Research Tools & Code·arXiv cs.LG·May 3

Federated Semi-Supervised Graph Neural Networks with Prototype-Guided Pseudo-Labeling for Privacy-Preserving Gestational Diabetes Mellitus Prediction

Federated learning and semi-supervised graph neural networks are converging on a real clinical bottleneck: hospitals cannot share patient data across institutions, yet most EHR records lack diagnostic labels. This paper demonstrates a concrete architecture (FedTGNN-SS) that trains GNNs locally on unlabeled patient similarity graphs while using prototype-guided pseudo-labeling to bootstrap confidence in unlabeled cases, keeping raw data on-premise. The work signals growing maturity in privacy-preserving ML for healthcare, where federated + semi-supervised methods are becoming table stakes rather than research novelty. Insiders should track this pattern: as regulatory friction around data sharing hardens, techniques that extract signal from sparse labels without centralization become infrastructure.

Modelwire context

Explainer

The paper's actual contribution is narrower than the framing suggests: it's not that federated plus semi-supervised learning is new, but that prototype-guided pseudo-labeling specifically solves the cold-start problem when hospitals have unlabeled patient graphs. The architectural choice (using prototypes rather than confidence thresholds) is the differentiator, not the federated wrapper.

This work sits at the intersection of two parallel Modelwire threads. The EASE paper from May 1st showed that federated multimodal systems face coupling problems across data modalities; FedTGNN-SS sidesteps this by staying within a single graph representation. More directly, the temporal readmission prediction study from the same day identified observation window selection as a deployment friction point. FedTGNN-SS implicitly assumes hospitals have sufficient historical patient similarity data to bootstrap prototypes, but the paper doesn't specify what minimum temporal depth is required. That gap matters for practitioners.

If the authors release code and benchmark FedTGNN-SS against centralized semi-supervised GNN baselines on public EHR datasets (MIMIC, eICU) within the next six months, that confirms the privacy cost is acceptable. If they only publish results on synthetic or proprietary data, the practical deployment barrier remains unquantified.

Coverage we drew on

Temporal Data Requirement for Predicting Unplanned Hospital Readmissions · arXiv cs.LG

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsFedTGNN-SS · Graph Neural Networks · Federated Learning · Semi-supervised Learning · Gestational Diabetes Mellitus

Read full story at arXiv cs.LG →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.