A Federated Benchmark for Clinical Natural Language Processing (FedDRAGON)

B. Abrahamsen, J.S. Bosma, H. Huisman and M. Elschot

Studies in Health Technology and Informatics 2026.

We introduce the FedDRAGON challenge, a federated learning benchmark for clinical natural language processing. The challenge includes 12 information extraction tasks where data is extracted from clinical reports from 4 Dutch care centers. Baseline results show that the performance of the federated models surpass single-center performance and approaches that of centralized models. Benchmark, code, and pre-trained LLMs are publicly available.