BMJ Paediatr Open. 2025 Aug 14;9(1):e003742. doi: 10.1136/bmjpo-2025-003742.
ABSTRACT
This study assessed how ChatGPT 3.5, ChatGPT 4.0 and Google Gemini perform in providing educational content about coeliac disease and type 1 diabetes mellitus. We analysed 76 frequently asked questions for accuracy, comprehensiveness, readability and consistency. The models delivered highly accurate and comprehensive responses across the board. While ChatGPT 4.0 offered the most readable content, all models struggled with overall readability. Each model maintained consistent performance throughout testing. These results indicate that large language models show promise as supplementary tools for patient education in chronic paediatric conditions, though improvements in readability are needed to enhance accessibility.
PMID:40813141 | DOI:10.1136/bmjpo-2025-003742