NPJ Digit Med. 2025 Nov 19;8(1):697. doi: 10.1038/s41746-025-02074-3.
ABSTRACT
Current AI approaches for cardiac diagnosis require condition-specific supervised learning with extensive labeled datasets, leading to fundamental scalability barriers. We developed an ECG-CLIP model, applying contrastive multimodal learning to enable zero-shot cardiac diagnosis from 12-lead ECGs using natural language supervision. Trained on 800,034 ECG-text pairs from MIMIC-IV-ECG, ECG-CLIP evaluated 18 cardiac conditions without condition-specific training. The model achieved superior performance for rhythm abnormalities (AUROC > 0.90) compared to morphological conditions. External validation demonstrated robust AUROC rank consistency (
PMID:41261169 | DOI:10.1038/s41746-025-02074-3