Clinical Utility of Protein Language Models in Resolution of Variants of Uncertain Significance in KCNQ1, KCNH2, and SCN5A Compared With Patch-Clamp Functional Characterization
Clinical Utility of Protein Language Models in Resolution of Variants of Uncertain Significance in KCNQ1, KCNH2, and SCN5A Compared With Patch-Clamp Functional Characterization

Clinical Utility of Protein Language Models in Resolution of Variants of Uncertain Significance in KCNQ1, KCNH2, and SCN5A Compared With Patch-Clamp Functional Characterization

Circ Genom Precis Med. 2024 Aug 9:e004584. doi: 10.1161/CIRCGEN.124.004584. Online ahead of print.

ABSTRACT

BACKGROUND: Genetic testing for cardiac channelopathies is the standard of care. However, many rare genetic variants remain classified as variants of uncertain significance (VUS) due to lack of epidemiological and functional data. Whether deep protein language models may aid in VUS resolution remains unknown. Here, we set out to compare how 2 deep protein language models perform at VUS resolution in the 3 most common long-QT syndrome-causative genes compared with the gold-standard patch clamp.

METHODS: A total of 72 rare nonsynonymous VUS (9 KCNQ1, 19 KCNH2, and 50 SCN5A) were engineered by site-directed mutagenesis and expressed in either HEK293 cells or TSA201 cells. Whole-cell patch-clamp technique was used to functionally characterize these variants. The protein language models, ESM1b and AlphaMissense, were used to predict the variant effect of missense variants and compared with patch clamp.

RESULTS: Considering variants in all 3 genes, the ESM1b model had a receiver operator curve-area under the curve of 0.75 (P=0.0003). It had a sensitivity of 88% and a specificity of 50%. AlphaMissense performed well compared with patch-clamp with an receiver operator curve-area under the curve of 0.85 (P<0.0001), sensitivity of 80%, and specificity of 76%.

CONCLUSIONS: Deep protein language models aid in VUS resolution with high sensitivity but lower specificity. Thus, these tools cannot fully replace functional characterization but can aid in reducing the number of variants that may require functional analysis.

PMID:39119706 | DOI:10.1161/CIRCGEN.124.004584