Article Text

Download PDFPDF

88 Exploring the prognostic significance and important phenotypic and genotypic associations of neural network-derived electrocardiographic features
Free
  1. Arunashis Sau1,
  2. Luana Giatti2,
  3. Fu Siong Ng3,
  4. Nicholas Peters4,
  5. Martin Shipley5,
  6. Sandhi Barreto6,
  7. Libor Pastika7,
  8. Antônio Ribeiro8,
  9. Ester Sabino9,
  10. James Ware10,
  11. Kathryn McGurk11,
  12. Irena Andršová12,
  13. Huiyi Wu13,
  14. Marek Malik14,
  15. Antonio Ribeiro15,
  16. Sean Zheng16,
  17. Declan O’Regan17,
  18. Jonathan Waks18,
  19. Maddalena Ardissino19,
  20. Nikesh Bajaj1,
  21. Katerina Hnatkova1,
  22. Xili Shi1,
  23. Danilo Mandic1,
  24. Daniel Kramer18,
  25. Annie Britton5,
  26. Jun Yu Chen1,
  27. Tomáš Novotný12
  1. 1Imperial College London
  2. 2Universidade Federal de Minas Gerais
  3. 3Imperial College London
  4. 4Imperial College London
  5. 5University College London
  6. 6Universidade Federal de Minas Gerais
  7. 7Imperial College London
  8. 8Uppsala University
  9. 9University of São Paulo
  10. 10Imperial College London & MRC London Institute of Medical Sciences
  11. 11Imperial College London
  12. 12University Hospital Brno and Masaryk University
  13. 13Imperial College London
  14. 14Imperial College London
  15. 15Universidade Federal de Minas Gerais
  16. 16Imperial College London
  17. 17Imperial College London
  18. 18Beth Israel Deaconess Medical Center
  19. 19n.bajaj@imperial.ac.uk

Abstract

Background Subtle, prognostically-meaningful ECG features may not be apparent to physicians. In the course of supervised machine learning training, many thousands of ECG features are identified. These are not limited to conventional ECG parameters and morphology. These novel neural network (NN)-derived ECG features may have clinical, phenotypic, and genotypic associations and prognostic significance.

Methods and Results We extracted 5120 NN-derived ECG features from an AI-ECG model trained for six simple diagnoses and applied unsupervised machine learning to identify three phenogroups. The derivation set, the Clinical Outcomes in Digital Electrocardiography (CODE) cohort (n = 1,558,421), is a database of ECGs recorded in primary care in Brazil. The three phenogroups had significantly different mortality profiles (Figure 1). After adjusting for known covariates (including age, gender, and comorbidities), phenogroup B had a 1.2-fold increase in long-term mortality compared to phenogroup A (HR 1.20, 95% CI 1.17-1.23, p < 0.0001).

We then externally validated our findings in four diverse cohorts. The Whitehall II cohort (n = 5,066) consists of British civil servants. The UK Biobank is longitudinal study of volunteers (n = 42,386). The Longitudinal Study of Adult Health (ELSA-Brasil) cohort (n = 13,739) consists of Brazilian public servants. Lastly the São Paulo-Minas Gerais Tropical Medicine Research Center (SaMi-Trop) is a cohort (n = 1,631) of patients with chronic Chagas cardiomyopathy.

We found phenogroup B had a significantly greater risk of mortality in all cohorts (Figure 1). We performed a phenome-wide association study (PheWAS) in the UK Biobank. We found ECG phenogroup significantly associated with cardiac and non-cardiac phenotypes, including cardiac chamber volumes and cardiac output (Figure 2A). A single-trait genome-wide association study (GWAS) was conducted. The GWAS yielded four significant loci (Figure 2B). SCN10A, SCN5A and CAV1 have well described roles in cardiac conduction and arrhythmia. ARHGAP24 has been previously associated with ECG parameters, however, our analysis has identified for the first time ARHGAP24 as a gene associated with a prognostically significant phenogroup. Mendelian randomisation demonstrated the higher risk ECG phenogroup was causally associated with higher odds of atrioventricular block but lower odds of atrial fibrillation and ischaemic heart disease.

Conclusion NN-derived ECG features have important applications beyond the original model from which they are derived and may be transferable and applicable for risk prediction in a wide range of settings, in addition to mortality prediction. We have shown the significant potential of NN-derived ECG features, as a highly transferable and potentially universal risk marker, that may be applied to a wide range of clinical contexts.

Abstract 88 Figure 1

(A) Hybrid machine learning approach (B) Flow of data and analyses performed in this study. LBBB: left bundle branch block, RBBB: right bundle branch block, AVB: atrioventricular block.

Abstract 88 Figure 2

Survival analysis in the derivation dataset, phenogroup B has a markedly worse prognosis. (A) whole CODE cohort, (B) CODE cohort without subjects with CODE-CNN diagnoses (C) Subset of CODE cohort with normal ECGs.

Abstract 88 Figure 3

Survival analysis in four external datasets. Phenogroup B has a significantly higher event rate.

Abstract 88 Figure 4

UK Biobank Phenome-wide association study. Manhattan plot showing negative logarithm of the univariate correlation P-value between phenotypes (A) and correlation coefficient (B). EDV: end-diastolic volume, ESV: end-systolic volume, RV: right ventricle, LV: left ventricle, RAV: right atrial volume, SV: stroke volume, BMC: bone mineral content, SHBG: sex hormone binding globulin.

Abstract 88 Figure 5

(A) Genome-wide association study. Manhattan plots of genomic loci associated with ECG phenogroup. Nearest genes are annotated on the plot. (B) Mendelian randomisation analyses of associations between genetically predicted ECG phenogroup and cardiovascular outcomes/phenotypes (C) Grad-CAM is used to generate importance maps showing the sections of the ECG signal deemed most important for phenogroup determination. HF: heart failure, BMI: body mass index, SBP: systolic blood pressure. SCD: sudden cardiac death.

Conflict of Interest None

  • ECG
  • machine learning
  • risk prediction

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.