Identification of Incident Atrial Fibrillation From Electronic Medical Records.
Alanna M ChamberlainVéronique L RogerPeter A NoseworthyLin Yee ChenSusan A WestonRuoxiang JiangAlvaro AlonsoPublished in: Journal of the American Heart Association (2022)
Background Electronic medical records are increasingly used to identify disease cohorts; however, computable phenotypes using electronic medical record data are often unable to distinguish between prevalent and incident cases. Methods and Results We identified all Olmsted County, Minnesota residents aged ≥18 with a first-ever International Classification of Diseases, Ninth Revision (ICD-9) diagnostic code for atrial fibrillation or atrial flutter from 2000 to 2014 (N=6177), and a random sample with an International Classification of Diseases, Tenth Revision (ICD-10) code from 2016 to 2018 (N=200). Trained nurse abstractors reviewed all medical records to validate the events and ascertain the date of onset (incidence date). Various algorithms based on number and types of codes (inpatient/outpatient), medications, and procedures were evaluated. Positive predictive value (PPV) and sensitivity of the algorithms were calculated. The lowest PPV was observed for 1 code (64.4%), and the highest PPV was observed for 2 codes (any type) >7 days apart but within 1 year (71.6%). Requiring either 1 inpatient or 2 outpatient codes separated by >7 days but within 1 year had the best balance between PPV (69.9%) and sensitivity (95.5%). PPVs were slightly higher using ICD-10 codes. Requiring an anticoagulant or antiarrhythmic prescription or electrical cardioversion in addition to diagnostic code(s) modestly improved the PPVs at the expense of large reductions in sensitivity. Conclusions We developed simple, exportable, computable phenotypes for atrial fibrillation using structured electronic medical record data. However, use of diagnostic codes to identify incident atrial fibrillation is prone to some misclassification. Further study is warranted to determine whether more complex phenotypes, including unstructured data sources or using machine learning techniques, may improve the accuracy of identifying incident atrial fibrillation.
Keyphrases
- atrial fibrillation
- catheter ablation
- left atrial
- oral anticoagulants
- machine learning
- left atrial appendage
- direct oral anticoagulants
- deep learning
- cardiovascular disease
- heart failure
- big data
- percutaneous coronary intervention
- electronic health record
- mental health
- healthcare
- primary care
- artificial intelligence
- palliative care
- type diabetes
- venous thromboembolism
- risk factors
- acute care
- high intensity