Development and validation of a model to categorize cardiovascular cause of death using health administrative data
Patel S, Thompson W, Sivaswamy A, Khan A, Ferreira-Legere L, Lee DS, Abdel-Qadir H, Jackevicius C, Goodman S, Farkouh ME, Tu K, Kapral MK, Wijeysundera HC, Tam D, Austin PC, Fang J, Ko DT, Udell JA. Am Heart J Plus. 2022; Sep 16 [Epub ahead of print]. DOI: https://doi.org/10.1016/j.ahjo.2022.100207
Study Objective — Develop and evaluate a model that uses health administrative data to categorize cardiovascular (CV) cause of death (COD).
Design — Population-based cohort.
Setting — Ontario, Canada.
Participants — Decedents ≥ 40 years with known COD between 2008 and 2015 in the CANHEART cohort, split into derivation (2008 to 2012; n = 363,778) and validation (2013 to 2015; n = 239,672) cohorts.
Main Outcome Measures — Model performance. COD was categorized as CV or non-CV with ICD-10 codes as the gold standard. We developed a logistic regression model that uses routinely collected healthcare administrative to categorize CV versus non-CV COD. We assessed model discrimination and calibration in the validation cohort.
Results The strongest predictors for CV COD were history of stroke, history of myocardial infarction, history of heart failure, and CV hospitalization one month before death. In the validation cohort, the c-statistic was 0.80, the sensitivity 0.75 (95 % CI 0.74 to 0.75) and the specificity 0.71 (95 % CI 0.70 to 0.71). In the primary prevention validation sub-cohort, the c-statistic was 0.81, the sensitivity 0.71 (95 % CI 0.70 to 0.71) and the specificity 0.75 (95 % CI 0.75 to 0.75) while in the secondary prevention sub-cohort the c-statistic was 0.74, the sensitivity 0.81 (95 % CI 0.81 to 0.82) and the specificity 0.54 (95 % CI 0.53 to 0.54),
Conclusion — Modelling approaches using health administrative data show potential in categorizing CV COD, though further work is necessary before this approach is employed in clinical studies.
View full text