Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality

Objectives — Automated variable selection methods are frequently used to determine the independent predictors of an outcome. The objective of this study was to determine the reproducibility of logistic regression models developed using automated variable selection methods.

Study Design and Setting — An initial set of 29 candidate variables were considered for predicting mortality after acute myocardial infarction (AMI). We drew 1,000 bootstrap samples from a dataset consisting of 4,911 patients admitted to hospital with an AMI. Using each bootstrap sample, logistic regression models predicting 30-day mortality were obtained using backward elimination, forward selection, and stepwise selection. The agreement between the different model selection methods and the agreement across the 1,000 bootstrap samples were compared.

Results — Using 1,000 bootstrap samples, backward elimination identified 940 unique models for predicting mortality. Similar results were obtained for forward and stepwise selection. Three variables were identified as independent predictors of mortality among all bootstrap samples. Over half the candidate prognostic variables were identified as independent predictors in less than half of the bootstrap samples.

Conclusion — Automated variable selection methods result in models that are unstable and not reproducible. The variables selected as independent predictors are sensitive to random fluctuations in the data.

Information

Citation

Austin PC, Tu JV. J Clin Epidemiol. 2004; 57(11):1138-46.

Discover More

Journal Article

16/04/2024

Association of blood mitochondrial DNA copy number with risk of acute kidney injury after cardiac surgery

Jotwani V, Thiessen-Philbrook H, rking DE, Yang SY, McArthur E, Garg AX, Katz R, Tranah GJ, Ix JH, Cummings S, Waikar SS, Sarnak MJ, Shlipak MG, Parikh SM, Parikh CR. Am J Kidney Dis. 2024; Apr 16 [Epub ahead of print].

Journal Article

16/04/2024

Evaluating readability, understandability, and actionability of online printable patient education materials for cholesterol management: a systematic review

Bhatt C, Lin E, Ferreira-Legere LE, Jackevicius CA, Ko DT, Lee DS, Schade K, Johnston S, Anderson TJ, Udell JA. J Am Heart Assoc. 2024; 13(8):e030140. Epub 2024 Apr 3.

Journal Article

02/04/2024

Time to SGLT2 inhibitors initiation in patients with heart failure

Moon J, Udell JA, Chong A, Fang J, Austin PC, Ko DT, Stukel TA, Atzema CL, Booth GL, Tu K, Naimark DMJ, Jackevicius CA. J Am Heart Assoc. 2024; Apr 2 [Epub ahead of print].

See All

Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality

Information

Citation

Contributing ICES Scientists

Research Programs

Associated Topics

Associated Sites

Discover More

Association of blood mitochondrial DNA copy number with risk of acute kidney injury after cardiac surgery

Evaluating readability, understandability, and actionability of online printable patient education materials for cholesterol management: a systematic review

Time to SGLT2 inhibitors initiation in patients with heart failure