Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes

Objective — Physicians classify patients into those with or without a specific disease. Furthermore, there is often interest in classifying patients according to disease etiology or subtype. Classification trees are frequently used to classify patients according to the presence or absence of a disease. However, classification trees can suffer from limited accuracy. In the data-mining and machine-learning literature, alternate classification schemes have been developed. These include bootstrap aggregation (bagging), boosting, random forests, and support vector machines.

Study Design and Setting — The researchers compared the performance of these classification methods with that of conventional classification trees to classify patients with heart failure (HF) according to the following subtypes: HF with preserved ejection fraction (HFPEF) and HF with reduced ejection fraction. The researchers also compared the ability of these methods to predict the probability of the presence of HFPEF with that of conventional logistic regression.

Results — The researchers found that modern, flexible tree-based methods from the data-mining literature offer substantial improvement in prediction and classification of HF subtype compared with conventional classification and regression trees. However, conventional logistic regression had superior performance for predicting the probability of the presence of HFPEF compared with the methods proposed in the data-mining literature.

Conclusion — The use of tree-based methods offers superior performance over conventional classification and regression trees for predicting and classifying HF subtypes in a population-based sample of patients from Ontario, Canada. However, these methods do not offer substantial improvements over logistic regression for predicting the presence of HFPEF.

View Source

Information

Citation

Austin PC, Tu JV, Ho JE, Levy D, Lee DS. J Clin Epidemiol. 2013; 66(4):398-407. Epub 2013 Feb 5.

View Source

Contributing ICES Scientists

Research Programs

Cardiovascular

Associated Sites

ICES Central

Discover More

Journal Article

16/04/2024

Association of blood mitochondrial DNA copy number with risk of acute kidney injury after cardiac surgery

Jotwani V, Thiessen-Philbrook H, rking DE, Yang SY, McArthur E, Garg AX, Katz R, Tranah GJ, Ix JH, Cummings S, Waikar SS, Sarnak MJ, Shlipak MG, Parikh SM, Parikh CR. Am J Kidney Dis. 2024; Apr 16 [Epub ahead of print].

Journal Article

16/04/2024

Evaluating readability, understandability, and actionability of online printable patient education materials for cholesterol management: a systematic review

Bhatt C, Lin E, Ferreira-Legere LE, Jackevicius CA, Ko DT, Lee DS, Schade K, Johnston S, Anderson TJ, Udell JA. J Am Heart Assoc. 2024; 13(8):e030140. Epub 2024 Apr 3.

Journal Article

16/04/2024

Renal transplantation in HIV-positive and HIV-negative patients with advanced stages of kidney disease: equity in transplantation

Hosseini-Moghaddam SM, Kang Y, Bota SE, Weir MA. Open Forum Infect Dis. 2024; Apr 16 [Epub ahead of print].

See All