Using multiple data features improved the validity of osteoporosis case ascertainment from administrative databases

Objectives — The aim was to construct and validate algorithms for osteoporosis case ascertainment from administrative databases and to estimate the population prevalence of osteoporosis for these algorithms.

Study Design and Setting — Artificial neural networks, classification trees, and logistic regression were applied to hospital, physician, and pharmacy data from Manitoba, Canada. Discriminative performance and calibration (i.e., error) were compared for algorithms defined from different sets of diagnosis, prescription drug, comorbidity, and demographic variables. Algorithms were validated against a regional bone mineral density testing program.

Results — Discriminative performance and calibration were poorer and sensitivity was generally lower for algorithms based on diagnosis codes alone than for algorithms based on an expanded set of data features that included osteoporosis prescriptions and age. Validation measures were similar for neural networks and classification trees, but prevalence estimates were lower for the former model.

Conclusion — Multiple features of administrative data generally resulted in improved sensitivity of osteoporosis case-detection algorithm without loss of specificity. However, prevalence estimates using an expanded set of features were still slightly lower than estimates from a population-based study with primary data collection. The classification methods developed in this study can be extended to other chronic diseases for which there may be multiple markers in administrative data.

Information

Citation

Lix LM, Yogendran MS, Leslie WD, Shaw SY, Baumgartner R, Bowman C, Metge C, Gumel A, Hux J, James RC. J Clin Epidemiol. 2008; 61(12):1250-60. Epub 2008 Jul 10.

Contributing ICES Scientists

Jan Hux

Research Programs

Chronic Disease & Pharmacotherapy

Associated Topics

Associated Sites

ICES Central

Discover More

Journal Article

22/03/2024

Validation of case-ascertainment algorithms using health administrative data to identify people who inject drugs in Ontario, Canada

Greenwald ZR, Werb D, Feld JJ, Austin PC, Fridman D, Bayoumi AM, Gomes T, Kendall CE, Lapointe-Shaw L, Scheim AI, Bartlett SR, Benchimol EI, Bouck Z, Boucher LM, Greenaway C, Janjua NZ, Leece P, Wong WWL, Sander B, Kwong JC. J Clin Epidemiol. 2024; Mar 22 [Epub ahead of print].

Journal Article

21/03/2024

Development of the multivariate administrative data cystectomy model and its impact on misclassification bias

Ross J, Lavallee LT, Hickling D, van Walraven C. BMC Med Res Methodol. 2024; 24(1):73. Epub 2024 Mar 21.

Journal Article

18/03/2024

Defining a low-risk birth cohort: a cohort study comparing two perinatal data sets in Ontario, Canada

Darling EK, Marquez O, Park AL. Int J Popul Data Sci. 2024; 9(1):2364. Epub 2024 Mar 18.

See All