Go to content

Case-ascertainment models to identify adults with obstructive sleep apnea using health administrative data: internal and external validation


Background — There is limited evidence on whether obstructive sleep apnea (OSA) can be accurately identified using health administrative data.

Study Design and Methods — We derived and validated a case-ascertainment model to identify OSA using linked provincial health administrative and clinical data from all consecutive adults who underwent a diagnostic sleep study (index date) at two large academic centers (Ontario, Canada) from 2007 to 2017. The presence of moderate/severe OSA (an apnea–hypopnea index≥ 15) was defined using clinical data. Of 39 candidate health administrative variables considered, 32 were tested. We used classification and regression tree (CART) methods to identify the most parsimonious models via cost-complexity pruning. Identified variables were also used to create parsimonious logistic regression models. All individuals with an estimated probability of 0.5 or greater using the predictive models were classified as having OSA.

Results — The case-ascertainment models were derived and validated internally through bootstrapping on 5099 individuals from one center (33% moderate/severe OSA) and validated externally on 13,486 adults from the other (45% moderate/severe OSA). On the external cohort, parsimonious models demonstrated c-statistics of 0.75– 0.81, sensitivities of 59– 60%, specificities of 87– 88%, positive predictive values of 79%, negative predictive values of 73%, positive likelihood ratios (+LRs) of 4.5– 5.0 and –LRs of 0.5. Logistic models performed better than CART models (mean integrated calibration indices of 0.02– 0.03 and 0.06– 0.12, respectively). The best model included: sex, age, and hypertension at the index date, as well as an outpatient specialty physician visit for OSA, a repeated sleep study, and a positive airway pressure treatment claim within 1 year since the index date.

Interpretation — Among adults who underwent a sleep study, case-ascertainment models for identifying moderate/severe OSA using health administrative data had relatively low sensitivity but high specificity and good discriminative ability. These findings could help study trends and outcomes of OSA individuals using routinely collected healthcare data.



Kendzerska T, van Walraven C, McIsaac DI, Povitz M, Mulpuru S, Lima I, Talarico R, Aaron SD, Reisman W, Gershon AS. Clin Epidemiol. 2021; 13:453-67. Epub 2021 Jun 17.

View Source

Associated Sites