Identifying transitional high cost users from unstructured patient profiles written by primary care physicians

Identification and subsequent intervention of patients at risk of becoming High Cost Users (HCUs) presents the opportunity to improve outcomes while also providing significant savings for the healthcare system. In this paper, the 2016 HCU status of patients was predicted using free-form text data from the 2015 cumulative patient profiles within the electronic medical records of family care practices in Ontario. These unstructured notes make substantial use of domain-specific spellings and abbreviations; we show that word embeddings derived from the same context provide more informative features than pre-trained ones based on Wikipedia, MIMIC, and Pubmed. We further demonstrate that a model using features derived from aggregated word embeddings (EmbEncode) provides a significant performance improvement over the bag-of-words representation (82.48±0.35% versus 81.85±0.36% held-out AUROC, p = 3.2 × 10-4), using far fewer input features (5,492 versus 214,750) and fewer non-zero coefficients (1,177 versus 4,284). The future HCUs of greatest interest are the transitional ones who are not already HCUs, because they provide the greatest scope for interventions. Predicting these new HCU is challenging because most HCUs recur. We show that removing recurrent HCUs from the training set improves the ability of EmbEncode to predict new HCUs, while only slightly decreasing its ability to predict recurrent ones.

View Source

Information

Citation

Zhang H, Candido E, Wilton AS, Duchen R, Jaakkimainen L, Wodchis W, Morris Q. Pac Symp Biocomput. 2020; 25:127-38. Epub 2020 Jan 1.

View Source

Contributing ICES Scientists

Research Programs

Life Stage

Associated Sites

ICES Central

Discover More

Journal Article

25/04/2024

Multifetal pregnancy after implementation of a publicly funded fertility program

Velez MP, Soule A, Gaudet L, Pudwell J, Nguyen P, Ray JG. JAMA Netw Open. 2024; 7(4):e248496. Epub 2024 Apr 25.

Journal Article

25/04/2024

Proportion of life spent in Canada and the incidence of multiple sclerosis in permanent immigrants

Vyas MV, Kapral MK, Rea A, Fang J, Rotstein DL. Neurology. 2024; 102(10):e209350. Epub 2024 Apr 24.

Journal Article

18/04/2024

Incidence of total knee arthroplasty after arthroscopic surgery for knee osteoarthritis: a secondary analysis of a randomized clinical trial

Birmingham TB, Primeau CA, Shariff SZ, Reid JNS, Marsh JD, Lam M, Dixon SN, Giffin JR, Willits KR, Litchfield RB, Feagan BG, Fowler PJ. JAMA Netw Open. 2024; 7(4):e246578. Epub 2024 Apr 18.

See All