The impact of two data-generating processes for competing risk data on the discrimination and calibration of two types of competing risk regression models

Monte Carlo simulations are an important tool in modern statistical research. The data-generating process is foundational to any simulation. In survival analysis, a competing risk is an event whose occurrence precludes the occurrence of the primary event of interest. Two data-generating processes have been described for simulating competing risk data: one based on all the cause-specific hazard functions for the different types of events, and one based on a subdistribution hazard model for the primary event of interest. There is a paucity of research on the impact of the choice of data-generating process. We used a series of Monte Carlo simulations to evaluate the impact of the choice of data-generating process on the performance of prediction models when assessing discrimination using the time-dependent AUC and accuracy using the time-dependent Brier score. We also assessed the impact of the choice of competing risk regression used for computing smoothed event probabilities for use when computing the calibration metrics ICI (integrated calibration index), E50, and E90. The impact of discordance between the fitted model and the data-generating process on both the time-dependent AUC and the time-dependent Brier score was minimal. When computing the ICI, E50, and E90, we recommend that researchers use a model for computing smoothed event probabilities that is concordant with the type of model whose calibration is being assessed.

View Source

Information

Citation

Austin PC, Putter H. Stat Med. 2026; 45(6-7):e70468.

View Source

Discover More

Journal Article

03/03/2026

Population-based clustering of co-occurring social determinants: an application of unsupervised machine learning

Giesinger I, Buajitti E, Siddiqi A, Smith PM, Krishnan RG, Postill G, Rosella LC. Ann Epidemiol. 2026; Mar 3 [Epub ahead of print].

Journal Article

30/01/2026

Development and validation of the Predicting Risk of Ischemic Stroke in Malignancy Estimation tool

Lun R, Leentjens J, Cerasuolo JO, Kirkwood D, Kapral MK, Carrier M, Siegal D, Sutradhar R. J Am Heart Assoc. 2026; e045631. Epub 2026 Jan 30.

Journal Article

21/01/2026

Quantitative assessment of neonatal health using dried blood spot metabolite profiles and deep learning

Chang AL, Reiss JD, Culos A, Becker M, Mayo JA, Marić I, De Francesco D, Phongpreecha T, Espinosa CA, Mataraso SJ, Berson E, Kim Y, Xue L, Xie F, Shu CH, Fallahzadeh R, Bidoki NH, Xenochristou M, Zhang M, Profit J, Lee HC, Gaudillière B, Angst MS, Hawken S, Wilson K, Stevenson DK, Shaw GM, Sylvester KG, Aghaeepour N. Sci Transl Med. 2026; 18(833): eadv4942. Epub 2026 Jan 21.

See All

The impact of two data-generating processes for competing risk data on the discrimination and calibration of two types of competing risk regression models

Information

Citation

Contributing ICES Scientists

Research Programs

Associated Topics

Associated Sites

Discover More

Population-based clustering of co-occurring social determinants: an application of unsupervised machine learning

Development and validation of the Predicting Risk of Ischemic Stroke in Malignancy Estimation tool

Quantitative assessment of neonatal health using dried blood spot metabolite profiles and deep learning