{"id":22287,"date":"2025-08-17T10:25:02","date_gmt":"2025-08-17T14:25:02","guid":{"rendered":"https:\/\/www.ices.on.ca\/?post_type=journal_article&#038;p=22287"},"modified":"2025-08-20T11:12:43","modified_gmt":"2025-08-20T15:12:43","slug":"imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching","status":"publish","type":"journal_article","link":"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/","title":{"rendered":"Imputation of incomplete ordinal and nominal data by predictive mean matching"},"content":{"rendered":"<p>Multivariate imputation using chained equations is a popular algorithm for imputing missing data that entails specifying multivariable models through conditional distributions. Two standard imputation methods for imputing missing continuous variables are parametric imputation using a linear model and predictive mean matching. The default methods for imputing missing categorical variables are parametric imputation using multinomial logistic regression and ordinal logistic regression for imputing nominal and ordinal categorical variables, respectively. There is a paucity of research into the relative computational burden and the quality of statistical inferences when using predictive mean matching versus parametric imputation for imputing missing non-binary categorical variables. We used simulations to compare the performance of predictive mean matching with that of multinomial logistic regression and ordinal logistic regression for imputing categorical variables when the analysis model of scientific interest was a logistic or linear regression model. We varied the sample size (N\u2009=\u2009500, 1000, 2500, and 5000), the rate of missing data (5%\u201350% in increments of 5%), and the number of levels of the categorical variable (3, 4, 5, and 6). In general, the performance of predictive mean matching compared very favorably to that of multinomial or ordinal logistic regression for imputing categorical variables when the analysis model was a logistic or linear regression model. This was true across a range of scenarios defined by sample size and the rate of missing data. Furthermore, the use of predictive mean matching was substantially faster, by a factor of 2\u20136. In conclusion, predictive mean matching can be used to impute categorical variables. The use of predictive mean matching to impute missing non-binary categorical variables substantially reduces computer processing time when conducting multiple imputation.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Multivariate imputation using chained equations is a popular algorithm for imputing missing data that entails specifying multivariable models through conditional distributions. Two standard imputation methods for imputing missing continuous variables are parametric imputation using a linear model and predictive mean matching. The default methods for imputing missing categorical variables are parametric imputation using multinomial logistic [&hellip;]<\/p>\n","protected":false},"template":"","migration-helper-automated":[],"migration-manual":[],"topic":[59],"migration-helper-qa-sample-set":[],"class_list":["post-22287","journal_article","type-journal_article","status-publish","hentry","topic-data-science"],"acf":{"citation":"Austin PC, van Buuren S. <em>Stat Methods Med Res<\/em>. 2025; Aug 17 [Epub ahead of print].","source_url":"https:\/\/doi.org\/10.1177\/09622802251362642","ices_scientist":[1385],"site":[6733],"research_program":[6742],"news_release":"","journal_article":"","atlas":"","research_report":"","infographic":"","video":"","downloads":null,"links":null,"sitecore_item_id":"","sitecore_item_name":"","sitecore_field_values":"","previous_url":""},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>ICES | Imputation of incomplete ordinal and nominal data by predictive mean matching<\/title>\n<meta name=\"description\" content=\"Multivariate imputation using chained equations is a popular algorithm for imputing missing data that entails specifying multivariable models through\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"ICES | Imputation of incomplete ordinal and nominal data by predictive mean matching\" \/>\n<meta property=\"og:description\" content=\"Multivariate imputation using chained equations is a popular algorithm for imputing missing data that entails specifying multivariable models through\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/\" \/>\n<meta property=\"og:site_name\" content=\"ICES\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/ICESOntario\/\" \/>\n<meta property=\"article:modified_time\" content=\"2025-08-20T15:12:43+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.ices.on.ca\/wp-content\/uploads\/2024\/11\/ic-es-data-discovery-better-health-logo.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"675\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/publications\\\/journal-articles\\\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\\\/\",\"url\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/publications\\\/journal-articles\\\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\\\/\",\"name\":\"ICES | Imputation of incomplete ordinal and nominal data by predictive mean matching\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/#website\"},\"datePublished\":\"2025-08-17T14:25:02+00:00\",\"dateModified\":\"2025-08-20T15:12:43+00:00\",\"description\":\"Multivariate imputation using chained equations is a popular algorithm for imputing missing data that entails specifying multivariable models through\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/publications\\\/journal-articles\\\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\\\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/publications\\\/journal-articles\\\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/publications\\\/journal-articles\\\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Journal Articles\",\"item\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/publications\\\/journal-articles\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Imputation of incomplete ordinal and nominal data by predictive mean matching\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/#website\",\"url\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/\",\"name\":\"ICES\",\"description\":\"POPULATION-BASED HEALTH RESEARCH THAT MAKES A DIFFERENCE\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/#organization\"},\"alternateName\":\"Institute for Clinical Evaluative Sciences\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/#organization\",\"name\":\"ICES\",\"alternateName\":\"Institute for Clinical Evaluative Sciences\",\"url\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.ices.on.ca\\\/wp-content\\\/uploads\\\/2023\\\/04\\\/ices-logo.png\",\"contentUrl\":\"https:\\\/\\\/www.ices.on.ca\\\/wp-content\\\/uploads\\\/2023\\\/04\\\/ices-logo.png\",\"width\":\"676\",\"height\":\"618\",\"caption\":\"ICES\"},\"image\":{\"@id\":\"https:\\\/\\\/www.ices.on.ca\\\/fr\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/ICESOntario\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/ices-research-institute\\\/\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"ICES | Imputation of incomplete ordinal and nominal data by predictive mean matching","description":"Multivariate imputation using chained equations is a popular algorithm for imputing missing data that entails specifying multivariable models through","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/","og_locale":"fr_FR","og_type":"article","og_title":"ICES | Imputation of incomplete ordinal and nominal data by predictive mean matching","og_description":"Multivariate imputation using chained equations is a popular algorithm for imputing missing data that entails specifying multivariable models through","og_url":"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/","og_site_name":"ICES","article_publisher":"https:\/\/www.facebook.com\/ICESOntario\/","article_modified_time":"2025-08-20T15:12:43+00:00","og_image":[{"width":1200,"height":675,"url":"https:\/\/www.ices.on.ca\/wp-content\/uploads\/2024\/11\/ic-es-data-discovery-better-health-logo.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/","url":"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/","name":"ICES | Imputation of incomplete ordinal and nominal data by predictive mean matching","isPartOf":{"@id":"https:\/\/www.ices.on.ca\/fr\/#website"},"datePublished":"2025-08-17T14:25:02+00:00","dateModified":"2025-08-20T15:12:43+00:00","description":"Multivariate imputation using chained equations is a popular algorithm for imputing missing data that entails specifying multivariable models through","breadcrumb":{"@id":"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/imputation-of-incomplete-ordinal-and-nominal-data-by-predictive-mean-matching\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ices.on.ca\/fr\/"},{"@type":"ListItem","position":2,"name":"Journal Articles","item":"https:\/\/www.ices.on.ca\/fr\/publications\/journal-articles\/"},{"@type":"ListItem","position":3,"name":"Imputation of incomplete ordinal and nominal data by predictive mean matching"}]},{"@type":"WebSite","@id":"https:\/\/www.ices.on.ca\/fr\/#website","url":"https:\/\/www.ices.on.ca\/fr\/","name":"ICES","description":"POPULATION-BASED HEALTH RESEARCH THAT MAKES A DIFFERENCE","publisher":{"@id":"https:\/\/www.ices.on.ca\/fr\/#organization"},"alternateName":"Institute for Clinical Evaluative Sciences","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ices.on.ca\/fr\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/www.ices.on.ca\/fr\/#organization","name":"ICES","alternateName":"Institute for Clinical Evaluative Sciences","url":"https:\/\/www.ices.on.ca\/fr\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/www.ices.on.ca\/fr\/#\/schema\/logo\/image\/","url":"https:\/\/www.ices.on.ca\/wp-content\/uploads\/2023\/04\/ices-logo.png","contentUrl":"https:\/\/www.ices.on.ca\/wp-content\/uploads\/2023\/04\/ices-logo.png","width":"676","height":"618","caption":"ICES"},"image":{"@id":"https:\/\/www.ices.on.ca\/fr\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/ICESOntario\/","https:\/\/www.linkedin.com\/company\/ices-research-institute\/"]}]}},"_links":{"self":[{"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/journal_article\/22287","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/journal_article"}],"about":[{"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/types\/journal_article"}],"acf:post":[{"embeddable":true,"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/research_program\/6742"},{"embeddable":true,"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/site\/6733"},{"embeddable":true,"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/ices_scientist\/1385"}],"wp:attachment":[{"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/media?parent=22287"}],"wp:term":[{"taxonomy":"migration-helper-automated","embeddable":true,"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/migration-helper-automated?post=22287"},{"taxonomy":"migration-manual","embeddable":true,"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/migration-manual?post=22287"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/topic?post=22287"},{"taxonomy":"migration-helper-qa-sample-set","embeddable":true,"href":"https:\/\/www.ices.on.ca\/fr\/wp-json\/wp\/v2\/migration-helper-qa-sample-set?post=22287"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}