09 September 2014 6 5K Report

I've already asked this on another forum so apologies if you've seen it before. Does anyone have any experience with either the propensity score matching  or with multiple imputation of missing values in SPSS?

I have data on a group with a particular diagnosis (treatment group) and also for a much larger group without that diagnosis (control group). As the data is cross sectional I am using propensity score matching (using the psmatch plug-in for SPSS developed by Felix Thoemmes)  to generate a subset of the control group that best matches my treatment group on a number of covariates. However psmatch cannot handle missing values so I first have to use multiple imputation to impute the missing values. By default SPSS gives me a multiple imputation (MI) dataset with five imputations of the complete data along with the original. I then take each imputed dataset and create a matched dataset using psmatch. After a bit of fiddling about I join them all up again to get a matched MI dataset.

SPSS very cleverly can recognise an MI file so that when, for example, I compare cholesterol levels for treatment and control it runs the analysis 6 times, once for the original data and once for each imputed dataset. It then combines the five imputed analyses into a single pooled test using (I think) Rubin's rules. My problem is that the matched control cases are not the same in each of the imputed datasets and I wonder if this violates some assumptions of Rubin's rules?

Can anyone help with this or perhaps suggest an accessible introduction to Rubin's rules.

Similar questions and discussions