Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Analysing compound data types from imputed variables

From   roland andersson <>
Subject   st: Analysing compound data types from imputed variables
Date   Wed, 3 Aug 2011 17:29:53 +0200

We are analysing the discriminating capacity of a clinical score,
which is constructed from 7 variables. One variable is the ratio of
two variables, two are binary, one is an ordered variabel with 4
levels, and finally we have 4 continous variables which have been
divided into 2-3 intervals. We now want to 1) validate the score on
new data and 2) analyse if some additional variables can add
discriminating capacity to the original score.

Our problem is missing variables. It seems the missingnes is random.
We want to impute these missing values. This is straightforward using
ice or mi and using the passive alternative for the ratio-variable.
The problem is how to calculate and analyse the score for the cases
with missing variables. As we need to divide the continous variables
into intervals after imputation we can not use passive.

I am thinking to construct the imputed dataset, calculate the score
for each imputed dataset according to the intervals of the continous
variables. I suppose I can use mim for the analysis of the
discriminating capacity of the score. Is there any other alternatives?
Should we use the variables after dividing into intervals and then do
the imputation instead? It would then be possible to use passive
imputation for the score. Other proposals or comments are wellcome

Roland Andersson
*   For searches and help try:

© Copyright 1996–2016 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index