These methods are frowned upon? Which method in particular is the object
of the frown? I guess I am puzzled, because mean substitution comes in a
few flavors. There is substituting the mean and leaving it at that. I
assume this is widely understood as problematic. And then there is
substituting the mean and adding a dummy variable to the model that
indicates whether the mean has been substituted. Is there any problem
with this second approach?
I taught and recommended the latter method for several years - and then
retracted my support a few years ago! See Allison's book on Missing
Data. For the Cliff's Notes version, see pp. 5-6 of