I have 50 individual-specific datasets (ie only one subject ID per
dataset), all of which contain data on the same 20 variables. The original
datasets were ASCII text and I read them into Stata, creating 50 datasets
and ran basic descriptive stats for each of the 50 subjects.
I then appended all 50 datasets into 1 very large dataset and discovered
that some of the subjects now have the wrong number of observations! Some
have too many observations and some have too few observations although the
total number of observations for all 50 subjects is correct.
Because the number of observations is so large for many of these subjects,
I'm not sure how to go about looking to see which observations got dropped etc.
I am using StataSE 7.0 on a Windows 2000 machine with 384mb RAM
the StataSE 7.0 executable is dated 11 Jun 2002
and the ado files are dated 9 Aug 2002