I have a question about studying with large datasets and problems
occuring after "save.., raplace".
I am working with a data set with 1.5 million rows, and 20 columns in
the beginnig. For this reason I can not work on my oersonel computer, I
am working on my university's server.
I am creating necesarry dummies for my work(like around 700 dummies).
They are being created properly. Before saving the new file, I am
checking for some simple stats for variables, everything seems to be OK.
But after saving the new file with a new name and using replace command
in any case; clearing and using the lastly saved dataset, some of my
variables start to have troubles! I really don't know what is happening
really, but it seems that variables shift into each other.
For example when I tabulate gender(which was inherent before creating
dummies), the following output arrives: