Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: How to reference results from a big dataset within a  program
From 
 
Richard Williams <[email protected]> 
To 
 
[email protected],        "'[email protected]'" <[email protected]> 
Subject 
 
Re: st: How to reference results from a big dataset within a  program 
Date 
 
Tue, 27 Aug 2013 18:15:54 -0500 
I think you could tell people that simudata must be placed somewhere 
along Stata's adopath. Then the program could say -sysuse simudata-. 
My guess is that simudata.dta would have to be specified as an 
ancillary file in the installation package, which means that the user 
will have to do a little more work than usual to get the package installed.
Alternatively, how were the simulations done? If with drawnorm or 
something like that maybe you could include code that would recreate the data.
At 04:25 PM 8/27/2013, Chen,Minxing wrote:
Dear all,
I submitted a manuscript and program to Stat Journal several months 
ago, and now got feedback from reviewer. I need a big help from your 
all regarding how to reference a big dataset within a program.
Basically, in the program I submitted, I had to reference results 
from a big pre-simulated dataset (four variables, but around 400,000 
observations). In my previous submission, I simply submitted the 
pre-simulated dataset with my program, and within the program I 
called up that simulated dataset by using code such as " use 
c:\ado\personal\simudata". I was hoping when people download the 
program from SSC, the pre-simulated dataset will be also downloaded 
to the directory "c:\ado\personal\".
Now my reviewer indicated that I can't expect users to do that, I 
can't even tell the user to place the file there because such a 
directory may not be creatable for the user (e.g. they might not 
have a C: drive). The reviewer suggested me to find some other way 
to get the information in my pre-simulated dataset, such as 
incorporating the data into the program.
I tried to copy of the simulated data within my program by using 
syntax such as "input x y z k", however, since there are so many 
observations (a little more than 400,000), and there are system 
limit for the maximum lines of syntax within a program (around 
3500), I was not able to do this way. The reviewer also mentioned 
that I may use "Mata library" function, but I am pretty new to Stata 
Mata. Is there anyone that may be able to help regarding this issue?
Thank you so much!
Minxing Chen
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/
-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/