Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

re: st: Reading large data sets in Stata

From	Christopher Baum <[email protected]>
To	[email protected]
Subject	re: st: Reading large data sets in Stata
Date	Mon, 22 Feb 2010 15:19:07 -0500

<>
Stas said

That's 13Gb of data, right? If you really want to put everything intomemory, then you would probably need a computer with 24Gb of RAM. Idon't really know if you can buy anything like that in the desktopformat, and what kind of OS you would need to look at, although I amsure there are clusters with much larger memory capacities. If youonly need subsets of that data set, then you could use <list of thevariables that you REALLY need> if <subsetting to the conditions youREALLY want to analyze> using <this huge data set name> That way, youmay have a data set of a more realistic 2Gb size that you can workwith on a 4Gb RAM machine.

That's not necessarily 13 Gb of data. Using the interactive calculatoron the FAQ, if you assume all 37 variables can be held in 4 byteseach, it's under 7 Gb. If on average they only need 3 bytes each, it'sunder 6 Gb. Stat/Transfer can optimize the dataset as it converts itto Stata format. Stas' suggestions are well taken, but one more isimportant--if any of these variables are 0/1 indicators, or integerstaking on values 1..5, etc. they need not chew up nearly as muchmemory. I don't know if you can get it down to a 2 Gb size, though. Touse more than 2 Gb, you need a 64-bit machine (almost all machines arethese days), and Stata 11 will automatically install the 64-bitversion on such a machine.


Kit Baum   |   Boston College Economics and DIW Berlin   |   http://ideas.repec.org/e/pba1.html
An Introduction to Stata Programming   |   http://www.stata-press.com/books/isp.html
An Introduction to Modern Econometrics Using Stata   |   http://www.stata-press.com/books/imeus.html

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Reading large data sets in Stata
  - From: Michael Norman Mitchell <[email protected]>

Prev by Date: Re: st: RE: hbox line
Next by Date: st: RE: RE: hbox line
Previous by thread: Re: st: Reading large data sets in Stata
Next by thread: Re: st: Reading large data sets in Stata
Index(es):
- Date
- Thread