Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: Reading large data sets in Stata


From   "Martin Weiss" <martin.weiss1@gmx.de>
To   <statalist@hsphsun2.harvard.edu>
Subject   RE: st: Reading large data sets in Stata
Date   Mon, 22 Feb 2010 18:11:19 +0100

<>


" That's 13Gb of data, right?"


Depends very much on the " Average width per variable", as seen in
http://www.stata.com/support/faqs/data/datacalc.cgi
If they were all just binary vars, the dataset described by Jibonayan could
be below 2G. 


HTH
Martin


-----Original Message-----
From: owner-statalist@hsphsun2.harvard.edu
[mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of Stas Kolenikov
Sent: Montag, 22. Februar 2010 17:49
To: statalist@hsphsun2.harvard.edu
Subject: Re: st: Reading large data sets in Stata

That's 13Gb of data, right? If you really want to put everything into
memory, then you would probably need a computer with 24Gb of RAM. I don't
really know if you can buy anything like that in the desktop format, and
what kind of OS you would need to look at, although I am sure there are
clusters with much larger memory capacities. If you only need subsets of
that data set, then you could

use <list of the variables that you REALLY need> if <subsetting to the
conditions you REALLY want to analyze> using <this huge data set name>

That way, you may have a data set of a more realistic 2Gb size that you can
work with on a 4Gb RAM machine.

On Mon, Feb 22, 2010 at 9:35 AM, Jibonayan Raychaudhuri <
jibonayanrc@yahoo.com> wrote:

> Dear Statalist Users,
>
> I have a large data set with  48249390 observations (rows) and 37
variables
> (columns). I am currently using SAS 9.1 to work on this data set. I want
to
> use Stata/MP to work on this data set. I have version 11.0/MP (dual core).
> Could anyone please let me know a computer specification (i.e. hardware
> specifications) that will enable me to work on this data set with Stata
> 11.0? I need to enter the data, generate variables, sort, etc. Thanks in
> advance for the help.
>
> Best Regards,
> Jibonayan
>
>
>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>



-- 
Stas Kolenikov, also found at http://stas.kolenikov.name
Small print: I use this email account for mailing lists only.
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index