I am confused with the size of my data files. Here's why:
I have two files: <b.dta> with 6,771,434 bytes on disk and <b1.asc>
3,385,856 bytes (size according to Win Commander).
Stata however says:
. uc b
. d
Contains data from b.dta
  obs:     1,692,789
 vars:             1                          5 Jul 2002 16:40
 size:    13,542,312 (87.1% of memory free)
----------------------------------------------------------------------------
---
              storage  display     value
variable name   type   format      label      variable label
----------------------------------------------------------------------------
---
b               float  %9.0g
----------------------------------------------------------------------------
---
Sorted by:
. uc b1
. d
Contains data from b1.dta
  obs:     1,692,789
 vars:             1                          5 Jul 2002 16:41
 size:    10,156,734 (90.3% of memory free)
----------------------------------------------------------------------------
---
              storage  display     value
variable name   type   format      label      variable label
----------------------------------------------------------------------------
---
b               int    %9.0g
----------------------------------------------------------------------------
---
I read the FAQ http://www.stata.com/support/faqs/data/howbig.html and I also
tried with the interactive dataset calculator
(http://www.stata.com/support/faqs/data/datacalc.cgi). It returns for
<b.dta> with a width in bytes of *4* -Approximate dataset size is 12.91
megabytes-. With a width in bytes of *2* for <b1.dta>, I get -Approximate
dataset size is 9.69 megabytes-. Still quite a difference compared to the
sizes returned by -describe-.
Where do the differences come from?
How big are the files now?
Thanks and best wishes,
Daniel
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/