Home  /  Resources & support  /  Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data

By Jeffrey Wooldridge

Official site for downloadable materials

Provided here are the datasets that were used to produce the output in the book Econometric Analysis of Cross Section and Panel Data by Jeffrey Wooldridge.

You can download the datasets from within Stata using the net command. At the Stata prompt, type

       . net from http://www.stata.com/data/jwooldridge/
       . net describe eacsap
       . net get eacsap
This will download all files associated with the book to your current directory.

If you do not have an Internet connection from within Stata, you can download one of the following files:

eacsap.zip ZIP format, 2MB
eacsap.tar.Z Unix tar.Z format, 3MB

We suggest you create a new directory and copy the materials there.

The book sometimes refers to filenames using capital letters. Due to differences across operating systems, the filenames stored here are all lowercase.

In addition to the methods above of obtaining all files associated with the book at once, it is possible to use any individual file from within Stata. Simply click on a file in the list of datasets below to download it to a local folder on your machine, or from within Stata, type

        . use http://www.stata.com/data/jwooldridge/eacsap/filename
where filename is the name of the dataset in the book, typed as all lowercase without the extension. For example, on page 59, the file MROZ.RAW is discussed. For ease of use, we have made this data available in Stata format as mroz.dta. To load the dataset into Stata, type
        . use http://www.stata.com/data/jwooldridge/eacsap/mroz
Many users find it convenient to define a macro to the path so that files can be obtained with less typing:
        . global JW http://www.stata.com/data/jwooldridge/eacsap
        . use $JW/mroz
Here is a list of the datasets:
401ksubs.dta Cross-sectional data on eligibility for and participation in 401k plans along with income and demographic information
airfare.dta Panel data on airline flights
apple.dta Data on quantity of ecologically friendly apples desired by a survey of individuals
attend.dta Data on class attendance and final exam performance
bwght.dta Data on birth weight and smoking
card.dta Card (1995) wage/proximity to four-year college data
cornwell.dta Cornwell and Trumball (1994) data on county level crim rates
cps78_85.dta CPS data on wages from 1978 and 1985
cps91.dta CPS data on wife and husband earnings along with family demographic information
ezunem.dta Papke (1994) panel data on the number of unemployment claims and the presence of enterprise zones for 22 cities over 9 years
fertil1.dta Data on number of children born to 7 different cross-sections of U.S. women
fertil2.dta Data on number of living children, education, and demographic information of a sample of women from Botswana
fringe.dta Data on hourly benefits and demographic information
gpa.dta In- and out-of-season grade point average of a sample of student athletes
hprice.dta Subset of the Kiel and McClain (1995) data housing prices
injury.dta Meyer, Viscusi, and Durbin (1995) data on the length of time workers received workers' compensation after an injury
jtrain1.dta Holzer, et al. (1993) data on the scrap rate of 157 Michigan firms
jtrain2.dta Data on real earnings and demographics of a sample of men who participated in a job training experiment in the 1970's
keane.dta Subset of Keane and Wolpin (1997) panel data of schooling and employment for a sample of men from 1981-1987
labsup.dta Cross-sectional data on the number of children born and the mother's work history and demographics
loanapp.dta Data on loan applications and demographics of individuals seeking loans
lowbirth.dta Panel data on percentage of state level births that qualify as low birth weight, percentage of families on AFCD, and other covariates
mathpnl.dta Panel data on percentages of 4 and 7 seventh graders performing satisfactorily along with school district level information
mroz.dta Mroz (1987) PSID data on the wages of 428 working, married women
murder.dta Panel data on murder rates in U.S. states
nbasal.dta Data on earnings, position played and demographics of a sample of NBA players
nls80.dta Blackburn and Neumark (1992) NLS data on the wages of working men
nls81_87.dta Panel data on earnings, schooling, and demographic information for 530 individuals from National Longitudinal Survey
norway.dta Crime rates i 53 districts in Norway in 1972 and 1978
openness.dta Romer (1993) data on open land and inflation
patent.dta Panel data on the number of patents sought and obtained by a sample of firms along with some firm specific information
pension.dta Subset of data Papke (1998) data on 401k portfolio choice and family demographics
prison.dta Levitt (1996), state level panel data on crime rates and prison populations
q.dta Data on Tobin's Q for 2,068 firms
recid.dta Chung, Schmidt, and Witte (1991) data on the time between being released from prison and being arrested again, with some demographics
smoke.dta Data on cigarettes smoked per day and demographic information
vote.dta Data on incumbent and challenge in elections in 42 US congressional districts in 1988 and 1990
wagepan.dta Vella and Verbeek (1998) panel data on 545 men worked every year from 1980-1987


Blackburn, M. and D. Neumark. 1992.
Unobserved ability, efficiency wages, and interindustry wage differentials. Quarterly Journal of Economics 107: 1421–1436.
Card, D. 1995.
Using geographic variation in college proximity to estimate the return to schooling. In Aspects of Labour Market Behavior: Essays in Honour of John Vanderkamp, ed. L. N. Christophides, E. K. Grant, and R. Swidinsky, 201–222. Toronto: University of Toronto Press.
Chung, C.-F., P. Schmidt, and A. D. Witte. 1991.
Survival analysis: A survey. Journal of Quantitative Criminology 7: 59–98.
Cornwell, C. and D. Trumball. 1994.
Estimating the economic model of crime with panel data. Review of Economics and Statistics 76: 360–366.
Grogger, J. 1991.
Certainty vs. severity of punishment. Economic Inquiry 29: 297–309.
Holzer, H., R. Block, M. Cheatham, and J. Knott. 1993.
Are training subsidies effective? The Michigan experience. Industrial and Labor Review 46:625–636.
Keane, M. P. and K. I. Wolpin. 1997.
The career decisions of young men. Journal of Political Economy 105: 473–522.
Kiel, K. A. and K. T. McClain. 1995.
House prices during siting decision stages: The case of an incinerator from rumor through operation. Journal of Environmental Economics and Management 28: 241–255.
Levitt, S. D. 1996.
The effect of prison population size on crime rates: evidence from prison overcrowding legislation. Quarterly Journal of Economics 111: 319–351.
Meyer, B. D., W. K. Viscusi, and D. L. Durbin. 1995.
Workers' compensation and injury duration: evidence from a natural experiment. American Economic Review 85: 322–340.
Mroz, T. A. 1987.
The sensitivity of an empirical model of married women's hours to work economic and statistical assumptions. Econometrica 55: 765–799.
Papke, L. E. 1994.
Tax policy and urban development: Evidence from the Indiana enterprise zone program. Journal of Public Economics 54: 37-49.
——. 1998.
How are participants directing their participant-directed individual account pension plans? American Economic Review 88: 212–216.
Romer, D. 1993.
Openness and inflation: Theory and evidence. Quarterly Journal of Economics 108: 869–903.
Vella, F. and M. Verbeek. 1998.
Whose wages do unions raise? A dynamic model of unionism and wage rate determination for young men. Journal of Applied Econometrics 13: 163–183.