1998 UK Stata Users Group meeting

Home / Resources & support / Users Group meetings / 1998 UK Stata Users Group meeting / Abstract

Tools for longitudinal data management

Speaker: Michael Hills

In Stata longitudinal data are usually coded long, that is to say each set of measurements at each new time point constitutes a new record, and the set of all records for a subject share the same subject id. When exploring such data interactively most simple operations refer to records, but often the answers required are those referring to subjects. The most obvious example is how many subjects are there? This is the same as the number of unique codes for subject id, and is returned by the Stata command codebook id, along with much else. A simple alternative is the new command unique id which generalizes to unique id visit, for example, which reports the number of unique combinations of id and visit. In general, the command

        . unique varlist, by(varname) gen(newvar)

will give the number of unique combinations of varlist. When the by is present it creates a new variable newvar, which contains the number of unique combinations of varlist for each level of varname. For example,

        . unique job, by(id) gen(jobvar)

reports the overall number of unique values for the variable job, and creates the variable jobvar which contains the number of different job codes for each subject.

Slightly more complex questions take the form: how many records satisfy the condition C, where C refers to a single variable. An example is the condition height == . . The command longch takes the form

        . longch id, c(height == .)

where id is the subject id variable name and c( ) contains the condition. The output looks like this:

        71  records fulfill the condition height == .
        
        some : 46  subjects have height == . in at least one record
        none : 51  subjects have height == . in no records
        every: 0  subjects have height == . in every record

In addition three logical variables called _some, _none, and _every, are created for convenience in further manipulation (e.g. dropping or keeping records). These flag all records belonging to subjects with some records satisfying the condition, and so on.

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

Tools for longitudinal data management

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies

Stata/MP4 Annual License (download)

Tools for longitudinal data management

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies