Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Appending unique cases based on two variables

From	"Logan-Greene, Patricia" <[email protected]>
To	"'[email protected]'" <[email protected]>
Subject	st: Appending unique cases based on two variables
Date	Thu, 9 Aug 2012 11:50:49 -0400

Hello,

I am doing a fairly complicated merge between two sets of data (from criminal court records) that each contain an ID number and dates (along with many other variables). Here's some background:
1. The two files represent a) an assessment, given at approximately the same time as the beginning of probation, and b) discharge records.
2. Each file contains ID numbers that can be used to match individuals across files. The ID number can appear multiple times in each dataset (multiple entries reflect recidivism).
3. The entries are dated, which represents for a) the date on which the assessment in given, and for b) the official start date for probation. Although there are multiple entries for many ID numbers, there is only one instance of a particular ID and a particular date in each file.
2. As the dates don't match identically, we conducted a fuzzy match that paired assessment entries with discharge information (based on the beginning of probation) when the dates were within 6 weeks of each other.
3. We now need to add the unique cases from the assessment data (that may represent, for example, an incomplete probation). I know how to append unique cases based on a single identifier, but not with two. Will append even work if there are duplicates for one of the identifiers? 

Can anyone help?

Thanks!



*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Appending unique cases based on two variables
  - From: Nick Cox <[email protected]>

Prev by Date: Re: st: rename
Next by Date: st: Stata implementation of alias method
Previous by thread: st: base outcome in mlogit
Next by thread: Re: st: Appending unique cases based on two variables
Index(es):
- Date
- Thread