[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: Making sure identifiers are unique

From	"louis boakye-yiadom" <[email protected]>
To	[email protected]
Subject	st: Making sure identifiers are unique
Date	Tue, 08 Mar 2005 15:48:56 +0000

Dear all,
I've been trying to determine the identifiers of a data set, and to ensure they're unique. Suspecting the variables, "region" and "district" are the identifiers, I gave the commands below, and got the output shown:
. sort region district
. by region district: assert _N==1
62 contradictions in 97 by-groups
assertion is false
r(9);

Owing to the fact that I'm more interested in the "district"-level data, I wanted to know whether a collapsed version of the data will have unique identifiers. I therefore gave the following set of commands and got the results shown:
. gen x=1
. collapse (count) x, by (region district)
. sort region district
. by region district: assert _N==1

My question is: What can account for the collaped data being uniquely identified by "region" and "district", whilst the original data are not? I'm using version 8.2.

Many thanks,
Louis

*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- st: Re: Making sure identifiers are unique
  - From: "Michael Blasnik" <[email protected]>

Prev by Date: Re: RE: st: _rmcoll query
Next by Date: st: Re: Making sure identifiers are unique
Previous by thread: st: RE: some code that doesn't do what I expect
Next by thread: st: Re: Making sure identifiers are unique
Index(es):
- Date
- Thread