Statalist


[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

Re: st: RE: Outlier: Detection


From   Maarten buis <[email protected]>
To   [email protected]
Subject   Re: st: RE: Outlier: Detection
Date   Wed, 20 Feb 2008 18:21:06 +0000 (GMT)

--- Sergiy Radyakin <[email protected]> wrote:
> Working with continious variables, it makes more sense to drop, say,
> top 1% of earners. Is that something you want?

I would not just drop the top 1%, but you could inspect those values
and see if they are extremely weird. You might also want to look at
other variables. Say you have also information on occupation title and
size of company for which someone is working, and a high earner is a
CEO and the company is very large, than a high income is probably
legitamate and should not be removed. If a high earner is a
receptionist, than it is probably a typo, and should be treated
accordingly (maybe go back to the original forms if they are still
available).

-- Maarten

-----------------------------------------
Maarten L. Buis
Department of Social Research Methodology
Vrije Universiteit Amsterdam
Boelelaan 1081
1081 HV Amsterdam
The Netherlands

visiting address:
Buitenveldertselaan 3 (Metropolitan), room Z434

+31 20 5986715

http://home.fsw.vu.nl/m.buis/
-----------------------------------------


      ___________________________________________________________
Yahoo! Answers - Got a question? Someone out there knows the answer. Try it
now.
http://uk.answers.yahoo.com/ 
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2024 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index