[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Re: RE: CASE-CONTROL STUDY

From	David Airey <[email protected]>
To	[email protected]
Subject	Re: st: Re: RE: CASE-CONTROL STUDY
Date	Sun, 15 Mar 2009 12:12:57 -0500

.

There is at that URL, optmatch2.ado and optmatch2.hlp.

You can save these files where Stata wants them, and then they will beavailable to you at the Stata command line. There won't be any menus.

You might also track don't the author. He has offered email assistanceusing this program.


-Dave

----------------------------------------------------------------------------------------
help for optmatch2
----------------------------------------------------------------------------------------

Optimal Matching

Syntax

        optmatch2 casecontrol varlist [if] [in] [, options ]

    options               description

----------------------------------------------------------------------------------

    Main
      minc(#)              Minimum control:case ratio.
      maxc(#)              Maximum number of controls per set.

nc(#) Total number of controls to include inmatch.gen(newvar) A new variable to contain the number ofthe case-control

                            set each subject belongs to.
      caliper(#)           Limit on acceptable matching.
      measure(string)      Type of dissimilarity measure to use.
      epsilon(#)           Stability constant.

repeat If requested number of controls cannot bematched, produce

                            match with as many controls as possible.

----------------------------------------------------------------------------------


Description

The command optmatch2 performs optimal matching using the networkflow methodologyoutlined in Rosenbaum(1989). The variable casecontrol contains 1for cases and 0for controls. The variable(s) on which matching is to beperformed are given byvarlist. If there is more than one variable in varlist, there area number of waysof calculating a distance between a case and a control: see theoption measure

    below for more information.

Options        +------+

----+ Main+---------------------------------------------------------------

minc(#) Minimum control:case ratio. May be less than 1: e.g. 0.5means the same

        control can be mapped to 2 cases. Default value is 1.

maxc(#) Maximum number of controls per case-control set. Must bean integer >= 1:

        default value is 1

nc(#) Total number of controls to be used in the match. Defaultsto the allcontrols in the dataset. Can be set to any integer less thanor equal to this:requesting more controls than exist in the dataset will causeoptmatch2 to

        fail with an error message.

gen(newvar) If given, this will create a new variable containingan identifier forthe case-control set this individual belongs to. If it is notgiven, avariable called set is created, unless it already exists inwhich case

        optmatch2 will fail with an error message.

caliper(#) This sets the maximum allowable discrepancy between acase and acontrol within a matched set. By default, no caliper is setand every control

        can, in theory, be matched to any case.

measure(string) This is only of importance if there are more thanone variable invarlist. In this case, it determines the metric to use whenconvertingdifferences in several variables to one overall dissimilaritymeasure. Thestandard measures that stata can use are outline inmeasure_option. Of these,optmatch2 can use L(#), Lpower(#) and Linfinity and theirvarious aliases,with the default being L2. In addition, it can accept a valuemahal to use the

        Mahalonobis distance.

epsilon(#) Default value is 0.000001. Technically, the optimalmatching methodonly works if all discrepancies between cases and controlsare greater thanzero. This value is added to all discrepancies to ensure thatthis is thecase. The value of epsilon can affect the matching if (optminc} < 1: see

        Hansen and Klopfer (2006) for a discussion of this.

repeat It may be impossible for optmatch2 to find a matching thatmatches therequested number of controls (nc). This may be a logicalimpossibility (thereare not that many controls in the data) or an empirical one(if you usecaliper to define the maximum allowable discrepancy in amatch, it may not bepossible to match all controls to a case). If you give therepeat option, itwill report how many controls it can match, then perform thematching withthat number of controls. Otherwise, it will simply report themaximum number

        of controls it could match.

Remarks

The command optmatch2 produces matched sets, that is groupsconsisting of one ormore cases and one or more controls, with the dissimilaritiesbetween subjects ina set being as small as possible. By default, it produces matchedpairs (1 caseand 1 control), but this can be changed using the options minc,maxc and nc. Forexample, minc(1) maxc(1) will produce the default 1 to 1matching, whilst minc(3)maxc(3) will produce sets which all consist of 1 case and 3controls.

More complex matchings can be achieved by using values of mincless than 1. Forexample, minc(0.5) maxc(2) will produce sets consisting of either1 control and 2

    cases, 1 control and 1 case or 2 controls and 1 case.

References

Ben B. Hansen and Stephanie Olsen Klopfer: "Optimal FullMatching and RelatedDesigns via Network Flows" (2006) Journal of Computationaland Graphical

        Statistics 15(3):  609-627.

Paul R. Rosenbaum "Optimal Matching for ObservationalStudies" (1989) JASA

        84(408): 1024-1302.

Author

    Mark Lunt, ARC Epidemiology Unit

    The University of Manchester

Please email [email protected] if you encounter problemswith this

    program


On Mar 15, 2009, at 11:52 AM, Ishay Barat wrote:

Dear Kieran and David
There can be lots of arguments why one designs backwards study andnot forward one. In my case, I am responsible for a lot of patients,going through my department, and need form time to time to havequality control of our patients management. It would have been niceto have a quarter of a million $ and 3 years time to carry on astudy, but that's not reality.
Sorry.



And now for your question.
As my objective is geriatric patients, and my data includes generalinter medicine ward cliental I like to reduce the noise younger andfar healthier patients introduces into my data.
By matching some crucial parameters like age, sex, medication anddisease, I may get answers to my questions.
As to my anagram. It is just for fun and nothing else.



As to the reference to http://personalpages.manchester.ac.uk/staff/mark.lunt/optmatch.html

I installed the files, but can not find the command in the menus.




*¸..· ´¨)) -:¦:-        *
  ¸.·´ .
(( -:¦:- * Ishay *  -:¦:-
  ´·..          ..·´
             ((¸¸.·´* -:¦:-

_________________________________________________________-
Matching is an element of the design of a study, planned before thedata is collected, and should be done for efficiency, not control.If you already have the data, you gain nothing by matching. Youhave a sample size of 2,500. If you match these data in the wayyou have indicated, you will end up with a matched sample size of1,200. Why would you want to discard over half of your data?
You should analyse the data as they are and control for age, sex,etc in the analysis.
______________________________________________
Kieran McCaul MPH PhD
WA Centre for Health & Ageing (M573)
University of Western Australia
Level 6, Ainslie House
48 Murray St
Perth 6000
Phone: (08) 9224-2140
Fax: (08) 9224 8009
email: [email protected]
http://myprofile.cos.com/mccaul
http://www.researcherid.com/rid/B-8751-2008
______________________________________________
Epidemiology is so beautiful and provides such an importantperspective on human life and death,but an incredible amount of rubbish is published. Richard Peto(2007)
-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Ishay Barat
Sent: Sunday, 15 March 2009 1:28 AM
To: [email protected]
Subject: st: CASE-CONTROL STUDY

HELLO
I've got a data set containing about 2500 patients, of which 300have my
interest (Group A).
I would like to extract a sample of 900 patients (Group B) out ofthe dataset that match Group A in age, sex and some other parameters. AClassical
Case-Control study with 3 controllers for each case.



Is anybody have a clue how the syntax look like??






*¸..· ´¨)) -:¦:-        *
  ¸.·´ .
(( -:¦:- * Ishay *  -:¦:-
  ´·..          ..·´
             ((¸¸.·´* -:¦:-

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


--
No virus found in this incoming message.
Checked by AVG.
Version: 7.5.557 / Virus Database: 270.11.13 - Release Date:13-03-2009 00:00
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: Re: retruned r(198) in nlsur
  - From: Kit Baum <[email protected]>
- st: CASE-CONTROL STUDY
  - From: "Ishay Barat" <[email protected]>
- st: RE: CASE-CONTROL STUDY
  - From: "Kieran McCaul" <[email protected]>
- st: Re: RE: CASE-CONTROL STUDY
  - From: "Ishay Barat" <[email protected]>

Prev by Date: RE: st: What does -koopmani- output? odds ratios or risk ratios?
Next by Date: st: Return r(111) this time
Previous by thread: st: Re: RE: CASE-CONTROL STUDY
Next by thread: st: RE: Re: RE: CASE-CONTROL STUDY
Index(es):
- Date
- Thread