# st: RE: data manipulation/reduction

 From "Nichols, Austin" <[email protected]> To "'[email protected]'" <[email protected]> Subject st: RE: data manipulation/reduction Date Fri, 27 Aug 2004 14:42:48 -0400

```My guess is that if you can pose the question more clearly, you will almost
have written the Stata code yourself. In your example, it's not clear which
vars you've generated, and which are given, and there is no indication what
wound~er measures, and percent does not seem to measure any percent in the
data as given.
Assuming you want the number of non-missing values of wound~er by name, by
approach, by machine, you would -egen AdverseEvents=count(wound~er), by(name
machine approach)- and then suppose the number of bites at the apple is
given by surger~s in one obs per (name machine approach) cell, so you would
-egen NEvents=max(surger~s), by(name machine approach)- and then -gen
if _n>1- to get one obs per cell.

-----Original Message-----
From: Clint Thompson [mailto:[email protected]]
Sent: Friday, August 27, 2004 2:23 PM
To: [email protected]
Subject: st: data manipulation/reduction

Hello All ---
I am using Intercooled, v.8.2.
I was presented w/ an Excel file wherein I need to generate rates of an
adverse outcome relative to the number of procedures.  Toward that end,
I was given the number of surgeries for each clinician and the number of
adverse outcomes for each clinician.  Computing crude rates was not
problematic, however, I now need to stratify by surgical approach & type
of machine.  My primary problem is that I'm struggling with how to
reduce the data from an essentially 'long' form to 'wide' form.  An
abbreviated example of the data as received is as follows:
+-----------------------------------------------------------------+
|   name   surger~s   technqe   total   percent wound~er approach |
|-----------------------------------------------------------------|
| John Doe   750       100     100        25         .            |
| John Doe     .         .       .        75         5     divide |
| John Doe     .         .       .         .         .     divide |
| John Doe     .         .       .         .         .     divide |
| John Doe     .         .       .         .         .     divide |
| John Doe     .         .       .         .         .     divide |
+-----------------------------------------------------------------+
where 'surger~s' denotes # of procedures for the respective clinician
and 'wound~er' denotes the number of adverse events for that clinician.
types (variable not shown) wherein I'd like to compute rates.  Note that
there are 100 unique clinicians but many of the clinicians occupy
multiple lines because they reported 2 or more adverse events.  Any
suggestions?  I've struggled w/ this for a while and haven't had any
success using the help file or Stata-list archives...

Many Thanks,
Clint Thompson
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/
```