Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: survey answers imported from google, checkbox type


From   Nick Cox <njcoxstata@gmail.com>
To   "statalist@hsphsun2.harvard.edu" <statalist@hsphsun2.harvard.edu>
Subject   Re: st: survey answers imported from google, checkbox type
Date   Mon, 22 Apr 2013 01:26:21 +0100

Force is on your mind. Better to think of persuasion. Specific answers below.

Nick
njcoxstata@gmail.com


On 21 April 2013 23:15, Steven Young <youngazn@gmail.com> wrote:
> Thanks Nick for your reply.
>
> I used tabsplit from tab_chi (SSC). However it just lists the
> tabulation. Is there a way to force it to create new variables based
> on the splitting?

-tabsplit- restructures the dataset temporarily to do what it does.
The tabulation uses -tabulate-, but the original data structure is
restored. That's deliberate. If you want something else, you are free
to clone the program and rewrite it accordingly.But this is the same
question as the next really, so see below.

> I read through the Stata support, and I liked split. I can use it to
> break the compound strings at the "," (comma).
>
> One thing I'm running into now, is that for instance in the original Var:
> 1    "boy girl, boy boy, girl girl"
> 2    "boy girl, girl girl"
> 3    "boy boy"
> 4    "girl girl"
>
> When using split, it of course makes 3 new vars called Var1, Var2, Var3.
> It also splits the data in the order it sees it.
>
>     Var1       Var2       Var3
> 1  boy girl  boy boy   girl girl
> 2  boy girl  girl girl
> 3  boy boy
> 4  girl girl
>
> Is there a way to force split to appropriately split them so that they
> are under the same var name?

You want a -stack- or -reshape-. Advice at length was given in the
references I gave earlier, so I have to guess you haven't read them.

Nick

> On Thu, Apr 18, 2013 at 1:40 AM, Nick Cox <njcoxstata@gmail.com> wrote:
>> See (for example)
>>
>> -tabsplit- in -tab_chi- (SSC)
>>
>> FAQ     . . . . . . . . . . . . . . . . . . .  Dealing with multiple responses
>>         . . . . . . . . . . . . . . . . . . . . . . .  N. J. Cox and U. Kohler
>>         4/05    How do I deal with multiple responses?
>>                 http://www.stata.com/support/faqs/data/multresp.html
>>
>> SJ-5-1  st0082  . . . . . . . . . . . . . . . Tabulation of multiple responses
>>         (help _mrsvmat, mrgraph, mrtab if installed)  . . . . . . . .  B. Jann
>>         Q1/05   SJ 5(1):92--122
>>         introduces new commands for the computation of one- and
>>         two-way tables of multiple responses
>>
>> SJ-3-1  pr0008   Speaking Stata: On structure & shape: the case of mult. resp.
>>         . . . . . . . . . . . . . . . . . . . . . . . .  N. J. Cox & U. Kohler
>>         Q1/03   SJ 3(1):81--99                                   (no commands)
>>         discussion of data manipulations for multiple response data

>> Nick
>> njcoxstata@gmail.com
>>
>> On 18 April 2013 08:29, Steven Young <youngazn@gmail.com> wrote:
>>
>>> So I have a survey with answers imported from Google.
>>>
>>> One of the questions asks "Which have you heard of" and lists 4 items below
>>> in a checkbox fashion (tick all that you know).
>>>
>>> Google aggregated the data into one cell, so a person (each row) may answer
>>> "a, b, d", a second may answer "a, b, c" and a third may answer "a, d".
>>> Unfortunately each of these answers are quite long... not as short as a, b,
>>> c, d. I also cannot change how Google "aggregates" this data into one cell.
>>>
>>> Now the issue I have is that when it's imported to stata, it will list in
>>> one cell, each of the selected items, separated by comma.
>>>
>>> How do I go about making a "do" file that will go through this and find out
>>> what each person answered, ie make sub-columns of answer choice a, b, c, d,
>>> and then assigning a value of 1 to each column that the person answered?
>>>
>>> For instance if Joe answered a, b, d, then his answer columsn will be 1, 1,
>>> 0, 1.
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index