Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Re: Reshaping dataset


From   Andrea Molinari <anmolinari@gmail.com>
To   statalist <statalist@hsphsun2.harvard.edu>
Subject   st: Re: Reshaping dataset
Date   Wed, 1 May 2013 20:05:24 -0300

Dear statalisters,

It´s me again trying to reshape a piece of my dataset.

I need to assign values from one trade classification (SITC) to
another (chain), but with the complexity that there may be one SITC
that corresponds to more than one chain. I then need to sum (with
-egen-) the values by SITC to group them into the chain
classification.

When I tried to use the -merge- command to do this, as the identifying
variable to use -merge- (SITC) "does not uniquely identify
observations in the using data" (sic), the system does not allow me to
merge the two datasets.

Does anyone know of any other command that allows me to do this?

Cheers!
Andrea

On 26 April 2013 13:24, Andrea Molinari <anmolinari@gmail.com> wrote:
> Dear statalisters,
>
> I´m working with a dataset which groups many dimensions and I´m having
> a little trouble reshaping the data for the (rather basic)
> calculations I need to do.
>
> The dataset has the following columns:
>
> year flow partner value cadena usoecon subcadena cadenacompartida1
> subcadenacompartida1 cadenacompartida2 subcadenacompartida2
>
> In order to regroup the data summing "value" by year, flow, cadena
> subcadena and usoecon, I need that:
>
> - the values in cadenacompartida1 and cadenacompartida2 go under those
> in the column "cadena"
>
> - the values in subcadenacompartida1 and "subcadenacompartida2"   go
> under those in the column "subcadena"
>
> To do so, I tried several options with -reshape long-, but I don´t
> seem to get the right reshaping to get the data in the way I need to
> then calculate:
>
> bysort year flow cadena subcadena usoecon: egen double svalue=sum(value)
>
> Any ideas of those handling large datasets would be more than welcomed!
>
> Cheers,
> Andrea
>
> --
> Andrea Molinari, PhD
> Investigadora Asistente
> Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)
> Instituto Interdisciplinario de Economía Política de Buenos Aires (IIEP- BAIRES)
> Córdoba 2122, 2do. piso (http://iiep-baires.econ.uba.ar)
> Tel: +54 11 4374-4448, int. 6362



-- 
Andrea Molinari, PhD
Investigadora Asistente
Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)
Instituto Interdisciplinario de Economía Política de Buenos Aires (IIEP- BAIRES)
Córdoba 2122, 2do. piso (http://iiep-baires.econ.uba.ar)
Tel: +54 11 4374-4448, int. 6362

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index