Statalist


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: RE: replying to lists / keeping records: New - proposal for data documentation


From   Neil Shephard <nshephard@nhs.net>
To   statalist@hsphsun2.harvard.edu
Subject   Re: st: RE: replying to lists / keeping records: New - proposal for data documentation
Date   Fri, 28 Nov 2008 11:27:51 +0000

Allan Reese (Cefas) wrote:
> A suggestion from myself and colleagues is therefore a much simpler
> "standard" for documenting data so that it is human-usable.  It may also
> be immediately compatible between computers, but that depends on the
> file format and character coding conventions, etc.  Most data can be
> reduced to a 2-way table (or sequence of linked tables).  We therefore
> propose that a data table (think Stata dataset) should be documented
> with a second table that describes the fields (=columns=variables) as a
> Codebook, and a third table containing the discovery metadata as defined
> in ISO standards.  It's very simple, not technical, but would promote
> more computer users to notice concepts like missing values and
> procedures used when coding.
>   
Theres no real need for a secondary table describing columns and variables.

The beauty of Stata data files (if used correctly, which many of my
predecessors at my current place of work haven't done, grrr) is to use
the various -label- commands to describe the -label variables-, -label
data--, -label define/values- so the code-book is tied in with the data
itself.

More verbose comments can be attached to the data or variables
themselves with the -notes- command.

It should be habit to define these when importing and tidying data and
is good practice for reproducible research.

Neil

-- 
"We should make things as simple as possible, but not simpler" - Anon (not Albert Einstein)


***********************************************************************
This  message  may  contain  confidential and  privileged  information.
If you  are not the  intended recipient  you should not  disclose, copy
or distribute information in this e-mail or take any action in reliance
on its contents.  To do so is strictly  prohibited and may be unlawful.
Please  inform  the  sender that  this  message has  gone astray before
deleting it.  Thank you.

2008 marks the 60th anniversary of the NHS.  It's an opportunity to pay
tribute to the NHS staff and volunteers who help shape the service, and
celebrate their achievements.

If you work for the NHS  and  would like  an NHSmail  email account, go
to: www.connectingforhealth.nhs.uk/nhsmail
***********************************************************************

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index