Search
   >> Home >> Products >> Features >> Data management

Data management

Importing/exporting data

ODBC support

  • Import data from any ODBC data source, such as Oracle, SQL Server, Access, Excel, MySQL, and DB2
  • Export data to new or existing ODBC tables
  • Execute custom SQL commands individually or in batches
  • Customize ODBC connection strings
  • Support for ODBC
  • Support for VARCHARs/CLOBs and BLOBs

Built-in spreadsheet editor

  • Clipboard Preview Tool lets you control how data will be pasted
  • Manage variables with the Variables Tool
  • For Windows , Mac , and Unix

Properties window

  • Manage variables
  • Manage dataset properties
  • For Windows , Mac , and Unix

Variables Manager

  • Change storage types, names, and formats
  • Add and edit value labels
  • Attach notes to variables
  • Filter variables
  • For Windows , Mac , and Unix

Functions

Data reorganization

  • Row–column transposition
  • Data reshaping
  • Stacking of variables
  • Collapsing into means, totals, etc.

Labels

  • Dataset labels
  • Variable labels
  • Value labels (e.g., male and female for 0 and 1)
  • Ability to switch between multiple sets of data, variable, and value labels
  • Missing-value labels
  • Support for multiple languages

Notes

  • Extensive notes can be attached to a dataset

Data snapshots

  • Allow multiple levels of undo to modified datasets

Automatic memory management

  • Up to 1 TB of RAM supported
  • Up to 32,767 variables
  • Up to 2 billion observations

Sorting

  • Ascending or descending sorts
  • Multiple-key sorts
  • Numeric and string sorts

Combining datasets

  • Merge datasets
    • By key variables
    • By observations
  • Join datasets
  • Outer join
  • Append datasets
  • Append time series

Special datasets

PDF and image output

  • Export results to PDF files on Windows and Mac
  • Export results to PostScript files
  • Save graphs as PDFs on Windows and Mac
  • Save graphs to EPS or TIF files for publication
  • Save graphs to PNG files for the web

Utilities

  • Compress (make dataset as small as possible without loss of accuracy)
  • Count number of observations that satisify specified conditions
  • Formatted and unformatted disk I/O
  • Zip-file support
  • Custom filters to manipulate text files

Variable management

  • Generation of new variables
  • Replacement of existing variables
  • Renaming variables
  • Encoding and decoding string variables
  • Reordering variables in dataset

Dataset utilities

  • Flexible description of variables, labels, and types
  • List values of variables
  • Data signatures to verify the integrity of datasets
  • Codebooks for variables
  • Value-label reports
  • Duplicates and missing values tables

Variable types

  • Numeric storage types
    • Byte
    • Integer (int)
    • Long
    • Float
    • Double
  • String (including very long strings and BLOBs)
  • Dates and times
  • Business calendars

Long string support

  • Up to 2 billion character long string
  • Coalescing of duplicate values to save memory
  • Binary 'strings' (BLOBs)
  • Import and export entire files into long strings/BLOBs

Stored results

  • Save results to disk for later use
  • Store estimation results in memory
  • Create tables to compare results

Additional resources

See New in Stata 13 for more about what was added in Stata 13.

The Stata Blog: Not Elsewhere Classified Find us on Facebook Follow us on Twitter LinkedIn Google+ Watch us on YouTube