[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]
st: RE: Cyrillic texts in Stata
> -----Original Message-----
> From: firstname.lastname@example.org
> [mailto:email@example.com] On Behalf Of
> Oleh Protsyk
> Sent: 11 December 2006 17:32
> To: firstname.lastname@example.org
> Subject: st: Cyrillic texts in Stata
> Dear Statalisters,
> Is there any way I can make Stata to recognize Cyrillic texts?
This method isn't great, but it works.
If you convert your text into text that uses a Cyrillic coding scheme
like KOI-8, WynCyrillic CP1251 or DOS Cyrillic CP866, Stata will be able
to read it as text and display it properly, if you also have the
corresponding font installed. The reason is that these encoding schemes
are extended ASCII, i.e., they use ASCII codes 0-255. These are 8-bit
codes, which Stata will support.
Just convert the text you are using into one of these encoding schemes,
and set the Stata display font to use a font employing the same scheme,
and it should work.
> I'd like to use Stata to computer analyse party manifestos and other
> political texts. Latin-based texts have been previously analyzed in
> Stata with the help of special program called 'wordscores'
> designed by Michael Laver and his colleagues(the basic logic:
> an analyst designates some texts as reference texts and then
> compares other texts to the reference documents by using word
> count procedures).
> At present, Stata does not consider an input in Cyrillic as a text.
> Any advice on whether this problem can be addressed would be welcome.
> Oleh Protsyk
> Senior Research Associate
> European Centre for Minority Issues
> Schiffbruecke 12
> D-24939 Flensburg, Germany
> Tel. + 49-(0)461-14149-64
> Fax. + 49-(0)461-14149-19
> E-mail: email@example.com
> * For searches and help try:
> * http://www.stata.com/support/faqs/res/findit.html
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
* For searches and help try: