Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Convert string with special characters to lower case


From   Jorge Eduardo Pérez Pérez <perez.jorge@ur.edu.co>
To   <statalist@hsphsun2.harvard.edu>
Subject   st: Convert string with special characters to lower case
Date   Wed, 25 Apr 2012 13:54:56 -0400

I have some text in Spanish that includes accents and special
characters in uppercase, e.g
"ZUÑIGA", "RODRÍGUEZ"
and I would like to convert it to proper case. Stata 12.1 in a Windows
machine doesn't do it correctly: if I apply -proper- to the previous
text, I get:
"ZuÑiga", "RodrÍguez"

The correct conversion could be achieved by getting the positions of
the accented characters, replacing the accented versions of the
characters with their unaccented versions using -subisntr- or -regex-,
converting it to proper case, and then putting the accents back using
the previous positions and -substr-. However, this would require some
loops over observations and over characters. It seems terribly
cumbersome and inefficient for large datasets.

Does anyone know a better way to achieve this?

Thank you,
______________________
Jorge Eduardo Pérez Pérez

PD: I am hoping that the Spanish characters make it through the plain
text encoding and are displayed correctly when you get this e-mail. If
not, then probably this e-mail will not make much sense, specially to
speakers of languages without special characters. Sorry about that.


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index