Alphabetics respect to a given locale

candide candide at free.invalid
Sat Apr 2 09:18:18 EDT 2011


Le 01/04/2011 22:55, candide a écrit :
> How to retrieve the list of all characters defined as alphabetic for the
> current locale ?


Thanks for the responses. Alas, neither solution works.

Under Ubuntu :

# ----------------------
import string
import locale

print locale.getdefaultlocale()
print locale.getpreferredencoding()

locale.setlocale(locale.LC_ALL, "")

print string.letters

letter_class = u"[" + u"".join(unichr(c) for c in range(0x10000) if
unichr(c).isalpha()) + u"]"

#print letter_class
# ----------------------

prints the following :


('fr_FR', 'UTF8')
UTF-8
ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz


I commented out the letter_class printing for outputing a flood of 
characters not belonging to the usual french character set.


More or less the same problem under Windows, for instance, 
string.letters gives the "latin capital letter eth" as an analphabetic 
character (this is not the case, we never use this letter in true french 
words).






More information about the Python-list mailing list