Can Python be used for UTF/Double Bytes/Asian Characters?

David LeBlanc whisper at oz.nospamnet
Fri Jun 1 17:30:46 EDT 2001


In article <5f28b650.0105310946.1298918b at posting.google.com>, 
akiyomiya at hotmail.com says...
> Does Python have a module or some sort of class that can be used for
> UTF/Double Bytes/Asian Characters?
> 
> I heard that Perl has a module that you can use for UTF/Double
> Bytes/Asian Characters.  So I am wondering if Python works same way or
> better.
> 
> We are modifying a search (Inktomi), so people can input UTF/Double
> Bytes/Asian Characters and search for some documents.
> 
> Thank you in advance.
> 
> AK
> 
I don't know about DBCS or Asian characters, but Unicode is a standard 
feature in Python 2.0 (and maybe also Python 2.0, but i'm only sure about 
2.1). I don't know if the regular expression stuff has been made unicode 
aware or not. If not, then you might want to look into Tcl, which does 
have a complete Unicode implementation, including regular expressions. 
>From reading stuff in the comp.lang.tcl newsgroup, Tcl seems quite well 
regarded in the Asian language community.

Hopefully Python will suit you though.

Dave LeBlanc



More information about the Python-list mailing list