unicode mystery

John Lenton john at grulic.org.ar
Tue Jan 11 00:23:04 EST 2005


On Mon, Jan 10, 2005 at 07:48:44PM -0800, Sean McIlroy wrote:
> I recently found out that unicode("\347", "iso-8859-1") is the
> lowercase c-with-cedilla, so I set out to round up the unicode numbers
> of the extra characters you need for French, and I found them all just
> fine EXCEPT for the o-e ligature (oeuvre, etc). I examined the unicode
> characters from 0 to 900 without finding it; then I looked at
> www.unicode.org but the numbers I got there (0152 and 0153) didn't
> work. Can anybody put a help on me wrt this? (Do I need to give a
> different value for the second parameter, maybe?)

œ isn't part of ISO 8859-1, so you can't get it that way. You can do
one of

   u'\u0153'

or, if you must,

   unicode("\305\223", "utf-8")

-- 
John Lenton (john at grulic.org.ar) -- Random fortune:
Lisp, Lisp, Lisp Machine,
Lisp Machine is Fun.
Lisp, Lisp, Lisp Machine,
Fun for everyone.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 196 bytes
Desc: Digital signature
URL: <http://mail.python.org/pipermail/python-list/attachments/20050111/5427031c/attachment.sig>


More information about the Python-list mailing list