not quite 1252

Fredrik Lundh fredrik at pythonware.com
Wed Apr 26 08:42:10 EDT 2006


Anton Vredegoor wrote:

> I'm trying to import text from an open office document (save as .sxw and
>  read the data from content.xml inside the sxw-archive using
> elementtree and such tools).
>
> The encoding that gives me the least problems seems to be cp1252,
> however it's not completely perfect because there are still characters
> in it like \93 or \94. Has anyone handled this before?

this might help:

    http://effbot.org/zone/unicode-gremlins.htm

</F>








More information about the Python-list mailing list