codec to parse raw UCS data?

Paul Watson pwatson at knightsbridge.com
Tue Aug 19 19:23:35 EDT 2003


"Fredrik Lundh" <fredrik at pythonware.com> wrote in message
news:mailman.1061308184.25736.python-list at python.org...
> Oleg Leschov wrote:
>
> > Where can I find a list and documentation for codecs?
> > What I want to do is to make a unicode string out of unicode data.
> > for example. I am parsing NTFS metadata, that contains filenames as
> > UCS-2 code, so I have a binary string that pretends to be a UCS-2.
>
> the "utf-16-le" codec is probably what you want.
>
> (utf-16 is basically ucs-2 plus mechanisms to encode characters outside
> the 16-bit BMP set; IIRC, Windows 2k and later uses utf-16, not ucs-2).
>
> </F>

UTF-16
DSCS (Double Short Character Set)






More information about the Python-list mailing list