Where is the ucs-32 codec?

beni.cherniavsky at gmail.com beni.cherniavsky at gmail.com
Sun Jun 4 18:23:05 EDT 2006


Python seems to be missing a UCS-32 codec, even in wide builds (not
that it the build should matter).
Is there some deep reason or should I just contribute a patch?

If it's just a bug, should I call the codec 'ucs-32' or 'utf-32'?  Or
both (aliased)?
There should be  '-le' and '-be' variats, I suppose.  Should there be a
variant without explicit endianity, using a BOM to decide (like
'utf-16')?
And it should combine surrogates into valid characters (on all builds),
like the 'utf-8' codec does, right?

--
Beni Cherniavsky <cben at users.sf.net>, who can only read email on
weekends.




More information about the Python-list mailing list