[Python-Dev] Internationalization Toolkit

Tim Peters tim_one@email.msn.com
Tue, 16 Nov 1999 01:19:16 -0500


[MAL]
> sys.bom should return the byte order mark (BOM) for the format used
> internally. The unicodec module should provide symbols for all
> possible values of this variable:
>
>   BOM_BE: '\376\377' 
>     (corresponds to Unicode 0x0000FEFF in UTF-16 
>      == ZERO WIDTH NO-BREAK SPACE)
>
>   BOM_LE: '\377\376' 
>     (corresponds to Unicode 0x0000FFFE in UTF-16 
>      == illegal Unicode character)
>
>   BOM4_BE: '\000\000\377\376'
>     (corresponds to Unicode 0x0000FEFF in UCS-4)

Should be
    BOM4_BE: '\000\000\376\377'   
 
>   BOM4_LE: '\376\377\000\000'
>     (corresponds to Unicode 0x0000FFFE in UCS-4)

Should be
    BOM4_LE: '\377\376\000\000'