[Patches] [ python-Patches-1177307 ] UTF-8-Sig codec

SourceForge.net noreply at sourceforge.net
Sun Jan 8 11:46:06 CET 2006


Patches item #1177307, was opened at 2005-04-05 21:26
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1177307&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Walter Dörwald (doerwalter)
Assigned to: Martin v. Löwis (loewis)
Summary: UTF-8-Sig codec

Initial Comment:
This patch implements a UTF-8-Sig codec. This codec
works like UTF-8 but adds a BOM on writing and skips
(at most) one BOM on reading.

----------------------------------------------------------------------

>Comment By: Martin v. Löwis (loewis)
Date: 2006-01-08 11:46

Message:
Logged In: YES 
user_id=21627

Thanks for the patch. Committed as 41977.

----------------------------------------------------------------------

Comment By: Walter Dörwald (doerwalter)
Date: 2005-12-26 17:51

Message:
Logged In: YES 
user_id=89016

OK, here's a text that explains what the BOM is used for in
various Unicode encodings. I hope that this can be turned
into something useful.

----------------------------------------------------------------------

Comment By: Martin v. Löwis (loewis)
Date: 2005-08-09 20:53

Message:
Logged In: YES 
user_id=21627

The place is right, but I feel this documentation is
incomplete still. The library reference should explain
somewhere what the difference between utf-8 and utf-8-sig
is. Perhaps a footnote could be added. I think I would
prefer a separate subsection on the BOM, explaining byte
order in UTF-{16,32}, and how the BOM can be used as a magic
signature for UTF-8.

----------------------------------------------------------------------

Comment By: Walter Dörwald (doerwalter)
Date: 2005-08-09 15:41

Message:
Logged In: YES 
user_id=89016

This version (diff3.txt) of the patch adds a note to
Misc/NEWS and a section to Doc/lib/libcodecs.tex. Is this
the correct place to add the documentation?

----------------------------------------------------------------------

Comment By: Martin v. Löwis (loewis)
Date: 2005-08-07 23:51

Message:
Logged In: YES 
user_id=21627

The patch looks fine, but lacks documentation changes.

----------------------------------------------------------------------

Comment By: Walter Dörwald (doerwalter)
Date: 2005-04-05 22:28

Message:
Logged In: YES 
user_id=89016

This second version of the patch will return starting bytes
immediately, if they don't look like a BOM.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1177307&group_id=5470


More information about the Patches mailing list