[Distutils] PEP 376 comments

Sridhar Ratnakumar sridharr at activestate.com
Mon Jun 8 19:36:23 CEST 2009


On 09-06-08 02:58 AM, Tarek Ziadé wrote:
>> >
>> >  * Should the PEP specify the encoding of text-files? PEP314 doesn't seem to specify the encoding of PKG-INFO files, which can cause problems when a field contains data that isn't ASCII.
>
> The encoding used is utf-8 since 2.6.  I think we should rather update
> PEP 314, and mention it in the upcoming PEP 345 as well,

For the python-wifi-0.3.1 package, I noticed that PKG-INFO uses 'latin1' 
encoding.

 > grep Author python_wifi.egg-info/PKG-INFO | head -n 1
Author: R�man Joost

 > grep Author python_wifi.egg-info/PKG-INFO | hexdump -C | head -n 1
00000000  41 75 74 68 6f 72 3a 20  52 f3 6d 61 6e 20 4a 6f  |Author: 
R.man Jo|

Note: latin1 uses "f3", while utf-8 uses "c3 b3".


The reason for not using utf-8 in PKG-INFO is perhaps due to the 
presence of "# -*- coding: latin1 -*-" in setup.py. Cf. PEP 0263

Indeed, even 'python setup.py --author' sends latin1 encoded bytes (not 
utf-8).


More information about the Distutils-SIG mailing list