[New-bugs-announce] [issue13025] mimetypes should read the rule file using UTF-8, not the locale encoding

STINNER Victor report at bugs.python.org
Wed Sep 21 01:07:07 CEST 2011


New submission from STINNER Victor <victor.stinner at haypocalc.com>:

On Debian and Ubuntu, /etc/mime.types file is pure ASCII, but on Fedora 15 it contains a non-ASCII character, ³ (U+00B3), in the line:
"application/vnd.geocube+xml                     g3 g³"

And the file is encoded in UTF-8.

That's why Python should read this file from UTF-8 instead of the locale encoding, because the locale encoding can be ASCII. Attached patch implements this idead.

I think that it is a bug and so it should also be fixed in Python 3.2.

(Python 2.7 reads the file in binary mode, it doesn't care of the encoding.)

----------
files: mimetypes_encoding.patch
keywords: patch
messages: 144357
nosy: haypo, sandro.tosi, terry.reedy
priority: normal
severity: normal
status: open
title: mimetypes should read the rule file using UTF-8, not the locale encoding
versions: Python 3.2, Python 3.3
Added file: http://bugs.python.org/file23219/mimetypes_encoding.patch

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue13025>
_______________________________________


More information about the New-bugs-announce mailing list