[issue38861] zipfile: Corrupts filenames containing non-UTF8 characters

Jon Nalley report at bugs.python.org
Sun Nov 24 20:04:39 EST 2019


Jon Nalley <code at bluebot.org> added the comment:

I think the Python implementation is adhering to the zip specification.

>From the specification v6.3.6 (Revised: April 26, 2019):

If general purpose bit 11 is unset, the file name and comment SHOULD conform 
to the original ZIP character encoding.  If general purpose bit 11 is set, the 
filename and comment MUST support The Unicode Standard, Version 4.1.0 or 
greater using the character encoding form defined by the UTF-8 storage 
specification.

https://pkware.cachefly.net/webdocs/casestudies/APPNOTE.TXT

----------
nosy: +jnalley

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue38861>
_______________________________________


More information about the Python-bugs-list mailing list