[issue41928] ZipFile does not supports Unicode Path Extra Field (0x7075) zip header field

Ivan Sorokin report at bugs.python.org
Sun Oct 4 07:21:41 EDT 2020


New submission from Ivan Sorokin <ivan.sorokin.tech at gmail.com>:

See attached sample. Well-known unzip command line tool lists its contents correctly:

$ unzip -l 23.zip
Archive:  23.zip
  Length      Date    Time    Name
---------  ---------- -----   ----
    81408  2012-10-23 19:03   Β' ΦΑΣΗ ΠΕ06 ΣΧΟΛΕΙΑ ΕΑΕΠ (ΙΝΤ).xls
---------                     -------
    81408                     1 file

But ZipFile lists the same file inside this archive as
ü' öÇæå Åä06 æòÄèäêÇ äÇäÅ (êîÆ).xls

It's because ZipFile completely ignores Unicode Path Extra Field (0x7075) zip header field.

See .ZIP specification for details on this field meaning and usage:
https://pkware.cachefly.net/webdocs/casestudies/APPNOTE.TXT

----------
components: Library (Lib)
files: 23.zip
messages: 377931
nosy: ivan.sorokin.tech
priority: normal
severity: normal
status: open
title: ZipFile does not supports Unicode Path Extra Field (0x7075) zip header field
type: enhancement
versions: Python 3.10
Added file: https://bugs.python.org/file49491/23.zip

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue41928>
_______________________________________


More information about the Python-bugs-list mailing list