[issue8633] tarfile doesn't support undecodable filename in PAX format

Martin v. Löwis report at bugs.python.org
Thu May 6 00:32:26 CEST 2010


Martin v. Löwis <martin at v.loewis.de> added the comment:

I think you are misinterpreting the spec. A PAX file MUST encode its file names in UTF-8. The "invalid" flag only applies when these invalid names cannot map to file names - either because they are not supported in the locale, or because they are not supported by the file system on which you want to extract the files (e.g. if they contain a colon ':' and you try to extract to a FAT filesystem).

The case that the file names are not actually in UTF-8 in the PAX file is a format error, just like any other format error in the file.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue8633>
_______________________________________


More information about the Python-bugs-list mailing list