[issue8390] tarfile: use surrogates for undecode fields
Martin v. Löwis
report at bugs.python.org
Mon May 3 21:51:57 CEST 2010
Martin v. Löwis <martin at v.loewis.de> added the comment:
I think it is helpful to read the pax specification here:
http://www.opengroup.org/onlinepubs/009695399/utilities/pax.html
pax defines (IIUC) that all strings in a pax-compliant tar file are UTF-8 encoded. For the "invalid" option, they offer the alternatives bypass, rename, UTF-8, and write. It may be useful to provide the same options, in some form.
----------
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue8390>
_______________________________________
More information about the Python-bugs-list
mailing list