[issue8390] tarfile: use surrogates for undecode fields

Martin v. Löwis report at bugs.python.org
Mon May 3 21:51:57 CEST 2010


Martin v. Löwis <martin at v.loewis.de> added the comment:

I think it is helpful to read the pax specification here:

http://www.opengroup.org/onlinepubs/009695399/utilities/pax.html

pax defines (IIUC) that all strings in a pax-compliant tar file are UTF-8 encoded. For the "invalid" option, they offer the alternatives bypass, rename, UTF-8, and write. It may be useful to provide the same options, in some form.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue8390>
_______________________________________


More information about the Python-bugs-list mailing list