BadZipfile "file is not a zip file"
MRAB
google at mrabarnett.plus.com
Fri Jan 9 18:13:20 EST 2009
webcomm wrote:
> On Jan 8, 8:39 pm, "James Mills" <prolo... at shortcircuit.net.au> wrote:
>> Send us a sample of this file in question...
>
> Here's a sample with some dummy data from the web service:
> http://webcomm.webfactional.com/htdocs/data.zip
>
> That's the zip created in this line of my code...
> f = open('data.zip', 'wb')
>
> If I open the file it contains as unicode in my text editor (EditPlus)
> on Windows XP, there is ostensibly nothing wrong with it. It looks
> like valid XML. But if I return it to my browser with python+django,
> there are bad characters every other character
>
> If I unzip it like this...
> popen("unzip data.zip")
> ...then the bad characters are 'FFFD' characters as described and
> pictured here...
> http://groups.google.com/group/comp.lang.python/browse_thread/thread/...
>
> If I unzip it like this...
> getzip('data.zip', ignoreable=30000)
> ...using Scott's function at...
> http://groups.google.com/group/comp.lang.python/msg/c2008e48368c6543
> ...then the bad characters are \x00 characters.
>
I can unzip it in Windows XP. The file within it (called "data") is XML
encoded as UTF-16LE (2 bytes per character, low byte first), but without
the initial byte order mark. Python's zipfile module says "BadZipfile:
File is not a zip file".
More information about the Python-list
mailing list