extra gibberish interspersed into urllib2 output

Dan Stromberg strombrg at dcs.nac.uci.edu
Fri Oct 1 19:11:50 EDT 2004


I'm attempting to retrieve some data from an http server using basic auth
via python 2.3 with the urllib2 and cookielib modules.

I'm finding that I'm getting the data I need, but unfortunately, there are
small bits of gibberish interspersed in it, rendering the data difficult
to use at best.  For example:

p06,128.200.73.146,foobar,,,,,ES Servers,,li,,,,,
p07,128.200.73.147,foobar,,,,,
ffb
ES Servers,,li,,,,,
webmail2,128.200.224.22,foobar ,,,,,ES Servers,,li,blackhole:1,,,,

IOW, that "ffb" does not belong in the middle of the 2nd line of what
should be a 3 line snippet.  There are also some spurious carriage returns
in there I believe, which may not show up in this message.

Has anyone seen this before?  Is it premature to start using urllib2 from
python 2.4?  Is it a bad idea to use this 2.4 module on python 2.3?

BTW, when I cut out the cookielib stuff, I still get the same strange
results.

On a bit of a bizarre note, mozilla is also unable to display this page,
however mozilla simply shows no content instead of adding in nonsense. 
links (the text mode web browser) however displays the content of the page
just as it should.

TIA for any suggestions you can offer.





More information about the Python-list mailing list