What am I doing wrong with urllib.urlopen() ?

Hernan M. Foffani hfoffani at yahoo.com
Sun Mar 24 09:48:58 EST 2002


I wrote:
> Polish the page with HTML Tidy first, then strip all script code tags,
> parse the resulting HTML to catch the rows you're interested in
> and save them into a DB.

Ups... There is no need to strip client-side script code from the
page. I was confused by a problem with another tool I used several
months ago. I tried htmllib.HTMLParser(...) from Python 2.2 and
works as expected.

Regards,
-Hernan





More information about the Python-list mailing list