Beginner: HTML Parsing

Kragen Sitaker kragen at pobox.com
Fri May 17 03:14:38 EDT 2002


"J. David Lashar" <dlashar at sprynet.com> writes:
> As a beginner, I'm working through the O'Reilly books mentioned in an
> earlier posting, but I haven't found much guidance on parsing an HTML file
> once I've pulled it down with httplib.  And I'm finding the Python Library
> Reference to be a bit cryptic.   Could someone point to resources or provide
> examples?

If possible, use Perl and HTML::Parser (or HTML::LinkExtor if that's
what you want) instead.  Python doesn't yet have anything nearly as
good.




More information about the Python-list mailing list