Best HTML module?

Aahz Maruch aahz at netcom.com
Mon May 22 11:41:55 EDT 2000


In article <3.0.5.32.20000514125226.007d97c0 at earthlink.net>,
Robert Citek  <rwcitek at uci.edu> wrote:
>
>My question is similar to Jason's.  Insead of replacing certain tags, I
>would like to extract data from an HTML page.  The "Learning Python" book
>has an example on p.265 that uses the string module to find and get the
>information.  Using the string module and the re module for regular
>expressions, I can get the information I want but the code is becoming
>large, unsightly, and unwieldy.  Could the HTMLlib be used to do the same
>thing?  Or is some other module better suited for extracting data from HTML
>pages (if so, which module)?

htmllib certainly *can* do this, but without information on what kind of
data you're trying to extract, I can't push you in the right direction.
--
                      --- Aahz (Copyright 2000 by aahz at netcom.com)

Androgynous poly kinky vanilla queer het    <*>     http://www.rahul.net/aahz/
Hugs and backrubs -- I break Rule 6

"Not everything in life has a clue in front of it...."  --JMS



More information about the Python-list mailing list