Best HTML module?
Aahz Maruch
aahz at netcom.com
Mon May 22 11:41:55 EDT 2000
In article <3.0.5.32.20000514125226.007d97c0 at earthlink.net>,
Robert Citek <rwcitek at uci.edu> wrote:
>
>My question is similar to Jason's. Insead of replacing certain tags, I
>would like to extract data from an HTML page. The "Learning Python" book
>has an example on p.265 that uses the string module to find and get the
>information. Using the string module and the re module for regular
>expressions, I can get the information I want but the code is becoming
>large, unsightly, and unwieldy. Could the HTMLlib be used to do the same
>thing? Or is some other module better suited for extracting data from HTML
>pages (if so, which module)?
htmllib certainly *can* do this, but without information on what kind of
data you're trying to extract, I can't push you in the right direction.
--
--- Aahz (Copyright 2000 by aahz at netcom.com)
Androgynous poly kinky vanilla queer het <*> http://www.rahul.net/aahz/
Hugs and backrubs -- I break Rule 6
"Not everything in life has a clue in front of it...." --JMS
More information about the Python-list
mailing list