getfirst and re

Victor Subervi victorsubervi at gmail.com
Wed Jan 6 12:31:41 EST 2010


On Wed, Jan 6, 2010 at 1:27 PM, Tim Chase <python.list at tim.thechases.com>wrote:

> But if you're using it on HTML form text, regexps are usually the wrong
> tool, and you should be using an HTML parser (such as BeautifulSoup) that
> knows how to handle odd text and escapings better and more robustly than
> regexps will.
>

I have an automatically generated HTML form from which I need to extract
data to the script which this form calls (to which the information is sent).
I believe BeautifulSoup is geared to scraping pages that exist permanently
on the web. By the time BeautifulSoup was called, this page would be gone.
Any other ideas?
TIA
beno
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-list/attachments/20100106/09da1f3c/attachment-0001.html>


More information about the Python-list mailing list