Using regular expressions in internet searches

Diez B. Roggisch deets at web.de
Sun Jul 3 14:18:23 EDT 2005


mike.ceravolo at gmail.com wrote:
> What is the best way to use regular expressions to extract information
> from the internet if one wants to search multiple pages? Let's say I
> want to search all of www.cnn.com and get a list of all the words that
> follow "Michael."
> 
> (1) Is Python the best language for this? (Plus is it time-efficient?)
> Is there already a search engine that can do this?
> 
> (2) How can I search multiple web pages within a single location or
> path?

You'd probably better off using htdig.

Diez



More information about the Python-list mailing list