Regex for URL extracting

Nikita the Spider NikitaTheSpider at gmail.com
Wed Jan 24 11:55:35 EST 2007


In article <1169655624.981833.29730 at j27g2000cwj.googlegroups.com>,
 "Johny" <python at hope.cz> wrote:

> Does anyone know about a good regular expression  for URL extracting?

Extracting URLs from what?

If it is HTML, then I'd look at some existing HTML parsing modules like 
Beautiful Soup and Barnes' HTMLData.

-- 
Philip
http://NikitaTheSpider.com/
Whole-site HTML validation, link checking and more



More information about the Python-list mailing list