spidering script
Nikita the Spider
NikitaTheSpider at gmail.com
Sat Jan 20 15:41:18 EST 2007
In article <8N6dnSE2eO6QKDLYnZ2dnUVZ_uejnZ2d at fdn.com>,
"David Waizer" <dwaizer at noreply.com> wrote:
> Hello..
>
> I'm looking for a script (perl, python, sh...)or program (such as wget)
> that will help me get a list of ALL the links on a website.
>
> For example ./magicscript.pl www.yahoo.com and outputs it to a file, it
> would be kind of like a spidering software..
David,
In addition to others' suggestions about Beautiful Soup, you might also
want to look at the HTMLData module:
http://oregonstate.edu/~barnesc/htmldata/
--
Philip
http://NikitaTheSpider.com/
Whole-site HTML validation, link checking and more
More information about the Python-list
mailing list