Trying to make a spider using mechanize

James Mills prologic at shortcircuit.net.au
Mon Sep 8 17:47:26 EDT 2008


Hi,

Perhaps you might want to
try out using a sample spider
I wrote and base your code of
this ?

See: http://hg.shortcircuit.net.au/index.wsgi/pymills/file/b9936ae2525c/examples/spider.py

cheers
James

On Tue, Sep 9, 2008 at 2:24 AM, tedpottel at gmail.com <tedpottel at gmail.com> wrote:
> Hi,
>
> I can read the home page using the mechanize lib.  Is there a way to
> load in web pages using filename.html instad of servername/
> filename.html.  Lots of time the links just have the file name.  I'm
> trying to read in the links name and then vsit those pages.
>
> here is the sample code I am ussing.
>
>
> import ClientForm
> import mechanize
>
>
> #get home page
> request = mechanize.Request("http://www.activetechconsulting.com")
> response = mechanize.urlopen(request)
> print response.read()
>
> #sub page (this does note work)
> request = mechanize.Request("service.html")
> response = mechanize.urlopen(request)
> print response.read-Ted
> --
> http://mail.python.org/mailman/listinfo/python-list
>



-- 
--
-- "Problems are solved by method"



More information about the Python-list mailing list