downloading a link with javascript in it..
Larry Bates
larry.bates at websafe.com`
Tue May 13 08:56:08 EDT 2008
Jetus wrote:
> I am able to download this page (enclosed code), but I then want to
> download a pdf file that I can view in a regular browser by clicking
> on the "view" link. I don't know how to automate this next part of my
> script. It seems like it uses Javascript.
> The line in the page source says
> href="javascript:openimagewin('JCCOGetImage.jsp?
> refnum=DN2007036179');" tabindex=-1>
>
> So, in summary, when I download this page, for each record, I would
> like to initiate the "view" link.
> Can anyone point me in the right direction?
>
> When the "view" link is clicked on in IE or Firefox, it returns a pdf
> file, so I should be able to download it with
> urllib.urlretrieve('pdffile, 'c:\temp\pdffile')
>
> Here is the following code I have been using
> ----------------------------------------------------------------
> import urllib, urllib2
>
> params = [
> ('booktype', 'L'),
> ('book', '930'),
> ('page', ''),
> ('hidPageName', 'S3Search'),
> ('DoItButton', 'Search'),]
>
> data = urllib.urlencode(params)
>
> f = urllib2.urlopen("http://www.landrecords.jcc.ky.gov/records/
> S3DataLKUP.jsp", data)
>
> s = f.read()
> f.close()
> open('jcolib.html','w').write(s)
>
You may want to take a look at mechanize, I'm having pretty good luck with using
it to do the types of things you describe.
http://wwwsearch.sourceforge.net/mechanize/
-Larry
More information about the Python-list
mailing list