downloading a link with javascript in it..
Jetus
stevegill7 at gmail.com
Tue May 13 20:12:31 EDT 2008
On May 12, 4:06 pm, "Diez B. Roggisch" <de... at nospam.web.de> wrote:
> Jetus schrieb:
>
>
>
> > I am able to download this page (enclosed code), but I then want to
> > download a pdf file that I can view in a regular browser by clicking
> > on the "view" link. I don't know how to automate this next part of my
> > script. It seems like it uses Javascript.
> > The line in the page source says
> > href="javascript:openimagewin('JCCOGetImage.jsp?
> > refnum=DN2007036179');" tabindex=-1>
>
> > So, in summary, when I download this page, for each record, I would
> > like to initiate the "view" link.
> > Can anyone point me in the right direction?
>
> > When the "view" link is clicked on in IE or Firefox, it returns a pdf
> > file, so I should be able to download it with
> > urllib.urlretrieve('pdffile, 'c:\temp\pdffile')
>
> > Here is the following code I have been using
> > ----------------------------------------------------------------
> > import urllib, urllib2
>
> > params = [
> > ('booktype', 'L'),
> > ('book', '930'),
> > ('page', ''),
> > ('hidPageName', 'S3Search'),
> > ('DoItButton', 'Search'),]
>
> > data = urllib.urlencode(params)
>
> > f = urllib2.urlopen("http://www.landrecords.jcc.ky.gov/records/
> > S3DataLKUP.jsp", data)
>
> > s = f.read()
> > f.close()
> > open('jcolib.html','w').write(s)
>
> Use something like the FireBug-extension to see what the
> openimagewin-function ultimately creates as reqest. Then issue that,
> parametrised from parsed information out of the above href.
>
> There is no way to interpret the JS in Python, let alone mimic possible
> browser dom behavior.
>
> Diez
Thanks Diez;
Never used Firebug, and could not find the http-header section, but it
lead me to Tamper Data, and that was perfect to give me the headers.
Thanks for the input.
More information about the Python-list
mailing list