Using Python 2.1 to download asp www pages

Zugz zugz.public at DEL-ete-MEbtinternet.com
Mon Jan 7 16:31:52 EST 2002


"Paul Rubin" <phr-n2002a at nightsong.com> wrote in message
news:7xofk6er0w.fsf at ruckus.brouhaha.com...
> It's just a matter of how the server side application works.
>
> If you got back an HTTP 3xx redirection response then you have to just
> have your client read the locator from it and open the new location.
>
> If you get back an HTML META HTTP-EQUIV tag with a redirection that's
> more or less the same thing and you have to check for that too.
>
> It gets more annoying if the page sends javascript that tries to
> navigate to another page.

Hi,

Thanks for the reply.

Well I get this:
<head><title>Object moved</title></head>
<body><h1>Object Moved</h1>This object may be found <a
HREF="/messages/overview.asp?name=panther_xl&page=2">here</a>.</body>

With x.geturl() =
'http://boards.gamers.com/user/profiling/login/cookieread.asp?action=read&de
st_url=%2Fmessages%2Foverview%2Easp%3Fname%3Dpanther%5Fxl%26page%3D2'

> Basically, what you're trying to do (programmatically interpret pages
> that were intended for human interaction) is a pain in the neck and
> you will find yourself having to manually examine the html in the
> target pages and tweak your client for those specific pages, and
> continue tweaking it as the page author changes the format over time.

Now this is not such a problem. The task to hand is as much about learning
python as anything, so the long term viability of the program is not such an
issue. Almost fun to fiddle! Don't get to program during the day :(

If I'm going to have to fight python then I may just choose to get the pages
with a windows macro thing. They sound like they will do the job. But I
thought I'd go for a 100% python solution at first.

Regards,
Zugz.


---
Outgoing mail is certified Virus Free.
Checked by AVG anti-virus system (http://www.grisoft.com).
Version: 6.0.310 / Virus Database: 171 - Release Date: 19/12/01





More information about the Python-list mailing list