When to use mechanize and Windmill library during WebScraping ?

Raji Seetharaman sraji.me at gmail.com
Thu Dec 17 08:50:36 EST 2009


>
> ---------- Forwarded message ----------
> From: Javier Collado <javier.collado at gmail.com>
> To: Raji Seetharaman <sraji.me at gmail.com>
> Date: Sat, 12 Dec 2009 12:52:27 +0100
> Subject: Re: When to use mechanize and Windmill library during WebScraping
> ?
> Hello,
>
> If a script that uses mechanize fails to find an html node that has
> been identified with Firebug, this is probably because that node has
> been autogenerated (provided that the expression to get the node is
> correct).
>
> As an alternative to verify this, you can try to download the html
> page and open it in your favourite editor. If some of the nodes that
> you can see in your browser are missing or empty, then one of the
> JavaScript scripts in the page should have created/populated it.
>
> If you're in doubt, you can try to use mechanize and, if you have
> problems such as the described above, then you can move to windmill or
> some other tool that executes JavaScript code before trying to get the
> desired data.
>
> Best regards,
>    Javier
>
>
Thanks for your help

Raji. S
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-list/attachments/20091217/cd97d7de/attachment-0001.html>


More information about the Python-list mailing list