[Tutor] Pictures

Wayne Werner waynejwerner at gmail.com
Thu Apr 28 07:02:43 CEST 2011


On Wed, Apr 27, 2011 at 10:41 PM, <l.leichtnam at gmail.com> wrote:

> Hello,
>
> I'm still quite new at this but I'm trying to get a list of the pictures
> adress (... .jpg) of a page of a website.
>
> I thought of using the import urllib and import re, trying to fetch the
> website, parse it, and collect the adresses but I don't know how to do it...
>
> Can you help me?
>

You should take a look at the lxml and or BeautifulSoup modules (I recommend
lxml as it's still being developed on).

You also might search for "image scraping with python" in your favorite
search engine.

HTH,
Wayne
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/tutor/attachments/20110428/a4f8db7f/attachment.html>


More information about the Tutor mailing list