[Tutor] Print images links

Noah Hall enalicho at gmail.com
Tue Apr 12 18:57:03 CEST 2011


On Tue, Apr 12, 2011 at 5:40 PM,  <l.leichtnam at gmail.com> wrote:
> Hello,
>
> I would like to print the adresses of every image that are on the main page of www.columbia.edu. Do you know how to do this?


Yes, I do. ;)

As this sounds like homework -
Firstly, you want to grab the source code of the webpage.
Secondly, you want to find what defines each image link - I'll give
you a clue, think of HTML markup for images.
Thirdly, use a regex to extract the link and either print it directly,
or put it into something you can reuse later.

Hope this helps.


More information about the Tutor mailing list