Scraping a web page

Lawrence D'Oliveiro ldo at geek-central.gen.new_zealand
Wed Apr 8 03:14:55 EDT 2009


In message <mailman.3480.1239112765.11746.python-list at python.org>, Support 
Desk wrote:

> You could do something like below to get the rendered page.
> 
> Import os
> site = 'website.com'
> X = os.popen('lynx --dump %s' % site).readlines()

I wonder how easy it would be to get the page image in SVG format? I believe 
the Gecko HTML engine in Firefox already uses Cairo for its rendering, and 
Cairo supports SVG as one of its surface types.




More information about the Python-list mailing list