help!! *extra* tricky web page to extract data from...

Max Erickson maxerickson at gmail.com
Tue Mar 13 17:41:33 EDT 2007


"seberino at spawar.navy.mil" <seberino at spawar.navy.mil> wrote:

> How extract the visible numerical data from this Microsoft
> financial web site?
> 
> http://tinyurl.com/yw2w4h
> 
> If you simply download the HTML file you'll see the data is *not*
> embedded in it but loaded from some other file.
> 
> Surely if I can see the data in my browser I can grab it somehow
> right in a Python script?
> 
> Any help greatly appreciated.
> 
> Sincerely,
> 
> Chris
> 

The url for the data is in an iframe. If you need to scrape the 
original page for some reason(instead of iframe url directly), you can 
use urlparse.urljoin to resolve the relative url.


max




More information about the Python-list mailing list