[Web-SIG] Extracting web data

Deb Midya debmidya at yahoo.com
Tue Feb 22 00:59:58 CET 2011


Joost,
 
Thank you very much for your response.
 
I have found that there is no binary file of lxml in the package index of python.org.
 
I am using Python 2.6 on Windows XP.
 
Is there any alternative solution?
 
Once again, thank you very much for the time you have given.
 
Regards,
 
Deb

--- On Mon, 21/2/11, Joost Molenaar <j.j.molenaar at gmail.com> wrote:


From: Joost Molenaar <j.j.molenaar at gmail.com>
Subject: Re: [Web-SIG] Extracting web data
To: "Deb Midya" <debmidya at yahoo.com>
Received: Monday, 21 February, 2011, 5:19 PM



You should look at lxml, it knows how to parse HTML and XML and lets you use XPath to find the data you need.
Joost Molenaar

Op 21 feb 2011 05:28 schreef "Deb Midya" <debmidya at yahoo.com>:






Hi Python web-sig users,
 
Thanks in advance and I am new to web-sig.
 
I am using Python 2.6 on Windows XP.
 
May I request you to assist me for the following please.
 
I like to extract web data from the site (http://finance.yahoo.com, for example).
 
The data may include Historical Prices, Key Statistics, News & Info, Headlines, etc. for a list of codes (such WOW, .... these are codes for company Ids). 
 
I am trying to automate the extraction of data.
 
Is there any Python module or any assistance please?
 
Once again, thank you very much for the time you have given.
 
Regards,
 
Deb
 
 
_______________________________________________
Web-SIG mailing list
Web-SIG at python.org
Web SIG: http://www.python.org/sigs/web-sig
Unsubscribe: http://mail.python.org/mailman/options/web-sig/j.j.molenaar%40gmail.com





      
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/web-sig/attachments/20110221/7c3613a6/attachment.html>


More information about the Web-SIG mailing list