how can i use lxml with win32com?
User
motoom at xs4all.nl
Sun Oct 25 02:56:45 EDT 2009
On 25 Oct 2009, at 07:45 , elca wrote:
> i want to make web scraper.
> if possible i really want to make it work together with
> beautifulsoup or
> lxml with PAMIE.
Scraping information from webpages falls apart in two tasks:
1. Getting the HTML data
2. Extracting information from the HTML data
It looks like you want to use Internet Explorer for getting the HTML
data; is there any reason you can't use a simpler approach like using
urllib2.urlopen()?
Once you have the HTML data, you could feed it into BeautifulSoup or
lxml.
Mixing up 1 and 2 into a single statement created some confusion for
you, I think.
Greetings,
More information about the Python-list
mailing list