how can i use lxml with win32com?

User motoom at xs4all.nl
Sun Oct 25 02:56:45 EDT 2009


On 25 Oct 2009, at 07:45 , elca wrote:

> i want to make web scraper.
> if possible i really want to make it work together with  
> beautifulsoup or
> lxml with PAMIE.

Scraping information from webpages falls apart in two tasks:

1. Getting the HTML data
2. Extracting information from the HTML data

It looks like you want to use Internet Explorer for getting the HTML  
data; is there any reason you can't use a simpler approach like using  
urllib2.urlopen()?

Once you have the HTML data, you could feed it into BeautifulSoup or  
lxml.

Mixing up 1 and 2 into a single statement created some confusion for  
you, I think.

Greetings,



More information about the Python-list mailing list