beautifulsoup VS lxml

iMath redstone-cold at 163.com
Fri Dec 12 09:25:28 EST 2014


在 2014年12月12日星期五UTC+8上午10时19分56秒,Michael Torrie写道:
> On 12/11/2014 07:02 PM, iMath wrote:
> > 
> > which is more easy and elegant for pulling data  out of HTML?
> 
> Beautiful Soup is specialized for HTML parsing, and it can deal with
> badly formed HTML, but if I recall correctly BeautifulSoup can use the
> lxml engine under the hood, so maybe it's the way to go for you, is it
> gives you the most flexibility.  It certainly has a good API that's easy
> to use for data scraping.  Try it and see if it's acceptable.

tried it, very elegant and Pythonic.
thank you very much !!!



More information about the Python-list mailing list