beautifulsoup VS lxml

Michael Torrie torriem at gmail.com
Thu Dec 11 21:19:21 EST 2014


On 12/11/2014 07:02 PM, iMath wrote:
> 
> which is more easy and elegant for pulling data  out of HTML?

Beautiful Soup is specialized for HTML parsing, and it can deal with
badly formed HTML, but if I recall correctly BeautifulSoup can use the
lxml engine under the hood, so maybe it's the way to go for you, is it
gives you the most flexibility.  It certainly has a good API that's easy
to use for data scraping.  Try it and see if it's acceptable.




More information about the Python-list mailing list