beautifulsoup VS lxml
Michael Torrie
torriem at gmail.com
Thu Dec 11 21:19:21 EST 2014
On 12/11/2014 07:02 PM, iMath wrote:
>
> which is more easy and elegant for pulling data out of HTML?
Beautiful Soup is specialized for HTML parsing, and it can deal with
badly formed HTML, but if I recall correctly BeautifulSoup can use the
lxml engine under the hood, so maybe it's the way to go for you, is it
gives you the most flexibility. It certainly has a good API that's easy
to use for data scraping. Try it and see if it's acceptable.
More information about the Python-list
mailing list