beautifulsoup VS lxml
iMath
redstone-cold at 163.com
Fri Dec 12 09:25:28 EST 2014
在 2014年12月12日星期五UTC+8上午10时19分56秒,Michael Torrie写道:
> On 12/11/2014 07:02 PM, iMath wrote:
> >
> > which is more easy and elegant for pulling data out of HTML?
>
> Beautiful Soup is specialized for HTML parsing, and it can deal with
> badly formed HTML, but if I recall correctly BeautifulSoup can use the
> lxml engine under the hood, so maybe it's the way to go for you, is it
> gives you the most flexibility. It certainly has a good API that's easy
> to use for data scraping. Try it and see if it's acceptable.
tried it, very elegant and Pythonic.
thank you very much !!!
More information about the Python-list
mailing list