[Python-Dev] Integrate BeautifulSoup into stdlib?

Ivan Krstić krstic at solarsail.hcs.harvard.edu
Wed Mar 4 22:17:52 CET 2009


On Mar 4, 2009, at 12:32 PM, James Y Knight wrote:
> I think html5lib would be a better candidate for an imrpoved HTML  
> parser in the stdlib than BeautifulSoup.


While we're talking about alternatives, Ian Bicking appears to swear  
by lxml:

<http://blog.ianbicking.org/2008/12/10/lxml-an-underappreciated-web-scraping-library/ 
 >

Cheers,

--
Ivan Krstić <krstic at solarsail.hcs.harvard.edu> | http://radian.org



More information about the Python-Dev mailing list