[Python-Dev] Integrate BeautifulSoup into stdlib?

Brett Cannon brett at python.org
Mon Mar 2 15:21:57 CET 2009


On Mon, Mar 2, 2009 at 04:23, Vaibhav Mallya <vaibhavmallya at gmail.com>wrote:

> I haven't seen a lot of discussion on this - maybe I didn't search hard
> enough - but  what are people's thoughts on including BeautifulSoup in
> stdlib? It's small, fast, and pretty widely-liked by the people who know
> about it. Someone mentioned that web scraping needs are infrequent. My
> argument is that people ask questions about them less because they feel they
> can just reinvent the wheel really easily using urllib and regexes. It seems
> like this is similar to the CSV problem from a while back actually, with
> everyone implementing their own parsers.
>
> We do have HTMLParser, but that doesn't handle malformed pages well, and
> just isn't as nice as BeautifulSoup.
>
> In a not-entirely-unrelated vein, has there been any discussion on just
> throwing all of Mechanize into stdlib?


Discussions of including modules in the standard library only occurs when
the module creators step forward to offer to support the modules. To my
knowledge neither the creators of BeautifulSoup or Mechanize have come
forward to offer to manage the code in Python's standard library.

-Brett
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-dev/attachments/20090302/a38b8adf/attachment.htm>


More information about the Python-Dev mailing list