Modules for inclusion in standard library?

Ivan Van Laningham ivanlan at pauahtun.org
Fri Jul 1 09:07:57 EDT 2005


Hi All--

Max M wrote:
> 
> Another good bet is BeautifulSoup, which is absolutely great for
> scraping content from webpages.
> 
> http://crummy.com/software/BeautifulSoup/index.html
> 

Not if you want to handle HTML in anything but ASCII.  BeautifulSoup
insists you change your site.py to change the default encoding if you
want to use non-ASCII.  It might work beautifully, but I won't use it,
at least not until it's fixed to understand encodings.

Metta,
Ivan
----------------------------------------------
Ivan Van Laningham
God N Locomotive Works
http://www.andi-holmes.com/
http://www.foretec.com/python/workshops/1998-11/proceedings.html
Army Signal Corps:  Cu Chi, Class of '70
Author:  Teach Yourself Python in 24 Hours



More information about the Python-list mailing list