[Baypiggies] replacement for urllib2 that can handle xhtml

ST1999 st1999 at gmail.com
Fri Dec 31 21:03:32 CET 2010


lxml is *much* faster than BeautifulSoup (as I recall, a speaker at 2009
PyCon suggested it was 20 to 30 times faster) and should be used unless
there is a compelling reason not to. Also, I'm not sure how much
BeautifulSoup is being maintained at this point.

- Shailen Tuli

On Thu, Dec 30, 2010 at 10:22 PM, Charles Merriam <charles.merriam at gmail.com
> wrote:

> This shows up on the mailing list every now and then.
>
> lxml is faster, more tolerant, etc., than Beautful Soup and the built in
> ones..
>
>
> Enjoy
>
> On Thu, Dec 30, 2010 at 7:46 PM, Bill Janssen <janssen at parc.com> wrote:
> >
> > BeautifulSoup does xhtml, too.
> >
> > Bill
> >
> > Tony Cappellini <cappy2112 at gmail.com> wrote:
> >
> > > What's the best module/package for parsing xhtml?
> > > HTMLParser is built in, but is there another package which is more
> > > like urlib2 or Beautiful Soup- but handles xhtml?
> > >
> > > thanks
> > > _______________________________________________
> > > Baypiggies mailing list
> > > Baypiggies at python.org
> > > To change your subscription options or unsubscribe:
> > > http://mail.python.org/mailman/listinfo/baypiggies
> > _______________________________________________
> > Baypiggies mailing list
> > Baypiggies at python.orgis shows
> > To change your subscription options or unsubscribe:
> > http://mail.python.org/mailman/listinfo/baypiggies
> _______________________________________________
> Baypiggies mailing list
> Baypiggies at python.org
> To change your subscription options or unsubscribe:
> http://mail.python.org/mailman/listinfo/baypiggies
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/baypiggies/attachments/20101231/2210f2bb/attachment.html>


More information about the Baypiggies mailing list