[Python-Dev] HTMLParser and HTML5

Brett Cannon brett at python.org
Fri Jul 29 22:31:48 CEST 2011


On Fri, Jul 29, 2011 at 11:31, Tres Seaver <tseaver at palladion.com> wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> On 07/29/2011 07:22 AM, Joao S. O. Bueno wrote:
>
> > I disaagree. Having proper html parsing out of the box is part of
> > the "batteries included" thing. And it is not a matter of "having
> > html 5" - as stated on this thread, fixing it for html5 will fix it
> > for html that exists in the "real world".
> >
> > Python _has_ to work with quick 30-50 lines scripts deliverable
> > everywhere, not just has proper 3rd party libraries that can work as
> > part of a huge project using buildout.
>
> Assuming it were merged today, that parser would only be available on
> Python 3.3 and later:  how is that "everywhere"?


Well, "everywhere, eventually". This gets down to the usual philosophical
debate of what should (not) be in the stdlib so that those who have strict
third-party code get access to useful libraries while balancing the desire
of those who want to keep the stdlib lean or prevent stagnating the API of a
module.


>  Having scripts that
> work against html5lib (which *doesn't* need buildout to install, or even
> setuptools) makes them portable to any version of Python supported by
> the library (Python 2.3+, AFAICT).
>

If the library was brought in they could probably continue to be portable
with possibly just the addition of a try/finally on the import line.

-Brett


>
>
> Tres.
> - --
> ===================================================================
> Tres Seaver          +1 540-429-0999          tseaver at palladion.com
> Palladion Software   "Excellence by Design"    http://palladion.com
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.10 (GNU/Linux)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
>
> iEYEARECAAYFAk4y/JYACgkQ+gerLs4ltQ4KKwCgkyOlmb8xxhxg1qWH9RRbEpEw
> ne0AoL6NgRElbY61QRqnXJjiKoHq0ToW
> =fk3k
> -----END PGP SIGNATURE-----
>
> _______________________________________________
> Python-Dev mailing list
> Python-Dev at python.org
> http://mail.python.org/mailman/listinfo/python-dev
> Unsubscribe:
> http://mail.python.org/mailman/options/python-dev/brett%40python.org
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-dev/attachments/20110729/967fcc6d/attachment.html>


More information about the Python-Dev mailing list