[pypy-dev] HTML Parser?

Amaury Forgeot d'Arc amauryfa at gmail.com
Wed Feb 20 19:02:14 CET 2013


2013/2/20 Joe Hillenbrand <joehillen at gmail.com>

> What is the recommended HTML parser to run in PyPy?
>
> The typical goto for Python is lxml, but of course that doesn't work with
> PyPy.
>

This is not true anymore. There has been a lot of work on both sides to
make lxml work with PyPy.
You should try with latest versions.

In addition, there is a port of lxml that does not use Cython nor the C API:
   https://github.com/amauryfa/lxml/tree/lxml-cffi
most of the tests are passing (except objectify), but "setup.py install"
does not work yet.
It works from the source tree, though.

-- 
Amaury Forgeot d'Arc
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/pypy-dev/attachments/20130220/d0336e2a/attachment.html>


More information about the pypy-dev mailing list