Converting HTML to ASCII

Paul Rubin http
Sat Feb 26 05:36:31 EST 2005


Jorgen Grahn <jgrahn-nntq at algonet.se> writes:
> You should probably do what some other poster suggested -- download
> lynx or some other text-only browser and make your code execute it
> in -dump mode to get the text-formatted html. You'll get that
> working in an hour or so, and then you can see if you need something
> more complicated.

Lynx is pathetically slow for large files.  It seems to use a
quadratic algorithm for remembering where the links point, or
something.  I wrote a very crude but very fast renderer in C that I
can post if someone wants it, which is what I use for this purpose.



More information about the Python-list mailing list