Converting HTML to ASCII
William Park
opengeometry at yahoo.ca
Thu Feb 24 22:36:54 EST 2005
gf gf <unknownsoldier93 at yahoo.com> wrote:
> Hi. I'm looking for a Python lib to convert HTML to
> ASCII. Of course, a quick Google search showed
> several options (although, I must say, less than I
> would expect, considering how easy this is to do in
> *other* languages... :| ), but, I have 2 requirements,
> which none of them seem to meet:
>
> 1) Be able to handle badly formed, or illegal, HTML,
> as best as possible. Some of the converters I tried
> ended up dieing on a weird character (that is, a high
> ascii char). Others somehow got confused and dumped
> the JavaScript as well.
>
> 2) Not embellish the text in any way - no asterisks,
> no bracket links, no __ for underlines.
>
> Can anyone direct me to something which could help me
> for this?
man lynx
man links
man w3m
--
William Park <opengeometry at yahoo.ca>, Toronto, Canada
Slackware Linux -- because I can type.
More information about the Python-list
mailing list