Converting HTML to ASCII

William Park opengeometry at yahoo.ca
Thu Feb 24 22:36:54 EST 2005


gf gf <unknownsoldier93 at yahoo.com> wrote:
> Hi.  I'm looking for a Python lib to convert HTML to
> ASCII.  Of course, a quick Google search showed
> several options (although, I must say, less than I
> would expect, considering how easy this is to do in
> *other* languages... :| ), but, I have 2 requirements,
> which none of them seem to meet:
> 
> 1) Be able to handle badly formed, or illegal, HTML,
> as best as possible.  Some of the converters I tried
> ended up dieing on a weird character (that is, a high
> ascii char).  Others somehow got confused and dumped
> the JavaScript as well.
> 
> 2) Not embellish the text in any way - no asterisks,
> no bracket links, no __ for underlines.
> 
> Can anyone direct me to something which could help me
> for this?

man lynx
man links
man w3m

-- 
William Park <opengeometry at yahoo.ca>, Toronto, Canada
Slackware Linux -- because I can type.




More information about the Python-list mailing list