Converting HTML to ASCII

gf gf unknownsoldier93 at yahoo.com
Thu Feb 24 20:26:36 EST 2005


Hi.  I'm looking for a Python lib to convert HTML to
ASCII.  Of course, a quick Google search showed
several options (although, I must say, less than I
would expect, considering how easy this is to do in
*other* languages... :| ), but, I have 2 requirements,
which none of them seem to meet:

1) Be able to handle badly formed, or illegal, HTML,
as best as possible.  Some of the converters I tried
ended up dieing on a weird character (that is, a high
ascii char).  Others somehow got confused and dumped
the JavaScript as well.

2) Not embellish the text in any way - no asterisks,
no bracket links, no __ for underlines.

Can anyone direct me to something which could help me
for this?

--Thanks a mil.


		
__________________________________ 
Do you Yahoo!? 
Yahoo! Mail - Helps protect you from nasty viruses. 
http://promotions.yahoo.com/new_mail



More information about the Python-list mailing list