capturing stdout from lynx..

sergio at village-buzz.com sergio at village-buzz.com
Fri Mar 10 23:17:25 EST 2006


i have a huge database that contains large amounts of html that i need
to translate to ascii..

i have tried using html2text.py:

http://www.aaronsw.com/2002/html2text/

but i could not figure out how to import it and use it as a library
without getting errors everywhere..

so i decided to try using lynx with the -dump switch..

it works great from the command line, but i am having trouble capturing
the output into a python variable..

the only way i have figured out how to do it is:

s = subprocess(args='/sw/bin/lynx',stdout=subprocess.PIPE)

but i can't figure out how to send it the "-dump" or the
<filename.html> and retrieve the ouput..

any help would be appreciated..




More information about the Python-list mailing list