How *extract* data from XHTML Transitional web pages? got xml.dom.minidom troubles..

seberino at spawar.navy.mil seberino at spawar.navy.mil
Fri Mar 2 18:32:58 EST 2007


I'm trying to extract some data from an XHTML Transitional web page.

What is best way to do this?

xml.dom.minidom.parseString("text of web page") gives errors about it
not being well formed XML.

Do I just need to add something like <?xml ...?> or what?

Chris




More information about the Python-list mailing list