Parsing HTML

Anders Eriksson anders.eriksson at morateknikutveckling.se
Thu Sep 23 02:42:08 EDT 2004


Hello!

I want to extract some info from a some specific HTML pages, Microsofts
International Word list (e.g.
http://msdn.microsoft.com/library/en-us/dnwue/html/swe_word_list.htm). I
want to take all the words, both English and the other language and create
a dictionary. so that I can look up About and get Om as the answer.

How is the best way to do this?

Please help!

// Anders



More information about the Python-list mailing list