Scraping Wikipedia with Python

Dotan Cohen dotancohen at gmail.com
Tue Aug 11 13:33:39 EDT 2009


I plan on making a geography-learning Anki [1] deck, and Wikipedia has
the information that I need in nicely formatted tables on the side of
each country's page. Has someone already invented a wheel to parse and
store that data (scrape)? It is probably not difficult to code, and
within the Wikipedia license, but if that wheel has already been
invented then I don't want to redo it. I tried googling for a
Wikipedia-specific solution but found none, is there a general purpose
solution that I could use?

Note that I am a regular Wikipedia contributor and plan on staying
within the realm of Wikipedia's rules.


[1] http://ichi2.net/anki/

-- 
Dotan Cohen

http://what-is-what.com
http://gibberish.co.il



More information about the Python-list mailing list