vBulletin scraper -- feasible?

Nick Cash nick.cash at npcinternational.com
Mon Jun 25 15:44:10 EDT 2012


You may want to look into http://www.crummy.com/software/BeautifulSoup/
It's made for parsing (potentially bad) HTML, and is quite easy to use. I'd say it's quite feasible.

Thanks,
Nick Cash
NPC International

-----Original Message-----
From: python-list-bounces+nick.cash=npcinternational.com at python.org [mailto:python-list-bounces+nick.cash=npcinternational.com at python.org] On Behalf Of Andrew D'Angelo
Sent: Monday, June 25, 2012 14:10
To: python-list at python.org
Subject: vBulletin scraper -- feasible?

Taking a look through vBulletin's HTML, I was wondering whether it would be overly difficult to parse it into nice, manipulatible data.
I'd suppose my ultimate goal would be to dynamically parse a vBulletin and feed it into a locally hosted NNTP server. 


--
http://mail.python.org/mailman/listinfo/python-list





More information about the Python-list mailing list