[TriZPUG] jump start needed

Paul McLanahan pmclanahan at gmail.com
Sun Dec 7 21:57:06 CET 2008


On Fri, Dec 5, 2008 at 8:34 PM, Scott Hicks <cshicks71 at gmail.com> wrote:
> * I have a particular project in mind.  Pretty simple html parsing /
> processing and reading and writing to an sqlite database (database is
> complete).  I have looked at and played around with HTMLParser, urllib, and
> other libraries such as twill.  They all seem to do the same job and I am
> not sure which one would best suit my needs.  Eventually, I want to get into
> driving Selenium Grid with Python.

Congrats on getting a Mac :) I have a 3GHz 24" iMac as my primary
machine and I love it.

I'm no Python veteran, but I am starting to use it a lot at work. My
only suggestion is in the HTML parsing realm. I've looked into this
some and a lot of people love the Beautiful Soup library for HTML
parsing. It's very easy and pythonic to use, and it also will power
through really bad HTML in the case where you don't control the HTML
code you'll be parsing.

Good luck!

http://www.crummy.com/software/BeautifulSoup/

Paul


More information about the TriZPUG mailing list