Rss/xml namespaces sgmllib, sax, minidom

uche.ogbuji at gmail.com uche.ogbuji at gmail.com
Mon Jan 2 01:26:27 EST 2006


Sakcee wrote:
> I want to build a simple validator for rss2 feeds, that checks basic
> structure and reports channels , items , and their attributes etc.
>
> I have been reading Mark Pilgrims articles on xml.com, diveintopython
> and someother stuff on sgmllib, sax.handlers and content handlers,
> xml.dom.minidom
>
> why is all of this necessary, what is the difference between all these
> libraries, it seems to me that I can parse the rss2 feed with any of
> these libraries.!! ?
>
> what is the difference between namespaces and non-namspaces functions
> in sax.handlers.contenthandler , is the namespace defined like domain
> names on some website?

Based on this question, I tend to think you might want to leave the XML
processing to someone else's code.  How about using Pilgrim's
feedparser?

http://feedparser.org/

--
Uche Ogbuji                               Fourthought, Inc.
http://uche.ogbuji.net                    http://fourthought.com
http://copia.ogbuji.net                   http://4Suite.org
Articles: http://uche.ogbuji.net/tech/publications/




More information about the Python-list mailing list