retrieving ATOM/FSS feeds

Lawrence Oluyede raims at dot.com
Mon Aug 13 05:21:05 EDT 2007


_spitFIRE <timid.gentoo at gmail.com> wrote:
>   I'm using feedparser library to parser ATOM/RSS feeds. However, I don't
> get the entire post! but only summaries! How do I retrieve the entire feed?
> I believe that the parser library should have support for doing that or the
> specification should detail how it can be done? Or should I simply get the
> feed link and do HTML scraping?

If the content producer doesn't provide the full article via RSS/ATOM
there's no way you can get it from there. Search for full content feeds
if any, otherwise get the article URL and feed it to BeautifulSoup to
scrape the content.

-- 
Lawrence, oluyede.org - neropercaso.it
"It is difficult to get a man to understand 
something when his salary depends on not
understanding it" - Upton Sinclair



More information about the Python-list mailing list