extracting HTML fragments and counting words

Ksenia Marasanova ksenia.marasanova at gmail.com
Fri Feb 18 15:28:24 EST 2005


Hi,

I want to show preview  of several HTML formatted newsitems on one
page, preserving markup (and images) intact, but showing not more
thatn X first _readable_ words of every page. Is anyone aware of some
Python library that makes programming this easy? I already started to
program it with Beautiful Soup, but maybe there is a more easy way...

Thanks!
-- 
Ksenia



More information about the Python-list mailing list