Using Beautiful Soup to entangle bookmarks.html

Diez B. Roggisch deets at nospam.web.de
Thu Sep 7 11:51:23 EDT 2006


Francach schrieb:
> Hi,
> 
> I'm trying to use the Beautiful Soup package to parse through the
> "bookmarks.html" file which Firefox exports all your bookmarks into.
> I've been struggling with the documentation trying to figure out how to
> extract all the urls. Has anybody got a couple of longer examples using
> Beautiful Soup I could play around with?

Why do you use BeautifulSoup on that? It's generated content, and I 
suppose it is well-formed, most probably even xml. So use a standard 
parser here, better yet somthing like lxml/elementtree

Diez



More information about the Python-list mailing list