HTML Structure Extraction
dayzman at hotmail.com
dayzman at hotmail.com
Wed Dec 8 01:31:16 EST 2004
Hi,
I'm going to write a program that extracts the structure of HTML
documents. The structure would be in the form of a tree, separating the
tags and grouping the start and end tags. I think I will use
htmllib.HTMLParser, is it appropriate for my application? If so, I
believe I will need to keep track of the depth reached.
Any tips for such application will be much appreciated.
Cheers,
Michael
More information about the Python-list
mailing list