HTML filtering
Stuart D. Gathman
stuart at bmsi.com
Wed May 1 15:06:36 EDT 2002
I need to filter HTML to remove certain constructs (e.g. <script ...> ...
</script>). I am trying to use the batteries. The htmllib module helps
with the parsing, but it seems like a lot of work to create a formatter
that passes everything (except script) through in HTML syntax - espicially
trying to preserve original syntax. Am I missing something? Is there
another module I should be using for filtering HTML? Perhaps one of
those ad stripping filters written in python would provide a usable
example?
--
Stuart D. Gathman <stuart at bmsi.com>
Business Management Systems Inc. Phone: 703 591-0911 Fax: 703 591-6154
"Confutatis maledictis, flamis acribus addictis" - background song for
a Microsoft sponsored "Where do you want to go from here?" commercial.
More information about the Python-list
mailing list