Hi, Any one has suggestion for following problem. Some word documents have been converted to HTML page in Ms-Word. Want to filter html tags like.. <o:p></o:p>, <![if !supportEmptyParas]> <![endif]> <?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" />, etc. I couldn't solve using SGMLParser. Thanks, Sanjay