[issue18753] [c]ElementTree.fromstring fails to parse <value>]]></value>

Kees Bos report at bugs.python.org
Fri Aug 16 21:13:14 CEST 2013


Kees Bos added the comment:

OK. I got clarification from the lxml list. It's not a bug. And it's sepcified in section 2.4 (http://www.w3.org/TR/REC-xml/#syntax):

The ampersand character (&) and the left angle bracket (<) MUST NOT appear in their literal form, except when used as markup delimiters, or within a comment, a processing instruction, or a CDATA section. If they are needed elsewhere, they MUST be escaped using either numeric character references or the strings " & " and " < " respectively. The right angle bracket (>) may be represented using the string " > ", and MUST, for compatibility, be escaped using either " > " or a character reference when it appears in the string " ]]> " in content, when that string is not marking the end of a CDATA section.

In the content of elements, character data is any string of characters which does not contain the start-delimiter of any markup and does not include the CDATA-section-close delimiter, " ]]> ". In a CDATA section, character data is any string of characters not including the CDATA-section-close delimiter, " ]]> ".


Sorry for the confusion and taking your time for a bogus report.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue18753>
_______________________________________


More information about the Python-bugs-list mailing list