Readlines returns non ASCII character

paul.hermeneutic at gmail.com paul.hermeneutic at gmail.com
Wed Sep 23 20:07:14 EDT 2015


After looking at this briefly, I am not sure that this is a plain-text
file. Interpreting it as UTF-16 LE shows that the characters are as
they appear.

Immediately after the BOM is:

SINGLE LOW-9 QUOTATION MARK' (U+201A)
START OF HEADING (U+0001)
SPACE (U+0020)
SPACE (U+0020)
LATIN SMALL LETTER Y WITH DIAERESIS (U+00FF)
LATIN SMALL LETTER THORN (U+00FE)
LATIN SMALL LETTER Y WITH DIAERESIS (U+00FF)



More information about the Python-list mailing list