Python 2.6 StreamReader.readline()

Walter Dörwald walter at livinglogic.de
Wed Jul 25 05:02:01 EDT 2012


On 25.07.12 08:09, Ulrich Eckhardt wrote:

> Am 24.07.2012 17:01, schrieb cpppwner at gmail.com:
>>      reader = codecs.getreader(encoding)
>>      lines  =  []
>>      with open(filename, 'rb') as f:
>>          lines  = reader(f, 'strict').readlines(keepends=False)
>>
>> where encoding == 'utf-16-be'
>> Everything works fine, except that lines[0] is equal to
>> codecs.BOM_UTF16_BE
>> Is this behaviour correct, that the BOM is still present?
>
> Yes, assuming the first line only contains that BOM. Technically it's a
> space character, and why should those be removed?

If the first "character" in the file is a BOM the file encoding is 
probably not utf-16-be but utf-16.

Servus,
    Walter




More information about the Python-list mailing list