[I18n-sig] UTF-8 and BOM

Martin v. Loewis martin@loewis.home.cs.tu-berlin.de
Thu, 17 May 2001 06:25:36 +0200


> > I disagree that putting the BOM into a file is a good thing - I think
> > it is stupid to do so. First of all, auto-detection can always be
> > fooled, so there should be a higher-level protocol for reliable data
> > processing. 
> 
> There should be but there isn't always. What is the standard way for
> tagging UTF-8 documents on the Windows file system?

There probably is none, although giving them a .txt extension is a
good starting point. What is the standard for tagging KOI8-R documents
on the Windows file system?

> So what if there is a BOM in the middle of the data stream. MAL's
> decoder will just remove it anyhow. :)

Yes, and I think this is a bug.

Regards,
Martin