[I18n-sig] XML and UTF-16

Tom Emerson tree@basistech.com
Thu, 31 May 2001 17:45:26 -0400


Paul Prescod writes:
> Would it matter if you were looking at <!DOCTYPE? Anyhow, a UTF-32
> document without an XML declaration would be in error. The declaration
> is required for everything other than UTF-8 and UTF-16.

I guess my point is that it is better to be overly conservative up
front and look for at least two complete characters (in whatever
encoding) before attempting to process the document.

    -tree

-- 
Tom Emerson                                          Basis Technology Corp.
Sr. Sinostringologist                              http://www.basistech.com
  "Beware the lollipop of mediocrity: lick it once and you suck forever"