[Python-Dev] Unicode and Windows
M.-A. Lemburg
mal@lemburg.com
Wed, 22 Mar 2000 17:31:26 +0100
Jack Jansen wrote:
>
> > > [on the user-supplies-buffer interface]
> > > I think this would be much less error-prone than having fixed-length buffers
> > > all over the place.
> >
> > PyArg_ParseTuple() should probably raise an error in case the
> > data doesn't fit into the buffer.
>
> Ah, that's right, that solves most of that problem.
>
> > > [on the malloced interface]
> > Good point. You'll still need the buffer_len output parameter
> > though -- otherwise you wouldn't be able tell the size of the
> > allocated buffer (the returned data may not be terminated).
>
> Are you sure? I would expect the "eS" format to be used to obtain 8-bit data
> in some local encoding, and I would expect that all 8-bit encodings of unicode
> data would still allow for null-termination. Or are there 8-bit encodings out
> there where a zero byte is normal occurrence and where it can't be used as
> terminator?
Not sure whether these exist or not, but they are certainly
a possibility to keep in mind.
Perhaps adding "es#" and "es" (with 0-byte check) would be
ideal ?!
--
Marc-Andre Lemburg
______________________________________________________________________
Business: http://www.lemburg.com/
Python Pages: http://www.lemburg.com/python/