[pypy-dev] Interest in GSoC project: UTF-8 internal unicode storage

Armin Rigo arigo at tunes.org
Tue Mar 8 11:30:12 EST 2016


Hi Robin,

On 8 March 2016 at 17:10, Robin Kruppe <robin.kruppe at gmail.com> wrote:
> I just wanted to mention that several other language implementors have faced
> ...
> While everything described there may be pretty obvious (for those immersed
> in the guts of Unicode), I wanted to raise awareness that this has a name
> and other users.

Thanks!  We'd be using the "generalized UTF-8" from
https://simonsapin.github.io/wtf-8/, in principle.  We'd not be using
WTF-8 because it considers that u'\ud805\udc09' == u'\U00011409',
whereas CPython does not, generally.


A bientôt,

Armin.


More information about the pypy-dev mailing list