[Python-ideas] Support Unicode code point notation

M.-A. Lemburg mal at egenix.com
Sat Jul 27 12:07:22 CEST 2013


Steven D'Aprano wrote:
> Unicode's standard notation for code points is U+ followed by a 4, 5 or 6 hex digit string, such as
> π = U+03C0. This notation is found throughout the Unicode Consortium's website, e.g.:
> 
> http://www.unicode.org/versions/corrigendum2.html
> 
> as well as in third party sites that have reason to discuss Unicode code points, e.g.:
> 
> https://en.wikipedia.org/wiki/Eth#Computer_input
> 
> I propose that Python strings support this as the preferred escape notation for Unicode code points:
> 
> '\U+03C0'
> => 'π'

-1.

The \u and \U notations are standard in several programming
languages, e.g. Java and C++, so we're in good company.

-- 
Marc-Andre Lemburg
eGenix.com

Professional Python Services directly from the Source
>>> Python/Zope Consulting and Support ...        http://www.egenix.com/
>>> mxODBC.Zope.Database.Adapter ...             http://zope.egenix.com/
>>> mxODBC, mxDateTime, mxTextTools ...        http://python.egenix.com/
________________________________________________________________________

::: Try our new mxODBC.Connect Python Database Interface for free ! ::::


   eGenix.com Software, Skills and Services GmbH  Pastor-Loeh-Str.48
    D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg
           Registered at Amtsgericht Duesseldorf: HRB 46611
               http://www.egenix.com/company/contact/


More information about the Python-ideas mailing list