[Python-Dev] Draft Guide for code migration and modernation

M.-A. Lemburg mal@lemburg.com
Tue, 04 Jun 2002 16:17:26 +0200


Guido van Rossum wrote:
>>>          c in string.whitespace --> c.isspace()
>>
>>This changes the meaning slightly for unicode characters, because
>>chr(i).isspace() != unichr(i).isspace()
>>for i in { 0x1c, 0x1d, 0x1e, 0x1f, 0x85, 0xa0 }
> 
> 
> That's unfortunate, because I'd like unicode to be an extension of
> ASCII also in this kind of functionality.  What are these and why are
> they considered spaces?  Would it hurt to make them spaces in ASCII
> too?

 From the Unicode database:

001C;<control>;Cc;0;B;;;;;N;FILE SEPARATOR;;;;
001D;<control>;Cc;0;B;;;;;N;GROUP SEPARATOR;;;;
001E;<control>;Cc;0;B;;;;;N;RECORD SEPARATOR;;;;
001F;<control>;Cc;0;S;;;;;N;UNIT SEPARATOR;;;;

0085;<control>;Cc;0;B;;;;;N;NEXT LINE;;;;

00A0;NO-BREAK SPACE;Zs;0;CS;<noBreak> 0020;;;;N;NON-BREAKING SPACE;;;;

-- 
Marc-Andre Lemburg
CEO eGenix.com Software GmbH
______________________________________________________________________
Company & Consulting:                           http://www.egenix.com/
Python Software:                   http://www.egenix.com/files/python/
Meet us at EuroPython 2002:                 http://www.europython.org/