From jackdied at gmail.com Tue Jun 1 01:31:06 2010 From: jackdied at gmail.com (Jack Diederich) Date: Mon, 31 May 2010 19:31:06 -0400 Subject: [Python-ideas] An identity dict In-Reply-To: References:

<79AAB0F9-3775-40A4-9408-2A8286FC6EDB@gmail.com> Message-ID: On Mon, May 31, 2010 at 2:05 PM, Terry Reedy wrote: > current vote: -.3 > I am also not yet convinced, but perhaps could be, that either type, with or > without generalization should be in the stdlib. Instances of user class > without custom equality are already compared by identity. The use cases for > keying immutables by identify is pretty sparse. That pretty much leave > mutables with custom equality (by value rather than identity). I'm -1 on the idea without a strong use case. I vaguely recall implementing one of these before but I think I was using it as a hacky weakrefdict. Looking in my libmisc.py for dict-alikes I see an OrderedDict (obsoleted), a ForgivingDict (obsoleted by defaultdict), a ProxyDict, and a DecorateDict. The ProxyDict can push/pop dicts and does lookups across all of them, most recent first, and performs sets in the most recent. The DecorateDict calls a function on the value before returning it. Django has classes with almost the exact same code (not contributed by me). Django: http://code.djangoproject.com/svn/django/trunk/django/utils/datastructures.py Me: http://bazaar.launchpad.net/~odbrazen/leanlyn/trunk/annotate/head:/libmisc.py -Jack From benjamin at python.org Tue Jun 1 02:44:41 2010 From: benjamin at python.org (Benjamin Peterson) Date: Tue, 1 Jun 2010 00:44:41 +0000 (UTC) Subject: [Python-ideas] An identity dict References:

<79AAB0F9-3775-40A4-9408-2A8286FC6EDB@gmail.com> Message-ID: Raymond Hettinger writes: > Also, I haven't seen much of a discussion of use cases. Here's a selection of use cases from PyPy's source (You can search for "identity_dict" to see its use): In a algorithm for breaking cycles in graphs: http://codespeak.net/svn/pypy/trunk/pypy/tool/algo/graphlib.py Keeping track of all the allocated objects in a model of a low level runtime: http://codespeak.net/svn/pypy/trunk/pypy/rpython/lltypesystem/lltype.py Tracing the source of a certain kind of type as our type checker annotate RPython: http://codespeak.net/svn/pypy/trunk/pypy/annotation/bookkeeper.py Traversing the blocks of a function's graph: http://codespeak.net/svn/pypy/trunk/pypy/objspace/flow/model.py Essentially these are places where defined equality should not matter. I could also use it here: http://code.activestate.com/recipes/577242-calling-c-level-finalizers-without-__del__/ From benjamin at python.org Tue Jun 1 02:45:45 2010 From: benjamin at python.org (Benjamin Peterson) Date: Tue, 1 Jun 2010 00:45:45 +0000 (UTC) Subject: [Python-ideas] An identity dict References:

<4C030321.4050803@canterbury.ac.nz> <89E5DB78-B304-4A9F-B140-96888B2FCCC7@gmail.com> Message-ID: Raymond Hettinger writes: > Also, there hasn't been much discussion of implementation, > but unless you're willing to copy and paste most of the > code in dictobject.c, you're going to end-up with something > much slower than d[id(obj)]=value. It can be implemented simply in Python: http://codespeak.net/svn/pypy/trunk/pypy/lib/identity_dict.py From raymond.hettinger at gmail.com Tue Jun 1 03:23:18 2010 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Mon, 31 May 2010 18:23:18 -0700 Subject: [Python-ideas] An identity dict In-Reply-To: References:

<4C030321.4050803@canterbury.ac.nz> <89E5DB78-B304-4A9F-B140-96888B2FCCC7@gmail.com> Message-ID: On May 31, 2010, at 5:45 PM, Benjamin Peterson wrote: > Raymond Hettinger writes: >> Also, there hasn't been much discussion of implementation, >> but unless you're willing to copy and paste most of the >> code in dictobject.c, you're going to end-up with something >> much slower than d[id(obj)]=value. > > It can be implemented simply in Python: > http://codespeak.net/svn/pypy/trunk/pypy/lib/identity_dict.py That code is pretty much what I expected. In CPython, it is dramatically slower than using a regular dictionary with d[id(obj)]=value. In PyPy, it makes sense because the code gets optimized as if it were hand coded in C. IOW, identity_dict.py doesn't make much sense for other implementations. > Here's a selection of use cases from PyPy's source (You can search for > "identity_dict" to see its use): > > In a algorithm for breaking cycles in graphs: > http://codespeak.net/svn/pypy/trunk/pypy/tool/algo/graphlib.py This is code that doesn't require or benefit from using an identity dictionary. Regular dicts work just fine here. And since, identity-implies-equality for regular CPython dicts, you already get excellent performance (i.e. the __eq__ methods never get called when the object identities already match). > Keeping track of all the allocated objects in a model of a low level runtime: > http://codespeak.net/svn/pypy/trunk/pypy/rpython/lltypesystem/lltype.py This is a ton of code and I can't easily tell what it is doing or comment on it. > Tracing the source of a certain kind of type as our type checker annotate > RPython: http://codespeak.net/svn/pypy/trunk/pypy/annotation/bookkeeper.py Looks to be another case where a regular dict works just fine. > Traversing the blocks of a function's graph: > http://codespeak.net/svn/pypy/trunk/pypy/objspace/flow/model.py This code also works fine with a regular dictionary or a regular python set. If you used the identity_dict.py code mentioned above, it would just slow down the code. This isn't really even a dictionary use case, a set would be a better choice. > Essentially these are places where defined equality should not matter. Essentially, these are cases where an identity dictionary isn't necessary and would in-fact be worse performance-wise in every implementation except for PyPy which can compile the pure python code for indentity_dict.py. Since instances have a default hash equal to the id and since identity-implies-equality for dictionary keys, we already have a dictionary that handles these cases. You don't even have to type: d[id(k)]=value, it would suffice to write: d[k]=value. Sorry, but I think this idea is a total waste. Perhaps post it as a recipe, but it doesn't make sense to try to inject it into the standard library. Raymond From benjamin at python.org Tue Jun 1 04:31:39 2010 From: benjamin at python.org (Benjamin Peterson) Date: Tue, 1 Jun 2010 02:31:39 +0000 (UTC) Subject: [Python-ideas] An identity dict References: Message-ID: Lie Ryan writes: > that their id() is expensive is implementation details, and the > developer of PyPy should solve that instead of adding a clutch to the > stdlib. The stdlib isn't just about CPython. We already have optimized primitives for CPython, so I don't see why helping other implementations isn't a good cause. From raymond.hettinger at gmail.com Tue Jun 1 06:37:05 2010 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Mon, 31 May 2010 21:37:05 -0700 Subject: [Python-ideas] An identity dict In-Reply-To: References:

Message-ID: <7AC8DB63-DAD6-46EA-89B1-AA339E4D7B43@gmail.com> On May 31, 2010, at 7:31 PM, Benjamin Peterson wrote: > Lie Ryan writes: >> that their id() is expensive is implementation details, and the >> developer of PyPy should solve that instead of adding a clutch to the >> stdlib. > > The stdlib isn't just about CPython. We already have optimized primitives for > CPython, so I don't see why helping other implementations isn't a good cause. Benjamin, could you elaborate of several points that are unclear: * If id() is expensive in PyPy, then how are they helped by the code in http://codespeak.net/svn/pypy/trunk/pypy/lib/identity_dict.py which uses id() for the gets and sets and contains? * In the examples you posted (such as http://codespeak.net/svn/pypy/trunk/pypy/tool/algo/graphlib.py ), it appears that PyPy already has an identity dict, so how are they helped by adding one to the collections module? * Most of the posted examples already work with regular dicts (which check identity before they check equality) -- don't the other implementations already implement regular dicts which need to have identity-implied-equality in order to pass the test suite? I would expect the following snippet to work under all versions and implementations of Python: >>> class A: ... pass >>> a = A() >>> d = {a: 10} >>> assert d[a] == 10 # uses a's identity for lookup * Is the proposal something needed for all implementations or is it just an optimization for a particular, non-CPython implementation? Raymond -------------- next part -------------- An HTML attachment was scrubbed... URL: From jh at improva.dk Tue Jun 1 09:52:44 2010 From: jh at improva.dk (Jacob Holm) Date: Tue, 01 Jun 2010 09:52:44 +0200 Subject: [Python-ideas] An identity dict In-Reply-To: References:

<4C030321.4050803@canterbury.ac.nz> <89E5DB78-B304-4A9F-B140-96888B2FCCC7@gmail.com>

Message-ID: <4C04BC4C.2030103@improva.dk> On 2010-06-01 03:23, Raymond Hettinger wrote: > > On May 31, 2010, at 5:45 PM, Benjamin Peterson wrote: > >> Essentially these are places where defined equality should not matter. "should not matter" is the important part here. It might have been clearer to say "should be ignored" instead. I think Raymond is misunderstanding it. > Essentially, these are cases where an identity dictionary isn't > necessary and would in-fact be worse performance-wise > in every implementation except for PyPy which can compile > the pure python code for indentity_dict.py. It is necessary, because the objects involved might define their own __hash__ and __cmp__/__eq__, and these should *not* be used. > Sorry, but I think this idea is a total waste. Perhaps post it as > a recipe, but it doesn't make sense to try to inject it into the > standard library. I don't think it is a total waste, but I have seen two ideas in this thread that I find more generally useful. One is "collections.keyfuncdict", which could be trivially used as an identitydict. The other is a WeakIdentityDict, which is a WeakKeyDict that uses only the identity of the keys for hashing/equality. These two are independent, one cannot be used to implement the other (unless collections.keyfuncdict grows an option to not keep strong refs to the keys, perhaps by providing the inverse keyfunc instead). Anyway, +0.1 on identitydict and +1 on each of collection.keyfuncdict and WeakIdentityDict. - Jacob From ziade.tarek at gmail.com Tue Jun 1 10:54:27 2010 From: ziade.tarek at gmail.com (=?ISO-8859-1?Q?Tarek_Ziad=E9?=) Date: Tue, 1 Jun 2010 10:54:27 +0200 Subject: [Python-ideas] stdlib upgrades Message-ID: Hello, That's not a new idea, but I'd like to throw it here again. Some modules/packages in the stdlib are pretty isolated, which means that they could be upgraded with no harm, independently from the rest. For example the unittest package, or the email package. Here's an idea: 1 - add a version number in each package or module of the stdlib that is potentially upgradable 2 - create standalone releases of these modules/packages at PyPI, in a restricted area 'stdlib upgrades' that can be used only by core devs to upload new versions. Each release lists the precise Python versions it's compatible with. 3 - once distutils2 is back in the stdlib, provide a command line interface to list upgradable packages, and make it possible to upgrade them 4 - an upgraded package lands in a new specific site-packages directory and is loaded *before* the one in Lib Regards Tarek -- Tarek Ziad? | http://ziade.org From dickinsm at gmail.com Tue Jun 1 11:00:23 2010 From: dickinsm at gmail.com (Mark Dickinson) Date: Tue, 1 Jun 2010 10:00:23 +0100 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Sun, May 30, 2010 at 9:28 AM, Masklinn wrote: > > datetime does have a bunch of issues and limitations which I believe soon > become harmful when doing serious date/calendaring works (which I don't > claim to do, but I've seen colleagues in serious trouble due to both > personal lack of knowledge in the field and issues with datetime itself): > it only supports a gregoriany calendar for instance, it's horrendous > in dealing with timezones, some methods are pretty much broken, > constructor refuses "24" as an hour value, blows up on positive leap > seconds)? Are there tracker issues open for all these problems? It not, please would you consider opening some? The datetime module has recently been getting a lot more attention than it used to, thanks largely to the efforts of Alexander Belopolsky, so I think opening relevant tracker issues would be worthwhile. Some of the issues you mention look like easy fixes (e.g., allowing positive leap seconds, allowing '24:00:00' as a valid time). The API problems for timezones look a little bit more serious. What are the use-cases for non-Gregorian calendars, and why do you think the datetime module should support them? This seems like a specialist need to me. And which methods are 'pretty much broken'? If you want to see progress on these issues, please do open some bugtracker issues. Or if open issues already exist, it might be worth pinging them. Mark From guido at python.org Tue Jun 1 16:08:50 2010 From: guido at python.org (Guido van Rossum) Date: Tue, 1 Jun 2010 07:08:50 -0700 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 2:00 AM, Mark Dickinson wrote: > On Sun, May 30, 2010 at 9:28 AM, Masklinn wrote: >> >> datetime does have a bunch of issues and limitations which I believe soon >> become harmful when doing serious date/calendaring works (which I don't >> claim to do, but I've seen colleagues in serious trouble due to both >> personal lack of knowledge in the field and issues with datetime itself): >> it only supports a gregoriany calendar for instance, it's horrendous >> in dealing with timezones, some methods are pretty much broken, >> constructor refuses "24" as an hour value, blows up on positive leap >> seconds)? > > Are there tracker issues open for all these problems? ?It not, please > would you consider opening some? ?The datetime module has recently > been getting a lot more attention than it used to, thanks largely to > the efforts of Alexander Belopolsky, so I think opening relevant > tracker issues would be worthwhile. > > Some of the issues you mention look like easy fixes (e.g., allowing > positive leap seconds, allowing '24:00:00' as a valid time). Whoa, the datetime module was explicitly designed not to support leap seconds. This matches the POSIX standard for timestamps, which, although commonly explained as "seconds since 1/1/1970 UTC" doesn't count leap seconds either (it would make the conversions between timestamps and date/time objects horribly complicated since leap seconds are not determined by an algorithm). This is all intentional, since leap seconds are designed to be ignorable by most people except a few communities like astronomers, who have their own clocks. > The API problems for timezones look a little bit more serious. Isn't the main problem that no timezone implementations are provided by the standard library? There is a reason for that too (although we should at least have UTC in the stdlib). > What are the use-cases for non-Gregorian calendars, and why do you > think the datetime module should support them? ?This seems like a > specialist need to me. I believe the main use case is compatibility with Java, which does support other calendars. Not a big motivation for me. :-) > And which methods are 'pretty much broken'? > > If you want to see progress on these issues, please do open some > bugtracker issues. ?Or if open issues already exist, it might be worth > pinging them. In general I would hesitate about attempts to "fix" "problems" with the datetime module that were carefully considered API properties when the design was first made. The only problems that I currently take seriously are issues with dates before 1900, which IIRC stem from reliance on C stdlib functions for manipulating time structs. -- --Guido van Rossum (python.org/~guido) From dickinsm at gmail.com Tue Jun 1 16:41:10 2010 From: dickinsm at gmail.com (Mark Dickinson) Date: Tue, 1 Jun 2010 15:41:10 +0100 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 3:08 PM, Guido van Rossum wrote: > On Tue, Jun 1, 2010 at 2:00 AM, Mark Dickinson wrote: >> Some of the issues you mention look like easy fixes (e.g., allowing >> positive leap seconds, allowing '24:00:00' as a valid time). > > Whoa, the datetime module was explicitly designed not to support leap > seconds. This matches the POSIX standard for timestamps, which, > although commonly explained as "seconds since 1/1/1970 UTC" doesn't > count leap seconds either (it would make the conversions between > timestamps and date/time objects horribly complicated since leap > seconds are not determined by an algorithm). This is all intentional, > since leap seconds are designed to be ignorable by most people except > a few communities like astronomers, who have their own clocks. Yes, I understand these issues: UTC is not POSIX time. By 'support for leap seconds', all I meant (and all I was assuming Masklinn meant) was that it would be helpful for e.g., datetime.datetime(1985, 6, 30, 23, 59, 60) to be accepted, rather producing a ValueError as it currently does: >>> datetime.datetime(1985, 6, 30, 23, 59, 60) Traceback (most recent call last): File "", line 1, in ValueError: second must be in 0..59 As per the POSIX standard (IIUC), that would be immediately converted to datetime.datetime(1985, 7, 1, 0, 0, 0) internally. So the datetime object itself wouldn't support leap seconds, and would continue to use POSIX time; only the constructor would support leap seconds. Similar comments apply to accepting a time of 24:00:00 (and converting it internally to 00:00:00 on the following day). Mark From jnoller at gmail.com Tue Jun 1 16:46:48 2010 From: jnoller at gmail.com (Jesse Noller) Date: Tue, 1 Jun 2010 10:46:48 -0400 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References: Message-ID: On Tue, Jun 1, 2010 at 4:54 AM, Tarek Ziad? wrote: > Hello, > > That's not a new idea, but I'd like to throw it here again. > > Some modules/packages in the stdlib are pretty isolated, which means > that they could be upgraded with no > harm, independently from the rest. For example the unittest package, > or the email package. > > Here's an idea: > > 1 - add a version number in each package or module of the stdlib that > is potentially upgradable > > 2 - create standalone releases of these modules/packages at PyPI, in a > restricted area 'stdlib upgrades' > ? ? that can be used only by core devs to upload new versions. Each > release lists the precise > ? ? Python versions it's compatible with. > > 3 - once distutils2 is back in the stdlib, provide a command line > interface to list upgradable packages, and make > ? ? it possible to upgrade them > > 4 - an upgraded package lands in a new specific site-packages > directory and is loaded *before* the one in Lib > > Regards > Tarek I dislike this more than I thought I would - I would rather have the stdlib broken out from core and have it have more releases than the whole of python then allowing for piecemeal "blessed" upgrades. Allowing piecemeal upgrades of the stdlib means you have to say something akin to: "I support Python 2.6, with the upgraded unittest (2.6.1.3), socket (2.6.1.2) and multiprocessing modules" And so on. Sure, API compatibility should be "fine" - but we all know that there are exceptions to the rule all the time, and that alone is enough to put the nix on allowing arbitrary upgrades of individual modules within the standard lib. For package authors, and users, the simple "I support 2.6" statement is key. For corporations with strict upgrade checks and verifications, the same applies. jesse From alexander.belopolsky at gmail.com Tue Jun 1 17:07:51 2010 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Tue, 1 Jun 2010 11:07:51 -0400 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 10:08 AM, Guido van Rossum wrote: .. >> Some of the issues you mention look like easy fixes (e.g., allowing >> positive leap seconds, allowing '24:00:00' as a valid time). > > Whoa, the datetime module was explicitly designed not to support leap > seconds. This matches the POSIX standard for timestamps, which, > although commonly explained as "seconds since 1/1/1970 UTC" doesn't > count leap seconds either (it would make the conversions between > timestamps and date/time objects horribly complicated since leap > seconds are not determined by an algorithm). This is all intentional, > since leap seconds are designed to be ignorable by most people except > a few communities like astronomers, who have their own clocks. > The POSIX standard was heavily influenced by the desire to preserve last century's existing practices. Its notable that even the leap year rule was initially specified incorrectly and only fixed in 2001 version. Here is the post that I find intriguing: http://www.mail-archive.com/leapsecs at rom.usno.navy.mil/msg00109.html An excerpt: """ In addition these "glued to the table" cards, there were a number of unfortunate attitudes: "Don't confuse people with UTC. Everyone uses GMT and knows what it means". "Lets not complicate things by worrying about the fact that the year 2100 is not a leap year." "You mean the year 2000 is, but 2100 is not a leap year?" "Everyone knows there are only 60 seconds in a minute." "I'm lucky if my system's clock is accurate to the minute, so I could care less about sometime as small as a leap second". "It takes hours, sometime days, for my EMail message to reach most people. Why should I worry about something as small as a second?" "What matters to me is just that POSIX systems produce the same ctime(3) string (i.e., Wed Jun 30 21:49:08 1993\n") when given the same time(2) time-stamp." "SI? TAI? UT1? I'm having trouble with using UTC instead of good old GMT!". """ Systems that are aware of leap seconds are not that uncommon. BSD derivatives including Mac OS X have time2posix() and posix2time() functions. NTP distributes leap seconds notifications. Any system that takes time from a GPS source needs to make a leap second translation. I think what Mark meant by "easy fixes" was not leap second aware timestamp to datetime and back translations or datetime arithmetics, but instead just the ability to store 23:59:60 in time/datetime object. This would allow leap second aware applications to use standard objects to store time and implement arithmetics as correction to the standard datetime arithmetics. This is much easier than to reimplement the entire datetime module from scratch. From alexander.belopolsky at gmail.com Tue Jun 1 17:23:55 2010 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Tue, 1 Jun 2010 11:23:55 -0400 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 10:41 AM, Mark Dickinson wrote: .. > As per the POSIX standard (IIUC), [datetime(1985, 6, 30, 23, 59, 60)] would be > immediately converted > to datetime.datetime(1985, 7, 1, 0, 0, 0) internally. ?So the datetime > object itself wouldn't support leap seconds, and would continue to use > POSIX time; ?only the constructor would support leap seconds. > It is my understanding that POSIX mandates that mktime() function normalizes the tm structure and therefore converts (1985, 6, 30, 23, 59, 60, ...) to (1985, 7, 1, 0, 0, 0, ...). It is not quite accurate to say that tm structure is converted "immediately". It is perfectly legal to pass around non-normalized tm structures and have for example utc2gps() function that would produce different values for Y-M-D 23:59:60 and T-M-[D+1] 00:00:00. I would prefer a similar behavior for datetime constructor: >>> datetime(1985, 6, 30, 23, 59, 60).second 60 >>> datetime(1985, 6, 30, 23, 59, 60).timetuple() (1985, 6, 30, 23, 59, 60, ...) but >>> datetime(1985, 6, 30, 23, 59, 60) - datetime(1985, 7, 1, 0, 0, 0) datetime.timedelta(0) From guido at python.org Tue Jun 1 18:07:36 2010 From: guido at python.org (Guido van Rossum) Date: Tue, 1 Jun 2010 09:07:36 -0700 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 7:41 AM, Mark Dickinson wrote: > On Tue, Jun 1, 2010 at 3:08 PM, Guido van Rossum wrote: >> On Tue, Jun 1, 2010 at 2:00 AM, Mark Dickinson wrote: >>> Some of the issues you mention look like easy fixes (e.g., allowing >>> positive leap seconds, allowing '24:00:00' as a valid time). >> >> Whoa, the datetime module was explicitly designed not to support leap >> seconds. This matches the POSIX standard for timestamps, which, >> although commonly explained as "seconds since 1/1/1970 UTC" doesn't >> count leap seconds either (it would make the conversions between >> timestamps and date/time objects horribly complicated since leap >> seconds are not determined by an algorithm). This is all intentional, >> since leap seconds are designed to be ignorable by most people except >> a few communities like astronomers, who have their own clocks. > > Yes, I understand these issues: UTC is not POSIX time. ?By 'support > for leap seconds', all I meant (and all I was assuming Masklinn meant) > was that it would be helpful for e.g., datetime.datetime(1985, 6, 30, > 23, 59, 60) to be accepted, rather producing a ValueError as it > currently does: > >>>> datetime.datetime(1985, 6, 30, 23, 59, 60) > Traceback (most recent call last): > ?File "", line 1, in > ValueError: second must be in 0..59 > > As per the POSIX standard (IIUC), that would be immediately converted > to datetime.datetime(1985, 7, 1, 0, 0, 0) internally. ?So the datetime > object itself wouldn't support leap seconds, and would continue to use > POSIX time; ?only the constructor would support leap seconds. > > Similar comments apply to accepting a time of 24:00:00 (and converting > it internally to 00:00:00 on the following day). What's the use case for these relaxations in argument range checking? I'd say they are more confusing, since they might lead one to suspect that leap seconds are in fact supported. -- --Guido van Rossum (python.org/~guido) From guido at python.org Tue Jun 1 18:17:53 2010 From: guido at python.org (Guido van Rossum) Date: Tue, 1 Jun 2010 09:17:53 -0700 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 8:23 AM, Alexander Belopolsky wrote: > On Tue, Jun 1, 2010 at 10:41 AM, Mark Dickinson wrote: > .. >> As per the POSIX standard (IIUC), [datetime(1985, 6, 30, 23, 59, 60)] would be >> immediately converted >> to datetime.datetime(1985, 7, 1, 0, 0, 0) internally. ?So the datetime >> object itself wouldn't support leap seconds, and would continue to use >> POSIX time; ?only the constructor would support leap seconds. >> > > It is my understanding that POSIX mandates that mktime() function > normalizes the tm structure and therefore converts (1985, 6, 30, 23, > 59, 60, ...) to (1985, 7, 1, 0, 0, 0, ...). ?It is not quite accurate > to say that tm structure is converted "immediately". ?It is perfectly > legal to pass around non-normalized tm structures and have for example > utc2gps() function that would produce different values for Y-M-D > 23:59:60 and T-M-[D+1] 00:00:00. > > I would prefer a similar behavior for datetime constructor: > >>>> datetime(1985, 6, 30, 23, 59, 60).second > 60 >>>> datetime(1985, 6, 30, 23, 59, 60).timetuple() > (1985, 6, 30, 23, 59, 60, ...) > > but >>>> datetime(1985, 6, 30, 23, 59, 60) - datetime(1985, 7, 1, 0, 0, 0) > datetime.timedelta(0) I expect this will cause a lot of subtle issues. E.g. What should comparison of an unnormalized datetime value to an equivalent normalized datetime value yield? How far will you go? Is datetime.datetime(2010, 6, 1, 36, 0, 0) a way of spelling datetime.datetime(2010, 6, 2, 12, 0 0) ? How do you force normalization? Won't it break apps if the .seconds attribute can be out of range or if normalization calls need to be inserted? The datetime module was written with "commercial" and everyday use in mind. In such use, there is no need to carry leap seconds around. -- --Guido van Rossum (python.org/~guido) From dickinsm at gmail.com Tue Jun 1 18:28:32 2010 From: dickinsm at gmail.com (Mark Dickinson) Date: Tue, 1 Jun 2010 17:28:32 +0100 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 5:07 PM, Guido van Rossum wrote: > What's the use case for these relaxations in argument range checking? > I'd say they are more confusing, since they might lead one to suspect > that leap seconds are in fact supported. For the first, it would prevent tuples corresponding to valid UTC times (or local times) causing an exception in the datetime constructor. I don't have any specific use-cases, but it's not hard to imagine passing a tuple from some external UTC-supporting source to datetime.datetime. The second relaxation (allowing 24:00:00) comes from ISO 8601, but I don't really know how widespread its use is. I admit I don't find this one particularly convincing; perhaps Masklinn can expand on why it's useful. Mark From guido at python.org Tue Jun 1 18:29:18 2010 From: guido at python.org (Guido van Rossum) Date: Tue, 1 Jun 2010 09:29:18 -0700 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 8:07 AM, Alexander Belopolsky wrote: > On Tue, Jun 1, 2010 at 10:08 AM, Guido van Rossum wrote: > .. >>> Some of the issues you mention look like easy fixes (e.g., allowing >>> positive leap seconds, allowing '24:00:00' as a valid time). >> >> Whoa, the datetime module was explicitly designed not to support leap >> seconds. This matches the POSIX standard for timestamps, which, >> although commonly explained as "seconds since 1/1/1970 UTC" doesn't >> count leap seconds either (it would make the conversions between >> timestamps and date/time objects horribly complicated since leap >> seconds are not determined by an algorithm). This is all intentional, >> since leap seconds are designed to be ignorable by most people except >> a few communities like astronomers, who have their own clocks. >> > > The POSIX standard was heavily influenced by the desire to preserve > last century's existing practices. I don't expect this century's practices will change much. Show me a a labor contract with a provision to pay for work during leap seconds and I might change my mind. > Its notable that even the leap > year rule was initially specified incorrectly and only fixed in 2001 > version. I don't see how that's an argument for supporting leap seconds. The change here is really about the expectation of the lifetime of software systems, not unlike what caused Y2K. > Here is the post that I find intriguing: > http://www.mail-archive.com/leapsecs at rom.usno.navy.mil/msg00109.html A rants by someone with a grudge. > An excerpt: > > """ > In addition these "glued to the table" cards, there were a number > of unfortunate attitudes: > > ? ?"Don't confuse people with UTC. ?Everyone uses GMT and knows > ? ?what it means". > > ? ?"Lets not complicate things by worrying about the fact that > ? ?the year 2100 is not a leap year." > > ? ?"You mean the year 2000 is, but 2100 is not a leap year?" > > ? ?"Everyone knows there are only 60 seconds in a minute." > > ? ?"I'm lucky if my system's clock is accurate to the minute, so > ? ? I could care less about sometime as small as a leap second". > > ? ?"It takes hours, sometime days, for my EMail message to > ? ? reach most people. ?Why should I worry about something as > ? ? small as a second?" > > ? ?"What matters to me is just that POSIX systems produce the > ? ? same ctime(3) string (i.e., Wed Jun 30 21:49:08 1993\n") when > ? ? given the same time(2) time-stamp." > > ? ?"SI? ?TAI? ?UT1? ?I'm having trouble with using UTC instead > ? ? of good old GMT!". > """ He throws ripe and green comments together in a way to make it sound as if not knowing the Gregorian leap year rule is of the same magnitude as not caring about leap seconds or TAI. > Systems that are aware of leap seconds are not that uncommon. ?BSD > derivatives including Mac OS X have time2posix() and posix2time() > functions. ?NTP distributes leap seconds notifications. ?Any system > that takes time from a GPS source needs to make a leap second > translation. That's a solved problem though, is it? The accounting for leap seconds properly belongs in the layers closest to NTP / GPS. All other software running in the typical computer (even Google's servers) uses interfaces that use POSIX timestamps (albeit often with fractions of a second supportd) or something logically equivalent. > I think what Mark meant by "easy fixes" was not leap second aware > timestamp to datetime and back translations or datetime arithmetics, > but instead just the ability to store 23:59:60 in time/datetime > object. ?This would allow leap second aware applications to use > standard objects to store time and implement arithmetics as correction > to the standard datetime arithmetics. ?This is much easier than to > reimplement ?the entire datetime module from scratch. Tell us about the use case. Note that if you're talking about times in the future (a very useful use case for the datetime module) you *can't* account for leap seconds since it is not known (far) ahead when they will be. -- --Guido van Rossum (python.org/~guido) From guido at python.org Tue Jun 1 18:36:42 2010 From: guido at python.org (Guido van Rossum) Date: Tue, 1 Jun 2010 09:36:42 -0700 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 9:28 AM, Mark Dickinson wrote: > On Tue, Jun 1, 2010 at 5:07 PM, Guido van Rossum wrote: >> What's the use case for these relaxations in argument range checking? >> I'd say they are more confusing, since they might lead one to suspect >> that leap seconds are in fact supported. > > For the first, it would prevent tuples corresponding to valid UTC > times (or local times) causing an exception in the datetime > constructor. ?I don't have any specific use-cases, but it's not hard > to imagine passing a tuple from some external UTC-supporting source to > datetime.datetime. Imagined use cases are just that. > The second relaxation (allowing 24:00:00) comes from ISO 8601, but I > don't really know how widespread its use is. ?I admit I don't find > this one particularly convincing; ?perhaps Masklinn can expand on why > it's useful. This I can understand, but more for output than for input. It is useful to specify the end time of an event (e.g. a party) ending at midnight as ending at 24:00 on a given date rather than at 00:00 on the next day, since that might confuse humans. -- --Guido van Rossum (python.org/~guido) From janssen at parc.com Tue Jun 1 19:03:44 2010 From: janssen at parc.com (Bill Janssen) Date: Tue, 1 Jun 2010 10:03:44 PDT Subject: [Python-ideas] An identity dict In-Reply-To: <20100530132047.39a5875a@o> References: <20100530132047.39a5875a@o> Message-ID: <50046.1275411824@parc.com> Denis, if you're going to post to python-ideas, would you mind taking that biohazard symbol out of your user name? My Emacs-based mail reader thrashes for quite a while trying to find a glyph for it before it gives up and renders it as a hollow rectangle. I'd normally just add you to my kill-file, but I hate to give up on python-ideas people that fast. I'm sure you're not really a biohazard :-). Bill From alexander.belopolsky at gmail.com Tue Jun 1 19:28:21 2010 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Tue, 1 Jun 2010 13:28:21 -0400 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 12:36 PM, Guido van Rossum wrote: > On Tue, Jun 1, 2010 at 9:28 AM, Mark Dickinson wrote: .. >> For the first, it would prevent tuples corresponding to valid UTC >> times (or local times) causing an exception in the datetime >> constructor. ?I don't have any specific use-cases, but it's not hard >> to imagine passing a tuple from some external UTC-supporting source to >> datetime.datetime. > > Imagined use cases are just that. Developers writing generic libraries have to deal with imagined use cases all the time. If I write an rfc3339 timestamp parser, I cannot ignore the fact that XXXX-12-31T23:59:60Z is a valid timestamp. If I do, I cannot claim that my parser implements rfc3339. An application that uses python datetime objects to represent time may crash parsing logs produced in December 2008 on the systems that keeps time in UTC. If all my application does is to read timestamps from some source, store them in the database and display them on a later date, I don't want to worry that it will crash when presented with 23:59:60. Of course, allowing leap seconds in time/datetime constructor may be a way to delay detection of a bug. An application may accept XXXX-12-31T23:59:60Z, but later rely on the fact that dt1-dt2 == timedelta(0) implies dt1 == dt2. Such issues, if exist, can be addressed by the application without replacing datetime object as a means of storing timestamps. On the other hand the current restriction in the constructor makes datetime fundamentally incompatible with a number of standards. PS: I believe systems capable of producing 23:59:60 in timestamps are already more common than those that don't use IEEE standard for floating point values. Nevertheless, CPython contains a lot of code designed to deal with imagined deviations from IEEE 754. From cool-rr at cool-rr.com Tue Jun 1 19:36:10 2010 From: cool-rr at cool-rr.com (cool-RR) Date: Tue, 1 Jun 2010 19:36:10 +0200 Subject: [Python-ideas] Having unbound methods refer to the classes their defined on Message-ID: Hello, I would like to raise an issue here that I've been discussing at python-porting. (And I'd like to preface by saying that I'm not intimately familiar with Python's innards, so if I make any mistakes please correct me.) In Python 2.x there was an "unbound method" type. An unbound method would have an attribute `.im_class` that would refer to the class on which the method was defined. This allowed users to use the `copy_reg` module to pickle unbound methods by name. (In a similar way to how functions and classes are pickled by default.) In Python 3.x unbound methods are plain functions. There is no way of knowing on which class they are defined, so therefore it's impossible to pickle them. It is even impossible to tell `copyreg` to use a custom reducer: http://stackoverflow.com/questions/2932742/python-using-copyreg-to-define-reducers-for-types-that-already-have-reducers (To the people who wonder why would anyone want to pickle unbound methods: I know that it sounds like a weird thing to do. Keep in mind that sometimes your objects need to get pickled. For example if you're using the multiprocessing module, and you pass into it an object that somehow refers to an unbound method, then that method has to be picklable.) The idea is: Let's give unbound methods an attribute that will refer to the class on which they were defined. What do you think? Ram. -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Tue Jun 1 19:44:23 2010 From: python at mrabarnett.plus.com (MRAB) Date: Tue, 01 Jun 2010 18:44:23 +0100 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: <4C0546F7.8040404@mrabarnett.plus.com> Alexander Belopolsky wrote: > On Tue, Jun 1, 2010 at 10:41 AM, Mark Dickinson wrote: > .. >> As per the POSIX standard (IIUC), [datetime(1985, 6, 30, 23, 59, 60)] would be >> immediately converted >> to datetime.datetime(1985, 7, 1, 0, 0, 0) internally. So the datetime >> object itself wouldn't support leap seconds, and would continue to use >> POSIX time; only the constructor would support leap seconds. >> > > It is my understanding that POSIX mandates that mktime() function > normalizes the tm structure and therefore converts (1985, 6, 30, 23, > 59, 60, ...) to (1985, 7, 1, 0, 0, 0, ...). It is not quite accurate > to say that tm structure is converted "immediately". It is perfectly > legal to pass around non-normalized tm structures and have for example > utc2gps() function that would produce different values for Y-M-D > 23:59:60 and T-M-[D+1] 00:00:00. > > I would prefer a similar behavior for datetime constructor: > >>>> datetime(1985, 6, 30, 23, 59, 60).second > 60 >>>> datetime(1985, 6, 30, 23, 59, 60).timetuple() > (1985, 6, 30, 23, 59, 60, ...) > > but >>>> datetime(1985, 6, 30, 23, 59, 60) - datetime(1985, 7, 1, 0, 0, 0) > datetime.timedelta(0) > Actually, that's wrong because there was a leap second. The clock went: 1985-06-30 23:59:59 1985-06-30 23:59:60 1985-07-01 00:00:00 The following year, however, it went: 1986-06-30 23:59:59 1986-07-01 00:00:00 From marcos.bonci at gmail.com Tue Jun 1 19:47:12 2010 From: marcos.bonci at gmail.com (Marcos Bonci) Date: Tue, 1 Jun 2010 14:47:12 -0300 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On 1 June 2010 13:17, Guido van Rossum wrote: > On Tue, Jun 1, 2010 at 8:23 AM, Alexander Belopolsky > wrote: > > On Tue, Jun 1, 2010 at 10:41 AM, Mark Dickinson > wrote: > > .. > >> As per the POSIX standard (IIUC), [datetime(1985, 6, 30, 23, 59, 60)] > would be > >> immediately converted > >> to datetime.datetime(1985, 7, 1, 0, 0, 0) internally. So the datetime > >> object itself wouldn't support leap seconds, and would continue to use > >> POSIX time; only the constructor would support leap seconds. > >> > > > > It is my understanding that POSIX mandates that mktime() function > > normalizes the tm structure and therefore converts (1985, 6, 30, 23, > > 59, 60, ...) to (1985, 7, 1, 0, 0, 0, ...). It is not quite accurate > > to say that tm structure is converted "immediately". It is perfectly > > legal to pass around non-normalized tm structures and have for example > > utc2gps() function that would produce different values for Y-M-D > > 23:59:60 and T-M-[D+1] 00:00:00. > > > > I would prefer a similar behavior for datetime constructor: > > > >>>> datetime(1985, 6, 30, 23, 59, 60).second > > 60 > >>>> datetime(1985, 6, 30, 23, 59, 60).timetuple() > > (1985, 6, 30, 23, 59, 60, ...) > > > > but > >>>> datetime(1985, 6, 30, 23, 59, 60) - datetime(1985, 7, 1, 0, 0, 0) > > datetime.timedelta(0) > > I expect this will cause a lot of subtle issues. E.g. What should > comparison of an unnormalized datetime value to an equivalent > normalized datetime value yield? How far will you go? Is > datetime.datetime(2010, 6, 1, 36, 0, 0) a way of spelling > datetime.datetime(2010, 6, 2, 12, 0 0) ? How do you force > normalization? Won't it break apps if the .seconds attribute can be > out of range or if normalization calls need to be inserted? > > The datetime module was written with "commercial" and everyday use in > mind. In such use, there is no need to carry leap seconds around. > > If this is really a design choice, then I guess my suggestions about date+time literals and a unique/"official" date+time interpretation as number really aren't good ideas. (Although I can't see why a precise/scientific approach wouldn't be better than a commercial one, as commercial applications often rely on precise standards.) But I still don't understand why datetime.datetime.toordinal returns an int that truncates time information. Is this deliberate? > -- > --Guido van Rossum (python.org/~guido) > -- Marcos -- -------------- next part -------------- An HTML attachment was scrubbed... URL: From alexander.belopolsky at gmail.com Tue Jun 1 20:10:17 2010 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Tue, 1 Jun 2010 14:10:17 -0400 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 12:17 PM, Guido van Rossum wrote: .. > I expect this will cause a lot of subtle issues. I will try to answer to those. > E.g. What should > comparison of an unnormalized datetime value to an equivalent > normalized datetime value yield? I am not proposing supporting arbitrary unnormalized datetime values, only to allow seconds (0 - 60). I am not proposing any notion of "equivalent datetime" objects either. POSIX will require that for t1 = datetime(1985, 6, 30, 23, 59, 60) and t2 =datetime(1985, 7, 1, 0, 0, 0) time.mktime(t1.timetuple()) == time.mktime(t2.timetuple()), but this does not mean that t1 and t2 should compare equal. It is a more subtle issue, what difference t1 - t2 should produce. I think it can be defined as difference in corresponding POSIX times. > How far will you go? Is > datetime.datetime(2010, 6, 1, 36, 0, 0) a way of spelling > datetime.datetime(2010, 6, 2, 12, 0 0) ? I would not go any further than extending seconds to 0-60 range which is common to many modern standards. > How do you force > normalization? Normalization is never forced. A round trip through POSIX timestamp will naturally produce normalized datetime objects. > Won't it break apps if the .seconds attribute can be > out of range or if normalization calls need to be inserted? > Many standards require that seconds range be 0-60. Applications that obtain time from timetuples should already be prepared to handle this range to be POSIX compliant. Note that I do not propose changing internal sources of datetime objects such as datetime.now() to return dt.seconds == 60. Therefore all extended range times will originate outside of the datetime library. Current application should already validate such sources before passing them to datetime library. Of course an application that relies on constructor throwing an exception for validation and then asserts that seconds < 60 will break, but this can be addressed by proper deprecation schedule. Maybe even starting with enabling extended seconds range with a from __future__ import. From debatem1 at gmail.com Tue Jun 1 20:12:00 2010 From: debatem1 at gmail.com (geremy condra) Date: Tue, 1 Jun 2010 11:12:00 -0700 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References: Message-ID: On Tue, Jun 1, 2010 at 1:54 AM, Tarek Ziad? wrote: > Hello, > > That's not a new idea, but I'd like to throw it here again. > > Some modules/packages in the stdlib are pretty isolated, which means > that they could be upgraded with no > harm, independently from the rest. For example the unittest package, > or the email package. What advantage do you see in this relative to, say, breaking off the stdlib or introducing a sumo addon? > Here's an idea: > > 1 - add a version number in each package or module of the stdlib that > is potentially upgradable As in, append it to the module name, or add an interface to modules to query their version? > 2 - create standalone releases of these modules/packages at PyPI, in a > restricted area 'stdlib upgrades' > ? ? that can be used only by core devs to upload new versions. Each > release lists the precise > ? ? Python versions it's compatible with. > > 3 - once distutils2 is back in the stdlib, provide a command line > interface to list upgradable packages, and make > ? ? it possible to upgrade them +1 on this for all packages, not just stdlib > 4 - an upgraded package lands in a new specific site-packages > directory and is loaded *before* the one in Lib > > Regards > Tarek Geremy Condra From tim.peters at gmail.com Tue Jun 1 20:12:10 2010 From: tim.peters at gmail.com (Tim Peters) Date: Tue, 1 Jun 2010 14:12:10 -0400 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: [Marcos Bonci] > ... > But I still don't understand why?datetime.datetime.toordinal returns > an int that truncates time information. Is this deliberate? That it does exactly what it's documented to do is a clue about that ;-) As the module docs say, the notion of "ordinal" was deliberately defined in this way: This matches the definition of the ?proleptic Gregorian? calendar in Dershowitz and Reingold?s book "Calendrical Calculations", where it?s the base calendar for all computations. See the book for algorithms for converting between proleptic Gregorian ordinals and many other calendar systems. That's the primary use case we had in mind for date & datetime ordinals. Indeed, the meaning of "ordinal" is "an integer indicating position in a sequence". From ianb at colorstudy.com Tue Jun 1 20:13:16 2010 From: ianb at colorstudy.com (Ian Bicking) Date: Tue, 1 Jun 2010 13:13:16 -0500 Subject: [Python-ideas] stdlib upgrades Message-ID: Threading will probably break here as I wasn't on the list for the first email... My concern with the standard library is that there's a couple things going on: 1. The standard library represents "accepted" functionality, kind of best practice, kind of just conventional. Everyone (roughly) knows what you are talking about when you use things from the standard library. 2. The standard library has some firm backward compatibility guarantees. It also has some firm stability guarantees, especially within releases (though in practice, nearly for eternity). 3. The standard library is kind of collectively owned; it's not up to the whims of one person, and can't be abandoned. 4. The standard library is one big chunk of functionality, upgraded all under one version number, and specifically works together (though in practice cross-module refactorings are uncommon). There's positive things about these features, but 4 really drives me nuts, and I think is a strong disincentive to putting stuff into the standard library. For packaging I think 4 actively damages maintainability. Packaging is at the intersection of several systems: * Python versions * Forward and backward compatibility with distributed libraries * System policies (e.g., Debian has changed things around a lot in the last few years) * A whole other ecosystem of libraries outside of Python (e.g., binding to C libraries) * Various developer toolkits, some Python specific (e.g., Cython) some not (gcc) I don't think it's practical to think that we can determine some scope of packaging where it will be stable in the long term, all these things are changing and many are changing without any particular concern for how it affects Python (i.e., packaging must be reactive). And frankly we clearly do not have packaging figured out, we're still circling in on something... and I think the circling will be more like a Strange Attractor than a sink drain. The issues exist for other libraries that aren't packaging-related, of course, it's just worse for packaging. argparse for instance is not "done"... it has bugs that won't be fixed before release, and functionality that it should reasonably include. But there's no path for it to get better. Will it have new and better features in Python 3.3? Who seriously wants to write code that is only compatible with Python 3.3+ just because of some feature in argparse? Instead everyone will work around argparse as it currently exists. In the process they'll probably use undocumented APIs, further calcifying the library and making future improvements disruptive. It's not very specific to argparse, I think ElementTree has similar issues. The json library is fairly unique in that it has a scope that can be "done". I don't know what to say about wsgiref... it's completely irrelevant in Python 3 because it was upgraded along the Python schedule despite being unready to be released (this is relatively harmless as I don't think anyone is using wsgiref in Python 3). So, this is the tension I see. I think aspects of the standard library process and its guarantees are useful, but the current process means releasing code that isn't ready or not releasing code that should be released, and neither is good practice and both compromise those guarantees. Lots of moving versions can indeed be difficult to manage... though it can be made a lot easier with good practices. Though even then distutils2 (and pip) does not even fit into that... they both enter into the workflow before you start working with libraries and versions, making them somewhat unique (though also giving them some more flexibility as they are not so strongly tied to the Python runtime, which is where stability requirements are most needed). -- Ian Bicking | http://blog.ianbicking.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From janssen at parc.com Tue Jun 1 20:32:34 2010 From: janssen at parc.com (Bill Janssen) Date: Tue, 1 Jun 2010 11:32:34 PDT Subject: [Python-ideas] lack of time zone support In-Reply-To: References: Message-ID: <51245.1275417154@parc.com> To me, the single most irritating problem with the Python support for date/time is the lack of support for time-zone understanding. This breaks down into two major issues, %z and lack of a standard time-zone table. First, let's say I have to parse a Medusa log file, which contains time stamps in the form "DD/Mon/YYYY:HH:MM:SS [+|-]HHMM", e.g. "31/May/2010:07:10:04 -0800". What I'd like to write is tm = time.mktime(time.strptime(timestamp, "%d/%b/%Y:%H:%M:%S %z")) which is what I'd do if I was writing in C. But no! The Python _strptime module doesn't support "%z". So instead, I have to pick the timestamp apart and do things separately and remember that "-0800" isn't octal, and also isn't the same as -800, and remember whether to add or subtract it. This seems insane. So, IMO, support for %z should be added to Lib/_strptime.py. We need a patch. Secondly, we really need concrete subclasses of tzinfo, and some sort of mapping. Lots of people have spent lots of time trying to figure out this cryptic hint in datetime: "The datetime module does not supply any concrete subclasses of tzinfo." I'm not sure whether pytz is the best ideas, or what I use, the "zoneinfo" module from python-dateutil. With that, I still have to add the Windows timezone names, using the table at http://unicode.org/repos/cldr/trunk/common/supplemental/windowsZones.xml, because the code in python-dateutil only works with Windows timezone names when running on Windows. Bill From alexander.belopolsky at gmail.com Tue Jun 1 20:36:06 2010 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Tue, 1 Jun 2010 14:36:06 -0400 Subject: [Python-ideas] Date/time literals In-Reply-To: <4C0546F7.8040404@mrabarnett.plus.com> References:

<4C0546F7.8040404@mrabarnett.plus.com> Message-ID: On Tue, Jun 1, 2010 at 1:44 PM, MRAB wrote: .. >> but >>>>> >>>>> datetime(1985, 6, 30, 23, 59, 60) - datetime(1985, 7, 1, 0, 0, 0) >> >> datetime.timedelta(0) >> > Actually, that's wrong because there was a leap second. The clock went: > > ? ?1985-06-30 23:59:59 > ? ?1985-06-30 23:59:60 > ? ?1985-07-01 00:00:00 > > The following year, however, it went: > > ? ?1986-06-30 23:59:59 > ? ?1986-07-01 00:00:00 It is only wrong if you expect datetime difference to reflect the actual duration between the corresponding UTC events. The datetime library does not do it even for dates. For example, on my system >>> date(1752, 9, 14) - date(1752, 9, 2) datetime.timedelta(12) even though calendar application on the same machine shows that September 14 was the day after September 2 in 1752. $ cal 9 1752 September 1752 Su Mo Tu We Th Fr Sa 1 2 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 This was a deliberate design choice to implement proleptic calendar rather than a historically more accurate variant. Similarly I see nothing wrong with datetime difference not capturing leap seconds. An application interested in leap seconds effects, however should still be able to use the basic datetime object and define its own duration functions. From alexander.belopolsky at gmail.com Tue Jun 1 20:41:40 2010 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Tue, 1 Jun 2010 14:41:40 -0400 Subject: [Python-ideas] lack of time zone support In-Reply-To: <51245.1275417154@parc.com> References: <51245.1275417154@parc.com> Message-ID: On Tue, Jun 1, 2010 at 2:32 PM, Bill Janssen wrote: > To me, the single most irritating problem with the Python support for > date/time is the lack of support for time-zone understanding. There are two related issues on the tracker: http://bugs.python.org/issue5094 "datetime lacks concrete tzinfo impl. for UTC" http://bugs.python.org/issue6641 "strptime doesn't support %z format ?" From benjamin at python.org Tue Jun 1 23:17:25 2010 From: benjamin at python.org (Benjamin Peterson) Date: Tue, 1 Jun 2010 21:17:25 +0000 (UTC) Subject: [Python-ideas] An identity dict References:

<7AC8DB63-DAD6-46EA-89B1-AA339E4D7B43@gmail.com> Message-ID: Raymond Hettinger writes: > Benjamin, could you elaborate of several points that are unclear: > > * If id() is expensive in PyPy, then how are they helped?by the code in? > http://codespeak.net/svn/pypy/trunk/pypy/lib/identity_dict.py > which uses id() for the gets and sets and contains? At the top of that file, it imports from the special module __pypy__ which contains an optimized version of the dict. > > * In the examples you posted (such as?http://codespeak.net/svn/pypy/trunk/pypy/tool/algo/graphlib.py?), > it appears that PyPy already has an identity dict, ?so how are they helped by adding one to the collections module? My purpose with those examples was to prove it as a generally useful utility. > > * Most of the posted examples already work with regular dicts (which check identity before they check equality) -- don't the other implementations already implement regular dicts which need to have identity-implied-equality in order to pass the test suite? ?I would expect the following snippet to work under all versions and implementations of Python: > > > ?? ?>>> class A:? > ?? ?... ? ? ? ? pass > ?? ?>>> a = A() > ?? ?>>> d = {a: 10} > ?? ?>>> assert d[a] == 10 ? # uses a's identity for lookup Yes, but that would be different if you have two "a"s with __eq__ defined to be equal and you want to hash them separately. > > * Is the proposal something needed for all implementations or is it just an optimization for a particular, non-CPython implementation? My contention is that an identity dictionary or at least a dictionary with custom hash and keys is a useful primitive that should be in the standard library. However, I also see its advantage in avoiding bad performance of id() based identity dicts in non-CPython implementations. It is useful to let the implementation optimize it any time there is moving GC as in Jython and IronPython where id also is expensive. (Basically a mapping has to be maintained for all objects on which id is called.) From ziade.tarek at gmail.com Tue Jun 1 23:40:57 2010 From: ziade.tarek at gmail.com (=?ISO-8859-1?Q?Tarek_Ziad=E9?=) Date: Tue, 1 Jun 2010 23:40:57 +0200 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 4:46 PM, Jesse Noller wrote: [..] > I dislike this more than I thought I would - I would rather have the > stdlib broken out from core and have it have more releases than the > whole of python then allowing for piecemeal "blessed" upgrades. > Allowing piecemeal upgrades of the stdlib means you have to say > something akin to: > > "I support Python 2.6, with the upgraded unittest (2.6.1.3), socket > (2.6.1.2) and multiprocessing modules" > > And so on. Sure, API compatibility should be "fine" - but we all know > that there are exceptions to the rule all the time, and that alone is > enough to put the nix on allowing arbitrary upgrades of individual > modules within the standard lib. For package authors, and users, the > simple "I support 2.6" statement is key. For corporations with strict > upgrade checks and verifications, the same applies. What I expect would be for some projects to state : "I support Python 2.6, with the upgraded unittest (2.6.1.3), or Python 3.2" Instead of: "I support Python 2.6, with unittest2 or Python 3.2 with its own unittest" Because the latter makes more work in the project itself, (and no difference on the corporation/end user side) where it has to deal with two different unittest versions. Well, the same, but with a different namespace that is used to be able to install it on previous Python versions besides the stdlib one. At some point, if a package or module in the stdlib evolve in a backward compatible way, it would be nice to be able to upgrade an existing Python installation. And this is going to be more and more true with the moratorium I guess: what people are creating now for Python should work in a wider range of Pythons. Now, releasing the stdlib on its own and shortening its cycle would also resolve the problem we have. But then, while there will be less combinations, the problems you have mentioned will remain the same. Just replace in your example "I support Python 2.6, with the upgraded unittest (2.6.1.3), socket (2.6.1.2) and multiprocessing modules" by "I support Python 2.6, with the upgraded stdlib 2.6.1.2". Regards, Tarek -- Tarek Ziad? | http://ziade.org From ziade.tarek at gmail.com Tue Jun 1 23:45:49 2010 From: ziade.tarek at gmail.com (=?ISO-8859-1?Q?Tarek_Ziad=E9?=) Date: Tue, 1 Jun 2010 23:45:49 +0200 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 8:12 PM, geremy condra wrote: > On Tue, Jun 1, 2010 at 1:54 AM, Tarek Ziad? wrote: >> Hello, >> >> That's not a new idea, but I'd like to throw it here again. >> >> Some modules/packages in the stdlib are pretty isolated, which means >> that they could be upgraded with no >> harm, independently from the rest. For example the unittest package, >> or the email package. > > What advantage do you see in this relative to, say, breaking off the > stdlib or introducing a sumo addon? Making it easier for package or module maintainers to take care of doing those smaller releases. > >> Here's an idea: >> >> 1 - add a version number in each package or module of the stdlib that >> is potentially upgradable > > As in, append it to the module name, or add an interface to modules > to query their version? probably by adding a __version__ in the package's __init__.py or in the module itself. Regards Tarek -- Tarek Ziad? | http://ziade.org From ziade.tarek at gmail.com Tue Jun 1 23:53:20 2010 From: ziade.tarek at gmail.com (=?ISO-8859-1?Q?Tarek_Ziad=E9?=) Date: Tue, 1 Jun 2010 23:53:20 +0200 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References: Message-ID: On Tue, Jun 1, 2010 at 8:13 PM, Ian Bicking wrote: [..] > 4. The standard library is one big chunk of functionality, upgraded all > under one version number, and specifically works together (though in > practice cross-module refactorings are uncommon). > > There's positive things about these features, but 4 really drives me nuts, > and I think is a strong disincentive to putting stuff into the standard > library.? For packaging I think 4 actively damages maintainability. > > Packaging is at the intersection of several systems: > > * Python versions > * Forward and backward compatibility with distributed libraries > * System policies (e.g., Debian has changed things around a lot in the last > few years) > * A whole other ecosystem of libraries outside of Python (e.g., binding to C > libraries) > * Various developer toolkits, some Python specific (e.g., Cython) some not > (gcc) > > I don't think it's practical to think that we can determine some scope of > packaging where it will be stable in the long term, all these things are > changing and many are changing without any particular concern for how it > affects Python (i.e., packaging must be reactive).? And frankly we clearly > do not have packaging figured out, we're still circling in on something... > and I think the circling will be more like a Strange Attractor than a sink > drain. Are you suggesting to have a third layer ? * Python * stdlib * stdlib-extras (distutils2, pip, etc) is that what some people called a "sumo" release of Python ? Tarek -- Tarek Ziad? | http://ziade.org From qrczak at knm.org.pl Wed Jun 2 00:05:31 2010 From: qrczak at knm.org.pl (Marcin 'Qrczak' Kowalczyk) Date: Wed, 2 Jun 2010 00:05:31 +0200 Subject: [Python-ideas] An identity dict In-Reply-To: References:

<7AC8DB63-DAD6-46EA-89B1-AA339E4D7B43@gmail.com> Message-ID: 2010/6/1 Benjamin Peterson : > My contention is that an identity dictionary or at least a dictionary with > custom hash and keys is a useful primitive that should be in the standard > library. However, I also see its advantage in avoiding bad performance of id() > based identity dicts in non-CPython implementations. > > It is useful to let the implementation optimize it any time there is moving GC > as in Jython and IronPython where id also is expensive. (Basically a mapping has > to be maintained for all objects on which id is called.) Here is how I designed this for my language: You can request the ObjectId of the given object. If an ObjectId corresponding to the given object is still alive, you always get it back again, but it can be GC'ed and later created afresh. ObjectIds are hashable and comparable (with an arbitrary ordering). Hash values and the ordering are preserved when ObjectIds are kept alive, but they may be different if ObjectIds are created afresh. An ObjectId contains an integer index which is unique among ObjectIds being alive at the same time. You can make a dictionary with a specified key function. It is internally backed by something equivalent to f(k) -> (k, v) dict. A dictionary with ObjectId constructor as the key is an identity dictionary; it works because it keeps both k and f(k) alive. An advantage of this scheme is that with a moving GC the id mapping must be maintained only for objects for which the program keeps their ObjectIds alive. A disadvantage is that the program must be careful to not use ObjectIds in a manner which does not keep them alive yet expects consistent hashing and ordering. In particular a key-mapped dict which would store only (k, v) pairs and compute f(k) on the fly would not work. Also ObjectIds cannot be used to generate printable unique identifiers which would be valid without having to keep ObjectIds alive, like in Python's default repr. -- Marcin Kowalczyk From debatem1 at gmail.com Wed Jun 2 00:06:15 2010 From: debatem1 at gmail.com (geremy condra) Date: Tue, 1 Jun 2010 18:06:15 -0400 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References: Message-ID: On Tue, Jun 1, 2010 at 5:53 PM, Tarek Ziad? wrote: > On Tue, Jun 1, 2010 at 8:13 PM, Ian Bicking wrote: > [..] >> 4. The standard library is one big chunk of functionality, upgraded all >> under one version number, and specifically works together (though in >> practice cross-module refactorings are uncommon). >> >> There's positive things about these features, but 4 really drives me nuts, >> and I think is a strong disincentive to putting stuff into the standard >> library.? For packaging I think 4 actively damages maintainability. >> >> Packaging is at the intersection of several systems: >> >> * Python versions >> * Forward and backward compatibility with distributed libraries >> * System policies (e.g., Debian has changed things around a lot in the last >> few years) >> * A whole other ecosystem of libraries outside of Python (e.g., binding to C >> libraries) >> * Various developer toolkits, some Python specific (e.g., Cython) some not >> (gcc) >> >> I don't think it's practical to think that we can determine some scope of >> packaging where it will be stable in the long term, all these things are >> changing and many are changing without any particular concern for how it >> affects Python (i.e., packaging must be reactive).? And frankly we clearly >> do not have packaging figured out, we're still circling in on something... >> and I think the circling will be more like a Strange Attractor than a sink >> drain. > > Are you suggesting to have a third layer ? > > * Python > * stdlib > * stdlib-extras (distutils2, pip, etc) > > is that what some people called a "sumo" release of Python ? > > > Tarek That's what I've been advocating. Geremy Condra From guido at python.org Wed Jun 2 00:40:04 2010 From: guido at python.org (Guido van Rossum) Date: Tue, 1 Jun 2010 15:40:04 -0700 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 11:10 AM, Alexander Belopolsky wrote: > On Tue, Jun 1, 2010 at 12:17 PM, Guido van Rossum wrote: > .. >> I expect this will cause a lot of subtle issues. > > I will try to answer to those. > >> E.g. What should >> comparison of an unnormalized datetime value to an equivalent >> normalized datetime value yield? > > I am not proposing supporting arbitrary unnormalized datetime values, > only to allow seconds (0 - 60). ?I am not proposing any notion of > "equivalent datetime" objects either. ?POSIX will require that for ?t1 > = datetime(1985, 6, 30, 23, 59, 60) and t2 =datetime(1985, 7, 1, 0, 0, > 0) time.mktime(t1.timetuple()) == time.mktime(t2.timetuple()), but > this does not mean that t1 and t2 should compare equal. > > It is a more subtle issue, what difference t1 - t2 should produce. ?I > think it can be defined as difference in corresponding POSIX times. But consistency within the API is out the window. Currently datetimes are linearly ordered like numbers and if a difference is zero the two values are the same. I think it would be safer for your use case to either store the tuple or the string representation, if you really need to represent a leap second. Also note that there will be no validation possible for future datetimes (and for past dates it would require an up-to-date leap second database). >> How far will you go? Is >> datetime.datetime(2010, 6, 1, 36, 0, 0) a way of spelling >> datetime.datetime(2010, 6, 2, 12, 0 0) ? > > I would not go any further than extending seconds to 0-60 range which > is common to many modern standards. That's good. >> How do you force >> normalization? > > Normalization is never forced. ?A round trip through POSIX timestamp > will naturally produce normalized datetime objects. Well code that for whatever reason wants normalized timestamps only will have to know about this method to force normalization, so it would be a backwards incompatibility (since currently one can assume that *all* datetime objects are normalized). >> Won't it break apps if the .seconds attribute can be >> out of range or if normalization calls need to be inserted? > > Many standards require that seconds range be 0-60. ?Applications that > obtain time from timetuples should already be prepared to handle this > range to be POSIX compliant. ? Note that I do not propose changing > internal sources of datetime objects such as datetime.now() to return > dt.seconds == 60.Therefore all extended range times will originate > outside of the datetime library. ?Current application should already > validate such sources before passing them to datetime library. ?Of > course an application that relies on constructor throwing an exception > for validation and then asserts that seconds < 60 will break, but this > can be addressed by proper deprecation schedule. ?Maybe even starting > with enabling ?extended seconds range with a from __future__ import. I see nothing but trouble down this road. Also: http://en.wikipedia.org/wiki/Leap_second#Proposal_to_abolish_leap_seconds [and later] On Tue, Jun 1, 2010 at 11:36 AM, Alexander Belopolsky wrote: > On Tue, Jun 1, 2010 at 1:44 PM, MRAB wrote: > .. >>> but >>>>>> >>>>>> datetime(1985, 6, 30, 23, 59, 60) - datetime(1985, 7, 1, 0, 0, 0) >>> >>> datetime.timedelta(0) >>> >> Actually, that's wrong because there was a leap second. The clock went: >> >> 1985-06-30 23:59:59 >> 1985-06-30 23:59:60 >> 1985-07-01 00:00:00 >> >> The following year, however, it went: >> >> 1986-06-30 23:59:59 >> 1986-07-01 00:00:00 > > It is only wrong if you expect datetime difference to reflect the > actual duration between the corresponding UTC events. What on earth do you mean by *actual duration*? Most datetimes are derived from clocks that aren't accurate to a second even. > The datetime > library does not do it even for dates. > > For example, on my system > >>>> date(1752, 9, 14) - date(1752, 9, 2) > datetime.timedelta(12) > > even though calendar application on the same machine shows that > September 14 was the day after September 2 in 1752. And here you are mixing topics completely -- calendar reform is a completely different topic from leap seconds. > $ cal 9 1752 > September 1752 > Su Mo Tu We Th Fr Sa > 1 2 14 15 16 > 17 18 19 20 21 22 23 > 24 25 26 27 28 29 30 > > This was a deliberate design choice to implement proleptic calendar > rather than a historically more accurate variant. Similarly I see > nothing wrong with datetime difference not capturing leap seconds. An > application interested in leap seconds effects, however should still > be able to use the basic datetime object and define its own duration > functions. You haven't proven this need at all, and your reference to calendar reform (which by the way didn't happen in the same year or even century everywhere) makes it weaker still. I've put my foot down against leap seconds once before (when datetime was introduced) and I will do it again. -- --Guido van Rossum (python.org/~guido) From jnoller at gmail.com Wed Jun 2 02:06:21 2010 From: jnoller at gmail.com (Jesse Noller) Date: Tue, 1 Jun 2010 20:06:21 -0400 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 5:40 PM, Tarek Ziad? wrote: > On Tue, Jun 1, 2010 at 4:46 PM, Jesse Noller wrote: > [..] >> I dislike this more than I thought I would - I would rather have the >> stdlib broken out from core and have it have more releases than the >> whole of python then allowing for piecemeal "blessed" upgrades. >> Allowing piecemeal upgrades of the stdlib means you have to say >> something akin to: >> >> "I support Python 2.6, with the upgraded unittest (2.6.1.3), socket >> (2.6.1.2) and multiprocessing modules" >> >> And so on. Sure, API compatibility should be "fine" - but we all know >> that there are exceptions to the rule all the time, and that alone is >> enough to put the nix on allowing arbitrary upgrades of individual >> modules within the standard lib. For package authors, and users, the >> simple "I support 2.6" statement is key. For corporations with strict >> upgrade checks and verifications, the same applies. > > > What I expect would be for some projects to state : > > ?"I support Python 2.6, with the upgraded unittest (2.6.1.3), or Python 3.2" > > Instead of: > > ? "I support Python 2.6, with unittest2 or Python 3.2 with its own unittest" > > > Because the latter makes more work in the project itself, (and no > difference on the > corporation/end user side) where it has to deal with two different > unittest versions. Well, the > same, but with a different namespace that is used to be able to > install it on previous > Python versions besides the stdlib one. Either fight is a losing one. In the first, you're requiring that someone *fundamentally alter* their standard library install to monkey patch in something with the same name, which means it may or may not break something else - which makes it a non zero risk, and therefore, unacceptable to a lot of people. The second requires that the user install an external package; which; until we include something as a standard, is a fools errand, only to be taken on by the bravest of people (that might be hyperbole ;)) In all seriousness - the second you ask people to alter one tiny sliver of the stdlib for the sake of your unique-snowflake project or app, you've lost. The stdlib is a critical piece of Python - and it's relative integrity is assumed when people download it from the python.org website. Asking them to download it from the site, and then possibly install piecemeal upgrades seems like a bad idea. Imagine a future where project dependencies look like this: Python 2.7.1 Python 2.7.1 with upgraded unittest Python 2.7.1 with upgraded unittest, socket, multiprocessing Python 2.7.1 with upgraded unittest, socket, multiprocessing, httplib And so on - sure, eventually (say, 6 months later) there might be a 2.7.2 with all of those changes rolled in, but that begs the question - why release them individually when you know there's another release coming shortly, and avoid the confusion? > At some point, if a package or module in the stdlib evolve in a > backward compatible > way, it would be nice to be able to upgrade an existing Python installation. Yes, but wouldn't it also be nice to simply have a built in package installation script, and a shorter (say, 6 month) release cycle for patch releases which maintain the backwards compatibility guarantee? This way, bug fixes can move more quickly. We're really discussing the window from a release, to the next - which could easily be shortened lacking language changes (stdlib bugfixes only). > And this is going to be more and more true with the moratorium I guess: what > people are creating now for Python should work in a wider range of Pythons. > > Now, releasing the stdlib on its own and shortening its cycle would also resolve > the problem we have. But then, while there will be less combinations, > the problems you have mentioned will remain the same. > Just replace in your example "I support Python 2.6, with the upgraded unittest > (2.6.1.3), socket (2.6.1.2) and multiprocessing modules" by ?"I > support Python 2.6, > with the upgraded stdlib 2.6.1.2". Then don't fragment it - just release everything more rapidly. And then suffer from the fact that OS vendors don't pick up your releases quickly, and so on and so forth. All I'm trying to say, is that allowing piecemeal upgrades of stdlib modules is a risky prospect - I know plenty of people (myself included) who write code which is ONLY dependent on the standard library intentionally, to save ourselves from the packaging/installation/etc heartache. This isn't just because of the pain of installing, or dependency management issues - it's because the stdlib is known, stable and the one requirement we can rely on, other than the language itself. The stdlib doesn't require anyone to install from github, or bitbucket, or figure out distribute vs. distutils2 - it's just there, despite all it's warts and dusty corners. jesse From brett at python.org Wed Jun 2 02:50:57 2010 From: brett at python.org (Brett Cannon) Date: Tue, 1 Jun 2010 17:50:57 -0700 Subject: [Python-ideas] lack of time zone support In-Reply-To: <51245.1275417154@parc.com> References: <51245.1275417154@parc.com> Message-ID: On Tue, Jun 1, 2010 at 11:32, Bill Janssen wrote: > To me, the single most irritating problem with the Python support for > date/time is the lack of support for time-zone understanding. ?This > breaks down into two major issues, %z and lack of a standard time-zone > table. > > First, let's say I have to parse a Medusa log file, which contains time > stamps in the form "DD/Mon/YYYY:HH:MM:SS [+|-]HHMM", e.g. > "31/May/2010:07:10:04 -0800". ?What I'd like to write is > > ?tm = time.mktime(time.strptime(timestamp, "%d/%b/%Y:%H:%M:%S %z")) > > which is what I'd do if I was writing in C. ?But no! ?The Python > _strptime module doesn't support "%z". ?So instead, I have to pick the > timestamp apart and do things separately and remember that "-0800" isn't > octal, and also isn't the same as -800, and remember whether to add or > subtract it. ?This seems insane. ?So, IMO, support for %z should be > added to Lib/_strptime.py. ?We need a patch. > > Secondly, we really need concrete subclasses of tzinfo, and some sort of > mapping. ?Lots of people have spent lots of time trying to figure out > this cryptic hint in datetime: "The datetime module does not supply any > concrete subclasses of tzinfo." ?I'm not sure whether pytz is the best > ideas, or what I use, the "zoneinfo" module from python-dateutil. ?With > that, I still have to add the Windows timezone names, using the table at > http://unicode.org/repos/cldr/trunk/common/supplemental/windowsZones.xml, > because the code in python-dateutil only works with Windows timezone > names when running on Windows. First of all, there will never be a timezone table in the stdlib, period. This has been brought up before and is always shot down because python-dev does not want to have to keep track of timezone changes. pytz and other modules fit that bill fine. Now if you want UTC, that's different. Alexander already linked to an issue that is discussing that end. The current proposal is to provide a generic class that creates fixed UTC-offset timezones, with an instance for UTC set on the datetime module. If you get that class in, you could then patch _strptime to support the %z directive so as to return a timezone that had a set UTC-offset. Not optimal, but it's something. Otherwise you would need to patch _strptime to simply consume the number which I don't think anyone wants. From janssen at parc.com Wed Jun 2 03:18:28 2010 From: janssen at parc.com (Bill Janssen) Date: Tue, 1 Jun 2010 18:18:28 PDT Subject: [Python-ideas] lack of time zone support In-Reply-To: References: <51245.1275417154@parc.com> Message-ID: <56465.1275441508@parc.com> Brett Cannon wrote: > First of all, there will never be a timezone table in the stdlib, > period. This has been brought up before and is always shot down > because python-dev does not want to have to keep track of timezone > changes. pytz and other modules fit that bill fine. Sure, sure. Though I'm not sure that it has to be "in" the standard library to be part of the standard library. Past time for CPython to start thinking about on-demand data, pulled dynamically from "the cloud", with a static version for backup. Just a thought... > Now if you want UTC, that's different. Alexander already linked to an > issue that is discussing that end. The current proposal is to provide > a generic class that creates fixed UTC-offset timezones, with an > instance for UTC set on the datetime module. Yes, I've been following that. Very promising. > If you get that class in, you could then patch _strptime to support > the %z directive so as to return a timezone that had a set UTC-offset. > Not optimal, but it's something. Yes, exactly. > Otherwise you would need to patch _strptime to simply consume the > number which I don't think anyone wants. No. Bill From brett at python.org Wed Jun 2 03:22:38 2010 From: brett at python.org (Brett Cannon) Date: Tue, 1 Jun 2010 18:22:38 -0700 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References: Message-ID: On Tue, Jun 1, 2010 at 11:13, Ian Bicking wrote: > Threading will probably break here as I wasn't on the list for the first > email... > > My concern with the standard library is that there's a couple things going > on: > > 1. The standard library represents "accepted" functionality, kind of best > practice, kind of just conventional.? Everyone (roughly) knows what you are > talking about when you use things from the standard library. > 2. The standard library has some firm backward compatibility guarantees.? It > also has some firm stability guarantees, especially within releases (though > in practice, nearly for eternity). > 3. The standard library is kind of collectively owned; it's not up to the > whims of one person, and can't be abandoned. > 4. The standard library is one big chunk of functionality, upgraded all > under one version number, and specifically works together (though in > practice cross-module refactorings are uncommon). > > There's positive things about these features, but 4 really drives me nuts, > and I think is a strong disincentive to putting stuff into the standard > library.? For packaging I think 4 actively damages maintainability. > > Packaging is at the intersection of several systems: > > * Python versions > * Forward and backward compatibility with distributed libraries > * System policies (e.g., Debian has changed things around a lot in the last > few years) > * A whole other ecosystem of libraries outside of Python (e.g., binding to C > libraries) > * Various developer toolkits, some Python specific (e.g., Cython) some not > (gcc) > > I don't think it's practical to think that we can determine some scope of > packaging where it will be stable in the long term, all these things are > changing and many are changing without any particular concern for how it > affects Python (i.e., packaging must be reactive).? And frankly we clearly > do not have packaging figured out, we're still circling in on something... > and I think the circling will be more like a Strange Attractor than a sink > drain. > > The issues exist for other libraries that aren't packaging-related, of > course, it's just worse for packaging.? argparse for instance is not > "done"... it has bugs that won't be fixed before release, and functionality > that it should reasonably include.? But there's no path for it to get > better.? Will it have new and better features in Python 3.3?? Who seriously > wants to write code that is only compatible with Python 3.3+ just because of > some feature in argparse?? Instead everyone will work around argparse as it > currently exists.? In the process they'll probably use undocumented APIs, > further calcifying the library and making future improvements disruptive. > > It's not very specific to argparse, I think ElementTree has similar issues. > The json library is fairly unique in that it has a scope that can be > "done".? I don't know what to say about wsgiref... it's completely > irrelevant in Python 3 because it was upgraded along the Python schedule > despite being unready to be released (this is relatively harmless as I don't > think anyone is using wsgiref in Python 3). > > So, this is the tension I see.? I think aspects of the standard library > process and its guarantees are useful, but the current process means > releasing code that isn't ready or not releasing code that should be > released, and neither is good practice and both compromise those > guarantees.? Lots of moving versions can indeed be difficult to manage... > though it can be made a lot easier with good practices.? Though even then > distutils2 (and pip) does not even fit into that... they both enter into the > workflow before you start working with libraries and versions, making them > somewhat unique (though also giving them some more flexibility as they are > not so strongly tied to the Python runtime, which is where stability > requirements are most needed). I can only see two scenarios that might be considered acceptable to address these issues. One is that when new modules are accepted into the stdlib they are flagged with a ExpermintalWarning so that people know that no backwards-compatibility promises have been made yet. That gets the module more exposure and gets python-dev real-world feedback to fix issues before the module calcifies into a strong backwards-compatibility. With that experience more proper decisions can be made as to how to change things (e.g. the logging module's default timestamp including microseconds which strptime cannot parse). Otherwise we shift to an annual release schedule, but alternate Python versions have a language moratorium. That would mean only new language features every two years, but a new stdlib annually. But one thing I can tell you is that having separate module releases of what is in the stdlib under the same name or doing a separate stdlib release will not happen. Python-dev as a whole does not like this idea and I don't see that changing. From brett at python.org Wed Jun 2 03:24:01 2010 From: brett at python.org (Brett Cannon) Date: Tue, 1 Jun 2010 18:24:01 -0700 Subject: [Python-ideas] lack of time zone support In-Reply-To: <56465.1275441508@parc.com> References: <51245.1275417154@parc.com> <56465.1275441508@parc.com> Message-ID: On Tue, Jun 1, 2010 at 18:18, Bill Janssen wrote: > Brett Cannon wrote: > >> First of all, there will never be a timezone table in the stdlib, >> period. This has been brought up before and is always shot down >> because python-dev does not want to have to keep track of timezone >> changes. pytz and other modules fit that bill fine. > > Sure, sure. ?Though I'm not sure that it has to be "in" the standard > library to be part of the standard library. ?Past time for CPython to > start thinking about on-demand data, pulled dynamically from "the > cloud", with a static version for backup. ?Just a thought... > >> Now if you want UTC, that's different. Alexander already linked to an >> issue that is discussing that end. The current proposal is to provide >> a generic class that creates fixed UTC-offset timezones, with an >> instance for UTC set on the datetime module. > > Yes, I've been following that. ?Very promising. Just need a patch. =) > >> If you get that class in, you could then patch _strptime to support >> the %z directive so as to return a timezone that had a set UTC-offset. >> Not optimal, but it's something. > > Yes, exactly. Then that's fine. Get the fixed offset timezone in and then get a patch for this and I don't see resistance. From jnoller at gmail.com Wed Jun 2 03:33:43 2010 From: jnoller at gmail.com (Jesse Noller) Date: Tue, 1 Jun 2010 21:33:43 -0400 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References: Message-ID: On Tue, Jun 1, 2010 at 9:22 PM, Brett Cannon wrote: > On Tue, Jun 1, 2010 at 11:13, Ian Bicking wrote: >> Threading will probably break here as I wasn't on the list for the first >> email... >> >> My concern with the standard library is that there's a couple things going >> on: >> >> 1. The standard library represents "accepted" functionality, kind of best >> practice, kind of just conventional.? Everyone (roughly) knows what you are >> talking about when you use things from the standard library. >> 2. The standard library has some firm backward compatibility guarantees.? It >> also has some firm stability guarantees, especially within releases (though >> in practice, nearly for eternity). >> 3. The standard library is kind of collectively owned; it's not up to the >> whims of one person, and can't be abandoned. >> 4. The standard library is one big chunk of functionality, upgraded all >> under one version number, and specifically works together (though in >> practice cross-module refactorings are uncommon). >> >> There's positive things about these features, but 4 really drives me nuts, >> and I think is a strong disincentive to putting stuff into the standard >> library.? For packaging I think 4 actively damages maintainability. >> >> Packaging is at the intersection of several systems: >> >> * Python versions >> * Forward and backward compatibility with distributed libraries >> * System policies (e.g., Debian has changed things around a lot in the last >> few years) >> * A whole other ecosystem of libraries outside of Python (e.g., binding to C >> libraries) >> * Various developer toolkits, some Python specific (e.g., Cython) some not >> (gcc) >> >> I don't think it's practical to think that we can determine some scope of >> packaging where it will be stable in the long term, all these things are >> changing and many are changing without any particular concern for how it >> affects Python (i.e., packaging must be reactive).? And frankly we clearly >> do not have packaging figured out, we're still circling in on something... >> and I think the circling will be more like a Strange Attractor than a sink >> drain. >> >> The issues exist for other libraries that aren't packaging-related, of >> course, it's just worse for packaging.? argparse for instance is not >> "done"... it has bugs that won't be fixed before release, and functionality >> that it should reasonably include.? But there's no path for it to get >> better.? Will it have new and better features in Python 3.3?? Who seriously >> wants to write code that is only compatible with Python 3.3+ just because of >> some feature in argparse?? Instead everyone will work around argparse as it >> currently exists.? In the process they'll probably use undocumented APIs, >> further calcifying the library and making future improvements disruptive. >> >> It's not very specific to argparse, I think ElementTree has similar issues. >> The json library is fairly unique in that it has a scope that can be >> "done".? I don't know what to say about wsgiref... it's completely >> irrelevant in Python 3 because it was upgraded along the Python schedule >> despite being unready to be released (this is relatively harmless as I don't >> think anyone is using wsgiref in Python 3). >> >> So, this is the tension I see.? I think aspects of the standard library >> process and its guarantees are useful, but the current process means >> releasing code that isn't ready or not releasing code that should be >> released, and neither is good practice and both compromise those >> guarantees.? Lots of moving versions can indeed be difficult to manage... >> though it can be made a lot easier with good practices.? Though even then >> distutils2 (and pip) does not even fit into that... they both enter into the >> workflow before you start working with libraries and versions, making them >> somewhat unique (though also giving them some more flexibility as they are >> not so strongly tied to the Python runtime, which is where stability >> requirements are most needed). > > I can only see two scenarios that might be considered acceptable to > address these issues. > > One is that when new modules are accepted into the stdlib they are > flagged with a ExpermintalWarning so that people know that no > backwards-compatibility promises have been made yet. That gets the > module more exposure and gets python-dev real-world feedback to fix > issues before the module calcifies into a strong > backwards-compatibility. With that experience more proper decisions > can be made as to how to change things (e.g. the logging module's > default timestamp including microseconds which strptime cannot parse). > > Otherwise we shift to an annual release schedule, but alternate Python > versions have a language moratorium. That would mean only new language > features every two years, but a new stdlib annually. I'm actually partial to this idea - the stdlib, by it's very existence has to evolve more quickly than the language itself, and it should fundamentally see more releases to stay up to date, and slightly fresh. jesse From ianb at colorstudy.com Wed Jun 2 04:29:10 2010 From: ianb at colorstudy.com (Ian Bicking) Date: Tue, 1 Jun 2010 21:29:10 -0500 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References: Message-ID: On Tue, Jun 1, 2010 at 8:22 PM, Brett Cannon wrote: > But one thing I can tell you is that having separate module releases > of what is in the stdlib under the same name or doing a separate > stdlib release will not happen. Python-dev as a whole does not like > this idea and I don't see that changing. > I have no particular interest in changing the stdlib as it exists now. It is what it is, I don't care if there's extra stuff in it and I'm now long settled into working around all bugs I encounter. While I think there are past situations that exemplify certain problems, I'm really just bringing them up as examples. But pip and distutils2 aren't settled into anything, and I don't want us to retrace bad paths just because they are so well trod. -- Ian Bicking | http://blog.ianbicking.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Wed Jun 2 05:01:23 2010 From: tjreedy at udel.edu (Terry Reedy) Date: Tue, 01 Jun 2010 23:01:23 -0400 Subject: [Python-ideas] Having unbound methods refer to the classes their defined on In-Reply-To: References: Message-ID: On 6/1/2010 1:36 PM, cool-RR wrote: > In Python 2.x there was an "unbound method" type. An unbound method > would have an attribute `.im_class` that would refer to the class on > which the method was defined. Actually, I believe it referred to the class through which the function was accessed. The object in the class __dict__ was still the function. In both 2.x and 3.x, a function can be an attribute of more than one class and might not have been defined 'on' any of them. Right or wrong, I believe it was thought that adding the wrapper was more of a nuisance than a benefit. I suppose you could propose that when a function is directly accessed as a class (as opposed to via an instance, when wrapping as a bound method is still done), an __access_class__ attribute could be added, but I do not know if that would even help you. Perhaps a custom metaclass could be written to do this now (I definitely do not know this for sure). Terry Jan Reedy From arnodel at googlemail.com Wed Jun 2 08:36:54 2010 From: arnodel at googlemail.com (Gmail) Date: Wed, 2 Jun 2010 07:36:54 +0100 Subject: [Python-ideas] Having unbound methods refer to the classes their defined on In-Reply-To: References: Message-ID: <0B87A480-0392-4EFF-BBEC-8947D010844F@gmail.com> On 1 Jun 2010, at 18:36, cool-RR wrote: > Hello, > > I would like to raise an issue here that I've been discussing at python-porting. > > (And I'd like to preface by saying that I'm not intimately familiar with Python's innards, so if I make any mistakes please correct me.) > > In Python 2.x there was an "unbound method" type. An unbound method would have an attribute `.im_class` that would refer to the class on which the method was defined. This allowed users to use the `copy_reg` module to pickle unbound methods by name. (In a similar way to how functions and classes are pickled by default.) > Not exactly (python 2.6): >>> class Foo(object): ... def f(self): pass ... >>> Foo.f >>> Foo.f.im_class >>> class Bar(Foo): pass ... >>> bar.f >>> Bar.f.im_class > In Python 3.x unbound methods are plain functions. There is no way of knowing on which class they are defined, so therefore it's impossible to pickle them. It is even impossible to tell `copyreg` to use a custom reducer: > http://stackoverflow.com/questions/2932742/python-using-copyreg-to-define-reducers-for-types-that-already-have-reducers > > (To the people who wonder why would anyone want to pickle unbound methods: I know that it sounds like a weird thing to do. Keep in mind that sometimes your objects need to get pickled. For example if you're using the multiprocessing module, and you pass into it an object that somehow refers to an unbound method, then that method has to be picklable.) > > The idea is: Let's give unbound methods an attribute that will refer to the class on which they were defined. > > What do you think? Unbound methods in Python 2.X were objects that were created on class attribute access, not when the class was created, so what you are asking for is different from what Python 2.X provided. Here is a very simplified way to mimic 2.X in 3.X via metaclasses (Python 3.2): >>> class FooType(type): ... def __getattribute__(self, attrname): ... attr = super().__dict__[attrname] ... if isinstance(attr, type(lambda:0)): ... return ("unbound method", self, attr) ... else: ... return attr ... >>> class Foo(metaclass=FooType): ... def f(self):pass ... >>> Foo.f ('unbound method', , ) >>> Foo().f() >>> What you want maybe instead is a metaclass that overrides type.__new__ or type.__init__ so that each function in the attributes of the class is wrapped in some kind of wrapper like this: class DefinedIn: def __init__(self, f, classdef): self.classdef = classdef self.f = f def __call__(self, *args, **kwargs): return self.f(*args, **kwargs) -- Arnaud -------------- next part -------------- An HTML attachment was scrubbed... URL: From solipsis at pitrou.net Wed Jun 2 09:53:51 2010 From: solipsis at pitrou.net (Antoine Pitrou) Date: Wed, 2 Jun 2010 09:53:51 +0200 Subject: [Python-ideas] stdlib upgrades References: Message-ID: <20100602095351.60ba0f2c@pitrou.net> On Tue, 1 Jun 2010 18:22:38 -0700 Brett Cannon wrote: > > One is that when new modules are accepted into the stdlib they are > flagged with a ExpermintalWarning Are you advocating this specific spelling? > Otherwise we shift to an annual release schedule, but alternate Python > versions have a language moratorium. That would mean only new language > features every two years, but a new stdlib annually. I think this has already been shot down by Guido (I think I was the one who asked last time :-)). Basically, even if you aren't adding new language features, you are still compelling people to upgrade to a new version with (very probably) slight compatibility annoyances. From cool-rr at cool-rr.com Wed Jun 2 10:59:45 2010 From: cool-rr at cool-rr.com (cool-RR) Date: Wed, 2 Jun 2010 10:59:45 +0200 Subject: [Python-ideas] Having unbound methods refer to the classes their defined on In-Reply-To: <0B87A480-0392-4EFF-BBEC-8947D010844F@gmail.com> References: <0B87A480-0392-4EFF-BBEC-8947D010844F@gmail.com> Message-ID: On Wed, Jun 2, 2010 at 8:36 AM, Gmail wrote: > > On 1 Jun 2010, at 18:36, cool-RR wrote: > > Hello, > > I would like to raise an issue here that I've been discussing at > python-porting. > > (And I'd like to preface by saying that I'm not intimately familiar with > Python's innards, so if I make any mistakes please correct me.) > > In Python 2.x there was an "unbound method" type. An unbound method would > have an attribute `.im_class` that would refer to the class on which the > method was defined. This allowed users to use the `copy_reg` module to > pickle unbound methods by name. (In a similar way to how functions and > classes are pickled by default.) > > > Not exactly (python 2.6): > > >>> class Foo(object): > ... def f(self): pass > ... > >>> Foo.f > > >>> Foo.f.im_class > > >>> class Bar(Foo): pass > ... > >>> bar.f > > >>> Bar.f.im_class > > > > In Python 3.x unbound methods are plain functions. There is no way of > knowing on which class they are defined, so therefore it's impossible to > pickle them. It is even impossible to tell `copyreg` to use a custom > reducer: > > http://stackoverflow.com/questions/2932742/python-using-copyreg-to-define-reducers-for-types-that-already-have-reducers > > (To the people who wonder why would anyone want to pickle unbound methods: > I know that it sounds like a weird thing to do. Keep in mind that sometimes > your objects need to get pickled. For example if you're using the > multiprocessing module, and you pass into it an object that somehow refers > to an unbound method, then that method has to be picklable.) > > The idea is: Let's give unbound methods an attribute that will refer to the > class on which they were defined. > > What do you think? > > > Unbound methods in Python 2.X were objects that were created on class > attribute access, not when the class was created, so what you are asking for > is different from what Python 2.X provided. Here is a very simplified way > to mimic 2.X in 3.X via metaclasses (Python 3.2): > > >>> class FooType(type): > ... def __getattribute__(self, attrname): > ... attr = super().__dict__[attrname] > ... if isinstance(attr, type(lambda:0)): > ... return ("unbound method", self, attr) > ... else: > ... return attr > ... > >>> class Foo(metaclass=FooType): > ... def f(self):pass > ... > >>> Foo.f > ('unbound method', , ) > >>> Foo().f() > >>> > > What you want maybe instead is a metaclass that overrides type.__new__ or > type.__init__ so that each function in the attributes of the class is > wrapped in some kind of wrapper like this: > > class DefinedIn: > def __init__(self, f, classdef): > self.classdef = classdef > self.f = f > def __call__(self, *args, **kwargs): > return self.f(*args, **kwargs) > > -- > Arnaud > > Thanks for the corrections and the metaclass, Arnaud. (And thanks to you too, Terry.) I might use it in my project. > so what you are asking for is different from what Python 2.X provided. Yes, I have been imprecise. So I'll correct my idea: I want Python 3.x to tell me the class from which the unbound method was accessed. (It can be done either on creation or or access, whatever seems better to you.) So I propose this as a modification of Python. Ram. -------------- next part -------------- An HTML attachment was scrubbed... URL: From mal at egenix.com Wed Jun 2 12:13:00 2010 From: mal at egenix.com (M.-A. Lemburg) Date: Wed, 02 Jun 2010 12:13:00 +0200 Subject: [Python-ideas] stdlib upgrades In-Reply-To: References:

Message-ID: <4C062EAC.60302@egenix.com> While I played with this idea a long time ago as well, I have since found that it causes more trouble than it's worth. Apart from having the user to maintain at least two different versioned packages (Python and (part of) the stdlib), it also causes problems if you use this Python installation for more than one project: it's easily possible to have project A require version 2 or a stdlib module and project B version 3 of that same module. If you then load both projects in an application, you end up either with a broken project A or B (depending on whether you have version 2 or 3 of that stdlib module installed), or you allow loading multiple versions of the same module, in which case you will likely break you application, since it will find multiple class implementations (and objects) for the the same instances. Things like exception catching, pickling (and esp. unpickling), security checks based on classes, interface adapters and even simply isinstance() checks would then fail in various hard to reproduce ways. IMHO, we've so far done well by issuing new Python patch level releases whenever there was a problem in the stdlib (and only then). Introducing new features by way of updates is left to minor releases, which then require more testing by the user. This additional testing is what causes many corporates to not follow the Python release cycle or skip a few minor releases: the work involved often just doesn't warrant the advantages of the added new features. The situations won't get any better if we start releasing partial or complete stdlib updates even more often. If users really want bleeding edge, they can just use the SVN version of the stdlib or cherry pick updates to module or packages they care about from SVN. -- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, Jun 02 2010) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ 2010-07-19: EuroPython 2010, Birmingham, UK 46 days to go ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg Registered at Amtsgericht Duesseldorf: HRB 46611 http://www.egenix.com/company/contact/ From dickinsm at gmail.com Wed Jun 2 12:50:11 2010 From: dickinsm at gmail.com (Mark Dickinson) Date: Wed, 2 Jun 2010 11:50:11 +0100 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Tue, Jun 1, 2010 at 11:40 PM, Guido van Rossum wrote: > Also note that there will be no validation possible for future > datetimes (and for past dates it would require an up-to-date leap > second database). It's even worse than that :(. Complete validation would also require timezone knowledge, because leap seconds happen at the same instant the world over: e.g., the leap second that occurred at 23:59:60 UTC on 31st December 2008 occurred at 19:29:60 local time in Caracas. So for naive datetime objects validation is going to be difficult. Given that timezone offsets can be an arbitrary number of minutes, the only reasonable options as far as I can see would be either *always* to accept seconds in the range 0-60, or *always* restrict the range to 0-59, as now. (Well, you could only accept seconds=60 for timestamps within 24 hours of midnight on Jun 30 or Dec 31st, but that's fairly horrible. :-) I'm still not convinced that incorrectly accepting some invalid UTC times is worse than incorrectly rejecting some (rare) valid UTC times, but I'll let it drop for now. -- Mark From dickinsm at gmail.com Wed Jun 2 12:55:50 2010 From: dickinsm at gmail.com (Mark Dickinson) Date: Wed, 2 Jun 2010 11:55:50 +0100 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: On Wed, Jun 2, 2010 at 11:50 AM, Mark Dickinson wrote: > within 24 hours of midnight on Jun 30 or Dec 31st, but that's fairly To avoid ambiguity, that should read "within 24 hours of 24:00:00 on Jun 30th or Dec 31st", of course. :) -- Mark From mal at egenix.com Wed Jun 2 13:18:30 2010 From: mal at egenix.com (M.-A. Lemburg) Date: Wed, 02 Jun 2010 13:18:30 +0200 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: <4C063E06.7050606@egenix.com> Mark Dickinson wrote: > On Tue, Jun 1, 2010 at 11:40 PM, Guido van Rossum wrote: >> Also note that there will be no validation possible for future >> datetimes (and for past dates it would require an up-to-date leap >> second database). > > It's even worse than that :(. Complete validation would also require > timezone knowledge, because leap seconds happen at the same instant > the world over: e.g., the leap second that occurred at 23:59:60 UTC > on 31st December 2008 occurred at 19:29:60 local time in Caracas. So > for naive datetime objects validation is going to be difficult. Given > that timezone offsets can be an arbitrary number of minutes, the only > reasonable options as far as I can see would be either *always* to > accept seconds in the range 0-60, or *always* restrict the range to > 0-59, as now. (Well, you could only accept seconds=60 for timestamps > within 24 hours of midnight on Jun 30 or Dec 31st, but that's fairly > horrible. :-) > > I'm still not convinced that incorrectly accepting some invalid UTC > times is worse than incorrectly rejecting some (rare) valid UTC times, > but I'll let it drop for now. You can use mxDateTime to store such values. I added support for storing leap seconds long ago, but only for the UTC variants, not for arbitrary time zones. This was mainly done to support those values when using mxDateTime as storage container rather than for calculations (those are all POSIX conform, ie. omit leap seconds). Note that most C libs nowadays only support the POSIX interpretation of time_t values. Those don't include leap seconds: >>> DateTime(1986,12,31,23,59,59).gmticks() 536457599.0 >>> DateTime(1986,12,31,23,59,60).gmticks() 536457600.0 >>> DateTime(1987,1,1,0,0,0).gmticks() 536457600.0 with leap seconds, you'd get 536457612 for DateTime(1986,12,31,23,59,59).gmticks(). As a result, conversion to time_t will be lossy. IIRC, the BSDs were the last to switch off leap second support, but could be mistaken. -- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, Jun 02 2010) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ 2010-07-19: EuroPython 2010, Birmingham, UK 46 days to go ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg Registered at Amtsgericht Duesseldorf: HRB 46611 http://www.egenix.com/company/contact/ From ncoghlan at gmail.com Wed Jun 2 14:45:10 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 02 Jun 2010 22:45:10 +1000 Subject: [Python-ideas] Date/time literals In-Reply-To: References:

Message-ID: <4C065256.6070100@gmail.com> On 02/06/10 03:28, Alexander Belopolsky wrote: > Developers writing generic libraries have to deal with imagined use > cases all the time. If I write an rfc3339 timestamp parser, I cannot > ignore the fact that XXXX-12-31T23:59:60Z is a valid timestamp. If I > do, I cannot claim that my parser implements rfc3339. An application > that uses python datetime objects to represent time may crash parsing > logs produced in December 2008 on the systems that keeps time in UTC. > > If all my application does is to read timestamps from some source, > store them in the database and display them on a later date, I don't > want to worry that it will crash when presented with 23:59:60. > > Of course, allowing leap seconds in time/datetime constructor may be a > way to delay detection of a bug. An application may accept > XXXX-12-31T23:59:60Z, but later rely on the fact that dt1-dt2 == > timedelta(0) implies dt1 == dt2. Such issues, if exist, can be > addressed by the application without replacing datetime object as a > means of storing timestamps. On the other hand the current > restriction in the constructor makes datetime fundamentally > incompatible with a number of standards. The case for allowing a "60" value for seconds in the datetime constructor seems reasonable to me (i.e. prevent leap seconds from breaking date parsing), but I don't see the use case for delaying normalisation to a valid POSIX time. If the constructor just converts the 60 to a zero and adds 1 minute immediately, then the chance of subtle breakages would be minimal and the current ValueError would be replaced by a far more graceful behaviour. (Allowing 2400 hours just seems plain odd to me, but if it was adopted I'd suggest immediate normalisation be similarly applied in that case as well). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From guido at python.org Wed Jun 2 15:32:44 2010 From: guido at python.org (Guido van Rossum) Date: Wed, 2 Jun 2010 06:32:44 -0700 Subject: [Python-ideas] Date/time literals In-Reply-To: <4C065256.6070100@gmail.com> References:

<4C065256.6070100@gmail.com> Message-ID: On Wed, Jun 2, 2010 at 5:45 AM, Nick Coghlan