From paul at boddie.org.uk Sun Jul 1 02:22:47 2012 From: paul at boddie.org.uk (Paul Boddie) Date: Sun, 1 Jul 2012 02:22:47 +0200 Subject: [Python-Dev] Announcing the python-static-type-checking google group Message-ID: <201207010222.48167.paul@boddie.org.uk> Edward K. Ream wrote: > Hello all, > > GvR has asked me to announce the python-static-type-checking google > group http://groups.google.com/group/python-static-type-checking to > python-dev. > > Consider it announced. Anyone from python-dev who likes may become a > member. Is there any reason why the compiler-sig mailing list wasn't chosen as a venue for such discussions? I know it has "compiler" in the title, but the mandate overlaps significantly with what you intend to discuss. It's obviously your choice where you host discussions and who you invite, and I know that the special interest group mailing lists aren't exactly well advertised these days, what with the lack of agility around updating the Web content that advertises such things (or the lack of visibility of the Wiki content), but I feel that you might have a more productive discussion if you don't insist on Google Groups membership and also allow a wider selection of participants. Please consider this as friendly advice: I too would like to see progress in the area concerned. Paul From raymond.hettinger at gmail.com Sun Jul 1 09:07:56 2012 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Sun, 1 Jul 2012 00:07:56 -0700 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: <4FEF6B10.9040409@udel.edu> Message-ID: <2E44286F-D3DB-4739-89AF-3A8EAA9D8901@gmail.com> On Jun 30, 2012, at 10:44 PM, Stefan Behnel wrote: >> >> Another addition could be a new subsection on grouping (chunking) that >> would discuss post-processing of grouper (as discussed above), as well as >> other recipes, including ones specific to strings and sequences. It would >> essentially be a short how-to. Call it 9.1.3 "Grouping, Blocking, or >> Chunking Sequences and Iterables". The synonyms will help external >> searching. A toc would let people who have found this doc know to look for >> this at the bottom. > > If it really is such an important use case for so many people, I agree that > it's worth special casing it in the docs. It's not a trivial algorithmic > step from a sequential iterable to a grouped iterable. I'm not too keen on adding a section like this to the itertools docs. Instead, I would be open adding "further reading" section with external links to interesting iterator writeups in blogs, cookbooks, stack overflow answers, wikis, etc. If one of you wants to craft an elegant blog post on "Grouping, Blocking, or Chunking Sequences and Iterables", I would be happy to link to it. Raymond -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.brandl at gmx.net Sun Jul 1 10:03:00 2012 From: g.brandl at gmx.net (Georg Brandl) Date: Sun, 01 Jul 2012 10:03:00 +0200 Subject: [Python-Dev] cpython: Add syntax highlighter tool In-Reply-To: References: Message-ID: <4FF00434.5020604@gmx.net> On 01.07.2012 01:58, raymond.hettinger wrote: > http://hg.python.org/cpython/rev/da4dd603030b > changeset: 77899:da4dd603030b > user: Raymond Hettinger > date: Sat Jun 30 16:58:06 2012 -0700 > summary: > Add syntax highlighter tool > > files: > Tools/scripts/pycolorize.py | 109 ++++++++++++++++++++++++ > 1 files changed, 109 insertions(+), 0 deletions(-) Uh, this looks quite a lot like a new feature... Since it's in Tools, I'm not going to veto it, just as with the improvements to the gdb helper, but it would have been nice to at least *ask*... Georg From doko at ubuntu.com Sun Jul 1 10:22:10 2012 From: doko at ubuntu.com (Matthias Klose) Date: Sun, 01 Jul 2012 10:22:10 +0200 Subject: [Python-Dev] cross-compiling patches In-Reply-To: <20120630231742.2c896682@pitrou.net> References: <20120630231742.2c896682@pitrou.net> Message-ID: <4FF008B2.8080807@ubuntu.com> On 30.06.2012 23:17, Antoine Pitrou wrote: > > Hello, > > I think these patches are premature (they break compilation on OS X, > and they break ctypes configure on my Linux box). that was unrelated. fixed last night. > Furthermore, they > were committed post-beta, which means they should probably have waited > for after the 3.3 release. So I propose for these commits to be > reverted. somebody (?) asked on irc that either Martin or I should update the internal copies, and someone suggested to do the same with the expat sources. And I asked our release manager if he was ok with this post beta change. > (to be clear, I'm talking about all configure / Makefile / setup.py / > libffi changes since and including > http://hg.python.org/cpython/rev/e6e99d449bdc876fa57111e7e534c44ecbc3bcbd > ) the first cross-build fixes went in in April, please consider these fixes for the then incomplete cros-build fixes. The build issues you did see last night, were fixed for the OS X build, and I reverted the update for the compiler search paths for the native build. So please lets keep this cross-build support for 3.3. It's working, tested with a arm-linux-gnueabi build on x86_64-linux-gnu. Matthias From ncoghlan at gmail.com Sun Jul 1 11:13:47 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 1 Jul 2012 19:13:47 +1000 Subject: [Python-Dev] cross-compiling patches In-Reply-To: <4FF008B2.8080807@ubuntu.com> References: <20120630231742.2c896682@pitrou.net> <4FF008B2.8080807@ubuntu.com> Message-ID: On Sun, Jul 1, 2012 at 6:22 PM, Matthias Klose wrote: > the first cross-build fixes went in in April, please consider these fixes for > the then incomplete cros-build fixes. The build issues you did see last night, > were fixed for the OS X build, and I reverted the update for the compiler search > paths for the native build. So please lets keep this cross-build support for > 3.3. It's working, tested with a arm-linux-gnueabi build on x86_64-linux-gnu. Indeed, especially with the variety of ARM mini-boards coming onto the market at the moment. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From g.brandl at gmx.net Sun Jul 1 11:37:50 2012 From: g.brandl at gmx.net (Georg Brandl) Date: Sun, 01 Jul 2012 11:37:50 +0200 Subject: [Python-Dev] cross-compiling patches In-Reply-To: <4FF008B2.8080807@ubuntu.com> References: <20120630231742.2c896682@pitrou.net> <4FF008B2.8080807@ubuntu.com> Message-ID: On 01.07.2012 10:22, Matthias Klose wrote: > On 30.06.2012 23:17, Antoine Pitrou wrote: >> >> Hello, >> >> I think these patches are premature (they break compilation on OS X, >> and they break ctypes configure on my Linux box). > > that was unrelated. fixed last night. It's also something the buildbots can catch easily. >> Furthermore, they >> were committed post-beta, which means they should probably have waited >> for after the 3.3 release. So I propose for these commits to be >> reverted. > > somebody (?) asked on irc that either Martin or I should update the internal > copies, and someone suggested to do the same with the expat sources. And I asked > our release manager if he was ok with this post beta change. > >> (to be clear, I'm talking about all configure / Makefile / setup.py / >> libffi changes since and including >> http://hg.python.org/cpython/rev/e6e99d449bdc876fa57111e7e534c44ecbc3bcbd >> ) > > the first cross-build fixes went in in April, please consider these fixes for > the then incomplete cros-build fixes. The build issues you did see last night, > were fixed for the OS X build, and I reverted the update for the compiler search > paths for the native build. So please lets keep this cross-build support for > 3.3. It's working, tested with a arm-linux-gnueabi build on x86_64-linux-gnu. FWIW, I agree that build fixes for platforms we already support are not a new feature. I am not happy about the amount of changes, and I hope that no such big changes will come until final now... Georg From solipsis at pitrou.net Sun Jul 1 13:43:49 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 1 Jul 2012 13:43:49 +0200 Subject: [Python-Dev] cross-compiling patches References: <20120630231742.2c896682@pitrou.net> <4FF008B2.8080807@ubuntu.com> Message-ID: <20120701134349.5390b738@pitrou.net> On Sun, 01 Jul 2012 11:37:50 +0200 Georg Brandl wrote: > > > > the first cross-build fixes went in in April, please consider these fixes for > > the then incomplete cros-build fixes. The build issues you did see last night, > > were fixed for the OS X build, and I reverted the update for the compiler search > > paths for the native build. So please lets keep this cross-build support for > > 3.3. It's working, tested with a arm-linux-gnueabi build on x86_64-linux-gnu. > > FWIW, I agree that build fixes for platforms we already support are not a new > feature. I am not happy about the amount of changes, and I hope that no such > big changes will come until final now... I don't think we have ever supported cross-compiling officially (which is different from supporting a given platform, e.g. ARM). But fair enough. Regards Antoine. From stefan_ml at behnel.de Sun Jul 1 14:01:36 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Sun, 01 Jul 2012 14:01:36 +0200 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: <2E44286F-D3DB-4739-89AF-3A8EAA9D8901@gmail.com> References: <4FEF6B10.9040409@udel.edu> <2E44286F-D3DB-4739-89AF-3A8EAA9D8901@gmail.com> Message-ID: Hi Raymond, Raymond Hettinger, 01.07.2012 09:07: > On Jun 30, 2012, at 10:44 PM, Stefan Behnel wrote: >>> Another addition could be a new subsection on grouping (chunking) that >>> would discuss post-processing of grouper (as discussed above), as well as >>> other recipes, including ones specific to strings and sequences. It would >>> essentially be a short how-to. Call it 9.1.3 "Grouping, Blocking, or >>> Chunking Sequences and Iterables". The synonyms will help external >>> searching. A toc would let people who have found this doc know to look for >>> this at the bottom. >> >> If it really is such an important use case for so many people, I agree that >> it's worth special casing it in the docs. It's not a trivial algorithmic >> step from a sequential iterable to a grouped iterable. > > I'm not too keen on adding a section like this to the itertools docs. I've only just seen that the recipes section is part of the same page since the 2.6 documentation was sphinxified. I had remembered it being on a separate page before. That resolves most of my original concerns. Sorry, should have looked earlier. To address the main problem of users not finding what they need, what about simply extending the docstring of the grouper() function with a sentence like this: "This functionality is also called 'chunking' or 'blocking' and can be used for load distribution and sharding." That would make it easy for users to find what they are looking for when they search the page for "chunk". I find that a much more common and less ambiguous name than "grouping", which reminds me more of "group by". It might be a good idea in general to add a short comment on a use case to each recipe where it's not immediately obvious or where there is a use case with a well-known name, simply to aid in text searches over the page. > Instead, I would be open adding "further reading" section with external links > to interesting iterator writeups in blogs, cookbooks, stack overflow answers, wikis, etc. > > If one of you wants to craft an elegant blog post on "Grouping, Blocking, or > Chunking Sequences and Iterables", I would be happy to link to it. That could be done in addition, but it bares the risk of bit rotting the documentation by links dying, blogs moving or texts changing. Stefan From guido at python.org Sun Jul 1 17:23:57 2012 From: guido at python.org (Guido van Rossum) Date: Sun, 1 Jul 2012 17:23:57 +0200 Subject: [Python-Dev] Announcing the python-static-type-checking google group In-Reply-To: <201207010222.48167.paul@boddie.org.uk> References: <201207010222.48167.paul@boddie.org.uk> Message-ID: Hey Paul, This list is going to be high volume super focused and run by Edward; so far he's mostly been thinking out loud. I don't think that Edward would be comfortable doing that quite in the same way on th existing compiler-list. But by all means sign up for his list! --Guido On Sun, Jul 1, 2012 at 2:22 AM, Paul Boddie wrote: > Edward K. Ream wrote: >> Hello all, >> >> GvR has asked me to announce the python-static-type-checking google >> group http://groups.google.com/group/python-static-type-checking to >> python-dev. >> >> Consider it announced. Anyone from python-dev who likes may become a >> member. > > Is there any reason why the compiler-sig mailing list wasn't chosen as a venue > for such discussions? I know it has "compiler" in the title, but the mandate > overlaps significantly with what you intend to discuss. > > It's obviously your choice where you host discussions and who you invite, and > I know that the special interest group mailing lists aren't exactly well > advertised these days, what with the lack of agility around updating the Web > content that advertises such things (or the lack of visibility of the Wiki > content), but I feel that you might have a more productive discussion if you > don't insist on Google Groups membership and also allow a wider selection of > participants. > > Please consider this as friendly advice: I too would like to see progress in > the area concerned. > > Paul > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/guido%40python.org -- --Guido van Rossum (python.org/~guido) From stefan_ml at behnel.de Sun Jul 1 17:56:28 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Sun, 01 Jul 2012 17:56:28 +0200 Subject: [Python-Dev] Announcing the python-static-type-checking google group In-Reply-To: <201207010222.48167.paul@boddie.org.uk> References: <201207010222.48167.paul@boddie.org.uk> Message-ID: Paul Boddie, 01.07.2012 02:22: > Is there any reason why the compiler-sig mailing list wasn't chosen as a venue Even I didn't know that this list even existed. And looking at the archives now, it's hard to see any relevant discussion in all the spam it received until it apparently died away in (almost) silence a couple of years ago. > It's obviously your choice where you host discussions and who you invite, and > I know that the special interest group mailing lists aren't exactly well > advertised these days True, but many (most?) of them are simply not very well frequented, which reduces the interest in joining them even further. Both SIG mailing lists that I read only receive a mail every so many months, often enough without any reply. And almost all of these mails deal with questions that would better be discussed on python-list to leverage the substantially higher number of eyeballs there. I think that's the basic problem: as long as more experts are lurking on python-list than on the dedicated SIG-ML, it's better not to use the SIG-ML for discussions but to go to python-list (or maybe python-ideas or python-dev) straight away. Stefan From paul at boddie.org.uk Sun Jul 1 23:43:45 2012 From: paul at boddie.org.uk (Paul Boddie) Date: Sun, 1 Jul 2012 23:43:45 +0200 Subject: [Python-Dev] Announcing the python-static-type-checking google group Message-ID: <201207012343.45808.paul@boddie.org.uk> Stefan Behnel wrote: > Paul Boddie, 01.07.2012 02:22: > > Is there any reason why the compiler-sig mailing list wasn't chosen as a > > venue > > Even I didn't know that this list even existed. And looking at the archives > now, it's hard to see any relevant discussion in all the spam it received > until it apparently died away in (almost) silence a couple of years ago. Yes, although the mailing lists for special interest groups are advertised on python.org, there is no longer the focus on steering discussion to those lists. And I see that the compiler SIG is "retired", as is the related types SIG: http://www.python.org/community/sigs/retired/ I seem to remember various procedures about SIGs and their retirement, but I don't really recall much discussion of such things recently. Still, the compiler SIG matches the scope of the Google group pretty well: http://www.python.org/community/sigs/retired/compiler-sig/ There's even a link to discussion of some tools you may be familiar with. > > It's obviously your choice where you host discussions and who you invite, > > and I know that the special interest group mailing lists aren't exactly > > well advertised these days > > True, but many (most?) of them are simply not very well frequented, which > reduces the interest in joining them even further. Both SIG mailing lists > that I read only receive a mail every so many months, often enough without > any reply. And almost all of these mails deal with questions that would > better be discussed on python-list to leverage the substantially higher > number of eyeballs there. Special interest group lists were always meant to be used as focused channels of communication where people are actively trying to get stuff done. The unfortunate thing is that they aren't as well known as they were. Another unfortunate thing is that getting stuff done of mutual benefit is frequently something that takes second place to whatever other motivations and goals people have, for whatever reason, good or bad. Thus, traffic drops away as people either do other things entirely or instead promote any related work in other channels instead. > I think that's the basic problem: as long as more experts are lurking on > python-list than on the dedicated SIG-ML, it's better not to use the SIG-ML > for discussions but to go to python-list (or maybe python-ideas or > python-dev) straight away. I think we really have to sort out what python-dev is for, because currently there's a tendency to target the list when any kind of "expert" discussion is required, but there are a number of people who would rather see only CPython-related discussion here. Another matter is that static analysis of Python is a topic that frequently hits the end of the road when one cannot, by definition, analyze Python in its most dynamic form, and when people refuse to accept that such analysis has anything to do with Python in its most pure, undiluted (and most contrived) form. But as I wrote, I still intend to follow the newly created group and see what people have to say. Paul From solipsis at pitrou.net Sun Jul 1 23:39:59 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 1 Jul 2012 23:39:59 +0200 Subject: [Python-Dev] hg.python.org migrated Message-ID: <20120701233959.745ec8a6@pitrou.net> Hello, hg.python.org has just been migrated to a new (virtual) machine hosted by OSU OSL (*). Until the domain name fully propagates, you won't be able to push or pull from the repositories using the ssh protocol. If you notice other issues, don't hesitate to mention them. (*) http://osuosl.org/ Regards Antoine. From roundup-admin at psf.upfronthosting.co.za Mon Jul 2 20:36:31 2012 From: roundup-admin at psf.upfronthosting.co.za (Python tracker) Date: Mon, 02 Jul 2012 18:36:31 +0000 Subject: [Python-Dev] Failed issue tracker submission Message-ID: <20120702183631.DFC031CC82@psf.upfronthosting.co.za> There was a problem with the message you sent: This issue can't be closed until issue 15031 is closed. Mail Gateway Help ================= Incoming messages are examined for multiple parts: . In a multipart/mixed message or part, each subpart is extracted and examined. The text/plain subparts are assembled to form the textual body of the message, to be stored in the file associated with a "msg" class node. Any parts of other types are each stored in separate files and given "file" class nodes that are linked to the "msg" node. . In a multipart/alternative message or part, we look for a text/plain subpart and ignore the other parts. . A message/rfc822 is treated similar tomultipart/mixed (except for special handling of the first text part) if unpack_rfc822 is set in the mailgw config section. Summary ------- The "summary" property on message nodes is taken from the first non-quoting section in the message body. The message body is divided into sections by blank lines. Sections where the second and all subsequent lines begin with a ">" or "|" character are considered "quoting sections". The first line of the first non-quoting section becomes the summary of the message. Addresses --------- All of the addresses in the To: and Cc: headers of the incoming message are looked up among the user nodes, and the corresponding users are placed in the "recipients" property on the new "msg" node. The address in the From: header similarly determines the "author" property of the new "msg" node. The default handling for addresses that don't have corresponding users is to create new users with no passwords and a username equal to the address. (The web interface does not permit logins for users with no passwords.) If we prefer to reject mail from outside sources, we can simply register an auditor on the "user" class that prevents the creation of user nodes with no passwords. Actions ------- The subject line of the incoming message is examined to determine whether the message is an attempt to create a new item or to discuss an existing item. A designator enclosed in square brackets is sought as the first thing on the subject line (after skipping any "Fwd:" or "Re:" prefixes). If an item designator (class name and id number) is found there, the newly created "msg" node is added to the "messages" property for that item, and any new "file" nodes are added to the "files" property for the item. If just an item class name is found there, we attempt to create a new item of that class with its "messages" property initialized to contain the new "msg" node and its "files" property initialized to contain any new "file" nodes. Triggers -------- Both cases may trigger detectors (in the first case we are calling the set() method to add the message to the item's spool; in the second case we are calling the create() method to create a new node). If an auditor raises an exception, the original message is bounced back to the sender with the explanatory message given in the exception. $Id: mailgw.py,v 1.196 2008-07-23 03:04:44 richard Exp $ -------------- next part -------------- Return-Path: X-Original-To: report at bugs.python.org Delivered-To: roundup+tracker at psf.upfronthosting.co.za Received: from mail.python.org (mail.python.org [82.94.164.166]) by psf.upfronthosting.co.za (Postfix) with ESMTPS id 5E7D11CC7C for ; Mon, 2 Jul 2012 20:36:31 +0200 (CEST) Received: from albatross.python.org (localhost [127.0.0.1]) by mail.python.org (Postfix) with ESMTP id 3WQxxp54vszPJ6 for ; Mon, 2 Jul 2012 20:36:30 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=python.org; s=200901; t=1341254190; bh=bcoPtckr6VOn/nnWeOnKJxf0E8057TGiXziLdKxmDAU=; h=MIME-Version:Content-Type:Content-Transfer-Encoding:From:To: Subject:Message-Id:Date; b=LeXZP20cZnfbA2FHjnPNH2KEX5ZCGryZdBwT5XwAqbn2PrdQCxB0OJ766NdEkuI5N RlaB95Vz83qouHfh8sVxO8welo9E4UpoXVEEvwFt5/XYUtDG3eoVp6ufKpWCxut1Sj PDwgNWINBY5UPYtDB92G1S7YTLttEiQ/jt2a/vaE= Received: from localhost (HELO mail.python.org) (127.0.0.1) by albatross.python.org with SMTP; 02 Jul 2012 20:36:30 +0200 Received: from virt-7yvsjn.psf.osuosl.org (virt-7yvsjn.psf.osuosl.org [140.211.10.72]) by mail.python.org (Postfix) with ESMTP for ; Mon, 2 Jul 2012 20:36:30 +0200 (CEST) MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 From: python-dev at python.org To: report at bugs.python.org Subject: [issue15030] [status=closed; resolution=fixed; stage=committed/rejected] Message-Id: <3WQxxp54vszPJ6 at mail.python.org> Date: Mon, 2 Jul 2012 20:36:30 +0200 (CEST) TmV3IGNoYW5nZXNldCBiNzQ2M2VjMTk4MGMgYnkgQnJldHQgQ2Fubm9uIGluIGJyYW5jaCAnZGVm YXVsdCc6CkNsb3NlcyAjMTUwMzA6IE1ha2UgaW1wb3J0bGliLmFiYy5QeVB5Y0xvYWRlciByZXNw ZWN0IHRoZSBuZXcgLnB5YwpodHRwOi8vaGcucHl0aG9uLm9yZy9jcHl0aG9uL3Jldi9iNzQ2M2Vj MTk4MGMK From solipsis at pitrou.net Mon Jul 2 20:51:35 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 2 Jul 2012 20:51:35 +0200 Subject: [Python-Dev] New stable buildbots Message-ID: <20120702205135.1d9607c4@pitrou.net> Hello, I have decided to add two new buildslaves to the stable buildbots fleet: - ?ukasz Langa's AMD64 OS Lion buildbot (using clang as compiler) - Jeremy Kloth's AMD64 Windows7 buildbot (our first 64-bit Windows buildbot!) They bring the number of stable buildbots to twelve: 4 Windows, 5 Linux and 3 other Unices: http://buildbot.python.org/all/waterfall?category=3.x.stable Regards Antoine. From tjreedy at udel.edu Mon Jul 2 21:14:38 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Mon, 02 Jul 2012 15:14:38 -0400 Subject: [Python-Dev] New stable buildbots In-Reply-To: <20120702205135.1d9607c4@pitrou.net> References: <20120702205135.1d9607c4@pitrou.net> Message-ID: On 7/2/2012 2:51 PM, Antoine Pitrou wrote: > > Hello, > > I have decided to add two new buildslaves to the stable buildbots fleet: > - ?ukasz Langa's AMD64 OS Lion buildbot (using clang as compiler) > - Jeremy Kloth's AMD64 Windows7 buildbot (our first 64-bit Windows > buildbot!) Great. > They bring the number of stable buildbots to twelve: 4 Windows, and all seems to have failed compiling a1c8302e6b27 after succeeding. Is that from the repository migration? > http://buildbot.python.org/all/waterfall?category=3.x.stable -- Terry Jan Reedy From solipsis at pitrou.net Mon Jul 2 21:54:53 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 2 Jul 2012 21:54:53 +0200 Subject: [Python-Dev] New stable buildbots References: <20120702205135.1d9607c4@pitrou.net> Message-ID: <20120702215453.07aedcf8@pitrou.net> On Mon, 02 Jul 2012 15:14:38 -0400 Terry Reedy wrote: > On 7/2/2012 2:51 PM, Antoine Pitrou wrote: > > > > Hello, > > > > I have decided to add two new buildslaves to the stable buildbots fleet: > > - ?ukasz Langa's AMD64 OS Lion buildbot (using clang as compiler) > > - Jeremy Kloth's AMD64 Windows7 buildbot (our first 64-bit Windows > > buildbot!) > > Great. > > > They bring the number of stable buildbots to twelve: 4 Windows, > > and all seems to have failed compiling a1c8302e6b27 after succeeding. > Is that from the repository migration? No, it's from the buildmaster restart. However, David Bolen's Windows 7 buildbot seems to show weird and persistent compile errors unrelated to Windows' file locking: ..\PC\python_nt.rc(35): error RC2104: undefined keyword or key name: MS_DLL_ID [D:\cygwin\home\db3l\buildarea\3.x.bolen-windows7\build\PCbuild\pythoncore.vcxproj] ..\PC\pylauncher.rc(16): error RC2104: undefined keyword or key name: FIELD3 [D:\cygwin\home\db3l\buildarea\3.x.bolen-windows7\build\PCbuild\pylauncher.vcxproj] ..\PC\pylauncher.rc(16): error RC2104: undefined keyword or key name: FIELD3 [D:\cygwin\home\db3l\buildarea\3.x.bolen-windows7\build\PCbuild\pywlauncher.vcxproj] Regards Antoine. From solipsis at pitrou.net Mon Jul 2 23:20:27 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 2 Jul 2012 23:20:27 +0200 Subject: [Python-Dev] cpython: Issue #15210: If _frozen_importlib is not found in sys.modules by References: <3WQyK70dd5zPCD@mail.python.org> Message-ID: <20120702232027.5be1f0e2@pitrou.net> On Mon, 2 Jul 2012 20:53:15 +0200 (CEST) brett.cannon wrote: > http://hg.python.org/cpython/rev/818db871d29a > changeset: 77916:818db871d29a > user: Brett Cannon > date: Mon Jul 02 14:53:10 2012 -0400 > summary: > Issue #15210: If _frozen_importlib is not found in sys.modules by > importlib.__init__, then catch the KeyError raised, not ImportError. There's a test failure here: http://buildbot.python.org/all/builders/AMD64%20Fedora%20without%20threads%203.x/builds/2774/steps/test/logs/stdio Regards Antoine. From raymond.hettinger at gmail.com Tue Jul 3 06:23:29 2012 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Mon, 2 Jul 2012 21:23:29 -0700 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: <4FEF6B10.9040409@udel.edu> <2E44286F-D3DB-4739-89AF-3A8EAA9D8901@gmail.com> Message-ID: <877C3C8D-1DE3-4DB0-812A-B352E8608364@gmail.com> On Jul 1, 2012, at 5:01 AM, Stefan Behnel wrote: > To address the main problem of users not finding what they need, what about > simply extending the docstring of the grouper() Here's a small change to the docstring: http://hg.python.org/cpython/rev/d32f21d87363 FWIW, if you're interested in load balancing applications, George Sakkis's itertools recipe for roundrobin() may be of interest. Another interesting iterator technique that is not well known is the two-argument form of iter() which is a marvel for transforming callables into iterators: for block in iter(partial(f.read, 1024), ''): ... for diceroll in iter(partial(randrange(1, 7), 4): ... Raymond -------------- next part -------------- An HTML attachment was scrubbed... URL: From techtonik at gmail.com Wed Jul 4 10:42:38 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Wed, 4 Jul 2012 11:42:38 +0300 Subject: [Python-Dev] GitHub mirror (Was: Bitbucket mirror?) Message-ID: On Fri, Jun 29, 2012 at 6:58 AM, Eli Bendersky wrote: > > The devguide (http://docs.python.org/devguide/committing.html) says: > > Bitbucket also maintain an up to date clone of the main cpython repository > that can be used as the basis for a new clone or patch queue. > > [the link goes to https://bitbucket.org/mirror/cpython/overview] This one is better than Atlassian's (looks nice, shorter URL) and worked long before Atlassian promotion as "makers of Bitbucket", which is dead wrong BTW, because everybody knows they bought BB. I don't remember if I commented on the Python Insider or if it was censored. Whatever.. it hurts Atlassian, not me. On the subject. Is there a mirror of CPython on GitHub? I abandoned my efforts to create online editor for docs, which should have been both web fronted independent, maintainable and clean. It appeared much easier to just use GitHub - so far it is the only one from open source code hosting services that directly commits online changes to repository (and allows anonymous to do this). I've made more than a dozen proposal for fixing docs, because as a matter of fact - filling a bug AND explaining why docs are wrong, why they need to be fixed, what should be added - all of this is a way *much easier* (and less time consuming!) than just fixing them. Unfortunately. From ralf at systemexit.de Wed Jul 4 11:51:25 2012 From: ralf at systemexit.de (Ralf Schmitt) Date: Wed, 04 Jul 2012 11:51:25 +0200 Subject: [Python-Dev] GitHub mirror In-Reply-To: (anatoly techtonik's message of "Wed, 4 Jul 2012 11:42:38 +0300") References: Message-ID: <8762a326si.fsf@winserver.brainbot.com> anatoly techtonik writes: > On the subject. Is there a mirror of CPython on GitHub? https://github.com/schmir/python -- Cheers Ralf From techtonik at gmail.com Wed Jul 4 11:57:20 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Wed, 4 Jul 2012 12:57:20 +0300 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: Message-ID: On Fri, Jun 29, 2012 at 11:32 PM, Georg Brandl wrote: > On 26.06.2012 10:03, anatoly techtonik wrote: >> >> Now that Python 3 is all about iterators (which is a user killer >> feature for Python according to StackOverflow - >> http://stackoverflow.com/questions/tagged/python) would it be nice to >> introduce more first class functions to work with them? One function >> to be exact to split string into chunks. >> >> itertools.chunks(iterable, size, fill=None) >> >> Which is the 33th most voted Python question on SO - >> >> http://stackoverflow.com/questions/312443/how-do-you-split-a-list-into-evenly-sized-chunks-in-python/312464 >> >> P.S. CC'ing to python-dev@ to notify about the thread in python-ideas. >> > > Anatoly, so far there were no negative votes -- would you care to go > another step and propose a patch? Was about to say "no problem", but in fact - there is. Sorry from whining from my side and thanks for nudging. The only thought that a simple task of copy/pasting relevant code from http://docs.python.org/library/itertools.html?highlight=itertools#recipes will require a few hours waiting of download (still not everybody has a high-speed internet) makes me switch to other less time consuming tasks before getting around to it. These tasks become more important in a few hours, and basically I've passed through this many times before. It then becomes quite hard to switch back. I absolutely don't mind someone else being credited for the idea, because ideas usually worthless without implementation. It will be interesting to design how the process could work in a separate thread. For now the best thing I can do (I don't risk even to mention anything with 3.3) is to copy/paste code from the docs here: from itertools import izip_longest def chunks(iterable, size, fill=None): """Split an iterable into blocks of fixed-length""" # chunks('ABCDEFG', 3, 'x') --> ABC DEF Gxx args = [iter(iterable)] * size return izip_longest(fillvalue=fill, *args) BTW, this doesn't work as expected (at least for strings). Expected is: chunks('ABCDEFG', 3, 'x') --> 'ABC' 'DEF' 'Gxx' got: chunks('ABCDEFG', 3, 'x') --> ('A' 'B' 'C') ('D' 'E' 'F') ('G' 'x' 'x') Needs more round tuits definitely. From stefan_ml at behnel.de Wed Jul 4 14:37:12 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Wed, 04 Jul 2012 14:37:12 +0200 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: Message-ID: anatoly techtonik, 04.07.2012 11:57: > On Fri, Jun 29, 2012 at 11:32 PM, Georg Brandl wrote: >> On 26.06.2012 10:03, anatoly techtonik wrote: >>> Now that Python 3 is all about iterators (which is a user killer >>> feature for Python according to StackOverflow - >>> http://stackoverflow.com/questions/tagged/python) would it be nice to >>> introduce more first class functions to work with them? One function >>> to be exact to split string into chunks. >>> >>> itertools.chunks(iterable, size, fill=None) >>> >>> Which is the 33th most voted Python question on SO - >>> >>> http://stackoverflow.com/questions/312443/how-do-you-split-a-list-into-evenly-sized-chunks-in-python/312464 >>> >>> P.S. CC'ing to python-dev@ to notify about the thread in python-ideas. >>> >> >> Anatoly, so far there were no negative votes -- would you care to go >> another step and propose a patch? > > Was about to say "no problem", but in fact - there is. Sorry from > whining from my side and thanks for nudging. The only thought that a > simple task of copy/pasting relevant code from > http://docs.python.org/library/itertools.html?highlight=itertools#recipes > will require a few hours waiting of download (still not everybody has > a high-speed internet) makes me switch to other less time consuming > tasks before getting around to it. These tasks become more important > in a few hours, and basically I've passed through this many times > before. It then becomes quite hard to switch back. > > I absolutely don't mind someone else being credited for the idea, > because ideas usually worthless without implementation. It will be > interesting to design how the process could work in a separate thread. > For now the best thing I can do (I don't risk even to mention anything > with 3.3) is to copy/paste code from the docs here: > > from itertools import izip_longest > def chunks(iterable, size, fill=None): > """Split an iterable into blocks of fixed-length""" > # chunks('ABCDEFG', 3, 'x') --> ABC DEF Gxx > args = [iter(iterable)] * size > return izip_longest(fillvalue=fill, *args) I think Raymond's change fixes this issue quite nicely, no need to touch at the module code. Stefan From larry at hastings.org Wed Jul 4 18:46:21 2012 From: larry at hastings.org (Larry Hastings) Date: Wed, 04 Jul 2012 18:46:21 +0200 Subject: [Python-Dev] EuroPython 2012 Language Summit Is In JEOPARDY *gasp* Message-ID: <4FF4735D.5020801@hastings.org> So far I've had exactly four reservations for the Language Summit at EuroPython 2012. One of them is Guido--but he's threatening to skip it if we don't get more people and just go to the sprints. Also, honestly I have next-to-nothing on the docket. At this point we've just hit feature freeze for Python 3.3--or at least that's what people tell me. In general, anyone who wanted to get a language change in have already either just succeeded or just failed. So after a flurry of activity I feel like we've hit a quiet period. I mean, the trunk for 3.4 won't even be open for a month or two. So: if you're a core developer, and you're interested in attending the Language Summit in Florence on Saturday, please email me your RSVP. Also, if you have suggestions for things we should discuss, send those along too. Please send all these emails directly to me, *off-list*, tonight or Thursday. I'll send another email on Friday morning decreeing the fate of the Language Summit. Thanks! //arry/ p.s. I'm assured the room we'd have for the Language Summit has excellent air conditioning. -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Wed Jul 4 20:31:19 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 04 Jul 2012 14:31:19 -0400 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: Message-ID: On 7/4/2012 5:57 AM, anatoly techtonik wrote: > On Fri, Jun 29, 2012 at 11:32 PM, Georg Brandl wrote: >> Anatoly, so far there were no negative votes -- would you care to go >> another step and propose a patch? > > Was about to say "no problem", Did you read that there *are* strong negative votes? And that this idea has been rejected before? I summarized the objections in my two responses and pointed to the tracker issues. One of the objections is that there are 4 different things one might want if the sequence length is not an even multiple of the chunk size. Your original 'idea' did not specify. > For now the best thing I can do (I don't risk even to mention anything > with 3.3) is to copy/paste code from the docs here: > > from itertools import izip_longest > def chunks(iterable, size, fill=None): > """Split an iterable into blocks of fixed-length""" > # chunks('ABCDEFG', 3, 'x') --> ABC DEF Gxx > args = [iter(iterable)] * size > return izip_longest(fillvalue=fill, *args) Python ideas is about Python 3 ideas. Please post Python 3 code. This is actually a one liner return zip_longest(*[iter(iterable)]*size, fillvalue=file) We don't generally add such to the stdlib. > BTW, this doesn't work as expected (at least for strings). Expected is: > chunks('ABCDEFG', 3, 'x') --> 'ABC' 'DEF' 'Gxx' > got: > chunks('ABCDEFG', 3, 'x') --> ('A' 'B' 'C') ('D' 'E' 'F') ('G' 'x' 'x') One of the problems with idea of 'add a chunker' is that there are at least a dozen variants that different people want. I discussed the problem of return types issue in my responses. I showed how to get the 'expected' response above using grouper, but also suggested that it is the wrong basis for splitting strings. Repeated slicing make more sense for concrete sequence types. def seqchunk_odd(s, size): # include odd size left over for i in range(0, len(s), size): yield s[i:i+size] print(list(seqchunk_odd('ABCDEFG', 3))) # ['ABC', 'DEF', 'G'] def seqchunk_even(s, size): # only include even chunks for i in range(0, size*(len(s)//size), size): yield s[i:i+size] print(list(seqchunk_even('ABCDEFG', 3))) # ['ABC', 'DEF'] def strchunk_fill(s, size, fill): # fill odd chunks q, r = divmod(len(s), size) even = size * q for i in range(0, even, size): yield s[i:i+size] if size != even: yield s[even:] + fill * (size - r) print(list(strchunk_fill('ABCDEFG', 3, 'x'))) # ['ABC', 'DEF', 'Gxx'] Because the 'fill' value is necessarily a sequence for strings, strchunk_fill would only work for lists and tuples if the fill value were either required to be given as a tuple or list of length 1 or if it were internally converted inside the function. Skipping that for now. Having written the fill version based on the even version, it is easy to select among the three behaviors by modifying the fill version. def strchunk(s, size, fill=NotImplemented): # fill odd chunks q, r = divmod(len(s), size) even = size * q for i in range(0, even, size): yield s[i:i+size] if size != even and fill is not NotImplemented: yield s[even:] + fill * (size - r) print(*strchunk('ABCDEFG', 3)) print(*strchunk('ABCDEFG', 3, '')) print(*strchunk('ABCDEFG', 3, 'x')) # ABC DEF ABC DEF G ABC DEF Gxx I already described how something similar could be done by checking each grouper output tuple for a fill value, but that requires that the fill value be a sentinal that could not otherwise appear in the tuple. One could modify grouper to fill with a private object() and check the last item of each group for that sentinal and act accordingly (delete, truncate, or replace). A generic api needs some thought, though. --- An issue I did not previously mention is that people sometimes want overlapping chunks rather than contiguous disjoint chunks. The slice approach trivially adapts to that. def seqlap(s, size): for i in range(len(s)-size+1): yield s[i:i+size] print(*seqlap('ABCDEFG', 3)) # ABC BCD CDE DEF EFG A sliding window for a generic iterable requires a deque or ring buffer approach that is quite different from the zip-longest -- grouper approach. -- Terry Jan Reedy From petri at digip.org Thu Jul 5 14:49:52 2012 From: petri at digip.org (Petri Lehtinen) Date: Thu, 5 Jul 2012 15:49:52 +0300 Subject: [Python-Dev] GitHub mirror (Was: Bitbucket mirror?) In-Reply-To: References: Message-ID: <20120705124952.GA17866@chang> anatoly techtonik wrote: > On the subject. Is there a mirror of CPython on GitHub? https://github.com/akheron/cpython > changes to repository (and allows anonymous to do this). I've made > more than a dozen proposal for fixing docs, because as a matter of > fact - filling a bug AND explaining why docs are wrong, why they need > to be fixed, what should be added - all of this is a way *much easier* > (and less time consuming!) than just fixing them. Unfortunately. You won't get any changes in to CPython by creating pull requests. We use http://bugs.python.org/ for that, sorry. From techtonik at gmail.com Thu Jul 5 15:36:24 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Thu, 5 Jul 2012 16:36:24 +0300 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: <4FEF6B10.9040409@udel.edu> References: <4FEF6B10.9040409@udel.edu> Message-ID: Before anything else I must apologize for significant lags in my replies. I can not read all of them to hold in my head, so I reply one by one as it goes trying not to miss a single point out there. It would be much easier to do this in unified interface for threaded discussions, but for now there is no capabilities for that neither in Mailman nor in GMail. And when it turns out that the amount of text is too big, and I spend a lot of time trying to squeeze it down and then it becomes pointless to send at all. Now back on the topic: On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: > On 6/29/2012 4:32 PM, Georg Brandl wrote: >> >> On 26.06.2012 10:03, anatoly techtonik wrote: >>> >>> Now that Python 3 is all about iterators (which is a user killer >>> feature for Python according to StackOverflow - >>> http://stackoverflow.com/questions/tagged/python) would it be nice to >>> introduce more first class functions to work with them? One function >>> to be exact to split string into chunks. > > Nothing special about strings. It seemed so, but it just appeared that grouper recipe didn't work for me. >>> itertools.chunks(iterable, size, fill=None) > > This is a renaming of itertools.grouper in 9.1.2. Itertools Recipes. You > should have mentioned this. I think of 'blocks' rather than 'chunks', but I > notice several SO questions with 'chunk(s)' in the title. I guess `block` gives too low signal/noize ration in search results. That's why it probably also called chunks in other languages, where `block` stand for something else (I speak of Ruby blocks). >>> Which is the 33th most voted Python question on SO - >>> >>> http://stackoverflow.com/questions/312443/how-do-you-split-a-list-into-evenly-sized-chunks-in-python/312464 > > I am curious how you get that number. I do note that there are about 15 > other Python SO questions that seem to be variations on the theme. There > might be more if 'blocks' and 'groups' were searched for. It's easy: 1. Go http://stackoverflow.com/ 2. Search [python] 3. Click `votes` tab 4. Choose `30 per page` at the bottom 5. Jump to the second page, there it is 4th from the top: http://stackoverflow.com/questions/tagged/python?page=2&sort=votes&pagesize=30 As for duplicates - feel free to mark them as such. SO allows everybody to do this (unlike Roundup). >> Anatoly, so far there were no negative votes -- would you care to go >> another step and propose a patch? > > That is because Raymond H. is not reading either list right now ;-) > Hence the Cc:. Also because I did not yet respond to a vague, very > incomplete idea. > > From Raymond's first message on http://bugs.python.org/issue6021 , add > grouper: > > "This has been rejected before. I quite often see such arguments and I can't stand to repeat that these are not arguments. It is good to know, but when people use that as a reason to close tickets - that's just disgusting. To the Raymond's honor he cares to explain. > * It is not a fundamental itertool primitive. The recipes section in > the docs shows a clean, fast implementation derived from zip_longest(). What is the definition of 'fundamental primitive'? To me the fact that top answer for chunking strings on SO has 2+ times more votes than itertools versions is a clear 5 sigma indicator that something is wrong with this Standard model without chunks boson. > * There is some debate on a correct API for odd lengths. Some people > want an exception, some want fill-in values, some want truncation, and > some want a partially filled-in tuple. The alone is reason enough not > to set one behavior in stone. use case 3.1: odd lengths exception (CHOOSE ONE) 1. I see that no itertools function throws exceptions, check manually: len(iterable) / float(size) == len(iterable) // float(size) 2. Explicitly - itertools.chunks(iterable, size, fill=None) + itertools.chunks(iterable, size, fill=None, exception=False) use case 3.2. fill in value. it is here (SOLVED) use case 3.3: truncation no itertools support truncation, do manually chunks(iter, size)[:len(iter)//size) use case 4: partially filled-in tuple What should be there? >>> chunks('ABCDEFG', 3, 'x') >>> | More replies and workarounds to some of the raised points are below. > * There is an issue with having too many itertools. The module taken as > a whole becomes more difficult to use as new tools are added." There can be only two reasons to that: * chosen basis is bad (many functions that are rarely used or easily emulated) * basis is good, but insufficient, because iterators universe is more complicated than we think > This is not to say that the question should not be re-considered. Given the > StackOverflow experience in addition to that of the tracker and python-list > (and maybe python-ideas), a special exception might be made in relation to > points 1 and 3. --[offtopic about Python enhancements / proposals feedback]-- Yes, without SO I probably wouldn't trigger this at all. Because tracker doesn't help with raising importance - there are no votes, no feature proposals, no "stars". And what I "like" the most is that very "nice" resolution status - "committed/rejected" - which doesn't say anything at all. Python list? I try not to disrupt the frequency there. Python ideas? Too low participation level for gathering signals. There are many people that read, support, but don't want to reply (don't want to stand out or just lazy). There are many outside who don't want to be subscribed at all. There are 2000+ people spending time on Python conferences all over the world each year we see only a couple of reactions for every Python idea here. Quite often there are mistakes and omissions that would be nice to correct and you can't. So StackOverflow really helps here, but it is a Q&A tool, which is still much better than ML that are solely for chatting, brainstorming and all the crazy reading / writing stuff. They don't help to develop ideas collaboratively. Quite often I am just lost in amount of text to handle. --[/offtopic]-- > It regard to point 2: many 'proposals', including Anatoly's, neglect this > detail. But the function has to do *something* when seqlen % grouplen != 0. > So an 'idea' is not really a concrete programmable proposal until > 'something' is specified. > > Exception -- not possible for an itertool until the end of the iteration > (see below). To raise immediately for sequences, one could wrap grouper. > > def exactgrouper(sequence, k): # untested > if len(sequence) % k: > raise ValueError('Sequence length {} must be a multiple of group length > {}'.format(len(sequence), k) > else: > return itertools.grouper(sequence, k) Right. Iterator is not a sequence, because it doesn't know the length of its sequence. The method should not belong to itertools at all then. Python 3 is definitely become more complicated. I'd prefer to keep separated from iterator stuff, but it seems more harder with every iteration. > Of course, sequences can also be directly sequentially sliced (but should > the result be an iterable or sequence of blocks?). But we do not have a > seqtools module and I do not think there should be another method added to > the seq protocol. I'd expect strings chunked into strings and lists into lists. Don't want to know anything about protocols. > Fill -- grouper always does this, with a default of None. > > Truncate, Remainder -- grouper (zip_longest) cannot directly do this and no > recipes are given in the itertools docs. (More could be, see below.) > > Discussions on python-list gives various implementations either for > sequences or iterables. For the latter, one approach is "it = > iter(iterable)" followed by repeated islice of the first n items. Another is > to use a sentinal for the 'fill' to detect a final incomplete block (tuple > for grouper). > > def grouper_x(n, iterable): # untested > sentinal = object() > for g in grouper(n, iterable, sentinal): > if g[-1] != sentinal: > yield g > else: > # pass to truncate > # yield g[:g.index(sentinal) for remainer > # raise ValueError for delayed exception We need a simple function to split a sequence into chunks(). Now we face with the problem to apply that technique to a sequence of infinite length when a last element of infinite sequence is encountered. You might be thinking now that this is a reduction to absurdity. But I'd say it is an exit from the trap. Mathematically this problem can't be solved. I am not ignoring your solution - I think it's quite feasible, but isn't it an overcomplication? I mean 160 people out of 149 who upvoted the question are pretty happy with an answer that just outputs the last chunk as-is: http://stackoverflow.com/questions/312443/how-do-you-split-a-list-into-evenly-sized-chunks-in-python chunks('ABCDEFG', 3) --> 'ABC' 'DEF' 'G' And it is quite nice solution to me, because you're free to do anything you'd like if you expect you data to be odd: for chunk in chunks('ABCDEFG', size): if len(chunk) < size: raise Tail You can make a helper iterator out of it too. > --- > The above discussion of point 2 touches on point 4, which Raymond neglected > in the particular message above but which has come up before: What are the > allowed input and output types? An idea is not a programmable proposal until > the domain, range, and mapping are specified. Domain? Mapping? I am not ignoring existing knowledge and experience. I just don't want to complicate and don't see appropriate `import usecase` in current context, so I won't try to guess what this means. in string -> out list of strings in list -> out list of lists > Possible inputs are a specific sequence (string, for instance), any > sequence, any iterable. Possible outputs are a sequence or iterator of > sequence or iterator. The various python-list and stackoverflow posts > questions asks for various combinations. zip_longest and hence grouper takes > any iterable and returns an iterator of tuples. (An iterator of maps might > be more useful as a building block.) This is not what one usually wants with > string input, for instance, nor with range input. To illustrate: Allright. Got it. Sequences have a length and can be sliced with [i:j], iterator can't be sliced (and hence no chunks can be made). So this function doesn't belong to itertools - it is a missing string or sequence method. We can't have a chunk with an iterator, because iterator over a string decomposes it into a group of pieces with no reverse function. We can have a group and then join the group into something. But this requires the knowledge of appropriate join() function for the iterator, and probably not efficient. As there are no such function (must be that Mapping you referenced above) - the recomposition into chunks is impossible. > import itertools as it > > def grouper(n, iterable, fillvalue=None): > "grouper(3, 'ABCDEFG', 'x') --> ABC DEF Gxx" > args = [iter(iterable)] * n > return it.zip_longest(*args, fillvalue=fillvalue) > > print(*(grouper(3, 'ABCDEFG', 'x'))) # probably not wanted > print(*(''.join(g) for g in grouper(3, 'ABCDEFG', 'x'))) > # > ('A', 'B', 'C') ('D', 'E', 'F') ('G', 'x', 'x') > ABC DEF Gxx > > -- > What to do? One could easily write 20 different functions. So more thought > is needed before adding anything. -1 on the idea as is. I've learned a new English type of argument - "straw man" (I used to call this "hijacking"). This -1 doesn't belong to original idea. It belongs to proposal of itertools.chunks() with a long list of above points and completely different user stories (i.e. not "split string into chunks"). I hope you still +1 with 160 people on SO that think Python needs an easy way to chunk sequences. > For the doc, I think it would be helpful here and in most module subchapters > if there were a subchapter table of contents at the top (under 9.1 in this > case). Even though just 2 lines here (currently, but see below), it would > let people know that there *is* a recipes section. After the appropriate > tables, mention that there are example uses in the recipe section. Possibly > add similar tables in the recipe section. Unfortunately, it appeared that grouper() is not chunks(). It doesn't delivers list of list of chars given string as an input instead of list of chunks. > Another addition could be a new subsection on grouping (chunking) that would > discuss post-processing of grouper (as discussed above), as well as other > recipes, including ones specific to strings and sequences. It would > essentially be a short how-to. Call it 9.1.3 "Grouping, Blocking, or > Chunking Sequences and Iterables". The synonyms will help external > searching. A toc would let people who have found this doc know to look for > this at the bottom. This makes matters pretty ugly. In ideal language there should be less docs, not more. From techtonik at gmail.com Thu Jul 5 15:47:53 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Thu, 5 Jul 2012 16:47:53 +0300 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: <4FEF6B10.9040409@udel.edu> <2E44286F-D3DB-4739-89AF-3A8EAA9D8901@gmail.com> Message-ID: On Sun, Jul 1, 2012 at 3:01 PM, Stefan Behnel wrote: > > To address the main problem of users not finding what they need, what about > simply extending the docstring of the grouper() function with a sentence > like this: > > "This functionality is also called 'chunking' or 'blocking' and can be used > for load distribution and sharding." > > That would make it easy for users to find what they are looking for when > they search the page for "chunk". I find that a much more common and less > ambiguous name than "grouping", which reminds me more of "group by". In appeared that "chunking" and "grouping" are different kind of tasks. You can chunk a sequence (sting) by slicing it into smaller sequences, but you can not chunk in iterable - you can only group it. There is an data loss about the structure that occurs when a sequence (string) becomes an iterator: chunks ABCDE -> AB CD E group ABCDE -> A B C D E -> (A B) (C D) (D E) From barry at python.org Thu Jul 5 15:52:49 2012 From: barry at python.org (Barry Warsaw) Date: Thu, 5 Jul 2012 09:52:49 -0400 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: <4FEF6B10.9040409@udel.edu> Message-ID: <20120705095249.79ca136c@limelight.wooz.org> On Jul 05, 2012, at 04:36 PM, anatoly techtonik wrote: >It would be much easier to do this in unified interface for threaded >discussions, but for now there is no capabilities for that neither in Mailman >nor in GMail. You might like to read the mailing lists via NNTP on Gmane. Cheers, -Barry From techtonik at gmail.com Thu Jul 5 16:33:19 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Thu, 5 Jul 2012 17:33:19 +0300 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: Message-ID: On Wed, Jul 4, 2012 at 9:31 PM, Terry Reedy wrote: > On 7/4/2012 5:57 AM, anatoly techtonik wrote: >> >> On Fri, Jun 29, 2012 at 11:32 PM, Georg Brandl wrote: > > >>> Anatoly, so far there were no negative votes -- would you care to go >>> another step and propose a patch? >> >> >> Was about to say "no problem", > > > Did you read that there *are* strong negative votes? And that this idea has > been rejected before? I summarized the objections in my two responses and > pointed to the tracker issues. One of the objections is that there are 4 > different things one might want if the sequence length is not an even > multiple of the chunk size. Your original 'idea' did not specify. I actually meant that there is a problem to propose a patch in the sense of getting checkout, working on a diff, sending it by attaching to bug tracker as developer guide says. >> For now the best thing I can do (I don't risk even to mention anything >> with 3.3) is to copy/paste code from the docs here: >> >> from itertools import izip_longest >> def chunks(iterable, size, fill=None): >> """Split an iterable into blocks of fixed-length""" >> # chunks('ABCDEFG', 3, 'x') --> ABC DEF Gxx >> args = [iter(iterable)] * size >> return izip_longest(fillvalue=fill, *args) > > > Python ideas is about Python 3 ideas. Please post Python 3 code. > > This is actually a one liner > > return zip_longest(*[iter(iterable)]*size, fillvalue=file) > > We don't generally add such to the stdlib. Can you figure out from the code what this stuff does? It doesn't give chunks of strings. >> BTW, this doesn't work as expected (at least for strings). Expected is: >> chunks('ABCDEFG', 3, 'x') --> 'ABC' 'DEF' 'Gxx' >> got: >> chunks('ABCDEFG', 3, 'x') --> ('A' 'B' 'C') ('D' 'E' 'F') ('G' 'x' 'x') > > > One of the problems with idea of 'add a chunker' is that there are at least > a dozen variants that different people want. That's not the problem. People always want something extra. The problem that we don't have a real wish distribution. If 1000 people want chunks and 1 wants groups with exception - we still account these as equal variants. Therefore my idea is deliberately limited to "string to chunks" user story, and SO implementation proposal. > I discussed the problem of > return types issue in my responses. I showed how to get the 'expected' > response above using grouper, but also suggested that it is the wrong basis > for splitting strings. Repeated slicing make more sense for concrete > sequence types. > > def seqchunk_odd(s, size): > # include odd size left over > for i in range(0, len(s), size): > yield s[i:i+size] > > print(list(seqchunk_odd('ABCDEFG', 3))) > # > ['ABC', 'DEF', 'G'] Right. That's the top answer on SO that people think should be in stdlib. Great we are talking about the same thing actually. > def seqchunk_even(s, size): > # only include even chunks > for i in range(0, size*(len(s)//size), size): > yield s[i:i+size] > > print(list(seqchunk_even('ABCDEFG', 3))) > # > ['ABC', 'DEF'] This is deducible from seqchunk_odd(s, size) > def strchunk_fill(s, size, fill): > # fill odd chunks > q, r = divmod(len(s), size) > even = size * q > for i in range(0, even, size): > yield s[i:i+size] > if size != even: > yield s[even:] + fill * (size - r) > > print(list(strchunk_fill('ABCDEFG', 3, 'x'))) > # > ['ABC', 'DEF', 'Gxx'] Also deducible from seqchunk_odd(s, size) > Because the 'fill' value is necessarily a sequence for strings, > strchunk_fill would only work for lists and tuples if the fill value were > either required to be given as a tuple or list of length 1 or if it were > internally converted inside the function. Skipping that for now. > > Having written the fill version based on the even version, it is easy to > select among the three behaviors by modifying the fill version. > > def strchunk(s, size, fill=NotImplemented): > # fill odd chunks > q, r = divmod(len(s), size) > even = size * q > for i in range(0, even, size): > yield s[i:i+size] > if size != even and fill is not NotImplemented: > yield s[even:] + fill * (size - r) > > print(*strchunk('ABCDEFG', 3)) > print(*strchunk('ABCDEFG', 3, '')) > print(*strchunk('ABCDEFG', 3, 'x')) > # > ABC DEF > ABC DEF G > ABC DEF Gxx I now don't even think that fill value is needed as argument. if len(chunk) < size: chunk.extend( [fill] * ( size - len(chunk)) ) > I already described how something similar could be done by checking each > grouper output tuple for a fill value, but that requires that the fill value > be a sentinal that could not otherwise appear in the tuple. One could modify > grouper to fill with a private object() and check the last item of each > group for that sentinal and act accordingly (delete, truncate, or replace). > A generic api needs some thought, though. I just need to chunk strings and sequences. Generic API is too complex without counting all usecases and iterating over them. > An issue I did not previously mention is that people sometimes want > overlapping chunks rather than contiguous disjoint chunks. The slice > approach trivially adapts to that. > > def seqlap(s, size): > for i in range(len(s)-size+1): > yield s[i:i+size] > > print(*seqlap('ABCDEFG', 3)) > # > ABC BCD CDE DEF EFG > > A sliding window for a generic iterable requires a deque or ring buffer > approach that is quite different from the zip-longest -- grouper approach. That's why I'd like to drastically reduce the scope of proposal. itertools doesn't seem to be the best place anymore. How about sequence method? string.chunks(size) -> ABC DEF G list.chunks(size) -> [A,B,C], [C,D,E],[G] If somebody needs a keyword argument - this can come later without breaking compatibility. From steve at pearwood.info Thu Jul 5 17:57:17 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Fri, 06 Jul 2012 01:57:17 +1000 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References:

Message-ID: <4FF5B95D.3060208@pearwood.info> anatoly techtonik wrote: > On Wed, Jul 4, 2012 at 9:31 PM, Terry Reedy wrote: >> A sliding window for a generic iterable requires a deque or ring buffer >> approach that is quite different from the zip-longest -- grouper approach. > > That's why I'd like to drastically reduce the scope of proposal. > itertools doesn't seem to be the best place anymore. How about > sequence method? > > string.chunks(size) -> ABC DEF G > list.chunks(size) -> [A,B,C], [C,D,E],[G] -1 This is a fairly trivial problem to solve, and there are many variations on it. Many people will not find the default behaviour helpful, and will need to write their own. Why complicate the API for all sequence types with this? I don't believe that we should enshrine one variation as a built-in method, without any evidence that it is the most useful or common variation. Even if there is one variation far more useful than the others, that doesn't necessarily mean we ought to make it a builtin method unless it is a fundamental sequence operation, has wide applicability, and is genuinely hard to write. I don't believe chunking meets *any* of those criteria, let alone all three. Not every six line function needs to be a builtin. I believe that splitting a sequence (or a string) into fixed-size chunks is more of a programming exercise problem than a genuinely useful tool. That does not mean that there is never any real use-cases for splitting into fixed-size chunks, only that this is the function that *seems* more useful in theory than it turns out in practice. Compare this with more useful sequence/iteration tools, like (say) zip. You can hardly write a hundred lines of code without using zip at least once. But I bet you can write tens of thousands of lines of code without needing to split sequences into fixed chunks like this. Besides, the name "chunks" is more general than how you are using it. For example, I consider chunking to be splitting a sequence up at a various delimiters or separators, not at fixed character positions. E.g. "the third word of item two of the fourth line" is a chunk. This fits more with the non-programming use of the term chunk or chunking, and has precedence in Apple's Hypertalk language, which literally allowed you to talk about words, items and lines of text, each of which are described as chunks. This might be a good candidate for a utility module made up of assorted useful functions, but not for the string and sequence APIs. -- Steven From ctb at msu.edu Thu Jul 5 18:01:26 2012 From: ctb at msu.edu (C. Titus Brown) Date: Thu, 5 Jul 2012 09:01:26 -0700 Subject: [Python-Dev] GitHub mirror (Was: Bitbucket mirror?) In-Reply-To: <20120705124952.GA17866@chang> References: <20120705124952.GA17866@chang> Message-ID: <20120705160126.GG31815@idyll.org> On Thu, Jul 05, 2012 at 03:49:52PM +0300, Petri Lehtinen wrote: > anatoly techtonik wrote: > > On the subject. Is there a mirror of CPython on GitHub? > > https://github.com/akheron/cpython > > > changes to repository (and allows anonymous to do this). I've made > > more than a dozen proposal for fixing docs, because as a matter of > > fact - filling a bug AND explaining why docs are wrong, why they need > > to be fixed, what should be added - all of this is a way *much easier* > > (and less time consuming!) than just fixing them. Unfortunately. > > You won't get any changes in to CPython by creating pull requests. We > use http://bugs.python.org/ for that, sorry. Question -- is there a reason to abide by this rule for docs? That is, if we could get a sympathetic core dev to look at pull requests for docs as part of a streamlined process, would it cause problems? (What I'm really asking is whether or the bugs.python.org process is considered critical for potentially minor doc changes and additions.) thanks, --titus -- C. Titus Brown, ctb at msu.edu From steve at pearwood.info Thu Jul 5 18:09:44 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Fri, 06 Jul 2012 02:09:44 +1000 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: <4FEF6B10.9040409@udel.edu> Message-ID: <4FF5BC48.9080302@pearwood.info> anatoly techtonik wrote: >>>> Which is the 33th most voted Python question on SO - >>>> >>>> http://stackoverflow.com/questions/312443/how-do-you-split-a-list-into-evenly-sized-chunks-in-python/312464 >> I am curious how you get that number. I do note that there are about 15 >> other Python SO questions that seem to be variations on the theme. There >> might be more if 'blocks' and 'groups' were searched for. > > It's easy: > 1. Go http://stackoverflow.com/ > 2. Search [python] > 3. Click `votes` tab > 4. Choose `30 per page` at the bottom > 5. Jump to the second page, there it is 4th from the top: > http://stackoverflow.com/questions/tagged/python?page=2&sort=votes&pagesize=30 Yes. I don't think this is particularly significant. Have a look at some of the questions with roughly the same number of votes: #26 "How can I remove (chomp) a newline in Python?" 176 votes #33 "How do you split a list into evenly sized chunks in Python?" 149 votes #36 "Accessing the index in Python for loops" 144 votes Being 33rd most voted question doesn't really mean much. By the way, why is this discussion going to both python-dev and python-ideas? -- Steven From stefan_ml at behnel.de Thu Jul 5 18:50:32 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Thu, 05 Jul 2012 18:50:32 +0200 Subject: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: <4FEF6B10.9040409@udel.edu> Message-ID: anatoly techtonik, 05.07.2012 15:36: > On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: >> From Raymond's first message on http://bugs.python.org/issue6021 , add >> grouper: >> >> "This has been rejected before. > > I quite often see such arguments and I can't stand to repeat that > these are not arguments. It is good to know, but when people use that > as a reason to close tickets - that's just disgusting. The *real* problem is that people keep bringing up topics (and even spell them out in the bug tracker) without searching for existing discussions and/or tickets first. That's why those who do such a search (or who know what they are talking about anyway) close these tickets with the remark "this has been rejected before", instead of repeating an entire heap of arguments all over again to feed a discussion that would only lead to the same result as it did before, often several times before. Stefan From martin at v.loewis.de Thu Jul 5 20:27:02 2012 From: martin at v.loewis.de (martin at v.loewis.de) Date: Thu, 05 Jul 2012 20:27:02 +0200 Subject: [Python-Dev] GitHub mirror (Was: Bitbucket mirror?) In-Reply-To: <20120705160126.GG31815@idyll.org> References: <20120705124952.GA17866@chang> <20120705160126.GG31815@idyll.org> Message-ID: <20120705202702.Horde.Yh-RBqGZi1VP9dx2H7Nj-nA@webmail.df.eu> >> You won't get any changes in to CPython by creating pull requests. We >> use http://bugs.python.org/ for that, sorry. > > Question -- is there a reason to abide by this rule for docs? That is, if we > could get a sympathetic core dev to look at pull requests for docs as part of > a streamlined process, would it cause problems? How do you communicate a "pull request"? On bitbucket, there is a "pull request" UI resulting in a tracker item being generated (and an email being sent), but hg.python.org doesn't have a notion of pull requests. Of course, you could use any communication means (email, telephone call, carrier pigeon) to request a pull from a "sympathetic core dev". > (What I'm really asking is whether or the bugs.python.org process is > considered critical for potentially minor doc changes and additions.) The sympathetic core dev is mostly free to bypass any submission process initially; commits that bypass established procedures will likely be questioned only after the fact. In the specific case, I'd be worried to verify that the submitter has provided a contributor form. That's easy to do in the bug tracker, but difficult to do in an offline pull request. Of course, for a really minor doc change (e.g. typo fixes), no contrib form is necessary. Regards, Martin From techtonik at gmail.com Thu Jul 5 21:41:29 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Thu, 5 Jul 2012 22:41:29 +0300 Subject: [Python-Dev] Bloody FAQ (Was: [Python-ideas] itertools.chunks(iterable, size, fill=None)) Message-ID: On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel wrote: > anatoly techtonik, 05.07.2012 15:36: >> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: >>> From Raymond's first message on http://bugs.python.org/issue6021 , add >>> grouper: >>> >>> "This has been rejected before. >> >> I quite often see such arguments and I can't stand to repeat that >> these are not arguments. It is good to know, but when people use that >> as a reason to close tickets - that's just disgusting. > > The *real* problem is that people keep bringing up topics (and even spell > them out in the bug tracker) without searching for existing discussions > and/or tickets first. That's why those who do such a search (or who know > what they are talking about anyway) close these tickets with the remark > "this has been rejected before", instead of repeating an entire heap of > arguments all over again to feed a discussion that would only lead to the > same result as it did before, often several times before. Make the bloody FAQ and summarize this stuff? Why waste each others time? If people don't enjoy repeating themselves over and over - there is a bloody wiki. What should happen to people to start extracting gems of knowledge from piles of dusty sheets called list "archives" for others to admire. No, it is easier to say "it was already discussed many times", "why don't you Google yourself", "so far you're only complaining", etc. If people can't find anything - why everybody thinks they are ignorant and lazy. Even if it so, why nobody thinks that maybe that bloody Xapian index is dead again for a bloody amount of moons nobody knows why and how many exactly? Why nobody thinks that lazy coders can also help with development? Maybe that laziness is the primary reason some major groups actually prefer Python to Java, C++ and other more interesting languages (such as PHP) when it comes to typing? Make it easy and the patches will follow. Answers like "this was discussed before" don't make it easy to understand, and leaving users rereading old 19xx archives that people don't reread themselves will likely make users bounce and never (NEVER!) come up with some proposal again. An "organic" way to keep traffic low. Miscommunication is a bad experience for users, bad experience for developers, everybody is annoyed and as a result such nice language as Python loses points on TIOBE (and convenient chunk() functions to munch-munch on the sequence data). Wheew. :-F From breamoreboy at yahoo.co.uk Thu Jul 5 21:55:09 2012 From: breamoreboy at yahoo.co.uk (Mark Lawrence) Date: Thu, 05 Jul 2012 20:55:09 +0100 Subject: [Python-Dev] Bloody FAQ (Was: [Python-ideas] itertools.chunks(iterable, size, fill=None)) In-Reply-To: References: Message-ID: On 05/07/2012 20:41, anatoly techtonik wrote: > On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel wrote: >> anatoly techtonik, 05.07.2012 15:36: >>> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: >>>> From Raymond's first message on http://bugs.python.org/issue6021 , add >>>> grouper: >>>> >>>> "This has been rejected before. >>> >>> I quite often see such arguments and I can't stand to repeat that >>> these are not arguments. It is good to know, but when people use that >>> as a reason to close tickets - that's just disgusting. >> >> The *real* problem is that people keep bringing up topics (and even spell >> them out in the bug tracker) without searching for existing discussions >> and/or tickets first. That's why those who do such a search (or who know >> what they are talking about anyway) close these tickets with the remark >> "this has been rejected before", instead of repeating an entire heap of >> arguments all over again to feed a discussion that would only lead to the >> same result as it did before, often several times before. > > Make the bloody FAQ and summarize this stuff? Why waste each others > time? If people don't enjoy repeating themselves over and over - there > is a bloody wiki. What should happen to people to start extracting > gems of knowledge from piles of dusty sheets called list "archives" > for others to admire. > > No, it is easier to say "it was already discussed many times", "why > don't you Google yourself", "so far you're only complaining", etc. If > people can't find anything - why everybody thinks they are ignorant > and lazy. Even if it so, why nobody thinks that maybe that bloody > Xapian index is dead again for a bloody amount of moons nobody knows > why and how many exactly? Why nobody thinks that lazy coders can also > help with development? Maybe that laziness is the primary reason some > major groups actually prefer Python to Java, C++ and other more > interesting languages (such as PHP) when it comes to typing? Make it > easy and the patches will follow. Answers like "this was discussed > before" don't make it easy to understand, and leaving users rereading > old 19xx archives that people don't reread themselves will likely make > users bounce and never (NEVER!) come up with some proposal again. An > "organic" way to keep traffic low. > > Miscommunication is a bad experience for users, bad experience for > developers, everybody is annoyed and as a result such nice language as > Python loses points on TIOBE (and convenient chunk() functions to > munch-munch on the sequence data). > > Wheew. :-F > Can I safely assume that you are volunteering to do the work required? -- Cheers. Mark Lawrence. From stefan_ml at behnel.de Thu Jul 5 21:58:52 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Thu, 05 Jul 2012 21:58:52 +0200 Subject: [Python-Dev] Bloody FAQ (Was: [Python-ideas] itertools.chunks(iterable, size, fill=None)) In-Reply-To: References: Message-ID: anatoly techtonik, 05.07.2012 21:41: > On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel wrote: >> anatoly techtonik, 05.07.2012 15:36: >>> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: >>>> From Raymond's first message on http://bugs.python.org/issue6021 , add >>>> grouper: >>>> >>>> "This has been rejected before. >>> >>> I quite often see such arguments and I can't stand to repeat that >>> these are not arguments. It is good to know, but when people use that >>> as a reason to close tickets - that's just disgusting. >> >> The *real* problem is that people keep bringing up topics (and even spell >> them out in the bug tracker) without searching for existing discussions >> and/or tickets first. That's why those who do such a search (or who know >> what they are talking about anyway) close these tickets with the remark >> "this has been rejected before", instead of repeating an entire heap of >> arguments all over again to feed a discussion that would only lead to the >> same result as it did before, often several times before. > > Make the bloody FAQ and summarize this stuff? Why waste each others > time? Yes, that is exactly the question. It takes time to write things up nicely. I mean, once someone has pointed out to you that this has been discussed before, you could just go, look it up (or search for it), and then put it into a Wiki or blog post yourself, or sum it up and send it to the mailing list as a reply. Why rely on others to do it for you? Stefan From paul at boddie.org.uk Thu Jul 5 23:11:46 2012 From: paul at boddie.org.uk (Paul Boddie) Date: Thu, 5 Jul 2012 23:11:46 +0200 Subject: [Python-Dev] Bloody FAQ (Was: [Python-ideas] itertools.chunks(iterable, size, fill=None)) Message-ID: <201207052311.46867.paul@boddie.org.uk> Stefan Behnel wrote: > anatoly techtonik, 05.07.2012 21:41: > > > > Make the bloody FAQ and summarize this stuff? Why waste each others > > time? > > Yes, that is exactly the question. > > It takes time to write things up nicely. I mean, once someone has pointed > out to you that this has been discussed before, you could just go, look it > up (or search for it), and then put it into a Wiki or blog post yourself, > or sum it up and send it to the mailing list as a reply. Why rely on others > to do it for you? To be fair, Anatoly has done quite a bit of maintenance on some of the Wiki content around various aspects of the project, so it's not as if he's demanding anything out of the ordinary or asking for others to do things that he isn't already doing in some sense. My experience is that there usually needs to be some willingness on the other end of the transaction, and if it takes repetition to encourage it amongst those who don't see the current situation as a problem for them, then so be it. Of course, this kind of documentation activity, where one gathers together historical decisions and the consensus from long-forgotten discussions, is pretty thankless work. I occasionally regard it as worthwhile if only to bring up something someone said as an inconvenient interruption in any current discussion, but that's a pretty minimal reward for all the effort unless one has such work as part of one's daily routine. Paul From larry at hastings.org Fri Jul 6 08:47:30 2012 From: larry at hastings.org (Larry Hastings) Date: Fri, 06 Jul 2012 08:47:30 +0200 Subject: [Python-Dev] EuroPython 2012 Language Summit is Canceled. Message-ID: <4FF68A02.8000500@hastings.org> I only got one more RSVP and zero topics for the docket. So let's sprint instead. See you at the PyCon 2013 Language Summit, //arry/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From merwok at netwok.org Fri Jul 6 16:54:39 2012 From: merwok at netwok.org (=?UTF-8?B?w4lyaWMgQXJhdWpv?=) Date: Fri, 06 Jul 2012 10:54:39 -0400 Subject: [Python-Dev] [Python-checkins] peps: specify Provides-Extra In-Reply-To: <3WTJQY0ZPZzNND@mail.python.org> References: <3WTJQY0ZPZzNND@mail.python.org> Message-ID: <4FF6FC2F.2070208@netwok.org> Le 06/07/2012 10:36, benjamin.peterson a ?crit : > http://hg.python.org/peps/rev/4dc72bbf9ed2 > changeset: 4483:4dc72bbf9ed2 > user: Daniel Holth > date: Wed Jul 04 07:30:08 2012 -0400 > summary: > specify Provides-Extra Sorry, but this was premature. The mail on distutils-sig only describes the syntax for a new field without any explanation of the semantics and use case, not the relation to the Setup-Requires and Test-Requires fields that we?ve been discussing for some time (and which would make part of the Extra proposal obsolete). I know Alexis and I are not as active as other devs in mailing lists and PEPs, but these things are on our radar and we want to push them forward. Please allow the discussion to take place before editing the packaging PEPs. Regards From status at bugs.python.org Fri Jul 6 18:07:18 2012 From: status at bugs.python.org (Python tracker) Date: Fri, 6 Jul 2012 18:07:18 +0200 (CEST) Subject: [Python-Dev] Summary of Python tracker Issues Message-ID: <20120706160718.9092F1CD07@psf.upfronthosting.co.za> ACTIVITY SUMMARY (2012-06-29 - 2012-07-06) Python tracker at http://bugs.python.org/ To view or respond to any of the issues listed below, click on the issue. Do NOT respond to this message. Issues counts and deltas: open 3495 (+10) closed 23546 (+31) total 27041 (+41) Open issues with patches: 1462 Issues opened (32) ================== #10571: "setup.py upload --sign" broken: TypeError: 'str' does not sup http://bugs.python.org/issue10571 reopened by eric.araujo #15223: datetime instances lack __module__ attribute http://bugs.python.org/issue15223 opened by cjerdonek #15227: Fatal Python error: PyEval_RestoreThread: NULL tstate on examp http://bugs.python.org/issue15227 opened by tmaslach #15228: os.utime() docs not clear on behavior on nonexistant files http://bugs.python.org/issue15228 opened by dewin #15230: runpy.run_path broken: Breaks scoping; pollutes sys.modules; d http://bugs.python.org/issue15230 opened by Benjamin.S.Wolf #15231: update PyPI upload doc to say --no-raw passed to rst2html.py http://bugs.python.org/issue15231 opened by cjerdonek #15232: email.generator.Generator doesn't mangle "From " lines in MIME http://bugs.python.org/issue15232 opened by Chris.Pickett #15233: atexit: guarantee order of execution of registered functions? http://bugs.python.org/issue15233 opened by georg.brandl #15234: avoid runtime library path for extensions found in system dire http://bugs.python.org/issue15234 opened by doko #15235: allow newer berkley db versions http://bugs.python.org/issue15235 opened by doko #15237: Add capsule API to _decimal http://bugs.python.org/issue15237 opened by skrah #15238: shutil.copystat should copy Linux extended attributes http://bugs.python.org/issue15238 opened by larry #15239: Abandoned Tools/unicode/mkstringprep.py http://bugs.python.org/issue15239 opened by storchaka #15242: PyImport_GetMagicTag() should use the same const char * as sys http://bugs.python.org/issue15242 opened by eric.snow #15243: Misleading documentation for __prepare__ http://bugs.python.org/issue15243 opened by William.Schwartz #15244: Support for opening files with FILE_SHARE_DELETE on Windows http://bugs.python.org/issue15244 opened by sbt #15245: ast.literal_eval fails on some literals http://bugs.python.org/issue15245 opened by JBernardo #15246: Line coverage for collectionts.abc.Set http://bugs.python.org/issue15246 opened by agentultra #15247: io.open() is inconsistent re os.open() http://bugs.python.org/issue15247 opened by apalala #15248: In "TypeError: 'tuple' object is not callable", explain that a http://bugs.python.org/issue15248 opened by o11c #15249: email.generator.BytesGenerator doesn't mangle "From " lines wh http://bugs.python.org/issue15249 opened by petri.lehtinen #15250: document that filecmp.dircmp comparisons are "shallow" http://bugs.python.org/issue15250 opened by cjerdonek #15251: new.code and new.function crashes Python iterpretter http://bugs.python.org/issue15251 opened by jeffdharper #15255: spam / garbage report http://bugs.python.org/issue15255 opened by andisthermal555 #15256: Typo in error message http://bugs.python.org/issue15256 opened by Marc.Abramowitz #15257: Misc/.gdbinit:pystack is too brittle http://bugs.python.org/issue15257 opened by rian #15258: argparse documentation: Improve optparse section regarding all http://bugs.python.org/issue15258 opened by nejucomo #15259: "Helping with Documentation" references missing dailybuild.py http://bugs.python.org/issue15259 opened by cjerdonek #15260: Mention how to order Misc/NEWS entries http://bugs.python.org/issue15260 opened by cjerdonek #15262: Idle does not show traceback in other threads http://bugs.python.org/issue15262 opened by Mark #15263: Guard against invalid file handles in os functions http://bugs.python.org/issue15263 opened by larry #1207613: Bottom Scroll Bar http://bugs.python.org/issue1207613 reopened by terry.reedy Most recent 15 issues with no replies (15) ========================================== #15262: Idle does not show traceback in other threads http://bugs.python.org/issue15262 #15260: Mention how to order Misc/NEWS entries http://bugs.python.org/issue15260 #15259: "Helping with Documentation" references missing dailybuild.py http://bugs.python.org/issue15259 #15258: argparse documentation: Improve optparse section regarding all http://bugs.python.org/issue15258 #15257: Misc/.gdbinit:pystack is too brittle http://bugs.python.org/issue15257 #15256: Typo in error message http://bugs.python.org/issue15256 #15250: document that filecmp.dircmp comparisons are "shallow" http://bugs.python.org/issue15250 #15244: Support for opening files with FILE_SHARE_DELETE on Windows http://bugs.python.org/issue15244 #15234: avoid runtime library path for extensions found in system dire http://bugs.python.org/issue15234 #15230: runpy.run_path broken: Breaks scoping; pollutes sys.modules; d http://bugs.python.org/issue15230 #15228: os.utime() docs not clear on behavior on nonexistant files http://bugs.python.org/issue15228 #15227: Fatal Python error: PyEval_RestoreThread: NULL tstate on examp http://bugs.python.org/issue15227 #15201: C argument errors and Python arguments error are different http://bugs.python.org/issue15201 #15199: Default mimetype for javascript should be application/javascri http://bugs.python.org/issue15199 #15195: test_distutils fails when ARCHFLAGS is set on a Mac http://bugs.python.org/issue15195 Most recent 15 issues waiting for review (15) ============================================= #15257: Misc/.gdbinit:pystack is too brittle http://bugs.python.org/issue15257 #15250: document that filecmp.dircmp comparisons are "shallow" http://bugs.python.org/issue15250 #15249: email.generator.BytesGenerator doesn't mangle "From " lines wh http://bugs.python.org/issue15249 #15246: Line coverage for collectionts.abc.Set http://bugs.python.org/issue15246 #15245: ast.literal_eval fails on some literals http://bugs.python.org/issue15245 #15244: Support for opening files with FILE_SHARE_DELETE on Windows http://bugs.python.org/issue15244 #15242: PyImport_GetMagicTag() should use the same const char * as sys http://bugs.python.org/issue15242 #15239: Abandoned Tools/unicode/mkstringprep.py http://bugs.python.org/issue15239 #15238: shutil.copystat should copy Linux extended attributes http://bugs.python.org/issue15238 #15234: avoid runtime library path for extensions found in system dire http://bugs.python.org/issue15234 #15233: atexit: guarantee order of execution of registered functions? http://bugs.python.org/issue15233 #15231: update PyPI upload doc to say --no-raw passed to rst2html.py http://bugs.python.org/issue15231 #15220: Reduce parsing overhead in email.feedparser.BufferedSubFile http://bugs.python.org/issue15220 #15209: Re-raising exceptions from an expression http://bugs.python.org/issue15209 #15207: mimetypes.read_windows_registry() uses the wrong regkey, creat http://bugs.python.org/issue15207 Top 10 most discussed issues (10) ================================= #14243: tempfile.NamedTemporaryFile not particularly useful on Windows http://bugs.python.org/issue14243 15 msgs #3754: cross-compilation support for python build http://bugs.python.org/issue3754 12 msgs #3871: cross and native build of python for mingw32 with packaging http://bugs.python.org/issue3871 11 msgs #15202: followlinks/follow_symlinks/symlinks flags unification http://bugs.python.org/issue15202 11 msgs #15233: atexit: guarantee order of execution of registered functions? http://bugs.python.org/issue15233 11 msgs #15245: ast.literal_eval fails on some literals http://bugs.python.org/issue15245 11 msgs #15180: Cryptic traceback from os.path.join when mixing str & bytes http://bugs.python.org/issue15180 6 msgs #15198: multiprocessing Pipe send of non-picklable objects doesn't rai http://bugs.python.org/issue15198 6 msgs #1602: windows console doesn't print or input Unicode http://bugs.python.org/issue1602 5 msgs #3561: Windows installer should add Python and Scripts directories to http://bugs.python.org/issue3561 5 msgs Issues closed (28) ================== #1677: Ctrl-C will exit out of Python interpreter in Windows http://bugs.python.org/issue1677 closed by tim.golden #2345: Using an exception variable outside an 'except' clause should http://bugs.python.org/issue2345 closed by brett.cannon #4485: fast swap of "default" Windows python versions http://bugs.python.org/issue4485 closed by tim.golden #9239: zipfile: truncating comment can corrupt the zipfile http://bugs.python.org/issue9239 closed by pitrou #14330: don't use host python, use host search paths for host compiler http://bugs.python.org/issue14330 closed by doko #14591: Value returned by random.random() out of valid range on 64-bit http://bugs.python.org/issue14591 closed by python-dev #14902: test_logging failed http://bugs.python.org/issue14902 closed by python-dev #14967: distutils2.utils.resolve_name cannot be implemented to give co http://bugs.python.org/issue14967 closed by eric.araujo #15020: default value for progname in pythonrun.c should be python3 fo http://bugs.python.org/issue15020 closed by pitrou #15030: PyPycLoader can't read cached .pyc files http://bugs.python.org/issue15030 closed by brett.cannon #15033: Different exit status when using -m http://bugs.python.org/issue15033 closed by orsenthil #15141: IDLE horizontal scroll bar missing (Win-XPsp3) http://bugs.python.org/issue15141 closed by terry.reedy #15166: Implement imp.get_tag() using sys.implementation http://bugs.python.org/issue15166 closed by brett.cannon #15174: amd64\python_d.exe -m test fails http://bugs.python.org/issue15174 closed by skrah #15210: importlib.__init__ checks for the wrong exception when looking http://bugs.python.org/issue15210 closed by brett.cannon #15212: Rename SC_GLOBAL_EXPLICT to SC_GLOBAL_EXPLICIT in compiler mod http://bugs.python.org/issue15212 closed by pitrou #15224: Range: Additional Methods (min/max/__and__) http://bugs.python.org/issue15224 closed by mark.dickinson #15225: Add negative tests for passing str to hmac.HMAC and hmac.new http://bugs.python.org/issue15225 closed by pitrou #15226: max( str ) should be fast with PEP 393 http://bugs.python.org/issue15226 closed by loewis #15229: stringification of subclasses of OSError can cause crash http://bugs.python.org/issue15229 closed by pitrou #15236: SEGFAULT in visit_decref http://bugs.python.org/issue15236 closed by amaury.forgeotdarc #15240: ctype Structure keeps reference to function pointers http://bugs.python.org/issue15240 closed by vpelletier #15241: venv module and pyvenv script documented, but not python behav http://bugs.python.org/issue15241 closed by python-dev #15252: Delivery reports about your e-mail http://bugs.python.org/issue15252 closed by benjamin.peterson #15253: xmlrpclib.ServerProxy does not support 2-tuple value for uri p http://bugs.python.org/issue15253 closed by loewis #15254: 08 is invalid token in lists. http://bugs.python.org/issue15254 closed by tim.golden #15261: os.stat(fd) crashes on Windows if fd does not exist http://bugs.python.org/issue15261 closed by sbt #798876: windows sys.path contains nonexistant directory http://bugs.python.org/issue798876 closed by tim.golden From g.brandl at gmx.net Fri Jul 6 21:27:19 2012 From: g.brandl at gmx.net (Georg Brandl) Date: Fri, 06 Jul 2012 21:27:19 +0200 Subject: [Python-Dev] GitHub mirror (Was: Bitbucket mirror?) In-Reply-To: References: Message-ID: On 04.07.2012 10:42, anatoly techtonik wrote: > On Fri, Jun 29, 2012 at 6:58 AM, Eli Bendersky wrote: >> >> The devguide (http://docs.python.org/devguide/committing.html) says: >> >> Bitbucket also maintain an up to date clone of the main cpython repository >> that can be used as the basis for a new clone or patch queue. >> >> [the link goes to https://bitbucket.org/mirror/cpython/overview] > > This one is better than Atlassian's (looks nice, shorter URL) and > worked long before Atlassian promotion as "makers of Bitbucket", which > is dead wrong BTW, because everybody knows they bought BB. I don't > remember if I commented on the Python Insider or if it was censored. > Whatever.. it hurts Atlassian, not me. > > On the subject. Is there a mirror of CPython on GitHub? > I abandoned my efforts to create online editor for docs, which should > have been both web fronted independent, maintainable and clean. It > appeared much easier to just use GitHub - so far it is the only one > from open source code hosting services that directly commits online > changes to repository (and allows anonymous to do this). I've made > more than a dozen proposal for fixing docs, because as a matter of > fact - filling a bug AND explaining why docs are wrong, why they need > to be fixed, what should be added - all of this is a way *much easier* > (and less time consuming!) than just fixing them. Unfortunately. Then you didn't read http://docs.python.org/bugs -- an email to docs at python.org is enough. If the bug/change is minor, it is usually taken care of right away, if it is major, we (mostly Sandro -- thanks Sandro -- and I) create a tracker item for you. Georg From g.brandl at gmx.net Fri Jul 6 21:36:27 2012 From: g.brandl at gmx.net (Georg Brandl) Date: Fri, 06 Jul 2012 21:36:27 +0200 Subject: [Python-Dev] cpython (2.7): Make it easier to search for the grouper() recipe. In-Reply-To: <3WRBfX2mWbzN2y@mail.python.org> References: <3WRBfX2mWbzN2y@mail.python.org> Message-ID: On 03.07.2012 06:09, raymond.hettinger wrote: > http://hg.python.org/cpython/rev/d32f21d87363 > changeset: 77921:d32f21d87363 > branch: 2.7 > parent: 77914:366df7ba1616 > user: Raymond Hettinger > date: Mon Jul 02 21:08:45 2012 -0700 > summary: > Make it easier to search for the grouper() recipe. > > files: > Doc/library/itertools.rst | 3 ++- > 1 files changed, 2 insertions(+), 1 deletions(-) > > > diff --git a/Doc/library/itertools.rst b/Doc/library/itertools.rst > --- a/Doc/library/itertools.rst > +++ b/Doc/library/itertools.rst > @@ -733,7 +733,8 @@ > return izip(a, b) > > def grouper(n, iterable, fillvalue=None): > - "grouper(3, 'ABCDEFG', 'x') --> ABC DEF Gxx" > + "Collect data into fixed-length chunks or blocks" > + # grouper(3, 'ABCDEFG', 'x') --> ABC DEF Gxx" > args = [iter(iterable)] * n > return izip_longest(fillvalue=fillvalue, *args) Will you please apply this also to 3.x? Georg From nad at acm.org Sat Jul 7 03:49:34 2012 From: nad at acm.org (Ned Deily) Date: Fri, 06 Jul 2012 18:49:34 -0700 Subject: [Python-Dev] Problem with hg hook for devguide repo? Message-ID: pushing to ssh://pyhg/devguide searching for changes remote: adding changesets remote: adding manifests remote: adding file changes remote: added 1 changesets with 1 changes to 1 files remote: sent email to roundup at report at bugs.python.org remote: notified python-checkins at python.org of incoming changeset bbe197bf57a1 remote: 1 files updated, 0 files merged, 0 files removed, 0 files unresolved remote: sh: 1: source: not found remote: sh: 1: sphinx-build: not found remote: warning: incoming.sphinxbuild hook exited with status 127 -- Ned Deily, nad at acm.org From benjamin at python.org Sat Jul 7 05:44:44 2012 From: benjamin at python.org (Benjamin Peterson) Date: Fri, 6 Jul 2012 22:44:44 -0500 Subject: [Python-Dev] Problem with hg hook for devguide repo? In-Reply-To: References: Message-ID: 2012/7/6 Ned Deily : > pushing to ssh://pyhg/devguide > searching for changes > remote: adding changesets > remote: adding manifests > remote: adding file changes > remote: added 1 changesets with 1 changes to 1 files > remote: sent email to roundup at report at bugs.python.org > remote: notified python-checkins at python.org of incoming changeset > bbe197bf57a1 > remote: 1 files updated, 0 files merged, 0 files removed, 0 files > unresolved > remote: sh: 1: source: not found > remote: sh: 1: sphinx-build: not found > remote: warning: incoming.sphinxbuild hook exited with status 127 Yes, this is because hg was migrated to OSUL. This is being discussed on the infrastructure list. -- Regards, Benjamin From solipsis at pitrou.net Sat Jul 7 15:55:35 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sat, 7 Jul 2012 15:55:35 +0200 Subject: [Python-Dev] cpython: Issue 14814: Correctly return NotImplemented from ipaddress._BaseNetwork.__eq__ References: <3WTtRG6h3vzN4C@mail.python.org> Message-ID: <20120707155535.1f7c4778@pitrou.net> On Sat, 7 Jul 2012 15:08:42 +0200 (CEST) nick.coghlan wrote: > > def __eq__(self, other): > - if not isinstance(other, _BaseNetwork): > - raise TypeError('%s and %s are not of the same type' % ( > - self, other)) > - return (self._version == other._version and > - self.network_address == other.network_address and > - int(self.netmask) == int(other.netmask)) > + try: > + return (self._version == other._version and > + self.network_address == other.network_address and > + int(self.netmask) == int(other.netmask)) > + except AttributeError: > + return NotImplemented I think the isinstance() test was correct. If you have an object which duck-types IPNetwork, you probably want its __eq__ to be called, not yours. Regards Antoine. From ncoghlan at gmail.com Sat Jul 7 16:59:51 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 8 Jul 2012 00:59:51 +1000 Subject: [Python-Dev] cpython: Issue 14814: Correctly return NotImplemented from ipaddress._BaseNetwork.__eq__ In-Reply-To: <20120707155535.1f7c4778@pitrou.net> References: <3WTtRG6h3vzN4C@mail.python.org> <20120707155535.1f7c4778@pitrou.net> Message-ID: On Sat, Jul 7, 2012 at 11:55 PM, Antoine Pitrou wrote: > I think the isinstance() test was correct. If you have an object which > duck-types IPNetwork, you probably want its __eq__ to be called, not > yours. This change was just to bring IPNetwork in line with the other types in the module and to stop it throwing TypeError itself, which meant the RHS was never even getting a chance to affect the result. Ducktyping and operator overloading has always been a tricky area though. If you use isinstance() checks, then the other side has to know how to reimplement your equality check, or temporarily create a real instance to do the comparison. If you use ducktyping internally, then the other side *has* to use inheritance if they want to completely control the result, but also have the option to just expose the appropriate attributes in order to interoperate with your class. The standard library tends to be a mixture of both based on how integral the author feels the ordering and comparison behaviour is to the classes involved. In this case, I currently think internal ducktyping is the right answer, but I'm open to being persuaded otherwise. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From ncoghlan at gmail.com Sat Jul 7 17:18:22 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 8 Jul 2012 01:18:22 +1000 Subject: [Python-Dev] cpython: Issue 14814: Correctly return NotImplemented from ipaddress._BaseNetwork.__eq__ In-Reply-To: References: <3WTtRG6h3vzN4C@mail.python.org> <20120707155535.1f7c4778@pitrou.net> Message-ID: On Sun, Jul 8, 2012 at 12:59 AM, Nick Coghlan wrote: > On Sat, Jul 7, 2012 at 11:55 PM, Antoine Pitrou wrote: >> I think the isinstance() test was correct. If you have an object which >> duck-types IPNetwork, you probably want its __eq__ to be called, not >> yours. > > This change was just to bring IPNetwork in line with the other types > in the module and to stop it throwing TypeError itself, which meant > the RHS was never even getting a chance to affect the result. Hmm, I just noticed the __lt__ implementations still throw TypeError directly (at least in the IPNetwork case). Looks like some more cleanups are still needed in this area :P Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From solipsis at pitrou.net Sun Jul 8 02:27:46 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 8 Jul 2012 02:27:46 +0200 Subject: [Python-Dev] cpython (3.2): Fix issue14826 - make urllib.request.Request quoted url consistent with References: <3WV9D85ygYzNdL@mail.python.org> Message-ID: <20120708022746.041e64c7@pitrou.net> > > + def test_quote_url(self): > + Request = urllib.request.Request > + request = Request("http://www.python.org/foo bar") > + self.assertEqual(request.full_url, "http://www.python.org/foo%20bar") > + This means someone who already quotes URL will get a double escaping, right? Regards Antoine. From g.brandl at gmx.net Sun Jul 8 10:12:37 2012 From: g.brandl at gmx.net (Georg Brandl) Date: Sun, 08 Jul 2012 10:12:37 +0200 Subject: [Python-Dev] cpython: Fix 5931 - Python runtime hardcoded in wsgiref.simple_server - Now it specifies In-Reply-To: <3WV5Xh3VfXzNcF@mail.python.org> References: <3WV5Xh3VfXzNcF@mail.python.org> Message-ID: On 07.07.2012 23:29, senthil.kumaran wrote: > http://hg.python.org/cpython/rev/34e705fa4da4 > changeset: 77983:34e705fa4da4 > user: Senthil Kumaran > date: Sat Jul 07 14:29:58 2012 -0700 > summary: > Fix 5931 - Python runtime hardcoded in wsgiref.simple_server - Now it specifies an implementation specific term. > > files: > Doc/library/wsgiref.rst | 5 +++++ > Lib/test/test_wsgiref.py | 10 ++++++++-- > Lib/wsgiref/simple_server.py | 3 ++- > Misc/NEWS | 3 +++ > 4 files changed, 18 insertions(+), 3 deletions(-) > > > diff --git a/Doc/library/wsgiref.rst b/Doc/library/wsgiref.rst > --- a/Doc/library/wsgiref.rst > +++ b/Doc/library/wsgiref.rst > @@ -609,6 +609,11 @@ > as :class:`BaseCGIHandler` and :class:`CGIHandler`) that are not HTTP origin > servers. > > + .. versionchanged:: 3.3 > + > + The term "Python" is replaced with implementation specific term like > + "CPython", "Jython" etc. Please fix the markup: no blank line, but indented. E.g. .. versionchanged:: 3.3 Blah, blah. Georg From senthil at uthcode.com Sun Jul 8 10:34:19 2012 From: senthil at uthcode.com (Senthil Kumaran) Date: Sun, 8 Jul 2012 01:34:19 -0700 Subject: [Python-Dev] cpython: Fix 5931 - Python runtime hardcoded in wsgiref.simple_server - Now it specifies In-Reply-To: References: <3WV5Xh3VfXzNcF@mail.python.org> Message-ID: On Sun, Jul 8, 2012 at 1:12 AM, Georg Brandl wrote: > > Please fix the markup: no blank line, but indented. E.g. > > .. versionchanged:: 3.3 > Blah, blah. Done. Sorry for that, I only built the docs and saw if the directive was affected and assumed it okay. Fixed that. Thanks, Senthil From ncoghlan at gmail.com Mon Jul 9 04:13:24 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 9 Jul 2012 12:13:24 +1000 Subject: [Python-Dev] [Python-checkins] cpython: Issue 14814: The new systematic tests aren't just about error reporting any In-Reply-To: <4FF9C1E3.2000501@udel.edu> References: <3WVVWD22TgzNt6@mail.python.org> <4FF9C1E3.2000501@udel.edu> Message-ID: On Mon, Jul 9, 2012 at 3:22 AM, Terry Reedy wrote: > On 7/8/2012 9:14 AM, nick.coghlan wrote: > If you want to report both errors, when present: I don't really - when there are multiple things wrong with the address, ipaddress just reports the first one it notices (e.g. IPv4Address("google.com") will complain about the lack of "." separators rather than the fact that "google" and "com" are not valid octets). This was just a case where I started to reorder the checks to do the faster one first, and then realised it was more user friendly to check the other way around (since the success case will always include both checks, it only makes a difference to the speed of rejection of some invalid cases). Most of the other validation checks have a more obvious suitable order (for example, there's no reason to start checking individual parts if we can't even split the parts up appropriately). >> +class BaseTestCase(unittest.TestCase): >> # One big change in ipaddress over the original ipaddr module is >> # error reporting that tries to assume users *don't know the rules* >> # for what constitutes an RFC compliant IP address > > > Good assumption. There *will* be people who learn by trial and error. *cough* It's conceivable I started down this path due to using the REPL for exploration while making an initial attempt at writing method and data attribute docs over the weekend. I'm firmly in the category of IPv4 network programming engineers with next to no IPv6 experience :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From solipsis at pitrou.net Mon Jul 9 11:14:15 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 9 Jul 2012 11:14:15 +0200 Subject: [Python-Dev] cpython: Issue #15283: Updated pyvenv documentation to expand on activation. References: <3WW0466grWzMhd@mail.python.org> Message-ID: <20120709111415.596e0047@pitrou.net> On Mon, 9 Jul 2012 10:25:58 +0200 (CEST) vinay.sajip wrote: > + > + Common installation tools such as ``distribute`` and ``pip`` work as > + expected with venvs - i.e. when a venv is active, they install Python > + packages into the venv without needing to be told to do so explicitly. Not exactly. They will do so if you *install* them in the venv. A system-wide distribute or pip, AFAIK, will still install packages into the system-wide locations. Also, I wonder why you are duplicating the exact same docs in two files. That's a maintenance burden. Regards Antoine. From g.brandl at gmx.net Mon Jul 9 12:21:29 2012 From: g.brandl at gmx.net (Georg Brandl) Date: Mon, 09 Jul 2012 12:21:29 +0200 Subject: [Python-Dev] cpython: Issue 15265: document the exception raised for invalid sample sizes. In-Reply-To: <3WVlYQ41nmzNLp@mail.python.org> References: <3WVlYQ41nmzNLp@mail.python.org> Message-ID: On 09.07.2012 01:02, raymond.hettinger wrote: > http://hg.python.org/cpython/rev/72174d8af3ba > changeset: 78015:72174d8af3ba > user: Raymond Hettinger > date: Sun Jul 08 16:01:53 2012 -0700 > summary: > Issue 15265: document the exception raised for invalid sample sizes. > > files: > Doc/library/random.rst | 3 +++ > 1 files changed, 3 insertions(+), 0 deletions(-) > > > diff --git a/Doc/library/random.rst b/Doc/library/random.rst > --- a/Doc/library/random.rst > +++ b/Doc/library/random.rst > @@ -150,6 +150,9 @@ > argument. This is especially fast and space efficient for sampling from a large > population: ``sample(range(10000000), 60)``. > > + If the sample size is larger than the population size, a :exc:``ValueError`` Should be :exc:`ValueError` (one set of quotes only). Georg From msabramo at gmail.com Tue Jul 10 18:41:50 2012 From: msabramo at gmail.com (Marc Abramowitz) Date: Tue, 10 Jul 2012 09:41:50 -0700 Subject: [Python-Dev] Testing 3.3 framework builds on OS X Message-ID: Anyone know how to build and run tests on an OS X build with ?enable-framework, without installing the framework? I want to test that something works in my tree without polluting /Library/Frameworks with stuff that could mess up future tests. I've been mucking around with DYLD_FRAMEWORK_PATH but not having much luck. If I build with ./configure --enable-framework --with-dtrace && make and use DYLD_FRAMEWORK_PATH=. then I suspect it's still loading from /Library/Frameworks, because my tests fail in a way that suggests it's not finding the new bits in my working directory. If I build with ./configure --enable-framework=. --with-dtrace && make and use DYLD_FRAMEWORK_PATH=., then it fails with: dyld: Library not loaded: ./Python.framework/Versions/3.3/Python presumably because ./Python.framework doesn't have any .dylib's or .so's. Thanks, Marc From nad at acm.org Tue Jul 10 19:27:31 2012 From: nad at acm.org (Ned Deily) Date: Tue, 10 Jul 2012 10:27:31 -0700 Subject: [Python-Dev] Testing 3.3 framework builds on OS X References: Message-ID: In article , Marc Abramowitz wrote: > Anyone know how to build and run tests on an OS X build with > ?enable-framework, without installing the framework? I want to test > that something works in my tree without polluting /Library/Frameworks > with stuff that could mess up future tests. > > I've been mucking around with DYLD_FRAMEWORK_PATH but not having much luck. > > If I build with ./configure --enable-framework --with-dtrace && make > and use DYLD_FRAMEWORK_PATH=. then I suspect it's still loading from > /Library/Frameworks, because my tests fail in a way that suggests it's > not finding the new bits in my working directory. > > If I build with ./configure --enable-framework=. --with-dtrace && make > and use DYLD_FRAMEWORK_PATH=., then it fails with: > > dyld: Library not loaded: ./Python.framework/Versions/3.3/Python > > presumably because ./Python.framework doesn't have any .dylib's or .so's. Read the updated Mac/README for more information about framework builds. Here's a sample configure I use to minimize the chances of pollution from other installed Pythons: BASEDIR="$(dirname $(pwd -P))" [ -n "${BASEDIR}" ] && ./configure --enable-universalsdk --with-universal-archs=intel --enable-framework="${BASEDIR}"/root/Library/Frameworks --with-framework-name=pytest_10_7 --with-pydebug && rm -rf "${BASEDIR}"/root/* || echo "ERROR: configure failed" This is for 3.3 on 10.7. For 3.2 and 2.7 at the moment you also need to explicitly set MACOSX_DEPLOYMENT_TARGET=10.7 and CC=clang. The framework path probably needs to be an absolute path, BTW. -- Ned Deily, nad at acm.org From nad at acm.org Tue Jul 10 19:35:50 2012 From: nad at acm.org (Ned Deily) Date: Tue, 10 Jul 2012 10:35:50 -0700 Subject: [Python-Dev] Testing 3.3 framework builds on OS X References:

Message-ID: In article , Ned Deily wrote: > Read the updated Mac/README for more information about framework builds. > Here's a sample configure I use to minimize the chances of pollution > from other installed Pythons: > > BASEDIR="$(dirname $(pwd -P))" > [ -n "${BASEDIR}" ] && ./configure --enable-universalsdk > --with-universal-archs=intel > --enable-framework="${BASEDIR}"/root/Library/Frameworks > --with-framework-name=pytest_10_7 --with-pydebug && rm -rf > "${BASEDIR}"/root/* || echo "ERROR: configure failed" This does require that you run "make install" but everything is safely installed at ../root, including Library/Frameworks, usr/bin, and Applications directories. -- Ned Deily, nad at acm.org From ryanpaullin at gmail.com Tue Jul 10 23:04:07 2012 From: ryanpaullin at gmail.com (Ryan Paullin) Date: Tue, 10 Jul 2012 14:04:07 -0700 Subject: [Python-Dev] Python-Dev Digest, Vol 108, Issue 7 In-Reply-To: References: Message-ID: thanks for the reply hastings ive been working on a loopback interface its done On Fri, Jul 6, 2012 at 3:00 AM, wrote: > Send Python-Dev mailing list submissions to > python-dev at python.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://mail.python.org/mailman/listinfo/python-dev > or, via email, send a message with subject or body 'help' to > python-dev-request at python.org > > You can reach the person managing the list at > python-dev-owner at python.org > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of Python-Dev digest..." > > > Today's Topics: > > 1. Re: GitHub mirror (Was: Bitbucket mirror?) (martin at v.loewis.de) > 2. Bloody FAQ (Was: [Python-ideas] itertools.chunks(iterable, > size, fill=None)) (anatoly techtonik) > 3. Re: Bloody FAQ (Was: [Python-ideas] > itertools.chunks(iterable, size, fill=None)) (Mark Lawrence) > 4. Re: Bloody FAQ (Was: [Python-ideas] > itertools.chunks(iterable, size, fill=None)) (Stefan Behnel) > 5. Re: Bloody FAQ (Was: [Python-ideas] > itertools.chunks(iterable, size, fill=None)) (Paul Boddie) > 6. EuroPython 2012 Language Summit is Canceled. (Larry Hastings) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Thu, 05 Jul 2012 20:27:02 +0200 > From: martin at v.loewis.de > To: python-dev at python.org > Subject: Re: [Python-Dev] GitHub mirror (Was: Bitbucket mirror?) > Message-ID: > <20120705202702.Horde.Yh-RBqGZi1VP9dx2H7Nj-nA at webmail.df.eu> > Content-Type: text/plain; charset=ISO-8859-1; format=flowed; DelSp=Yes > > >> You won't get any changes in to CPython by creating pull requests. We > >> use http://bugs.python.org/ for that, sorry. > > > > Question -- is there a reason to abide by this rule for docs? That is, > if we > > could get a sympathetic core dev to look at pull requests for docs as > part of > > a streamlined process, would it cause problems? > > How do you communicate a "pull request"? On bitbucket, there is a > "pull request" > UI resulting in a tracker item being generated (and an email being sent), > but > hg.python.org doesn't have a notion of pull requests. Of course, you could > use any communication means (email, telephone call, carrier pigeon) to > request > a pull from a "sympathetic core dev". > > > (What I'm really asking is whether or the bugs.python.org process is > > considered critical for potentially minor doc changes and additions.) > > The sympathetic core dev is mostly free to bypass any submission process > initially; commits that bypass established procedures will likely be > questioned > only after the fact. > > In the specific case, I'd be worried to verify that the submitter has > provided > a contributor form. That's easy to do in the bug tracker, but difficult to > do > in an offline pull request. Of course, for a really minor doc change > (e.g. typo > fixes), no contrib form is necessary. > > Regards, > Martin > > > > > ------------------------------ > > Message: 2 > Date: Thu, 5 Jul 2012 22:41:29 +0300 > From: anatoly techtonik > To: Stefan Behnel > Cc: python-ideas at python.org, python-dev at python.org > Subject: [Python-Dev] Bloody FAQ (Was: [Python-ideas] > itertools.chunks(iterable, size, fill=None)) > Message-ID: > 6GnQn+o_Tb3LMnimHYs9zkYmWR1GTgA at mail.gmail.com> > Content-Type: text/plain; charset=UTF-8 > > On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel wrote: > > anatoly techtonik, 05.07.2012 15:36: > >> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: > >>> From Raymond's first message on http://bugs.python.org/issue6021 , add > >>> grouper: > >>> > >>> "This has been rejected before. > >> > >> I quite often see such arguments and I can't stand to repeat that > >> these are not arguments. It is good to know, but when people use that > >> as a reason to close tickets - that's just disgusting. > > > > The *real* problem is that people keep bringing up topics (and even spell > > them out in the bug tracker) without searching for existing discussions > > and/or tickets first. That's why those who do such a search (or who know > > what they are talking about anyway) close these tickets with the remark > > "this has been rejected before", instead of repeating an entire heap of > > arguments all over again to feed a discussion that would only lead to the > > same result as it did before, often several times before. > > Make the bloody FAQ and summarize this stuff? Why waste each others > time? If people don't enjoy repeating themselves over and over - there > is a bloody wiki. What should happen to people to start extracting > gems of knowledge from piles of dusty sheets called list "archives" > for others to admire. > > No, it is easier to say "it was already discussed many times", "why > don't you Google yourself", "so far you're only complaining", etc. If > people can't find anything - why everybody thinks they are ignorant > and lazy. Even if it so, why nobody thinks that maybe that bloody > Xapian index is dead again for a bloody amount of moons nobody knows > why and how many exactly? Why nobody thinks that lazy coders can also > help with development? Maybe that laziness is the primary reason some > major groups actually prefer Python to Java, C++ and other more > interesting languages (such as PHP) when it comes to typing? Make it > easy and the patches will follow. Answers like "this was discussed > before" don't make it easy to understand, and leaving users rereading > old 19xx archives that people don't reread themselves will likely make > users bounce and never (NEVER!) come up with some proposal again. An > "organic" way to keep traffic low. > > Miscommunication is a bad experience for users, bad experience for > developers, everybody is annoyed and as a result such nice language as > Python loses points on TIOBE (and convenient chunk() functions to > munch-munch on the sequence data). > > Wheew. :-F > > > ------------------------------ > > Message: 3 > Date: Thu, 05 Jul 2012 20:55:09 +0100 > From: Mark Lawrence > To: python-dev at python.org > Cc: python-ideas at python.org > Subject: Re: [Python-Dev] Bloody FAQ (Was: [Python-ideas] > itertools.chunks(iterable, size, fill=None)) > Message-ID: > Content-Type: text/plain; charset=ISO-8859-1; format=flowed > > On 05/07/2012 20:41, anatoly techtonik wrote: > > On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel > wrote: > >> anatoly techtonik, 05.07.2012 15:36: > >>> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: > >>>> From Raymond's first message on http://bugs.python.org/issue6021 , > add > >>>> grouper: > >>>> > >>>> "This has been rejected before. > >>> > >>> I quite often see such arguments and I can't stand to repeat that > >>> these are not arguments. It is good to know, but when people use that > >>> as a reason to close tickets - that's just disgusting. > >> > >> The *real* problem is that people keep bringing up topics (and even > spell > >> them out in the bug tracker) without searching for existing discussions > >> and/or tickets first. That's why those who do such a search (or who know > >> what they are talking about anyway) close these tickets with the remark > >> "this has been rejected before", instead of repeating an entire heap of > >> arguments all over again to feed a discussion that would only lead to > the > >> same result as it did before, often several times before. > > > > Make the bloody FAQ and summarize this stuff? Why waste each others > > time? If people don't enjoy repeating themselves over and over - there > > is a bloody wiki. What should happen to people to start extracting > > gems of knowledge from piles of dusty sheets called list "archives" > > for others to admire. > > > > No, it is easier to say "it was already discussed many times", "why > > don't you Google yourself", "so far you're only complaining", etc. If > > people can't find anything - why everybody thinks they are ignorant > > and lazy. Even if it so, why nobody thinks that maybe that bloody > > Xapian index is dead again for a bloody amount of moons nobody knows > > why and how many exactly? Why nobody thinks that lazy coders can also > > help with development? Maybe that laziness is the primary reason some > > major groups actually prefer Python to Java, C++ and other more > > interesting languages (such as PHP) when it comes to typing? Make it > > easy and the patches will follow. Answers like "this was discussed > > before" don't make it easy to understand, and leaving users rereading > > old 19xx archives that people don't reread themselves will likely make > > users bounce and never (NEVER!) come up with some proposal again. An > > "organic" way to keep traffic low. > > > > Miscommunication is a bad experience for users, bad experience for > > developers, everybody is annoyed and as a result such nice language as > > Python loses points on TIOBE (and convenient chunk() functions to > > munch-munch on the sequence data). > > > > Wheew. :-F > > > > Can I safely assume that you are volunteering to do the work required? > > -- > Cheers. > > Mark Lawrence. > > > > > > ------------------------------ > > Message: 4 > Date: Thu, 05 Jul 2012 21:58:52 +0200 > From: Stefan Behnel > To: python-dev at python.org > Cc: python-ideas at python.org > Subject: Re: [Python-Dev] Bloody FAQ (Was: [Python-ideas] > itertools.chunks(iterable, size, fill=None)) > Message-ID: > Content-Type: text/plain; charset=UTF-8 > > anatoly techtonik, 05.07.2012 21:41: > > On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel wrote: > >> anatoly techtonik, 05.07.2012 15:36: > >>> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: > >>>> From Raymond's first message on http://bugs.python.org/issue6021 , > add > >>>> grouper: > >>>> > >>>> "This has been rejected before. > >>> > >>> I quite often see such arguments and I can't stand to repeat that > >>> these are not arguments. It is good to know, but when people use that > >>> as a reason to close tickets - that's just disgusting. > >> > >> The *real* problem is that people keep bringing up topics (and even > spell > >> them out in the bug tracker) without searching for existing discussions > >> and/or tickets first. That's why those who do such a search (or who know > >> what they are talking about anyway) close these tickets with the remark > >> "this has been rejected before", instead of repeating an entire heap of > >> arguments all over again to feed a discussion that would only lead to > the > >> same result as it did before, often several times before. > > > > Make the bloody FAQ and summarize this stuff? Why waste each others > > time? > > Yes, that is exactly the question. > > It takes time to write things up nicely. I mean, once someone has pointed > out to you that this has been discussed before, you could just go, look it > up (or search for it), and then put it into a Wiki or blog post yourself, > or sum it up and send it to the mailing list as a reply. Why rely on others > to do it for you? > > Stefan > > > > ------------------------------ > > Message: 5 > Date: Thu, 5 Jul 2012 23:11:46 +0200 > From: Paul Boddie > To: python-dev at python.org > Subject: Re: [Python-Dev] Bloody FAQ (Was: [Python-ideas] > itertools.chunks(iterable, size, fill=None)) > Message-ID: <201207052311.46867.paul at boddie.org.uk> > Content-Type: text/plain; charset="utf-8" > > Stefan Behnel wrote: > > anatoly techtonik, 05.07.2012 21:41: > > > > > > Make the bloody FAQ and summarize this stuff? Why waste each others > > > time? > > > > Yes, that is exactly the question. > > > > It takes time to write things up nicely. I mean, once someone has pointed > > out to you that this has been discussed before, you could just go, look > it > > up (or search for it), and then put it into a Wiki or blog post yourself, > > or sum it up and send it to the mailing list as a reply. Why rely on > others > > to do it for you? > > To be fair, Anatoly has done quite a bit of maintenance on some of the Wiki > content around various aspects of the project, so it's not as if he's > demanding anything out of the ordinary or asking for others to do things > that > he isn't already doing in some sense. My experience is that there usually > needs to be some willingness on the other end of the transaction, and if it > takes repetition to encourage it amongst those who don't see the current > situation as a problem for them, then so be it. > > Of course, this kind of documentation activity, where one gathers together > historical decisions and the consensus from long-forgotten discussions, is > pretty thankless work. I occasionally regard it as worthwhile if only to > bring up something someone said as an inconvenient interruption in any > current discussion, but that's a pretty minimal reward for all the effort > unless one has such work as part of one's daily routine. > > Paul > > > ------------------------------ > > Message: 6 > Date: Fri, 06 Jul 2012 08:47:30 +0200 > From: Larry Hastings > To: python-dev at python.org, python-committers at python.org > Subject: [Python-Dev] EuroPython 2012 Language Summit is Canceled. > Message-ID: <4FF68A02.8000500 at hastings.org> > Content-Type: text/plain; charset="iso-8859-1"; Format="flowed" > > > > I only got one more RSVP and zero topics for the docket. So let's > sprint instead. > > See you at the PyCon 2013 Language Summit, > > > //arry/ > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < > http://mail.python.org/pipermail/python-dev/attachments/20120706/f13295aa/attachment-0001.html > > > > ------------------------------ > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > > > End of Python-Dev Digest, Vol 108, Issue 7 > ****************************************** > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ryanpaullin at gmail.com Tue Jul 10 23:04:24 2012 From: ryanpaullin at gmail.com (Ryan Paullin) Date: Tue, 10 Jul 2012 14:04:24 -0700 Subject: [Python-Dev] Python-Dev Digest, Vol 108, Issue 7 In-Reply-To: References: Message-ID: spoke too early on its done sorry On Tue, Jul 10, 2012 at 2:04 PM, Ryan Paullin wrote: > thanks for the reply hastings ive been working on a loopback interface its > done > > > On Fri, Jul 6, 2012 at 3:00 AM, wrote: > >> Send Python-Dev mailing list submissions to >> python-dev at python.org >> >> To subscribe or unsubscribe via the World Wide Web, visit >> http://mail.python.org/mailman/listinfo/python-dev >> or, via email, send a message with subject or body 'help' to >> python-dev-request at python.org >> >> You can reach the person managing the list at >> python-dev-owner at python.org >> >> When replying, please edit your Subject line so it is more specific >> than "Re: Contents of Python-Dev digest..." >> >> >> Today's Topics: >> >> 1. Re: GitHub mirror (Was: Bitbucket mirror?) (martin at v.loewis.de) >> 2. Bloody FAQ (Was: [Python-ideas] itertools.chunks(iterable, >> size, fill=None)) (anatoly techtonik) >> 3. Re: Bloody FAQ (Was: [Python-ideas] >> itertools.chunks(iterable, size, fill=None)) (Mark Lawrence) >> 4. Re: Bloody FAQ (Was: [Python-ideas] >> itertools.chunks(iterable, size, fill=None)) (Stefan Behnel) >> 5. Re: Bloody FAQ (Was: [Python-ideas] >> itertools.chunks(iterable, size, fill=None)) (Paul Boddie) >> 6. EuroPython 2012 Language Summit is Canceled. (Larry Hastings) >> >> >> ---------------------------------------------------------------------- >> >> Message: 1 >> Date: Thu, 05 Jul 2012 20:27:02 +0200 >> From: martin at v.loewis.de >> To: python-dev at python.org >> Subject: Re: [Python-Dev] GitHub mirror (Was: Bitbucket mirror?) >> Message-ID: >> <20120705202702.Horde.Yh-RBqGZi1VP9dx2H7Nj-nA at webmail.df.eu> >> Content-Type: text/plain; charset=ISO-8859-1; format=flowed; DelSp=Yes >> >> >> You won't get any changes in to CPython by creating pull requests. We >> >> use http://bugs.python.org/ for that, sorry. >> > >> > Question -- is there a reason to abide by this rule for docs? That is, >> if we >> > could get a sympathetic core dev to look at pull requests for docs as >> part of >> > a streamlined process, would it cause problems? >> >> How do you communicate a "pull request"? On bitbucket, there is a >> "pull request" >> UI resulting in a tracker item being generated (and an email being sent), >> but >> hg.python.org doesn't have a notion of pull requests. Of course, you >> could >> use any communication means (email, telephone call, carrier pigeon) to >> request >> a pull from a "sympathetic core dev". >> >> > (What I'm really asking is whether or the bugs.python.org process is >> > considered critical for potentially minor doc changes and additions.) >> >> The sympathetic core dev is mostly free to bypass any submission process >> initially; commits that bypass established procedures will likely be >> questioned >> only after the fact. >> >> In the specific case, I'd be worried to verify that the submitter has >> provided >> a contributor form. That's easy to do in the bug tracker, but difficult >> to do >> in an offline pull request. Of course, for a really minor doc change >> (e.g. typo >> fixes), no contrib form is necessary. >> >> Regards, >> Martin >> >> >> >> >> ------------------------------ >> >> Message: 2 >> Date: Thu, 5 Jul 2012 22:41:29 +0300 >> From: anatoly techtonik >> To: Stefan Behnel >> Cc: python-ideas at python.org, python-dev at python.org >> Subject: [Python-Dev] Bloody FAQ (Was: [Python-ideas] >> itertools.chunks(iterable, size, fill=None)) >> Message-ID: >> > 6GnQn+o_Tb3LMnimHYs9zkYmWR1GTgA at mail.gmail.com> >> Content-Type: text/plain; charset=UTF-8 >> >> On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel >> wrote: >> > anatoly techtonik, 05.07.2012 15:36: >> >> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: >> >>> From Raymond's first message on http://bugs.python.org/issue6021 , >> add >> >>> grouper: >> >>> >> >>> "This has been rejected before. >> >> >> >> I quite often see such arguments and I can't stand to repeat that >> >> these are not arguments. It is good to know, but when people use that >> >> as a reason to close tickets - that's just disgusting. >> > >> > The *real* problem is that people keep bringing up topics (and even >> spell >> > them out in the bug tracker) without searching for existing discussions >> > and/or tickets first. That's why those who do such a search (or who know >> > what they are talking about anyway) close these tickets with the remark >> > "this has been rejected before", instead of repeating an entire heap of >> > arguments all over again to feed a discussion that would only lead to >> the >> > same result as it did before, often several times before. >> >> Make the bloody FAQ and summarize this stuff? Why waste each others >> time? If people don't enjoy repeating themselves over and over - there >> is a bloody wiki. What should happen to people to start extracting >> gems of knowledge from piles of dusty sheets called list "archives" >> for others to admire. >> >> No, it is easier to say "it was already discussed many times", "why >> don't you Google yourself", "so far you're only complaining", etc. If >> people can't find anything - why everybody thinks they are ignorant >> and lazy. Even if it so, why nobody thinks that maybe that bloody >> Xapian index is dead again for a bloody amount of moons nobody knows >> why and how many exactly? Why nobody thinks that lazy coders can also >> help with development? Maybe that laziness is the primary reason some >> major groups actually prefer Python to Java, C++ and other more >> interesting languages (such as PHP) when it comes to typing? Make it >> easy and the patches will follow. Answers like "this was discussed >> before" don't make it easy to understand, and leaving users rereading >> old 19xx archives that people don't reread themselves will likely make >> users bounce and never (NEVER!) come up with some proposal again. An >> "organic" way to keep traffic low. >> >> Miscommunication is a bad experience for users, bad experience for >> developers, everybody is annoyed and as a result such nice language as >> Python loses points on TIOBE (and convenient chunk() functions to >> munch-munch on the sequence data). >> >> Wheew. :-F >> >> >> ------------------------------ >> >> Message: 3 >> Date: Thu, 05 Jul 2012 20:55:09 +0100 >> From: Mark Lawrence >> To: python-dev at python.org >> Cc: python-ideas at python.org >> Subject: Re: [Python-Dev] Bloody FAQ (Was: [Python-ideas] >> itertools.chunks(iterable, size, fill=None)) >> Message-ID: >> Content-Type: text/plain; charset=ISO-8859-1; format=flowed >> >> On 05/07/2012 20:41, anatoly techtonik wrote: >> > On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel >> wrote: >> >> anatoly techtonik, 05.07.2012 15:36: >> >>> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: >> >>>> From Raymond's first message on http://bugs.python.org/issue6021 , >> add >> >>>> grouper: >> >>>> >> >>>> "This has been rejected before. >> >>> >> >>> I quite often see such arguments and I can't stand to repeat that >> >>> these are not arguments. It is good to know, but when people use that >> >>> as a reason to close tickets - that's just disgusting. >> >> >> >> The *real* problem is that people keep bringing up topics (and even >> spell >> >> them out in the bug tracker) without searching for existing discussions >> >> and/or tickets first. That's why those who do such a search (or who >> know >> >> what they are talking about anyway) close these tickets with the remark >> >> "this has been rejected before", instead of repeating an entire heap of >> >> arguments all over again to feed a discussion that would only lead to >> the >> >> same result as it did before, often several times before. >> > >> > Make the bloody FAQ and summarize this stuff? Why waste each others >> > time? If people don't enjoy repeating themselves over and over - there >> > is a bloody wiki. What should happen to people to start extracting >> > gems of knowledge from piles of dusty sheets called list "archives" >> > for others to admire. >> > >> > No, it is easier to say "it was already discussed many times", "why >> > don't you Google yourself", "so far you're only complaining", etc. If >> > people can't find anything - why everybody thinks they are ignorant >> > and lazy. Even if it so, why nobody thinks that maybe that bloody >> > Xapian index is dead again for a bloody amount of moons nobody knows >> > why and how many exactly? Why nobody thinks that lazy coders can also >> > help with development? Maybe that laziness is the primary reason some >> > major groups actually prefer Python to Java, C++ and other more >> > interesting languages (such as PHP) when it comes to typing? Make it >> > easy and the patches will follow. Answers like "this was discussed >> > before" don't make it easy to understand, and leaving users rereading >> > old 19xx archives that people don't reread themselves will likely make >> > users bounce and never (NEVER!) come up with some proposal again. An >> > "organic" way to keep traffic low. >> > >> > Miscommunication is a bad experience for users, bad experience for >> > developers, everybody is annoyed and as a result such nice language as >> > Python loses points on TIOBE (and convenient chunk() functions to >> > munch-munch on the sequence data). >> > >> > Wheew. :-F >> > >> >> Can I safely assume that you are volunteering to do the work required? >> >> -- >> Cheers. >> >> Mark Lawrence. >> >> >> >> >> >> ------------------------------ >> >> Message: 4 >> Date: Thu, 05 Jul 2012 21:58:52 +0200 >> From: Stefan Behnel >> To: python-dev at python.org >> Cc: python-ideas at python.org >> Subject: Re: [Python-Dev] Bloody FAQ (Was: [Python-ideas] >> itertools.chunks(iterable, size, fill=None)) >> Message-ID: >> Content-Type: text/plain; charset=UTF-8 >> >> anatoly techtonik, 05.07.2012 21:41: >> > On Thu, Jul 5, 2012 at 7:50 PM, Stefan Behnel wrote: >> >> anatoly techtonik, 05.07.2012 15:36: >> >>> On Sun, Jul 1, 2012 at 12:09 AM, Terry Reedy wrote: >> >>>> From Raymond's first message on http://bugs.python.org/issue6021 , >> add >> >>>> grouper: >> >>>> >> >>>> "This has been rejected before. >> >>> >> >>> I quite often see such arguments and I can't stand to repeat that >> >>> these are not arguments. It is good to know, but when people use that >> >>> as a reason to close tickets - that's just disgusting. >> >> >> >> The *real* problem is that people keep bringing up topics (and even >> spell >> >> them out in the bug tracker) without searching for existing discussions >> >> and/or tickets first. That's why those who do such a search (or who >> know >> >> what they are talking about anyway) close these tickets with the remark >> >> "this has been rejected before", instead of repeating an entire heap of >> >> arguments all over again to feed a discussion that would only lead to >> the >> >> same result as it did before, often several times before. >> > >> > Make the bloody FAQ and summarize this stuff? Why waste each others >> > time? >> >> Yes, that is exactly the question. >> >> It takes time to write things up nicely. I mean, once someone has pointed >> out to you that this has been discussed before, you could just go, look it >> up (or search for it), and then put it into a Wiki or blog post yourself, >> or sum it up and send it to the mailing list as a reply. Why rely on >> others >> to do it for you? >> >> Stefan >> >> >> >> ------------------------------ >> >> Message: 5 >> Date: Thu, 5 Jul 2012 23:11:46 +0200 >> From: Paul Boddie >> To: python-dev at python.org >> Subject: Re: [Python-Dev] Bloody FAQ (Was: [Python-ideas] >> itertools.chunks(iterable, size, fill=None)) >> Message-ID: <201207052311.46867.paul at boddie.org.uk> >> Content-Type: text/plain; charset="utf-8" >> >> Stefan Behnel wrote: >> > anatoly techtonik, 05.07.2012 21:41: >> > > >> > > Make the bloody FAQ and summarize this stuff? Why waste each others >> > > time? >> > >> > Yes, that is exactly the question. >> > >> > It takes time to write things up nicely. I mean, once someone has >> pointed >> > out to you that this has been discussed before, you could just go, look >> it >> > up (or search for it), and then put it into a Wiki or blog post >> yourself, >> > or sum it up and send it to the mailing list as a reply. Why rely on >> others >> > to do it for you? >> >> To be fair, Anatoly has done quite a bit of maintenance on some of the >> Wiki >> content around various aspects of the project, so it's not as if he's >> demanding anything out of the ordinary or asking for others to do things >> that >> he isn't already doing in some sense. My experience is that there usually >> needs to be some willingness on the other end of the transaction, and if >> it >> takes repetition to encourage it amongst those who don't see the current >> situation as a problem for them, then so be it. >> >> Of course, this kind of documentation activity, where one gathers together >> historical decisions and the consensus from long-forgotten discussions, is >> pretty thankless work. I occasionally regard it as worthwhile if only to >> bring up something someone said as an inconvenient interruption in any >> current discussion, but that's a pretty minimal reward for all the effort >> unless one has such work as part of one's daily routine. >> >> Paul >> >> >> ------------------------------ >> >> Message: 6 >> Date: Fri, 06 Jul 2012 08:47:30 +0200 >> From: Larry Hastings >> To: python-dev at python.org, python-committers at python.org >> Subject: [Python-Dev] EuroPython 2012 Language Summit is Canceled. >> Message-ID: <4FF68A02.8000500 at hastings.org> >> Content-Type: text/plain; charset="iso-8859-1"; Format="flowed" >> >> >> >> I only got one more RSVP and zero topics for the docket. So let's >> sprint instead. >> >> See you at the PyCon 2013 Language Summit, >> >> >> //arry/ >> -------------- next part -------------- >> An HTML attachment was scrubbed... >> URL: < >> http://mail.python.org/pipermail/python-dev/attachments/20120706/f13295aa/attachment-0001.html >> > >> >> ------------------------------ >> >> _______________________________________________ >> Python-Dev mailing list >> Python-Dev at python.org >> http://mail.python.org/mailman/listinfo/python-dev >> >> >> End of Python-Dev Digest, Vol 108, Issue 7 >> ****************************************** >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ronaldoussoren at mac.com Wed Jul 11 07:30:43 2012 From: ronaldoussoren at mac.com (Ronald Oussoren) Date: Wed, 11 Jul 2012 07:30:43 +0200 Subject: [Python-Dev] Testing 3.3 framework builds on OS X In-Reply-To: References: Message-ID: On 10 Jul, 2012, at 18:41, Marc Abramowitz wrote: > Anyone know how to build and run tests on an OS X build with > ?enable-framework, without installing the framework? I want to test > that something works in my tree without polluting /Library/Frameworks > with stuff that could mess up future tests. > > I've been mucking around with DYLD_FRAMEWORK_PATH but not having much luck. > Setting DYLD_FRAMEWORK_PATH should work and is whtat's used for RUNSHARED in the Makefile. I regularly build using 'configure --enable-framework' and then use 'make test' to test that build. This runs the python.exe in the build directory with DYLD_FRAMEWORK_PATH set to the build directory (as an absolute path). Ronald > > Thanks, > Marc > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/ronaldoussoren%40mac.com -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 4788 bytes Desc: not available URL: From steve at holdenweb.com Thu Jul 12 02:54:33 2012 From: steve at holdenweb.com (Steve Holden) Date: Wed, 11 Jul 2012 19:54:33 -0500 Subject: [Python-Dev] =?windows-1252?q?Fwd=3A_How_to_recruit_open-source_c?= =?windows-1252?q?ontributors_=96_The_Story_of_Data?= Message-ID: <44E4E015-4C35-4265-A4EF-51B2D3A64086@holdenweb.com> Interesting article about Gentoo's approach to developer recruitment from GSoC: http://redmonk.com/dberkholz/2012/07/10/how-to-recruit-open-source-contributors/ S -- Steve Holden steve at holdenweb.com, Holden Web, LLC http://holdenweb.com/ Python classes (and much more) through the web http://oreillyschool.com/ Conferences and technical event management at http://theopenbastion.com/ Next: DjangoCon US Sep 6-8, Washington DC http://djangocon.us/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ryanpaullin at gmail.com Thu Jul 12 12:30:59 2012 From: ryanpaullin at gmail.com (Ryan Paullin) Date: Thu, 12 Jul 2012 03:30:59 -0700 Subject: [Python-Dev] Python-Dev Digest, Vol 108, Issue 14 In-Reply-To: References: Message-ID: looks like theres no forgiveness except for dj yoda On Thu, Jul 12, 2012 at 3:00 AM, wrote: > Send Python-Dev mailing list submissions to > python-dev at python.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://mail.python.org/mailman/listinfo/python-dev > or, via email, send a message with subject or body 'help' to > python-dev-request at python.org > > You can reach the person managing the list at > python-dev-owner at python.org > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of Python-Dev digest..." > > > Today's Topics: > > 1. Fwd: How to recruit open-source contributors ? The Story of > Data (Steve Holden) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Wed, 11 Jul 2012 19:54:33 -0500 > From: Steve Holden > To: Arc Riley > Cc: python-dev at python.org > Subject: [Python-Dev] Fwd: How to recruit open-source contributors ? > The Story of Data > Message-ID: <44E4E015-4C35-4265-A4EF-51B2D3A64086 at holdenweb.com> > Content-Type: text/plain; charset="us-ascii" > > Interesting article about Gentoo's approach to developer recruitment from > GSoC: > > > http://redmonk.com/dberkholz/2012/07/10/how-to-recruit-open-source-contributors/ > > S > -- > Steve Holden steve at holdenweb.com, Holden Web, LLC http://holdenweb.com/ > Python classes (and much more) through the web http://oreillyschool.com/ > Conferences and technical event management at http://theopenbastion.com/ > Next: DjangoCon US Sep 6-8, Washington DC http://djangocon.us/ > > > > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < > http://mail.python.org/pipermail/python-dev/attachments/20120711/0a6e9085/attachment-0001.html > > > > ------------------------------ > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > > > End of Python-Dev Digest, Vol 108, Issue 14 > ******************************************* > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ryanpaullin at gmail.com Thu Jul 12 12:31:18 2012 From: ryanpaullin at gmail.com (Ryan Paullin) Date: Thu, 12 Jul 2012 03:31:18 -0700 Subject: [Python-Dev] Python-Dev Digest, Vol 108, Issue 14 In-Reply-To: References:

Message-ID: <- its just my gmail face On Thu, Jul 12, 2012 at 3:30 AM, Ryan Paullin wrote: > looks like theres no forgiveness except for dj yoda > > > On Thu, Jul 12, 2012 at 3:00 AM, wrote: > >> Send Python-Dev mailing list submissions to >> python-dev at python.org >> >> To subscribe or unsubscribe via the World Wide Web, visit >> http://mail.python.org/mailman/listinfo/python-dev >> or, via email, send a message with subject or body 'help' to >> python-dev-request at python.org >> >> You can reach the person managing the list at >> python-dev-owner at python.org >> >> When replying, please edit your Subject line so it is more specific >> than "Re: Contents of Python-Dev digest..." >> >> >> Today's Topics: >> >> 1. Fwd: How to recruit open-source contributors ? The Story of >> Data (Steve Holden) >> >> >> ---------------------------------------------------------------------- >> >> Message: 1 >> Date: Wed, 11 Jul 2012 19:54:33 -0500 >> From: Steve Holden >> To: Arc Riley >> Cc: python-dev at python.org >> Subject: [Python-Dev] Fwd: How to recruit open-source contributors ? >> The Story of Data >> Message-ID: <44E4E015-4C35-4265-A4EF-51B2D3A64086 at holdenweb.com> >> Content-Type: text/plain; charset="us-ascii" >> >> Interesting article about Gentoo's approach to developer recruitment from >> GSoC: >> >> >> http://redmonk.com/dberkholz/2012/07/10/how-to-recruit-open-source-contributors/ >> >> S >> -- >> Steve Holden steve at holdenweb.com, Holden Web, LLC http://holdenweb.com/ >> Python classes (and much more) through the web http://oreillyschool.com/ >> Conferences and technical event management at http://theopenbastion.com/ >> Next: DjangoCon US Sep 6-8, Washington DC http://djangocon.us/ >> >> >> >> -------------- next part -------------- >> An HTML attachment was scrubbed... >> URL: < >> http://mail.python.org/pipermail/python-dev/attachments/20120711/0a6e9085/attachment-0001.html >> > >> >> ------------------------------ >> >> _______________________________________________ >> Python-Dev mailing list >> Python-Dev at python.org >> http://mail.python.org/mailman/listinfo/python-dev >> >> >> End of Python-Dev Digest, Vol 108, Issue 14 >> ******************************************* >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ryanpaullin at gmail.com Thu Jul 12 12:34:20 2012 From: ryanpaullin at gmail.com (Ryan Paullin) Date: Thu, 12 Jul 2012 03:34:20 -0700 Subject: [Python-Dev] Python-Dev Digest, Vol 108, Issue 14 In-Reply-To: References:

Message-ID: $ python -m timeit -s 'import test' 'list(test.grouper(2,"abcdef"))' 100000 loops, best of 3: 5.34 usec per loop $ python -m timeit -s 'import test' 'test.grouper(2,"abcdef")' 100000 loops, best of 3: 2.19 usec per loop $ python -m timeit -s 'import test' 'list(test.chunks(2,"abcdef"))' 100000 loops, best of 3: 2.85 usec per loop $ python -m timeit -s 'import test' 'test.chunks(2,"abcdef")' 1000000 loops, best of 3: 0.685 usec per loop some woman wrote this On Thu, Jul 12, 2012 at 3:31 AM, Ryan Paullin wrote: > <- its just my gmail face > > > On Thu, Jul 12, 2012 at 3:30 AM, Ryan Paullin wrote: > >> looks like theres no forgiveness except for dj yoda >> >> >> On Thu, Jul 12, 2012 at 3:00 AM, wrote: >> >>> Send Python-Dev mailing list submissions to >>> python-dev at python.org >>> >>> To subscribe or unsubscribe via the World Wide Web, visit >>> http://mail.python.org/mailman/listinfo/python-dev >>> or, via email, send a message with subject or body 'help' to >>> python-dev-request at python.org >>> >>> You can reach the person managing the list at >>> python-dev-owner at python.org >>> >>> When replying, please edit your Subject line so it is more specific >>> than "Re: Contents of Python-Dev digest..." >>> >>> >>> Today's Topics: >>> >>> 1. Fwd: How to recruit open-source contributors ? The Story of >>> Data (Steve Holden) >>> >>> >>> ---------------------------------------------------------------------- >>> >>> Message: 1 >>> Date: Wed, 11 Jul 2012 19:54:33 -0500 >>> From: Steve Holden >>> To: Arc Riley >>> Cc: python-dev at python.org >>> Subject: [Python-Dev] Fwd: How to recruit open-source contributors ? >>> The Story of Data >>> Message-ID: <44E4E015-4C35-4265-A4EF-51B2D3A64086 at holdenweb.com> >>> Content-Type: text/plain; charset="us-ascii" >>> >>> Interesting article about Gentoo's approach to developer recruitment >>> from GSoC: >>> >>> >>> http://redmonk.com/dberkholz/2012/07/10/how-to-recruit-open-source-contributors/ >>> >>> S >>> -- >>> Steve Holden steve at holdenweb.com, Holden Web, LLC http://holdenweb.com/ >>> Python classes (and much more) through the web http://oreillyschool.com/ >>> Conferences and technical event management at http://theopenbastion.com/ >>> Next: DjangoCon US Sep 6-8, Washington DC http://djangocon.us/ >>> >>> >>> >>> -------------- next part -------------- >>> An HTML attachment was scrubbed... >>> URL: < >>> http://mail.python.org/pipermail/python-dev/attachments/20120711/0a6e9085/attachment-0001.html >>> > >>> >>> ------------------------------ >>> >>> _______________________________________________ >>> Python-Dev mailing list >>> Python-Dev at python.org >>> http://mail.python.org/mailman/listinfo/python-dev >>> >>> >>> End of Python-Dev Digest, Vol 108, Issue 14 >>> ******************************************* >>> >> >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From rdmurray at bitdance.com Thu Jul 12 15:46:27 2012 From: rdmurray at bitdance.com (R. David Murray) Date: Thu, 12 Jul 2012 09:46:27 -0400 Subject: [Python-Dev] Python-Dev Digest, Vol 108, Issue 14 In-Reply-To: References:

Message-ID: <20120712134628.2A523250823@webabinitio.net> On Thu, 12 Jul 2012 03:34:20 -0700, Ryan Paullin wrote: > $ python -m timeit -s 'import test' 'list(test.grouper(2,"abcdef"))' > 100000 loops, best of 3: 5.34 usec per loop > $ python -m timeit -s 'import test' 'test.grouper(2,"abcdef")' > 100000 loops, best of 3: 2.19 usec per loop > $ python -m timeit -s 'import test' 'list(test.chunks(2,"abcdef"))' > 100000 loops, best of 3: 2.85 usec per loop > $ python -m timeit -s 'import test' 'test.chunks(2,"abcdef")' > 1000000 loops, best of 3: 0.685 usec per loop > > some woman wrote this I don't have any idea what any of your recent posts mean, but this one appears to be offensive. I would appreciate it if you would stop posting until you have something substantive to say, and can do so in a civil fashion. --David From solipsis at pitrou.net Thu Jul 12 18:20:32 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Thu, 12 Jul 2012 18:20:32 +0200 Subject: [Python-Dev] =?utf-8?q?How_to_recruit_open-source_contributors_?= =?utf-8?q?=E2=80=93_The_Story_of_Data?= References: <44E4E015-4C35-4265-A4EF-51B2D3A64086@holdenweb.com> Message-ID: <20120712182032.3d68e891@pitrou.net> On Wed, 11 Jul 2012 19:54:33 -0500 Steve Holden wrote: > Interesting article about Gentoo's approach to developer recruitment from GSoC: > > http://redmonk.com/dberkholz/2012/07/10/how-to-recruit-open-source-contributors/ This is quite an interesting article, but I don't think we are that high in pyramid of needs; we would first need someone dedicated enough to keep track of all GSoC projects. (and we would also need mentoring of potential mentors) Regards Antoine. -- Software development and contracting: http://pro.pitrou.net From status at bugs.python.org Fri Jul 13 18:07:18 2012 From: status at bugs.python.org (Python tracker) Date: Fri, 13 Jul 2012 18:07:18 +0200 (CEST) Subject: [Python-Dev] Summary of Python tracker Issues Message-ID: <20120713160718.808121C984@psf.upfronthosting.co.za> ACTIVITY SUMMARY (2012-07-06 - 2012-07-13) Python tracker at http://bugs.python.org/ To view or respond to any of the issues listed below, click on the issue. Do NOT respond to this message. Issues counts and deltas: open 3520 (+25) closed 23603 (+57) total 27123 (+82) Open issues with patches: 1485 Issues opened (58) ================== #14826: urllib2.urlopen fails to load URL http://bugs.python.org/issue14826 reopened by rosslagerwall #15264: PyErr_SetFromErrnoWithFilenameObject() undocumented http://bugs.python.org/issue15264 opened by pitrou #15266: Perform the same checks as PyPI for Description field http://bugs.python.org/issue15266 opened by cjerdonek #15267: tempfile.TemporaryFile and httplib incompatibility http://bugs.python.org/issue15267 opened by tzs #15268: curses configure checks fail if only /usr/include/ncursesw/cur http://bugs.python.org/issue15268 opened by doko #15269: Document dircmp.left and dircmp.right http://bugs.python.org/issue15269 opened by cjerdonek #15270: "Economy of Expression" section outdated http://bugs.python.org/issue15270 opened by cjerdonek #15271: argparse: repeatedly specifying the same argument ignores the http://bugs.python.org/issue15271 opened by mapleoin #15272: pkgutil.find_loader accepts invalid module names http://bugs.python.org/issue15272 opened by ncoghlan #15273: Remove unnecessarily random behavior from test_unparse.py http://bugs.python.org/issue15273 opened by larry #15275: isinstance is called a more times that needed in ntpath http://bugs.python.org/issue15275 opened by mandel #15276: unicode format does not really work in Python 2.x http://bugs.python.org/issue15276 opened by Ariel.Ben-Yehuda #15278: UnicodeDecodeError when readline in codecs.py http://bugs.python.org/issue15278 opened by lovelylain #15279: Spurious unittest warnings http://bugs.python.org/issue15279 opened by lukasz.langa #15280: Don't use builtins as variable names in urllib.request http://bugs.python.org/issue15280 opened by bbrazil #15285: test_timeout failure when system on IPv4 10.x.x.x subnet http://bugs.python.org/issue15285 opened by flox #15286: normpath does not work with local literal paths http://bugs.python.org/issue15286 opened by mandel #15292: import hook behavior documentation improvement http://bugs.python.org/issue15292 opened by iko #15295: Document PEP 420 namespace packages http://bugs.python.org/issue15295 opened by brett.cannon #15297: pkgutil.iter_importers() includes an ImpImporter http://bugs.python.org/issue15297 opened by cjerdonek #15298: _sysconfigdata is generated in srcdir, not builddir http://bugs.python.org/issue15298 opened by doko #15299: pkgutil.ImpImporter(None).iter_modules() does not search sys.p http://bugs.python.org/issue15299 opened by cjerdonek #15301: os.chown: OverflowError: Python int too large to convert to C http://bugs.python.org/issue15301 opened by do1 #15302: Use argparse instead of getopt in test.regrtest http://bugs.python.org/issue15302 opened by cjerdonek #15303: Minor revision to the method in Tkinter http://bugs.python.org/issue15303 opened by Drew.French #15304: Wrong path in test.support.temp_cwd() error message http://bugs.python.org/issue15304 opened by cjerdonek #15305: Test harness unnecessarily disambiguating twice http://bugs.python.org/issue15305 opened by cjerdonek #15307: Patch for --symlink support in pyvenv with framework python http://bugs.python.org/issue15307 opened by ronaldoussoren #15308: IDLE - add an "Interrupt Execution" to shell menu http://bugs.python.org/issue15308 opened by serwy #15310: urllib: Support for multiple WWW-Authenticate headers and/or m http://bugs.python.org/issue15310 opened by almost #15311: Developer Guide doesn't get updated once a day http://bugs.python.org/issue15311 opened by cjerdonek #15313: IDLE - remove all bare excepts http://bugs.python.org/issue15313 opened by serwy #15314: Use importlib instead of pkgutil in runpy http://bugs.python.org/issue15314 opened by ncoghlan #15315: Can't build Python extension with mingw32 on Windows http://bugs.python.org/issue15315 opened by cmcqueen1975 #15317: Source installation sets incorrect permissions for Grammar3.2. http://bugs.python.org/issue15317 opened by tpievila #15318: IDLE - sys.stdin is writeable http://bugs.python.org/issue15318 opened by serwy #15320: thread-safety issue in regrtest.main() http://bugs.python.org/issue15320 opened by cjerdonek #15321: bdist_wininst installers may terminate with "close failed in f http://bugs.python.org/issue15321 opened by mhammond #15322: sysconfig.get_config_var('srcdir') returns unexpected value http://bugs.python.org/issue15322 opened by cjerdonek #15323: Provide target name in output message when Mock.assert_called_ http://bugs.python.org/issue15323 opened by Brian.Jones #15324: --match does not work for regrtest http://bugs.python.org/issue15324 opened by cjerdonek #15325: --fromfile does not work for regrtest http://bugs.python.org/issue15325 opened by cjerdonek #15326: --random does not work for regrtest http://bugs.python.org/issue15326 opened by cjerdonek #15327: Argparse: main arguments and subparser arguments indistinguish http://bugs.python.org/issue15327 opened by Ingo.Fischer #15328: datetime.strptime slow http://bugs.python.org/issue15328 opened by Lars.Nordin #15329: clarify which deque methods are thread-safe http://bugs.python.org/issue15329 opened by cjerdonek #15331: Missing codec aliases for bytes-bytes codecs http://bugs.python.org/issue15331 opened by ncoghlan #15332: 2to3 should fix bad indentation (or warn about it) http://bugs.python.org/issue15332 opened by jwilk #15334: access denied for HKEY_PERFORMANCE_DATA http://bugs.python.org/issue15334 opened by jkloth #15335: IDLE - debugger steps through run.py internals http://bugs.python.org/issue15335 opened by serwy #15336: Argparse required arguments incorrectly displayed as optional http://bugs.python.org/issue15336 opened by rhettinger #15337: The cmd module incorrectly lists "help" as an undocument comma http://bugs.python.org/issue15337 opened by rhettinger #15338: test_UNC_path failure in test_import http://bugs.python.org/issue15338 opened by pitrou #15339: document the threading "facts of life" in Python http://bugs.python.org/issue15339 opened by cjerdonek #15340: OSError with "import random" when /dev/urandom doesn't exist ( http://bugs.python.org/issue15340 opened by iwienand #15343: "pydoc -w " writes out page with empty "Package Conte http://bugs.python.org/issue15343 opened by christopherthemagnificent #15344: devinabox: failure when running make_a_box multiple times http://bugs.python.org/issue15344 opened by eric.snow #15345: HOWTOs Argparse tutorial - code example raises SyntaxError http://bugs.python.org/issue15345 opened by simon.hayward Most recent 15 issues with no replies (15) ========================================== #15345: HOWTOs Argparse tutorial - code example raises SyntaxError http://bugs.python.org/issue15345 #15343: "pydoc -w " writes out page with empty "Package Conte http://bugs.python.org/issue15343 #15340: OSError with "import random" when /dev/urandom doesn't exist ( http://bugs.python.org/issue15340 #15337: The cmd module incorrectly lists "help" as an undocument comma http://bugs.python.org/issue15337 #15336: Argparse required arguments incorrectly displayed as optional http://bugs.python.org/issue15336 #15334: access denied for HKEY_PERFORMANCE_DATA http://bugs.python.org/issue15334 #15327: Argparse: main arguments and subparser arguments indistinguish http://bugs.python.org/issue15327 #15326: --random does not work for regrtest http://bugs.python.org/issue15326 #15325: --fromfile does not work for regrtest http://bugs.python.org/issue15325 #15321: bdist_wininst installers may terminate with "close failed in f http://bugs.python.org/issue15321 #15303: Minor revision to the method in Tkinter http://bugs.python.org/issue15303 #15280: Don't use builtins as variable names in urllib.request http://bugs.python.org/issue15280 #15278: UnicodeDecodeError when readline in codecs.py http://bugs.python.org/issue15278 #15275: isinstance is called a more times that needed in ntpath http://bugs.python.org/issue15275 #15269: Document dircmp.left and dircmp.right http://bugs.python.org/issue15269 Most recent 15 issues waiting for review (15) ============================================= #15345: HOWTOs Argparse tutorial - code example raises SyntaxError http://bugs.python.org/issue15345 #15334: access denied for HKEY_PERFORMANCE_DATA http://bugs.python.org/issue15334 #15323: Provide target name in output message when Mock.assert_called_ http://bugs.python.org/issue15323 #15320: thread-safety issue in regrtest.main() http://bugs.python.org/issue15320 #15318: IDLE - sys.stdin is writeable http://bugs.python.org/issue15318 #15311: Developer Guide doesn't get updated once a day http://bugs.python.org/issue15311 #15310: urllib: Support for multiple WWW-Authenticate headers and/or m http://bugs.python.org/issue15310 #15308: IDLE - add an "Interrupt Execution" to shell menu http://bugs.python.org/issue15308 #15307: Patch for --symlink support in pyvenv with framework python http://bugs.python.org/issue15307 #15304: Wrong path in test.support.temp_cwd() error message http://bugs.python.org/issue15304 #15302: Use argparse instead of getopt in test.regrtest http://bugs.python.org/issue15302 #15299: pkgutil.ImpImporter(None).iter_modules() does not search sys.p http://bugs.python.org/issue15299 #15298: _sysconfigdata is generated in srcdir, not builddir http://bugs.python.org/issue15298 #15286: normpath does not work with local literal paths http://bugs.python.org/issue15286 #15280: Don't use builtins as variable names in urllib.request http://bugs.python.org/issue15280 Top 10 most discussed issues (10) ================================= #15318: IDLE - sys.stdin is writeable http://bugs.python.org/issue15318 22 msgs #14814: Implement PEP 3144 (the ipaddress module) http://bugs.python.org/issue14814 20 msgs #14826: urllib2.urlopen fails to load URL http://bugs.python.org/issue14826 17 msgs #15320: thread-safety issue in regrtest.main() http://bugs.python.org/issue15320 14 msgs #15302: Use argparse instead of getopt in test.regrtest http://bugs.python.org/issue15302 10 msgs #4832: IDLE does not supply a default ext of .py on Windows or OS X f http://bugs.python.org/issue4832 9 msgs #15144: Possible integer overflow in operations with addresses and siz http://bugs.python.org/issue15144 9 msgs #15231: update PyPI upload doc to say --no-raw passed to rst2html.py http://bugs.python.org/issue15231 9 msgs #15338: test_UNC_path failure in test_import http://bugs.python.org/issue15338 9 msgs #15285: test_timeout failure when system on IPv4 10.x.x.x subnet http://bugs.python.org/issue15285 8 msgs Issues closed (53) ================== #5931: Python runtime name hardcoded in wsgiref.simple_server http://bugs.python.org/issue5931 closed by orsenthil #9867: Interrupted system calls are not retried http://bugs.python.org/issue9867 closed by pitrou #10248: Fix resource warnings in test_xmlrpclib http://bugs.python.org/issue10248 closed by bbrazil #11153: urllib2 basic auth parser handle unquoted realm in WWW-Authent http://bugs.python.org/issue11153 closed by orsenthil #11319: Command line option -t (and -tt) does not work for a particula http://bugs.python.org/issue11319 closed by gvanrossum #11624: distutils should support a custom list of exported symbols for http://bugs.python.org/issue11624 closed by dholth #11796: Comprehensions in a class definition mostly cannot access clas http://bugs.python.org/issue11796 closed by flox #12081: Remove distributed copy of libffi http://bugs.python.org/issue12081 closed by loewis #12927: test_ctypes: segfault with suncc http://bugs.python.org/issue12927 closed by skrah #13532: In IDLE, sys.stdout and sys.stderr can write any pickleable ob http://bugs.python.org/issue13532 closed by loewis #13686: Some notes on the docs of multiprocessing http://bugs.python.org/issue13686 closed by eli.bendersky #13959: Re-implement parts of imp in pure Python http://bugs.python.org/issue13959 closed by brett.cannon #14190: Minor C API documentation bugs http://bugs.python.org/issue14190 closed by eli.bendersky #14241: io.UnsupportedOperation.__new__(io.UnsupportedOperation) fails http://bugs.python.org/issue14241 closed by Mark.Shannon #14590: ConfigParser doesn't strip inline comment when delimiter occur http://bugs.python.org/issue14590 closed by lukasz.langa #14990: detect_encoding should fail with SyntaxError on invalid encodi http://bugs.python.org/issue14990 closed by flox #15053: imp.lock_held() "Changed in Python 3.3" mention accidentally o http://bugs.python.org/issue15053 closed by brett.cannon #15056: Have imp.cache_from_source() raise NotImplementedError when ca http://bugs.python.org/issue15056 closed by brett.cannon #15110: strange Tracebacks with importlib http://bugs.python.org/issue15110 closed by pitrou #15111: Wrong ImportError message with importlib http://bugs.python.org/issue15111 closed by brett.cannon #15167: Re-implement imp.get_magic() in pure Python http://bugs.python.org/issue15167 closed by brett.cannon #15242: PyImport_GetMagicTag() should use the same const char * as sys http://bugs.python.org/issue15242 closed by eric.snow #15247: io.open() is inconsistent re os.open() http://bugs.python.org/issue15247 closed by pitrou #15256: Typo in error message http://bugs.python.org/issue15256 closed by brett.cannon #15259: "Helping with Documentation" references missing dailybuild.py http://bugs.python.org/issue15259 closed by ned.deily #15260: Mention how to order Misc/NEWS entries http://bugs.python.org/issue15260 closed by ned.deily #15262: Idle does not show traceback in other threads http://bugs.python.org/issue15262 closed by terry.reedy #15265: random.sample() docs unclear on k < len(population) http://bugs.python.org/issue15265 closed by rhettinger #15274: Patch for issue 5765: stack overflow evaluating eval("()" * 30 http://bugs.python.org/issue15274 closed by ag6502 #15277: Fix resource leak in support.py:_is_ipv6_enabled http://bugs.python.org/issue15277 closed by rosslagerwall #15281: pyvenv --symlinks option is a no-op? http://bugs.python.org/issue15281 closed by python-dev #15282: pysetup still installed http://bugs.python.org/issue15282 closed by pitrou #15283: pyvenv says nothing on success http://bugs.python.org/issue15283 closed by vinay.sajip #15284: Handle ipv6 not being enabled in test_socket http://bugs.python.org/issue15284 closed by bbrazil #15287: support.TESTFN was modified by test_builtin http://bugs.python.org/issue15287 closed by flox #15288: Clarify the pkgutil.walk_packages() note http://bugs.python.org/issue15288 closed by brett.cannon #15289: Adding __getitem__ as a class method doesn't work as expected http://bugs.python.org/issue15289 closed by eric.snow #15290: setAttribute() can fail http://bugs.python.org/issue15290 closed by loewis #15291: test_ast leaks memory a lot http://bugs.python.org/issue15291 closed by pitrou #15293: AST nodes do not support garbage collection http://bugs.python.org/issue15293 closed by python-dev #15294: regression with nested namespace packages http://bugs.python.org/issue15294 closed by pitrou #15296: Minidom can't create ASCII representation http://bugs.python.org/issue15296 closed by eli.bendersky #15300: test directory doubly-nested running tests with -j/--multiproc http://bugs.python.org/issue15300 closed by pitrou #15306: Python3 segfault? (works in Python2) http://bugs.python.org/issue15306 closed by amaury.forgeotdarc #15309: buffer/memoryview slice assignment uses only memcpy http://bugs.python.org/issue15309 closed by skrah #15312: Serial library not found http://bugs.python.org/issue15312 closed by ezio.melotti #15316: runpy swallows ImportError information with relative imports http://bugs.python.org/issue15316 closed by amaury.forgeotdarc #15319: IDLE - readline, isatty, and input broken http://bugs.python.org/issue15319 closed by loewis #15330: allow deque to act as a thread-safe circular buffer http://bugs.python.org/issue15330 closed by rhettinger #15333: import on Windows will recompile a pyc file created on Unix http://bugs.python.org/issue15333 closed by pitrou #15341: Cplex and python http://bugs.python.org/issue15341 closed by amaury.forgeotdarc #15342: os.path.join behavior http://bugs.python.org/issue15342 closed by ned.deily #1616125: Cached globals+builtins lookup optimization http://bugs.python.org/issue1616125 closed by ag6502 From alex.gaynor at gmail.com Sun Jul 15 00:11:46 2012 From: alex.gaynor at gmail.com (Alex Gaynor) Date: Sat, 14 Jul 2012 22:11:46 +0000 (UTC) Subject: [Python-Dev] PEP 0424: A method for exposing a length hint Message-ID: Hi all, I've just submitted a PEP proposing making __length_hint__ a public API for users to define and other VMs to implement: PEP: 424 Title: A method for exposing a length hint Version: $Revision$ Last-Modified: $Date Author: Alex Gaynor Status: Draft Type: Standards Track Content-Type: text/x-rst Created: 14-July-2012 Python-Version: 3.4 Abstract ======== CPython currently defines an ``__length_hint__`` method on several types, such as various iterators. This method is then used by various other functions (such as ``map``) to presize lists based on the estimated returned by ``__length_hint__``. Types can then define ``__length_hint__`` which are not sized, and thus should not define ``__len__``, but can estimate or compute a size (such as many iterators). Proposal ======== This PEP proposes formally documenting ``__length_hint__`` for other interpreter and non-standard library Python to implement. ``__length_hint__`` must return an integer, and is not required to be accurate. It may return a value that is either larger or smaller than the actual size of the container. It may raise a ``TypeError`` if a specific instance cannot have its length estimated. It may not return a negative value. Rationale ========= Being able to pre-allocate lists based on the expected size, as estimated by ``__length_hint__``, can be a significant optimization. CPython has been observed to run some code faster than PyPy, purely because of this optimization being present. Open questions ============== There are two open questions for this PEP: * Should ``list`` expose a kwarg in it's constructor for supplying a length hint. * Should a function be added either to ``builtins`` or some other module which calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. Copyright ========= This document has been placed into the public domain. .. Local Variables: mode: indented-text indent-tabs-mode: nil sentence-end-double-space: t fill-column: 70 coding: utf-8 Alex From benjamin at python.org Sun Jul 15 01:18:38 2012 From: benjamin at python.org (Benjamin Peterson) Date: Sat, 14 Jul 2012 16:18:38 -0700 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References: Message-ID: 2012/7/14 Alex Gaynor : > > Proposal > ======== > > This PEP proposes formally documenting ``__length_hint__`` for other > interpreter and non-standard library Python to implement. > > ``__length_hint__`` must return an integer, and is not required to be accurate. > It may return a value that is either larger or smaller than the actual size of > the container. It may raise a ``TypeError`` if a specific instance cannot have > its length estimated. It may not return a negative value. And what happens if you return a negative value? > > Rationale > ========= > > Being able to pre-allocate lists based on the expected size, as estimated by > ``__length_hint__``, can be a significant optimization. CPython has been > observed to run some code faster than PyPy, purely because of this optimization > being present. > > Open questions > ============== > > There are two open questions for this PEP: > > * Should ``list`` expose a kwarg in it's constructor for supplying a length > hint. > * Should a function be added either to ``builtins`` or some other module which > calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. Let's try to keep this as limited as possible for a public API. -- Regards, Benjamin From alex.gaynor at gmail.com Sun Jul 15 01:21:45 2012 From: alex.gaynor at gmail.com (Alex Gaynor) Date: Sat, 14 Jul 2012 16:21:45 -0700 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

Message-ID: On Sat, Jul 14, 2012 at 4:18 PM, Benjamin Peterson wrote: > 2012/7/14 Alex Gaynor : > > > > Proposal > > ======== > > > > This PEP proposes formally documenting ``__length_hint__`` for other > > interpreter and non-standard library Python to implement. > > > > ``__length_hint__`` must return an integer, and is not required to be > accurate. > > It may return a value that is either larger or smaller than the actual > size of > > the container. It may raise a ``TypeError`` if a specific instance > cannot have > > its length estimated. It may not return a negative value. > > And what happens if you return a negative value? > > ValueError, the same as with len. > > > > Rationale > > ========= > > > > Being able to pre-allocate lists based on the expected size, as > estimated by > > ``__length_hint__``, can be a significant optimization. CPython has been > > observed to run some code faster than PyPy, purely because of this > optimization > > being present. > > > > Open questions > > ============== > > > > There are two open questions for this PEP: > > > > * Should ``list`` expose a kwarg in it's constructor for supplying a > length > > hint. > > * Should a function be added either to ``builtins`` or some other module > which > > calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. > > Let's try to keep this as limited as possible for a public API. > > Sounds reasonable to me! Should we just go ahead and strip those out now? > > -- > Regards, > Benjamin > Alex -- "I disapprove of what you say, but I will defend to the death your right to say it." -- Evelyn Beatrice Hall (summarizing Voltaire) "The people's good is the highest law." -- Cicero -------------- next part -------------- An HTML attachment was scrubbed... URL: From alexandre.zani at gmail.com Sun Jul 15 01:28:07 2012 From: alexandre.zani at gmail.com (Alexandre Zani) Date: Sat, 14 Jul 2012 16:28:07 -0700 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

Message-ID: On Sat, Jul 14, 2012 at 4:21 PM, Alex Gaynor wrote: > > > On Sat, Jul 14, 2012 at 4:18 PM, Benjamin Peterson > wrote: >> >> 2012/7/14 Alex Gaynor : >> > >> > Proposal >> > ======== >> > >> > This PEP proposes formally documenting ``__length_hint__`` for other >> > interpreter and non-standard library Python to implement. >> > >> > ``__length_hint__`` must return an integer, and is not required to be >> > accurate. >> > It may return a value that is either larger or smaller than the actual >> > size of >> > the container. It may raise a ``TypeError`` if a specific instance >> > cannot have >> > its length estimated. It may not return a negative value. >> >> And what happens if you return a negative value? >> > > ValueError, the same as with len. > >> >> > >> > Rationale >> > ========= >> > >> > Being able to pre-allocate lists based on the expected size, as >> > estimated by >> > ``__length_hint__``, can be a significant optimization. CPython has been >> > observed to run some code faster than PyPy, purely because of this >> > optimization >> > being present. >> > >> > Open questions >> > ============== >> > >> > There are two open questions for this PEP: >> > >> > * Should ``list`` expose a kwarg in it's constructor for supplying a >> > length >> > hint. >> > * Should a function be added either to ``builtins`` or some other module >> > which >> > calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. >> >> Let's try to keep this as limited as possible for a public API. >> > > Sounds reasonable to me! Should we just go ahead and strip those out now? I'm +1 on not having a public API for this. Ultimately the contract for a length hint will depend heavily upon what you need it for. Some applications would require a length hint to be an "at least" others an "at most" and others something else entirely. Given that the contract here appears to be >=0, I don't think the length hint is particularly useful to the public at large. > >> >> >> -- >> Regards, >> Benjamin > > > Alex > > -- > "I disapprove of what you say, but I will defend to the death your right to > say it." -- Evelyn Beatrice Hall (summarizing Voltaire) > "The people's good is the highest law." -- Cicero > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/alexandre.zani%40gmail.com > From benjamin at python.org Sun Jul 15 01:37:18 2012 From: benjamin at python.org (Benjamin Peterson) Date: Sat, 14 Jul 2012 16:37:18 -0700 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

Message-ID: 2012/7/14 Alex Gaynor : > > > On Sat, Jul 14, 2012 at 4:18 PM, Benjamin Peterson > wrote: >> >> 2012/7/14 Alex Gaynor : >> > >> > Proposal >> > ======== >> > >> > This PEP proposes formally documenting ``__length_hint__`` for other >> > interpreter and non-standard library Python to implement. >> > >> > ``__length_hint__`` must return an integer, and is not required to be >> > accurate. >> > It may return a value that is either larger or smaller than the actual >> > size of >> > the container. It may raise a ``TypeError`` if a specific instance >> > cannot have >> > its length estimated. It may not return a negative value. >> >> And what happens if you return a negative value? >> > > ValueError, the same as with len. CPython will probably have to updated to not ignore it if you return "melons". > >> >> > >> > Rationale >> > ========= >> > >> > Being able to pre-allocate lists based on the expected size, as >> > estimated by >> > ``__length_hint__``, can be a significant optimization. CPython has been >> > observed to run some code faster than PyPy, purely because of this >> > optimization >> > being present. >> > >> > Open questions >> > ============== >> > >> > There are two open questions for this PEP: >> > >> > * Should ``list`` expose a kwarg in it's constructor for supplying a >> > length >> > hint. >> > * Should a function be added either to ``builtins`` or some other module >> > which >> > calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. >> >> Let's try to keep this as limited as possible for a public API. >> > > Sounds reasonable to me! Should we just go ahead and strip those out now? Certainly. -- Regards, Benjamin From tjreedy at udel.edu Sun Jul 15 03:03:31 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 14 Jul 2012 21:03:31 -0400 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References: Message-ID: On 7/14/2012 6:11 PM, Alex Gaynor wrote: ... Various thoughts: "This method is then used by various other functions (such +as ``map``) to presize lists" -- map no longer produces lists. This only makes sense in 3.x if you mean that map can pass along the value of its inputs. "Types can then define ``__length_hint__`` which are not +sized, and thus should not define ``__len__``," is awkwardly phrased. I think you mean "Types that are not sized and should not define __len__ can then define __length_hint__. What do 'sized' and 'should' mean? Some iterators know exactly how many items they have yet to yield. The main implication of having a __len__ versus __length_hint__ methods seems to be it bool() value when empty. If lists were to get a new keyword arg, so should the other classes based on one internal array. I see this has been removed. Generator functions are the nicest way to define iterators in Python. Generator instances returned from generator functions cannot be given a length hint. They are not directly helped. However ... Not addressed in the PEP: do consumers of __length_hint look for it (and __length__ before or after calling iter(input), or both? If before, then the following should work. class gwlh: # generator with length hint def __init__(self, gen, len): self.gen = gen self.len = len def __iter__(self): return self.gen def __length_hint__(self): return len Do transformation iterators pass through hints from inputs? Does map(f, iterable) look for len or hint on iterable? Ditto for some itertools, like chain (add lengths). Any guidelines in the PEP -- Terry Jan Reedy From ncoghlan at gmail.com Sun Jul 15 07:16:02 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 15 Jul 2012 15:16:02 +1000 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

Message-ID: On Sun, Jul 15, 2012 at 9:18 AM, Benjamin Peterson wrote: >> Open questions >> ============== >> >> There are two open questions for this PEP: >> >> * Should ``list`` expose a kwarg in it's constructor for supplying a length >> hint. >> * Should a function be added either to ``builtins`` or some other module which >> calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. > > Let's try to keep this as limited as possible for a public API. Length hints are very useful for *any* container implementation, whether those containers are in the standard library or not. Just as we exposed operator.index when __index__ was added, we should expose an "operator.length_hint" function with the following semantics: def length_hint(obj): """Return an estimate of the number of items in obj. This is useful for presizing containers when building from an iterable. If the object supports len(), the result will be exact. Otherwise, it may over or underestimate by an arbitrary amount. The result will be an integer >= 0. """ try: return len(obj) except TypeError: try: get_hint = obj.__length_hint__ except AttributeError: return 0 hint = get_hint() if not isinstance(hint, int): raise TypeError("Length hint must be an integer, not %r" % type(hint)) if hint < 0: raise ValueError("Length hint (%r) must be >= 0" % hint) return hint There's no reason to make pure Python container implementations reimplement all that for themselves. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From alex.gaynor at gmail.com Sun Jul 15 07:20:03 2012 From: alex.gaynor at gmail.com (Alex Gaynor) Date: Sat, 14 Jul 2012 22:20:03 -0700 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

Message-ID: On Sat, Jul 14, 2012 at 10:16 PM, Nick Coghlan wrote: > On Sun, Jul 15, 2012 at 9:18 AM, Benjamin Peterson > wrote: > >> Open questions > >> ============== > >> > >> There are two open questions for this PEP: > >> > >> * Should ``list`` expose a kwarg in it's constructor for supplying a > length > >> hint. > >> * Should a function be added either to ``builtins`` or some other > module which > >> calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. > > > > Let's try to keep this as limited as possible for a public API. > > Length hints are very useful for *any* container implementation, > whether those containers are in the standard library or not. Just as > we exposed operator.index when __index__ was added, we should expose > an "operator.length_hint" function with the following semantics: > > def length_hint(obj): > """Return an estimate of the number of items in obj. This is > useful for presizing containers when building from an iterable. > > If the object supports len(), the result will be exact. > Otherwise, it may over or underestimate by an arbitrary amount. The > result will be an integer >= 0. > """ > try: > return len(obj) > except TypeError: > try: > get_hint = obj.__length_hint__ > except AttributeError: > return 0 > hint = get_hint() > if not isinstance(hint, int): > raise TypeError("Length hint must be an integer, not > %r" % type(hint)) > if hint < 0: > raise ValueError("Length hint (%r) must be >= 0" % hint) > return hint > > There's no reason to make pure Python container implementations > reimplement all that for themselves. > > Cheers, > Nick. > > -- > Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia > Sounds reasonable to me, the only issue with your psuedocode (err... I mean Python ;)), is that there's no way for the __lenght_hint__ to specify that that particular instance can't have a length hint computed. e.g. imagine some sort of lazy stream that cached itself, and only wanted to offer a length hint if it had already been evaluated. Without an exception to raise, it has to return whatever the magic value for length_hint is (in your impl it appears to be 0, the current _PyObject_LengthHint method in CPython has a required `default` parameter). The PEP proposes using TypeError for that. Anyways that code looks good, do you want to add it to the PEP? Alex -- "I disapprove of what you say, but I will defend to the death your right to say it." -- Evelyn Beatrice Hall (summarizing Voltaire) "The people's good is the highest law." -- Cicero -------------- next part -------------- An HTML attachment was scrubbed... URL: From steve at pearwood.info Sun Jul 15 10:21:13 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Sun, 15 Jul 2012 18:21:13 +1000 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

Message-ID: <50027D79.8010500@pearwood.info> Nick Coghlan wrote: > On Sun, Jul 15, 2012 at 9:18 AM, Benjamin Peterson wrote: >>> Open questions >>> ============== >>> >>> There are two open questions for this PEP: >>> >>> * Should ``list`` expose a kwarg in it's constructor for supplying a length >>> hint. >>> * Should a function be added either to ``builtins`` or some other module which >>> calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. >> Let's try to keep this as limited as possible for a public API. > > Length hints are very useful for *any* container implementation, > whether those containers are in the standard library or not. Just as > we exposed operator.index when __index__ was added, we should expose > an "operator.length_hint" function with the following semantics: [...] As given, length_hint gives no way of distinguishing between iterables and non-iterables: py> length_hint([]) 0 py> length_hint(42) 0 nor does it give iterable objects a way to indicate that either they don't know their length, or that they are infinite. I suggest: * object (and hence all other types that don't explicitly override it) should have a __length_hint__ that raises TypeError; * __length_hint__ should be allowed to return None to indicate "don't know" or -1 to indicate "infinite". Presumably anything that wishes to create a list or other sequence from an object with a hint of -1 could then raise an exception immediately. -- Steven From ncoghlan at gmail.com Sun Jul 15 10:47:38 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 15 Jul 2012 18:47:38 +1000 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <50027D79.8010500@pearwood.info> References:

<50027D79.8010500@pearwood.info> Message-ID: On Sun, Jul 15, 2012 at 6:21 PM, Steven D'Aprano wrote: > I suggest: > > * object (and hence all other types that don't explicitly override it) > should have a __length_hint__ that raises TypeError; We can keep it simpler than that just by changing the order of the checks. > * __length_hint__ should be allowed to return None to indicate "don't know" > or -1 to indicate "infinite". > > Presumably anything that wishes to create a list or other sequence from an > object with a hint of -1 could then raise an exception immediately. I'm not seeing the value in returning None over 0 for the don't know case - it just makes the API harder to use. Declaring negative results as meaning "I'm infinite" sounds reasonable, though: def length_hint(obj): """Return an estimate of the number of items in obj. This is useful for presizing containers when building from an iterable. If the object supports len(), the result will be exact. Otherwise, it may over or underestimate by an arbitrary amount. """ try: get_hint = obj.__length_hint__ except AttributeError: return len(obj) hint = get_hint() if not isinstance(hint, int): msg = "Length hint must be an integer, not %r" raise TypeError(msg % type(hint)) if hint < 0: raise ValueError("%r is an infinite iterator" % (obj,)) return hint Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From stefan_ml at behnel.de Sun Jul 15 11:11:50 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Sun, 15 Jul 2012 11:11:50 +0200 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

Message-ID: Alex Gaynor, 15.07.2012 07:20: > there's no way for the __lenght_hint__ to specify that > that particular instance can't have a length hint computed. e.g. imagine > some sort of lazy stream that cached itself, and only wanted to offer a > length hint if it had already been evaluated. Without an exception to > raise, it has to return whatever the magic value for length_hint is (in > your impl it appears to be 0, the current _PyObject_LengthHint method in > CPython has a required `default` parameter). The PEP proposes using > TypeError for that. Yes, that's a major issue. I've been planning to add a length hint to Cython's generator expressions for a while, but the problem is really that in most cases it is only known at runtime if the underlying iterable has a length hint, so propagating it needs a way to say "sorry, I thought I might know, but I don't". It would be even better if this way was efficient. Since we're at a point of making this an official protocol, why not change the current behaviour and return -1 (or even just 0) to explicitly state that "we don't know"? The problem with an exception here is that it might have been raised accidentally inside of the __length_hint__() implementation that is being asked. Swallowing it just because it happened to be a TypeError rather than something else may end up covering bugs. We had a similar issue with hasattr() in the past. Also, it would be nice if this became a type slot rather than requiring a dict lookup and Python function call. Stefan From solipsis at pitrou.net Sun Jul 15 14:36:41 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 15 Jul 2012 14:36:41 +0200 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint References:

<50027D79.8010500@pearwood.info> Message-ID: <20120715143641.3d2bb83d@pitrou.net> On Sun, 15 Jul 2012 18:47:38 +1000 Nick Coghlan wrote: > > > * __length_hint__ should be allowed to return None to indicate "don't know" > > or -1 to indicate "infinite". > > > > Presumably anything that wishes to create a list or other sequence from an > > object with a hint of -1 could then raise an exception immediately. > > I'm not seeing the value in returning None over 0 for the don't know > case - it just makes the API harder to use. The point is that 0 is a legitimate value for a length hint. Simple implementations of __length_hint__ will start returning 0 as a legitimate value and you will wrongly interpret that as "don't know", which kinds of defeat the purpose of __length-hint__ ;) That said, I don't think a special value for "is infinite" is useful. Just make -1 mean "I don't know". Regards Antoine. From solipsis at pitrou.net Sun Jul 15 14:42:25 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 15 Jul 2012 14:42:25 +0200 Subject: [Python-Dev] cpython: Take the first step in resolving the messy pkgutil vs importlib edge cases by References: <3WZgR32c8lzPBc@mail.python.org> Message-ID: <20120715144225.79966de4@pitrou.net> On Sun, 15 Jul 2012 10:10:07 +0200 (CEST) nick.coghlan wrote: > > int > +set_main_loader(PyObject *d, const char *filename, const char *loader_name) > +{ This function should be static. Regards Antoine. -- Software development and contracting: http://pro.pitrou.net From solipsis at pitrou.net Sun Jul 15 14:43:31 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 15 Jul 2012 14:43:31 +0200 Subject: [Python-Dev] cpython: Actually initialize __main__.__loader__ with loader instances, not the References: <3WZhn613RWzPJM@mail.python.org> Message-ID: <20120715144331.16d7461c@pitrou.net> On Sun, 15 Jul 2012 11:10:50 +0200 (CEST) nick.coghlan wrote: > tstate = PyThreadState_GET(); > interp = tstate->interp; > - loader = PyObject_GetAttrString(interp->importlib, loader_name); > + loader_type = PyObject_GetAttrString(interp->importlib, loader_name); > + if (loader_type == NULL) { > + return -1; > + } > + loader = PyObject_CallFunction(loader_type, "ss", "__main__", filename); I think you may have a refleak on loader_type here. Regards Antoine. -- Software development and contracting: http://pro.pitrou.net From steve at pearwood.info Sun Jul 15 15:47:07 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Sun, 15 Jul 2012 23:47:07 +1000 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <20120715143641.3d2bb83d@pitrou.net> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> Message-ID: <5002C9DB.2000409@pearwood.info> Antoine Pitrou wrote: > The point is that 0 is a legitimate value for a length hint. Simple > implementations of __length_hint__ will start returning 0 as a > legitimate value and you will wrongly interpret that as "don't know", > which kinds of defeat the purpose of __length-hint__ ;) > That said, I don't think a special value for "is infinite" is useful. > Just make -1 mean "I don't know". You've obviously never accidentally called list on an infinite iterator *wink* It's not the (eventual) MemoryError that is the problem. On some systems, this can cause the PC to become unresponsive as the OS tries to free an ever-increasing amount of memory. Been there, done that, on a production system. I had to do a hard reboot to fix it. I think having a hint that says "there's no way this can succeed, fail immediately" is more useful than caring about the difference between a hint of 0 and a hint of 1. -- Steven From ncoghlan at gmail.com Sun Jul 15 16:08:41 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 16 Jul 2012 00:08:41 +1000 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <5002C9DB.2000409@pearwood.info> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> Message-ID: Right, I agree on the value in being able to return something to say "this cannot be converted to a concrete container". I still haven't seen a use case where the appropriate response to "I don't know" differs from the appropriate response to a hint of zero - that is, you don't preallocate, you just start iterating. Cheers, Nick. -- Sent from my phone, thus the relative brevity :) -------------- next part -------------- An HTML attachment was scrubbed... URL: From solipsis at pitrou.net Sun Jul 15 16:22:32 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 15 Jul 2012 16:22:32 +0200 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> Message-ID: <20120715162232.25c040b7@pitrou.net> On Mon, 16 Jul 2012 00:08:41 +1000 Nick Coghlan wrote: > Right, I agree on the value in being able to return something to say "this > cannot be converted to a concrete container". Who would be able to return that, apart from trivial cases like itertools.cycle()? Regards Antoine. -- Software development and contracting: http://pro.pitrou.net From lists at cheimes.de Sun Jul 15 16:33:23 2012 From: lists at cheimes.de (Christian Heimes) Date: Sun, 15 Jul 2012 16:33:23 +0200 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <20120715162232.25c040b7@pitrou.net> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <20120715162232.25c040b7@pitrou.net> Message-ID: Am 15.07.2012 16:22, schrieb Antoine Pitrou: > On Mon, 16 Jul 2012 00:08:41 +1000 > Nick Coghlan wrote: >> Right, I agree on the value in being able to return something to say "this >> cannot be converted to a concrete container". > > Who would be able to return that, apart from trivial cases like > itertools.cycle()? For example most numerical sequence iterators like Fibonacci generator, prime number sequence generator and even trivial cases like even natural number generator. IMO it's a good idea to have a notation for infinitive iterators that can't be materialized as finite containers. +1 Christian From mark at hotpy.org Sun Jul 15 16:39:07 2012 From: mark at hotpy.org (Mark Shannon) Date: Sun, 15 Jul 2012 15:39:07 +0100 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> Message-ID: <5002D60B.4000203@hotpy.org> Nick Coghlan wrote: > Right, I agree on the value in being able to return something to say > "this cannot be converted to a concrete container". > > I still haven't seen a use case where the appropriate response to "I > don't know" differs from the appropriate response to a hint of zero - > that is, you don't preallocate, you just start iterating. > There seem to be 5 possible classes values of __length_hint__ that an iterator object can provide: 1. Don't implement it at all. 2. Implement __length_hint__() but don't want to return any value. Either raise an exception (TypeError) -- As suggested in the PEP. or return NotImplemented -- my preferred option. 3. Return a "don't know" value: Returning 0 would be fine for this, but the VM might want to respond differently to "don't know" and 0. __length_hint__() == 0 container should be minimum size. __length_hint__() == "unknown" container starts at default size. 4. Infinite iterator: Could return float('inf'), but given this is a "hint" then returning sys.maxsize or sys.maxsize + 1 might be OK. Alternatively raise an OverflowError 5. A meaningful length. No problem :) Also, what are the allowable return types? 1. int only 2. Any number (ie any type with a __int__() method)? 3. Or any integer-like object (ie a type with a __index__() method)? My suggestion: a) Don't want to return any value or "don't know": return NotImplemented b) For infinite iterators: raise an OverflowError c) All other cases: return an int or a type with a __index__() method. Cheers, Mark. From mark at hotpy.org Sun Jul 15 16:14:59 2012 From: mark at hotpy.org (Mark Shannon) Date: Sun, 15 Jul 2012 15:14:59 +0100 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References: Message-ID: <5002D063.1060401@hotpy.org> Alex Gaynor wrote: > Hi all, > > I've just submitted a PEP proposing making __length_hint__ a public API for > users to define and other VMs to implement: These seems back-to-front. __length_hint__ is *used* by the VM, not provided by it. It should be part of the object model, rather than the API. > > PEP: 424 > Title: A method for exposing a length hint > Version: $Revision$ > Last-Modified: $Date > Author: Alex Gaynor > Status: Draft > Type: Standards Track > Content-Type: text/x-rst > Created: 14-July-2012 > Python-Version: 3.4 > > Abstract > ======== > > CPython currently defines an ``__length_hint__`` method on several types, such > as various iterators. This method is then used by various other functions (such > as > ``map``) to presize lists based on the estimated returned by Don't use "map" as an example. map returns an iterator so it doesn't need __length_hint__ > ``__length_hint__``. Types can then define ``__length_hint__`` which are not > sized, and thus should not define ``__len__``, but can estimate or compute a > size (such as many iterators). > > Proposal > ======== > > This PEP proposes formally documenting ``__length_hint__`` for other > interpreter and non-standard library Python to implement. > > ``__length_hint__`` must return an integer, and is not required to be accurate. > It may return a value that is either larger or smaller than the actual size of > the container. It may raise a ``TypeError`` if a specific instance cannot have > its length estimated. It may not return a negative value. Rather than raising a TypeError, why not return NotImplemented? > > Rationale > ========= > > Being able to pre-allocate lists based on the expected size, as estimated by > ``__length_hint__``, can be a significant optimization. CPython has been > observed to run some code faster than PyPy, purely because of this optimization > being present. > > Open questions > ============== > > There are two open questions for this PEP: > > * Should ``list`` expose a kwarg in it's constructor for supplying a length > hint. > * Should a function be added either to ``builtins`` or some other module which > calls ``__length_hint__``, like ``builtins.len`` calls ``__len__``. > > Copyright > ========= > > This document has been placed into the public domain. > > .. > Local Variables: > mode: indented-text > indent-tabs-mode: nil > sentence-end-double-space: t > fill-column: 70 > coding: utf-8 > > > > > Alex > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/mark%40hotpy.org From brett at python.org Sun Jul 15 16:47:09 2012 From: brett at python.org (Brett Cannon) Date: Sun, 15 Jul 2012 10:47:09 -0400 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <5002D60B.4000203@hotpy.org> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <5002D60B.4000203@hotpy.org> Message-ID: On Sun, Jul 15, 2012 at 10:39 AM, Mark Shannon wrote: > Nick Coghlan wrote: > >> Right, I agree on the value in being able to return something to say >> "this cannot be converted to a concrete container". >> >> I still haven't seen a use case where the appropriate response to "I >> don't know" differs from the appropriate response to a hint of zero - that >> is, you don't preallocate, you just start iterating. >> >> > There seem to be 5 possible classes values of __length_hint__ that an > iterator object can provide: > > 1. Don't implement it at all. > > 2. Implement __length_hint__() but don't want to return any value. > Either raise an exception (TypeError) -- As suggested in the PEP. > or return NotImplemented -- my preferred option. > > 3. Return a "don't know" value: > Returning 0 would be fine for this, but the VM might want to respond > differently to "don't know" and 0. > __length_hint__() == 0 container should be minimum > size. > __length_hint__() == "unknown" container starts at default > size. > 4. Infinite iterator: > Could return float('inf'), but given this is a "hint" then > returning sys.maxsize or sys.maxsize + 1 might be OK. > Alternatively raise an OverflowError > I am really having a hard time differentiating infinity with "I don't know" since they are both accurate from the point of view of __length_hint__ and its typical purpose of allocation. You have no clue how many values will be grabbed from an infinite iterator, so it's the same as just not knowing upfront how long the iterator will be, infinite or not, and thus not worth distinguishing. > > 5. A meaningful length. No problem :) > > Also, what are the allowable return types? > > 1. int only > 2. Any number (ie any type with a __int__() method)? > 3. Or any integer-like object (ie a type with a __index__() method)? > > My suggestion: > > a) Don't want to return any value or "don't know": return NotImplemented > b) For infinite iterators: raise an OverflowError > c) All other cases: return an int or a type with a __index__() method. > I'm fine with (a), drop (b), and for (c) use what we allow for __len__() since, as Nick's operator.length_hint pseudo-code suggests, people will call this as a fallback if __len__ isn't defined. -Brett > > Cheers, > Mark. > > > ______________________________**_________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/**mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/**mailman/options/python-dev/** > brett%40python.org > -------------- next part -------------- An HTML attachment was scrubbed... URL: From lists at cheimes.de Sun Jul 15 16:56:39 2012 From: lists at cheimes.de (Christian Heimes) Date: Sun, 15 Jul 2012 16:56:39 +0200 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <5002D60B.4000203@hotpy.org> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <5002D60B.4000203@hotpy.org> Message-ID: Am 15.07.2012 16:39, schrieb Mark Shannon: > 1. Don't implement it at all. > > 2. Implement __length_hint__() but don't want to return any value. > Either raise an exception (TypeError) -- As suggested in the PEP. > or return NotImplemented -- my preferred option. How is this different from "don't know"? What's the use case for knowing that the object doesn't want to say anything or doesn't know its possible length. > 3. Return a "don't know" value: > Returning 0 would be fine for this, but the VM might want to respond > differently to "don't know" and 0. How about None? It's the logical choice, simple and easy to test for in Python and C code. 0 is a valid number for "I know that's I'll return nothing". > 4. Infinite iterator: > Could return float('inf'), but given this is a "hint" then > returning sys.maxsize or sys.maxsize + 1 might be OK. > Alternatively raise an OverflowError Too complex, hard to remember and even harder to check for. Since a length is always positive or zero, -1 is a good return value for infinite. > a) Don't want to return any value or "don't know": return NotImplemented +1 > b) For infinite iterators: raise an OverflowError -1, I'm for -1. ;) I'm not a fan of using exception for valid and correct return values. > c) All other cases: return an int or a type with a __index__() method. +1 Christian From mark at hotpy.org Sun Jul 15 17:08:00 2012 From: mark at hotpy.org (Mark Shannon) Date: Sun, 15 Jul 2012 16:08:00 +0100 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <5002D60B.4000203@hotpy.org> Message-ID: <5002DCD0.6040107@hotpy.org> Brett Cannon wrote: > > > On Sun, Jul 15, 2012 at 10:39 AM, Mark Shannon > wrote: > > Nick Coghlan wrote: > > Right, I agree on the value in being able to return something to > say "this cannot be converted to a concrete container". > > I still haven't seen a use case where the appropriate response > to "I don't know" differs from the appropriate response to a > hint of zero - that is, you don't preallocate, you just start > iterating. > > > There seem to be 5 possible classes values of __length_hint__ that an > iterator object can provide: > > 1. Don't implement it at all. > > 2. Implement __length_hint__() but don't want to return any value. > Either raise an exception (TypeError) -- As suggested in the PEP. > or return NotImplemented -- my preferred option. > > 3. Return a "don't know" value: > Returning 0 would be fine for this, but the VM might want to respond > differently to "don't know" and 0. > __length_hint__() == 0 container should be > minimum size. > __length_hint__() == "unknown" container starts at > default size. > > > 4. Infinite iterator: > Could return float('inf'), but given this is a "hint" then > returning sys.maxsize or sys.maxsize + 1 might be OK. > Alternatively raise an OverflowError > > > I am really having a hard time differentiating infinity with "I don't > know" since they are both accurate from the point of view of > __length_hint__ and its typical purpose of allocation. You have no clue > how many values will be grabbed from an infinite iterator, so it's the > same as just not knowing upfront how long the iterator will be, infinite > or not, and thus not worth distinguishing. > > > > 5. A meaningful length. No problem :) > > Also, what are the allowable return types? > > 1. int only > 2. Any number (ie any type with a __int__() method)? > 3. Or any integer-like object (ie a type with a __index__() method)? > > My suggestion: > > a) Don't want to return any value or "don't know": return NotImplemented > b) For infinite iterators: raise an OverflowError > c) All other cases: return an int or a type with a __index__() method. > > > I'm fine with (a), drop (b), and for (c) use what we allow for __len__() > since, as Nick's operator.length_hint pseudo-code suggests, people will > call this as a fallback if __len__ isn't defined. So how does an iterator express infinite length? What should happen if I am silly enough to do this: >>> list(itertools.count()) This will fail; it should fail quickly. Cheers, Mark. From solipsis at pitrou.net Sun Jul 15 17:06:28 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 15 Jul 2012 17:06:28 +0200 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <20120715162232.25c040b7@pitrou.net> Message-ID: <20120715170628.1f6fcc04@pitrou.net> On Sun, 15 Jul 2012 16:33:23 +0200 Christian Heimes wrote: > Am 15.07.2012 16:22, schrieb Antoine Pitrou: > > On Mon, 16 Jul 2012 00:08:41 +1000 > > Nick Coghlan wrote: > >> Right, I agree on the value in being able to return something to say "this > >> cannot be converted to a concrete container". > > > > Who would be able to return that, apart from trivial cases like > > itertools.cycle()? > > For example most numerical sequence iterators like Fibonacci generator, > prime number sequence generator and even trivial cases like even natural > number generator. First, you can't implement __length_hint__ for a generator, which is the preferred (the most practical) way of writing iterators in pure Python. Second, not all iterators will implement __length_hint__ (because it's optional and, really, of rather little use). So, as a user, you cannot hope that `list(some_iterator)` will always raise instead of filling your memory with an infinite stream of values: you have to be careful anyway. Even if __length_hint__ is implemented, its result may be wrong. That's the whole point: it's a *hint*; an iterator might tell you it's finite while it's infinite, or the reverse. My conclusion is that an infinite iterator is a documentation issue. Just tell the user that it doesn't stop, and let them shoot themselves in the foot in they want to. Regards Antoine. -- Software development and contracting: http://pro.pitrou.net From solipsis at pitrou.net Sun Jul 15 17:14:30 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 15 Jul 2012 17:14:30 +0200 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <5002D60B.4000203@hotpy.org> <5002DCD0.6040107@hotpy.org> Message-ID: <20120715171430.0b1b1ddc@pitrou.net> On Sun, 15 Jul 2012 16:08:00 +0100 Mark Shannon wrote: > > What should happen if I am silly enough to do this: > >>> list(itertools.count()) > > This will fail; it should fail quickly. Why should it? AFAIK it's not a common complaint. You said it yourself: it's a silly thing to do. Regards Antoine. -- Software development and contracting: http://pro.pitrou.net From alexandre.zani at gmail.com Sun Jul 15 17:38:20 2012 From: alexandre.zani at gmail.com (Alexandre Zani) Date: Sun, 15 Jul 2012 08:38:20 -0700 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <5002DCD0.6040107@hotpy.org> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <5002D60B.4000203@hotpy.org> <5002DCD0.6040107@hotpy.org> Message-ID: On Sun, Jul 15, 2012 at 8:08 AM, Mark Shannon wrote: > Brett Cannon wrote: > >> >> >> On Sun, Jul 15, 2012 at 10:39 AM, Mark Shannon > > wrote: >> >> Nick Coghlan wrote: >> >> Right, I agree on the value in being able to return something to >> say "this cannot be converted to a concrete container". >> >> I still haven't seen a use case where the appropriate response >> to "I don't know" differs from the appropriate response to a >> hint of zero - that is, you don't preallocate, you just start >> iterating. >> >> >> There seem to be 5 possible classes values of __length_hint__ that an >> iterator object can provide: >> >> 1. Don't implement it at all. >> >> 2. Implement __length_hint__() but don't want to return any value. >> Either raise an exception (TypeError) -- As suggested in the PEP. >> or return NotImplemented -- my preferred option. >> >> 3. Return a "don't know" value: >> Returning 0 would be fine for this, but the VM might want to >> respond >> differently to "don't know" and 0. >> __length_hint__() == 0 container should be >> minimum size. >> __length_hint__() == "unknown" container starts at >> default size. >> >> >> 4. Infinite iterator: >> Could return float('inf'), but given this is a "hint" then >> returning sys.maxsize or sys.maxsize + 1 might be OK. >> Alternatively raise an OverflowError >> >> >> I am really having a hard time differentiating infinity with "I don't >> know" since they are both accurate from the point of view of __length_hint__ >> and its typical purpose of allocation. You have no clue how many values will >> be grabbed from an infinite iterator, so it's the same as just not knowing >> upfront how long the iterator will be, infinite or not, and thus not worth >> distinguishing. >> >> >> 5. A meaningful length. No problem :) >> >> Also, what are the allowable return types? >> >> 1. int only >> 2. Any number (ie any type with a __int__() method)? >> 3. Or any integer-like object (ie a type with a __index__() method)? >> >> My suggestion: >> >> a) Don't want to return any value or "don't know": return >> NotImplemented >> b) For infinite iterators: raise an OverflowError >> c) All other cases: return an int or a type with a __index__() method. >> >> >> I'm fine with (a), drop (b), and for (c) use what we allow for __len__() >> since, as Nick's operator.length_hint pseudo-code suggests, people will call >> this as a fallback if __len__ isn't defined. > > > So how does an iterator express infinite length? > > What should happen if I am silly enough to do this: >>>> list(itertools.count()) > > This will fail; it should fail quickly. > > > Cheers, > Mark. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/alexandre.zani%40gmail.com The PEP so far says: "It may raise a ``TypeError`` if a specific instance cannot have its length estimated." In many ways, "I don't know" is the same as this "specific instance cannot have its length estimated". Why not just raise a TypeError? Also, regarding the code Nick posted above, I'm a little concerned about calling len as the first thing to try. That means that if I implement both __len__ and __len_hint__ (perhaps because __len__ is very expensive) __len_hint__ will never be used. It's relatively easy to say: try: hint = len_hint(l) except TypeError: hint = len(l) From steve at pearwood.info Sun Jul 15 17:55:09 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Mon, 16 Jul 2012 01:55:09 +1000 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <5002DCD0.6040107@hotpy.org> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <5002D60B.4000203@hotpy.org> <5002DCD0.6040107@hotpy.org> Message-ID: <5002E7DD.7030400@pearwood.info> Mark Shannon wrote: > So how does an iterator express infinite length? The suggestion was it should return -1. > What should happen if I am silly enough to do this: > >>> list(itertools.count()) > > This will fail; it should fail quickly. That depends on your OS. I've just tested it now on Linux Mint, and the Python process was terminated within seconds. I've also inadvertently done it on a Fedora system, which became completely unresponsive to user-input (including ctrl-alt-delete) within a few minutes. I let it run overnight (16 hours) before literally pulling the plug. (I expect the difference in behaviour is due to the default ulimit under Debian/Mint and RedHat/Fedora systems.) Ignoring OS-specific features, the promise[1] of the language is that list will try to allocate enough space for every item yielded by the iterator, or fail with a MemoryError. No promise is made as to how long that will take: it could take hours, or days, depending on how badly memory allocation performance drops when faced with unreasonably large requests. You can't expect it to fail either quickly or with an exception. With a length hint, we could strengthen that promise: "if __length_hint__ returns a negative number, list, tuple and set will fail immediately with MemoryError" which I think is a good safety feature for some things which cannot possibly succeed, but risk DOSing your system. Does it prevent every possible failure mode? No, of course not. But just because you can't prevent *every* problem doesn't mean you should prevent the ones which you can. [1] I think. I'm sure I read this somewhere in the docs, but I can't find it now. -- Steven From rosuav at gmail.com Sun Jul 15 18:00:58 2012 From: rosuav at gmail.com (Chris Angelico) Date: Mon, 16 Jul 2012 02:00:58 +1000 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <5002E7DD.7030400@pearwood.info> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <5002D60B.4000203@hotpy.org> <5002DCD0.6040107@hotpy.org> <5002E7DD.7030400@pearwood.info> Message-ID: On Mon, Jul 16, 2012 at 1:55 AM, Steven D'Aprano wrote: > (I expect the difference in behaviour is due to the default ulimit under > Debian/Mint and RedHat/Fedora systems.) Possibly also virtual memory settings. Allocating gobs of memory with a huge page file slows everything down without raising an error. And since it's possible to have non-infinite but ridiculous-sized iterators, I'd not bother putting too much effort into protecting infinite iterators - although the "huge but not infinite" case is, admittedly, rather rarer than either "reasonable-sized" or "actually infinite". ChrisA From steve at pearwood.info Sun Jul 15 18:21:20 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Mon, 16 Jul 2012 02:21:20 +1000 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint In-Reply-To: <20120715170628.1f6fcc04@pitrou.net> References:

<50027D79.8010500@pearwood.info> <20120715143641.3d2bb83d@pitrou.net> <5002C9DB.2000409@pearwood.info> <20120715162232.25c040b7@pitrou.net> <20120715170628.1f6fcc04@pitrou.net> Message-ID: <5002EE00.7000305@pearwood.info> Antoine Pitrou wrote: > First, you can't implement __length_hint__ for a generator, which is the > preferred (the most practical) way of writing iterators in pure Python. Limitations of generators are no reason for not improving iterators which are not generators. __length_hint__ already exists; this proposal simply proposes making it documented and officially supported. py> iter([]).__length_hint__ > Even if __length_hint__ is implemented, its result may be wrong. > That's the whole point: it's a *hint*; an iterator might tell you it's > finite while it's infinite, or the reverse. If it claims to be infinite, I see no reason to disbelieve it on the off-chance that it is actually both finite and small enough to fit into memory without crashing my system. If it claims to be finite, but is actually infinite, well that's not much of a hint, is it? There's an implied promise that the hint will be close to the real value, not infinitely distant. > My conclusion is that an infinite iterator is a documentation issue. > Just tell the user that it doesn't stop, and let them shoot themselves > in the foot in they want to. Buffer overflows are a documentation issue. Just tell the user not to overwrite memory they don't mean to, and let them shoot themselves in the foot if they want. *wink* -- Steven From solipsis at pitrou.net Sun Jul 15 18:19:47 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 15 Jul 2012 18:19:47 +0200 Subject: [Python-Dev] PEP 0424: A method for exposing a length hint References: