From storchaka at gmail.com Sat Jul 1 04:16:32 2017 From: storchaka at gmail.com (Serhiy Storchaka) Date: Sat, 1 Jul 2017 11:16:32 +0300 Subject: [Python-Dev] Summary of Python tracker Issues In-Reply-To: <20170630160920.087DD11A882@psf.upfronthosting.co.za> References: <20170630160920.087DD11A882@psf.upfronthosting.co.za> Message-ID: 30.06.17 19:09, Python tracker ????: > ACTIVITY SUMMARY (2017-06-23 - 2017-06-30) > Python tracker at http://bugs.python.org/ > > To view or respond to any of the issues listed below, click on the issue. > Do NOT respond to this message. > > Issues counts and deltas: > open 6006 (-20) Victor closed a half of hundred his issues. From storchaka at gmail.com Sat Jul 1 04:19:28 2017 From: storchaka at gmail.com (Serhiy Storchaka) Date: Sat, 1 Jul 2017 11:19:28 +0300 Subject: [Python-Dev] Summary of Python tracker Issues In-Reply-To: <20170623160917.C695056BD7@psf.upfronthosting.co.za> References: <20170623160917.C695056BD7@psf.upfronthosting.co.za> Message-ID: 23.06.17 19:09, Python tracker ????: > > ACTIVITY SUMMARY (2017-06-16 - 2017-06-23) > Python tracker at http://bugs.python.org/ > > To view or respond to any of the issues listed below, click on the issue. > Do NOT respond to this message. > > Issues counts and deltas: > open 6026 ( -8) Terry closed a third of hundred outdated IDLE issues. From victor.stinner at gmail.com Sat Jul 1 11:09:52 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Sat, 1 Jul 2017 17:09:52 +0200 Subject: [Python-Dev] Summary of Python tracker Issues In-Reply-To: References: <20170630160920.087DD11A882@psf.upfronthosting.co.za> Message-ID: Le 1 juil. 2017 10:18 AM, "Serhiy Storchaka" a ?crit : Victor closed a half of hundred his issues. Let me elaborate :-) I am still learning the new GitHub workflow, and it's common that I forget to close issues after merging a change. I had many issues that I forgot to update since we moved to GitHub. I had also complex bugs splitted into small issues and it took time to fix all branches and to wait for buildots to confirm that it's really fixed. To finish, I closed a lot of issues that were older than 2 years because the bug was fixed in the meanwhile in another issue, or just because I lost track of the issue and also lost interest and I don't consider that the bug or feature was worth it. Victor -------------- next part -------------- An HTML attachment was scrubbed... URL: From bhavishyagopesh at gmail.com Sat Jul 1 12:02:56 2017 From: bhavishyagopesh at gmail.com (Bhavishya) Date: Sat, 1 Jul 2017 21:32:56 +0530 Subject: [Python-Dev] Lazy_loading to pickle/unpickle_pure? Message-ID: Hi, I added lazy_loading in pickle.py.Here's some statistics if you consider them of any importance: 1)Unpickle_pure: Lazy -> unpickle_pure_python: Mean +- std dev: 728 us +- 24 us Original -> unpickle_pure_python: Mean +- std dev: 771 us +- 22 us 2)Pickle_pure: Lazy-> pickle_pure_python: Mean +- std dev: 919 us +- 18 us Orginal -> pickle_pure_python: Mean +- std dev: 1.04 ms +- 0.03 ms Thanks bhavishya -------------- next part -------------- An HTML attachment was scrubbed... URL: From victor.stinner at gmail.com Sat Jul 1 18:27:42 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Sun, 2 Jul 2017 00:27:42 +0200 Subject: [Python-Dev] Lazy_loading to pickle/unpickle_pure? In-Reply-To: References: Message-ID: What is lazy loading? How does it work? Victor Le 1 juil. 2017 6:02 PM, "Bhavishya" a ?crit : > Hi, > I added lazy_loading in pickle.py.Here's some statistics if you consider > them of any importance: > > 1)Unpickle_pure: > Lazy -> unpickle_pure_python: Mean +- std dev: 728 us +- 24 us > > Original -> unpickle_pure_python: Mean +- std dev: 771 us +- 22 us > > 2)Pickle_pure: > > Lazy-> pickle_pure_python: Mean +- std dev: 919 us +- 18 us > > Orginal -> pickle_pure_python: Mean +- std dev: 1.04 ms +- 0.03 ms > > Thanks > bhavishya > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From nad at python.org Mon Jul 3 00:43:06 2017 From: nad at python.org (Ned Deily) Date: Mon, 3 Jul 2017 00:43:06 -0400 Subject: [Python-Dev] 3.6.2 update: 3.6.2rc2 coming Message-ID: As you may recall, 3.6.2rc1 was released on 2017-06-17 with an expected release date of 3017-06-30 for 3.6.2 final. As you also may recall, 3.6.2rc1 was delayed a bit to address some security issues including updating our version of libexpat. Shortly after 3.6.2rc1 was released, the expat project released another version of libexpat fixing another security problem. Also since 3.6.2rc1, fixes for several other security issues in Python 3.6 itself have become available and it would be better to get them out sooner rather than later. Therefore, I have decided to do a second release candidate with select fixes cherry-picked from the 3.6 branch; continue to merge bug fixes and doc fixes into 3.6 as usual for release in 3.6.3. Since there haven't been regressions reported so far with 3.6.2rc1, we will plan to compress the rest of the release cycle. Expect to see 3.6.2rc2 available within the next couple of days (2017-07-04 expected) and, assuming no new issues, 3.6.2 final about a week later (around 2017-07-11). --Ned -- Ned Deily nad at python.org -- [] From netheril96 at gmail.com Mon Jul 3 00:52:16 2017 From: netheril96 at gmail.com (Siyuan Ren) Date: Mon, 3 Jul 2017 12:52:16 +0800 Subject: [Python-Dev] 64 bit units in PyLong Message-ID: The current PyLong implementation represents arbitrary precision integers in units of 15 or 30 bits. I presume the purpose is to avoid overflow in addition , subtraction and multiplication. But compilers these days offer intrinsics that allow one to access the overflow flag, and to obtain the result of 64 bit multiplication as a 128 bit number. Or at least on x86-64, which is the dominant platform. Any reason why it is not done? If it is only because no one bothers, I may be able to do it. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ma3yuki.8mamo10 at gmail.com Mon Jul 3 03:32:51 2017 From: ma3yuki.8mamo10 at gmail.com (Masayuki YAMAMOTO) Date: Mon, 3 Jul 2017 16:32:51 +0900 Subject: [Python-Dev] Remove own implementation for thread-local storage Message-ID: Hi, python-dev. I'd propose removing code which I think out-of-date. CPython has provided the own implementation for thread-local storage (TLS) on Python/thread.c, it's used in the case which a platform has not supplied native TLS. However, currently all supported platforms (NT and pthreads) have provided native TLS and defined the Py_HAVE_NATIVE_TLS macro with unconditional in any case. If the code is removed, the new TLS API for PEP 539 won't have to care the reinitialization of the thread keys managed by the interpreter (i.e. PyThread_ReInitTLS function has been working for own implementation and will be no longer necessary for new API). Does anyone have a reason we should keep it? Regards, Masayuki -------------- next part -------------- An HTML attachment was scrubbed... URL: From victor.stinner at gmail.com Mon Jul 3 04:05:20 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 3 Jul 2017 10:05:20 +0200 Subject: [Python-Dev] 64 bit units in PyLong In-Reply-To: References: Message-ID: 2017-07-03 6:52 GMT+02:00 Siyuan Ren : > The current PyLong implementation represents arbitrary precision integers in > units of 15 or 30 bits. I presume the purpose is to avoid overflow in > addition , subtraction and multiplication. But compilers these days offer > intrinsics that allow one to access the overflow flag, and to obtain the > result of 64 bit multiplication as a 128 bit number. The question is the performance. Is it fast? :-) You can try to write a patch and run a benchmark. See for example http://pyperformance.readthedocs.io/ for benchmarks. Victor From victor.stinner at gmail.com Mon Jul 3 04:07:06 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 3 Jul 2017 10:07:06 +0200 Subject: [Python-Dev] Remove own implementation for thread-local storage In-Reply-To: References: Message-ID: I'm in favor of removing it. I know that it confused people many times, they look at this fallback and found an issue, whereas I'm not aware of any platform using this fallback anymore. Can you please write a PR just to remove this fallback? We can merge it and then check buildbots :-) So in the worst case, we can revert it. Victor 2017-07-03 9:32 GMT+02:00 Masayuki YAMAMOTO : > Hi, python-dev. > > I'd propose removing code which I think out-of-date. > CPython has provided the own implementation for thread-local storage (TLS) > on Python/thread.c, it's used in the case which a platform has not supplied > native TLS. However, currently all supported platforms (NT and pthreads) > have provided native TLS and defined the Py_HAVE_NATIVE_TLS macro with > unconditional in any case. > If the code is removed, the new TLS API for PEP 539 won't have to care the > reinitialization of the thread keys managed by the interpreter (i.e. > PyThread_ReInitTLS function has been working for own implementation and will > be no longer necessary for new API). Does anyone have a reason we should > keep it? > > Regards, > Masayuki > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/victor.stinner%40gmail.com > From solipsis at pitrou.net Mon Jul 3 04:19:29 2017 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 3 Jul 2017 10:19:29 +0200 Subject: [Python-Dev] Remove own implementation for thread-local storage References: Message-ID: <20170703101929.65aea0b0@fsol> Also note that C11, MSVC and some Unix C compilers have built-in support for thread-local variables. Example: https://github.com/numba/numba/blob/master/numba/_random.c#L114-L119 Regards Antoine. On Mon, 3 Jul 2017 10:07:06 +0200 Victor Stinner wrote: > I'm in favor of removing it. I know that it confused people many > times, they look at this fallback and found an issue, whereas I'm not > aware of any platform using this fallback anymore. > > Can you please write a PR just to remove this fallback? We can merge > it and then check buildbots :-) So in the worst case, we can revert > it. > > Victor > > 2017-07-03 9:32 GMT+02:00 Masayuki YAMAMOTO : > > Hi, python-dev. > > > > I'd propose removing code which I think out-of-date. > > CPython has provided the own implementation for thread-local storage (TLS) > > on Python/thread.c, it's used in the case which a platform has not supplied > > native TLS. However, currently all supported platforms (NT and pthreads) > > have provided native TLS and defined the Py_HAVE_NATIVE_TLS macro with > > unconditional in any case. > > If the code is removed, the new TLS API for PEP 539 won't have to care the > > reinitialization of the thread keys managed by the interpreter (i.e. > > PyThread_ReInitTLS function has been working for own implementation and will > > be no longer necessary for new API). Does anyone have a reason we should > > keep it? > > > > Regards, > > Masayuki > > > > _______________________________________________ > > Python-Dev mailing list > > Python-Dev at python.org > > https://mail.python.org/mailman/listinfo/python-dev > > Unsubscribe: > > https://mail.python.org/mailman/options/python-dev/victor.stinner%40gmail.com > > From victor.stinner at gmail.com Mon Jul 3 06:02:13 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 3 Jul 2017 12:02:13 +0200 Subject: [Python-Dev] Remove own implementation for thread-local storage In-Reply-To: <20170703101929.65aea0b0@fsol> References: <20170703101929.65aea0b0@fsol> Message-ID: Sadly, we only require C99 yet :-/ Victor 2017-07-03 10:19 GMT+02:00 Antoine Pitrou : > > Also note that C11, MSVC and some Unix C compilers have built-in support > for thread-local variables. Example: > https://github.com/numba/numba/blob/master/numba/_random.c#L114-L119 > > Regards > > Antoine. > > > On Mon, 3 Jul 2017 10:07:06 +0200 > Victor Stinner wrote: > >> I'm in favor of removing it. I know that it confused people many >> times, they look at this fallback and found an issue, whereas I'm not >> aware of any platform using this fallback anymore. >> >> Can you please write a PR just to remove this fallback? We can merge >> it and then check buildbots :-) So in the worst case, we can revert >> it. >> >> Victor >> >> 2017-07-03 9:32 GMT+02:00 Masayuki YAMAMOTO : >> > Hi, python-dev. >> > >> > I'd propose removing code which I think out-of-date. >> > CPython has provided the own implementation for thread-local storage (TLS) >> > on Python/thread.c, it's used in the case which a platform has not supplied >> > native TLS. However, currently all supported platforms (NT and pthreads) >> > have provided native TLS and defined the Py_HAVE_NATIVE_TLS macro with >> > unconditional in any case. >> > If the code is removed, the new TLS API for PEP 539 won't have to care the >> > reinitialization of the thread keys managed by the interpreter (i.e. >> > PyThread_ReInitTLS function has been working for own implementation and will >> > be no longer necessary for new API). Does anyone have a reason we should >> > keep it? >> > >> > Regards, >> > Masayuki >> > >> > _______________________________________________ >> > Python-Dev mailing list >> > Python-Dev at python.org >> > https://mail.python.org/mailman/listinfo/python-dev >> > Unsubscribe: >> > https://mail.python.org/mailman/options/python-dev/victor.stinner%40gmail.com >> > > > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/victor.stinner%40gmail.com From victor.stinner at gmail.com Mon Jul 3 07:03:25 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 3 Jul 2017 13:03:25 +0200 Subject: [Python-Dev] Need help to review a test_nntplib enhancement: Message-ID: Hi, Sometimes, for an unknown reason, test_nntplib fails randomly: http://bugs.python.org/issue19613 Martin Panter wrote a patch, but since I don't know how to reproduce the bug, I'm unable to test it. Moreover, I don't know nntplib nor test_nntplib, so I don't feel able to review it. Sadly, Martin doesn't feel confident enough in his patch neither, since again, he is unable to test it. As a result, the issue is stuck and the bug continues to occur sometimes on buildbots. Victor From ncoghlan at gmail.com Mon Jul 3 10:34:32 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 4 Jul 2017 00:34:32 +1000 Subject: [Python-Dev] Remove own implementation for thread-local storage In-Reply-To: References: <20170703101929.65aea0b0@fsol> Message-ID: On 3 July 2017 at 20:02, Victor Stinner wrote: > Sadly, we only require C99 yet :-/ Handling fallbacks when shiny new features are unavailable is what autoconf is for, though :) The fact thread specific storage support made it directly into C11 makes me more confident in dropping our emulation, so +1 for investigating that *before* we finalize PEP 539. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From victor.stinner at gmail.com Mon Jul 3 10:38:19 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 3 Jul 2017 16:38:19 +0200 Subject: [Python-Dev] Remove own implementation for thread-local storage In-Reply-To: References: Message-ID: >> I'd propose removing code which I think out-of-date. Already done! https://github.com/python/cpython/commit/aa0aa0492c5fffe750a26d2ab13737a1a6d7d63c (and no buildbot complained). Victor From victor.stinner at gmail.com Tue Jul 4 05:55:01 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Tue, 4 Jul 2017 11:55:01 +0200 Subject: [Python-Dev] New work-in-progress bisection tool for the Python test suite (in regrtest) In-Reply-To: References: Message-ID: On Python 2, the addition of Lib/test/bisect.py caused conflict with Lib/bisect.py when running the Python test suite :-( I chose to rename Lib/test/bisect.py to Lib/test/bisectcmd.py to reduce changes caused by this new debug tool. So only in Python 2.7, you have to run: ./python -m test.bisectcmd ... instead of ./python -m tes.bisect ... See http://bugs.python.org/issue30843 for more information. Victor 2017-06-16 18:05 GMT+02:00 Victor Stinner : > Hi, > > Last weeks, I worked on a new tool to bisect failing tests because > it's painful to bisect manually reference leaks (I remove as much code > as possible until the code is small enough to be reviewable manually). > > See the bisect_test.py script attached to this issue: > http://bugs.python.org/issue29512 > > With the help of Louie Lu, I added new --list-cases option to "python > -m test", so you can now list all test cases and write it into a text > file: > > ./python -m test --list-cases test_os > tests > > I also added a new --matchfile option, to filter tests using a text > file which contains one pattern per line: > > ./python -m test --matchfile=tests test_os > > fnmatch is used to match test names, so "*" joker character can be > used in test names. > > > My bisection tool takes a text file with the --matchfile format (one > pattern per line) and creates a random subset of tests with half of > the tests. If tests still fail, use the subset. Otherwise, create a > new random subset. Loop until the subset contains a single test > (configurable threshold, -n command line option). > > The long term plan is to integrate the bisection feature directly into regrtest. > > > > Right now, my script is hardcoded to bisect reference leak bugs, but > it should be easy to modify it to bisect other test issues like test > creating files without removing it ("ENV_CHANGED" failure in > regrtest). > > For example, a core file is dumped when running test_subprocess on > FreeBSD buildbots: > > http://bugs.python.org/issue30448 > > But I'm unable to reproduce the issue on my FreeBSD. It would be nice > to be able to automate the bisection on the buildbot directly. > > > --list-cases and --matchfile options are now available in 2.7, 3.5, > 3.6 and master (3.7) branches. > > TODO: doctest tests are only partially supported, see: > > http://bugs.python.org/issue30683 > > Victor From ncoghlan at gmail.com Tue Jul 4 06:52:34 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 4 Jul 2017 20:52:34 +1000 Subject: [Python-Dev] New work-in-progress bisection tool for the Python test suite (in regrtest) In-Reply-To: References:

Message-ID: On 4 July 2017 at 19:55, Victor Stinner wrote: > On Python 2, the addition of Lib/test/bisect.py caused conflict with > Lib/bisect.py when running the Python test suite :-( I chose to rename > Lib/test/bisect.py to Lib/test/bisectcmd.py to reduce changes caused > by this new debug tool. So only in Python 2.7, you have to run: > > ./python -m test.bisectcmd ... I know it's longer, but perhaps it would make sense to put the bisection helper under "python -m test.support.bisect" in both Python 2 & 3? Even in Python 3 "test.bisect" looks a bit like the test suite for the bisect module to me - you have to "just know" that the latter actually lives at "test.test_bisect". Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From victor.stinner at gmail.com Tue Jul 4 07:03:26 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Tue, 4 Jul 2017 13:03:26 +0200 Subject: [Python-Dev] New work-in-progress bisection tool for the Python test suite (in regrtest) In-Reply-To: References:

Message-ID: 2017-07-04 12:52 GMT+02:00 Nick Coghlan : > I know it's longer, but perhaps it would make sense to put the > bisection helper under "python -m test.support.bisect" in both Python > 2 & 3? For me, test.support is a toolkit to *write* tests, not really to run tests. I don't really care where my bisect tool lives. Serhiy proposed test.bisect, I like because it's short and easy to remind. Technically it is possible to get test.bisect on Python 2, it just requires to modify 4 .py files which import Lib/bisect.py to add "from __future__ import absolute_import": Lib/urllib2.py:import bisect Lib/mhlib.py:from bisect import bisect Lib/test/test_bisect.py:import bisect as py_bisect Lib/multiprocessing/heap.py:import bisect I modified Lib/test/test_bisect.py, but I missed these other ones in my first commit. And then I got a failure in multiprocessing. I chose the conservative approach: rename the new Lib/test/bisect.py file. Do you prefer to get test.bisect, and so modify the 4 files to add "from __future__ import absolute_import"? I didn't recall the subtle details of "relative import" in Python 2. Since I'm now used to Python 3, the Python 2 behaviour now really looks weird to me :-) Victor From lele at metapensiero.it Tue Jul 4 07:20:36 2017 From: lele at metapensiero.it (Lele Gaifax) Date: Tue, 04 Jul 2017 13:20:36 +0200 Subject: [Python-Dev] New work-in-progress bisection tool for the Python test suite (in regrtest) References:

Message-ID: <87a84k5tyz.fsf@metapensiero.it> Nick Coghlan writes: > On 4 July 2017 at 19:55, Victor Stinner wrote: >> On Python 2, the addition of Lib/test/bisect.py caused conflict with >> Lib/bisect.py when running the Python test suite :-( I chose to rename >> Lib/test/bisect.py to Lib/test/bisectcmd.py to reduce changes caused >> by this new debug tool. So only in Python 2.7, you have to run: >> >> ./python -m test.bisectcmd ... > > I know it's longer, but perhaps it would make sense to put the > bisection helper under "python -m test.support.bisect" in both Python > 2 & 3? Or test.tool.bisect, similar to json.tool. ciao, lele. -- nickname: Lele Gaifax | Quando vivr? di quello che ho pensato ieri real: Emanuele Gaifas | comincer? ad aver paura di chi mi copia. lele at metapensiero.it | -- Fortunato Depero, 1929. From ncoghlan at gmail.com Tue Jul 4 07:22:08 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 4 Jul 2017 21:22:08 +1000 Subject: [Python-Dev] New work-in-progress bisection tool for the Python test suite (in regrtest) In-Reply-To: References:

Message-ID: On 4 July 2017 at 21:03, Victor Stinner wrote: > 2017-07-04 12:52 GMT+02:00 Nick Coghlan : >> I know it's longer, but perhaps it would make sense to put the >> bisection helper under "python -m test.support.bisect" in both Python >> 2 & 3? > > For me, test.support is a toolkit to *write* tests, not really to run tests. > > I don't really care where my bisect tool lives. Serhiy proposed > test.bisect, I like because it's short and easy to remind. > > Technically it is possible to get test.bisect on Python 2, it just > requires to modify 4 .py files which import Lib/bisect.py to add "from > __future__ import absolute_import": > > Lib/urllib2.py:import bisect > Lib/mhlib.py:from bisect import bisect > Lib/test/test_bisect.py:import bisect as py_bisect > Lib/multiprocessing/heap.py:import bisect That doesn't sound right, as implicit relative imports in Python 2 are relative to the *package* (they're akin to writing "from . import name"). That means if test.bisect is shadowing the top level bisect module when backported, it suggests that the test.regrtest directory is ending up on sys.path for the affected test run (e.g. because the tests were run as "python Lib/test/regrtest.py" rather than via the -m switch). Checking test.regrtest in 2.7, it looks to me like it's missing the stanza added to Py3 that makes sure "Lib/test/" isn't present on sys.path - if you add a similar operation to Py2.7, I'd expect the test.bisect name for the command to work there as well. Cheers, Nick. P.S. As far the multiprocessing failure you saw goes, my guess would be that the 2.7 version actually is relying on implicit relative imports to find peer modules, and would need some "from . import ..." adjustments to handle "from __future__ import absolute_import" at the top of the file. However, as noted above, I don't think that's actually what's happening - I think the "Lib/test/" directory is still sometimes ending up on sys.path while running the tests in 2.7. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From victor.stinner at gmail.com Tue Jul 4 08:10:14 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Tue, 4 Jul 2017 14:10:14 +0200 Subject: [Python-Dev] New work-in-progress bisection tool for the Python test suite (in regrtest) In-Reply-To: References:

Message-ID: 2017-07-04 13:22 GMT+02:00 Nick Coghlan : > That means if test.bisect is shadowing the top level bisect module > when backported, it suggests that the test.regrtest directory is > ending up on sys.path for the affected test run (e.g. because the > tests were run as "python Lib/test/regrtest.py" rather than via the -m > switch). I don't think that Lib/test/ is in sys.path. It's more subtle than test. When you run "./python -m test test_bisect", Lib/test/regrtest.py imports "test.test_bisect", and so test_bisect is imported with __package__=['test']. With test_bisect.__package__=['test'], "import bisect" in Lib/test/test_bisect.py imports Lib/test/bisect.py. The question is more when Lib/multiprocessing/heap.py got Lib/test/bisect.py instead of Lib/bisect.py. I didn't dig into this issue. The Python 2 import machinery blows my mind :-) Victor From victor.stinner at gmail.com Tue Jul 4 09:15:53 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Tue, 4 Jul 2017 15:15:53 +0200 Subject: [Python-Dev] Star the CPython GitHub project if you like Python! In-Reply-To: References: Message-ID: 4 days later, we got +2,389 new stars, thank you! (8,539 => 10,928) Python moved from the 11th place to the 9th, before Elixir and Julia. Python is still behind Ruby (12,511) and PHP (12,318), but it's already much better than before! Victor 2017-06-30 15:59 GMT+02:00 Victor Stinner : > Hi, > > GitHub has a showcase page of hosted programming languages: > > https://github.com/showcases/programming-languages > > Python is only #11 with 8,539 stars, behind PHP and Ruby! > > Hey, you should "like" ("star"?) the CPython project if you like Python! > > https://github.com/python/cpython/ > Click on "Star" at the top right. > > Thank you! > Victor From victor.stinner at gmail.com Tue Jul 4 09:23:56 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Tue, 4 Jul 2017 15:23:56 +0200 Subject: [Python-Dev] Buildbot report (almost July) In-Reply-To: References: Message-ID: 2017-06-29 17:09 GMT+02:00 Victor Stinner : > Correct me if I'm wrong, but, for the first time, *all reference > leaks* have been fixed on *all branches* (2.7, 3.5, 3.6 and master), > on *Linux and Windows*! Before, we mostly focused on the master branch > (called "default" in Mercurial) on Linux. > > I also started to fix a few "memory block" leaks, most (or all?) of > them should also be fixed (on all branches, on Linux and Windows). The "AMD64 Windows8.1 Refleaks" now pass on 2.7, 3.5, 3.6 and master branches. I finished to backport my changes to fix false alarms in hunting reference and memory leaks. The next introduced reference leak should now send an email to the buildbot-status mailing list ;-) Victor From benhoyt at gmail.com Tue Jul 4 10:22:48 2017 From: benhoyt at gmail.com (Ben Hoyt) Date: Tue, 4 Jul 2017 10:22:48 -0400 Subject: [Python-Dev] Star the CPython GitHub project if you like Python! In-Reply-To: References: Message-ID: Nice! I also posted it on reddit.com/r/Python, where it got a bit of traction: https://www.reddit.com/r/Python/comments/6kg4w0/cpython_recently_moved_to_github_star_the_project/ -Ben On Tue, Jul 4, 2017 at 9:15 AM, Victor Stinner wrote: > 4 days later, we got +2,389 new stars, thank you! (8,539 => 10,928) > > Python moved from the 11th place to the 9th, before Elixir and Julia. > > Python is still behind Ruby (12,511) and PHP (12,318), but it's > already much better than before! > > Victor > > 2017-06-30 15:59 GMT+02:00 Victor Stinner : > > Hi, > > > > GitHub has a showcase page of hosted programming languages: > > > > https://github.com/showcases/programming-languages > > > > Python is only #11 with 8,539 stars, behind PHP and Ruby! > > > > Hey, you should "like" ("star"?) the CPython project if you like Python! > > > > https://github.com/python/cpython/ > > Click on "Star" at the top right. > > > > Thank you! > > Victor > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > benhoyt%40gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Tue Jul 4 10:27:34 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 5 Jul 2017 00:27:34 +1000 Subject: [Python-Dev] New work-in-progress bisection tool for the Python test suite (in regrtest) In-Reply-To: References:

Message-ID: On 4 July 2017 at 22:10, Victor Stinner wrote: > 2017-07-04 13:22 GMT+02:00 Nick Coghlan : >> That means if test.bisect is shadowing the top level bisect module >> when backported, it suggests that the test.regrtest directory is >> ending up on sys.path for the affected test run (e.g. because the >> tests were run as "python Lib/test/regrtest.py" rather than via the -m >> switch). > > I don't think that Lib/test/ is in sys.path. It's more subtle than > test. When you run "./python -m test test_bisect", > Lib/test/regrtest.py imports "test.test_bisect", and so test_bisect is > imported with __package__=['test']. > > With test_bisect.__package__=['test'], "import bisect" in > Lib/test/test_bisect.py imports Lib/test/bisect.py. Right, for test_bisect specifically, the implicit relative import problem applies, and "from __future__ import absolute_import" is the relevant fix. That concern just doesn't apply to the *stdlib* modules doing a normal top-level "import bisect". > The question is more when Lib/multiprocessing/heap.py got > Lib/test/bisect.py instead of Lib/bisect.py. I didn't dig into this > issue. The Python 2 import machinery blows my mind :-) *This* is the case that I think is due to "Lib/test" being on sys.path when the tests are run: ``` $ ./python -i Lib/test/regrtest.py --help [snip output] >>> import sys >>> sys.path[0] '/home/ncoghlan/devel/py27/Lib/test' ``` Using test_urllib2 as the example: ``` $ touch Lib/test/bisect.py $ ./python -m test.regrtest test_urllib2 Run tests sequentially 0:00:00 [1/1] test_urllib2 1 test OK. Total duration: 800 ms Tests result: SUCCESS $ ./python Lib/test/regrtest.py test_urllib2 Run tests sequentially 0:00:00 [1/1] test_urllib2 Warning -- os.environ was modified by test_urllib2 [snip output] test test_urllib2 failed -- multiple errors occurred; run in verbose mode for details 1 test failed: test_urllib2 Total duration: 107 ms Tests result: FAILURE ``` Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From victor.stinner at gmail.com Tue Jul 4 10:46:00 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Tue, 4 Jul 2017 16:46:00 +0200 Subject: [Python-Dev] New work-in-progress bisection tool for the Python test suite (in regrtest) In-Reply-To: References:

Message-ID: 2017-07-04 16:27 GMT+02:00 Nick Coghlan : > That concern just doesn't apply to the *stdlib* modules doing a normal > top-level "import bisect". Hum, ok. I created a PR which removes '' and Lib/test/ from sys.path, and rename again test.bisectcmd to test.bisect. Would you mind to review it? https://github.com/python/cpython/pull/2567 Victor From chris.jerdonek at gmail.com Tue Jul 4 15:11:42 2017 From: chris.jerdonek at gmail.com (Chris Jerdonek) Date: Tue, 4 Jul 2017 12:11:42 -0700 Subject: [Python-Dev] Star the CPython GitHub project if you like Python! In-Reply-To: References: Message-ID: Great work, Victor! It seems like this would be an easy thing to mention at the beginning of conference talks and meetup presentations, and also something to ask coworkers if you work at a company that uses Python (e.g. on workplace Slack channels, etc). --Chris On Tue, Jul 4, 2017 at 6:15 AM, Victor Stinner wrote: > 4 days later, we got +2,389 new stars, thank you! (8,539 => 10,928) > > Python moved from the 11th place to the 9th, before Elixir and Julia. > > Python is still behind Ruby (12,511) and PHP (12,318), but it's > already much better than before! > > Victor > > 2017-06-30 15:59 GMT+02:00 Victor Stinner : >> Hi, >> >> GitHub has a showcase page of hosted programming languages: >> >> https://github.com/showcases/programming-languages >> >> Python is only #11 with 8,539 stars, behind PHP and Ruby! >> >> Hey, you should "like" ("star"?) the CPython project if you like Python! >> >> https://github.com/python/cpython/ >> Click on "Star" at the top right. >> >> Thank you! >> Victor > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/chris.jerdonek%40gmail.com From tjreedy at udel.edu Tue Jul 4 16:00:09 2017 From: tjreedy at udel.edu (Terry Reedy) Date: Tue, 4 Jul 2017 16:00:09 -0400 Subject: [Python-Dev] Star the CPython GitHub project if you like Python! In-Reply-To: References:

Message-ID: On 7/4/2017 10:22 AM, Ben Hoyt wrote: > Nice! > > I also posted it on reddit.com/r/Python , > where it got a bit of traction: > https://www.reddit.com/r/Python/comments/6kg4w0/cpython_recently_moved_to_github_star_the_project/ I just posted on python-list. -- Terry Jan Reedy From dickinsm at gmail.com Wed Jul 5 15:05:06 2017 From: dickinsm at gmail.com (Mark Dickinson) Date: Wed, 5 Jul 2017 20:05:06 +0100 Subject: [Python-Dev] 64 bit units in PyLong In-Reply-To: References: Message-ID: On Mon, Jul 3, 2017 at 5:52 AM, Siyuan Ren wrote: > The current PyLong implementation represents arbitrary precision integers in > units of 15 or 30 bits. I presume the purpose is to avoid overflow in > addition , subtraction and multiplication. But compilers these days offer > intrinsics that allow one to access the overflow flag, and to obtain the > result of 64 bit multiplication as a 128 bit number. Or at least on x86-64, > which is the dominant platform. Any reason why it is not done? Portability matters, so any use of these intrinsics would likely also have to be accompanied by fallback code that doesn't depend on them, as well as some buildsystem complexity to figure out whether those intrinsics are supported or not. And then the Objects/longobject.c would suffer in terms of simplicity and readability, so there would have to be some clear gains to offset that. Note that the typical Python workload does not involve thousand-digit integers: what would matter would be performance of smaller integers, and it seems conceivable that 64-bit limbs would speed up those operations simply because so many more integers would become single-limb and so there would be more opportunities to take fast paths, but there would need to be benchmarks demonstrating that. Oh, and you'd have to rewrite the power algorithm, which currently depends on the size of a limb in bytes being a multiple of 5. :-) -- Mark From breamoreboy at yahoo.co.uk Wed Jul 5 15:33:23 2017 From: breamoreboy at yahoo.co.uk (Mark Lawrence) Date: Wed, 5 Jul 2017 20:33:23 +0100 Subject: [Python-Dev] 64 bit units in PyLong In-Reply-To: References: Message-ID: On 05/07/2017 20:05, Mark Dickinson wrote: > Oh, and you'd have to rewrite the power algorithm, which currently > depends on the size of a limb in bytes being a multiple of 5. :-) > What is a limb, as my search foo has let me down? -- My fellow Pythonistas, ask not what our language can do for you, ask what you can do for our language. Mark Lawrence --- This email has been checked for viruses by AVG. http://www.avg.com From rosuav at gmail.com Wed Jul 5 15:39:08 2017 From: rosuav at gmail.com (Chris Angelico) Date: Thu, 6 Jul 2017 05:39:08 +1000 Subject: [Python-Dev] 64 bit units in PyLong In-Reply-To: References: Message-ID: On Thu, Jul 6, 2017 at 5:33 AM, Mark Lawrence via Python-Dev wrote: > On 05/07/2017 20:05, Mark Dickinson wrote: > >> Oh, and you'd have to rewrite the power algorithm, which currently >> depends on the size of a limb in bytes being a multiple of 5. :-) >> > > What is a limb, as my search foo has let me down? A thing that has a bunch of digits, but fits inside a machine word. https://gmplib.org/manual/Nomenclature-and-Types.html ChrisA From greg at krypto.org Wed Jul 5 16:18:05 2017 From: greg at krypto.org (Gregory P. Smith) Date: Wed, 05 Jul 2017 20:18:05 +0000 Subject: [Python-Dev] 64 bit units in PyLong In-Reply-To: References: Message-ID: On Wed, Jul 5, 2017 at 12:05 PM Mark Dickinson wrote: > On Mon, Jul 3, 2017 at 5:52 AM, Siyuan Ren wrote: > > The current PyLong implementation represents arbitrary precision > integers in > > units of 15 or 30 bits. I presume the purpose is to avoid overflow in > > addition , subtraction and multiplication. But compilers these days offer > > intrinsics that allow one to access the overflow flag, and to obtain the > > result of 64 bit multiplication as a 128 bit number. Or at least on > x86-64, > > which is the dominant platform. Any reason why it is not done? > > Portability matters, so any use of these intrinsics would likely also > have to be accompanied by fallback code that doesn't depend on them, > as well as some buildsystem complexity to figure out whether those > intrinsics are supported or not. And then the Objects/longobject.c > would suffer in terms of simplicity and readability, so there would > have to be some clear gains to offset that. Note that the typical > Python workload does not involve thousand-digit integers: what would > matter would be performance of smaller integers, and it seems > conceivable that 64-bit limbs would speed up those operations simply > because so many more integers would become single-limb and so there > would be more opportunities to take fast paths, but there would need > to be benchmarks demonstrating that. > > Oh, and you'd have to rewrite the power algorithm, which currently > depends on the size of a limb in bytes being a multiple of 5. :-) > > -- > Mark > When I pushed to get us to adopt 30-bit digits instead of 15-bit digits, I hoped it could also happen on 32-bit x86 builds as the hardware has fast 32bit multiply -> 64 result support. But it made the configuration more difficult and IIRC would've increase the memory use of the PyLong type for single digit numbers on 32-bit platforms so we settled on moving to 30 bits only for 64-bit platforms (which were obviously going to become the norm). A reasonable digit size for hardware that supports 128bit results of 64bit multiplies would be 60 bits (to keep our multiple of 5 bits logic working). But I doubt you will see any notable performance gains in practical applications by doing so. Sure, numbers in the 1-4 billion range are slightly less efficient today, but those are not likely to appear in hot loops in a typical application. Microbenchmarks alone should not be used to make this decision. remember: We had native integer support in Python 1 & 2 via the old PyInt type. In Python 3 we ditched that in favor of PyLong everywhere. This was a performance hit for the sake of proper high level language simplicity. We've already regained all of that since then in other areas of the interpreter. -gps -------------- next part -------------- An HTML attachment was scrubbed... URL: From status at bugs.python.org Fri Jul 7 12:09:18 2017 From: status at bugs.python.org (Python tracker) Date: Fri, 7 Jul 2017 18:09:18 +0200 (CEST) Subject: [Python-Dev] Summary of Python tracker Issues Message-ID: <20170707160918.4A8D056BE1@psf.upfronthosting.co.za> ACTIVITY SUMMARY (2017-06-30 - 2017-07-07) Python tracker at http://bugs.python.org/ To view or respond to any of the issues listed below, click on the issue. Do NOT respond to this message. Issues counts and deltas: open 6021 (+15) closed 36604 (+39) total 42625 (+54) Open issues with patches: 2344 Issues opened (48) ================== #23644: g++ module compile fails with ???_Atomic??? does not name a ty http://bugs.python.org/issue23644 reopened by haypo #27945: Various segfaults with dict http://bugs.python.org/issue27945 reopened by ned.deily #30819: Linking with 'ld -b' fails with 64-bit using Itanium HP compil http://bugs.python.org/issue30819 opened by Robert Boehne #30820: email.contentmanager.raw_data_manager fails to create multipar http://bugs.python.org/issue30820 opened by elenril #30821: unittest.mock.Mocks with specs aren't aware of default argumen http://bugs.python.org/issue30821 opened by Max Rothman #30822: Python implementation of datetime module is not being tested c http://bugs.python.org/issue30822 opened by musically_ut #30823: os.startfile("") craches Python 2.7, 3.4 in Windows 7 32 bit i http://bugs.python.org/issue30823 opened by mikeee #30824: Add mimetype for extension .json http://bugs.python.org/issue30824 opened by quentel #30825: csv.Sniffer does not detect lineterminator http://bugs.python.org/issue30825 opened by vmax #30826: More details in reference 'Looping through a list in Python an http://bugs.python.org/issue30826 opened by the_darklord #30828: Out of bounds write in _asyncio_Future_remove_done_callback http://bugs.python.org/issue30828 opened by Ned Williamson #30830: HTTPHandlerTest of test_logging leaks a "dangling" thread on A http://bugs.python.org/issue30830 opened by haypo #30831: Inconsistent or wrong documentation around Asynchronous Contex http://bugs.python.org/issue30831 opened by dmiyakawa #30833: UnloadUserprofile displays the error "The handle is invalid" http://bugs.python.org/issue30833 opened by d doe #30834: Warning -- files was modified by test_import, After: ['@test http://bugs.python.org/issue30834 opened by haypo #30835: AttributeError when parsing multipart email with invalid non-d http://bugs.python.org/issue30835 opened by Andrew Donnellan #30836: test_c_locale_coercion fails on AIX http://bugs.python.org/issue30836 opened by haypo #30837: Mac OS High Sierra Beta - Python Crash http://bugs.python.org/issue30837 opened by ayesjm #30839: Larger and/or configurable _MAX_LENGTH for unittest messages http://bugs.python.org/issue30839 opened by maarten-treewalker #30840: Contrary to documentation, relative imports cannot pass throug http://bugs.python.org/issue30840 opened by Malcolm Smith #30841: A shadowing variable naming emitted for Python-ast.c http://bugs.python.org/issue30841 opened by OswinC #30842: pyenv activate for bash and tcsh http://bugs.python.org/issue30842 opened by PyAcrisel #30844: selectors: Add urgent data to read event http://bugs.python.org/issue30844 opened by pklanke #30845: [3.5] test_first_completed_some_already_completed() of test_co http://bugs.python.org/issue30845 opened by haypo #30846: [3.6] test_rapid_restart() of test_multiprocessing_fork fails http://bugs.python.org/issue30846 opened by haypo #30847: asyncio: selector_events: add_urgent() for urgent data to read http://bugs.python.org/issue30847 opened by pklanke #30848: test_multiprocessing_forkserver hangs on AMD64 FreeBSD CURRENT http://bugs.python.org/issue30848 opened by haypo #30849: test_stress_delivery_dependent() of test_signal randomly fails http://bugs.python.org/issue30849 opened by haypo #30850: [2.7] bsddb3: test01_basic_replication() of test_bsddb3 fails http://bugs.python.org/issue30850 opened by haypo #30851: IDLE: configdialog -- fix tkinter Variables http://bugs.python.org/issue30851 opened by terry.reedy #30852: _PyObject_GC_UNTRACK corruption when call a lambda function wi http://bugs.python.org/issue30852 opened by ????????? #30853: IDLE: configdialog -- factor out Variable subclass http://bugs.python.org/issue30853 opened by terry.reedy #30856: unittest.TestResult.addSubTest should be called immediately af http://bugs.python.org/issue30856 opened by sir-sigurd #30857: test_bsddb3 hangs longer than 37 minutes on x86 Tiger 2.7 http://bugs.python.org/issue30857 opened by haypo #30858: Keyword can't be an expression? http://bugs.python.org/issue30858 opened by veky #30859: Can't install Python for Windows 3.6.1 on multiple profiles http://bugs.python.org/issue30859 opened by Joe Jacobs #30860: Consolidate stateful C globals under a single struct. http://bugs.python.org/issue30860 opened by eric.snow #30861: StreamReader does not return reamaing and ready data buffer be http://bugs.python.org/issue30861 opened by pfreixes #30863: Rewrite PyUnicode_AsWideChar() and PyUnicode_AsWideCharString( http://bugs.python.org/issue30863 opened by serhiy.storchaka #30864: Compile failure for linux socket CAN support http://bugs.python.org/issue30864 opened by Riccardo Magliocchetti #30865: python cannot import module located on a "VOLUME" directory http://bugs.python.org/issue30865 opened by apre #30866: Add _testcapi.stack_pointer() to measure the C stack consumpti http://bugs.python.org/issue30866 opened by haypo #30867: Add necessary macro that insure `HAVE_OPENSSL_VERIFY_PARAM` to http://bugs.python.org/issue30867 opened by signal1587 #30868: IDLE: Improve configuration tests with mock Save. http://bugs.python.org/issue30868 opened by terry.reedy #30869: regrtest: Add .idlerc to saved_test_environment http://bugs.python.org/issue30869 opened by louielu #30870: IDLE: configdialog/fonts: change font when select by key up/do http://bugs.python.org/issue30870 opened by louielu #30871: Add a "python info" command somewhere to dump versions of all http://bugs.python.org/issue30871 opened by haypo #30872: Update curses docs to Python 3 http://bugs.python.org/issue30872 opened by serhiy.storchaka Most recent 15 issues with no replies (15) ========================================== #30872: Update curses docs to Python 3 http://bugs.python.org/issue30872 #30870: IDLE: configdialog/fonts: change font when select by key up/do http://bugs.python.org/issue30870 #30868: IDLE: Improve configuration tests with mock Save. http://bugs.python.org/issue30868 #30867: Add necessary macro that insure `HAVE_OPENSSL_VERIFY_PARAM` to http://bugs.python.org/issue30867 #30864: Compile failure for linux socket CAN support http://bugs.python.org/issue30864 #30863: Rewrite PyUnicode_AsWideChar() and PyUnicode_AsWideCharString( http://bugs.python.org/issue30863 #30856: unittest.TestResult.addSubTest should be called immediately af http://bugs.python.org/issue30856 #30853: IDLE: configdialog -- factor out Variable subclass http://bugs.python.org/issue30853 #30852: _PyObject_GC_UNTRACK corruption when call a lambda function wi http://bugs.python.org/issue30852 #30851: IDLE: configdialog -- fix tkinter Variables http://bugs.python.org/issue30851 #30846: [3.6] test_rapid_restart() of test_multiprocessing_fork fails http://bugs.python.org/issue30846 #30842: pyenv activate for bash and tcsh http://bugs.python.org/issue30842 #30841: A shadowing variable naming emitted for Python-ast.c http://bugs.python.org/issue30841 #30833: UnloadUserprofile displays the error "The handle is invalid" http://bugs.python.org/issue30833 #30831: Inconsistent or wrong documentation around Asynchronous Contex http://bugs.python.org/issue30831 Most recent 15 issues waiting for review (15) ============================================= #30872: Update curses docs to Python 3 http://bugs.python.org/issue30872 #30867: Add necessary macro that insure `HAVE_OPENSSL_VERIFY_PARAM` to http://bugs.python.org/issue30867 #30863: Rewrite PyUnicode_AsWideChar() and PyUnicode_AsWideCharString( http://bugs.python.org/issue30863 #30860: Consolidate stateful C globals under a single struct. http://bugs.python.org/issue30860 #30828: Out of bounds write in _asyncio_Future_remove_done_callback http://bugs.python.org/issue30828 #30817: Abort in PyErr_PrintEx() when no memory http://bugs.python.org/issue30817 #30814: Import dotted name as alias breaks with concurrency http://bugs.python.org/issue30814 #30808: Use _Py_atomic API for concurrency-sensitive signal state http://bugs.python.org/issue30808 #30747: _Py_atomic_* not actually atomic on Windows with MSVC http://bugs.python.org/issue30747 #30714: test_ssl fails with openssl 1.1.0f http://bugs.python.org/issue30714 #30711: getaddrinfo invalid port number http://bugs.python.org/issue30711 #30710: getaddrinfo raises OverflowError http://bugs.python.org/issue30710 #30696: infinite loop in PyRun_InteractiveLoopFlags() http://bugs.python.org/issue30696 #30695: add a nomemory_allocator to the _testcapi module http://bugs.python.org/issue30695 #30693: tarfile add uses random order http://bugs.python.org/issue30693 Top 10 most discussed issues (10) ================================= #30822: Python implementation of datetime module is not being tested c http://bugs.python.org/issue30822 29 msgs #30302: Improve .__repr__ implementation for datetime.timedelta http://bugs.python.org/issue30302 14 msgs #30861: StreamReader does not return reamaing and ready data buffer be http://bugs.python.org/issue30861 12 msgs #29796: [2.7] test_weakref hangs on Python 2.7 on Windows http://bugs.python.org/issue29796 9 msgs #23644: g++ module compile fails with ???_Atomic??? does not name a ty http://bugs.python.org/issue23644 8 msgs #29854: Segfault when readline history is more then 2 * history size http://bugs.python.org/issue29854 8 msgs #30779: IDLE: configdialog -- factor out Changes class http://bugs.python.org/issue30779 8 msgs #30844: selectors: Add urgent data to read event http://bugs.python.org/issue30844 8 msgs #30847: asyncio: selector_events: add_urgent() for urgent data to read http://bugs.python.org/issue30847 8 msgs #30814: Import dotted name as alias breaks with concurrency http://bugs.python.org/issue30814 7 msgs Issues closed (40) ================== #6691: Support for nested classes and function for pyclbr http://bugs.python.org/issue6691 closed by terry.reedy #19325: _osx_support imports many modules http://bugs.python.org/issue19325 closed by haypo #20042: Python Launcher, Windows, fails on scripts w/ non-latin names http://bugs.python.org/issue20042 closed by terry.reedy #20669: OpenBSD: socket.recvmsg tests fail with OSError: [Errno 40] Me http://bugs.python.org/issue20669 closed by haypo #29293: Missing parameter "n" on multiprocessing.Condition.notify() http://bugs.python.org/issue29293 closed by pitrou #30259: Test somehow that generated files are up to date: run make reg http://bugs.python.org/issue30259 closed by haypo #30315: test_ftplib.TestTLS_FTPClass: "[Errno 54] Connection reset by http://bugs.python.org/issue30315 closed by haypo #30319: Change socket.close() to ignore ECONNRESET http://bugs.python.org/issue30319 closed by pitrou #30328: test_ssl.test_connect_with_context(): ConnectionResetError on http://bugs.python.org/issue30328 closed by haypo #30330: test_socket.test_idna(): socket.gaierror: [Errno 11001] getadd http://bugs.python.org/issue30330 closed by haypo #30351: [2.7] regrtest hangs on Python 2.7 (test_threading?) http://bugs.python.org/issue30351 closed by haypo #30371: test_long_lines() fails randomly on AMD64 Windows7 SP1 3.x http://bugs.python.org/issue30371 closed by haypo #30441: os.environ raises RuntimeError: dictionary changed size during http://bugs.python.org/issue30441 closed by serhiy.storchaka #30448: test_subprocess creates a core dump on FreeBSD http://bugs.python.org/issue30448 closed by haypo #30532: email.policy.SMTP.fold() mangles long headers http://bugs.python.org/issue30532 closed by r.david.murray #30543: test_timeout fails on AMD64 FreeBSD CURRENT Debug 3.x: Connect http://bugs.python.org/issue30543 closed by haypo #30623: python-nightly import numpy fails since recently http://bugs.python.org/issue30623 closed by ned.deily #30649: test_utime_current_old() of test_os fails randomy on x86 Windo http://bugs.python.org/issue30649 closed by haypo #30651: test_poplib.test_stls_context() access violation on x86 Window http://bugs.python.org/issue30651 closed by haypo #30652: test_threading_not_handled() of test_socketserver hangs random http://bugs.python.org/issue30652 closed by haypo #30703: Non-reentrant signal handler (test_multiprocessing_forkserver http://bugs.python.org/issue30703 closed by pitrou #30726: [Windows] Warnings in elementtree due to new expat http://bugs.python.org/issue30726 closed by haypo #30739: pypi ssl errors [CERTIFICATE_VERIFY_FAILED] http://bugs.python.org/issue30739 closed by ned.deily #30741: https://www.pypi-mirrors.org/ error 503 http://bugs.python.org/issue30741 closed by ned.deily #30758: regrtest hangs sometimes on the master branch (test_pydoc? tes http://bugs.python.org/issue30758 closed by haypo #30759: [2.7] Fix python2 -m test --list-cases test_multibytecodec_sup http://bugs.python.org/issue30759 closed by haypo #30777: IDLE: configdialog -- add docstrings and improve comments http://bugs.python.org/issue30777 closed by terry.reedy #30789: Redesign PyCodeObject.co_extras to use a single memory block, http://bugs.python.org/issue30789 closed by serhiy.storchaka #30791: tkinter.Tk() adds suffix to window class name when launching m http://bugs.python.org/issue30791 closed by serhiy.storchaka #30795: OS X failures in test_site http://bugs.python.org/issue30795 closed by ned.deily #30804: bolen-dmg-3.x build-installer.py failed http://bugs.python.org/issue30804 closed by haypo #30818: Warning -- asyncore.socket_map was modified by test_ftplib on http://bugs.python.org/issue30818 closed by haypo #30827: Tweak order of links in https://www.python.org/downloads/sourc http://bugs.python.org/issue30827 closed by berker.peksag #30829: 'Cannot serialize socket object' after ssl.wrap_socket http://bugs.python.org/issue30829 closed by pitrou #30832: Remove own implementation for thread-local storage http://bugs.python.org/issue30832 closed by haypo #30838: re \w does not match some valid Unicode characters http://bugs.python.org/issue30838 closed by davidism #30843: [2.7] Lib/test/bisect.py conflicts with Lib/bisect.py when run http://bugs.python.org/issue30843 closed by haypo #30854: Compile error on Python/ceval.c without threads http://bugs.python.org/issue30854 closed by haypo #30855: [3.5] test_tk: test_use() of test_tkinter.test_widgets randoml http://bugs.python.org/issue30855 closed by haypo #30862: parent logger should also check the level http://bugs.python.org/issue30862 closed by vinay.sajip From nad at python.org Sat Jul 8 01:22:32 2017 From: nad at python.org (Ned Deily) Date: Sat, 8 Jul 2017 01:22:32 -0400 Subject: [Python-Dev] [RELEASE] Python 3.6.2rc2 is now available for testing Message-ID: <4B22E08B-9C6C-47F6-B908-CC7676D43B77@python.org> On behalf of the Python development community and the Python 3.6 release team, I would like to announce the availability of Python 3.6.2rc2. 3.6.2rc2 is the second release candidate for Python 3.6.2, the next maintenance release of Python 3.6. 3.6.2rc2 includes fixes for three security-related issues resolved since the previous release candidate; see the change log (link below). While 3.6.2rc2 is a preview release and, thus, not intended for production environments, we encourage you to explore it and provide feedback via the Python bug tracker (https://bugs.python.org). Please see "What?s New In Python 3.6" for more information: https://docs.python.org/3.6/whatsnew/3.6.html You can find Python 3.6.2rc2 here: https://www.python.org/downloads/release/python-362rc2/ and its change log here: https://docs.python.org/3.6/whatsnew/changelog.html#python-3-6-2-release-candidate-2 3.6.2 is now planned for final release on 2017-07-17 with the next maintenance release expected to follow in about 3 months. More information about the 3.6 release schedule can be found here: https://www.python.org/dev/peps/pep-0494/ -- Ned Deily nad at python.org -- [] From bhavishyagopesh at gmail.com Sun Jul 9 10:08:09 2017 From: bhavishyagopesh at gmail.com (Bhavishya) Date: Sun, 9 Jul 2017 19:38:09 +0530 Subject: [Python-Dev] Pure pickle bechmark. Message-ID: Hello, 1).I was going through the code of *python pickle* to search any optimization possibility.But the only thing that I found very alarming was again the import time(I tried with lazy-import but it didn't helped much.) I found py3 to be ~45 times slower on* initial imports(very raw measure..using "time." ) *as compared to py2 on an usual example. py3-> ./python -c ' favorite_color = { "lion": "yellow", "kitty": "red" } pickle.dump( favorite_color, open( "save.p", "wb" ) )' 0.009715557098388672(time taken to do initial imports...measured using *time.time()* ) py2-> ./python -c ' favorite_color = { "lion": "yellow", "kitty": "red" } pickle.dump( favorite_color, open( "save.p", "wb" ) )' 0.000236034393311(time taken to do initial imports...measured using *time.time()* ) Do you have any thought/ideas on improving this? Thank You. -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Sun Jul 9 12:12:28 2017 From: python at mrabarnett.plus.com (MRAB) Date: Sun, 9 Jul 2017 17:12:28 +0100 Subject: [Python-Dev] Pure pickle bechmark. In-Reply-To: References: Message-ID: <1ccb6b93-65dd-0d4e-6d4b-1426aea6b65a@mrabarnett.plus.com> On 2017-07-09 15:08, Bhavishya wrote: > Hello, > > 1).I was going through the code of *python pickle* to search any > optimization possibility.But the only thing that I found very alarming > was again the import time(I tried with lazy-import but it didn't helped > much.) > > I found py3 to be ~45 times slower on*initial imports(very raw > measure..using "time." ) *as compared to py2 on an usual example. > > py3-> > ./python -c ' > favorite_color = { "lion": "yellow", "kitty": "red" } > pickle.dump( favorite_color, open( "save.p", "wb" ) )' > 0.009715557098388672(time taken to do initial imports...measured using > *time.time()* ) > > py2-> > ./python -c ' > favorite_color = { "lion": "yellow", "kitty": "red" } > pickle.dump( favorite_color, open( "save.p", "wb" ) )' > 0.000236034393311(time taken to do initial imports...measured using > *time.time()* ) > > Do you have any thought/ideas on improving this? > Python 3 is using Unicode strings, whereas Python 2 is using bytestrings. What you show above are very short (in time) examples (less than 1/100 of a second), so they're not that meaningful. If you had timed pickling a substantial object (the same object in both cases) and it took a significant amount of time and you found a significant slowdown, then it would be worth looking into further. From solipsis at pitrou.net Sun Jul 9 12:58:36 2017 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 9 Jul 2017 18:58:36 +0200 Subject: [Python-Dev] Pure pickle bechmark. References: Message-ID: <20170709185836.67138939@fsol> Hi, On Sun, 9 Jul 2017 19:38:09 +0530 Bhavishya wrote: > Hello, > > 1).I was going through the code of *python pickle* to search any > optimization possibility.But the only thing that I found very alarming was > again the import time(I tried with lazy-import but it didn't helped much.) > > I found py3 to be ~45 times slower on* initial imports(very raw > measure..using "time." ) *as compared to py2 on an usual example. Can you explain how you measured exactly? Regards Antoine. From songofacandy at gmail.com Sun Jul 9 19:17:30 2017 From: songofacandy at gmail.com (INADA Naoki) Date: Mon, 10 Jul 2017 08:17:30 +0900 Subject: [Python-Dev] Pure pickle bechmark. In-Reply-To: References: Message-ID: I don't know this is relating to your case. When I saw Victor's report [1], I researched why Python 3 is slower than Python 2 on unpickle_pure_python benchmark. [1] https://mail.python.org/pipermail/speed/2017-February/000503.html And I found Python 2 and 3 uses different version of pickle format. Current Python 3 uses "framing" format. While unpickling, `read(1)` is very performance critical. Python 2 uses `cStringIO.read` which is implemented in C. On the other hand, Python 3 uses `_Unframer.read` which is implemented in Python. Since this is not relating to "first import time", I don't know this is what you want to optimize. (Since _pickle is used for normal case, pure Python unpickle performance is not a common problem). If you want to optimize it, _Unframer uses BytesIO internally and performance critical part may be able to call BytesIO.read directly instead of _Unframer.read. Regards, INADA Naoki On Sun, Jul 9, 2017 at 11:08 PM, Bhavishya wrote: > Hello, > > 1).I was going through the code of python pickle to search any optimization > possibility.But the only thing that I found very alarming was again the > import time(I tried with lazy-import but it didn't helped much.) > > I found py3 to be ~45 times slower on initial imports(very raw > measure..using "time." ) as compared to py2 on an usual example. > > py3-> > ./python -c ' > favorite_color = { "lion": "yellow", "kitty": "red" } > pickle.dump( favorite_color, open( "save.p", "wb" ) )' > 0.009715557098388672(time taken to do initial imports...measured using > time.time() ) > > py2-> > ./python -c ' > favorite_color = { "lion": "yellow", "kitty": "red" } > pickle.dump( favorite_color, open( "save.p", "wb" ) )' > 0.000236034393311(time taken to do initial imports...measured using > time.time() ) > > Do you have any thought/ideas on improving this? > > > Thank You. From victor.stinner at gmail.com Sun Jul 9 19:36:03 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 10 Jul 2017 01:36:03 +0200 Subject: [Python-Dev] Pure pickle bechmark. In-Reply-To: References: Message-ID: Wait. Are we talking about the C accelerator or the pure Python implementation of pickle on Python 3? Victor Le 10 juil. 2017 01:19, "INADA Naoki" a ?crit : > I don't know this is relating to your case. > > When I saw Victor's report [1], I researched why Python 3 is slower than > Python 2 on unpickle_pure_python benchmark. > > [1] https://mail.python.org/pipermail/speed/2017-February/000503.html > > > And I found Python 2 and 3 uses different version of pickle format. > > Current Python 3 uses "framing" format. While unpickling, `read(1)` is > very performance critical. Python 2 uses `cStringIO.read` which is > implemented in C. > On the other hand, Python 3 uses `_Unframer.read` which is implemented > in Python. > > Since this is not relating to "first import time", I don't know this > is what you want to optimize. > (Since _pickle is used for normal case, pure Python unpickle > performance is not a common > problem). > > If you want to optimize it, _Unframer uses BytesIO internally and > performance critical > part may be able to call BytesIO.read directly instead of _Unframer.read. > > Regards, > INADA Naoki > > > On Sun, Jul 9, 2017 at 11:08 PM, Bhavishya > wrote: > > Hello, > > > > 1).I was going through the code of python pickle to search any > optimization > > possibility.But the only thing that I found very alarming was again the > > import time(I tried with lazy-import but it didn't helped much.) > > > > I found py3 to be ~45 times slower on initial imports(very raw > > measure..using "time." ) as compared to py2 on an usual example. > > > > py3-> > > ./python -c ' > > favorite_color = { "lion": "yellow", "kitty": "red" } > > pickle.dump( favorite_color, open( "save.p", "wb" ) )' > > 0.009715557098388672(time taken to do initial imports...measured using > > time.time() ) > > > > py2-> > > ./python -c ' > > favorite_color = { "lion": "yellow", "kitty": "red" } > > pickle.dump( favorite_color, open( "save.p", "wb" ) )' > > 0.000236034393311(time taken to do initial imports...measured using > > time.time() ) > > > > Do you have any thought/ideas on improving this? > > > > > > Thank You. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > victor.stinner%40gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From songofacandy at gmail.com Sun Jul 9 19:39:54 2017 From: songofacandy at gmail.com (INADA Naoki) Date: Mon, 10 Jul 2017 08:39:54 +0900 Subject: [Python-Dev] Pure pickle bechmark. In-Reply-To: References: Message-ID: I said about pure Python implementation (unpickle_pure_python), because mail title is "Pure pickle bechmark". INADA Naoki On Mon, Jul 10, 2017 at 8:36 AM, Victor Stinner wrote: > Wait. Are we talking about the C accelerator or the pure Python > implementation of pickle on Python 3? > > Victor > > Le 10 juil. 2017 01:19, "INADA Naoki" a ?crit : >> >> I don't know this is relating to your case. >> >> When I saw Victor's report [1], I researched why Python 3 is slower than >> Python 2 on unpickle_pure_python benchmark. >> >> [1] https://mail.python.org/pipermail/speed/2017-February/000503.html >> >> >> And I found Python 2 and 3 uses different version of pickle format. >> >> Current Python 3 uses "framing" format. While unpickling, `read(1)` is >> very performance critical. Python 2 uses `cStringIO.read` which is >> implemented in C. >> On the other hand, Python 3 uses `_Unframer.read` which is implemented >> in Python. >> >> Since this is not relating to "first import time", I don't know this >> is what you want to optimize. >> (Since _pickle is used for normal case, pure Python unpickle >> performance is not a common >> problem). >> >> If you want to optimize it, _Unframer uses BytesIO internally and >> performance critical >> part may be able to call BytesIO.read directly instead of _Unframer.read. >> >> Regards, >> INADA Naoki >> >> >> On Sun, Jul 9, 2017 at 11:08 PM, Bhavishya >> wrote: >> > Hello, >> > >> > 1).I was going through the code of python pickle to search any >> > optimization >> > possibility.But the only thing that I found very alarming was again the >> > import time(I tried with lazy-import but it didn't helped much.) >> > >> > I found py3 to be ~45 times slower on initial imports(very raw >> > measure..using "time." ) as compared to py2 on an usual example. >> > >> > py3-> >> > ./python -c ' >> > favorite_color = { "lion": "yellow", "kitty": "red" } >> > pickle.dump( favorite_color, open( "save.p", "wb" ) )' >> > 0.009715557098388672(time taken to do initial imports...measured using >> > time.time() ) >> > >> > py2-> >> > ./python -c ' >> > favorite_color = { "lion": "yellow", "kitty": "red" } >> > pickle.dump( favorite_color, open( "save.p", "wb" ) )' >> > 0.000236034393311(time taken to do initial imports...measured using >> > time.time() ) >> > >> > Do you have any thought/ideas on improving this? >> > >> > >> > Thank You. >> _______________________________________________ >> Python-Dev mailing list >> Python-Dev at python.org >> https://mail.python.org/mailman/listinfo/python-dev >> Unsubscribe: >> https://mail.python.org/mailman/options/python-dev/victor.stinner%40gmail.com From victor.stinner at gmail.com Sun Jul 9 19:47:16 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 10 Jul 2017 01:47:16 +0200 Subject: [Python-Dev] Pure pickle bechmark. In-Reply-To: References: Message-ID: Please explain how to reproduce your benchmark. Maybe write a shell script? Victor Le 9 juil. 2017 17:49, "Bhavishya" a ?crit : > Hello, > > 1).I was going through the code of *python pickle* to search any > optimization possibility.But the only thing that I found very alarming was > again the import time(I tried with lazy-import but it didn't helped much.) > > I found py3 to be ~45 times slower on* initial imports(very raw > measure..using "time." ) *as compared to py2 on an usual example. > > py3-> > ./python -c ' > favorite_color = { "lion": "yellow", "kitty": "red" } > pickle.dump( favorite_color, open( "save.p", "wb" ) )' > 0.009715557098388672(time taken to do initial imports...measured using > *time.time()* ) > > py2-> > ./python -c ' > favorite_color = { "lion": "yellow", "kitty": "red" } > pickle.dump( favorite_color, open( "save.p", "wb" ) )' > 0.000236034393311(time taken to do initial imports...measured using > *time.time()* ) > > Do you have any thought/ideas on improving this? > > > Thank You. > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > victor.stinner%40gmail.com > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From victor.stinner at gmail.com Sun Jul 9 20:35:37 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 10 Jul 2017 02:35:37 +0200 Subject: [Python-Dev] Pure pickle bechmark. In-Reply-To: References:

Message-ID: (Oops, I didn't notice that we started to talk off the list, let's discuss that in python-dev please.) I don't see the point of optimizing "pickle/unpickle pure python" benchmark on Python 3. This benchmark doesn't make sense on Python 3, since I don't know anyone using the pure Python pickle. The C accelerator is now used by default. I already proposed to remove this benchmark: https://mail.python.org/pipermail/speed/2017-April/000554.html *but* Antoine Pitrou mentionned that the cloudpickle project uses it. Maybe we should try to understand what's wrong with _pickle (C module) for cloudpickle? Victor 2017-07-10 2:10 GMT+02:00 Bhavishya : > I was working on the two regressed benchmarks (i.e. pickle/unpickle > pure-python), and as it was a case with other benchmarks....that performance > is affected by import ...I thought that could be a case with pickle.py > too. And thus tried adding the above patch to Lib/pickle.py to measure the > initial import time. > > I haven't tried it for any practical use-case. > > > On Mon, Jul 10, 2017 at 5:27 AM, Victor Stinner > wrote: >> >> Sorry, I don't understand the direct link between the import time of 4 >> modules and the pickle module. Can you please elaborate? >> >> What are you trying to optimize? >> >> What is your use case? >> >> Victor From solipsis at pitrou.net Mon Jul 10 05:13:13 2017 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 10 Jul 2017 11:13:13 +0200 Subject: [Python-Dev] Extendability of C vs Python pickle References:

Message-ID: <20170710111313.129f67db@fsol> On Mon, 10 Jul 2017 02:35:37 +0200 Victor Stinner wrote: > > I already proposed to remove this benchmark: > https://mail.python.org/pipermail/speed/2017-April/000554.html > > *but* Antoine Pitrou mentionned that the cloudpickle project uses it. > > Maybe we should try to understand what's wrong with _pickle (C module) > for cloudpickle? That's a good question, Victor. cloudpickle uses three hooks inside pickle.py's Pickler: - the "dispatch" dictionary - overriding the "save_global" method to support saving more objects (such as closures, etc.) - overriding the "save_reduce" method; this one doesn't seem really necessary, perhaps some leftover from previous attempts _pickle.c's Pickler does seem to allow a custom "dispatch_table", but it doesn't allow overriding "save_global". Of course, if _pickle.c were improved to allow such extensions, it would suddenly allow cloudpickle to be much more performant! Regards Antoine. From artieua at gmail.com Mon Jul 10 10:37:40 2017 From: artieua at gmail.com (Artem Muterko) Date: Mon, 10 Jul 2017 17:37:40 +0300 Subject: [Python-Dev] Improve test coverage for standard library Message-ID: Good day, I've noticed that test coverage of standard library tools can be improved and wanted to ask which module is better to start with? I'm kindly asking to point to the module which would be good to start writing tests for and also easy to review for other contributors. Best regards, Artem Muterko -------------- next part -------------- An HTML attachment was scrubbed... URL: From brett at python.org Mon Jul 10 14:19:28 2017 From: brett at python.org (Brett Cannon) Date: Mon, 10 Jul 2017 18:19:28 +0000 Subject: [Python-Dev] Improve test coverage for standard library In-Reply-To: References: Message-ID: In general the answer to helping with code coverage is "whatever module motivates you to help". :) So unless a core dev has a specific module that they want to help you write tests for then it's whatever you want to work on. And for easy reference, the code coverage report can be found at https://codecov.io/gh/python/cpython (although it is somewhat inaccurate for any module imported at startup). On Mon, 10 Jul 2017 at 07:54 Artem Muterko wrote: > Good day, > > I've noticed that test coverage of standard library tools can be improved > and wanted to ask which module is better to start with? > > I'm kindly asking to point to the module which would be good to start > writing tests for and also easy to review for other contributors. > > Best regards, > Artem Muterko > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/brett%40python.org > -------------- next part -------------- An HTML attachment was scrubbed... URL: From larry at hastings.org Wed Jul 12 09:09:50 2017 From: larry at hastings.org (Larry Hastings) Date: Wed, 12 Jul 2017 15:09:50 +0200 Subject: [Python-Dev] Should I make a 3.4.7rc1 next weekend? Message-ID: <20ec439d-5735-1285-5380-ae22b57c50c6@hastings.org> I'm scheduled to tag and release 3.5.4rc1 next weekend. I've been releasing 3.4 and 3.5 at the same time for the last year; this is convenient for me as it halves the frequency with which I have to put on the "release manager" hat. There are currently no scheduled dates to release 3.4.7. The reason being that until very recently there was almost no work done in 3.4 since 3.4.6 was tagged. But! The reason for /that/ was because of a change in the workflow: once we switched to Github, for branches that are in security-fixes-only mode, only the Release Manager is allowed to accept PRs into that branch. It turned out there were a bunch of PRs waiting for my approval. After a flurry of accepted PRs, I have now accrued about ten fresh security fixes in the 3.4 branch. (Mostly from Victor, but also Serhiy, and one from Barry--thanks everyone!) There are now no outstanding security fix PRs against 3.4. Since I'm releasing 3.5.4rc1 next weekend, I wouldn't mind /also/ releasing 3.47rc1 next weekend. That would put 3.4.7 final the same day as 3.5.4 final: just over three weeks from now, releasing on Sunday August 5. I realize it's not much notice, and that's normally not how we do things in the CPython world. (Sorry for the short notice--it's my fault for not adjusting to the new workflow quickly enough.) Anyway the point of this email is to call for a vote. Which of these statements do you agree with: * Larry should tag and release 3.4.7rc1 next weekend. * Larry should schedule 3.4.7rc1 for a month from now, to give people time to get their work in. In particular, Victor and Serhiy, I'm interested in your votes. You both get veto powers for the short notice--if either of you say "do it a month from now" then it'll be a month from now. Also, if anybody has security fixes you want to get in to the next release of 3.4, but you haven't made a PR yet, please reply and describe them. (Please reply to list if appropriate, but if it should be kept quiet please reply to me directly.) Braising in my own juices at EuroPython, //arry/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From larry at hastings.org Wed Jul 12 09:12:50 2017 From: larry at hastings.org (Larry Hastings) Date: Wed, 12 Jul 2017 15:12:50 +0200 Subject: [Python-Dev] Reminder: 3.5.4rc1 will be tagged next Saturday, July 22 2017 Message-ID: Just a quick reminder. I'll be tagging 3.5.4rc1 next Saturday, July 22. 3.5.4 final will be the last release of 3.5.4 that accepts bugfixes; after that, the 3.5 branch will transition to security-fixes-only mode. If you have bugfixes you want to ship with 3.5.4, please get them committed in the next nine days. Happy hacking, //arry/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From victor.stinner at gmail.com Wed Jul 12 09:18:42 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Wed, 12 Jul 2017 15:18:42 +0200 Subject: [Python-Dev] [python-committers] Should I make a 3.4.7rc1 next weekend? In-Reply-To: <20ec439d-5735-1285-5380-ae22b57c50c6@hastings.org> References: <20ec439d-5735-1285-5380-ae22b57c50c6@hastings.org> Message-ID: I would love to have a new 3.4 release including all security fixes, sure! It would reduce the number of known vulnerability in Python 3.4: http://python-security.readthedocs.io/vulnerabilities.html 2017-07-12 15:09 GMT+02:00 Larry Hastings : > After a flurry of accepted PRs, I have now accrued about ten fresh security > fixes in the 3.4 branch. (Mostly from Victor, but also Serhiy, and one from > Barry--thanks everyone!) There are now no outstanding security fix PRs > against 3.4. Thanks for merging them ;-) I would like to see my "[3.4] Backport CI config from master" PR merged into 3.4 to get at least a check from Travis and AppVeyor that there is no major regression on Linux and Windows: https://github.com/python/cpython/pull/2475 If I recall correctly, it would be the first time that we have a CI for a branch in security-fix only mode, no? Victor From victor.stinner at gmail.com Thu Jul 13 11:33:45 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Thu, 13 Jul 2017 17:33:45 +0200 Subject: [Python-Dev] Articles on my contributions to CPython during 2017 Q1 and Q2 Message-ID: Hi, I wrote a serie of new articles on my contributions to CPython during 2017 Q1 and Q2. "My contributions to CPython during 2017 Q1" https://haypo.github.io/contrib-cpython-2017q1.html "New Python test.bisect tool" https://haypo.github.io/python-test-bisect.html "Work on Python buildbots, 2017 Q2" https://haypo.github.io/python-buildbots-2017q2.html "My contributions to CPython during 2017 Q2 (part 1)" https://haypo.github.io/contrib-cpython-2017q2-part1.html "My contributions to CPython during 2017 Q2 (part 2)" https://haypo.github.io/contrib-cpython-2017q2-part2.html "My contributions to CPython during 2017 Q2 (part 3)" https://haypo.github.io/contrib-cpython-2017q2-part3.html Good reading ;-) Victor From benhoyt at gmail.com Fri Jul 14 07:37:45 2017 From: benhoyt at gmail.com (Ben Hoyt) Date: Fri, 14 Jul 2017 07:37:45 -0400 Subject: [Python-Dev] Articles on my contributions to CPython during 2017 Q1 and Q2 In-Reply-To: References:

Message-ID: Wow, amazing work. The Stinnerbot strikes again! A lot of great optimizations and bugfixes. Speaking of optimizations, I just wrote some code which takes 12s on Python 2.7 and 5s on Python 3.5. so we're doing something right! I might post about it shortly. -Ben On Jul 13, 2017 11:34 AM, "Victor Stinner" wrote: Hi, I wrote a serie of new articles on my contributions to CPython during 2017 Q1 and Q2. "My contributions to CPython during 2017 Q1" https://haypo.github.io/contrib-cpython-2017q1.html "New Python test.bisect tool" https://haypo.github.io/python-test-bisect.html "Work on Python buildbots, 2017 Q2" https://haypo.github.io/python-buildbots-2017q2.html "My contributions to CPython during 2017 Q2 (part 1)" https://haypo.github.io/contrib-cpython-2017q2-part1.html "My contributions to CPython during 2017 Q2 (part 2)" https://haypo.github.io/contrib-cpython-2017q2-part2.html "My contributions to CPython during 2017 Q2 (part 3)" https://haypo.github.io/contrib-cpython-2017q2-part3.html Good reading ;-) Victor _______________________________________________ Python-Dev mailing list Python-Dev at python.org https://mail.python.org/mailman/listinfo/python-dev Unsubscribe: https://mail.python.org/mailman/options/python-dev/ benhoyt%40gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From victor.stinner at gmail.com Fri Jul 14 09:33:05 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Fri, 14 Jul 2017 15:33:05 +0200 Subject: [Python-Dev] Articles on my contributions to CPython during 2017 Q1 and Q2 In-Reply-To: References:

Message-ID: 2017-07-14 13:37 GMT+02:00 Ben Hoyt : > Wow, amazing work. The Stinnerbot strikes again! Thanks. > A lot of great optimizations and bugfixes. Speaking of optimizations, I just > wrote some code which takes 12s on Python 2.7 and 5s on Python 3.5. so we're > doing something right! I might post about it shortly. Hum, I'm curious to see which kind of code becomes so much faster on Python 3. Victor From p.f.moore at gmail.com Fri Jul 14 10:20:47 2017 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 14 Jul 2017 15:20:47 +0100 Subject: [Python-Dev] Articles on my contributions to CPython during 2017 Q1 and Q2 In-Reply-To: References:

Message-ID: On 14 July 2017 at 14:33, Victor Stinner wrote: >> A lot of great optimizations and bugfixes. Speaking of optimizations, I just >> wrote some code which takes 12s on Python 2.7 and 5s on Python 3.5. so we're >> doing something right! I might post about it shortly. > > Hum, I'm curious to see which kind of code becomes so much faster on Python 3. time.sleep(5 if sys.version_info >= (3,) else 12) :-) Paul From benhoyt at gmail.com Fri Jul 14 10:53:10 2017 From: benhoyt at gmail.com (Ben Hoyt) Date: Fri, 14 Jul 2017 10:53:10 -0400 Subject: [Python-Dev] Articles on my contributions to CPython during 2017 Q1 and Q2 In-Reply-To: References:

Message-ID: Yeah, it was surprising to me too. I thought it'd be faster, but not that much. I did some quick cProfile tests, but that didn't show anything, and I think it's improvements to the bytecode interpreter and various bytecode instructions. (This particular test hammers the bytecode interpreter.) I'll post details in the next week or so. -Ben On Fri, Jul 14, 2017 at 9:33 AM, Victor Stinner wrote: > 2017-07-14 13:37 GMT+02:00 Ben Hoyt : > > Wow, amazing work. The Stinnerbot strikes again! > > Thanks. > > > A lot of great optimizations and bugfixes. Speaking of optimizations, I > just > > wrote some code which takes 12s on Python 2.7 and 5s on Python 3.5. so > we're > > doing something right! I might post about it shortly. > > Hum, I'm curious to see which kind of code becomes so much faster on > Python 3. > > Victor > -------------- next part -------------- An HTML attachment was scrubbed... URL: From status at bugs.python.org Fri Jul 14 12:09:13 2017 From: status at bugs.python.org (Python tracker) Date: Fri, 14 Jul 2017 18:09:13 +0200 (CEST) Subject: [Python-Dev] Summary of Python tracker Issues Message-ID: <20170714160913.E6BDA56AF5@psf.upfronthosting.co.za> ACTIVITY SUMMARY (2017-07-07 - 2017-07-14) Python tracker at http://bugs.python.org/ To view or respond to any of the issues listed below, click on the issue. Do NOT respond to this message. Issues counts and deltas: open 6042 (+21) closed 36641 (+37) total 42683 (+58) Open issues with patches: 2342 Issues opened (39) ================== #26617: Assertion failed in gc with __del__ and weakref http://bugs.python.org/issue26617 reopened by serhiy.storchaka #29766: --with-lto still implied by --enable-optimizations in Python 2 http://bugs.python.org/issue29766 reopened by Arfrever #30296: Remove unnecessary tuples, lists, sets, and dicts from Lib http://bugs.python.org/issue30296 reopened by serhiy.storchaka #30758: test_pydoc hangs sometimes on 3.6 and master branches http://bugs.python.org/issue30758 reopened by haypo #30814: Import dotted name as alias breaks with concurrency http://bugs.python.org/issue30814 reopened by haypo #30874: unittest execute tests twice in some conditions http://bugs.python.org/issue30874 opened by ale5000 #30876: SystemError on importing module from unloaded package http://bugs.python.org/issue30876 opened by serhiy.storchaka #30877: possibe typo in json/scanner.py http://bugs.python.org/issue30877 opened by c-fos #30882: Built-in list disappeared from Python 2.7 intersphinx inventor http://bugs.python.org/issue30882 opened by Christoph.Deil #30883: test_urllib2net failed on s390x Debian 3.6: ftp.debian.org err http://bugs.python.org/issue30883 opened by haypo #30884: regrtest -jN --timeout=TIMEOUT should kill child process runni http://bugs.python.org/issue30884 opened by haypo #30885: test_subprocess hangs on AMD64 Windows8.1 Refleaks 3.x http://bugs.python.org/issue30885 opened by haypo #30888: import class not isinstance of the class http://bugs.python.org/issue30888 opened by ????????? #30889: distutils extra_link_args not working because it is added to c http://bugs.python.org/issue30889 opened by Liu Cailiang #30891: importlib: _find_and_load() race condition on sys.modules[name http://bugs.python.org/issue30891 opened by haypo #30892: _elementtree: assertion error if stdlib copy module is overrid http://bugs.python.org/issue30892 opened by haypo #30893: Expose importlib._bootstrap._ModuleLockManager in importlib.ma http://bugs.python.org/issue30893 opened by brett.cannon #30897: Add a is_mount() to pathlib http://bugs.python.org/issue30897 opened by cooperlees #30898: SSL cert failure running make test during Python 3.6 install http://bugs.python.org/issue30898 opened by Ben Johnston #30903: IPv4Network's hostmask attribute doesn't returns string value http://bugs.python.org/issue30903 opened by Abhijit Mamarde #30904: Python 3 logging HTTPHandler sends duplicate Host header http://bugs.python.org/issue30904 opened by lhelwerd #30905: Embedding should have public API for interactive mode http://bugs.python.org/issue30905 opened by steveire #30907: speed up comparisons to self for built-in containers http://bugs.python.org/issue30907 opened by wbolster #30908: test_os.TestSendfile.test_keywords() leaks dangling threads http://bugs.python.org/issue30908 opened by haypo #30909: ServerProxy should not make requests with malformed XML http://bugs.python.org/issue30909 opened by Alex Corcoles #30910: Add -fexception to ppc64le build http://bugs.python.org/issue30910 opened by brunoalr #30912: python 3 git master fails to find libffi and build _ctypes on http://bugs.python.org/issue30912 opened by shlomif #30915: distutils sometimes assumes wrong C compiler http://bugs.python.org/issue30915 opened by moxian #30916: Pre-build OpenSSL and Tcl/Tk for Windows http://bugs.python.org/issue30916 opened by steve.dower #30917: IDLE: Add idlelib.config.IdleConf unittest http://bugs.python.org/issue30917 opened by louielu #30918: Unable to launch IDLE in windows 7 http://bugs.python.org/issue30918 opened by trencyclo #30919: Shared Array Memory Allocation Regression http://bugs.python.org/issue30919 opened by dtasev #30920: Sequence Matcher from diff lib is not implementing longest com http://bugs.python.org/issue30920 opened by Syam Mohan #30923: Add -Wimplicit-fallthrough=0 to Makefile ? http://bugs.python.org/issue30923 opened by matrixise #30924: RPM build doc_files needs files separated into separate lines http://bugs.python.org/issue30924 opened by warthog9 #30925: RPM build lacks ability to include other files similar to doc_ http://bugs.python.org/issue30925 opened by warthog9 #30926: KeyError with cgitb inspecting exception in generator expressi http://bugs.python.org/issue30926 opened by jason.coombs #30928: Copy modified blurbs to idlelib/NEWS.txt http://bugs.python.org/issue30928 opened by terry.reedy #30929: AttributeErrors after import in multithreaded environment http://bugs.python.org/issue30929 opened by boytsovea Most recent 15 issues with no replies (15) ========================================== #30929: AttributeErrors after import in multithreaded environment http://bugs.python.org/issue30929 #30926: KeyError with cgitb inspecting exception in generator expressi http://bugs.python.org/issue30926 #30925: RPM build lacks ability to include other files similar to doc_ http://bugs.python.org/issue30925 #30924: RPM build doc_files needs files separated into separate lines http://bugs.python.org/issue30924 #30918: Unable to launch IDLE in windows 7 http://bugs.python.org/issue30918 #30915: distutils sometimes assumes wrong C compiler http://bugs.python.org/issue30915 #30912: python 3 git master fails to find libffi and build _ctypes on http://bugs.python.org/issue30912 #30910: Add -fexception to ppc64le build http://bugs.python.org/issue30910 #30905: Embedding should have public API for interactive mode http://bugs.python.org/issue30905 #30904: Python 3 logging HTTPHandler sends duplicate Host header http://bugs.python.org/issue30904 #30903: IPv4Network's hostmask attribute doesn't returns string value http://bugs.python.org/issue30903 #30898: SSL cert failure running make test during Python 3.6 install http://bugs.python.org/issue30898 #30889: distutils extra_link_args not working because it is added to c http://bugs.python.org/issue30889 #30874: unittest execute tests twice in some conditions http://bugs.python.org/issue30874 #30872: Update curses docs to Python 3 http://bugs.python.org/issue30872 Most recent 15 issues waiting for review (15) ============================================= #30891: importlib: _find_and_load() race condition on sys.modules[name http://bugs.python.org/issue30891 #30877: possibe typo in json/scanner.py http://bugs.python.org/issue30877 #30876: SystemError on importing module from unloaded package http://bugs.python.org/issue30876 #30872: Update curses docs to Python 3 http://bugs.python.org/issue30872 #30867: Add necessary macro that insure `HAVE_OPENSSL_VERIFY_PARAM` to http://bugs.python.org/issue30867 #30863: Rewrite PyUnicode_AsWideChar() and PyUnicode_AsWideCharString( http://bugs.python.org/issue30863 #30860: Consolidate stateful C globals under a single struct. http://bugs.python.org/issue30860 #30828: Out of bounds write in _asyncio_Future_remove_done_callback http://bugs.python.org/issue30828 #30817: Abort in PyErr_PrintEx() when no memory http://bugs.python.org/issue30817 #30808: Use _Py_atomic API for concurrency-sensitive signal state http://bugs.python.org/issue30808 #30747: _Py_atomic_* not actually atomic on Windows with MSVC http://bugs.python.org/issue30747 #30714: test_ssl fails with openssl 1.1.0f http://bugs.python.org/issue30714 #30711: getaddrinfo invalid port number http://bugs.python.org/issue30711 #30710: getaddrinfo raises OverflowError http://bugs.python.org/issue30710 #30696: infinite loop in PyRun_InteractiveLoopFlags() http://bugs.python.org/issue30696 Top 10 most discussed issues (10) ================================= #30870: IDLE: configdialog/fonts: change font when select by key up/do http://bugs.python.org/issue30870 19 msgs #30891: importlib: _find_and_load() race condition on sys.modules[name http://bugs.python.org/issue30891 18 msgs #30876: SystemError on importing module from unloaded package http://bugs.python.org/issue30876 13 msgs #30919: Shared Array Memory Allocation Regression http://bugs.python.org/issue30919 11 msgs #30730: [security] Injecting environment variable in subprocess on Win http://bugs.python.org/issue30730 8 msgs #27099: IDLE: turn builting extensions into regular modules http://bugs.python.org/issue27099 7 msgs #27584: New addition of vSockets to the python socket module http://bugs.python.org/issue27584 7 msgs #30907: speed up comparisons to self for built-in containers http://bugs.python.org/issue30907 7 msgs #8231: Unable to run IDLE without write-access to home directory http://bugs.python.org/issue8231 6 msgs #30171: Emit ResourceWarning in multiprocessing Queue destructor http://bugs.python.org/issue30171 6 msgs Issues closed (41) ================== #10438: list an example for calling static methods from WITHIN classes http://bugs.python.org/issue10438 closed by serhiy.storchaka #13220: print function unable while multiprocessing.Process is being r http://bugs.python.org/issue13220 closed by terry.reedy #22607: find by dichotomy the failing test http://bugs.python.org/issue22607 closed by haypo #25746: test_unittest failure in leaks searching mode http://bugs.python.org/issue25746 closed by serhiy.storchaka #29464: Specialize FASTCALL for functions with positional-only paramet http://bugs.python.org/issue29464 closed by serhiy.storchaka #29591: expat 2.2.0: Various security vulnerabilities in bundled expat http://bugs.python.org/issue29591 closed by larry #29812: test for token.py, and consistency tests for tokenize.py http://bugs.python.org/issue29812 closed by ammar2 #29854: Segfault when readline history is more then 2 * history size http://bugs.python.org/issue29854 closed by berker.peksag #30251: Windows Visual Studio solution does not have an install target http://bugs.python.org/issue30251 closed by steve.dower #30444: Add ability to change "-- more --" text in pager module http://bugs.python.org/issue30444 closed by Gautam krishna.R #30731: Use correct executable manifest for windows http://bugs.python.org/issue30731 closed by steve.dower #30779: IDLE: configdialog -- factor out Changes class http://bugs.python.org/issue30779 closed by terry.reedy #30801: shoutdown process error with python 3.4 and pyqt/PySide http://bugs.python.org/issue30801 closed by larry #30823: os.startfile("") craches Python 2.7, 3.4 in Windows 7 32 bit i http://bugs.python.org/issue30823 closed by terry.reedy #30837: Mac OS High Sierra Beta - Python Crash http://bugs.python.org/issue30837 closed by ned.deily #30851: IDLE: configdialog -- fix tkinter Variables http://bugs.python.org/issue30851 closed by terry.reedy #30859: Can't install Python for Windows 3.6.1 on multiple profiles http://bugs.python.org/issue30859 closed by terry.reedy #30873: `SystemError: returned NULL http://bugs.python.org/issue30873 closed by haypo #30875: round(number[, digits]) does not return value with >12 decima http://bugs.python.org/issue30875 closed by john Forgue #30878: The staticmethod doesn't properly reject keyword arguments http://bugs.python.org/issue30878 closed by serhiy.storchaka #30879: os.listdir(bytes) gives a list of bytes, but os.listdir(buffer http://bugs.python.org/issue30879 closed by serhiy.storchaka #30880: PCG random number generator http://bugs.python.org/issue30880 closed by rhettinger #30881: IDLE: add docstrings to browser.py http://bugs.python.org/issue30881 closed by terry.reedy #30886: multiprocessing.Queue.join_thread() does nothing if created an http://bugs.python.org/issue30886 closed by haypo #30887: Syntax checking confuses Try: class_instance_name as ... is us http://bugs.python.org/issue30887 closed by steven.daprano #30890: IDLE: Input method error in comment with Korean language http://bugs.python.org/issue30890 closed by terry.reedy #30894: Python 3.6.1 String Literal Error Not Going to sys.stderr http://bugs.python.org/issue30894 closed by steven.daprano #30895: Decimal arithmetic sum error http://bugs.python.org/issue30895 closed by zach.ware #30896: BytesWarning in re module when compiling certain bytes pattern http://bugs.python.org/issue30896 closed by serhiy.storchaka #30899: IDLE: Add idle config parser unittest http://bugs.python.org/issue30899 closed by terry.reedy #30900: IDLE: Fix configdialog should use wm_withdraw http://bugs.python.org/issue30900 closed by louielu #30901: "503 HTTP ERROR" on attempt to access some points of python li http://bugs.python.org/issue30901 closed by brett.cannon #30902: Python-Redmine plugin not seeing python install MacOS http://bugs.python.org/issue30902 closed by berker.peksag #30906: os.path.join misjoins paths http://bugs.python.org/issue30906 closed by paul.moore #30911: Warning in _json.c on platforms where char is unsigned http://bugs.python.org/issue30911 closed by haypo #30913: IDLE: Document tk Vars, attributes, methods by tab page http://bugs.python.org/issue30913 closed by terry.reedy #30914: test_alpn_protocols (test.test_ssl.ThreadedTests) fails on Fed http://bugs.python.org/issue30914 closed by ned.deily #30921: Process in not get killed using subprocess.call() in python th http://bugs.python.org/issue30921 closed by gvanrossum #30922: Process in not get killed using subprocess.call() in python th http://bugs.python.org/issue30922 closed by matrixise #30927: re.sub() does not work correctly on '.' pattern and \n http://bugs.python.org/issue30927 closed by mrabarnett #30930: Element wise multiplication issue http://bugs.python.org/issue30930 closed by mark.dickinson From tjreedy at udel.edu Fri Jul 14 13:59:37 2017 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 14 Jul 2017 13:59:37 -0400 Subject: [Python-Dev] Articles on my contributions to CPython during 2017 Q1 and Q2 In-Reply-To: References: Message-ID: On 7/13/2017 11:33 AM, Victor Stinner wrote: > I wrote a serie of new articles on my contributions to CPython during > 2017 Q1 and Q2. ... > "Work on Python buildbots, 2017 Q2" > https://haypo.github.io/python-buildbots-2017q2.html "During this quarter, I tried to mark "easy" issues using a "[EASY]" tag in their title and the "easy" or "easy C" keyword. ... I mentored St?phane Wirtel and Louie Lu to fix issues (easy or not)." Thank you for doing this. Louie Lu has made very helpful contributions to IDLE also. -- Terry Jan Reedy From larry at hastings.org Sat Jul 15 08:26:10 2017 From: larry at hastings.org (Larry Hastings) Date: Sat, 15 Jul 2017 14:26:10 +0200 Subject: [Python-Dev] Announcing the schedule for 3.4.7 In-Reply-To: <20ec439d-5735-1285-5380-ae22b57c50c6@hastings.org> References: <20ec439d-5735-1285-5380-ae22b57c50c6@hastings.org> Message-ID: <882df9f6-fbb3-9e2b-c63a-af00028361f0@hastings.org> In reply to my proposal of a few days ago, I received two +1s and no other feedback. So I'm going to issue 3.4.7 with relatively-little notice.t Here's the schedule for 3.4.7; it mirrors the schedule for 3.5.4. Saturday, July 22, 2017 - tag 3.4.7 rc1 Sunday, July 23, 2017 - release 3.4.7 rc1 Sunday, August 6, 2017 - tag 3.4.7 final Monday, August 7, 2017 - release 3.4.7 final Cheers, //arry/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From maxmoroz at gmail.com Sat Jul 15 03:01:36 2017 From: maxmoroz at gmail.com (Max Moroz) Date: Sat, 15 Jul 2017 00:01:36 -0700 Subject: [Python-Dev] deque implementation question In-Reply-To: References: Message-ID: What would be the disadvantage of implementing collections.deque as a circular array (rather than a doubly linked list of blocks)? My naive thinking was that a circular array would maintain the current O(1) append/pop from either side, and would improve index lookup in the middle from O(n) to O(1). What am I missing? The insertion/removal of an arbitrary item specified by a pointer would increase from constant time to linear, but since we don't have pointers this is a moot point. Of course when the circular array is full, it will need to be reallocated, but the amortized cost of that is still O(1). (Moreover, for a bounded deque, there's even an option of preallocation, which would completely eliminate reallocations.) Thanks Max -------------- next part -------------- An HTML attachment was scrubbed... URL: From nad at python.org Sat Jul 15 17:51:37 2017 From: nad at python.org (Ned Deily) Date: Sat, 15 Jul 2017 17:51:37 -0400 Subject: [Python-Dev] Python 3.3.7 release schedule and end-of-life Message-ID: <32CB89BE-A49E-4888-B8C7-4A8CAB8F15F1@python.org> Python 3.3 is fast approaching its end-of-life date, 2017-09-29. Per our release policy, that date is five years after the initial release of 3.3, 3.3.0 final on 2012-09-29. Note that 3.3 has been in security-fix only mode since the 2014-03-08 release of 3.3.5. It has been a while since we produced a 3.3.x security-fix release and, due to his commitments elsewhere, Georg has agreed for me to lead 3.3 to its well-deserved retirement. To that end, I would like to schedule its next, and hopefully final, security-fix release to coincide with the already announced 3.4.7 security-fix release. In particular, we'll plan to tag and release 3.3.7rc1 on Monday 2017-07-24 (UTC) and tag and release 3.3.7 final on Monday 2017-08-07. In the coming days, I'll be reviewing the outstanding 3.3 security issues and merging appropriate 3.3 PRs. Some of them have been sitting as patches for a long time so, if you have any such security issues that you think belong in 3.3, it would be very helpful if you would review such patches and turn them into 3.3 PRs. As a reminder, here are the guidelines from the devguide as to what is appropriate for a security-fix only branch: "The only changes made to a security branch are those fixing issues exploitable by attackers such as crashes, privilege escalation and, optionally, other issues such as denial of service attacks. Any other changes are not considered a security risk and thus not backported to a security branch. You should also consider fixing hard-failing tests in open security branches since it is important to be able to run the tests successfully before releasing." Note that documentation changes, other than any that might be related to a security fix, are also out of scope. Assuming no new security issues arise prior to the EOL date, 3.3.7 will likely be the final release of 3.3. And you really shouldn't be using 3.3 at all at this point; while downstream distributors are, of course, free to provide support of 3.3 to their customers, in a little over two months when EOL is reached python-dev will no longer accept any issues or make any changes available for 3.3. If you are still using 3.3, you really owe it to your applications, to your users, and to yourself to upgrade to a more recent release of Python 3, preferably 3.6! Many, many fixes, new features, and substantial performance improvements await you. https://www.python.org/dev/peps/pep-0398/ https://devguide.python.org/devcycle/#security-branches -- Ned Deily nad at python.org -- [] From francismb at email.de Sun Jul 16 11:30:43 2017 From: francismb at email.de (francismb) Date: Sun, 16 Jul 2017 17:30:43 +0200 Subject: [Python-Dev] Articles on my contributions to CPython during 2017 Q1 and Q2 In-Reply-To: References: Message-ID: <7f04b678-fe0d-2dab-f86f-7aaabbd5069e@email.de> Hi Victor, On 07/13/2017 05:33 PM, Victor Stinner wrote: > Hi, > > I wrote a serie of new articles on my contributions to CPython during > 2017 Q1 and Q2. > > "My contributions to CPython during 2017 Q1" > https://haypo.github.io/contrib-cpython-2017q1.html > > "New Python test.bisect tool" > https://haypo.github.io/python-test-bisect.html would it make sense too add a test that passes if the tool doesn't find anything ? if it fails so should the reason be already there (?) (Or to add a test step after running the test suite if some error happened (?)) Thanks ! -- francis From victor.stinner at gmail.com Sun Jul 16 12:19:10 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Sun, 16 Jul 2017 18:19:10 +0200 Subject: [Python-Dev] Articles on my contributions to CPython during 2017 Q1 and Q2 In-Reply-To: <7f04b678-fe0d-2dab-f86f-7aaabbd5069e@email.de> References: <7f04b678-fe0d-2dab-f86f-7aaabbd5069e@email.de> Message-ID: I'm not sure that I understood your suggestion. Basically, if a test file fails, you would like to automatically re-run the failing test with test.bisect to identify the failing *methods*? Yeah, it's doable, but I didn't write it :-) It's very easy to run bisect: just replace "-m test" with "-m test.bisect" in your command line, and you are done. Victor 2017-07-16 17:30 GMT+02:00 francismb : > Hi Victor, > > On 07/13/2017 05:33 PM, Victor Stinner wrote: >> Hi, >> >> I wrote a serie of new articles on my contributions to CPython during >> 2017 Q1 and Q2. >> >> "My contributions to CPython during 2017 Q1" >> https://haypo.github.io/contrib-cpython-2017q1.html >> >> "New Python test.bisect tool" >> https://haypo.github.io/python-test-bisect.html > would it make sense too add a test that passes if the tool doesn't find > anything ? if it fails so should the reason be already there (?) > (Or to add a test step after running the test suite if some error > happened (?)) > > Thanks ! > -- francis > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/victor.stinner%40gmail.com From songofacandy at gmail.com Sun Jul 16 12:42:38 2017 From: songofacandy at gmail.com (INADA Naoki) Date: Mon, 17 Jul 2017 01:42:38 +0900 Subject: [Python-Dev] deque implementation question In-Reply-To: References:

Message-ID: I found the answer in _collectionsmodule.c /* Data for deque objects is stored in a doubly-linked list of fixed * length blocks. This assures that appends or pops never move any * other data elements besides the one being appended or popped. * * Another advantage is that it completely avoids use of realloc(), * resulting in more predictable performance. Regards, INADA Naoki On Sat, Jul 15, 2017 at 4:01 PM, Max Moroz wrote: > What would be the disadvantage of implementing collections.deque as a > circular array (rather than a doubly linked list of blocks)? My naive > thinking was that a circular array would maintain the current O(1) > append/pop from either side, and would improve index lookup in the middle > from O(n) to O(1). What am I missing? > > The insertion/removal of an arbitrary item specified by a pointer would > increase from constant time to linear, but since we don't have pointers this > is a moot point. > > Of course when the circular array is full, it will need to be reallocated, > but the amortized cost of that is still O(1). (Moreover, for a bounded > deque, there's even an option of preallocation, which would completely > eliminate reallocations.) > > Thanks > > Max > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/songofacandy%40gmail.com > From brett at python.org Sun Jul 16 17:15:00 2017 From: brett at python.org (Brett Cannon) Date: Sun, 16 Jul 2017 21:15:00 +0000 Subject: [Python-Dev] [python-committers] Python 3.3.7 release schedule and end-of-life In-Reply-To: <32CB89BE-A49E-4888-B8C7-4A8CAB8F15F1@python.org> References: <32CB89BE-A49E-4888-B8C7-4A8CAB8F15F1@python.org> Message-ID: A quick thanks from me, Ned, for stepping forward to help 3.3 pine for the fjords. On Sat, Jul 15, 2017, 14:51 Ned Deily, wrote: > Python 3.3 is fast approaching its end-of-life date, 2017-09-29. Per our > release policy, that date is five years after the initial release of 3.3, > 3.3.0 final on 2012-09-29. Note that 3.3 has been in security-fix only > mode since the 2014-03-08 release of 3.3.5. It has been a while since we > produced a 3.3.x security-fix release and, due to his commitments > elsewhere, Georg has agreed for me to lead 3.3 to its well-deserved > retirement. > > To that end, I would like to schedule its next, and hopefully final, > security-fix release to coincide with the already announced 3.4.7 > security-fix release. In particular, we'll plan to tag and release 3.3.7rc1 > on Monday 2017-07-24 (UTC) and tag and release 3.3.7 final on Monday > 2017-08-07. In the coming days, I'll be reviewing the outstanding 3.3 > security issues and merging appropriate 3.3 PRs. Some of them have been > sitting as patches for a long time so, if you have any such security issues > that you think belong in 3.3, it would be very helpful if you would review > such patches and turn them into 3.3 PRs. > > As a reminder, here are the guidelines from the devguide as to what is > appropriate for a security-fix only branch: > > "The only changes made to a security branch are those fixing issues > exploitable by attackers such as crashes, privilege escalation and, > optionally, other issues such as denial of service attacks. Any other > changes are not considered a security risk and thus not backported to a > security branch. You should also consider fixing hard-failing tests in open > security branches since it is important to be able to run the tests > successfully before releasing." > > Note that documentation changes, other than any that might be related to a > security fix, are also out of scope. > > Assuming no new security issues arise prior to the EOL date, 3.3.7 will > likely be the final release of 3.3. And you really shouldn't be using 3.3 > at all at this point; while downstream distributors are, of course, free to > provide support of 3.3 to their customers, in a little over two months when > EOL is reached python-dev will no longer accept any issues or make any > changes available for 3.3. If you are still using 3.3, you really owe it > to your applications, to your users, and to yourself to upgrade to a more > recent release of Python 3, preferably 3.6! Many, many fixes, new > features, and substantial performance improvements await you. > > https://www.python.org/dev/peps/pep-0398/ > https://devguide.python.org/devcycle/#security-branches > > -- > Ned Deily > nad at python.org -- [] > > _______________________________________________ > python-committers mailing list > python-committers at python.org > https://mail.python.org/mailman/listinfo/python-committers > Code of Conduct: https://www.python.org/psf/codeofconduct/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tim.peters at gmail.com Sun Jul 16 20:09:39 2017 From: tim.peters at gmail.com (Tim Peters) Date: Sun, 16 Jul 2017 19:09:39 -0500 Subject: [Python-Dev] deque implementation question In-Reply-To: References:

Message-ID: [Max Moroz ] > What would be the disadvantage of implementing collections.deque as a > circular array (rather than a doubly linked list of blocks)? ... You answered it yourself ;-) > ... > Of course when the circular array is full, it will need to be reallocated, > but the amortized cost of that is still O(1). Bingo. The primary point of a deque is to make popping and pushing at both ends efficient. That's what the current implementation does: worst-case constant time per push or pop regardless of how many items are in the deque. That beats "amortized O(1)" in the small and in the large. That's why it was done this way. Some other deque methods are consequently slower than they are for lists, but who cares? For example, the only indices I've ever used with a deque are 0 and -1 (to peek at one end or the other of a deque), and the implementation makes accessing those specific indices constant-time too. From aleksejs at mit.edu Sun Jul 16 22:37:47 2017 From: aleksejs at mit.edu (Aleksejs Popovs) Date: Sun, 16 Jul 2017 22:37:47 -0400 Subject: [Python-Dev] curses: error handling and the lower right margin of the screen Message-ID: <6d09e791-6e1d-e0b9-f06e-d8809b65a01e@mit.edu> Hello everyone, My name is Aleksejs, and this is my first time posting here. I'm working on a Python project (a client for Zephyr, MIT's instant messaging system) that uses curses, and I believe I've found either a bug in the Python curses bindings or a deficiency in their documentation. The problem has to do with ncurses' cursor advancing behavior. The manpage addch(3x) says: >If the advance is at the right margin: >? The cursor automatically wraps to the beginning of the next line. >? At the bottom of the current scrolling region, and if scrollok is enabled, the scrolling region is scrolled up one line. >? If scrollok is not enabled, writing a character at the lower right margin succeeds. However, an error is returned because it is not possible to wrap to a new line Python's window.addch(y, x, ch[, attr]) function seems to be internally calling addch or one of the related functions, and so, if scrollok is off and (y, x) is the lower right corner of the screen, addch returns an error, which window.addch then detects and turns into an exception. I'd like to argue that this is not expected behavior, and does not follow from the documentation. The documentation for window.addch makes no mention of the fact that the cursor is advanced at all, so there's no reason that a user should expect window.addch(height - 1, width - 1, ch) to fail (and the exception raised, "_curses.error: add_wch() returned ERR", is not very helpful in understanding what the deal is). Because the documentation doesn't say anything about the cursor, this is not even an error in any meaningful way, as the character is successfully written to the screen, and wrapping the code in a "try: ... except curses.error: pass" block "fixes" the error. The same problem affects window.addstr() if the end of the string being painted ends up in the lower right corner. This behavior is so unintuitive that it is not even accounted for elsewhere in the curses bindings: the implementation of curses.panel.rectangle contains a line [1] > win.addch(lry, lrx, curses.ACS_LRCORNER) which raises an exception when trying to draw a rectangle spanning an entire window (or in fact any rectangle touching the lower right corner). I also wrote a little example script [2] demonstrating the problem. I believe that this is a problem, but I am not sure how it could be resolved. It seems that there's no way to distinguish this "error" from a legitimate ncurses error. Perhaps the documentation ought to at least mention this behavior, and the implementation of curses.panel.rectangle should check for the case where the rectangle touches the lower right corner and use a try-catch block there. Best regards, Aleksejs Popovs [1] https://github.com/python/cpython/blob/3.6/Lib/curses/textpad.py#L16 [2] https://gist.github.com/popoffka/e21299967f5739d18c4fa393fa5cf20b From nad at python.org Mon Jul 17 01:50:33 2017 From: nad at python.org (Ned Deily) Date: Mon, 17 Jul 2017 01:50:33 -0400 Subject: [Python-Dev] [RELEASE] Python 3.6.2 is now available Message-ID: On behalf of the Python development community and the Python 3.6 release team, I am happy to announce the availability of Python 3.6.2, the second maintenance release of Python 3.6. 3.6.0 was released on 2016-12-22 to great interest and we are now providing the second set of bugfixes and documentation updates for it; the first maintenance release, 3.6.1, was released on 2017-03-31. Detailed information about the changes made in 3.6.2 can be found in the change log here: https://docs.python.org/3.6/whatsnew/changelog.html#python-3-6-2 Please see "What?s New In Python 3.6" for more information about the new features in Python 3.6: https://docs.python.org/3.6/whatsnew/3.6.html You can download Python 3.6.2 here: https://www.python.org/downloads/release/python-362/ The next maintenance release of Python 3.6 is expected to follow in about 3 months, around the end of 2017-09. More information about the 3.6 release schedule can be found here: https://www.python.org/dev/peps/pep-0494/ Enjoy! P.S. If you need to download the documentation set for 3.6.2 immediately, you can always find the release version here: https://docs.python.org/release/3.6.2/download.html The most current updated versions will appear here: https://docs.python.org/3.6/ -- Ned Deily nad at python.org -- [] From solipsis at pitrou.net Mon Jul 17 08:43:19 2017 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 17 Jul 2017 14:43:19 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time Message-ID: <20170717144319.7fdbf64b@fsol> Hello, Cost of creating a namedtuple has been identified as a contributor to Python startup time. Not only Python core and the stdlib, but any third-party library creating namedtuple classes (there are many of them). An issue was created for this: https://bugs.python.org/issue28638 Raymond decided to close the issue because: 1) the proposed resolution makes the "_source" attribute empty (or, at least, something else than it currently is). Raymond claims the "_source" attribute is an essential feature of namedtuples. 2) optimizing startup cost is supposedly not worth the effort. To this, I will counter-argument: As for 1), a search for "namedtuple" and "_source" in a code search engine (*) brings *only* false positives of different kinds: * clones of the CPython repo * copies of the namedtuple class instantiation source code with slight tweaks (*not* reading the _source attribute of an existing namedtuple) * modules using namedtuples and also using a "_source" attribute on unrelated objects (*) https://searchcode.com/?q=namedtuple+_source As for 2), startup time is actually a very important consideration nowadays, both for small scripts *and* for interactive use with the now very wide-spread use of Jupyter Notebooks. A 1 ms. cost when importing a single module can translate into a large slowdown when your library imports (directly or indirectly) hundreds of modules, many of which may create their own namedtuple classes. Nick pointed out that one alternative is to make the C-written "struct sequence" class user-visible. My opinion is that, while better than nothing, this would complicate things by exposing two very similar primitives in the stdlib, without there being a clear choice for users. Should I use the well-known namedtuple? Should I use the new-ish "struct sequence", with similar characteristics and better performance, but worse compatibility (now I have to write fallback code for Python versions where the "struct sequence" isn't exposed)? And not to mention all third-party libraries must be migrated to the newly-exposed "struct sequence" + compatibility fallback code... So my take is: 1) Usage of "_source" in open source code (as per the search above) seems non-existent. 2) If the primary intent of "_source" is to show-case how to write a tuple subclass, well, why not write a recipe or tutorial somewhere? The Python stdlib is generally not a place where we reify tutorials or educational snippets as public APIs. 3) The well-known namedtuple would really benefit from a performance boost, without asking all maintainers of dependent code (that's a *ton*) to migrate to a new idiom + compatibility fallback. Regards Antoine. From solipsis at pitrou.net Mon Jul 17 08:53:12 2017 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 17 Jul 2017 14:53:12 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time References: <20170717144319.7fdbf64b@fsol> Message-ID: <20170717145312.455c4a28@fsol> On Mon, 17 Jul 2017 14:43:19 +0200 Antoine Pitrou wrote: > Hello, > > Cost of creating a namedtuple has been identified as a contributor to > Python startup time. Imprecise wording: that's the cost of creating a namedtuple *class*, i.e. anytime someone writes `MyClass = namedtuple('MyClass', ...)`. Regards Antoine. From levkivskyi at gmail.com Mon Jul 17 09:03:26 2017 From: levkivskyi at gmail.com (Ivan Levkivskyi) Date: Mon, 17 Jul 2017 15:03:26 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <20170717145312.455c4a28@fsol> References: <20170717144319.7fdbf64b@fsol> <20170717145312.455c4a28@fsol> Message-ID: Interesting coincidence, just two days ago I have heard that a team at one large company completely abandoned namedtuple because of the creation time problem. Concerning _source, why it is not possible to make it a property so that all the string formatting will happen on request, thus saving some time for users who doesn't need it? (Of course this will not be an actual source, but it can be made practically equivalent to the no-compile version.) -- Ivan On 17 July 2017 at 14:53, Antoine Pitrou wrote: > On Mon, 17 Jul 2017 14:43:19 +0200 > Antoine Pitrou wrote: > > Hello, > > > > Cost of creating a namedtuple has been identified as a contributor to > > Python startup time. > > Imprecise wording: that's the cost of creating a namedtuple *class*, > i.e. anytime someone writes `MyClass = namedtuple('MyClass', ...)`. > > Regards > > Antoine. > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > levkivskyi%40gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From isaac.morland at gmail.com Mon Jul 17 09:26:21 2017 From: isaac.morland at gmail.com (Isaac Morland) Date: Mon, 17 Jul 2017 09:26:21 -0400 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <20170717144319.7fdbf64b@fsol> References: <20170717144319.7fdbf64b@fsol> Message-ID: On 17 July 2017 at 08:43, Antoine Pitrou wrote: > > Hello, > > Cost of creating a namedtuple has been identified as a contributor to > Python startup time. Not only Python core and the stdlib, but any > third-party library creating namedtuple classes (there are many of > them). An issue was created for this: > https://bugs.python.org/issue28638 > > Raymond decided to close the issue because: > > 1) the proposed resolution makes the "_source" attribute empty (or, at > least, something else than it currently is). Raymond claims the > "_source" attribute is an essential feature of namedtuples. > I think I understand well enough to say something intelligent? While actual references to _source are likely rare (certainly I?ve never used it), my understanding is that the way namedtuple works is to construct _source, and then exec it to create the class. Once that is done, there is no significant saving to be had by throwing away the constructed _source value. When namedtuple was being considered for inclusion, I actually went so far as to write a proof-of-concept version that worked by creating a class, creating attributes on it, etc. I don?t remember how far I got but the exec version is the version included in the stdlib. I come from a non-Pythonic background so use of exec still feels a bit weird to me but I absolutely love namedtuple and use it constantly. I don't know whether a polished and completed version of my idea could be faster than using exec, but I wouldn't expect a major saving ? a whole bunch of code has to run either way. -------------- next part -------------- An HTML attachment was scrubbed... URL: From solipsis at pitrou.net Mon Jul 17 09:11:51 2017 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 17 Jul 2017 15:11:51 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <20170717145312.455c4a28@fsol> Message-ID: <20170717151151.66465bdb@fsol> On Mon, 17 Jul 2017 15:03:26 +0200 Ivan Levkivskyi wrote: > Interesting coincidence, just two days ago I have heard that a team at one > large company completely abandoned namedtuple because of the creation time > problem. > > Concerning _source, why it is not possible to make it a property so that > all the string formatting will happen on request, thus saving some time for > users who doesn't need it? It was proposed in https://bugs.python.org/issue19640 but rejected. Regards Antoine. From antoine at python.org Mon Jul 17 09:31:35 2017 From: antoine at python.org (Antoine Pitrou) Date: Mon, 17 Jul 2017 15:31:35 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> Message-ID: <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Le 17/07/2017 ? 15:26, Isaac Morland a ?crit : > > I think I understand well enough to say something intelligent? > > While actual references to _source are likely rare (certainly I?ve never > used it), my understanding is that the way namedtuple works is to > construct _source, and then exec it to create the class. Once that is > done, there is no significant saving to be had by throwing away the > constructed _source value. The proposed resolution on https://bugs.python.org/issue28638 is to avoid exec() on most parts of the namedtuple class, hence speeding up the class creation. > I come from > a non-Pythonic background so use of exec still feels a bit weird to me > but I absolutely love namedtuple and use it constantly. I think for most Python programmers, it still feels a bit un-Pythonic. While exec() is part of Python, it's generally only used in fringe cases where nothing else works. Regards Antoine. From facundobatista at gmail.com Mon Jul 17 10:56:56 2017 From: facundobatista at gmail.com (Facundo Batista) Date: Mon, 17 Jul 2017 11:56:56 -0300 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <20170717144319.7fdbf64b@fsol> References: <20170717144319.7fdbf64b@fsol> Message-ID: On Mon, Jul 17, 2017 at 9:43 AM, Antoine Pitrou wrote: > As for 2), startup time is actually a very important consideration > nowadays, both for small scripts *and* for interactive use with the > now very wide-spread use of Jupyter Notebooks. A 1 ms. cost when > importing a single module can translate into a large slowdown when your > library imports (directly or indirectly) hundreds of modules, many of > which may create their own namedtuple classes. My experience inside Canonical is that golang stole a lot of "codebase share" from Python, and (others and mine) talks hit two walls, mainly: one is memory consumption, and the other is startup time. So yes, startup time is important for user-faced scripts and services. Regards, -- . Facundo Blog: http://www.taniquetil.com.ar/plog/ PyAr: http://www.python.org/ar/ Twitter: @facundobatista From raymond.hettinger at gmail.com Mon Jul 17 10:59:51 2017 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Mon, 17 Jul 2017 07:59:51 -0700 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: > On Jul 17, 2017, at 6:31 AM, Antoine Pitrou wrote: > >> I think I understand well enough to say something intelligent? >> >> While actual references to _source are likely rare (certainly I?ve never >> used it), my understanding is that the way namedtuple works is to >> construct _source, and then exec it to create the class. Once that is >> done, there is no significant saving to be had by throwing away the >> constructed _source value. There are considerable benefits to namedtuple being able to generate and match its own source. * It makes it is really easy for a user to generate the code, drop it into another another module, and customize it. * It makes the named tuple factory function completely self-documenting. * The verbose/_source option teaches you exactly what named tuple does. That makes the tool relatively easy to learn, understand, and debug. I really don't want to throw away these benefits to save a couple of milliseconds. As Nick Coghlan recently posted, "Speed isn't everything, and it certainly isn't adequate justification for breaking public APIs that have been around for years." FWIW, the template/exec implementation has had excellent benefits for maintainability making it very easy to fix and update. As other parts of Python have changed (limitations on number of arguments, what is allowed as an identifier, etc), it mostly automatically stays in sync with the rest of the language. ISTM this issue is being pressed by micro-optimizers who are being very aggressive and not responding to actual user needs (it is more an invented issue than a real one). Named tuple has been around for a long time and users have been somewhat happy with it. If someone truly cares about the exec time for a particular named tuple, the _source option makes it trivially easy to just replace the generator call with the expanded code in that particular circumstance. Raymond P.S. I'm fully supportive of Victor's efforts to build-out structseq to make it sufficiently expressive to do more of what collections.namedtuple() does. That is a perfectly reasonable path to optimization. We've wanted that for a long time and no one has had the spare clock cycles to make it come true. From barry at python.org Mon Jul 17 11:13:51 2017 From: barry at python.org (Barry Warsaw) Date: Mon, 17 Jul 2017 11:13:51 -0400 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: On Jul 17, 2017, at 10:59, Raymond Hettinger wrote: > > ISTM this issue is being pressed by micro-optimizers who are being very aggressive and not responding to actual user needs (it is more an invented issue than a real one). Named tuple has been around for a long time and users have been somewhat happy with it. Regardless of whether this particular optimization is a good idea or not, start up time *is* a serious challenge in many environments for CPython in particular and the perception of Python?s applicability to many problems. I think we?re better off trying to identify and address such problems than ignoring or minimizing them. Cheers, -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: Message signed with OpenPGP URL: From victor.stinner at gmail.com Mon Jul 17 11:14:07 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 17 Jul 2017 17:14:07 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> Message-ID: 2017-07-17 16:56 GMT+02:00 Facundo Batista : > My experience inside Canonical is that golang stole a lot of "codebase > share" from Python, and (others and mine) talks hit two walls, mainly: > one is memory consumption, and the other is startup time. > > So yes, startup time is important for user-faced scripts and services. Removing the _source attribute would allow to: (1) Reduce the memory consumption http://bugs.python.org/issue19640#msg213949 (2) Pyhon startup up time https://bugs.python.org/issue28638#msg280277 Victor From christian at python.org Mon Jul 17 11:19:46 2017 From: christian at python.org (Christian Heimes) Date: Mon, 17 Jul 2017 17:19:46 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <20170717144319.7fdbf64b@fsol> References: <20170717144319.7fdbf64b@fsol> Message-ID: On 2017-07-17 14:43, Antoine Pitrou wrote: > So my take is: > > 1) Usage of "_source" in open source code (as per the search above) > seems non-existent. > > 2) If the primary intent of "_source" is to show-case how to write a > tuple subclass, well, why not write a recipe or tutorial somewhere? > The Python stdlib is generally not a place where we reify tutorials or > educational snippets as public APIs. > > 3) The well-known namedtuple would really benefit from a performance > boost, without asking all maintainers of dependent code (that's a > *ton*) to migrate to a new idiom + compatibility fallback. I have an additional take on named tuples 4) The current approach uses exec() to generate the namedtuple class on the fly. The exec() function isn't necessarily evil and the use of exec() in namedtuple is safe. However I would appreciate if Python interpreter could be started without requiring the exec() function. It would make it easier to harden the interpreter for embedding and system integration uses cases. It's not about sandboxing Python. My goal is to make it harder to abuse Python. See Steve's lighting talk "Python as a security vulnerability" at the language summit, https://lwn.net/Articles/723823/ . Christian From steve at holdenweb.com Mon Jul 17 11:22:38 2017 From: steve at holdenweb.com (Steve Holden) Date: Mon, 17 Jul 2017 16:22:38 +0100 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: On Mon, Jul 17, 2017 at 3:59 PM, Raymond Hettinger < raymond.hettinger at gmail.com> wrote: > I really don't want to throw away these benefits to save a couple of > milliseconds. As Nick Coghlan recently posted, "Speed isn't everything, > and it certainly isn't adequate justification for breaking public APIs that > have been around for years." ?My only question is "what's a variable called _source doing in the public API?" regards Steve? Steve Holden -------------- next part -------------- An HTML attachment was scrubbed... URL: From raymond.hettinger at gmail.com Mon Jul 17 11:29:18 2017 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Mon, 17 Jul 2017 08:29:18 -0700 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org>

Message-ID: <34A86EF5-8A12-40E7-881D-1E2C5FFE7B18@gmail.com> > On Jul 17, 2017, at 8:22 AM, Steve Holden wrote: > > My only question is "what's a variable called _source doing in the public API?" The convention for named tuple hnas been for all the methods and attributes to be prefixed with an underscore so that the names won't conflict with field names in the named tuple itself. For example, we want to allow Path=namedtuple('Path', ['source', 'destination']). If I had it all to do over again, it might have been better to have had a different convention like source_ with a trailing underscore, but that ship sailed long ago :-) Raymond From steve at holdenweb.com Mon Jul 17 11:34:50 2017 From: steve at holdenweb.com (Steve Holden) Date: Mon, 17 Jul 2017 16:34:50 +0100 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <34A86EF5-8A12-40E7-881D-1E2C5FFE7B18@gmail.com> References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org>

<34A86EF5-8A12-40E7-881D-1E2C5FFE7B18@gmail.com> Message-ID: Makes sense. Thanks. S Steve Holden On Mon, Jul 17, 2017 at 4:29 PM, Raymond Hettinger < raymond.hettinger at gmail.com> wrote: > > > On Jul 17, 2017, at 8:22 AM, Steve Holden wrote: > > > > My only question is "what's a variable called _source doing in the > public API?" > > The convention for named tuple hnas been for all the methods and > attributes to be prefixed with an underscore so that the names won't > conflict with field names in the named tuple itself. For example, we want > to allow Path=namedtuple('Path', ['source', 'destination']). > > If I had it all to do over again, it might have been better to have had a > different convention like source_ with a trailing underscore, but that ship > sailed long ago :-) > > > Raymond -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Mon Jul 17 11:49:13 2017 From: guido at python.org (Guido van Rossum) Date: Mon, 17 Jul 2017 08:49:13 -0700 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: I am firmly with Antoine here. The cumulative startup time of large Python programs is a serious problem and namedtuple is one of the major contributors -- especially because it is so convenient that it is ubiquitous. The approach of generating source code and exec()ing it, is a cool demonstration of Python's expressive power, but it's always been my sense that whenever we encounter a popular idiom that uses exec() and eval(), we should augment the language (or the builtins) to avoid these calls -- that's for example how we ended up with getattr(). One of the reasons to be wary of exec()/eval() other than the usual security concerns is that in some Python implementations they have a high overhead to initialize the parser and compiler. (Even in CPython it's not that fast.) Regarding the argument that it's easier to learn what namedtuple does if the generated source is available, while I don't feel this is important, supposedly it is important to Raymond. But surely there are other approaches possible that work just as well in an educational setting while being more efficient in production use. (E.g. the approach taken by itertools, where the docs show equivalent Python code.) Concluding, I think we should move on from the original implementation and optimize the heck out of namedtuple. The original has served us well. The world is constantly changing. Python should adapt to the (happy) fact that it's being used for systems larger than any of us could imagine 15 years ago. --Guido On Mon, Jul 17, 2017 at 7:59 AM, Raymond Hettinger < raymond.hettinger at gmail.com> wrote: > > > On Jul 17, 2017, at 6:31 AM, Antoine Pitrou wrote: > > > >> I think I understand well enough to say something intelligent? > >> > >> While actual references to _source are likely rare (certainly I?ve never > >> used it), my understanding is that the way namedtuple works is to > >> construct _source, and then exec it to create the class. Once that is > >> done, there is no significant saving to be had by throwing away the > >> constructed _source value. > > There are considerable benefits to namedtuple being able to generate and > match its own source. > > * It makes it is really easy for a user to generate the code, drop it into > another another module, and customize it. > > * It makes the named tuple factory function completely self-documenting. > > * The verbose/_source option teaches you exactly what named tuple does. > That makes the tool relatively easy to learn, understand, and debug. > > I really don't want to throw away these benefits to save a couple of > milliseconds. As Nick Coghlan recently posted, "Speed isn't everything, > and it certainly isn't adequate justification for breaking public APIs that > have been around for years." > > FWIW, the template/exec implementation has had excellent benefits for > maintainability making it very easy to fix and update. As other parts of > Python have changed (limitations on number of arguments, what is allowed as > an identifier, etc), it mostly automatically stays in sync with the rest of > the language. > > ISTM this issue is being pressed by micro-optimizers who are being very > aggressive and not responding to actual user needs (it is more an invented > issue than a real one). Named tuple has been around for a long time and > users have been somewhat happy with it. > > If someone truly cares about the exec time for a particular named tuple, > the _source option makes it trivially easy to just replace the generator > call with the expanded code in that particular circumstance. > > > Raymond > > > P.S. I'm fully supportive of Victor's efforts to build-out structseq to > make it sufficiently expressive to do more of what collections.namedtuple() > does. That is a perfectly reasonable path to optimization. We've wanted > that for a long time and no one has had the spare clock cycles to make it > come true. > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > guido%40python.org > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg at krypto.org Mon Jul 17 12:13:39 2017 From: greg at krypto.org (Gregory P. Smith) Date: Mon, 17 Jul 2017 16:13:39 +0000 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: On Mon, Jul 17, 2017 at 8:00 AM Raymond Hettinger < raymond.hettinger at gmail.com> wrote: > > > On Jul 17, 2017, at 6:31 AM, Antoine Pitrou wrote: > > > >> I think I understand well enough to say something intelligent? > >> > >> While actual references to _source are likely rare (certainly I?ve never > >> used it), my understanding is that the way namedtuple works is to > >> construct _source, and then exec it to create the class. Once that is > >> done, there is no significant saving to be had by throwing away the > >> constructed _source value. > > There are considerable benefits to namedtuple being able to generate and > match its own source. > > * It makes it is really easy for a user to generate the code, drop it into > another another module, and customize it. > > * It makes the named tuple factory function completely self-documenting. > > * The verbose/_source option teaches you exactly what named tuple does. > That makes the tool relatively easy to learn, understand, and debug. > > I really don't want to throw away these benefits to save a couple of > milliseconds. As Nick Coghlan recently posted, "Speed isn't everything, > and it certainly isn't adequate justification for breaking public APIs that > have been around for years." > > FWIW, the template/exec implementation has had excellent benefits for > maintainability making it very easy to fix and update. As other parts of > Python have changed (limitations on number of arguments, what is allowed as > an identifier, etc), it mostly automatically stays in sync with the rest of > the language. > > ISTM this issue is being pressed by micro-optimizers who are being very > aggressive and not responding to actual user needs (it is more an invented > issue than a real one). Named tuple has been around for a long time and > users have been somewhat happy with it. > Raymond, you keep repeating statements similar to "only a millisecond" and "aggressive micro-optimizers who don't care about user needs" in your comments on issues like this. That simply isn't true. These issues come up in the first place *because of* users who need fast startup. Please don't be so dismissive. The reason people care about this has been stated many times. It isn't just "a millisecond", it's 100s or 1000s of milliseconds in any application of reasonable size where namedtuples were adopted as a design pattern in various libraries. Real world use cases for startup time mattering exist: interactive command line tools are the most obvious one people keep citing. I'll toss another where Python startup time has raised eyebrows at work: unittest startup and completion time. When the bulk of a processes time is spent in startup before hitting unittest.main(), people take notice and consider it a problem. Developer productivity is reduced. The hacks individual developers come up with to try and workaround things like this are not pretty. If someone truly cares about the exec time for a particular named tuple, > the _source option makes it trivially easy to just replace the generator > call with the expanded code in that particular circumstance. > In real world applications you do not control the bulk of the code that has chosen to use namedtuple. They're scattered through 100-1000s of other transitive dependency libraries (not just the standard library), the modification of each of which faces hurdles both technical and non-technical in nature. To me the desired resolution to this is clear: Optimize the default use case of namedtuple and everybody wins. This isn't just about the stdlib's namedtuple uses being fast, those a small portion of all uses in any application where startup time matters. This is about making Python better for the world. ie: What Antoine's original write-up suggested in his #3. I get that namedtuple ._source is a public API. We may need to keep it. If so, that just means revisiting lazily generating it as a property - issue19640. -gps PS - Good call on the naming hindsight! A trailing underscore would've been nice. Oh well, too late for that. > > Raymond > > > P.S. I'm fully supportive of Victor's efforts to build-out structseq to > make it sufficiently expressive to do more of what collections.namedtuple() > does. That is a perfectly reasonable path to optimization. We've wanted > that for a long time and no one has had the spare clock cycles to make it > come true. > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/greg%40krypto.org > -------------- next part -------------- An HTML attachment was scrubbed... URL: From victor.stinner at gmail.com Mon Jul 17 12:25:23 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 17 Jul 2017 18:25:23 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: 2017-07-17 18:13 GMT+02:00 Gregory P. Smith : > I get that namedtuple ._source is a public API. We may need to keep it. If > so, that just means revisiting lazily generating it as a property - > issue19640. I agree. Technically speaking, optimizing namedtuple doesn't have to mean "remove the _source attribute". I wouldn't discuss here if _source should be kept or not, but even if we rewrite the namedtuple implementation, I agree that we *can* technically keep a _source property which would create the same Python code. It would allow it to speedup namedtuple, reduce the memory footprint, and have a smooth deprecation policy (*if* we decide to deprecate this attribute). Victor From steve at pearwood.info Mon Jul 17 12:45:20 2017 From: steve at pearwood.info (Steven D'Aprano) Date: Tue, 18 Jul 2017 02:45:20 +1000 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <20170717144319.7fdbf64b@fsol> References: <20170717144319.7fdbf64b@fsol> Message-ID: <20170717164519.GY3149@ando.pearwood.info> On Mon, Jul 17, 2017 at 02:43:19PM +0200, Antoine Pitrou wrote: > > Hello, > > Cost of creating a namedtuple has been identified as a contributor to > Python startup time. Not only Python core and the stdlib, but any > third-party library creating namedtuple classes (there are many of > them). An issue was created for this: > https://bugs.python.org/issue28638 Some time ago, I needed to backport a version of namedtuple to Python 2.4, so I started with Raymond's recipe on Activestate and modified it to only exec the code needed for __new__. The rest of the class is an ordinary inner class: # a short sketch def namedtuple(...): class Inner(tuple): ... exec(source, ns) Inner.__new__ = ns['__new__'] return Inner Here's my fork of Raymond's recipe: https://code.activestate.com/recipes/578918-yet-another-namedtuple/ Out of curiosity, I took that recipe, updated it to work in Python 3, and compared it to the std lib version. Here are some representative timings: [steve at ando ~]$ python3.5 -m timeit -s "from collections import namedtuple" "K = namedtuple('K', 'a b c')" 1000 loops, best of 3: 1.02 msec per loop [steve at ando ~]$ python3.5 -m timeit -s "from nt3 import namedtuple" "K = namedtuple('K', 'a b c')" 1000 loops, best of 3: 255 usec per loop I think that proves that this approach is viable and can lead to a big speed up. I don't think that merely dropping the _source attribute will save much time. It might save a bit of memory, but in my experiements dropping it only saves about 10?s more. I think the real bottleneck is the cost of exec'ing the entire class. -- Steve From jelle.zijlstra at gmail.com Mon Jul 17 13:04:39 2017 From: jelle.zijlstra at gmail.com (Jelle Zijlstra) Date: Mon, 17 Jul 2017 10:04:39 -0700 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <20170717164519.GY3149@ando.pearwood.info> References: <20170717144319.7fdbf64b@fsol> <20170717164519.GY3149@ando.pearwood.info> Message-ID: 2017-07-17 9:45 GMT-07:00 Steven D'Aprano : > On Mon, Jul 17, 2017 at 02:43:19PM +0200, Antoine Pitrou wrote: > > > > Hello, > > > > Cost of creating a namedtuple has been identified as a contributor to > > Python startup time. Not only Python core and the stdlib, but any > > third-party library creating namedtuple classes (there are many of > > them). An issue was created for this: > > https://bugs.python.org/issue28638 > > Some time ago, I needed to backport a version of namedtuple to Python > 2.4, so I started with Raymond's recipe on Activestate and modified it > to only exec the code needed for __new__. The rest of the class is an > ordinary inner class: > > # a short sketch > def namedtuple(...): > class Inner(tuple): > ... > exec(source, ns) > Inner.__new__ = ns['__new__'] > return Inner > > > Here's my fork of Raymond's recipe: > > https://code.activestate.com/recipes/578918-yet-another-namedtuple/ > > > Out of curiosity, I took that recipe, updated it to work in Python 3, > and compared it to the std lib version. Here are some representative > timings: > > [steve at ando ~]$ python3.5 -m timeit -s "from collections import > namedtuple" "K = namedtuple('K', 'a b c')" > 1000 loops, best of 3: 1.02 msec per loop > > [steve at ando ~]$ python3.5 -m timeit -s "from nt3 import namedtuple" "K = > namedtuple('K', 'a b c')" > 1000 loops, best of 3: 255 usec per loop > > > I think that proves that this approach is viable and can lead to a big > speed up. > > I have an open pull request implementing this approach: https://github.com/python/cpython/pull/2736. We can discuss the exact form the code should take there (Ivan already added some good suggestions). > I don't think that merely dropping the _source attribute will save much > time. It might save a bit of memory, but in my experiements dropping it > only saves about 10?s more. I think the real bottleneck is the cost of > exec'ing the entire class. > > > > -- > Steve > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > jelle.zijlstra%40gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From raymond.hettinger at gmail.com Mon Jul 17 15:42:45 2017 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Mon, 17 Jul 2017 12:42:45 -0700 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: <9E9AD48B-155B-4357-98D8-65C088E1F02F@gmail.com> > On Jul 17, 2017, at 8:49 AM, Guido van Rossum wrote: > > The approach of generating source code and exec()ing it, is a cool demonstration of Python's expressive power, but it's always been my sense that whenever we encounter a popular idiom that uses exec() and eval(), we should augment the language (or the builtins) to avoid these calls -- that's for example how we ended up with getattr(). FYI, the proposal (from Jelle) isn't to remove exec. It is to only exec a smaller piece of code and make the rest of it static. It isn't bad idea, it just complicates the implementation (generating _source lazily) and the subsequence maintenance (which is currently really easy). > Concluding, I think we should move on from the original implementation and optimize the heck out of namedtuple. The original has served us well. The world is constantly changing. Python should adapt to the (happy) fact that it's being used for systems larger than any of us could imagine 15 years ago. Okay, then Nick and I are overruled. I'll move Jelle's patch forward. We'll also need to lazily generate _source but I don't think that will be hard. One minor grumble: I think we need to give careful cost/benefit considerations to optimizations that complicate the implementation. Over the last several years, the source for Python has grown increasingly complicated. Fewer people understand it now. It is much harder to newcomers to on-ramp. The old-timers (myself included) find that their knowledge is out of date. And complexity leads to bugs (the C optimization of random number seeding caused a major bug in the 3.6.0 release; the C optimization of the lru_cache resulted in multiple releases having a hard to find threading bugs, etc.). It is becoming increasingly difficult to look at code and tell whether it is correct (I still don't fully understand the implications of the recursive constant folding in the peephole optimizer for example). In the case of this named tuple proposal, the complexity is manageable, but the overall trend isn't good and I get the feeling the aggressive optimization is causing us to forget key parts of the zen-of-python. Cheers, Raymond P.S. Ironically, a lot of my consulting work comes from people who have created something complex our of something that could have been simple. So, I in a strange way, I should be happy about these trends -- just saying ;-) From raymond.hettinger at gmail.com Mon Jul 17 16:27:47 2017 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Mon, 17 Jul 2017 13:27:47 -0700 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: <0DC5C68F-953D-493A-BD18-6D77D2C047CA@gmail.com> > On Jul 17, 2017, at 8:49 AM, Guido van Rossum wrote: > > One of the reasons to be wary of exec()/eval() other than the usual security concerns is that in some Python implementations they have a high overhead to initialize the parser and compiler. (Even in CPython it's not that fast.) BTW, if getting rid of the template/exec pair is a goal, Joe Jevnik proposed a patch a couple of years ago the completely reimplemented namedtuple() in C. The patch was somewhat complex and hard to semantic equivalence, but we could resurrect it and clean it up. That way, we could like the existing namedtuple() code in-place and do a subsequent import from the C-version. This path won't be fun (whenever we have both a C version and Python version, we get years of trying to sync-up tiny differences); however, it will give you take fastest startup times, the fastest lookups at runtime, and eliminate use of exec. > On Jul 17, 2017, at 8:13 AM, Barry Warsaw wrote: > Regardless of whether this particular optimization is a good idea or not, start up time *is* a serious challenge in many environments for CPython in particular and the perception of Python?s applicability to many problems. I think we?re better off trying to identify and address such problems than ignoring or minimizing them. I agree with that sentiment but think we ought to look at places where the payoffs would actually matter such a minimizing the number of disk accesses (Python performs a lot of I/O on startup). Whenever I've addressed start-up time for my clients, named tuples we never the issue. Also, it would have been trivially easy to replace the factory function call with the generated code, but that never proved necessary or beneficial. IMO, we're about to turn the named tuple code into a mess but will find that most users, most of the time will get nearly zero benefit. Raymond From g.rodola at gmail.com Mon Jul 17 16:31:21 2017 From: g.rodola at gmail.com (Giampaolo Rodola') Date: Mon, 17 Jul 2017 22:31:21 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: I completely agree. I love namedtuples but I've never been too happy about the additional overhead vs. plain tuples (both for creation and attribute access times), to the point that I explicitly avoid to use them in certain circumstances (e.g. a busy loop) and only for public end-user APIs returning multiple values. To be entirely honest, I'm not even sure why they need to be forcefully declared upfront in the first place, instead of just having a first-class function (builtin?) written in C: >>> ntuple(x=1, y=0) (x=1, y=0) ...or even a literal as in: >>> (x=1, y=0) (x=1, y=0) Most of the times this is what I really want: quickly returning an anonymous tuple with named attributes and nothing else, similarly to os.times() & others. I believe that if something like this would exist we would witness a big transition from tuple() to ntuple() for all those functions returning more than 1 value. We witnessed a similar transition in many parts of the stdlib when collections.namedtuple was first introduced, but not everywhere, probably because declaring a namedtuple is more work, it's more expensive, and it still feels like you're dealing with some kind of too high-level second-class citizen with too much overhead and too many sugar in terms of API (e.g. "verbose", "rename", "module" and "_source"). If something like this were to happen I expect collections.namedtuple to be used only by those who want to subclass it in order to attach methods, whereas the rest would stick and use ntuple() pretty much everywhere (both in "private" and "public" functions). On Mon, Jul 17, 2017 at 5:49 PM, Guido van Rossum wrote: > I am firmly with Antoine here. The cumulative startup time of large Python > programs is a serious problem and namedtuple is one of the major > contributors -- especially because it is so convenient that it is > ubiquitous. The approach of generating source code and exec()ing it, is a > cool demonstration of Python's expressive power, but it's always been my > sense that whenever we encounter a popular idiom that uses exec() and > eval(), we should augment the language (or the builtins) to avoid these > calls -- that's for example how we ended up with getattr(). > > One of the reasons to be wary of exec()/eval() other than the usual > security concerns is that in some Python implementations they have a high > overhead to initialize the parser and compiler. (Even in CPython it's not > that fast.) > > Regarding the argument that it's easier to learn what namedtuple does if > the generated source is available, while I don't feel this is important, > supposedly it is important to Raymond. But surely there are other > approaches possible that work just as well in an educational setting while > being more efficient in production use. (E.g. the approach taken by > itertools, where the docs show equivalent Python code.) > > Concluding, I think we should move on from the original implementation and > optimize the heck out of namedtuple. The original has served us well. The > world is constantly changing. Python should adapt to the (happy) fact that > it's being used for systems larger than any of us could imagine 15 years > ago. > > --Guido > > On Mon, Jul 17, 2017 at 7:59 AM, Raymond Hettinger < > raymond.hettinger at gmail.com> wrote: > >> >> > On Jul 17, 2017, at 6:31 AM, Antoine Pitrou wrote: >> > >> >> I think I understand well enough to say something intelligent? >> >> >> >> While actual references to _source are likely rare (certainly I?ve >> never >> >> used it), my understanding is that the way namedtuple works is to >> >> construct _source, and then exec it to create the class. Once that is >> >> done, there is no significant saving to be had by throwing away the >> >> constructed _source value. >> >> There are considerable benefits to namedtuple being able to generate and >> match its own source. >> >> * It makes it is really easy for a user to generate the code, drop it >> into another another module, and customize it. >> >> * It makes the named tuple factory function completely self-documenting. >> >> * The verbose/_source option teaches you exactly what named tuple does. >> That makes the tool relatively easy to learn, understand, and debug. >> >> I really don't want to throw away these benefits to save a couple of >> milliseconds. As Nick Coghlan recently posted, "Speed isn't everything, >> and it certainly isn't adequate justification for breaking public APIs that >> have been around for years." >> >> FWIW, the template/exec implementation has had excellent benefits for >> maintainability making it very easy to fix and update. As other parts of >> Python have changed (limitations on number of arguments, what is allowed as >> an identifier, etc), it mostly automatically stays in sync with the rest of >> the language. >> >> ISTM this issue is being pressed by micro-optimizers who are being very >> aggressive and not responding to actual user needs (it is more an invented >> issue than a real one). Named tuple has been around for a long time and >> users have been somewhat happy with it. >> >> If someone truly cares about the exec time for a particular named tuple, >> the _source option makes it trivially easy to just replace the generator >> call with the expanded code in that particular circumstance. >> >> >> Raymond >> >> >> P.S. I'm fully supportive of Victor's efforts to build-out structseq to >> make it sufficiently expressive to do more of what collections.namedtuple() >> does. That is a perfectly reasonable path to optimization. We've wanted >> that for a long time and no one has had the spare clock cycles to make it >> come true. >> >> >> _______________________________________________ >> Python-Dev mailing list >> Python-Dev at python.org >> https://mail.python.org/mailman/listinfo/python-dev >> Unsubscribe: https://mail.python.org/mailman/options/python-dev/guido% >> 40python.org >> > > > > -- > --Guido van Rossum (python.org/~guido) > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/g. > rodola%40gmail.com > > -- Giampaolo - http://grodola.blogspot.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Mon Jul 17 16:46:48 2017 From: python at mrabarnett.plus.com (MRAB) Date: Mon, 17 Jul 2017 21:46:48 +0100 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org>

Message-ID: <08dd8581-ac2d-059c-0b8e-e7bfb5576569@mrabarnett.plus.com> On 2017-07-17 21:31, Giampaolo Rodola' wrote: > I completely agree. I love namedtuples but I've never been too happy > about the additional overhead vs. plain tuples (both for creation and > attribute access times), to the point that I explicitly avoid to use > them in certain circumstances (e.g. a busy loop) and only for public > end-user APIs returning multiple values. > > To be entirely honest, I'm not even sure why they need to be forcefully > declared upfront in the first place, instead of just having a > first-class function (builtin?) written in C: > > >>> ntuple(x=1, y=0) > (x=1, y=0) > > ...or even a literal as in: > > >>> (x=1, y=0) > (x=1, y=0) > [snip] I know it's a bit early to bikeshed, but shouldn't that be: >>> (x: 1, y: 0) (x: 1, y: 0) instead if it's a display/literal? From isaac.morland at gmail.com Mon Jul 17 16:48:02 2017 From: isaac.morland at gmail.com (Isaac Morland) Date: Mon, 17 Jul 2017 16:48:02 -0400 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org> Message-ID: My apologies, I misunderstood what had been proposed (and rejected). So it sounds like the _source is a pre-requisite for the current exec-based implementation, but the proposal is to replace with a non-exec-based implementation, meaning _source would no longer be needed for the module to work and might be eliminated. But _source could continue to be generated lazily (and cached if thought helpful) using an @property, so even the (apparently rare) uses of _source would continue to work. This would in some sense be a DRY violation, but of a very pragmatic Pythonic sort, where we have two implementations, one for documentation and one for efficiency. How different would this be from all those modules that have both Python and C implementations? On 17 July 2017 at 09:31, Antoine Pitrou wrote: > > Le 17/07/2017 ? 15:26, Isaac Morland a ?crit : > > > > I think I understand well enough to say something intelligent? > > > > While actual references to _source are likely rare (certainly I?ve never > > used it), my understanding is that the way namedtuple works is to > > construct _source, and then exec it to create the class. Once that is > > done, there is no significant saving to be had by throwing away the > > constructed _source value. > > The proposed resolution on https://bugs.python.org/issue28638 is to > avoid exec() on most parts of the namedtuple class, hence speeding up > the class creation. > > > I come from > > a non-Pythonic background so use of exec still feels a bit weird to me > > but I absolutely love namedtuple and use it constantly. > > I think for most Python programmers, it still feels a bit un-Pythonic. > While exec() is part of Python, it's generally only used in fringe cases > where nothing else works. > > Regards > > Antoine. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > isaac.morland%40gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Mon Jul 17 16:57:24 2017 From: python at mrabarnett.plus.com (MRAB) Date: Mon, 17 Jul 2017 21:57:24 +0100 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <08dd8581-ac2d-059c-0b8e-e7bfb5576569@mrabarnett.plus.com> References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org>

<08dd8581-ac2d-059c-0b8e-e7bfb5576569@mrabarnett.plus.com> Message-ID: <56649d1a-90ea-2807-2586-59605dba2633@mrabarnett.plus.com> On 2017-07-17 21:46, MRAB wrote: > On 2017-07-17 21:31, Giampaolo Rodola' wrote: >> I completely agree. I love namedtuples but I've never been too happy >> about the additional overhead vs. plain tuples (both for creation and >> attribute access times), to the point that I explicitly avoid to use >> them in certain circumstances (e.g. a busy loop) and only for public >> end-user APIs returning multiple values. >> >> To be entirely honest, I'm not even sure why they need to be forcefully >> declared upfront in the first place, instead of just having a >> first-class function (builtin?) written in C: >> >> >>> ntuple(x=1, y=0) >> (x=1, y=0) >> >> ...or even a literal as in: >> >> >>> (x=1, y=0) >> (x=1, y=0) >> > [snip] > > I know it's a bit early to bikeshed, but shouldn't that be: > > >>> (x: 1, y: 0) > (x: 1, y: 0) > > instead if it's a display/literal? > Actually, come to think of it, a dict's keys would be quoted, so there would be a slight inconsistency there... From encukou at gmail.com Mon Jul 17 17:07:11 2017 From: encukou at gmail.com (Petr Viktorin) Date: Mon, 17 Jul 2017 23:07:11 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org>

Message-ID: <45a5391d-2760-d613-156b-6e5884d8123d@gmail.com> On 07/17/2017 10:31 PM, Giampaolo Rodola' wrote: > I completely agree. I love namedtuples but I've never been too happy > about the additional overhead vs. plain tuples (both for creation and > attribute access times), to the point that I explicitly avoid to use > them in certain circumstances (e.g. a busy loop) and only for public > end-user APIs returning multiple values. > > To be entirely honest, I'm not even sure why they need to be forcefully > declared upfront in the first place, instead of just having a > first-class function (builtin?) written in C: > > >>> ntuple(x=1, y=0) > (x=1, y=0) > > ...or even a literal as in: > > >>> (x=1, y=0) > (x=1, y=0) > > Most of the times this is what I really want: quickly returning an > anonymous tuple with named attributes and nothing else, similarly to > os.times() & others. [...] It seems that you want `types.SimpleNamespace(x=1, y=0)`. From g.rodola at gmail.com Mon Jul 17 17:09:16 2017 From: g.rodola at gmail.com (Giampaolo Rodola') Date: Mon, 17 Jul 2017 23:09:16 +0200 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: <45a5391d-2760-d613-156b-6e5884d8123d@gmail.com> References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org>

<45a5391d-2760-d613-156b-6e5884d8123d@gmail.com> Message-ID: On Mon, Jul 17, 2017 at 11:07 PM, Petr Viktorin wrote: > On 07/17/2017 10:31 PM, Giampaolo Rodola' wrote: > >> I completely agree. I love namedtuples but I've never been too happy >> about the additional overhead vs. plain tuples (both for creation and >> attribute access times), to the point that I explicitly avoid to use them >> in certain circumstances (e.g. a busy loop) and only for public end-user >> APIs returning multiple values. >> >> To be entirely honest, I'm not even sure why they need to be forcefully >> declared upfront in the first place, instead of just having a first-class >> function (builtin?) written in C: >> >> >>> ntuple(x=1, y=0) >> (x=1, y=0) >> >> ...or even a literal as in: >> >> >>> (x=1, y=0) >> (x=1, y=0) >> >> Most of the times this is what I really want: quickly returning an >> anonymous tuple with named attributes and nothing else, similarly to >> os.times() & others. [...] >> > > It seems that you want `types.SimpleNamespace(x=1, y=0)`. > That doesn't support indexing (obj[0]). -- Giampaolo - http://grodola.blogspot.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From tim.peters at gmail.com Mon Jul 17 17:24:17 2017 From: tim.peters at gmail.com (Tim Peters) Date: Mon, 17 Jul 2017 16:24:17 -0500 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org>

Message-ID: [Giampaolo Rodola' ] > .... > To be entirely honest, I'm not even sure why they need to be forcefully > declared upfront in the first place, instead of just having a first-class > function (builtin?) written in C: > > >>> ntuple(x=1, y=0) > (x=1, y=0) > > ...or even a literal as in: > > >>> (x=1, y=0) > (x=1, y=0) How do you propose that the resulting object T know that T.x is 1. T.y is 0, and T.z doesn't make sense? Declaring a namedtuple up front allows the _class_ to know that all of its instances map attribute "x" to index 0 and attribute "y" to index 1. The instances know nothing about that on their own, and consume no more memory than a plain tuple. If your `ntuple()` returns an object implementing its own mapping, it loses a primary advantage (0 memory overhead) of namedtuples. From barry at python.org Mon Jul 17 17:26:24 2017 From: barry at python.org (Barry Warsaw) Date: Mon, 17 Jul 2017 17:26:24 -0400 Subject: [Python-Dev] Impact of Namedtuple on startup time In-Reply-To: References: <20170717144319.7fdbf64b@fsol> <4d161c7b-87cf-7d78-6967-07be1c584591@python.org>

Message-ID: namedtuple is great and clever, but it?s also a bit clunky. It has a weird signature and requires a made up type name. It?s also rather unPythonic if you want to support default arguments when creating namedtuple instances. Maybe as you say, a lot of the typical use cases for namedtuples could be addressed by a better builtin, but I fear we?ll end up down the bikeshedding hole for that. -Barry > On Jul 17, 2017, at 16:31, Giampaolo Rodola' wrote: > > I completely agree. I love namedtuples but I've never been too happy about the additional overhead vs. plain tuples (both for creation and attribute access times), to the point that I explicitly avoid to use them in certain circumstances (e.g. a busy loop) and only for public end-user APIs returning multiple values. > > To be entirely honest, I'm not even sure why they need to be forcefully declared upfront in the first place, instead of just having a first-class function (builtin?) written in C: > > >>> ntuple(x=1, y=0) > (x=1, y=0) > > ...or even a literal as in: > > >>> (x=1, y=0) > (x=1, y=0) > > Most of the times this is what I really want: quickly returning an anonymous tuple with named attributes and nothing else, similarly to os.times() & others. I believe that if something like this would exist we would witness a big transition from tuple() to ntuple() for all those functions returning more than 1 value. We witnessed a similar transition in many parts of the stdlib when collections.namedtuple was first introduced, but not everywhere, probably because declaring a namedtuple is more work, it's more expensive, and it still feels like you're dealing with some kind of too high-level second-class citizen with too much overhead and too many sugar in terms of API (e.g. "verbose", "rename", "module" and "_source"). -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: Message signed with OpenPGP URL: