From J.Demeyer at UGent.be Mon Apr 1 01:31:03 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Mon, 1 Apr 2019 07:31:03 +0200 Subject: [Python-Dev] PEP 580/590 discussion In-Reply-To: <5C9FEF82.50207@UGent.be> References: <6c8356e3-f9b2-c39e-63c4-17f146d326b7@hotpy.org> <15b8a3d7-00ed-a5eb-475c-a3adee671b5f@hotpy.org> <5C9FEF82.50207@UGent.be> Message-ID: <5CA1A217.1030007@UGent.be> I added benchmarks for PEP 590: https://gist.github.com/jdemeyer/f0d63be8f30dc34cc989cd11d43df248 From songofacandy at gmail.com Mon Apr 1 04:26:31 2019 From: songofacandy at gmail.com (Inada Naoki) Date: Mon, 1 Apr 2019 17:26:31 +0900 Subject: [Python-Dev] Removing PendingDeprecationWarning In-Reply-To: References:

<18ccdacf-8fc8-5130-b4ba-89df84e02987@python.org> Message-ID: On Sat, Mar 30, 2019 at 7:31 PM Nick Coghlan wrote: > > That's just a documentation fix: "If you're not sure whether to use > DeprecationWarning or PendingDeprecationWarning, use > DeprecationWarning". > Current proposed patch is: """ .. note:: PendingDeprecationWarning was introduced as an "ignored by default" version of DeprecationWarning. But :exc:`DeprecationWarning` is also ignored by default since Python 2.7 and 3.2. There is not much difference between PendingDeprecationWarning and DeprecationWarning nowadays. DeprecationWarning is recommended in general. """ https://github.com/python/cpython/pull/12505/files#diff-4d7187c7266c3f79727d358de3b3d228 -- Inada Naoki From steve.dower at python.org Mon Apr 1 12:12:26 2019 From: steve.dower at python.org (Steve Dower) Date: Mon, 1 Apr 2019 09:12:26 -0700 Subject: [Python-Dev] Strange umask(?)/st_mode issue In-Reply-To: References: <13f98061-6f64-2e8b-de66-d84a7be00a17@python.org> <20190330023947.GA62291@cskk.homeip.net> <55e6f051-179d-73b4-4cca-b91c5c81b498@python.org> <3210441d-6094-b53e-6bf7-4b7c4cfb16ea@python.org> Message-ID: On 30Mar2019 1130, Gregory P. Smith wrote: > I wouldn't expect it to be the case in a CI environment but I believe a > umask can be overridden if the filesystem is mounted and configured with > acls set?? (oh, hah, Ivan just said the same thing) Yep, it appears this is the case. The Pipelines team got back to me and it seems to be a known issue - the workaround they gave me was to run "sudo setfacl -Rb /home/vsts" at the start, so I've merged that in for now (to master and 3.7). Cheers, Steve From steve.dower at python.org Mon Apr 1 12:31:36 2019 From: steve.dower at python.org (Steve Dower) Date: Mon, 1 Apr 2019 09:31:36 -0700 Subject: [Python-Dev] PEP 578: Python Runtime Audit Hooks In-Reply-To: <8466c9c4-b5dc-c6c5-6fe4-a49dc2f4f968@python.org> References: <2cb3740e-ebc2-1839-1d2e-73d1b9f0a445@python.org> <8466c9c4-b5dc-c6c5-6fe4-a49dc2f4f968@python.org> Message-ID: On 31Mar2019 0538, Christian Heimes wrote: > I don't like the fact that the PEP requires users to learn and use an > additional layer to handle native code. Although we cannot provide a > fully secure hook for native code, we could at least try to provide a > best effort hook and document the limitations. A bit more information > would make the verified open function more useful, too. So instead they need to learn a significantly more complicated API? :) (I was very happy to be able to say "it's the same as open(p, 'rb')"). > PyObject *PyImport_OpenForExecution( > const char *path, > const char *intent, > int flags, > PyObject *context > ) > > - Path is an absolute (!) file path. The PEP doesn't specify if the file > name is relative or absolute. IMO it should be always absolute. Yeah, this is fair enough. I'll add it as a requirement. > - The new intent argument lets the caller pass information how it > intents to use the file, e.g. pythoncode, zipimport, nativecode (for > loading a shared library/DLL), ctypes, ... This allows the verify hook > to react on the intent and provide different verifications for e.g. > Python code and native modules. I had an intent argument at one point and the feedback I got (from teams who wanted to implement it) is that they wouldn't trust it anyway :) In each case there should be associated audit events for tracking the intent (and interrupting at that point if it doesn't like the intended action), but for the simple case of "let me open this specific file" it doesn't really add much. And it almost certainly shouldn't impact decision making. > - The flags argument is for additional flags, e.g. return an opened file > or None, open the file in text or binary mode, ... This just makes it harder for the hook implementer - now you have to allow encoding/errors arguments and probably more. And as mentioned above, there should be an audit event showing the intent before this call, and a hook can reject it at that point (rather than verify without actually returning the verified content). > - Context is an optional Python object from the caller's context. For > the import system, it could be the loader instance. I think the audit event covers this, unless you have some way of using this context in mind that I can't think of? Cheers, Steve From steve.dower at python.org Mon Apr 1 13:42:58 2019 From: steve.dower at python.org (Steve Dower) Date: Mon, 1 Apr 2019 10:42:58 -0700 Subject: [Python-Dev] PEP 578: Python Runtime Audit Hooks In-Reply-To: <6ded2c50-bf28-1376-7b0c-9cc6839be56b@python.org> References: <2cb3740e-ebc2-1839-1d2e-73d1b9f0a445@python.org>

<6ded2c50-bf28-1376-7b0c-9cc6839be56b@python.org> Message-ID: <16bc5fae-47c0-e9f6-da60-e0e46cb84c78@python.org> On 30Mar2019 0913, Steve Dower wrote: > On 30Mar.2019 0747, Nick Coghlan wrote: >> I like this PEP in principle, but the specific "open_for_import" name >> bothers me a lot, as it implies that "importing" is the only situation >> where a file will be opened for code execution. >> >> If this part of the API were lower down the stack (e.g. >> "_io.open_for_code_execution") then I think it would make more sense - >> APIs like tokenize.open(), runpy.run_path(), PyRun_SimpleFile(), >> shelve, etc, could use that, without having to introduce a dependency >> on importlib to get access to the functionality. > > It was called "open_for_exec" at one point, though I forget exactly why > we changed it. But I have no problem with moving it. Something like this? > > PyImport_OpenForImport -> PyIO_OpenForExec > PyImport_SetOpenForImportHook -> PyIO_SetOpenForExecHook > importlib.util.open_for_import -> _io.open_for_exec > > Or more in line with Nick's suggestion: > > PyImport_OpenForImport -> PyIO_OpenExecutableCode > PyImport_SetOpenForImportHook -> PyIO_SetOpenExecutableCodeHook > importlib.util.open_for_import -> _io.open_executable_code > > I dropped "For", but I don't really care that much about the name. I'd > be okay dropping either "executable" or "code" as well - I don't really > have a good sense of which will make people more likely to use this > correctly. Looking at what we already have, perhaps putting it under "PyFile_OpenForExecute" would make the most sense? We don't currently have any public "PyIO" types or functions. Bikeshedding now, but as I'm the only one really participating in it, I think it's allowed :) Cheers, Steve From cs at cskk.id.au Mon Apr 1 18:35:39 2019 From: cs at cskk.id.au (Cameron Simpson) Date: Tue, 2 Apr 2019 09:35:39 +1100 Subject: [Python-Dev] Strange umask(?)/st_mode issue In-Reply-To: References: Message-ID: <20190401223539.GA47505@cskk.homeip.net> On 01Apr2019 09:12, Steve Dower wrote: >On 30Mar2019 1130, Gregory P. Smith wrote: >>I wouldn't expect it to be the case in a CI environment but I >>believe a umask can be overridden if the filesystem is mounted and >>configured with acls set?? (oh, hah, Ivan just said the same thing) > >Yep, it appears this is the case. The Pipelines team got back to me >and it seems to be a known issue - the workaround they gave me was to >run "sudo setfacl -Rb /home/vsts" at the start, so I've merged that in >for now (to master and 3.7). Could that be done _without_ sudo to just the local directory containing the test tar file? If that works then you don't need any nasty privileged sudo use (which will just break on platforms without sudo anyway). Cheers, Cameron Simpson From steve.dower at python.org Mon Apr 1 18:44:13 2019 From: steve.dower at python.org (Steve Dower) Date: Mon, 1 Apr 2019 15:44:13 -0700 Subject: [Python-Dev] Strange umask(?)/st_mode issue In-Reply-To: <20190401223539.GA47505@cskk.homeip.net> References: <20190401223539.GA47505@cskk.homeip.net> Message-ID: On 01Apr2019 1535, Cameron Simpson wrote: > On 01Apr2019 09:12, Steve Dower wrote: >> On 30Mar2019 1130, Gregory P. Smith wrote: >>> I wouldn't expect it to be the case in a CI environment but I believe >>> a umask can be overridden if the filesystem is mounted and configured >>> with acls set?? (oh, hah, Ivan just said the same thing) >> >> Yep, it appears this is the case. The Pipelines team got back to me >> and it seems to be a known issue - the workaround they gave me was to >> run "sudo setfacl -Rb /home/vsts" at the start, so I've merged that in >> for now (to master and 3.7). > > Could that be done _without_ sudo to just the local directory containing > the test tar file? If that works then you don't need any nasty > privileged sudo use (which will just break on platforms without sudo > anyway). I tried something similar to that and it didn't work. My guess is it's to do with the actual mount point? (I also tried without sudo at first, and when I didn't work, I tried it with sudo. I hear that's how to decide whether you need it or not ;) ) In any case, it only applies to the Azure Pipelines build definition, so there aren't any other platforms where it'll be used. Cheers, Steve From cs at cskk.id.au Mon Apr 1 19:49:29 2019 From: cs at cskk.id.au (Cameron Simpson) Date: Tue, 2 Apr 2019 10:49:29 +1100 Subject: [Python-Dev] Strange umask(?)/st_mode issue In-Reply-To: References: Message-ID: <20190401234929.GA53667@cskk.homeip.net> On 01Apr2019 15:44, Steve Dower wrote: >On 01Apr2019 1535, Cameron Simpson wrote: >>On 01Apr2019 09:12, Steve Dower wrote: >>>On 30Mar2019 1130, Gregory P. Smith wrote: >>>>I wouldn't expect it to be the case in a CI environment but I >>>>believe a umask can be overridden if the filesystem is mounted >>>>and configured with acls set?? (oh, hah, Ivan just said the same >>>>thing) >>> >>>Yep, it appears this is the case. The Pipelines team got back to >>>me and it seems to be a known issue - the workaround they gave me >>>was to run "sudo setfacl -Rb /home/vsts" at the start, so I've >>>merged that in for now (to master and 3.7). >> >>Could that be done _without_ sudo to just the local directory >>containing the test tar file? If that works then you don't need any >>nasty privileged sudo use (which will just break on platforms >>without sudo anyway). > >I tried something similar to that and it didn't work. My guess is it's >to do with the actual mount point? (I also tried without sudo at >first, and when I didn't work, I tried it with sudo. I hear that's how >to decide whether you need it or not ;) ) > >In any case, it only applies to the Azure Pipelines build definition, >so there aren't any other platforms where it'll be used. Ok then. Curious: is the sudo now in the build setup? I'm just thinking that this isn't a tarfile specific fix but a "get correct POSIX umask semantics" fix, so it should apply to the entire environment. Or am I naive? Cheers, Cameron Simpson From greg at krypto.org Mon Apr 1 19:59:43 2019 From: greg at krypto.org (Gregory P. Smith) Date: Mon, 1 Apr 2019 16:59:43 -0700 Subject: [Python-Dev] Strange umask(?)/st_mode issue In-Reply-To: <20190401234929.GA53667@cskk.homeip.net> References: <20190401234929.GA53667@cskk.homeip.net> Message-ID: On Mon, Apr 1, 2019 at 4:49 PM Cameron Simpson wrote: > On 01Apr2019 15:44, Steve Dower wrote: > >On 01Apr2019 1535, Cameron Simpson wrote: > >>On 01Apr2019 09:12, Steve Dower wrote: > >>>On 30Mar2019 1130, Gregory P. Smith wrote: > >>>>I wouldn't expect it to be the case in a CI environment but I > >>>>believe a umask can be overridden if the filesystem is mounted > >>>>and configured with acls set? (oh, hah, Ivan just said the same > >>>>thing) > >>> > >>>Yep, it appears this is the case. The Pipelines team got back to > >>>me and it seems to be a known issue - the workaround they gave me > >>>was to run "sudo setfacl -Rb /home/vsts" at the start, so I've > >>>merged that in for now (to master and 3.7). > >> > >>Could that be done _without_ sudo to just the local directory > >>containing the test tar file? If that works then you don't need any > >>nasty privileged sudo use (which will just break on platforms > >>without sudo anyway). > > > >I tried something similar to that and it didn't work. My guess is it's > >to do with the actual mount point? (I also tried without sudo at > >first, and when I didn't work, I tried it with sudo. I hear that's how > >to decide whether you need it or not ;) ) > > > >In any case, it only applies to the Azure Pipelines build definition, > >so there aren't any other platforms where it'll be used. > > Ok then. > > Curious: is the sudo now in the build setup? I'm just thinking that this > isn't a tarfile specific fix but a "get correct POSIX umask semantics" > fix, so it should apply to the entire environment. > > Or am I naive? > I'm reading between the lines and assuming we're not the only user of their CI complaining about this environment change. ;) -------------- next part -------------- An HTML attachment was scrubbed... URL: From vano at mail.mipt.ru Mon Apr 1 23:14:00 2019 From: vano at mail.mipt.ru (Ivan Pozdeev) Date: Tue, 2 Apr 2019 06:14:00 +0300 Subject: [Python-Dev] Strange umask(?)/st_mode issue In-Reply-To: References: <20190401223539.GA47505@cskk.homeip.net> Message-ID: <67704a69-45aa-47dc-8b97-e18e886752bf@mail.mipt.ru> On 02.04.2019 1:44, Steve Dower wrote: > On 01Apr2019 1535, Cameron Simpson wrote: >> On 01Apr2019 09:12, Steve Dower wrote: >>> On 30Mar2019 1130, Gregory P. Smith wrote: >>>> I wouldn't expect it to be the case in a CI environment but I believe a umask can be overridden if the filesystem is mounted and >>>> configured with acls set? (oh, hah, Ivan just said the same thing) >>> >>> Yep, it appears this is the case. The Pipelines team got back to me and it seems to be a known issue - the workaround they gave me was >>> to run "sudo setfacl -Rb /home/vsts" at the start, so I've merged that in for now (to master and 3.7). >> >> Could that be done _without_ sudo to just the local directory containing the test tar file? If that works then you don't need any nasty >> privileged sudo use (which will just break on platforms without sudo anyway). > > I tried something similar to that and it didn't work. My guess is it's to do with the actual mount point? (I also tried without sudo at > first, and when I didn't work, I tried it with sudo. I hear that's how to decide whether you need it or not ;) ) > > In any case, it only applies to the Azure Pipelines build definition, so there aren't any other platforms where it'll be used. > https://github.com/python/cpython/pull/12655 > Cheers, > Steve > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/vano%40mail.mipt.ru -- Regards, Ivan From Peixing.Xin at windriver.com Tue Apr 2 05:46:14 2019 From: Peixing.Xin at windriver.com (Xin, Peixing) Date: Tue, 2 Apr 2019 09:46:14 +0000 Subject: [Python-Dev] =?windows-1252?q?how_to_rerun_the_job_=93Azure_Pipe?= =?windows-1252?q?lines_PR=94=3F?= Message-ID: <8488FBC4EAAC5941BA4B85DD1ECCF1870133BAAF35@ALA-MBD.corp.ad.wrs.com> Hi, Experts: Anyone can tell how to rerun the job ?Azure Pipelines PR? for my PR? Sometimes my PR failed but this is caused by externals. The next day this external issue was fixed then I might want to rerun this specific job on my PR to get the new result. How can I reach this? [cid:image001.png at 01D4E97B.F67B1E20] Thanks, Peixing -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 4252 bytes Desc: image001.png URL: From tir.karthi at gmail.com Tue Apr 2 08:22:56 2019 From: tir.karthi at gmail.com (Karthikeyan) Date: Tue, 2 Apr 2019 17:52:56 +0530 Subject: [Python-Dev] =?utf-8?q?how_to_rerun_the_job_=E2=80=9CAzure_Pipel?= =?utf-8?b?aW5lcyBQUuKAnT8=?= In-Reply-To: <8488FBC4EAAC5941BA4B85DD1ECCF1870133BAAF35@ALA-MBD.corp.ad.wrs.com> References: <8488FBC4EAAC5941BA4B85DD1ECCF1870133BAAF35@ALA-MBD.corp.ad.wrs.com> Message-ID: Closing and re-opening the PR will trigger the CI run again that might help in this case but it will run all the jobs. -- Regards, Karthikeyan S -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 4252 bytes Desc: not available URL: From cspealma at redhat.com Tue Apr 2 11:17:20 2019 From: cspealma at redhat.com (Calvin Spealman) Date: Tue, 2 Apr 2019 11:17:20 -0400 Subject: [Python-Dev] PEP-582 and multiple Python installations Message-ID: (I originally posted this to python-ideas, where I was told none of this PEP's authors subscribe so probably no one will see it there, so I'm posting it here to raise the issue where it can get seen and hopefully discussed) While the PEP does show the version number as part of the path to the actual packages, implying support for multiple versions, this doesn't seem to be spelled out in the actual text. Presumably __pypackages__/3.8/ might sit beside __pypackages__/3.9/, etc. to keep future versions capable of installing packages for each version, the way virtualenv today is bound to one version of Python. I'd like to raise a potential edge case that might be a problem, and likely an increasingly common one: users with multiple installations of the *same* version of Python. This is actually a common setup for Windows users who use WSL, Microsoft's Linux-on-Windows solution, as you could have both the Windows and Linux builds of a given Python version installed on the same machine. The currently implied support for multiple versions would not be able to separate these and could create problems if users pip install a Windows binary package through Powershell and then try to run a script in Bash from the same directory, causing the Linux version of Python to try to use Windows python packages. I'm not actually sure what the solution here is. Mostly I wanted to raise the concern, because I'm very keen on WSL being a great entry path for new developers and I want to make that a better experience, not a more confusing one. Maybe that version number could include some other unique identify, maybe based on Python's own executable. A hash maybe? I don't know if anything like that already exists to uniquely identify a Python build or installation. -- CALVIN SPEALMAN SENIOR QUALITY ENGINEER cspealma at redhat.com M: +1.336.210.5107 TRIED. TESTED. TRUSTED. -------------- next part -------------- An HTML attachment was scrubbed... URL: From steve.dower at python.org Tue Apr 2 12:09:30 2019 From: steve.dower at python.org (Steve Dower) Date: Tue, 2 Apr 2019 09:09:30 -0700 Subject: [Python-Dev] =?utf-8?q?how_to_rerun_the_job_=E2=80=9CAzure_Pipel?= =?utf-8?b?aW5lcyBQUuKAnT8=?= In-Reply-To: References: <8488FBC4EAAC5941BA4B85DD1ECCF1870133BAAF35@ALA-MBD.corp.ad.wrs.com> Message-ID: <05055b7d-922f-2b93-4425-4068564313ad@python.org> On 02Apr2019 0522, Karthikeyan wrote: > Closing and re-opening the PR will trigger the CI run again that might > help in this case but it will run all the jobs. Yes, I believe this is still the best way to re-run Pipelines jobs. For people with logins (not yet everyone in the GitHub org, but I hear that's coming) you can requeue the build, but last time I tried it didn't sync back to the pull request properly (I think it needs GitHub to cooperate, which is why triggering it from GitHub works best.) The Pipelines team is aware of this and working on it, so I expect the integration to improve over time. For now, close/reopen the PR. Cheers, Steve From steve.dower at python.org Tue Apr 2 12:10:59 2019 From: steve.dower at python.org (Steve Dower) Date: Tue, 2 Apr 2019 09:10:59 -0700 Subject: [Python-Dev] PEP-582 and multiple Python installations In-Reply-To: References: Message-ID: <2b889555-db6f-6c69-0347-ebb89d6fec21@python.org> On 02Apr2019 0817, Calvin Spealman wrote: > (I originally posted this to python-ideas, where I was told none of this > PEP's authors subscribe so probably no one will see it there, so I'm > posting it here to raise the issue where it can get seen and hopefully > discussed) Correct, thanks for posting. (I thought we had a "discussions-to" tag with distutils-sig on it, but apparently not.) > While the PEP does show the version number as part of the path to the > actual packages, implying support for multiple versions, this doesn't > seem to be spelled out in the actual text. Presumably > __pypackages__/3.8/ might sit beside __pypackages__/3.9/, etc. to keep > future versions capable of installing packages for each version, the way > virtualenv today is bound to one version of Python. > > I'd like to raise a potential edge case that might be a problem, and > likely an increasingly common one: users with multiple installations of > the *same* version of Python. This is actually a common setup for > Windows users who use WSL, Microsoft's Linux-on-Windows solution, as you > could have both the Windows and Linux builds of a given Python version > installed on the same machine. The currently implied support for > multiple versions would not be able to separate these and could create > problems if users pip install a Windows binary package through > Powershell and then try to run a script in Bash from the same directory, > causing the Linux version of Python to try to use Windows python packages. > > I'm not actually sure what the solution here is. Mostly I wanted to > raise the concern, because I'm very keen on WSL being a great entry path > for new developers and I want to make that a better experience, not a > more confusing one. Maybe that version number could include some other > unique identify, maybe based on Python's own executable. A hash maybe? I > don't know if anything like that already exists to uniquely identify a > Python build or installation. Yes, this is a situation we're aware of, and it's caught in the conflict of "who is this feature meant to support". Since all platforms have a unique extension module suffix (e.g. "module.cp38-win32.pyd"), it would be possible to support this with "fat" packages that include all binaries (or some clever way of merging wheels for multiple platforms). And since this is already in CPython itself, it leads to about the only reasonable solution - instead of "3.8", use the extension module suffix "cp38-win32". (Wheel tags are not in core CPython, so we can't use those.) But while this seems obvious, it also reintroduces problems that this has the potential to fix - suddenly, just like installing into your global environment, your packages are not project-specific anymore but are Python-specific. Which is one of the major confusions people run into ("I pip installed X but now can't import it in python"). So the main points of discussion right now are "whose problem does this solve" and "when do we tell people they need a full venv". And that discussion is mostly happening at https://discuss.python.org/t/pep-582-python-local-packages-directory/963/ Cheers, Steve From pviktori at redhat.com Tue Apr 2 08:49:56 2019 From: pviktori at redhat.com (Petr Viktorin) Date: Tue, 2 Apr 2019 14:49:56 +0200 Subject: [Python-Dev] PEP 590 discussion In-Reply-To: <5C9FEF82.50207@UGent.be> References: <6c8356e3-f9b2-c39e-63c4-17f146d326b7@hotpy.org> <15b8a3d7-00ed-a5eb-475c-a3adee671b5f@hotpy.org> <5C9FEF82.50207@UGent.be> Message-ID: On 3/30/19 11:36 PM, Jeroen Demeyer wrote: > On 2019-03-30 17:30, Mark Shannon wrote: >> 2. The claim that PEP 580 allows "certain optimizations because other >> code can make assumptions" is flawed. In general, the caller cannot make >> assumptions about the callee or vice-versa. Python is a dynamic language. > > PEP 580 is meant for extension classes, not Python classes. Extension > classes are not dynamic. When you implement tp_call in a given way, the > user cannot change it. So if a class implements the C call protocol or > the vectorcall protocol, callers can make assumptions about what that > means. > >> PEP 579 is mainly a list of supposed flaws with the >> 'builtin_function_or_method' class. >> The general thrust of PEP 579 seems to be that builtin-functions and >> builtin-methods should be more flexible and extensible than they are. I >> don't agree. If you want different behaviour, then use a different >> object. Don't try an cram all this extra behaviour into a pre-existing >> object. > > I think that there is a misunderstanding here. I fully agree with the > "use a different object" solution. This isn't a new solution: it's > already possible to implement those different objects (Cython does it). > It's just that this solution comes at a performance cost and that's what > we want to avoid. It does seem like there is some misunderstanding. PEP 580 defines a CCall structure, which includes the function pointer, flags, "self" and "parent". Like the current implementation, it has various METH_ flags for various C signatures. When called, the info from CCall is matched up (in relatively complex ways) to what the C function expects. PEP 590 only adds the "vectorcall". It does away with flags and only has one C signatures, which is designed to fit all the existing ones, and is well optimized. Storing the "self"/"parent", and making sure they're passed to the C function is the responsibility of the callable object. There's an optimization for "self" (offsetting using PY_VECTORCALL_ARGUMENTS_OFFSET), and any supporting info can be provided as part of "self". >> I'll reiterate that PEP 590 is more general than PEP 580 and that once >> the callable's code has access to the callable object (as both PEPs >> allow) then anything is possible. You can't can get more extensible than >> that. Anything is possible, but if one of the possibilities becomes common and useful, PEP 590 would make it hard to optimize for it. Python has grown many "METH_*" signatures over the years as we found more things that need to be passed to callables. Why would "METH_VECTORCALL" be the last? If it won't (if you think about it as one more way to call functions), then dedicating a tp_* slot to it sounds quite expensive. In one of the ways to call C functions in PEP 580, the function gets access to: - the arguments, - "self", the object - the class that the method was found in (which is not necessarily type(self)) I still have to read the details, but when combined with LOAD_METHOD/CALL_METHOD optimization (avoiding creation of a "bound method" object), it seems impossible to do this efficiently with just the callable's code and callable's object. > I would argue the opposite: PEP 590 defines a fixed protocol that is not > easy to extend. PEP 580 on the other hand uses a new data structure > PyCCallDef which could easily be extended in the future (this will > intentionally never be part of the stable ABI, so we can do that). > > I have also argued before that the generality of PEP 590 is a bad thing > rather than a good thing: by defining a more rigid protocol as in PEP > 580, more optimizations are possible. > >> PEP 580 has the same limitation for the same reasons. The limitation is >> necessary for correctness if an object supports calls via `__call__` and >> through another calling convention. > > I don't think that this limitation is needed in either PEP. As I > explained at the top of this email, it can easily be solved by not using > the protocol for Python classes. What is wrong with my proposal in PEP > 580: https://www.python.org/dev/peps/pep-0580/#inheritance I'll add Jeroen's notes from the review of the proposed PEP 590 (https://github.com/python/peps/pull/960): The statement "PEP 580 is specifically targetted at function-like objects, and doesn't support other callables like classes, partial functions, or proxies" is factually false. The motivation for PEP 580 is certainly function/method-like objects but it's a general protocol that every class can implement. For certain classes, it may not be easy or desirable to do that but it's always possible. Given that `PY_METHOD_DESCRIPTOR` is a flag for tp_flags, shouldn't it be called `Py_TPFLAGS_METHOD_DESCRIPTOR` or something? Py_TPFLAGS_HAVE_VECTOR_CALL should be Py_TPFLAGS_HAVE_VECTORCALL, to be consistent with tp_vectorcall_offset and other uses of "vectorcall" (not "vector call") And mine, so far: I'm not clear on the constness of the "args" array. If it is mutable (PyObject **), you can't, for example, directly pass a tuple's storage (or any other array that could be used in the call). If it is not (PyObject * const *), you can't insert the "self" argument in. The reference implementations seems to be inconsistent here. What's the intention? From mark at hotpy.org Tue Apr 2 15:38:23 2019 From: mark at hotpy.org (Mark Shannon) Date: Tue, 2 Apr 2019 20:38:23 +0100 Subject: [Python-Dev] PEP 580/590 discussion In-Reply-To: <5CA1A217.1030007@UGent.be> References: <6c8356e3-f9b2-c39e-63c4-17f146d326b7@hotpy.org> <15b8a3d7-00ed-a5eb-475c-a3adee671b5f@hotpy.org> <5C9FEF82.50207@UGent.be> <5CA1A217.1030007@UGent.be> Message-ID: Hi, On 01/04/2019 6:31 am, Jeroen Demeyer wrote: > I added benchmarks for PEP 590: > > https://gist.github.com/jdemeyer/f0d63be8f30dc34cc989cd11d43df248 Thanks. As expected for calls to C function for both PEPs and master perform about the same, as they are using almost the same calling convention under the hood. As an example of the advantage that a general fast calling convention gives you, I have implemented the vectorcall versions of list() and range() https://github.com/markshannon/cpython/compare/vectorcall-minimal...markshannon:vectorcall-examples Which gives a roughly 30% reduction in time for creating ranges, or lists from small tuples. https://gist.github.com/markshannon/5cef3a74369391f6ef937d52cca9bfc8 Cheers, Mark. From mark at hotpy.org Tue Apr 2 17:12:11 2019 From: mark at hotpy.org (Mark Shannon) Date: Tue, 2 Apr 2019 22:12:11 +0100 Subject: [Python-Dev] PEP 590 discussion In-Reply-To: References: <6c8356e3-f9b2-c39e-63c4-17f146d326b7@hotpy.org> <15b8a3d7-00ed-a5eb-475c-a3adee671b5f@hotpy.org> <5C9FEF82.50207@UGent.be> Message-ID: <421f8182-4bc8-b8cf-82d6-ca4a4fbd2013@hotpy.org> Hi, On 02/04/2019 1:49 pm, Petr Viktorin wrote: > On 3/30/19 11:36 PM, Jeroen Demeyer wrote: >> On 2019-03-30 17:30, Mark Shannon wrote: >>> 2. The claim that PEP 580 allows "certain optimizations because other >>> code can make assumptions" is flawed. In general, the caller cannot make >>> assumptions about the callee or vice-versa. Python is a dynamic >>> language. >> >> PEP 580 is meant for extension classes, not Python classes. Extension >> classes are not dynamic. When you implement tp_call in a given way, >> the user cannot change it. So if a class implements the C call >> protocol or the vectorcall protocol, callers can make assumptions >> about what that means. >> >>> PEP 579 is mainly a list of supposed flaws with the >>> 'builtin_function_or_method' class. >>> The general thrust of PEP 579 seems to be that builtin-functions and >>> builtin-methods should be more flexible and extensible than they are. I >>> don't agree. If you want different behaviour, then use a different >>> object. Don't try an cram all this extra behaviour into a pre-existing >>> object. >> >> I think that there is a misunderstanding here. I fully agree with the >> "use a different object" solution. This isn't a new solution: it's >> already possible to implement those different objects (Cython does >> it). It's just that this solution comes at a performance cost and >> that's what we want to avoid. > > It does seem like there is some misunderstanding. > > PEP 580 defines a CCall structure, which includes the function pointer, > flags, "self" and "parent". Like the current implementation, it has > various METH_ flags for various C signatures. When called, the info from > CCall is matched up (in relatively complex ways) to what the C function > expects. > > PEP 590 only adds the "vectorcall". It does away with flags and only has > one C signatures, which is designed to fit all the existing ones, and is > well optimized. Storing the "self"/"parent", and making sure they're > passed to the C function is the responsibility of the callable object. > There's an optimization for "self" (offsetting using > PY_VECTORCALL_ARGUMENTS_OFFSET), and any supporting info can be provided > as part of "self". > >>> I'll reiterate that PEP 590 is more general than PEP 580 and that once >>> the callable's code has access to the callable object (as both PEPs >>> allow) then anything is possible. You can't can get more extensible than >>> that. > > Anything is possible, but if one of the possibilities becomes common and > useful, PEP 590 would make it hard to optimize for it. > Python has grown many "METH_*" signatures over the years as we found > more things that need to be passed to callables. Why would > "METH_VECTORCALL" be the last? If it won't (if you think about it as one > more way to call functions), then dedicating a tp_* slot to it sounds > quite expensive. I doubt METH_VECTORCALL will be the last. Let me give you an example: It is quite common for a function to take two arguments, so we might want add a METH_OO flag for builtin-functions with 2 parameters. To support this in PEP 590, you would make exactly the same change as you would now; which is to add another case to the switch statement in _PyCFunction_FastCallKeywords. For PEP 580, you would add another case to the switch in PyCCall_FastCall. No difference really. PEP 580 uses a slot as well. It's only 8 bytes per class. > > > In one of the ways to call C functions in PEP 580, the function gets > access to: > - the arguments, > - "self", the object > - the class that the method was found in (which is not necessarily > type(self)) > I still have to read the details, but when combined with > LOAD_METHOD/CALL_METHOD optimization (avoiding creation of a "bound > method" object), it seems impossible to do this efficiently with just > the callable's code and callable's object. It is possible, and relatively straightforward. Why do you think it is impossible? > > >> I would argue the opposite: PEP 590 defines a fixed protocol that is >> not easy to extend. PEP 580 on the other hand uses a new data >> structure PyCCallDef which could easily be extended in the future >> (this will intentionally never be part of the stable ABI, so we can do >> that). >> >> I have also argued before that the generality of PEP 590 is a bad >> thing rather than a good thing: by defining a more rigid protocol as >> in PEP 580, more optimizations are possible. >> >>> PEP 580 has the same limitation for the same reasons. The limitation is >>> necessary for correctness if an object supports calls via `__call__` and >>> through another calling convention. >> >> I don't think that this limitation is needed in either PEP. As I >> explained at the top of this email, it can easily be solved by not >> using the protocol for Python classes. What is wrong with my proposal >> in PEP 580: https://www.python.org/dev/peps/pep-0580/#inheritance > > > I'll add Jeroen's notes from the review of the proposed PEP 590 > (https://github.com/python/peps/pull/960): > > The statement "PEP 580 is specifically targetted at function-like > objects, and doesn't support other callables like classes, partial > functions, or proxies" is factually false. The motivation for PEP 580 is > certainly function/method-like objects but it's a general protocol that > every class can implement. For certain classes, it may not be easy or > desirable to do that but it's always possible. > > Given that `PY_METHOD_DESCRIPTOR` is a flag for tp_flags, shouldn't it > be called `Py_TPFLAGS_METHOD_DESCRIPTOR` or something? > > Py_TPFLAGS_HAVE_VECTOR_CALL should be Py_TPFLAGS_HAVE_VECTORCALL, to be > consistent with tp_vectorcall_offset and other uses of "vectorcall" (not > "vector call") > Thanks for the comments, I'll update the PEP when I get the chance. > > And mine, so far: > > I'm not clear on the constness of the "args" array. > If it is mutable (PyObject **), you can't, for example, directly pass a > tuple's storage (or any other array that could be used in the call). > If it is not (PyObject * const *), you can't insert the "self" argument in. > The reference implementations seems to be inconsistent here. What's the > intention? > I'll make it clearer in the PEP. My thinking was that if `PY_VECTORCALL_ARGUMENTS_OFFSET` is set then the caller is allowing the callee to mutate element -1. It would make sense to generalise that to any element of the vector (including -1). When passing the contents of a tuple, `PY_VECTORCALL_ARGUMENTS_OFFSET` should not be set, and thus the vector could not be mutated. Cheers, Mark. From J.Demeyer at UGent.be Wed Apr 3 01:33:49 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Wed, 3 Apr 2019 07:33:49 +0200 Subject: [Python-Dev] PEP 590 discussion In-Reply-To: <421f8182-4bc8-b8cf-82d6-ca4a4fbd2013@hotpy.org> References: <6c8356e3-f9b2-c39e-63c4-17f146d326b7@hotpy.org> <15b8a3d7-00ed-a5eb-475c-a3adee671b5f@hotpy.org> <5C9FEF82.50207@UGent.be> <421f8182-4bc8-b8cf-82d6-ca4a4fbd2013@hotpy.org> Message-ID: <5CA445BD.4040705@UGent.be> >> In one of the ways to call C functions in PEP 580, the function gets >> access to: >> - the arguments, >> - "self", the object >> - the class that the method was found in (which is not necessarily >> type(self)) >> I still have to read the details, but when combined with >> LOAD_METHOD/CALL_METHOD optimization (avoiding creation of a "bound >> method" object), it seems impossible to do this efficiently with just >> the callable's code and callable's object. > > It is possible, and relatively straightforward. Access to the class isn't possible currently and also not with PEP 590. But it's easy enough to fix that: PEP 573 adds a new METH_METHOD flag to change the signature of the C function (not the vectorcall wrapper). PEP 580 supports this "out of the box" because I'm reusing the class also to do type checks. But this shouldn't be an argument for or against either PEP. From J.Demeyer at UGent.be Wed Apr 3 01:43:28 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Wed, 3 Apr 2019 07:43:28 +0200 Subject: [Python-Dev] PEP 580/590 discussion In-Reply-To: References: <6c8356e3-f9b2-c39e-63c4-17f146d326b7@hotpy.org> <15b8a3d7-00ed-a5eb-475c-a3adee671b5f@hotpy.org> <5C9FEF82.50207@UGent.be> <5CA1A217.1030007@UGent.be> Message-ID: <5CA44800.9050901@UGent.be> On 2019-04-02 21:38, Mark Shannon wrote: > Hi, > > On 01/04/2019 6:31 am, Jeroen Demeyer wrote: >> I added benchmarks for PEP 590: >> >> https://gist.github.com/jdemeyer/f0d63be8f30dc34cc989cd11d43df248 > > Thanks. As expected for calls to C function for both PEPs and master > perform about the same, as they are using almost the same calling > convention under the hood. While they are "about the same", in general PEP 580 is slightly faster than master and PEP 590. And PEP 590 actually has a minor slow-down for METH_VARARGS calls. I think that this happens because PEP 580 has less levels of indirection than PEP 590. The vectorcall protocol (PEP 590) changes a slower level (tp_call) by a faster level (vectorcall), while PEP 580 just removes that level entirely: it calls the C function directly. This shows that PEP 580 is really meant to have maximal performance in all cases, accidentally even making existing code faster. Jeroen. From J.Demeyer at UGent.be Wed Apr 3 11:41:06 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Wed, 3 Apr 2019 17:41:06 +0200 Subject: [Python-Dev] PEP 590 vs. bpo-29259 In-Reply-To: <5CA445BD.4040705@UGent.be> References: <6c8356e3-f9b2-c39e-63c4-17f146d326b7@hotpy.org> <15b8a3d7-00ed-a5eb-475c-a3adee671b5f@hotpy.org> <5C9FEF82.50207@UGent.be> <421f8182-4bc8-b8cf-82d6-ca4a4fbd2013@hotpy.org> <5CA445BD.4040705@UGent.be> Message-ID: <5CA4D412.7050906@UGent.be> As I'm reading the PEP 590 reference implementation, it strikes me how similar it is to https://bugs.python.org/issue29259 The main difference is that bpo-29259 has a per-class pointer tp_fastcall instead of a per-object pointer. But actually, the PEP 590 reference implementation does not make much use of the per-object pointer: for all classes except "type", the vectorcall wrapper is the same for all objects of a given type. One thing that bpo-29259 did not realize is that existing optimizations could be dropped in favor of using tp_fastcall. For example, bpo-29259 has code like if (PyFunction_Check(callable)) { return _PyFunction_FastCallKeywords(...); } if (PyCFunction_Check(callable)) { return _PyCFunction_FastCallKeywords(...); } else if (PyType_HasFeature(..., Py_TPFLAGS_HAVE_FASTCALL) ...) but the first 2 branches are superfluous given the third. Anyway, this is just putting PEP 590 a bit in perspective. It doesn't say anything about the merits of PEP 590. Jeroen. From J.Demeyer at UGent.be Thu Apr 4 07:51:40 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Thu, 4 Apr 2019 13:51:40 +0200 Subject: [Python-Dev] Deprecating "instance method" class Message-ID: <5CA5EFCC.2030400@UGent.be> During my investigations related to low-level function/method classes, I came across the "instance method" class. There is a C API for it: https://docs.python.org/3.7/c-api/method.html However, it's not used/exposed anywhere in CPython, except as _testcapi.instancemethod (for testing its functionality) This class was discussed at https://mail.python.org/pipermail/python-3000/2007-December/011456.html and implemented in https://bugs.python.org/issue1587 Reading that old thread, there are use cases presented related to classic classes, wrapping Kogut (http://kokogut.sourceforge.net/kogut.html) objects and Pyrex. But classic classes no longer exist and the latter two use cases aren't actually needed if you read the thread to the end. So there are no surviving use cases from that thread. Does anybody know actual use cases or any code in the wild using it? To me, the fact that it's only exposed in the C API is a good sign that it's not really useful. So, should we deprecate the instance method class? Jeroen. From christian at python.org Thu Apr 4 08:09:44 2019 From: christian at python.org (Christian Heimes) Date: Thu, 4 Apr 2019 14:09:44 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA5EFCC.2030400@UGent.be> References: <5CA5EFCC.2030400@UGent.be> Message-ID: On 04/04/2019 13.51, Jeroen Demeyer wrote: > During my investigations related to low-level function/method classes, I > came across the "instance method" class. There is a C API for it: > https://docs.python.org/3.7/c-api/method.html > However, it's not used/exposed anywhere in CPython, except as > _testcapi.instancemethod (for testing its functionality) > > This class was discussed at > https://mail.python.org/pipermail/python-3000/2007-December/011456.html > and implemented in https://bugs.python.org/issue1587 > Reading that old thread, there are use cases presented related to > classic classes, wrapping Kogut > (http://kokogut.sourceforge.net/kogut.html) objects and Pyrex. But > classic classes no longer exist and the latter two use cases aren't > actually needed if you read the thread to the end. So there are no > surviving use cases from that thread. > > Does anybody know actual use cases or any code in the wild using it? To > me, the fact that it's only exposed in the C API is a good sign that > it's not really useful. You are drawing the wrong conclusion here. The feature was explicitly designed for C code and C API wrappers like swig and Cython to make adaption to Python 3 simpler. I implemented it when I removed unbound methods. > So, should we deprecate the instance method class? I couldn't find any current code that uses PyInstanceMethod_New. Let's deprecate the feature and schedule it for removal in 3.10. Christian From J.Demeyer at UGent.be Thu Apr 4 09:45:03 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Thu, 4 Apr 2019 15:45:03 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> Message-ID: <5CA60A5F.7060406@UGent.be> On 2019-04-04 14:09, Christian Heimes wrote: > I couldn't find any current code that uses PyInstanceMethod_New. Let's > deprecate the feature and schedule it for removal in 3.10. Done at https://github.com/python/cpython/pull/12685 From chris.barker at noaa.gov Thu Apr 4 12:02:00 2019 From: chris.barker at noaa.gov (Chris Barker - NOAA Federal) Date: Thu, 4 Apr 2019 09:02:00 -0700 Subject: [Python-Dev] PEP-582 and multiple Python installations In-Reply-To: References: Message-ID: > I'd like to raise a potential edge case that might be a problem, and likely an increasingly common one: users with multiple installations of the *same* version of Python. I would suggest that that use case is best addressed by a system that isolates the entire python environment, such as conda. > This is actually a common setup for Windows users who use WSL, Microsoft's Linux-on-Windows solution, as you could have both the Windows and Linux builds of a given Python version installed on the same machine. Sure, but Isn?t the WSL subsystem pretty isolated already? Would native Windows and WSL users be running in the same dir? That being said, I?m pretty skeptical of the PEP ? I understand the motivation ? I make a point of avoiding virtual environments in my intro classes, but at some point folks will need to learn them. I?ve had students think that virtualenv was a part of (or required by) e.g. flask, because the tutorials include it in the setup. But I think environments really need to be more distinct, not less, I?m quite concerned about mingling them in one place. Maybe I?m reading it wrong, but it seems that this could create serious clashes with other ?environment? systems, such as conda. I suppose one could say: ?don?t do that? ? I.e. don?t create a __pypackages__ dir if you are going to use conda ? but many folks want the same source to be runnable in multiple ?styles? of Python. Also, I see a major benefit for teaching, but it does go a bit against my philosophy of not hiding important details from newbies ? that is, don?t teach using an approach that is not suitable for production. And newbies could be really confused by the fact that pip installs stuff differently depending on what dir they are in and what is in that dir. The PEP is listed as a draft ? anyone know what?s going on with it? -CHB From chris.barker at noaa.gov Thu Apr 4 18:54:46 2019 From: chris.barker at noaa.gov (Chris Barker) Date: Thu, 4 Apr 2019 15:54:46 -0700 Subject: [Python-Dev] PEP-582 and multiple Python installations In-Reply-To: References:

Message-ID: Sorry somehow missed Steve Dower's post: that discussion is mostly happening at https://discuss.python.org/t/pep-582-python-local-packages-directory/963/ I"ll go there to comment. -CHB On Thu, Apr 4, 2019 at 9:02 AM Chris Barker - NOAA Federal < chris.barker at noaa.gov> wrote: > > > I'd like to raise a potential edge case that might be a problem, and > likely an increasingly common one: users with multiple installations of the > *same* version of Python. > > I would suggest that that use case is best addressed by a system that > isolates the entire python environment, such as conda. > > > This is actually a common setup for Windows users who use WSL, > Microsoft's Linux-on-Windows solution, as you could have both the Windows > and Linux builds of a given Python version installed on the same machine. > > Sure, but Isn?t the WSL subsystem pretty isolated already? Would native > Windows and WSL users be running in the same dir? > > That being said, I?m pretty skeptical of the PEP ? I understand the > motivation ? I make a point of avoiding virtual environments in my intro > classes, but at some point folks will need to learn them. > > I?ve had students think that virtualenv was a part of (or required by) > e.g. flask, because the tutorials include it in the setup. > > But I think environments really need to be more distinct, not less, I?m > quite concerned about mingling them in one place. > > Maybe I?m reading it wrong, but it seems that this could create serious > clashes with other ?environment? systems, such as conda. > > I suppose one could say: ?don?t do that? ? I.e. don?t create a > __pypackages__ dir if you are going to use conda ? but many folks want the > same source to be runnable in multiple ?styles? of Python. > > Also, I see a major benefit for teaching, but it does go a bit against my > philosophy of not hiding important details from newbies ? that is, don?t > teach using an approach that is not suitable for production. > > And newbies could be really confused by the fact that pip installs stuff > differently depending on what dir they are in and what is in that dir. > > The PEP is listed as a draft ? anyone know what?s going on with it? > > -CHB > > -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg.ewing at canterbury.ac.nz Thu Apr 4 18:57:00 2019 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 05 Apr 2019 11:57:00 +1300 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> Message-ID: <5CA68BBC.8060205@canterbury.ac.nz> Christian Heimes wrote: > I couldn't find any current code that uses PyInstanceMethod_New. Let's > deprecate the feature and schedule it for removal in 3.10. If it's designed for use by things outside of CPython, how can you be sure nothing is using it? -- Greg From J.Demeyer at UGent.be Fri Apr 5 02:07:25 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Fri, 5 Apr 2019 08:07:25 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA68BBC.8060205@canterbury.ac.nz> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> Message-ID: <5CA6F09D.3000900@UGent.be> On 2019-04-05 00:57, Greg Ewing wrote: > If it's designed for use by things outside of CPython, how > can you be sure nothing is using it? Of course I'm not sure. However: 1. So far, nobody in this thread knows of any code using it. 2. So far, nobody in this thread knows any use case for it. And if we end up deprecating and it was a mistake, we can easily revert the deprecation. From storchaka at gmail.com Fri Apr 5 08:10:54 2019 From: storchaka at gmail.com (Serhiy Storchaka) Date: Fri, 5 Apr 2019 15:10:54 +0300 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA6F09D.3000900@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> Message-ID: 05.04.19 09:07, Jeroen Demeyer ????: > On 2019-04-05 00:57, Greg Ewing wrote: >> If it's designed for use by things outside of CPython, how >> can you be sure nothing is using it? > > Of course I'm not sure. However: > > 1. So far, nobody in this thread knows of any code using it. > > 2. So far, nobody in this thread knows any use case for it. > > And if we end up deprecating and it was a mistake, we can easily revert > the deprecation. I have a use case. I did not know this before, but it can be used to implement accelerated versions of separate methods instead of the whole class. I'm going to use it to further optimize total_ordering. Thanks Josh Rosenberg for the tip. From J.Demeyer at UGent.be Fri Apr 5 07:27:17 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Fri, 5 Apr 2019 13:27:17 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> Message-ID: <5CA73B95.6040509@UGent.be> On 2019-04-05 14:10, Serhiy Storchaka wrote: > it can be used to > implement accelerated versions of separate methods instead of the whole > class. Could you elaborate? I'm curious what you mean. > I'm going to use it to further optimize total_ordering. There are so many ways in which total_ordering is inefficient. If you really want it to be efficient, you should just implement it in C. From storchaka at gmail.com Fri Apr 5 09:13:26 2019 From: storchaka at gmail.com (Serhiy Storchaka) Date: Fri, 5 Apr 2019 16:13:26 +0300 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA73B95.6040509@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> Message-ID: 05.04.19 14:27, Jeroen Demeyer ????: > On 2019-04-05 14:10, Serhiy Storchaka wrote: >> it can be used to >> implement accelerated versions of separate methods instead of the whole >> class. > > Could you elaborate? I'm curious what you mean. It is easy to implement a function in C. But there is a difference between functions implemented in Python and C -- the latter are not descriptors. They behave like static methods when assigned to a class attribute, i.e. there is no implicit passing of the "self" argument. >> I'm going to use it to further optimize total_ordering. > > There are so many ways in which total_ordering is inefficient. If you > really want it to be efficient, you should just implement it in C. Yes, this is what I want to do. I did not do this only because implementing method-like functions which which do not belong to concrete class implemented in C is not convention. But PyInstanceMethod_New() should help. From J.Demeyer at UGent.be Fri Apr 5 08:33:29 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Fri, 5 Apr 2019 14:33:29 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> Message-ID: <5CA74B19.70806@UGent.be> On 2019-04-05 15:13, Serhiy Storchaka wrote: > It is easy to implement a function in C. Why does it need to be a PyCFunction? You could put an actual method descriptor in the class. In other words, use PyDescr_NewMethod() instead of PyCFunction_New() + PyInstanceMethod_New(). It's probably going to be faster too since the instancemethod adds an unoptimized extra level of indirection. > Yes, this is what I want to do. I did not do this only because > implementing method-like functions which which do not belong to concrete > class implemented in C is not convention. Sure, you could implement separate methods like __gt__ in C, but that's still less efficient than just implementing a specific tp_richcompare for total_ordering and then having the usual wrapper descriptors for __gt__. From guido at python.org Fri Apr 5 11:46:00 2019 From: guido at python.org (Guido van Rossum) Date: Fri, 5 Apr 2019 08:46:00 -0700 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA74B19.70806@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> Message-ID: Let's stop here. This API is doing no harm, it's not a maintenance burden, clearly *some* folks have a use for it. Let's just keep it, okay? There are bigger fish to fry. On Fri, Apr 5, 2019 at 5:36 AM Jeroen Demeyer wrote: > On 2019-04-05 15:13, Serhiy Storchaka wrote: > > It is easy to implement a function in C. > > Why does it need to be a PyCFunction? You could put an actual method > descriptor in the class. In other words, use PyDescr_NewMethod() instead > of PyCFunction_New() + PyInstanceMethod_New(). It's probably going to be > faster too since the instancemethod adds an unoptimized extra level of > indirection. > > > Yes, this is what I want to do. I did not do this only because > > implementing method-like functions which which do not belong to concrete > > class implemented in C is not convention. > > Sure, you could implement separate methods like __gt__ in C, but that's > still less efficient than just implementing a specific tp_richcompare > for total_ordering and then having the usual wrapper descriptors for > __gt__. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From vstinner at redhat.com Fri Apr 5 12:00:31 2019 From: vstinner at redhat.com (Victor Stinner) Date: Fri, 5 Apr 2019 18:00:31 +0200 Subject: [Python-Dev] New Python Initialization API In-Reply-To: <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: Le dim. 31 mars 2019 ? 01:49, Steve Dower a ?crit : > Here is my first review of https://www.python.org/dev/peps/pep-0587/ and > in general I think it's very good. Ah nice, that's a good start :-) Thanks for reviewing it. Your email is long, and answer makes it even longer, so I will reply in multiple emails. > > ``PyWideCharList`` is a list of ``wchar_t*`` strings. > > I always forget whether "const" is valid in C99, but if it is, can we > make this a list of const strings? Short answer: no :-( This structure mostly exists to simplify the implementation. Sadly, "const PyWideCharList" doesn't automatically make PyWideCharList.items an array of "const wchar_t*". I tried some hacks to have an array of const strings... but it would be very complicated and not natural at all in C. Sadly, it's way more simple to have "wchar_t*" in practice. > I also prefer a name like ``PyWideStringList``, since that's what it is > (the other places we use WideChar in the C API refer only to a single > string, as far as I'm aware). I'm fine with this name. > > ``PyInitError`` is a structure to store an error message or an exit code > > for the Python Initialization. > > I love this struct! Currently it's private, but I wonder whether it's > worth making it public as PyError (or PyErrorInfo)? The PEP 587 makes the structure public, but I'm not sure about calling it PyError because PyInitError also allows to exit Python with an exit status which is something specific to the initialization. If you want to use a structure to reporting errors, I would prefer to add a new simpler PyError structure to only report an error message, but never exit Python. PyInitError use case is really specific to Python initialization. Moreover, the API is inefficient since it is returned by copy, not by reference. That's fine for Python initialization which only happens once and is not part of "hot code". I'm not sure if PyError would need to store the C function name where the error is triggered. Usually, we try hard to hide Python internals to the user. > > * ``exitcode`` (``int``): if greater or equal to zero, argument passed to > > ``exit()`` > > Windows is likely to need/want negative exit codes, as many system > errors are represented as 0x80000000|(source of error)|(specific code). Hum, int was used in Python 3.6 code base. We change change PyInitError.exitcode type to DWORD on Windows, but use int on Unix. We can add a private field to check if the error is an error message or an exit code. Or maybe check if the error message is NULL or not. Py_INIT_ERR(MSG) must never be called with Py_INIT_ERR(NULL) and it should be called with a static string, not with a dynamically allocated string (since the API doesn't allow to release memory). > > * ``user_err`` (int): if non-zero, the error is caused by the user > > configuration, otherwise it's an internal Python error. > > Maybe we could just encode this as "positive exitcode is user error, > negative is internal error"? I'm pretty sure struct return values are > passed by reference in most C calling conventions, so the size of the > struct isn't a big deal, but without a proper bool type it may look like > this is a second error code (like errno/winerror in a few places). Honestly, I'm not sure that we really have to distinguish "user error" and "internal error". It's an old debate about calling abort()/DebugBreak() or not. It seems like most users are annoyed by getting a coredump on Unix when abort() is called. Maybe we should just remove Py_INIT_USER_ERR(), always use Py_INIT_ERR(), and never call abort()/DebugBreak() in Py_ExitInitError(). Or does anyone see a good reason to trigger a debugger on an initialization error? See https://bugs.python.org/issue19983 discussion: "When interrupted during startup, Python should not call abort() but exit()" Note: I'm not talking about Py_FatalError() here, this one will not change. Victor From vstinner at redhat.com Fri Apr 5 12:12:50 2019 From: vstinner at redhat.com (Victor Stinner) Date: Fri, 5 Apr 2019 18:12:50 +0200 Subject: [Python-Dev] New Python Initialization API In-Reply-To: <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: About PyPreConfig and encodings. > The appendix is excellent, by the way. Very useful detail to have > written down. Thanks. The appendix is based on Include/cpython/coreconfig.h comments which is now my reference documentation. By the way, this header file contains more information about PyConfig fields than the PEP 587. For example, the comment on filesystem_encoding and filesystem_errors lists every single cases and exceptions (it describes the implementation). > > ``PyPreConfig`` structure is used to pre-initialize Python: > > > > * Set the memory allocator > > * Configure the LC_CTYPE locale > > * Set the UTF-8 mode > > I think we should have the isolated flag in here - oh wait, we do - I > think we should have the isolated/use_environment options listed in this > list :) My introduction paragraph only explains the changes made by Py_PreInitialize(): calling Py_PreInitialize() doesn't "isolate" Python. PyPreConfig.isolated is used to decide if Python reads environment variables or not. Examples: PYTHONMALLOC, PYTHONUTF8, PYTHONDEVMODE (which has an impact on PyPreConfig.allocator), PYTHONCOERCECLOCALE, etc. That's why isolated and use_environment are present in PyPreConfig and PyConfig. In practice, values should be equal in both structures. Moreover, if PyConfig.isolated is equal to 1, Py_InitializeFromConfig() updates _PyRuntime.preconfig.isolated to 1 ;-) > > * ``PyInitError Py_PreInitialize(const PyPreConfig *config)`` > > * ``PyInitError Py_PreInitializeFromArgs( const PyPreConfig *config, > int argc, char **argv)`` > > * ``PyInitError Py_PreInitializeFromWideArgs( const PyPreConfig > *config, int argc, wchar_t **argv)`` > > I hope to one day be able to support multiple runtimes per process - can > we have an opaque PyRuntime object exposed publicly now and passed into > these functions? I hesitated to include a "_PyRuntimeState*" parameter somewhere, but I chose to not do so. Currently, there is a single global variable _PyRuntime which has the type _PyRuntimeState. The _PyRuntime_Initialize() API is designed around this global variable. For example, _PyRuntimeState contains the registry of interpreters: you don't want to have multiple registries :-) I understood that we should only have a single instance of _PyRuntimeState. So IMHO it's fine to keep it private at this point. There is no need to expose it in the API. > (FWIW, I think we're a long way from being able to support multiple > runtimes *simultaneously*, so the initial implementation here would be > to have a PyRuntime_Create() that returns our global one once and then > errors until it's finalised. The migration path is probably to enable > switching of the current runtime via a dedicated function (i.e. one > active at a time, probably with thread local storage), since we have no > "context" parameter for C API functions, and obviously there are still > complexities such as poorly written extension modules that nonetheless > can be dealt with in embedding scenarios by simply not using them. This > doesn't seem like an unrealistic future, *unless* we add a whole lot of > new APIs now that can't allow it :) ) FYI I tried to design an internal API with a "context" to pass _PyRuntimeState, PyPreConfig, _PyConfig, the current interpreter, etc. => https://bugs.python.org/issue35265 My first need was to pass a memory allocator to Py_DecodeLocale(). There are 2 possible implementations: * Modify *all* functions to add a new "context" parameter and modify *all* functions to pass this parameter to sub-functions. * Store the current "context" as a thread local variable or something like that. I wrote a proof-of-concept of the first option: the implementation was very painful to write: a lot of changes which looks useless and a lot of new private functions which to pass the argument. I had to modify way too much code. I gave up. For the second option: well, there is no API change needed! It can be done later. Moreover, we already have such API! PyThreadState_Get() gets the Python thread state of the current thread: the current interpreter can be accessed from there. > > ``PyPreConfig`` fields: > > > > * ``coerce_c_locale_warn``: if non-zero, emit a warning if the C locale > > is coerced. > > * ``coerce_c_locale``: if equals to 2, coerce the C locale; if equals to > > 1, read the LC_CTYPE to decide if it should be coerced. > > Can we use another value for coerce_c_locale to determine whether to > warn or not? Save a field. coerce_c_locale is already complex, it can have 4 values: -1, 0, 1 and 2. I prefer keep a separated field. Moreover, I understood that you might want to coerce the C locale *and* get the warning, or get the warning but *not* coerce the locale. > > * ``legacy_windows_fs_encoding`` (Windows only): if non-zero, set the > > Python filesystem encoding to ``"mbcs"``. > > * ``utf8_mode``: if non-zero, enable the UTF-8 mode > > Why not just set the encodings here? For different technical reasons, you simply cannot specify an encoding name. You can also pass options to tell Python that you have some preferences (PyPreConfig and PyConfig fields). Python doesn't support any encoding and encoding errors combinations. In practice, it only supports a narrow set of choices. The main implementation are Py_EncodeLocale() and Py_DecodeLocale() functions which uses the C codec of the current locale encoding to implement the filesystem encoding, before the codec implemented in Python can be used. Basically, only the current locale encoding or UTF-8 are supported. If you want UTF-8, enable the UTF-8 Mode. To load the Python codec, you need importlib. importlib needs to access the filesystem which requires a codec to encode/decode file names (PyConfig.module_search_paths uses Unicode wchar_t* strings, but the C API only supports bytes char* strings). Py_PreInitialize() doesn't set the filesystem encoding. It initializes the LC_CTYPE locale and Python global configuration variables (Py_UTF8Mode and Py_LegacyWindowsFSEncodingFlag). > Obviously we are not ready to import most encodings after pre > initialization, but I think that's okay. Embedders who set something > outside the range of what can be used without importing encodings will > get an error to that effect if we try. You need a C implementation of the Python filesystem encoding very early in Python initialization. You cannot start with one encoding and "later" switch the encoding. I tried multiple times the last 10 years and I always failed to do that. All attempts failed with mojibake at different levels. Unix pays the price of its history. Windows is a very different story: there are API to access the filesystem with Unicode strings, there is no such "bootstrap problem" for importlib. > In fact, I'd be totally okay with letting embedders specify their own > function pointer here to do encoding/decoding between Unicode and the OS > preferred encoding. In my experience, when someone wants to get a specific encoding: they only want UTF-8. There is now the UTF-8 Mode which ignores the locale and forces the usage of UTF-8. I'm not sure that there is a need to have a custom codec. Moreover, if there an API to pass a codec in C, you will need to expose it somehow at the Python level for os.fsencode() and os.fsdecode(). Currently, Python ensures during early stage of startup that codecs.lookup(sys.getfilesystemencoding()) works: there is a existing Python codec for the requested filesystem encoding. Victor From vstinner at redhat.com Fri Apr 5 12:22:17 2019 From: vstinner at redhat.com (Victor Stinner) Date: Fri, 5 Apr 2019 18:22:17 +0200 Subject: [Python-Dev] New Python Initialization API In-Reply-To: <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: > > Example of Python initialization enabling the isolated mode:: > > > > PyConfig config = PyConfig_INIT; > > config.isolated = 1; > > Haven't we already used extenal values by this point that should have > been isolated? On this specific example, "config.isolated = 1;" ensures that Py_PreInitialize() is also called internally with "PyPreConfig.isolated = 1". > I'd rather have the isolation up front. Or better yet, > make isolation the default unless you call one of the "FromArgs" > functions, and then we don't actually need the config setting at all. While there are supporters of an "isolated Python" (sometimes called "system python"), the fact that it doesn't exist in any Linux distribution nor on any other operating system (Windows, macOS, FreeBSD), whereas it's already doable in Python 3.6 with Py_IsolatedFlag=1 makes me think that users like the ability to control Python with environment variables and configuration files. I would prefer to leave Python as not isolated by default. It's just a matter of comment line arguments. > > * The PEP 432 stores ``PYTHONCASEOK`` into the config. Do we need > > to add something for that into ``PyConfig``? How would it be exposed > > at the Python level for ``importlib``? Passed as an argument to > > ``importlib._bootstrap._setup()`` maybe? It can be added later if > > needed. > > Could we convert it into an xoption? It's very rarely used, to my knowledge. The first question is if there is any need for an embedder to change this option. Currently, importlib._bootstrap_external._install() reads the environment variable and it's the only way to control the option. ... By the way, importlib reads PYTHONCASEOK environment varaible even if isolated mode is enabled (sys.flags.isolated is equal to 1). Is it a bug? :-) Victor From vstinner at redhat.com Fri Apr 5 12:24:27 2019 From: vstinner at redhat.com (Victor Stinner) Date: Fri, 5 Apr 2019 18:24:27 +0200 Subject: [Python-Dev] New Python Initialization API In-Reply-To: <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: > I think my biggest point (about halfway down) is that I'd rather use > argv/environ/etc. to *initialize* PyConfig and then only use the config > for initializing the runtime. That way it's more transparent for users > and more difficult for us to add options that embedders can't access. I chose to exclude PyConfig_Read() function from the PEP to try to start with the bare minimum public API and see how far we can go with that. The core of the PEP 587 implementation are PyPreConfig_Read() and PyConfig_Read() functions (currently called _PyPreConfig_Read() and _PyCoreConfig_Read()): they populate all fields so the read config becomes the reference config which will be applied. For example, PyConfig_Read() fills module_search_paths, from other PyConfig fields: it will become sys.path. I spent a lot of time to rework deeply the implementation of PyConfig_Read() to make sure that it has no side effect. Reading and writing the configuration are now strictly separated. So it is safe to call PyConfig_Read(), modify PyConfig afterwards, and pass the modified config to Py_InitializeFromConfig(). Do you think that exposing PyConfig_Read() would solve some of your problems? > Currently you have three functions, that take a PyConfig and optionally > also use the environment/argv to figure out the settings: > > > * ``PyInitError Py_InitializeFromConfig(const PyConfig *config)`` > > * ``PyInitError Py_InitializeFromArgs(const PyConfig *config, int > argc, char **argv)`` > > * ``PyInitError Py_InitializeFromWideArgs(const PyConfig *config, int > argc, wchar_t **argv)`` > > I would much prefer to see this flipped around, so that there is one > initialize function taking PyConfig, and two functions that will fill > out the PyConfig based on the environment: > > (note two of the "const"s are gone) > > * ``PyInitError Py_SetConfigFromArgs(PyConfig *config, int argc, char > **argv)`` > * ``PyInitError Py_SetConfigFromWideArgs(PyConfig *config, int argc, > wchar_t **argv)`` > * ``PyInitError Py_InitializeFromConfig(const PyConfig *config)`` This implementation evolved *A LOT* last months. I was *very confused* until the pre-initialization phase was introduced which solved a lot of bootstrap issues. After I wrote down the PEP and read it again, I also came to the same conclusion: Py_InitializeFromConfig(config) should be enough, and we can add helper functions to set arguments on PyConfig (as you showed). Victor From vstinner at redhat.com Fri Apr 5 12:36:37 2019 From: vstinner at redhat.com (Victor Stinner) Date: Fri, 5 Apr 2019 18:36:37 +0200 Subject: [Python-Dev] New Python Initialization API In-Reply-To: <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: For the PyMainConfig structure idea, I cannot comment at this point. I need more time to think about it. About the "path configuration" fields, maybe a first step to enhance the API would be to add the the following function: PyInitError PyConfig_ComputePath(PyConfig *config, const wchar *home); where home can be NULL (and PyConfig.module_search_paths_env field goes away: the function reads PYTHONPATH env var internally). This function would "compute the path configuration", what's currently listed in _PyCoreConfig under: /* Path configuration outputs */ int use_module_search_paths; /* If non-zero, use module_search_paths */ _PyWstrList module_search_paths; /* sys.path paths. Computed if use_module_search_paths is equal to zero. */ wchar_t *executable; /* sys.executable */ wchar_t *prefix; /* sys.prefix */ wchar_t *base_prefix; /* sys.base_prefix */ wchar_t *exec_prefix; /* sys.exec_prefix */ wchar_t *base_exec_prefix; /* sys.base_exec_prefix */ #ifdef MS_WINDOWS wchar_t *dll_path; /* Windows DLL path */ #endif Victor From storchaka at gmail.com Fri Apr 5 13:53:10 2019 From: storchaka at gmail.com (Serhiy Storchaka) Date: Fri, 5 Apr 2019 20:53:10 +0300 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA74B19.70806@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> Message-ID: 05.04.19 15:33, Jeroen Demeyer ????: > On 2019-04-05 15:13, Serhiy Storchaka wrote: >> It is easy to implement a function in C. > > Why does it need to be a PyCFunction? You could put an actual method > descriptor in the class. In other words, use PyDescr_NewMethod() instead > of PyCFunction_New() + PyInstanceMethod_New(). It's probably going to be > faster too since the instancemethod adds an unoptimized extra level of > indirection. PyDescr_NewMethod() takes PyTypeObject* which is not known at that moment. But maybe passing &PyBaseObject_Type will make a trick. I need to try. >> Yes, this is what I want to do. I did not do this only because >> implementing method-like functions which which do not belong to concrete >> class implemented in C is not convention. > > Sure, you could implement separate methods like __gt__ in C, but that's > still less efficient than just implementing a specific tp_richcompare > for total_ordering and then having the usual wrapper descriptors for > __gt__. At Python level we can monkeypatch __gt__, but not tp_richcompare. In any case, removing a C API is a large breakage, and it is better to avoid it unless that API is inherently broken. From christian at python.org Fri Apr 5 13:00:00 2019 From: christian at python.org (Christian Heimes) Date: Fri, 5 Apr 2019 19:00:00 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> Message-ID: <52930d72-803b-0f8b-b338-a96e914ff2eb@python.org> On 05/04/2019 17.46, Guido van Rossum wrote: > Let's stop here. This API is doing no harm, it's not a maintenance > burden, clearly *some* folks have a use for it. Let's just keep it, > okay? There are bigger fish to fry. Sounds good to me. My code is 12 years ago and I can't remember any complain. I have closed the BPO issue and PR. From J.Demeyer at UGent.be Fri Apr 5 13:56:08 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Fri, 5 Apr 2019 19:56:08 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> Message-ID: <5CA796B8.1000202@UGent.be> On 2019-04-05 19:53, Serhiy Storchaka wrote: > At Python level we can monkeypatch __gt__, but not tp_richcompare. Sure, but you're planning to use C anyway so that's not really an argument. From status at bugs.python.org Fri Apr 5 14:07:47 2019 From: status at bugs.python.org (Python tracker) Date: Fri, 5 Apr 2019 18:07:47 +0000 (UTC) Subject: [Python-Dev] Summary of Python tracker Issues Message-ID: <20190405180747.30B2D52B1D6@bugs.ams1.psf.io> ACTIVITY SUMMARY (2019-03-29 - 2019-04-05) Python tracker at https://bugs.python.org/ To view or respond to any of the issues listed below, click on the issue. Do NOT respond to this message. Issues counts and deltas: open 7056 ( +8) closed 41231 (+55) total 48287 (+63) Open issues with patches: 2816 Issues opened (44) ================== #36260: Cpython/Lib vulnerability found and request a patch submission https://bugs.python.org/issue36260 reopened by krnick #36474: RecursionError resets trace function set via sys.settrace https://bugs.python.org/issue36474 opened by blueyed #36475: PyEval_AcquireLock() and PyEval_AcquireThread() do not handle https://bugs.python.org/issue36475 opened by eric.snow #36476: Runtime finalization assumes all other threads have exited. https://bugs.python.org/issue36476 opened by eric.snow #36478: backport of pickle fixes to Python 3.5.7 uses C99 for loops https://bugs.python.org/issue36478 opened by Anthony Sottile #36479: Exit threads when interpreter is finalizing rather than runtim https://bugs.python.org/issue36479 opened by eric.snow #36481: telnetlib process_rawq() callback https://bugs.python.org/issue36481 opened by Quanthir #36484: Can't reorder TLS 1.3 ciphersuites https://bugs.python.org/issue36484 opened by Eman Alashwali #36485: Establish a uniform way to clear all caches in a given module https://bugs.python.org/issue36485 opened by serhiy.storchaka #36486: Bugs and inconsistencies in unicodedata https://bugs.python.org/issue36486 opened by dscorbett #36487: Make C-API docs clear about what the "main" interpreter is https://bugs.python.org/issue36487 opened by nanjekyejoannah #36488: os.sendfile() on BSD and macOS does not return bytes sent on E https://bugs.python.org/issue36488 opened by giampaolo.rodola #36489: add filename_extension_map and/or content-types_map dict(s) to https://bugs.python.org/issue36489 opened by Daniel Black #36490: Modernize function signature format in Archiving section of sh https://bugs.python.org/issue36490 opened by CAM-Gerlach #36494: bdb.Bdb.set_trace: should set f_trace_lines = True https://bugs.python.org/issue36494 opened by blueyed #36495: Out-of-bounds array reads in Python/ast.c https://bugs.python.org/issue36495 opened by blarsen #36497: Undocumented behavior in csv.Sniffer (preferred delimiters) https://bugs.python.org/issue36497 opened by thomas #36500: Add "regen-*" equivalent projects for Windows builds https://bugs.python.org/issue36500 opened by anthony shaw #36501: Remove POSIX.1e ACLs in tests that rely on default permissions https://bugs.python.org/issue36501 opened by Ivan.Pozdeev #36502: The behavior of str.isspace() for U+00A0 and U+202F is differe https://bugs.python.org/issue36502 opened by Jun #36503: remove references to aix3 and aix4 in \*.py https://bugs.python.org/issue36503 opened by Michael.Felt #36504: Signed integer overflow in _ctypes.c's PyCArrayType_new() https://bugs.python.org/issue36504 opened by ZackerySpytz #36506: [security] CVE-2019-10268: An arbitrary execution vulnerabilit https://bugs.python.org/issue36506 opened by bigbigliang #36508: python-config --ldflags must not contain LINKFORSHARED ("-Xlin https://bugs.python.org/issue36508 opened by vstinner #36509: Add iot layout for windows iot containers https://bugs.python.org/issue36509 opened by Paul Monson #36511: Add Windows ARM32 buildbot https://bugs.python.org/issue36511 opened by Paul Monson #36512: future_factory argument for Thread/ProcessPoolExecutor https://bugs.python.org/issue36512 opened by stefanhoelzl #36513: Add support for building arm32 nuget package https://bugs.python.org/issue36513 opened by Paul Monson #36515: unaligned memory access in the _sha3 extension https://bugs.python.org/issue36515 opened by doko #36516: Python Launcher can not recognize pyw file as Python GUI Scrip https://bugs.python.org/issue36516 opened by gjj2828 #36517: typing.NamedTuple does not support mixins https://bugs.python.org/issue36517 opened by rectalogic #36518: Avoid conflicts when pass arbitrary keyword arguments to Pytho https://bugs.python.org/issue36518 opened by serhiy.storchaka #36519: Blake2b/s implementations have minor GIL issues https://bugs.python.org/issue36519 opened by gwk #36520: Email header folded incorrectly https://bugs.python.org/issue36520 opened by Jonathan Horn #36521: Consider removing docstrings from co_consts in code objects https://bugs.python.org/issue36521 opened by rhettinger #36523: missing docs for IOBase writelines https://bugs.python.org/issue36523 opened by Marcin Niemira #36527: unused parameter warnings in Include/object.h (affecting build https://bugs.python.org/issue36527 opened by AMDmi3 #36528: Remove duplicate tests in Lib/tests/re_tests.py https://bugs.python.org/issue36528 opened by xtreak #36529: Python from WindowsStore: can't install package using "-m pip" https://bugs.python.org/issue36529 opened by Ilya Kazakevich #36531: PyType_FromSpec wrong behavior with multiple Py_tp_members https://bugs.python.org/issue36531 opened by eelizondo #36532: Example of logging.formatter with new str.format style https://bugs.python.org/issue36532 opened by spaceman_spiff #36533: logging regression with threading + fork are mixed in 3.7.1rc2 https://bugs.python.org/issue36533 opened by gregory.p.smith #36534: tarfile: handling Windows (path) illegal characters in archive https://bugs.python.org/issue36534 opened by CristiFati #36535: Windows build failure when use the code from the GitHub master https://bugs.python.org/issue36535 opened by Manjusaka Most recent 15 issues with no replies (15) ========================================== #36535: Windows build failure when use the code from the GitHub master https://bugs.python.org/issue36535 #36531: PyType_FromSpec wrong behavior with multiple Py_tp_members https://bugs.python.org/issue36531 #36529: Python from WindowsStore: can't install package using "-m pip" https://bugs.python.org/issue36529 #36528: Remove duplicate tests in Lib/tests/re_tests.py https://bugs.python.org/issue36528 #36527: unused parameter warnings in Include/object.h (affecting build https://bugs.python.org/issue36527 #36523: missing docs for IOBase writelines https://bugs.python.org/issue36523 #36520: Email header folded incorrectly https://bugs.python.org/issue36520 #36517: typing.NamedTuple does not support mixins https://bugs.python.org/issue36517 #36516: Python Launcher can not recognize pyw file as Python GUI Scrip https://bugs.python.org/issue36516 #36515: unaligned memory access in the _sha3 extension https://bugs.python.org/issue36515 #36513: Add support for building arm32 nuget package https://bugs.python.org/issue36513 #36512: future_factory argument for Thread/ProcessPoolExecutor https://bugs.python.org/issue36512 #36511: Add Windows ARM32 buildbot https://bugs.python.org/issue36511 #36509: Add iot layout for windows iot containers https://bugs.python.org/issue36509 #36503: remove references to aix3 and aix4 in \*.py https://bugs.python.org/issue36503 Most recent 15 issues waiting for review (15) ============================================= #36532: Example of logging.formatter with new str.format style https://bugs.python.org/issue36532 #36531: PyType_FromSpec wrong behavior with multiple Py_tp_members https://bugs.python.org/issue36531 #36528: Remove duplicate tests in Lib/tests/re_tests.py https://bugs.python.org/issue36528 #36527: unused parameter warnings in Include/object.h (affecting build https://bugs.python.org/issue36527 #36523: missing docs for IOBase writelines https://bugs.python.org/issue36523 #36518: Avoid conflicts when pass arbitrary keyword arguments to Pytho https://bugs.python.org/issue36518 #36516: Python Launcher can not recognize pyw file as Python GUI Scrip https://bugs.python.org/issue36516 #36515: unaligned memory access in the _sha3 extension https://bugs.python.org/issue36515 #36513: Add support for building arm32 nuget package https://bugs.python.org/issue36513 #36512: future_factory argument for Thread/ProcessPoolExecutor https://bugs.python.org/issue36512 #36509: Add iot layout for windows iot containers https://bugs.python.org/issue36509 #36508: python-config --ldflags must not contain LINKFORSHARED ("-Xlin https://bugs.python.org/issue36508 #36504: Signed integer overflow in _ctypes.c's PyCArrayType_new() https://bugs.python.org/issue36504 #36503: remove references to aix3 and aix4 in \*.py https://bugs.python.org/issue36503 #36501: Remove POSIX.1e ACLs in tests that rely on default permissions https://bugs.python.org/issue36501 Top 10 most discussed issues (10) ================================= #36485: Establish a uniform way to clear all caches in a given module https://bugs.python.org/issue36485 13 msgs #36466: Adding a way to strip annotations from compiled bytecode https://bugs.python.org/issue36466 12 msgs #36469: Stuck during interpreter exit, attempting to take the GIL https://bugs.python.org/issue36469 10 msgs #36506: [security] CVE-2019-10268: An arbitrary execution vulnerabilit https://bugs.python.org/issue36506 8 msgs #6721: Locks in the standard library should be sanitized on fork https://bugs.python.org/issue6721 7 msgs #36533: logging regression with threading + fork are mixed in 3.7.1rc2 https://bugs.python.org/issue36533 6 msgs #35866: concurrent.futures deadlock https://bugs.python.org/issue35866 5 msgs #36384: ipaddress Should not reject IPv4 addresses with leading zeroes https://bugs.python.org/issue36384 5 msgs #30661: Support tarfile.PAX_FORMAT in shutil.make_archive https://bugs.python.org/issue30661 4 msgs #35224: PEP 572: Assignment Expressions https://bugs.python.org/issue35224 4 msgs Issues closed (53) ================== #17110: sys.argv docs should explaining how to handle encoding issues https://bugs.python.org/issue17110 closed by inada.naoki #20844: SyntaxError: encoding problem: iso-8859-1 on Windows https://bugs.python.org/issue20844 closed by inada.naoki #21269: Provide args and kwargs attributes on mock call objects https://bugs.python.org/issue21269 closed by xtreak #22831: Use "with" to avoid possible fd leaks https://bugs.python.org/issue22831 closed by serhiy.storchaka #24214: UTF-8 incremental decoder doesn't support surrogatepass correc https://bugs.python.org/issue24214 closed by serhiy.storchaka #25451: tkinter: PhotoImage transparency methods https://bugs.python.org/issue25451 closed by serhiy.storchaka #29202: Improve dict iteration https://bugs.python.org/issue29202 closed by inada.naoki #31182: Suggested Enhancements to zipfile & tarfile command line inter https://bugs.python.org/issue31182 closed by brett.cannon #32413: Document that locals() may return globals() https://bugs.python.org/issue32413 closed by brett.cannon #32531: gdb.execute can not put string value. https://bugs.python.org/issue32531 closed by berker.peksag #32538: Multiprocessing Manager on 3D list - no change of the list pos https://bugs.python.org/issue32538 closed by berker.peksag #33261: inspect.isgeneratorfunction fails on hand-created methods https://bugs.python.org/issue33261 closed by petr.viktorin #34430: Symmetrical chaining futures in asyncio.future.wrap_future https://bugs.python.org/issue34430 closed by huji #35272: sqlite3 get the connected database url https://bugs.python.org/issue35272 closed by berker.peksag #35403: support application/wasm in mimetypes and http.server https://bugs.python.org/issue35403 closed by martin.panter #35838: ConfigParser: document optionxform must be idempotent https://bugs.python.org/issue35838 closed by inada.naoki #36010: Please provide a .zip Windows release of Python that is not cr https://bugs.python.org/issue36010 closed by steve.dower #36026: Different error message when sys.settrace is used https://bugs.python.org/issue36026 closed by inada.naoki #36085: Enable better DLL resolution https://bugs.python.org/issue36085 closed by steve.dower #36157: Document PyInterpreterState_Main(). https://bugs.python.org/issue36157 closed by eric.snow #36293: Nonblocking read sys.stdin raises error https://bugs.python.org/issue36293 closed by martin.panter #36322: Argument typo in dbm.ndbm.open https://bugs.python.org/issue36322 closed by brett.cannon #36377: Python 'datastructures.html' docs page needs improvement becau https://bugs.python.org/issue36377 closed by rhettinger #36404: Document PendingDeprecationWarning is not so useful. https://bugs.python.org/issue36404 closed by inada.naoki #36426: exec() issue when used inside function https://bugs.python.org/issue36426 closed by ncoghlan #36434: Zipfile breaks if signalled during write() https://bugs.python.org/issue36434 closed by serhiy.storchaka #36440: more helpful diagnostics for parser module https://bugs.python.org/issue36440 closed by pablogsal #36442: Different ValueError for the same operation in List and Tuple https://bugs.python.org/issue36442 closed by serhiy.storchaka #36445: bus error in test_gil test on armhf running with 64bit kernel https://bugs.python.org/issue36445 closed by doko #36448: Message "You will need to rebuild pythoncore to see the change https://bugs.python.org/issue36448 closed by steve.dower #36468: Treeview: wrong color change https://bugs.python.org/issue36468 closed by ned.deily #36472: Some old PR with CLA not signed https://bugs.python.org/issue36472 closed by brett.cannon #36473: dictkeysobject: Add maximum iteration check for .values() and https://bugs.python.org/issue36473 closed by inada.naoki #36477: Subinterpreters are not finalized during runtime finalization. https://bugs.python.org/issue36477 closed by eric.snow #36480: .strip() unexpected output on Windows https://bugs.python.org/issue36480 closed by eric.smith #36482: let struct's internal cache use FIFO policy https://bugs.python.org/issue36482 closed by rhettinger #36483: Missing line in documentation example https://bugs.python.org/issue36483 closed by martin.panter #36491: sum function's start optional parameter documented in help but https://bugs.python.org/issue36491 closed by rhettinger #36492: Deprecate passing some conflicting arguments by keyword https://bugs.python.org/issue36492 closed by serhiy.storchaka #36493: Add math.midpoint(a,b) function https://bugs.python.org/issue36493 closed by scoder #36496: Local variables can be used uninitialized in _PyPreConfig_Read https://bugs.python.org/issue36496 closed by vstinner #36498: combining dict comprehensing and lists lead to IndexError https://bugs.python.org/issue36498 closed by SilentGhost #36499: unpickling of a datetime object in 3.5 fails when pickled with https://bugs.python.org/issue36499 closed by josh.r #36505: PYTHON-CAN with vector https://bugs.python.org/issue36505 closed by SilentGhost #36507: frozenset type breaks ZFC https://bugs.python.org/issue36507 closed by rhettinger #36510: Regular Expression Dot-Star patter matching - re- text skippi https://bugs.python.org/issue36510 closed by SilentGhost #36514: -m switch revisited https://bugs.python.org/issue36514 closed by ronaldoussoren #36522: http/client.py does not print duplicate header values in debug https://bugs.python.org/issue36522 closed by serhiy.storchaka #36524: identity operator https://bugs.python.org/issue36524 closed by SilentGhost #36525: Deprecate instancemethod https://bugs.python.org/issue36525 closed by christian.heimes #36526: python crash when loading some .pyc file https://bugs.python.org/issue36526 closed by serhiy.storchaka #36530: Document codecs decode_encode() and encode_decode() APIs https://bugs.python.org/issue36530 closed by gregory.p.smith #36536: is there a python implementation of the cpython commandline in https://bugs.python.org/issue36536 closed by larry From J.Demeyer at UGent.be Fri Apr 5 14:29:00 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Fri, 5 Apr 2019 20:29:00 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> Message-ID: <5CA79E6C.904@UGent.be> On 2019-04-05 17:46, Guido van Rossum wrote: > This API is doing no harm, it's not a maintenance > burden What if the following happens? 1. For some reason (possibly because of this thread), people discover instancemethod and start using it. 2. People realize that it's slow. 3. It needs to be made more efficient, causing new code bloat and maintenance burden. > clearly *some* folks have a use for it. I'm not convinced. I don't think that instancemethod is the right solution for functools.total_ordering for example. Jeroen. From brett at python.org Fri Apr 5 15:58:47 2019 From: brett at python.org (Brett Cannon) Date: Fri, 5 Apr 2019 12:58:47 -0700 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA79E6C.904@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> <5CA79E6C.904@UGent.be> Message-ID: On Fri, Apr 5, 2019 at 11:30 AM Jeroen Demeyer wrote: > On 2019-04-05 17:46, Guido van Rossum wrote: > > This API is doing no harm, it's not a maintenance > > burden > > What if the following happens? > > 1. For some reason (possibly because of this thread), people discover > instancemethod and start using it. > > 2. People realize that it's slow. > > 3. It needs to be made more efficient, causing new code bloat and > maintenance burden. > Then we can consider improving the documentation if there are performance implications. But the point is if there's code out there already using it without issue then ripping it out of the C API is painful since we don't have nearly as good of a deprecation setup as we do in Python code. Not everything about the C APi is about performance. -Brett > > > clearly *some* folks have a use for it. > > I'm not convinced. OK, but as of right now others like me are convinced and we typically err on the side of backwards-compatibility in these kinds of situations. -Brett > I don't think that instancemethod is the right > solution for functools.total_ordering for example. > > > Jeroen. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/brett%40python.org > -------------- next part -------------- An HTML attachment was scrubbed... URL: From J.Demeyer at UGent.be Fri Apr 5 16:09:48 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Fri, 5 Apr 2019 22:09:48 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> <5CA79E6C.904@UGent.be> Message-ID: <5CA7B60C.1060605@UGent.be> On 2019-04-05 21:58, Brett Cannon wrote: > Then we can consider improving the documentation if there are > performance implications. Sure, we could write in the docs something like "Don't use this, this is not what you want. It's slow and there are better alternatives like method descriptors". Should I do that (with better wording of course)? > since we don't have nearly as good of a deprecation setup as we > do in Python code. I don't get this. One can easily raise a DeprecationWarning from C code, there is plenty of code already doing that. Jeroen. From brett at python.org Fri Apr 5 20:30:31 2019 From: brett at python.org (Brett Cannon) Date: Fri, 5 Apr 2019 17:30:31 -0700 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA7B60C.1060605@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> <5CA79E6C.904@UGent.be> <5CA7B60C.1060605@UGent.be> Message-ID: On Fri, Apr 5, 2019 at 1:11 PM Jeroen Demeyer wrote: > On 2019-04-05 21:58, Brett Cannon wrote: > > Then we can consider improving the documentation if there are > > performance implications. > > Sure, we could write in the docs something like "Don't use this, this is > not what you want. It's slow and there are better alternatives like > method descriptors". Should I do that (with better wording of course)? > Up to you. Obviously help is always appreciated, just a question of who feels qualified to review the PR. > > > since we don't have nearly as good of a deprecation setup as we > > do in Python code. > > I don't get this. One can easily raise a DeprecationWarning from C code, > there is plenty of code already doing that. > True. I personally prefer compile-time warnings for that sort of thing, but you're right we can do it at the Python "level" with a raise of a DeprecationWarning on those instances. -------------- next part -------------- An HTML attachment was scrubbed... URL: From doko at ubuntu.com Fri Apr 5 22:39:49 2019 From: doko at ubuntu.com (Matthias Klose) Date: Sat, 6 Apr 2019 04:39:49 +0200 Subject: [Python-Dev] PEP-582 and multiple Python installations In-Reply-To: <2b889555-db6f-6c69-0347-ebb89d6fec21@python.org> References: <2b889555-db6f-6c69-0347-ebb89d6fec21@python.org> Message-ID: <46858269-5ce0-5f70-9a08-f22135b5c1e9@ubuntu.com> On 02.04.19 18:10, Steve Dower wrote: > On 02Apr2019 0817, Calvin Spealman wrote: >> (I originally posted this to python-ideas, where I was told none of this PEP's >> authors subscribe so probably no one will see it there, so I'm posting it here >> to raise the issue where it can get seen and hopefully discussed) > > Correct, thanks for posting. (I thought we had a "discussions-to" tag with > distutils-sig on it, but apparently not.) > >> While the PEP does show the version number as part of the path to the actual >> packages, implying support for multiple versions, this doesn't seem to be >> spelled out in the actual text. Presumably __pypackages__/3.8/ might sit >> beside __pypackages__/3.9/, etc. to keep future versions capable of installing >> packages for each version, the way virtualenv today is bound to one version of >> Python. >> >> I'd like to raise a potential edge case that might be a problem, and likely an >> increasingly common one: users with multiple installations of the *same* >> version of Python. This is actually a common setup for Windows users who use >> WSL, Microsoft's Linux-on-Windows solution, as you could have both the Windows >> and Linux builds of a given Python version installed on the same machine. The >> currently implied support for multiple versions would not be able to separate >> these and could create problems if users pip install a Windows binary package >> through Powershell and then try to run a script in Bash from the same >> directory, causing the Linux version of Python to try to use Windows python >> packages. >> >> I'm not actually sure what the solution here is. Mostly I wanted to raise the >> concern, because I'm very keen on WSL being a great entry path for new >> developers and I want to make that a better experience, not a more confusing >> one. Maybe that version number could include some other unique identify, maybe >> based on Python's own executable. A hash maybe? I don't know if anything like >> that already exists to uniquely identify a Python build or installation. > > Yes, this is a situation we're aware of, and it's caught in the conflict of "who > is this feature meant to support". This smells the same like mixing system installed python packages (deb/rpm) with one managed by pip, and pip touching system installed packages. > Since all platforms have a unique extension module suffix (e.g. > "module.cp38-win32.pyd"), it would be possible to support this with "fat" > packages that include all binaries (or some clever way of merging wheels for > multiple platforms). unfortunately not. The Android developers opted out of that, reverting that change. Also how would you differentiate win32 builds for different architectures? But maybe this is already done. > And since this is already in CPython itself, it leads to about the only > reasonable solution - instead of "3.8", use the extension module suffix > "cp38-win32". (Wheel tags are not in core CPython, so we can't use those.) > > But while this seems obvious, it also reintroduces problems that this has the > potential to fix - suddenly, just like installing into your global environment, > your packages are not project-specific anymore but are Python-specific. Which is > one of the major confusions people run into ("I pip installed X but now can't > import it in python"). > > So the main points of discussion right now are "whose problem does this solve" > and "when do we tell people they need a full venv". And that discussion is > mostly happening at > https://discuss.python.org/t/pep-582-python-local-packages-directory/963/ > > Cheers, > Steve > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/doko%40ubuntu.com From songofacandy at gmail.com Sat Apr 6 01:09:37 2019 From: songofacandy at gmail.com (Inada Naoki) Date: Sat, 6 Apr 2019 14:09:37 +0900 Subject: [Python-Dev] New Python Initialization API In-Reply-To: References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: On Sat, Apr 6, 2019 at 1:13 AM Victor Stinner wrote: > > > > ``PyPreConfig`` fields: > > > > > > * ``coerce_c_locale_warn``: if non-zero, emit a warning if the C locale > > > is coerced. > > > * ``coerce_c_locale``: if equals to 2, coerce the C locale; if equals to > > > 1, read the LC_CTYPE to decide if it should be coerced. > > > > Can we use another value for coerce_c_locale to determine whether to > > warn or not? Save a field. > > coerce_c_locale is already complex, it can have 4 values: -1, 0, 1 and 2. > I prefer keep a separated field. > > Moreover, I understood that you might want to coerce the C locale *and* > get the warning, or get the warning but *not* coerce the locale. > Are these configurations are really needed? Applications embedding Python may not initialize Python interpreter at first. For example, vim initializes Python when Python is used first time. On the other hand, C locale coercion should be done ASAP application starts. I think dedicated API for coercing C locale is better than preconfig. // When application starts: Py_CoerceCLocale(warn=0); // later... Py_Initialize(); -- Inada Naoki From vstinner at redhat.com Sat Apr 6 09:37:28 2019 From: vstinner at redhat.com (Victor Stinner) Date: Sat, 6 Apr 2019 15:37:28 +0200 Subject: [Python-Dev] New Python Initialization API In-Reply-To: References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: Maybe I should clarify in the PEP 587 Rationale what are the use cases for the API. Embeding Python is one kind of use case, but writing your own Python with a specific config like "isolated Python" or "system Python" is also a valid use case. For a custom Python, you might want to get C locale coercion and UTF-8 Mode. The most common case is to embed Python in an application like Blender or vim: the application already executes a lot of code and manipulated strings and encoding before Python is initialized, so Python must not coerce the C locale in that case. That's why Nick and me decided to disable C loclae coercion and UTF-8 Mode by default when the C API is used. Victor Le samedi 6 avril 2019, Inada Naoki a ?crit : > On Sat, Apr 6, 2019 at 1:13 AM Victor Stinner wrote: >> >> > > ``PyPreConfig`` fields: >> > > >> > > * ``coerce_c_locale_warn``: if non-zero, emit a warning if the C locale >> > > is coerced. >> > > * ``coerce_c_locale``: if equals to 2, coerce the C locale; if equals to >> > > 1, read the LC_CTYPE to decide if it should be coerced. >> > >> > Can we use another value for coerce_c_locale to determine whether to >> > warn or not? Save a field. >> >> coerce_c_locale is already complex, it can have 4 values: -1, 0, 1 and 2. >> I prefer keep a separated field. >> >> Moreover, I understood that you might want to coerce the C locale *and* >> get the warning, or get the warning but *not* coerce the locale. >> > > Are these configurations are really needed? > > Applications embedding Python may not initialize Python interpreter at first. > For example, vim initializes Python when Python is used first time. > > On the other hand, C locale coercion should be done ASAP application starts. > > I think dedicated API for coercing C locale is better than preconfig. > > // When application starts: > Py_CoerceCLocale(warn=0); > > // later... > Py_Initialize(); > > -- > Inada Naoki > -- Night gathers, and now my watch begins. It shall not end until my death. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Sat Apr 6 22:45:36 2019 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 7 Apr 2019 12:45:36 +1000 Subject: [Python-Dev] New Python Initialization API In-Reply-To: References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: On Sat, 6 Apr 2019 at 02:16, Victor Stinner wrote: > > > ``PyPreConfig`` fields: > > > > > > * ``coerce_c_locale_warn``: if non-zero, emit a warning if the C locale > > > is coerced. > > > * ``coerce_c_locale``: if equals to 2, coerce the C locale; if equals to > > > 1, read the LC_CTYPE to decide if it should be coerced. > > > > Can we use another value for coerce_c_locale to determine whether to > > warn or not? Save a field. > > coerce_c_locale is already complex, it can have 4 values: -1, 0, 1 and 2. > I prefer keep a separated field. > > Moreover, I understood that you might want to coerce the C locale *and* > get the warning, or get the warning but *not* coerce the locale. Yeah, that's how they ended up being two different fields in the first place. However, I wonder if the two fields might be better named: * warn_on_legacy_c_locale * coerce_legacy_c_locale Neither set: legacy C locale is left alone Only warning flag set: complain about the legacy C locale on stderr Only coercion flag set: silently attempt to coerce the legacy C locale to a UTF-8 based one Both flags set: attempt the coercion, and then complain about it on stderr (regardless of whether the coercion succeeded or not) The original PEP 580 implementation tried to keep the config API simpler by always complaining, but that turned out to break the world (plenty of contexts where things get upset by unexpected output on stderr). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From ncoghlan at gmail.com Sat Apr 6 22:49:10 2019 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 7 Apr 2019 12:49:10 +1000 Subject: [Python-Dev] New Python Initialization API In-Reply-To: References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: On Sun, 7 Apr 2019 at 12:45, Nick Coghlan wrote: > The original PEP 580 implementation tried to keep the config API > simpler by always complaining, but that turned out to break the world > (plenty of contexts where things get upset by unexpected output on > stderr). Err, PEP 538. No idea why my brain swapped in the wrong PEP number :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From storchaka at gmail.com Sun Apr 7 03:48:45 2019 From: storchaka at gmail.com (Serhiy Storchaka) Date: Sun, 7 Apr 2019 10:48:45 +0300 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA796B8.1000202@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> <5CA796B8.1000202@UGent.be> Message-ID: 05.04.19 20:56, Jeroen Demeyer ????: > On 2019-04-05 19:53, Serhiy Storchaka wrote: >> At Python level we can monkeypatch __gt__, but not tp_richcompare. > > Sure, but you're planning to use C anyway so that's not really an argument. total_ordering monkeypatches the decorated class. I'm planning to implement in C methods that implement __gt__ in terms of __lt__ etc. From J.Demeyer at UGent.be Sun Apr 7 03:15:57 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Sun, 7 Apr 2019 09:15:57 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> <5CA796B8.1000202@UGent.be> Message-ID: <5CA9A3AD.1010300@UGent.be> On 2019-04-07 09:48, Serhiy Storchaka wrote: > total_ordering monkeypatches the decorated class. I'm planning to > implement in C methods that implement __gt__ in terms of __lt__ etc. Yes, I understood that. I'm just saying: if you want to make it fast, that's not the best solution. The fastest would be to implement tp_richcompare from scratch (instead of relying on slot_tp_richcompare dispatching to methods). From xdegaye at gmail.com Sun Apr 7 07:31:13 2019 From: xdegaye at gmail.com (Xavier de Gaye) Date: Sun, 7 Apr 2019 13:31:13 +0200 Subject: [Python-Dev] bedevere pipelines hang on github Message-ID: bedevere/issue-number and bedevere/news are not triggered for some reason at https://github.com/python/cpython/pull/12708 and hang forever with "Expected ? Waiting for status to be reported ". Xavier From lisandrosnik at gmail.com Sun Apr 7 07:40:32 2019 From: lisandrosnik at gmail.com (Lysandros Nikolaou) Date: Sun, 7 Apr 2019 13:40:32 +0200 Subject: [Python-Dev] bedevere pipelines hang on github In-Reply-To: References: Message-ID: There is an issue with bedevere at the moment. As described by Mariatta in https://github.com/python/bedevere/issues/162 it is still not clear, if this is our issue or GitHub's. I may have some time to look into it a bit later. On Sun, Apr 7, 2019 at 1:32 PM Xavier de Gaye wrote: > bedevere/issue-number and bedevere/news are not triggered for some > reason at https://github.com/python/cpython/pull/12708 and hang > forever with "Expected ? Waiting for status to be reported ". > > Xavier > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/lisandrosnik%40gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tir.karthi at gmail.com Sun Apr 7 23:38:16 2019 From: tir.karthi at gmail.com (Karthikeyan) Date: Mon, 8 Apr 2019 09:08:16 +0530 Subject: [Python-Dev] bedevere pipelines hang on github In-Reply-To: References: Message-ID: This seems to be fixed now : https://github.com/python/core-workflow/issues/321 Regards, Karthikeyan S -------------- next part -------------- An HTML attachment was scrubbed... URL: From xdegaye at gmail.com Mon Apr 8 06:09:08 2019 From: xdegaye at gmail.com (Xavier de Gaye) Date: Mon, 8 Apr 2019 12:09:08 +0200 Subject: [Python-Dev] bedevere pipelines hang on github In-Reply-To: References: Message-ID: Thanks. Xavier From robert.wd.white at gmail.com Mon Apr 8 11:08:40 2019 From: robert.wd.white at gmail.com (Robert White) Date: Mon, 8 Apr 2019 10:08:40 -0500 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CA9A3AD.1010300@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> <5CA796B8.1000202@UGent.be> <5CA9A3AD.1010300@UGent.be> Message-ID: So we're making pretty heavy use of PyInstanceMethod_New in our python binding library that we've written for a bunch of in house tools. If this isn't the best / correct way to go about adding methods to objects, what should we be using instead? On Sun, Apr 7, 2019 at 2:17 AM Jeroen Demeyer wrote: > On 2019-04-07 09:48, Serhiy Storchaka wrote: > > total_ordering monkeypatches the decorated class. I'm planning to > > implement in C methods that implement __gt__ in terms of __lt__ etc. > > Yes, I understood that. I'm just saying: if you want to make it fast, > that's not the best solution. The fastest would be to implement > tp_richcompare from scratch (instead of relying on slot_tp_richcompare > dispatching to methods). > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/robert.wd.white%40gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From J.Demeyer at UGent.be Mon Apr 8 11:24:34 2019 From: J.Demeyer at UGent.be (Jeroen Demeyer) Date: Mon, 8 Apr 2019 17:24:34 +0200 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> <5CA796B8.1000202@UGent.be> <5CA9A3AD.1010300@UGent.be> Message-ID: <5CAB67B2.6010106@UGent.be> On 2019-04-08 17:08, Robert White wrote: > So we're making pretty heavy use of PyInstanceMethod_New in our python > binding library that we've written for a bunch of in house tools. > If this isn't the best / correct way to go about adding methods to > objects, what should we be using instead? First of all, the consensus in this thread is not to deprecate instancemethod. Well, it depends what you mean with "adding methods to objects", that's vaguely formulated. Do you mean adding methods at run-time (a.k.a. monkey-patching) to a pre-existing class? And is the process of adding methods done in C or in Python? Do you only need PyInstanceMethod_New() or also other PyInstanceMethod_XXX functions/macros? From robert.wd.white at gmail.com Mon Apr 8 11:45:24 2019 From: robert.wd.white at gmail.com (Robert White) Date: Mon, 8 Apr 2019 10:45:24 -0500 Subject: [Python-Dev] Deprecating "instance method" class In-Reply-To: <5CAB67B2.6010106@UGent.be> References: <5CA5EFCC.2030400@UGent.be> <5CA68BBC.8060205@canterbury.ac.nz> <5CA6F09D.3000900@UGent.be> <5CA73B95.6040509@UGent.be> <5CA74B19.70806@UGent.be> <5CA796B8.1000202@UGent.be> <5CA9A3AD.1010300@UGent.be> <5CAB67B2.6010106@UGent.be> Message-ID: Just PyInstanceMethod_New, and by "adding methods to objects" this is adding C functions to types defined in C. Only appears to be called at module import / creation time. On Mon, Apr 8, 2019 at 10:24 AM Jeroen Demeyer wrote: > On 2019-04-08 17:08, Robert White wrote: > > So we're making pretty heavy use of PyInstanceMethod_New in our python > > binding library that we've written for a bunch of in house tools. > > If this isn't the best / correct way to go about adding methods to > > objects, what should we be using instead? > > First of all, the consensus in this thread is not to deprecate > instancemethod. > > Well, it depends what you mean with "adding methods to objects", that's > vaguely formulated. Do you mean adding methods at run-time (a.k.a. > monkey-patching) to a pre-existing class? And is the process of adding > methods done in C or in Python? > > Do you only need PyInstanceMethod_New() or also other > PyInstanceMethod_XXX functions/macros? > -------------- next part -------------- An HTML attachment was scrubbed... URL: From vstinner at redhat.com Tue Apr 9 10:22:56 2019 From: vstinner at redhat.com (Victor Stinner) Date: Tue, 9 Apr 2019 16:22:56 +0200 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability Message-ID: Hi, In May 2017, user "Orange" found a vulnerability in the urllib fix for CVE-2016-5699 (HTTP Header Injection vulnerability): https://bugs.python.org/issue30458 It allows to inject arbitrary HTTP headers. Copy of their message: """ Hi, the patch in CVE-2016-5699 can be broke by an addition space. http://www.cvedetails.com/cve/CVE-2016-5699/ https://hg.python.org/cpython/rev/bf3e1c9b80e9 https://hg.python.org/cpython/rev/1c45047c5102 import urllib, urllib2 urllib.urlopen('http://127.0.0.1\r\n\x20hihi\r\n :11211') urllib2.urlopen('http://127.0.0.1\r\n\x20hihi\r\n :11211') """ Last month, the same bug has been rediscovered by user "ragdoll.guo": https://bugs.python.org/issue36276 Almost one year after the bug has been reported, no one came with a solution. I'm not comfortable with having known security issues impacting HTTP. Can someone please have a look at the issue and try to write a change to fix the issue? According to Karthikeyan Singaravelan, the Go language fixed a similar issue in Go 1.12: throw an error if the URL contains any control character. If we decide that the issue is not a security issue, we should document the behavior properly and close the issue. See also this related issue: "urlopen URL with unescaped space" https://bugs.python.org/issue14826 Victor -- Night gathers, and now my watch begins. It shall not end until my death. From vstinner at redhat.com Tue Apr 9 12:25:25 2019 From: vstinner at redhat.com (Victor Stinner) Date: Tue, 9 Apr 2019 18:25:25 +0200 Subject: [Python-Dev] No longer enable Py_TRACE_REFS by default in debug build Message-ID: Hi, When Python is built in debug mode, PyObject gets 2 new fields: _ob_prev and _ob_next. These fields change the offset of following fields in the PyObject structure and so breaks the ABI. I propose to modify the debug build (Py_DEBUG) to no longer imply Py_TRACE_REFS. Antoine Pitrou proposed this idea when the C API was discussed to get a stable ABI. https://bugs.python.org/issue36465 https://github.com/python/cpython/pull/12615 This change makes the debug build ABI closer to the release build ABI, but I am not sure how to compare these two ABI. Technically, C extensions still need to be recompiled. What do you think? -- I also wrote a PR to remove all code related to Py_TRACE_REFS: https://github.com/python/cpython/pull/12614 I don't think that it's right approach. I prefer to keep this special build around to see if anyone needs it, and wait one or two Python releases to decide what to do with it. Victor -- Night gathers, and now my watch begins. It shall not end until my death. From steve.dower at python.org Tue Apr 9 16:16:29 2019 From: steve.dower at python.org (Steve Dower) Date: Tue, 9 Apr 2019 13:16:29 -0700 Subject: [Python-Dev] No longer enable Py_TRACE_REFS by default in debug build In-Reply-To: References: Message-ID: <0a1871f0-14f8-8591-cf35-21cb3741f354@python.org> On 09Apr2019 0925, Victor Stinner wrote: > This change makes the debug build ABI closer to the release build ABI, > but I am not sure how to compare these two ABI. Technically, C > extensions still need to be recompiled. > > What do you think? What are the other changes that would be required? And is there another way to get the same functionality without ABI modifications? I think it's worthwhile if we can really get to debug and non-debug builds being ABI compatible. Getting partway there in this case doesn't seem to offer any benefits. Cheers, Steve From steve.dower at python.org Tue Apr 9 16:20:46 2019 From: steve.dower at python.org (Steve Dower) Date: Tue, 9 Apr 2019 13:20:46 -0700 Subject: [Python-Dev] New Python Initialization API In-Reply-To: References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: Thanks for the replies. Anything I don't comment on means that I agree with you :) On 05Apr2019 0900, Victor Stinner wrote: > Honestly, I'm not sure that we really have to distinguish "user error" and > "internal error". It's an old debate about calling abort()/DebugBreak() or > not. It seems like most users are annoyed by getting a coredump on Unix > when abort() is called. I'm also annoyed by the crash reports on Windows when "encodings" cannot be found (because occasionally there are enough of them that the Windows team starts reviewing the issue, and I get pulled in to review and resolve their bugs). > Maybe we should just remove Py_INIT_USER_ERR(), always use Py_INIT_ERR(), > and never call abort()/DebugBreak() in Py_ExitInitError(). Not calling abort() sounds fine to me. Embedders would likely prefer an error code rather than a crash, but IIRC they'd have to call Py_ExitInitError() to get the crash anyway, right? > Or does anyone see a good reason to trigger a debugger on an > initialization error? Only before returning from the point where the error occurs. By the time you've returned the error value all the useful context is gone. > Note: I'm not talking about Py_FatalError() here, this one will not > change. Does this get called as part of initialization? If not, I'm fine with it not changing. Cheers, Steve From steve.dower at python.org Tue Apr 9 16:39:59 2019 From: steve.dower at python.org (Steve Dower) Date: Tue, 9 Apr 2019 13:39:59 -0700 Subject: [Python-Dev] New Python Initialization API In-Reply-To: References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: On 05Apr2019 0912, Victor Stinner wrote: > About PyPreConfig and encodings. > [...] >>> * ``PyInitError Py_PreInitialize(const PyPreConfig *config)`` >>> * ``PyInitError Py_PreInitializeFromArgs( const PyPreConfig *config, >> int argc, char **argv)`` >>> * ``PyInitError Py_PreInitializeFromWideArgs( const PyPreConfig >> *config, int argc, wchar_t **argv)`` >> >> I hope to one day be able to support multiple runtimes per process - can >> we have an opaque PyRuntime object exposed publicly now and passed into >> these functions? > > I hesitated to include a "_PyRuntimeState*" parameter somewhere, but I > chose to not do so. > > Currently, there is a single global variable _PyRuntime which has the type > _PyRuntimeState. The _PyRuntime_Initialize() API is designed around this > global variable. For example, _PyRuntimeState contains the registry of > interpreters: you don't want to have multiple registries :-) > > I understood that we should only have a single instance of > _PyRuntimeState. So IMHO it's fine to keep it private at this point. > There is no need to expose it in the API. So I didn't want to expose that particular object right now, but just some sort of "void*" parameter in the new APIs (and require either NULL or a known value be passed). That gives us the freedom to enable multiple runtimes in the future without having to change the API shape. > FYI I tried to design an internal API with a "context" to pass > _PyRuntimeState, PyPreConfig, _PyConfig, the current interpreter, etc. > [...] > There are 2 possible implementations: > > * Modify *all* functions to add a new "context" parameter and modify *all* > functions to pass this parameter to sub-functions. > * Store the current "context" as a thread local variable or something like > that. > [...] > For the second option: well, there is no API change needed! > It can be done later. > Moreover, we already have such API! PyThreadState_Get() gets the Python > thread state of the current thread: the current interpreter can be > accessed from there. Yes, this is what I had in mind as a transition. I think eventually it would be best to have the context parameter, as thread-local variables have overhead and add significant complexity (particularly when debugging crashes), but making that change is huge. >>> ``PyPreConfig`` fields: >>> >>> * ``coerce_c_locale_warn``: if non-zero, emit a warning if the C locale >>> is coerced. >>> * ``coerce_c_locale``: if equals to 2, coerce the C locale; if equals to >>> 1, read the LC_CTYPE to decide if it should be coerced. >> >> Can we use another value for coerce_c_locale to determine whether to >> warn or not? Save a field. > > coerce_c_locale is already complex, it can have 4 values: -1, 0, 1 and 2. > I prefer keep a separated field. > > Moreover, I understood that you might want to coerce the C locale *and* > get the warning, or get the warning but *not* coerce the locale. If we define meaningful constants, then it doesn't matter how many values it has. We could have PY_COERCE_LOCALE_AND_WARN, PY_COERCE_LOCALE_SILENTLY, PY_WARN_WITHOUT_COERCE etc. to represent the states. These actually make things simpler than trying to reason about how two similar parameters interact. >>> * ``legacy_windows_fs_encoding`` (Windows only): if non-zero, set the >>> Python filesystem encoding to ``"mbcs"``. >>> * ``utf8_mode``: if non-zero, enable the UTF-8 mode >> >> Why not just set the encodings here? > > For different technical reasons, you simply cannot specify an encoding > name. You can also pass options to tell Python that you have some > preferences (PyPreConfig and PyConfig fields). > > Python doesn't support any encoding and encoding errors combinations. In > practice, it only supports a narrow set of choices. The main implementation are > Py_EncodeLocale() and Py_DecodeLocale() functions which uses the C codec > of the current locale encoding to implement the filesystem encoding, > before the codec implemented in Python can be used. > > Basically, only the current locale encoding or UTF-8 are supported. > If you want UTF-8, enable the UTF-8 Mode. If we already had a trivial way to specify the default encodings as a string before any initialization has occurred, I think we would have made UTF-8 mode enabled by setting them to "utf-8" rather than a brand new flag. Again, we either have a huge set of flags to infer certain values at certain times, or we can just make them directly settable. If we make them settable, it's much easier for users to reason about what is going to happen. > To load the Python codec, you need importlib. importlib needs to access > the filesystem which requires a codec to encode/decode file names > (PyConfig.module_search_paths uses Unicode wchar_t* strings, but the C API > only supports bytes char* strings). Right, and the few places where we need an encoding *before* we can load any arbitrary ones we can easily compare the strings and fail if someone's trying to do something unusual (or if the platform can do the lookup itself, it could succeed). If we say "passing NULL means use the default" then we have that handled, and the actual encoding just gets set to the real default once we figure out what that is. > Py_PreInitialize() doesn't set the filesystem encoding. It initializes the > LC_CTYPE locale and Python global configuration variables (Py_UTF8Mode and > Py_LegacyWindowsFSEncodingFlag). Right, I'm proposing a simplification here where it *does* set the filesystem encoding (even though it doesn't get used until Py_Initialize() is called). That way we can use the filesystem encoding to access the filesystem during initialization, provided it's one of the built-in supported ones (e.g. NULL, which means the C locale, or "utf-8" which means UTF-8) rather than relying on the tables in the standard library. Oh look, I said all this in my original email: >> Obviously we are not ready to import most encodings after pre >> initialization, but I think that's okay. Embedders who set something >> outside the range of what can be used without importing encodings will >> get an error to that effect if we try. > > You need a C implementation of the Python filesystem encoding very early > in Python initialization. You cannot start with one encoding and "later" > switch the encoding. I tried multiple times the last 10 years and I always > failed to do that. All attempts failed with mojibake at different > levels. Again, this is for embedders. Regular Python users will only ever request "NULL" or "utf-8", depending on the UTF-8 mode flag. And embedders have to make sure they get what they ask for and also can't change it later. The problems you've hit in the past have always been to do with trying to infer or guess the actual encoding, rather than simply letting someone tell you what it is (via config) and letting them deal with the failure. >> In fact, I'd be totally okay with letting embedders specify their own >> function pointer here to do encoding/decoding between Unicode and the OS >> preferred encoding. > > In my experience, when someone wants to get a specific encoding: they > only want UTF-8. There is now the UTF-8 Mode which ignores the locale > and forces the usage of UTF-8. Your experience here sounds like it's limited to POSIX systems. I've wanted UTF-16 before, and been able to provide it (if Python had allowed me to provide a callback to encode/decode). And again, all this is about "why do we need to define a boolean that determines what the encoding is when we can just let people tell us what encoding they want". There's a good chance that an embedded Python isn't going to touch the real filesystem anyway. > I'm not sure that there is a need to have a custom codec. Moreover, if > there an API to pass a codec in C, you will need to expose it somehow > at the Python level for os.fsencode() and os.fsdecode(). We need to expose those operations anyway, and os.fsencode/fsdecode have their own issues (particularly since there *are* ways to change filesystem encoding while running). Turning them into actual native functions that might call out to a host-provided callback would not be difficult. > Currently, Python ensures during early stage of startup that > codecs.lookup(sys.getfilesystemencoding()) works: there is a existing > Python codec for the requested filesystem encoding. Right, it's a validation step. But we can also make codecs.lookup("whatever the file system encoding is") return something based on os.fsencode() and os.fsdecode(). We're not actually beholden to the current implementations here - we are allowed to change them! ;) From steve.dower at python.org Tue Apr 9 16:44:15 2019 From: steve.dower at python.org (Steve Dower) Date: Tue, 9 Apr 2019 13:44:15 -0700 Subject: [Python-Dev] New Python Initialization API In-Reply-To: References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: On 05Apr2019 0922, Victor Stinner wrote: > While there are supporters of an "isolated Python" (sometimes called > "system python"), the fact that it doesn't exist in any Linux distribution > nor on any other operating system (Windows, macOS, FreeBSD), whereas it's > already doable in Python 3.6 with Py_IsolatedFlag=1 makes me think that > users like the ability to control Python with environment variables and > configuration files. > > I would prefer to leave Python as not isolated by default. It's just a > matter of command line arguments. Not for embedders it isn't. When embedding you need to do a whole lot of special things to make sure that your private version of Python doesn't pick up settings relating to a regular Python install. We should make the Python runtime isolated by default, and only (automatically) pick up settings from the environment in the Python binary. >>> * The PEP 432 stores ``PYTHONCASEOK`` into the config. Do we need >>> to add something for that into ``PyConfig``? How would it be exposed >>> at the Python level for ``importlib``? Passed as an argument to >>> ``importlib._bootstrap._setup()`` maybe? It can be added later if >>> needed. >> >> Could we convert it into an xoption? It's very rarely used, to my knowledge. > > The first question is if there is any need for an embedder to change > this option. Currently, importlib._bootstrap_external._install() reads > the environment variable and it's the only way to control the option. I think the first question should be "is there any reason to prevent an embedder from changing this option". In general, the answer is going to be no. We should expose all the options we rely on to embedders, or else they're going to have to find workarounds. > ... By the way, importlib reads PYTHONCASEOK environment varaible even > if isolated mode is enabled (sys.flags.isolated is equal to 1). Is it > a bug? :-) Yes, I think it's a bug. Perhaps this should become a proper configuration option, rather than a directly-read environment variable? From steve.dower at python.org Tue Apr 9 16:51:03 2019 From: steve.dower at python.org (Steve Dower) Date: Tue, 9 Apr 2019 13:51:03 -0700 Subject: [Python-Dev] New Python Initialization API In-Reply-To: References: <70c245c7-8f40-a51b-934b-af958f7cd849@python.org>

<119e26f0-d826-7e13-aa4a-e0a67bea3265@python.org> <91eec784-28ab-024f-42a6-8a1e5d37d9bf@python.org> <6a17e990-7e2a-0544-1e8e-9b16d05e4df6@python.org> Message-ID: On 05Apr2019 0936, Victor Stinner wrote: > For the PyMainConfig structure idea, I cannot comment at this point. I > need more time to think about it. > > > About the "path configuration" fields, maybe a first step to enhance > the API would be to add the the following function: > > PyInitError PyConfig_ComputePath(PyConfig *config, const wchar *home); > > where home can be NULL (and PyConfig.module_search_paths_env field > goes away: the function reads PYTHONPATH env var internally). Yes, I like this. Maybe pass PYTHONPATH value in as an "additional paths" parameter? Basically, this function would be the replacement for "Py_GetPath()" (which initializes paths to the defaults the first time it is called), and setting the path fields in PyConfig manually is the replacement for Py_SetPath() (or calling the various Py_Set*() functions to make the default logic infer the paths you want). Similarly, PyConfig_ComputeFromArgv() and/or PyConfig_ComputeFromEnviron() functions would also directly replace the magic we have scattered all over the place right now. It would also make it more obvious to the callers which values take precedence, and easier to see that there should be no side effects. I think it's easier to document as well. Cheers, Steve From tir.karthi at gmail.com Tue Apr 9 19:45:06 2019 From: tir.karthi at gmail.com (Karthikeyan) Date: Wed, 10 Apr 2019 05:15:06 +0530 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability In-Reply-To: References: Message-ID: I would recommend fixing it since it's potentially remote code execution on systems like Redis (latest versions of Redis have this mitigated) though I must admit I don't fully understand the complexity since there are multiple issues linked. Go was also assigned a CVE for linked issue and it seemed to be the same reporter by username : CVE-2019-9741 . I tried using go's approach in the commit but urlopen accepts more URLs like data URLs [0] that seemed to accept \n as a valid case and the patch broke some tests. Looking at the issue discussion complexity also involves backwards compatibility. golang also pushed an initial fix that seemed to broke their internal tests [0] to arrive at a more simpler fix. [0] https://github.com/python/cpython/blob/a40681dd5db8deaf05a635eecb91498dac882aa4/Lib/test/test_urllib.py#L482 [1] https://go-review.googlesource.com/c/go/+/159157/2#message-39c6be13a192bf760f6318ac641b432a6ab8fdc8 -- Regards, Karthikeyan S -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg at krypto.org Tue Apr 9 20:41:27 2019 From: greg at krypto.org (Gregory P. Smith) Date: Tue, 9 Apr 2019 17:41:27 -0700 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability In-Reply-To: References: Message-ID: On Tue, Apr 9, 2019 at 4:45 PM Karthikeyan wrote: > I would recommend fixing it since it's potentially remote code execution > on systems like Redis (latest versions of Redis have this mitigated) though > I must admit I don't fully understand the complexity since there are > multiple issues linked. Go was also assigned a CVE for linked issue and it > seemed to be the same reporter by username : CVE-2019-9741 . I tried using > go's approach in the commit but urlopen accepts more URLs like data URLs > [0] that seemed to accept \n as a valid case and the patch broke some > tests. Looking at the issue discussion complexity also involves backwards > compatibility. golang also pushed an initial fix that seemed to broke their > internal tests [0] to arrive at a more simpler fix. > > [0] > https://github.com/python/cpython/blob/a40681dd5db8deaf05a635eecb91498dac882aa4/Lib/test/test_urllib.py#L482 > [1] > https://go-review.googlesource.com/c/go/+/159157/2#message-39c6be13a192bf760f6318ac641b432a6ab8fdc8 > > -- > Regards, > Karthikeyan S > useful references, thanks! limiting the checks to only http and https as those are the text based protocols with urls transmitted in text form makes sense and avoids the data: test failures. proposed simple fix in https://github.com/python/cpython/pull/12755 but tests are needed as is an audit of the code to see where else we may potentially need to do such things. -gps -------------- next part -------------- An HTML attachment was scrubbed... URL: From tir.karthi at gmail.com Wed Apr 10 00:30:59 2019 From: tir.karthi at gmail.com (Karthikeyan) Date: Wed, 10 Apr 2019 10:00:59 +0530 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability In-Reply-To: References: Message-ID: Thanks Gregory. I think it's a good tradeoff to ensure this validation only for URLs of http scheme. I also agree handling newline is little problematic over the years and the discussion over the level at which validation should occur also prolongs some of the patches. https://bugs.python.org/issue35906 is another similar case where splitlines is used but it's better to raise an error and the proposed fix could be used there too. Victor seemed to wrote a similar PR like linked one for other urllib functions only to fix similar attack in ftplib to reject newlines that was eventually fixed only in ftplib * https://bugs.python.org/issue30713 * https://bugs.python.org/issue29606 Search also brings multiple issues with one duplicate over another that makes these attacks scattered over the tracker and some edge case missing. Slightly off topic, the last time I reported a cookie related issue where the policy can be overriden by third party library I was asked to fix it in stdlib itself since adding fixes to libraries causes maintenance burden to downstream libraries to keep up upstream. With urllib being a heavily used module across ecosystem it's good to have a fix landing in stdlib that secures downstream libraries encouraging users to upgrade Python too. Regards, Karthikeyan S > -------------- next part -------------- An HTML attachment was scrubbed... URL: From wes.turner at gmail.com Wed Apr 10 06:16:10 2019 From: wes.turner at gmail.com (Wes Turner) Date: Wed, 10 Apr 2019 06:16:10 -0400 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability In-Reply-To: References:

Message-ID: 1. Is there a library of URL / Header injection tests e.g. for fuzzing that we could generate additional test cases with or from? 2. Are requests.get() and requests.post() also vulnerable? 3. Despite the much-heralded UNIX pipe protocols' utility, filenames containing newlines (the de-facto line record delimiter) are possible: "file"$'\n'"name" Should filenames containing newlines and control characters require a kwarg to be non-None in order to be passed through unescaped to the HTTP request? On Wednesday, April 10, 2019, Karthikeyan wrote: > Thanks Gregory. I think it's a good tradeoff to ensure this validation > only for URLs of http scheme. > > I also agree handling newline is little problematic over the years and the > discussion over the level at which validation should occur also prolongs > some of the patches. https://bugs.python.org/issue35906 is another > similar case where splitlines is used but it's better to raise an error and > the proposed fix could be used there too. Victor seemed to wrote a similar > PR like linked one for other urllib functions only to fix similar attack in > ftplib to reject newlines that was eventually fixed only in ftplib > > * https://bugs.python.org/issue30713 > * https://bugs.python.org/issue29606 > > Search also brings multiple issues with one duplicate over another that > makes these attacks scattered over the tracker and some edge case missing. > Slightly off topic, the last time I reported a cookie related issue where > the policy can be overriden by third party library I was asked to fix it in > stdlib itself since adding fixes to libraries causes maintenance burden to > downstream libraries to keep up upstream. With urllib being a heavily used > module across ecosystem it's good to have a fix landing in stdlib that > secures downstream libraries encouraging users to upgrade Python too. > > Regards, > Karthikeyan S > >> -------------- next part -------------- An HTML attachment was scrubbed... URL: From vstinner at redhat.com Wed Apr 10 06:51:50 2019 From: vstinner at redhat.com (Victor Stinner) Date: Wed, 10 Apr 2019 12:51:50 +0200 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability In-Reply-To: References:

Message-ID: Hi, I dig into Python code history and the bug tracker. I would like to say that this issue is a work-in-progress since 2004. Different fixes have been pushed, but there are *A LOT* of open issues: https://bugs.python.org/issue30458#msg339846 I would suggest to discuss on https://bugs.python.org/issue30458 rather than here, just to avoid to duplicate discussions ;-) Note: the whole class of issue (HTTP Header Injection) got at least 3 CVE: CVE-2016-5699, CVE-2019-9740, CVE-2019-9947. I changed bpo-30458 title to "[security][CVE-2019-9740][CVE-2019-9947] HTTP Header Injection (follow-up of CVE-2016-5699)". Victor Le mer. 10 avr. 2019 ? 12:20, Wes Turner a ?crit : > > 1. Is there a library of URL / Header injection tests e.g. for fuzzing that we could generate additional test cases with or from? > > 2. Are requests.get() and requests.post() also vulnerable? > > 3. Despite the much-heralded UNIX pipe protocols' utility, filenames containing newlines (the de-facto line record delimiter) are possible: "file"$'\n'"name" > > Should filenames containing newlines and control characters require a kwarg to be non-None in order to be passed through unescaped to the HTTP request? > > On Wednesday, April 10, 2019, Karthikeyan wrote: >> >> Thanks Gregory. I think it's a good tradeoff to ensure this validation only for URLs of http scheme. >> >> I also agree handling newline is little problematic over the years and the discussion over the level at which validation should occur also prolongs some of the patches. https://bugs.python.org/issue35906 is another similar case where splitlines is used but it's better to raise an error and the proposed fix could be used there too. Victor seemed to wrote a similar PR like linked one for other urllib functions only to fix similar attack in ftplib to reject newlines that was eventually fixed only in ftplib >> >> * https://bugs.python.org/issue30713 >> * https://bugs.python.org/issue29606 >> >> Search also brings multiple issues with one duplicate over another that makes these attacks scattered over the tracker and some edge case missing. Slightly off topic, the last time I reported a cookie related issue where the policy can be overriden by third party library I was asked to fix it in stdlib itself since adding fixes to libraries causes maintenance burden to downstream libraries to keep up upstream. With urllib being a heavily used module across ecosystem it's good to have a fix landing in stdlib that secures downstream libraries encouraging users to upgrade Python too. >> >> Regards, >> Karthikeyan S > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/vstinner%40redhat.com -- Night gathers, and now my watch begins. It shall not end until my death. From vstinner at redhat.com Wed Apr 10 07:01:42 2019 From: vstinner at redhat.com (Victor Stinner) Date: Wed, 10 Apr 2019 13:01:42 +0200 Subject: [Python-Dev] No longer enable Py_TRACE_REFS by default in debug build In-Reply-To: <0a1871f0-14f8-8591-cf35-21cb3741f354@python.org> References: <0a1871f0-14f8-8591-cf35-21cb3741f354@python.org> Message-ID: Le mar. 9 avr. 2019 ? 22:16, Steve Dower a ?crit : > What are the other changes that would be required? I don't know. > And is there another > way to get the same functionality without ABI modifications? Py_TRACE_REFS is a double linked list of *all* Python objects. To get this functionality, you need to store the list somewhere. I don't know how to maintain such list outside the PyObject structure. One solution would be to enable Py_TRACE_REFS in release mode. Does anyone want to add 16 bytes to every PyObject? I don't want that :-) > I think it's worthwhile if we can really get to debug and non-debug > builds being ABI compatible. Getting partway there in this case doesn't > seem to offer any benefits. Disabling Py_TRACE_REFS by default in debug mode reduces the Python memory footprint. Py_TRACE_REFS costs 2 pointers per PyObject: 16 bytes on 64-bit platforms. I don't think that I ever used sys.getobjects(), whereas many projects use gc.get_objects() which is also available in release builds (not only in debug builds). I'm quite sure that almost nobody uses debug builds because the ABI is incompatible. The main question is if anyone ever used Py_TRACE_REFS? Does someone use sys.getobjects() or PYTHONDUMPREFS environment variable? Using PYTHONDUMPREFS=1 on a debug build (with Py_TRACE_REFS) does simply crash Python 3.7 at exit. So I don't think that anyone use it :-) I wrote PR 12614 to remove all code related to Py_TRACE_REFS. I wrote it to see which code depends on it: commit 63509498761a0e7f72585a8cd7df325ea2abd1b2 (HEAD -> remove_trace_refs, origin/remove_trace_refs) Author: Victor Stinner Date: Thu Mar 28 23:26:58 2019 +0100 WIP: bpo-36465: Remove Py_TRACE_REFS special build Remove _ob_prev and _ob_next fields of PyObject when Python is compiled in debug mode to make debug ABI closer to the release ABI. Remove: * sys.getobjects() * PYTHONDUMPREFS environment variable * _PyCoreConfig.dump_refs * PyObject._ob_prev and PyObject._ob_next fields * _PyObject_HEAD_EXTRA and _PyObject_EXTRA_INIT macros * _Py_AddToAllObjects() * _Py_PrintReferenceAddresses() * _Py_PrintReferences() Victor -- Night gathers, and now my watch begins. It shall not end until my death. From tir.karthi at gmail.com Wed Apr 10 07:07:03 2019 From: tir.karthi at gmail.com (Karthikeyan) Date: Wed, 10 Apr 2019 16:37:03 +0530 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability In-Reply-To: References:

Message-ID: > 1. Is there a library of URL / Header injection tests e.g. for fuzzing > that we could generate additional test cases with or from? https://github.com/swisskyrepo/PayloadsAllTheThings seems to contain payload related stuff but not sure how useful it is for URL parsing. > > 2. Are requests.get() and requests.post() also vulnerable? > urllib3 seems to be vulnerable as noted in https://bugs.python.org/issue36276#msg337837 . requests uses urllib3 under the hood. The last time I checked requests passed encoded URL to urllib3 where this doesn't seem to be exploitable but I could be wrong. -- Regards, Karthikeyan S -------------- next part -------------- An HTML attachment was scrubbed... URL: From aranea.network at gmail.com Wed Apr 10 07:24:40 2019 From: aranea.network at gmail.com (Robert Okadar) Date: Wed, 10 Apr 2019 13:24:40 +0200 Subject: [Python-Dev] (no subject) Message-ID: Hi community, I have developed a tkinter GUI component, Python v3.7. It runs very well in Linux but seeing a huge performance impact in Windows 10. While in Linux an almost real-time performance is achieved, in Windows it is slow to an unusable level. The code is somewhat stripped down from the original, but the performance difference is the same anyway. The columns can be resized by clicking on the column border and dragging it. Resizing works only for the top row (but it resizes the entire column). In this demo, all bindings are avoided to exclude influence on the component performance and thus not included. If you resize the window (i.e., if you maximize it), you must call the function table.fit() from IDLE shell. Does anyone know where is this huge difference in performance coming from? Can anything be done about it? All the best, -- Robert Okadar IT Consultant Schedule an *online meeting * with me! Visit *aranea-mreze.hr* or call * +385 91 300 8887* -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- import tkinter class Resizer(tkinter.Frame): def __init__(self, info_grid, master, **cnf): self.table_grid = info_grid tkinter.Frame.__init__(self, master, **cnf) self.bind('', self.resize_column) self.bind('', self.resize_start) self.bind('', self.resize_end) self._resizing = False self.bind('', self.onDestroyEvent) def onDestroyEvent(self, event): self.table_grid = [] def resize_column(self, event, width = None): #if self._resizing: top = self.table_grid.Top grid = self.table_grid._grid col = self.master.grid_info()["column"] if not width: width = self._width + event.x_root - self._x_root top.columnconfigure(col, minsize = width) grid.columnconfigure(col, minsize = width) def resize_start(self, event): top = self.table_grid.Top self._resizing = True self._x_root = event.x_root col = self.master.grid_info()["column"] self._width = top.grid_bbox(row = 0, column = col)[2] #print event.__dict__ col = self.master.grid_info()["column"] #print top.grid_bbox(row = 0, column = col) def resize_end(self, event): pass #self.table_grid.xscrollcommand() #self.table_grid.column_resize_callback(col, self) class Table(tkinter.Frame): def __init__(self, master, columns = 10, rows = 20, width = 100,**kw): tkinter.Frame.__init__(self, master, **kw) self.columns = [] self._width = width self._grid = grid = tkinter.Frame(self, bg = "#CCCCCC") self.Top = top = tkinter.Frame(self, bg = "#DDDDDD") self.create_top(columns) self.create_grid(rows) #self.bind('', self.on_table_configure) #self.bind('', self.on_table_map) top.pack(anchor = 'nw')#, expand = 1, fill = "both") grid.pack(anchor = 'nw')#fill = "both",expand = 1 def on_table_map(self, event): theight = self.winfo_height() def fit(self):#on_table_configure(self, event): i = 0 for frame in self.Top.grid_slaves(row = 0): frame.resizer.resize_column(None, width = frame.winfo_width()) i += 1 theight = self.winfo_height() fheight = self._grid.winfo_height() + self.Top.winfo_height() #print('', theight, fheight) if theight > fheight: rheight = self.grid_array[0][0].winfo_height() ammount = int((-fheight + theight) / rheight) #print(rheight, ammount) for i in range(ammount): self.add_row() self.update() def add_row(self, ammount = 1): columnsw = self.columns row = [] i = len(self.grid_array) for j in range(len(columnsw)): bg = self.bgcolor0 if i % 2 == 1: bg = self.bgcolor1 entry = tkinter.Label(self._grid, bg = bg, text = '%i %i' % (i, j)) entry.grid(row = i, column = j, sticky = "we", padx = 2) row.append(entry) self.grid_array.append(row) bgcolor0 = "#FFFFFF" bgcolor1 = "#EEEEEE" def create_grid(self, height): #grid.grid(row = 0, column = 0, sticky = "nsew") columnsw = self.columns# = self.Top.grid_slaves(row = 1) self.grid_array = [] for i in range(height): row = [] for j in range(len(columnsw)): bg = self.bgcolor0 if i % 2 == 1: bg = self.bgcolor1 #entry = self.EntryClass(False, self, self._grid, bg = bg, width = 1, ) entry = tkinter.Label(self._grid, bg = bg, text = '%i %i' % (i,j)) entry.grid(row = i, column = j, sticky = "we", padx = 2) row.append(entry) self.grid_array.append(row) def create_top(self, columns = 10): top = self.Top #columns = self._columns #maybe to rename for i in range(columns): name = 'column %i' % i self.add_column(name, top) def add_column(self, name, top, width = None): if not width: width = self._width col = tkinter.Frame(top) i = len(self.columns) #filter = Filter(self, name, i, top) entry = tkinter.Entry(col, width = 1) #readonlybackground #col.ColumnIndex = i #col.array_index = i entry.insert(0, name) resizer = Resizer(self, col, bg = "#000000", width = 3, height = 21, cursor = 'sb_h_double_arrow') #entry.grid(row = 0, column = 0, sticky = "we") #resizer.grid(row = 0, column = 1, sticky = "e") entry.pack(side = "left", fill = "both", expand = 1) resizer.pack(side = "right") top.columnconfigure(i, minsize = width, weight = 1) #filter.grid(row = 0, column = i, sticky = "we") col.grid(row = 0, column = i, sticky = "we") col.entry = entry #col.filter = filter #col.index = filter.index = i col.resizer = resizer #filter.Column = col entry.Column = col self.columns.append(col) if __name__ == '__main__': columns = 30 rows = 20 width = 60 root = tkinter.Tk() root.wm_title('TableGridTest') table = self = Table(root, columns = columns, rows = rows, width = width) table.pack(expand = 1, fill = 'both') #table.create_grid( From steve at pearwood.info Wed Apr 10 11:35:56 2019 From: steve at pearwood.info (Steven D'Aprano) Date: Thu, 11 Apr 2019 01:35:56 +1000 Subject: [Python-Dev] (no subject) In-Reply-To: References: Message-ID: <20190410153555.GB3010@ando.pearwood.info> Hi Robert, This mailing list is for the development of the Python interpreter, not a general help desk. There are many other forums where you can ask for help, such as the comp.lang.python newsgroup, Stackoverflow, /r/python on Reddit, the IRC channel, and more. Perhaps you can help us though, I presume you signed up to this mailing list via the web interface at https://mail.python.org/mailman/listinfo/python-dev Is there something we could do to make it more clear that this is not the right place to ask for help? -- Steven From pviktori at redhat.com Wed Apr 10 12:25:57 2019 From: pviktori at redhat.com (Petr Viktorin) Date: Wed, 10 Apr 2019 18:25:57 +0200 Subject: [Python-Dev] PEP 590 discussion In-Reply-To: <421f8182-4bc8-b8cf-82d6-ca4a4fbd2013@hotpy.org> References: <6c8356e3-f9b2-c39e-63c4-17f146d326b7@hotpy.org> <15b8a3d7-00ed-a5eb-475c-a3adee671b5f@hotpy.org> <5C9FEF82.50207@UGent.be> <421f8182-4bc8-b8cf-82d6-ca4a4fbd2013@hotpy.org> Message-ID: <50d675b4-839c-6502-ad1a-a33ea9330000@redhat.com> Hello! I've had time for a more thorough reading of PEP 590 and the reference implementation. Thank you for the work! Overall, I like PEP 590's direction. I'd now describe the fundamental difference between PEP 580 and PEP 590 as: - PEP 580 tries to optimize all existing calling conventions - PEP 590 tries to optimize (and expose) the most general calling convention (i.e. fastcall) PEP 580 also does a number of other things, as listed in PEP 579. But I think PEP 590 does not block future PEPs for the other items. On the other hand, PEP 580 has a much more mature implementation -- and that's where it picked up real-world complexity. PEP 590's METH_VECTORCALL is designed to handle all existing use cases, rather than mirroring the existing METH_* varieties. But both PEPs require the callable's code to be modified, so requiring it to switch calling conventions shouldn't be a problem. Jeroen's analysis from https://mail.python.org/pipermail/python-dev/2018-July/154238.html seems to miss a step at the top: a. CALL_FUNCTION* / CALL_METHOD opcode calls b. _PyObject_FastCallKeywords() which calls c. _PyCFunction_FastCallKeywords() which calls d. _PyMethodDef_RawFastCallKeywords() which calls e. the actual C function (*ml_meth)() I think it's more useful to say that both PEPs bridge a->e (via _Py_VectorCall or PyCCall_Call). PEP 590 is built on a simple idea, formalizing fastcall. But it is complicated by PY_VECTORCALL_ARGUMENTS_OFFSET and Py_TPFLAGS_METHOD_DESCRIPTOR. As far as I understand, both are there to avoid intermediate bound-method object for LOAD_METHOD/CALL_METHOD. (They do try to be general, but I don't see any other use case.) Is that right? (I'm running out of time today, but I'll write more on why I'm asking, and on the case I called "impossible" (while avoiding creation of a "bound method" object), later.) The way `const` is handled in the function signatures strikes me as too fragile for public API. I'd like if, as much as possible, PY_VECTORCALL_ARGUMENTS_OFFSET was treated as a special optimization that extension authors can either opt in to, or blissfully ignore. That might mean: - vectorcall, PyObject_VectorCallWithCallable, PyObject_VectorCall, PyCall_MakeTpCall all formally take "PyObject *const *args" - a na?ve callee must do "nargs &= ~PY_VECTORCALL_ARGUMENTS_OFFSET" (maybe spelled as "nargs &= PY_VECTORCALL_NARGS_MASK"), but otherwise writes compiler-enforced const-correct code. - if PY_VECTORCALL_ARGUMENTS_OFFSET is set, the callee may modify "args[-1]" (and only that, and after the author has read the docs). Another point I'd like some discussion on is that vectorcall function pointer is per-instance. It looks this is only useful for type objects, but it will add a pointer to every new-style callable object (including functions). That seems wasteful. Why not have a per-type pointer, and for types that need it (like PyTypeObject), make it dispatch to an instance-specific function? Minor things: - "Continued prohibition of callable classes as base classes" -- this section reads as a final. Would you be OK wording this as something other PEPs can tackle? - "PyObject_VectorCall" -- this looks extraneous, and the reference imlementation doesn't need it so far. Can it be removed, or justified? - METH_VECTORCALL is *not* strictly "equivalent to the currently undocumented METH_FASTCALL | METH_KEYWORD flags" (it has the ARGUMENTS_OFFSET complication). - I'd like to officially call this PEP "Vectorcall", see https://github.com/python/peps/pull/984 Mark, what are your plans for next steps with PEP 590? If a volunteer wanted to help you push this forward, what would be the best thing to work on? Jeroen, is there something in PEPs 579/580 that PEP 590 blocks, or should address? From vano at mail.mipt.ru Wed Apr 10 13:57:13 2019 From: vano at mail.mipt.ru (Ivan Pozdeev) Date: Wed, 10 Apr 2019 20:57:13 +0300 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability In-Reply-To: References:

Message-ID: <8bd61b63-4fc7-91be-b717-226c205e3623@mail.mipt.ru> On 10.04.2019 7:30, Karthikeyan wrote: > Thanks Gregory. I think it's a good tradeoff to ensure this validation only for URLs of http scheme. > > I also agree handling newline is little problematic over the years and the discussion over the level at which validation should occur also > prolongs some of the patches. https://bugs.python.org/issue35906 is another similar case where splitlines is used but it's better to raise > an error and the proposed fix could be used there too. Victor seemed to wrote a similar PR like linked one for other urllib functions only > to fix similar attack in ftplib to reject newlines that was eventually fixed only in ftplib > > * https://bugs.python.org/issue30713 > * https://bugs.python.org/issue29606 > > Search also brings multiple issues with one duplicate over another that makes these attacks scattered over the tracker and some edge case > missing. Slightly off topic, the last time I reported a cookie related issue where the policy can be overriden by third party library I > was asked to fix it in stdlib itself since adding fixes to libraries causes maintenance burden to downstream libraries to keep up > upstream. With urllib being a heavily used module across ecosystem it's good to have a fix landing in stdlib that secures downstream > libraries encouraging users to upgrade Python too. > Validation should occur whenever user data crosses a trust boundary -- i.e. when the library starts to assume that an extracted chunk now contains something valid. https://tools.ietf.org/html/rfc3986 defines valid syntax (incl. valid characters) for every part of a URL -- _of any scheme_ (FYI, \r\n are invalid everywhere and the test code for ??? `data:' that Karthikeyan referred to is raw data to compare to rather than a part of a URL). It also obsoletes all the RFCs that the current code is written against. AFAICS, urllib.split* fns (obsoleted as public in 3.8) are used by both urllib and urllib2 to parse URLs. They can be made to each validate the chunk that they split off. urlparse can validate the entire URL altogether. Also, all modules ought to use the same code (urlparse looks like the best candidate) to parse URLs -- this will minimize the attack surface. I think I can look into this later this week. Fixing this is going to break code that relies on the current code accepting invalid URLs. But the docs have never said that e.g. in urlopen, anything apart from a (valid) URL is accepted (in particular, this implies that the user is responsible for escaping stuff properly before passing it). So I would say that we are within our right here and whoever is relying on those quirks is and has always been on unsupported territory. Determining which of those quirks are exploitable and which are not to fix just the former is an incomparably larger, more error-prone and avoidable work. If anything, the history of the issue referenced to by previous posters clearly shows that this is too much to ask from the Python team. I also see other undocumented behavior like accepting '>' (also obsoleted as public in 3.8) which I would like to but it's of no harm. -- Regards, Ivan -------------- next part -------------- An HTML attachment was scrubbed... URL: From steve.dower at python.org Wed Apr 10 14:09:49 2019 From: steve.dower at python.org (Steve Dower) Date: Wed, 10 Apr 2019 11:09:49 -0700 Subject: [Python-Dev] No longer enable Py_TRACE_REFS by default in debug build In-Reply-To: References: <0a1871f0-14f8-8591-cf35-21cb3741f354@python.org> Message-ID: On 10Apr2019 0401, Victor Stinner wrote: > Le mar. 9 avr. 2019 ? 22:16, Steve Dower a ?crit : >> What are the other changes that would be required? > > I don't know. > >> And is there another >> way to get the same functionality without ABI modifications? > > Py_TRACE_REFS is a double linked list of *all* Python objects. To get > this functionality, you need to store the list somewhere. I don't know > how to maintain such list outside the PyObject structure. There's certainly no more convenient way to do it. Maybe if we had detached reference counts it would be easier, but it would likely still result in ABI compatibility issues between debug builds of extensions and release builds of Python (the most common scenario, in my experience). > One solution would be to enable Py_TRACE_REFS in release mode. Does > anyone want to add 16 bytes to every PyObject? I don't want that :-) Yeah, nobody suggested that anyway :) >> I think it's worthwhile if we can really get to debug and non-debug >> builds being ABI compatible. Getting partway there in this case doesn't >> seem to offer any benefits. > > Disabling Py_TRACE_REFS by default in debug mode reduces the Python > memory footprint. Py_TRACE_REFS costs 2 pointers per PyObject: 16 > bytes on 64-bit platforms. Right, except it's debug mode. > I don't think that I ever used sys.getobjects(), whereas many projects > use gc.get_objects() which is also available in release builds (not > only in debug builds). > > I'm quite sure that almost nobody uses debug builds because the ABI is > incompatible. There were just over 250,000 downloads of the prebuilt debug binaries for Windows (which are optional in the installer and turned off by default) in March. Whether they are being used is another question, but I know for sure at least a few people who use them. When you want to use a debug build of your extension module, using a debug build of CPython is the only way to do it. So unless we can get rid of *all* the ABI incompatibilities, a debug build of CPython is still going to be necessary and disabling/removing reference tracking doesn't provide any benefit. > The main question is if anyone ever used Py_TRACE_REFS? Does someone > use sys.getobjects() or PYTHONDUMPREFS environment variable? > > Using PYTHONDUMPREFS=1 on a debug build (with Py_TRACE_REFS) does > simply crash Python 3.7 at exit. So I don't think that anyone use it > :-) How do we track reference leaks in the buildbots? Can/should we be using this? It doesn't crash on Python 3.8, so I suspect fixing the bug is a better option than using it as an excuse to remove the feature. From a quick test, it seems that a tuple element is being freed but not removed from the tuple, so it's probably a double-decref bug somewhere in 3.7. Cheers, Steve From steve.dower at python.org Wed Apr 10 14:45:51 2019 From: steve.dower at python.org (Steve Dower) Date: Wed, 10 Apr 2019 11:45:51 -0700 Subject: [Python-Dev] No longer enable Py_TRACE_REFS by default in debug build In-Reply-To: References: <0a1871f0-14f8-8591-cf35-21cb3741f354@python.org>

Message-ID: <2778f0f5-3a29-54da-8292-dfc23a0c6621@python.org> On 10Apr2019 1109, Steve Dower wrote: > On 10Apr2019 0401, Victor Stinner wrote: >>> I think it's worthwhile if we can really get to debug and non-debug >>> builds being ABI compatible. Getting partway there in this case doesn't >>> seem to offer any benefits. >> >> Disabling Py_TRACE_REFS by default in debug mode reduces the Python >> memory footprint. Py_TRACE_REFS costs 2 pointers per PyObject: 16 >> bytes on 64-bit platforms. > > Right, except it's debug mode. I left this comment unfinished :) It's debug mode, and so you should expect less efficient memory and CPU usage. That's why we have two modes - so that it's easier to debug issues. Now, if debug mode was unusably slow or had way too much overhead, we'd want to fix that. But it isn't unusable, so reducing memory usage at the cost of making debugging harder is not compelling. Cheers, Steve From guido at python.org Wed Apr 10 15:07:50 2019 From: guido at python.org (Guido van Rossum) Date: Wed, 10 Apr 2019 12:07:50 -0700 Subject: [Python-Dev] No longer enable Py_TRACE_REFS by default in debug build In-Reply-To: <2778f0f5-3a29-54da-8292-dfc23a0c6621@python.org> References: <0a1871f0-14f8-8591-cf35-21cb3741f354@python.org>

<2778f0f5-3a29-54da-8292-dfc23a0c6621@python.org> Message-ID: I recall finding memory leaks using this. (E.g. I remember a leak in Zope due to a cache that was never pruned.) But presumably gc.get_objects() would have been sufficient. (IIRC it didn't exist at the time.) On Wed, Apr 10, 2019 at 11:48 AM Steve Dower wrote: > On 10Apr2019 1109, Steve Dower wrote: > > On 10Apr2019 0401, Victor Stinner wrote: > >>> I think it's worthwhile if we can really get to debug and non-debug > >>> builds being ABI compatible. Getting partway there in this case doesn't > >>> seem to offer any benefits. > >> > >> Disabling Py_TRACE_REFS by default in debug mode reduces the Python > >> memory footprint. Py_TRACE_REFS costs 2 pointers per PyObject: 16 > >> bytes on 64-bit platforms. > > > > Right, except it's debug mode. > > I left this comment unfinished :) > > It's debug mode, and so you should expect less efficient memory and CPU > usage. That's why we have two modes - so that it's easier to debug issues. > > Now, if debug mode was unusably slow or had way too much overhead, we'd > want to fix that. But it isn't unusable, so reducing memory usage at the > cost of making debugging harder is not compelling. > > Cheers, > Steve > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (python.org/~guido) *Pronouns: he/him/his **(why is my pronoun here?)* -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg at krypto.org Wed Apr 10 15:20:25 2019 From: greg at krypto.org (Gregory P. Smith) Date: Wed, 10 Apr 2019 12:20:25 -0700 Subject: [Python-Dev] Need help to fix HTTP Header Injection vulnerability In-Reply-To: <8bd61b63-4fc7-91be-b717-226c205e3623@mail.mipt.ru> References:

<8bd61b63-4fc7-91be-b717-226c205e3623@mail.mipt.ru> Message-ID: On Wed, Apr 10, 2019 at 11:00 AM Ivan Pozdeev via Python-Dev < python-dev at python.org> wrote: > > On 10.04.2019 7:30, Karthikeyan wrote: > > Thanks Gregory. I think it's a good tradeoff to ensure this validation > only for URLs of http scheme. > > I also agree handling newline is little problematic over the years and the > discussion over the level at which validation should occur also prolongs > some of the patches. https://bugs.python.org/issue35906 is another > similar case where splitlines is used but it's better to raise an error and > the proposed fix could be used there too. Victor seemed to wrote a similar > PR like linked one for other urllib functions only to fix similar attack in > ftplib to reject newlines that was eventually fixed only in ftplib > > * https://bugs.python.org/issue30713 > * https://bugs.python.org/issue29606 > > Search also brings multiple issues with one duplicate over another that > makes these attacks scattered over the tracker and some edge case missing. > Slightly off topic, the last time I reported a cookie related issue where > the policy can be overriden by third party library I was asked to fix it in > stdlib itself since adding fixes to libraries causes maintenance burden to > downstream libraries to keep up upstream. With urllib being a heavily used > module across ecosystem it's good to have a fix landing in stdlib that > secures downstream libraries encouraging users to upgrade Python too. > > Validation should occur whenever user data crosses a trust boundary -- > i.e. when the library starts to assume that an extracted chunk now contains > something valid. > > https://tools.ietf.org/html/rfc3986 defines valid syntax (incl. valid > characters) for every part of a URL -- _of any scheme_ (FYI, \r\n are > invalid everywhere and the test code for `data:' that Karthikeyan > referred to is raw data to compare to rather than a part of a URL). It also > obsoletes all the RFCs that the current code is written against. > > AFAICS, urllib.split* fns (obsoleted as public in 3.8) are used by both > urllib and urllib2 to parse URLs. They can be made to each validate the > chunk that they split off. urlparse can validate the entire URL altogether. > > Also, all modules ought to use the same code (urlparse looks like the best > candidate) to parse URLs -- this will minimize the attack surface. > > I think I can look into this later this week. > My PR as of last night cites that RFC and does validation in http.client while constructing the protocol request payload. Doing it within split functions was an initial hack that looked like it might work but didn't feel right as that isn't what people expect of those functions and that turned out to be the case as I tested things due to our mess of codepaths for opening URLs, but they all end with http.client so yay! I did *not* look at any of the async http client code paths. (legacy asyncore or new asyncio). If there is an issue there, those deserve to have their own bugs filed. As for third party PyPI libraries such as urllib3, they are on their own to fix bugs. If they happen to use a code path that a stdlib fix helps, good for them, but honestly they are much better off making and shipping their own update to avoid the bug. Users can get it much sooner as it's a mere pip install -U away rather than a python runtime upgrade. > Fixing this is going to break code that relies on the current code > accepting invalid URLs. But the docs have never said that e.g. in urlopen, > anything apart from a (valid) URL is accepted (in particular, this implies > that the user is responsible for escaping stuff properly before passing > it). So I would say that we are within our right here and whoever is > relying on those quirks is and has always been on unsupported territory. > yep. even http.client.HTTPConnection.request names the function parameter "url" so anyone embedding whitespace newlines and http protocol strings within that is well outside of supported territory (as one example in our own test_xmlrpc was taking advantage of to test a malformed request). I suggest following up on https://bugs.python.org/issue30458 rather than in this thread. the thread did its job, it directed our eyeballs at the problems. :) -gps > Determining which of those quirks are exploitable and which are not to fix > just the former is an incomparably larger, more error-prone and avoidable > work. If anything, the history of the issue referenced to by previous > posters clearly shows that this is too much to ask from the Python team. > > I also see other undocumented behavior like accepting '>' (also > obsoleted as public in 3.8) which I would like to but it's of no harm. > > -- > > Regards, > Ivan > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/greg%40krypto.org > -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Wed Apr 10 15:27:03 2019 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 10 Apr 2019 12:27:03 -0700 Subject: [Python-Dev] No longer enable Py_TRACE_REFS by default in debug build In-Reply-To: References: <0a1871f0-14f8-8591-cf35-21cb3741f354@python.org> Message-ID: On Wed, Apr 10, 2019, 04:04 Victor Stinner wrote: > Le mar. 9 avr. 2019 ? 22:16, Steve Dower a ?crit > : > > What are the other changes that would be required? > > I don't know. > > > And is there another > > way to get the same functionality without ABI modifications? > > Py_TRACE_REFS is a double linked list of *all* Python objects. To get > this functionality, you need to store the list somewhere. I don't know > how to maintain such list outside the PyObject structure. > I assume these pointers get updated from some generic allocation/free code. Could that code instead overallocate by 16 bytes, use the first 16 bytes to hold the pointers, and then return the PyObject* as (actual allocated pointer + 16)? Basically the "container_of" trick. I don't think that I ever used sys.getobjects(), whereas many projects > use gc.get_objects() which is also available in release builds (not > only in debug builds). Can anyone explain what pydebug builds are... for? Confession: I've never used them myself, and don't know why I would want to. (I have to assume that most of Steve's Windows downloads are from folks who thought they were downloading a python debugger.) -n -------------- next part -------------- An HTML attachment was scrubbed... URL: From aranea.network at gmail.com Wed Apr 10 15:39:27 2019 From: aranea.network at gmail.com (Robert Okadar) Date: Wed, 10 Apr 2019 21:39:27 +0200 Subject: [Python-Dev] (no subject) In-Reply-To: <20190410153555.GB3010@ando.pearwood.info> References: <20190410153555.GB3010@ando.pearwood.info> Message-ID: Hi Steven, Thank you for pointing me in the right direction. Will search for help on places you mentioned. Not sure how can we help you with developing the Python interpreter, as I doubt we have any knowledge that this project might use it. When I say 'we', I mean on my colleague and me. All the best, -- Robert Okadar IT Consultant Schedule an *online meeting * with me! Visit *aranea-mreze.hr* or call * +385 91 300 8887* On Wed, 10 Apr 2019 at 17:36, Steven D'Aprano