From p.f.moore at gmail.com Thu Feb 1 04:30:04 2018 From: p.f.moore at gmail.com (Paul Moore) Date: Thu, 1 Feb 2018 09:30:04 +0000 Subject: [Python-Dev] Why is Python for Windows compiled with MSVC? In-Reply-To: References: <3zWzR85rP2zFrJV@mail.python.org> Message-ID: On 1 February 2018 at 00:42, Gregory P. Smith wrote: > TL;DR of Steve's post - MSVC is the compiler of choice for most serious > software on Windows. So we use it to best integrate with the world. There is > no compelling reason to change that. > > The free-as-in-beer MSVC community edition is finally non-sucky (their > earlier efforts were crippled, they seem to have learned the lesson) > > There are other viable Windows compilers. If we want to support those in > CPython someone needs to contribute the work to do so, ongoing maintenance, > and buildbots. I'd love to see a Clang based Windows build (Google Chrome > is built using that). But I have no motivating reason to do the work. I > believe such a build could be made to integrate and inter-operate fully with > MSVC builds and ABIs. We could probably even make cross-compilation of > extensions from Linux -> Windows work that way. > > We're highly unlikely to ever stop shipping python.org Windows binaries > built with anything other than MSVC unless Microsoft takes a turn toward the > dark side again. I would also point out that CPython (distutils, specifically) supported mingw builds (that's the original mingw 32-bit version) for a long time. Support for that bit-rotted as the mingw project fragmented with various 64-bit versions, and slow progress from the mingw project(s) for supporting newer CRTs on Windows. No-one from the community who used mingw was providing patches back to distutils or Python, and so the support for mingw was (I believe) dropped. It sounds as if people have since then got things to a point where building extensions with (some form of ) mingw is possible, but I don't know the details, and I'm not aware of any documentation that's been contributed back to Python on how "mingw support for building extensions" works these days. We'd be more than happy to support building extensions with alternative compilers (such as one of the various gcc builds that go under the banner of "mingw") - that's a completely different matter than switching the CPython build process to use mingw - but it's down to the community of users of such a compiler to contribute that support. Expecting "someone else", and particularly someone who doesn't need it, to provide that support, is unreasonable. Disclaimer: This all comes under "as far as I know" - I did a lot of work on mingw support back when it was supported (including trying to push patches back to the mingw people) but haven't been following it for a long time. Things could be very different now than I remember. Paul From j.orponen at 4teamwork.ch Thu Feb 1 06:35:35 2018 From: j.orponen at 4teamwork.ch (Joni Orponen) Date: Thu, 1 Feb 2018 12:35:35 +0100 Subject: [Python-Dev] OS-X builds for 3.7.0 In-Reply-To: References:

Message-ID: On Thu, Feb 1, 2018 at 12:18 AM, Chris Barker wrote: > On Wed, Jan 31, 2018 at 3:13 AM, Joni Orponen > wrote: > >> On Wed, Jan 31, 2018 at 12:43 AM, Chris Barker - NOAA Federal < >> chris.barker at noaa.gov> wrote: >> >>> And maybe we could even get rid of the "Framework" builds...... >>>> >>> >>> Please do not. These make life easier for doing things the Apple way for >>> signed sandboxed applications. >>> >>> For the record, are you re-distributing the python.org builds, or >>> re-building yourself? >>> >> >> We are re-building ourselves. >> > > Then it makes no difference to you if the pyton.org binaries are > Framework builds... though maybe you want the configure target available. > And if the official distribution distributes a Framework, the configure target and build for that are also actually maintained, which will keep them likelier usable. Would we not be doing a couple of very specific things, we'd be able to use the distributed Framework as well. It is not unconceivable we could use it as-is in the future or for other projects. -- Joni Orponen -------------- next part -------------- An HTML attachment was scrubbed... URL: From olegs at traiana.com Thu Feb 1 04:19:31 2018 From: olegs at traiana.com (Oleg Sivokon) Date: Thu, 1 Feb 2018 09:19:31 +0000 Subject: [Python-Dev] Why is Python for Windows compiled with MSVC? In-Reply-To: <69628ee3-d220-4934-aa73-f652cb5c6f2d@HE1EUR02FT027.eop-EUR02.prod.protection.outlook.com> References: , <69628ee3-d220-4934-aa73-f652cb5c6f2d@HE1EUR02FT027.eop-EUR02.prod.protection.outlook.com> Message-ID: > so why shouldn?t the one with the most users? Because it makes compilation difficult, and cross-compilatin completely impossible? Why is it difficult: a package maintainer needs to (1) buy MS Windows (2) create a special workflow for compiling on a different machine. This is both costly and inconsistent with free-as-in-freedom... It makes cross-compilation impossible because libraries produced by any tool that can run on all target platforms are incompatible with Python binaries on MS Windows. Again, many languages (i.e. projects similar in size an purpose to CPython) took a different approach: they use GNU compilers to be able to compile cross-platform. This is true for Ruby and Go at least. I would need to investigate further, but I think these two examples should be enough. > I?m likely biased because I work there and I?m the main intermediary with python-dev, but these days Microsoft is one of the strongest supporters of CPython. We employ the most core developers of any private company and we all are allowed work time to contribute, we provide full access to our development tools and platforms to all core developers and some prominent projects, we?ve made fixes, enhancements and releases or core products such as the CRT, MSVC, Visual Studio, Visual Studio Code, and Azure SPECIFICALLY to support CPython development and users. As far as I know, ALL the Windows buildbots are running on Azure subscriptions that Microsoft provides (though managed by some awesome volunteers). You?ll see us at PyCon US under the biggest banner and we?ll have a booth filled with engineers and not recruiters. Crash reports from thousands of opted-in users come into our systems and have directly lead to both CPython and Windows bug fixes. Oh, so this is the real reason... well, corporate interests are hard to argue against. But, this is an interesting statistic nevertheless. Thanks for letting me know. Best. Oleg This communication and all information contained in or attached to it is confidential, intended solely for the addressee, may be legally privileged and is the intellectual property of one of the companies of NEX Group plc ("NEX") or third parties. If you are not the intended addressee or receive this message in error, please immediately delete all copies of it and notify the sender. We have taken precautions to minimise the risk of transmitting software viruses, but we advise you to carry out your own virus checks on any attachments. We do not accept liability for any loss or damage caused by software viruses. NEX reserves the right to monitor all communications. We do not accept any legal responsibility for the content of communications, and no communication shall be considered legally binding. Furthermore, if the content of this communication is personal or unconnected with our business, we accept no liability or responsibility for it. NEX Group plc is a public limited company registered in England and Wales under number 10013770 and certain of its affiliates are authorised and regulated by regulatory authorities. For further regulatory information please see www.NEX.com. From christian at python.org Thu Feb 1 07:14:43 2018 From: christian at python.org (Christian Heimes) Date: Thu, 1 Feb 2018 13:14:43 +0100 Subject: [Python-Dev] Why is Python for Windows compiled with MSVC? In-Reply-To: References: <69628ee3-d220-4934-aa73-f652cb5c6f2d@HE1EUR02FT027.eop-EUR02.prod.protection.outlook.com> Message-ID: On 2018-02-01 10:19, Oleg Sivokon wrote: > >> so why shouldn?t the one with the most users? > > Because it makes compilation difficult, and cross-compilatin completely impossible? Why is it difficult: a package maintainer needs to (1) buy MS Windows (2) create a special workflow for compiling on a different machine. This is both costly and inconsistent with free-as-in-freedom... It makes cross-compilation impossible because libraries produced by any tool that can run on all target platforms are incompatible with Python binaries on MS Windows. > > Again, many languages (i.e. projects similar in size an purpose to CPython) took a different approach: they use GNU compilers to be able to compile cross-platform. This is true for Ruby and Go at least. I would need to investigate further, but I think these two examples should be enough. > >> I?m likely biased because I work there and I?m the main intermediary with python-dev, but these days Microsoft is one of the strongest supporters of CPython. We employ the most core developers of any private company and we all are allowed work time to contribute, we provide full access to our development tools and platforms to all core developers and some prominent projects, we?ve made fixes, enhancements and releases or core products such as the CRT, MSVC, Visual Studio, Visual Studio Code, and Azure SPECIFICALLY to support CPython development and users. As far as I know, ALL the Windows buildbots are running on Azure subscriptions that Microsoft provides (though managed by some awesome volunteers). You?ll see us at PyCon US under the biggest banner and we?ll have a booth filled with engineers and not recruiters. Crash reports from thousands of opted-in users come into our systems and have directly lead to both CPython and Windows bug fixes. > > Oh, so this is the real reason... well, corporate interests are hard to argue against. But, this is an interesting statistic nevertheless. Thanks for letting me know. You are drawing the wrong conclusion here. CPython has been using MSVC many years before Microsoft even started to sponsor MSDN subscriptions for core developers. MSVC is the default Windows compiler for over 20 years now. IIRC Microsoft started to donate MSDN subscription for about 5 years and Steve has been helping out with Windows improvement for about 5 years. Christian From brian at python.org Thu Feb 1 08:15:32 2018 From: brian at python.org (Brian Curtin) Date: Thu, 1 Feb 2018 08:15:32 -0500 Subject: [Python-Dev] Why is Python for Windows compiled with MSVC? In-Reply-To: References: <69628ee3-d220-4934-aa73-f652cb5c6f2d@HE1EUR02FT027.eop-EUR02.prod.protection.outlook.com> Message-ID: On Thu, Feb 1, 2018 at 4:19 AM, Oleg Sivokon wrote: > > > so why shouldn?t the one with the most users? > > Because it makes compilation difficult, and cross-compilatin completely > impossible? Why is it difficult: a package maintainer needs to (1) buy MS > Windows (2) create a special workflow for compiling on a different > machine. This is both costly and inconsistent with free-as-in-freedom... > It makes cross-compilation impossible because libraries produced by any > tool that can run on all target platforms are incompatible with Python > binaries on MS Windows. > > Again, many languages (i.e. projects similar in size an purpose to > CPython) took a different approach: they use GNU compilers to be able to > compile cross-platform. This is true for Ruby and Go at least. I would > need to investigate further, but I think these two examples should be > enough. > They should be enough for *what*, though? You can tell people what everyone else is doing, but the difference between that and what we have is someone's time and effort. > I?m likely biased because I work there and I?m the main intermediary with > python-dev, but these days Microsoft is one of the strongest supporters of > CPython. We employ the most core developers of any private company and we > all are allowed work time to contribute, we provide full access to our > development tools and platforms to all core developers and some prominent > projects, we?ve made fixes, enhancements and releases or core products such > as the CRT, MSVC, Visual Studio, Visual Studio Code, and Azure SPECIFICALLY > to support CPython development and users. As far as I know, ALL the Windows > buildbots are running on Azure subscriptions that Microsoft provides > (though managed by some awesome volunteers). You?ll see us at PyCon US > under the biggest banner and we?ll have a booth filled with engineers and > not recruiters. Crash reports from thousands of opted-in users come into > our systems and have directly lead to both CPython and Windows bug fixes. > > Oh, so this is the real reason... well, corporate interests are hard to > argue against. But, this is an interesting statistic nevertheless. Thanks > for letting me know. I think that's a mischaracterization of the situation. The MS toolchain was chosen some time long before I (or Steve) got involved, and when I upgraded us from VS2008 to VS2010 for 3.3 ~6 years ago I had several messages similar to this thread. As much as Steve is unlikely to do the work to initiate and maintain support of these other tools?whether due to his employer's interests or his own?I too was unlikely to do work like this thread is asking. In fact, the chances I would have done it were zero because I was sitting on my couch upgrading our Visual Studio versions because it let me do better stuff at my day job, though I was always open to review patches that supported alternatives without major disruption. However, they never came. I suspect the same could be said of Martin and anyone else working in this area prior to that, because nothing has really changed. Like the previous times this sort of question has come up?and really, for any question on this list?it ultimately turns into a matter of how much the solution is wanted and how much effort people are willing to give to make it happen. Historically, the former has had small amounts, and the latter has had much smaller amounts. Without a change there I don't think one will materialize in a released version of Python. Brian -------------- next part -------------- An HTML attachment was scrubbed... URL: From stefanrin at gmail.com Thu Feb 1 08:55:27 2018 From: stefanrin at gmail.com (Stefan Ring) Date: Thu, 1 Feb 2018 14:55:27 +0100 Subject: [Python-Dev] Why is Python for Windows compiled with MSVC? In-Reply-To: References: <69628ee3-d220-4934-aa73-f652cb5c6f2d@HE1EUR02FT027.eop-EUR02.prod.protection.outlook.com>

Message-ID: > As much as Steve is unlikely to do the work to initiate and > maintain support of these other tools?whether due to his employer's > interests or his own?I too was unlikely to do work like this thread is > asking. In fact, the chances I would have done it were zero because I was > sitting on my couch upgrading our Visual Studio versions because it let me > do better stuff at my day job, though I was always open to review patches > that supported alternatives without major disruption. However, they never > came. I suspect the same could be said of Martin and anyone else working in > this area prior to that, because nothing has really changed. It would be cool though if Microsoft started providing a cross-compiler running on Linux. This could even be the only compiler shipped with Visual Studio, now that Windows can run Linux userland. Cross-compilers from Microsoft would not be totally unheard of. IIRC, the last DOS versions (Visual C++ 1.5x) were Win32 binaries building for DOS 16 bit. Technically speaking, using a 32 bit compiler for building for 64 bit Windows or the other way around would probably count as cross-compilation anyway. From barry at python.org Thu Feb 1 09:40:56 2018 From: barry at python.org (Barry Warsaw) Date: Thu, 1 Feb 2018 09:40:56 -0500 Subject: [Python-Dev] Why is Python for Windows compiled with MSVC? In-Reply-To: References: <69628ee3-d220-4934-aa73-f652cb5c6f2d@HE1EUR02FT027.eop-EUR02.prod.protection.outlook.com> Message-ID: <59693230-FC88-430A-9887-D50B49E196CE@python.org> On Feb 1, 2018, at 04:19, Oleg Sivokon wrote: > > Oh, so this is the real reason... well, corporate interests are hard to argue against. But, this is an interesting statistic nevertheless. Thanks for letting me know. Maybe it hasn?t happened because no volunteer has stepped up to do it. Or maybe no corporation thinks it a good business investment to pay employees to do it. What other options are there, and if none, then which of those can you solve? Or to paraphrase a timeless quote: Ask not what Python can do for you, ask what you can do for Python. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: Message signed with OpenPGP URL: From olegs at traiana.com Thu Feb 1 04:50:01 2018 From: olegs at traiana.com (Oleg Sivokon) Date: Thu, 1 Feb 2018 09:50:01 +0000 Subject: [Python-Dev] Why is Python for Windows compiled with MSVC? In-Reply-To: References: <3zWzR85rP2zFrJV@mail.python.org> , Message-ID: > I would also point out that CPython (distutils, specifically) supported mingw builds (that's the original mingw 32-bit version) for a long time. Support for that bit-rotted as the mingw project fragmented with various 64-bit versions, and slow progress from the mingw project(s) for supporting newer CRTs on Windows. No-one from the community who used mingw was providing patches back to distutils or Python, and so the support for mingw was (I believe) dropped. It sounds as if people have since then got things to a point where building extensions with (some form of ) mingw is possible, but I don't know the details, and I'm not aware of any documentation that's been contributed back to Python on how "mingw support for building extensions" works these days. The question isn't whether I can build Python with MinGW: I know I can. The problem is that if I do that, I make the users of my package depend on my special build of Python. This also means that the packages built by others (who are unaware of my special build of Python) may or may not work for them. In practice, if I want to make sure that the users of my packages can actually use them, I cannot allow myself to link them against a Python binary of my choice. It has to be their choice. We'd be more than happy to support building extensions with alternative compilers (such as one of the various gcc builds that go under the banner of "mingw") - that's a completely different matter than switching the CPython build process to use mingw - but it's down to the community of users of such a compiler to contribute that support. Expecting "someone else", and particularly someone who doesn't need it, to provide that support, is unreasonable. Well, guys, since you are here, I assumed you were in the business of deciding which compiler to compile your code with. I really, just asked a question. It's obvious that I don't agree with your decision, but I wanted to hear your argument. Nowhere did I suggest that you should do any work for me or anything of that kind. All I wanted was information to make an informed decision about using Python and its extensions ecosystem. I've got my answers, for which I'm thankful. I'm sorry for the disclaimer that follows this email. Unfortunately, I'm too a slave of circumstances, s.a. my employer and the IT department that would not let me send an email without this legal mumbo-jumbo. Best. Oleg This communication and all information contained in or attached to it is confidential, intended solely for the addressee, may be legally privileged and is the intellectual property of one of the companies of NEX Group plc ("NEX") or third parties. If you are not the intended addressee or receive this message in error, please immediately delete all copies of it and notify the sender. We have taken precautions to minimise the risk of transmitting software viruses, but we advise you to carry out your own virus checks on any attachments. We do not accept liability for any loss or damage caused by software viruses. NEX reserves the right to monitor all communications. We do not accept any legal responsibility for the content of communications, and no communication shall be considered legally binding. Furthermore, if the content of this communication is personal or unconnected with our business, we accept no liability or responsibility for it. NEX Group plc is a public limited company registered in England and Wales under number 10013770 and certain of its affiliates are authorised and regulated by regulatory authorities. For further regulatory information please see www.NEX.com. From chris.barker at noaa.gov Thu Feb 1 15:24:53 2018 From: chris.barker at noaa.gov (Chris Barker) Date: Thu, 1 Feb 2018 12:24:53 -0800 Subject: [Python-Dev] OS-X builds for 3.7.0 In-Reply-To: References:

Message-ID: >> Ned Deily is in charge of the Mac build (as well as current release > manager). Within the last week, he revised the official builds (now two, I > believe) for 3.7.0b1, due in a day or so. One will be a future oriented > 64-bit build. The PR and What's New have more. > What's New doesn't mention it, but Ned's annoucemtn does: """ Attention macOS users: with 3.7.0b1, we are providing a choice of two binary installers. The new variant provides a 64-bit-only version for macOS 10.9 and later systems; this variant also now includes its own built-in version of Tcl/Tk 8.6. We welcome your feedback. """ So that's a start -- thanks Ned! He may not be reading this thread, but will read MacOS tracker issues with > a specific proposal, data and a patch. Comparisons should be against the > current master or an installed 3.7.0b1. I hope the folks on this thread that know what they are doing can test and make suggestions :-) -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From elprans at gmail.com Thu Feb 1 19:34:19 2018 From: elprans at gmail.com (Elvis Pranskevichus) Date: Thu, 01 Feb 2018 19:34:19 -0500 Subject: [Python-Dev] Dataclasses and correct hashability Message-ID: <3951276.IZdOgMHjlT@hammer.magicstack.net> There appears to be a critical omission from the current dataclass implementation: it does not make hash=True fields immutable. Per Python spec: "the implementation of hashable collections requires that a key?s hash value is immutable (if the object?s hash value changes, it will be in the wrong hash bucket)" Yet: import dataclasses @dataclasses.dataclass(hash=True) class A: foo: int = dataclasses.field(hash=True, compare=True) a = A(foo=1) s = set() s.add(a) # s == {a} a.foo = 2 print(a in s) print({a} == s} print(s == s) # prints False False True This looks to me like a clearly wrong behavior. Elvis From eric at trueblade.com Thu Feb 1 20:17:00 2018 From: eric at trueblade.com (Eric V. Smith) Date: Thu, 1 Feb 2018 20:17:00 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <3951276.IZdOgMHjlT@hammer.magicstack.net> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> Message-ID: <4b8e20d5-31db-69e3-2dbb-d2853d47c6c0@trueblade.com> On 2/1/2018 7:34 PM, Elvis Pranskevichus wrote: > There appears to be a critical omission from the current dataclass > implementation: it does not make hash=True fields immutable. > > Per Python spec: > > "the implementation of hashable collections requires that a key?s hash > value is immutable (if the object?s hash value changes, it will be in > the wrong hash bucket)" > > Yet: > > import dataclasses > > @dataclasses.dataclass(hash=True) > class A: > foo: int = dataclasses.field(hash=True, compare=True) > > a = A(foo=1) > > s = set() > s.add(a) # s == {a} > a.foo = 2 > > print(a in s) > print({a} == s} > print(s == s) > > # prints False False True > > > This looks to me like a clearly wrong behavior. > > > Elvis Data classes do not protect you from doing the wrong thing. This is the same as writing: class A: def __init__(self, foo): self.foo = foo def __hash__(self): return hash((self.foo,)) You're allowed to override the parameters to dataclasses.dataclass for cases where you know what you're doing. Consenting adults, and all. Eric. From eric at trueblade.com Thu Feb 1 20:21:03 2018 From: eric at trueblade.com (Eric V. Smith) Date: Thu, 1 Feb 2018 20:21:03 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <4b8e20d5-31db-69e3-2dbb-d2853d47c6c0@trueblade.com> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <4b8e20d5-31db-69e3-2dbb-d2853d47c6c0@trueblade.com> Message-ID: <441a5309-b4b6-adb4-27ca-3431f123f7dc@trueblade.com> On 2/1/2018 8:17 PM, Eric V. Smith wrote: > On 2/1/2018 7:34 PM, Elvis Pranskevichus wrote: >> There appears to be a critical omission from the current dataclass >> implementation: it does not make hash=True fields immutable. >> >> Per Python spec: >> >> "the implementation of hashable collections requires that a key?s hash >> value is immutable (if the object?s hash value changes, it will be in >> the wrong hash bucket)" >> >> Yet: >> >> ???? import dataclasses >> >> ???? @dataclasses.dataclass(hash=True) >> ???? class A: >> ???????? foo: int = dataclasses.field(hash=True, compare=True) >> >> ???? a = A(foo=1) >> >> ???? s = set() >> ???? s.add(a)?? # s == {a} >> ???? a.foo = 2 >> >> ???? print(a in s) >> ???? print({a} == s} >> ???? print(s == s) >> >> ???? # prints False False True >> >> >> This looks to me like a clearly wrong behavior. >> >> >> ???????????????????????????????????? Elvis > > Data classes do not protect you from doing the wrong thing. This is the > same as writing: > > class A: > ??? def __init__(self, foo): > ??????? self.foo = foo > ??? def __hash__(self): > ??????? return hash((self.foo,)) > > You're allowed to override the parameters to dataclasses.dataclass for > cases where you know what you're doing. Consenting adults, and all. I should add: This is why you shouldn't override the default (hash=None) unless you know what the consequences are. Can I ask why you want to specify hash=True? Eric From elprans at gmail.com Thu Feb 1 20:29:57 2018 From: elprans at gmail.com (Elvis Pranskevichus) Date: Thu, 01 Feb 2018 20:29:57 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <441a5309-b4b6-adb4-27ca-3431f123f7dc@trueblade.com> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <4b8e20d5-31db-69e3-2dbb-d2853d47c6c0@trueblade.com> <441a5309-b4b6-adb4-27ca-3431f123f7dc@trueblade.com> Message-ID: <34493569.S535hZLbfa@klinga.prans.org> On Thursday, February 1, 2018 8:21:03 PM EST Eric V. Smith wrote: > I should add: This is why you shouldn't override the default > (hash=None) unless you know what the consequences are. Can I ask > why you want to specify hash=True? hash=None and compare=True leads to the same result, which, I think is even worse. > You're allowed to override the parameters to dataclasses.dataclass > for cases where you know what you're doing. Consenting adults, > and all. I don't agree with this. You are comparing implicit dataclass behavior with an explicit shoot-in-the-foot __hash__() definition. Elvis From eric at trueblade.com Thu Feb 1 20:37:41 2018 From: eric at trueblade.com (Eric V. Smith) Date: Thu, 1 Feb 2018 20:37:41 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <34493569.S535hZLbfa@klinga.prans.org> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <4b8e20d5-31db-69e3-2dbb-d2853d47c6c0@trueblade.com> <441a5309-b4b6-adb4-27ca-3431f123f7dc@trueblade.com> <34493569.S535hZLbfa@klinga.prans.org> Message-ID: <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> On 2/1/2018 8:29 PM, Elvis Pranskevichus wrote: > On Thursday, February 1, 2018 8:21:03 PM EST Eric V. Smith wrote: >> I should add: This is why you shouldn't override the default >> (hash=None) unless you know what the consequences are. Can I ask >> why you want to specify hash=True? > > hash=None and compare=True leads to the same result, which, I think is > even worse. Have you actually tried that? >>> @dataclass(hash=None) ... class A: ... foo: int = field(hash=True, compare=True) ... >>> hash(A(2)) Traceback (most recent call last): File "", line 1, in TypeError: unhashable type: 'A' I believe the default hash=None on the class decorator does right thing. Please provide a counter-example. >> You're allowed to override the parameters to dataclasses.dataclass >> for cases where you know what you're doing. Consenting adults, >> and all. > > I don't agree with this. You are comparing implicit dataclass > behavior with an explicit shoot-in-the-foot __hash__() definition. I don't recommend ever specifying the decorator hash= parameter unless you have an unusual use case, in which case it's on you to get it right. There was recently a long python-dev discussion on this issue. I need to update the PEP to reflect it, but the advice still stands: you almost always want to use the default hash=None. Do you have a use case for specifying a hash function on a class with mutable instances? Maybe you want frozen=True? Eric From elprans at gmail.com Thu Feb 1 20:49:44 2018 From: elprans at gmail.com (Elvis Pranskevichus) Date: Thu, 01 Feb 2018 20:49:44 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <34493569.S535hZLbfa@klinga.prans.org> <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> Message-ID: <1937305.WBmin8xbao@klinga.prans.org> On Thursday, February 1, 2018 8:37:41 PM EST Eric V. Smith wrote: > > hash=None and compare=True leads to the same result, which, I > > think is even worse. > > Have you actually tried that? I meant this: @dataclasses.dataclass(hash=True) class A: foo: int = dataclasses.field(compare=True) > I don't recommend ever specifying the decorator hash= parameter > unless you have an unusual use case, in which case it's on you to > get it right. In my experience this type of breakage is so subtle that people will happily write code lots of code like this without noticing. My main objection here is that the dataclass does not go far enough to prevent obviously wrong behaviour. Or it goes too far with the whole hash/ frozen distinction. Elvis From ncoghlan at gmail.com Fri Feb 2 00:33:04 2018 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 2 Feb 2018 15:33:04 +1000 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <1937305.WBmin8xbao@klinga.prans.org> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <34493569.S535hZLbfa@klinga.prans.org> <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> <1937305.WBmin8xbao@klinga.prans.org> Message-ID: On 2 February 2018 at 11:49, Elvis Pranskevichus wrote: > In my experience this type of breakage is so subtle that people will > happily write code lots of code like this without noticing. My main > objection here is that the dataclass does not go far enough to prevent > obviously wrong behaviour. Or it goes too far with the whole hash/ > frozen distinction. For 3.7, I think we should seriously considered just straight up disallowing the "hash=True, frozen=False" combination, and instead require folks to provide their own hash function in that case. "Accidentally hashable" (whether by identity or field hash) isn't a thing that data classes should be allowing to happen. If we did that, then the public "hash" parameter could potentially be dropped entirely for the time being - the replacement for "hash=True" would be a "def __hash__: ..." in the body of the class definition, and the replacement for "hash=False" would be "__hash__ = None" in the class body. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From tjreedy at udel.edu Fri Feb 2 01:53:00 2018 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 2 Feb 2018 01:53:00 -0500 Subject: [Python-Dev] Is object the most base type? (bpo-20285) Message-ID: >>> object.__doc__ 'The most base type' I and several people on python-list thread "interactive help on the base object" (Dec 2013) thought this could be improved. On https://bugs.python.org/issue20285 and https://github.com/python/cpython/pull/4759 After some research, I believe the following, which is wrote on the issue, explains the uneasiness many feel. 'Base' is actually two words. As a noun (or verb), it comes from Ancient Greek ????? (b?sis), a foundation from which other things extend or derive. As an adjective, it comes from Late Latin bassus (?low?). In computer science and Python, the couplet 'base class' is being used, it seems to me and apparently others, as a noun-noun compound, meaning, 'foundation class', not as an adjective-noun phrase meaning 'low class' (let along 'depraved class'). However, 'most base class' must be parsed as '(most base) class', with 'base' re-interpreted as the adjective meaning 'low' (or worse). The switch in meaning of 'base' is similar in 'baseball' versus 'most base ball'. --- I have suggested "The superclass for all Python classes." "The starting base class of all types and classes other than itself." I intended to pick the second, but Serhiy Storchake wants more opinions. -- Terry Jan Reedy From steve at pearwood.info Fri Feb 2 02:25:34 2018 From: steve at pearwood.info (Steven D'Aprano) Date: Fri, 2 Feb 2018 18:25:34 +1100 Subject: [Python-Dev] Is object the most base type? (bpo-20285) In-Reply-To: References: Message-ID: <20180202072534.GO26553@ando.pearwood.info> On Fri, Feb 02, 2018 at 01:53:00AM -0500, Terry Reedy wrote: > >>> object.__doc__ > 'The most base type' [...] > I have suggested > "The superclass for all Python classes." > "The starting base class of all types and classes other than itself." > > I intended to pick the second, but Serhiy Storchake wants more opinions. Yay, bike-shedding! How about: "the base class of the class heirarchy" "the root of the class heirarchy" Java used to say this about Object: "The root class of the Java class hierarchy. All non-primitive types (including arrays) inherit either directly or indirectly from this class." but now says: "Class Object is the root of the class hierarchy. Every class has Object as a superclass. All objects, including arrays, implement the methods of this class." https://developer.android.com/reference/java/lang/Object.html Ruby says this about Object and BasicObject: "Object is the default root of all Ruby objects. Object inherits from BasicObject which allows creating alternate object hierarchies." "BasicObject is the parent class of all classes in Ruby. It's an explicit blank class." https://ruby-doc.org/core-2.5.0/Object.html https://ruby-doc.org/core-2.5.0/BasicObject.html The root of Julia's object heirarchy is Any, but if it is documented anywhere, I can't find it. The interactive help just says: Any No documentation found. Summary: abstract Any <: Any -- Steve From vadmium+py at gmail.com Fri Feb 2 06:23:13 2018 From: vadmium+py at gmail.com (Martin Panter) Date: Fri, 2 Feb 2018 11:23:13 +0000 Subject: [Python-Dev] Is object the most base type? (bpo-20285) In-Reply-To: <20180202072534.GO26553@ando.pearwood.info> References: <20180202072534.GO26553@ando.pearwood.info> Message-ID: > On Fri, Feb 02, 2018 at 01:53:00AM -0500, Terry Reedy wrote: >> >>> object.__doc__ >> 'The most base type' > [...] >> I have suggested >> "The superclass for all Python classes." >> "The starting base class of all types and classes other than itself." >> >> I intended to pick the second, but Serhiy Storchake wants more opinions. On 2 February 2018 at 07:25, Steven D'Aprano wrote: > "the base class of the class heirarchy" > > "the root of the class heirarchy" The RST documentation says Object is a base for all classes. which is better than ?the most base type? IMO, although I would write _the_ base rather than _a_ base; there is only one object class. In my patch for the RST documentation in I proposed This is the ultimate base class of all other classes. which I still think is okay (though other parts of my patch doesn?t seem very good). From elprans at gmail.com Fri Feb 2 08:55:25 2018 From: elprans at gmail.com (Elvis Pranskevichus) Date: Fri, 02 Feb 2018 08:55:25 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <1937305.WBmin8xbao@klinga.prans.org> Message-ID: <2657249.hN7IFKUuXt@klinga.prans.org> On Friday, February 2, 2018 12:33:04 AM EST Nick Coghlan wrote: > For 3.7, I think we should seriously considered just straight up > disallowing the "hash=True, frozen=False" combination, and instead > require folks to provide their own hash function in that case. > "Accidentally hashable" (whether by identity or field hash) isn't a > thing that data classes should be allowing to happen. > > If we did that, then the public "hash" parameter could potentially > be dropped entirely for the time being - the replacement for > "hash=True" would be a "def __hash__: ..." in the body of the class > definition, and the replacement for "hash=False" would be "__hash__ > = None" in the class body. I think "frozen=True" should just imply hashability (by fields). You can always do "__hash__ = None" to opt out. I don't see the default hashability of an immutable object as a problem. tuples and frozensets are hashable after all. Elvis From eric at trueblade.com Fri Feb 2 10:08:43 2018 From: eric at trueblade.com (Eric V. Smith) Date: Fri, 2 Feb 2018 10:08:43 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <34493569.S535hZLbfa@klinga.prans.org> <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> <1937305.WBmin8xbao@klinga.prans.org> Message-ID: <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> On 2/2/2018 12:33 AM, Nick Coghlan wrote: > For 3.7, I think we should seriously considered just straight up > disallowing the "hash=True, frozen=False" combination, and instead > require folks to provide their own hash function in that case. > "Accidentally hashable" (whether by identity or field hash) isn't a > thing that data classes should be allowing to happen. > > If we did that, then the public "hash" parameter could potentially be > dropped entirely for the time being - the replacement for "hash=True" > would be a "def __hash__: ..." in the body of the class definition, > and the replacement for "hash=False" would be "__hash__ = None" in the > class body. attrs has the same behavior (if you ignore how dataclasses handles the cases where __hash__ or __eq__ already exist in the class definition). Here's what attrs says about adding __hash__ via hash=True: "Although not recommended, you can decide for yourself and force attrs to create one (e.g. if the class is immutable even though you didn?t freeze it programmatically) by passing True or not. Both of these cases are rather special and should be used carefully." The problem with dropping hash=True is: how would you write __hash__ yourself? It seems like a bug magnet if you're adding fields to the class and forget to update __hash__, especially in the presence of per-field hash=False and eq=False settings. And you'd need to make sure it matches the generated __eq__ (if 2 objects are equal, they need to have the same hash value). If we're going to start disallowing things, how about the per-field hash=True, eq=False case? However, I don't feel very strongly about this. As I've said, I expect the use cases for hash=True to be very, very rare. And now that we allow overriding __hash__ in the class body without setting hash=False, there aren't a lot of uses for hash=False, either. But we would need to think through how you'd get the behavior of hash=False with multiple inheritance, if that's what you wanted. Again, a very, very rare case. In all, I think we're better off documenting best practices and making them the default, like attrs does, and leave it to the programmer to follow them. I realize we're handing out footguns, the alternatives seem even more complex and are limiting. Eric From christian at python.org Fri Feb 2 10:23:20 2018 From: christian at python.org (Christian Heimes) Date: Fri, 2 Feb 2018 16:23:20 +0100 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 Message-ID: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> Hi, in PR 3854 [1] Serhiy added blowfish, extended DES and NT-Hash to Python's crypt mdodule. I vetoed against addition of the APIs because all these hashing algorithms are not state of the art. Their quality ranges from old to horribly, horriblye broken beyond any repair. Shortly after the PR has landed, I was made aware that glibc has deprecated crypt(3) API [2] and favor of an external library called libxcrypt [3] from OpenWall Linux. I have patched Python 3.7 [4] to support libxcrypt. In light of deprecation of crypt(3) glibc function and bad quality of hashing algorithms, I'd like to raise the motion to revert 3854 and deprecate the crypt module. The whole module should be rather moved into 3rd party library that wraps xcrypt. Regards, Christian [1] https://github.com/python/cpython/pull/3854 [2] https://www.phoronix.com/scan.php?page=news_item&px=Fedora-28-libxcrypt-Plans [3] https://github.com/besser82/libxcrypt [4] https://bugs.python.org/issue32635 From barry at python.org Fri Feb 2 10:24:47 2018 From: barry at python.org (Barry Warsaw) Date: Fri, 2 Feb 2018 10:24:47 -0500 Subject: [Python-Dev] Is object the most base type? (bpo-20285) In-Reply-To: <20180202072534.GO26553@ando.pearwood.info> References: <20180202072534.GO26553@ando.pearwood.info> Message-ID: On Feb 2, 2018, at 02:25, Steven D'Aprano wrote: > > On Fri, Feb 02, 2018 at 01:53:00AM -0500, Terry Reedy wrote: >>>>> object.__doc__ >> 'The most base type? Clearly that?s a typo. It should be: ?The most bass type? as in: "It all starts with the bass, the most important part of any band or class hierarchy. Sure, the drums type is very important, and the bass type and drums type work hand-in-hand to provide you with a solid funky foundation on which to groove, er, code. Don?t let those guitar types tell you anything else." > "the base class of the class heirarchy" > > "the root of the class heirarchy" In all seriousness, either one of those works for me. While the first one does repeat the word ?class?, it seems the most clear description of what this thing is. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: Message signed with OpenPGP URL: From elprans at gmail.com Fri Feb 2 10:38:26 2018 From: elprans at gmail.com (Elvis Pranskevichus) Date: Fri, 02 Feb 2018 10:38:26 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> Message-ID: <1906618.bnMMLoM6on@hammer.magicstack.net> On Friday, February 2, 2018 10:08:43 AM EST Eric V. Smith wrote: > However, I don't feel very strongly about this. As I've said, I expect > the use cases for hash=True to be very, very rare. Why do you think that the requirement to make a dataclass hashable is a "very, very rare" requirement? The moment you want to use a dataclass a a dict key, or put it in a set, you need it to be hashable. Just put yourself in the shoes of an average Python developer. You try to put a dataclass in a set, you get a TypeError. Your immediate reaction is to add "hash=True". Things appear to work. Then, you, or someone else, decides to mutate the dataclass object and then you are looking at a very frustrating debug session. > In all, I think we're better off documenting best practices and making > them the default, like attrs does, and leave it to the programmer to > follow them. I realize we're handing out footguns I don't think attrs doing the same thing is a valid justification. This is a major footgun that is very easy to trigger, and there's really no precedent in standard data types. > the alternatives seem even more complex and are limiting. The alternative is simple and follows the design of other standard containers: immutable containers are hashable, mutable containers are not. @dataclass(frozen=False) gives you a SimpleNamespace-like and @dataclass(frozen=True) gives you a namedtuple-like. If you _really_ know what you are doing, then you can always declare an explicit __hash__. > The problem with dropping hash=True is: how would you write __hash__ > yourself? Is "def __hash__(self): return hash((self.field1, self.field2))" that hard? It is explicit, and you made a concious choice, i.e you understand how __hash__ works. IMO, the danger of "@dataclass(hash=True)" far overweighs whatever convenience it might provide. Elvis From eric at trueblade.com Fri Feb 2 10:51:11 2018 From: eric at trueblade.com (Eric V. Smith) Date: Fri, 2 Feb 2018 10:51:11 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <1906618.bnMMLoM6on@hammer.magicstack.net> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> Message-ID: <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> On 2/2/2018 10:38 AM, Elvis Pranskevichus wrote: > On Friday, February 2, 2018 10:08:43 AM EST Eric V. Smith wrote: >> However, I don't feel very strongly about this. As I've said, I expect >> the use cases for hash=True to be very, very rare. > > Why do you think that the requirement to make a dataclass hashable is a > "very, very rare" requirement? The moment you want to use a dataclass a > a dict key, or put it in a set, you need it to be hashable. I was specifically talking about the case of a non-frozen, hashable class. If you want to make a class frozen and hashable, then: @dataclass(frozen=True) will do just that. The case you brought up initially is the non-frozen, hashable class. It's that case that I think is very rare. I'll ask again: what's your use case for wanting a non-frozen, hashable class? I'm genuinely interested. You seem to think that hash=True means "make the class hashable". That's not true. It means "add a __hash__" to this class". There are other, better ways to make the class hashable, specifically frozen=True. You might want to read all of https://bugs.python.org/issue32513 for the background on the current behavior. > Just put yourself in the shoes of an average Python developer. You try > to put a dataclass in a set, you get a TypeError. Your immediate > reaction is to add "hash=True". Things appear to work. Then, you, or > someone else, decides to mutate the dataclass object and then you are > looking at a very frustrating debug session. I will be documented that the correct way to do this is frozen=True. >> In all, I think we're better off documenting best practices and making >> them the default, like attrs does, and leave it to the programmer to >> follow them. I realize we're handing out footguns > > I don't think attrs doing the same thing is a valid justification. This > is a major footgun that is very easy to trigger, and there's really no > precedent in standard data types. > >> the alternatives seem even more complex and are limiting. > > The alternative is simple and follows the design of other standard > containers: immutable containers are hashable, mutable containers are > not. @dataclass(frozen=False) gives you a SimpleNamespace-like and > @dataclass(frozen=True) gives you a namedtuple-like. If you _really_ > know what you are doing, then you can always declare an explicit > __hash__. I'm not sure what you're arguing for here. This is how dataclasses work. >> The problem with dropping hash=True is: how would you write __hash__ >> yourself? > > Is "def __hash__(self): return hash((self.field1, self.field2))" that > hard? It is explicit, and you made a concious choice, i.e you > understand how __hash__ works. I didn't say it was hard, I said it needed to be kept up to date as you add fields. That is, you'd have to duplicate the field list. dataclasses is trying to prevent you from repeating the field list anywhere. > IMO, the danger of > "@dataclass(hash=True)" far overweighs whatever convenience it might > provide. We'll just have to disagree about this. As I said, I don't feel very strongly about it, but I lean toward leaving it in and documenting it for what it is and does. Eric From p.f.moore at gmail.com Fri Feb 2 10:51:37 2018 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 2 Feb 2018 15:51:37 +0000 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <1906618.bnMMLoM6on@hammer.magicstack.net> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> Message-ID: On 2 February 2018 at 15:38, Elvis Pranskevichus wrote: > On Friday, February 2, 2018 10:08:43 AM EST Eric V. Smith wrote: >> However, I don't feel very strongly about this. As I've said, I expect >> the use cases for hash=True to be very, very rare. > > Why do you think that the requirement to make a dataclass hashable is a > "very, very rare" requirement? The moment you want to use a dataclass a > a dict key, or put it in a set, you need it to be hashable. > > Just put yourself in the shoes of an average Python developer. You try > to put a dataclass in a set, you get a TypeError. Your immediate > reaction is to add "hash=True". Things appear to work. Then, you, or > someone else, decides to mutate the dataclass object and then you are > looking at a very frustrating debug session. If I saw someone try to put a dataclass into a set, I'd point out that dataclasses are *mutable*, and if they want immutable values they should use "frozen=True". If it were me in that situation, that's what I'd do as well. Adding hashability to a mutable object would *never* be my immediate reaction. To put it another way, using your words above, "The moment you want to use a dataclass a a dict key, or put it in a set, you need it to be *immutable*" (not hashable, unless you really know what you're doing). Paul From elprans at gmail.com Fri Feb 2 10:56:46 2018 From: elprans at gmail.com (Elvis Pranskevichus) Date: Fri, 02 Feb 2018 10:56:46 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> Message-ID: <3259286.YNdnAkMY4U@hammer.magicstack.net> On Friday, February 2, 2018 10:51:11 AM EST Eric V. Smith wrote: > I was specifically talking about the case of a non-frozen, hashable > class. If you want to make a class frozen and hashable, then: > > @dataclass(frozen=True) > > will do just that. > > The case you brought up initially is the non-frozen, hashable class. > It's that case that I think is very rare. I'll ask again: what's your > use case for wanting a non-frozen, hashable class? I'm genuinely > interested. My point is exactly that there is _no_ valid use case, so (hash=True, frozen=False) should not be a thing! Why are you so insistent on adding a dangerous option which you admit is nearly useless? Elvis From yselivanov.ml at gmail.com Fri Feb 2 11:14:54 2018 From: yselivanov.ml at gmail.com (Yury Selivanov) Date: Fri, 2 Feb 2018 11:14:54 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> Message-ID: On Fri, Feb 2, 2018 at 10:51 AM, Paul Moore wrote: [..] > To put it another way, using your words above, "The moment you want to > use a dataclass a a dict key, or put it in a set, you need it to be > *immutable*" (not hashable, unless you really know what you're doing). Can someone clarify what is the actual use case of someone *knowingly* making a mutable collection hashable? Why can't that advanced user write their own __hash__ implementation? It's easy to do so. For what it's worth I think this argument is being blindly used to justify the current questionable design of "dataclass(hash=True)" being the same as "dataclass(hash=True, frozen=False) case. At least a few other core developers are concerned with this, but all I see is "attrs does the same thing". Eric, in my opinion we shouldn't copy attrs. It was designed as an external package with its own backwards-compatibility story. At some point it was realized that "attrs(hash=True, frozen=False)" is an anti-pattern, but it couldn't be removed at that point. Hence the warning in the documentation. We can do better. We are designing a new API that is going to be hugely popular. Why can't we ship it with dangerous options prohibited in 3.7 (it's easy to do that!) and then enable them in 3.8 when there's an actual clear use case? Yury From guido at python.org Fri Feb 2 11:18:06 2018 From: guido at python.org (Guido van Rossum) Date: Fri, 2 Feb 2018 08:18:06 -0800 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> Message-ID: I'm all for nudging people in the direction of xcrypt. I assume we can't just switch the C-level crypt with xcrypt and leave the Python API unchanged? However until a usable solution exist (either in the stdlib or as 3rd party) I don't think we should deprecate anything (deprecating things before the replacement is ready is stressful for everyone involved). I'm also not sure I agree with removing support for old hashes. By all means put in the docs that they are unsafe. But if someone has a database full of old hashes it would be nice to be able to at least read/verify it, right? Was a release already made with blowfish, extended DES and NT-Hash? (And what's so bad with blowfish? It's mentioned in the heading of the xcrypt project too.) On Fri, Feb 2, 2018 at 7:23 AM, Christian Heimes wrote: > Hi, > > in PR 3854 [1] Serhiy added blowfish, extended DES and NT-Hash to > Python's crypt mdodule. I vetoed against addition of the APIs because > all these hashing algorithms are not state of the art. Their quality > ranges from old to horribly, horriblye broken beyond any repair. > > Shortly after the PR has landed, I was made aware that glibc has > deprecated crypt(3) API [2] and favor of an external library called > libxcrypt [3] from OpenWall Linux. I have patched Python 3.7 [4] to > support libxcrypt. > > In light of deprecation of crypt(3) glibc function and bad quality of > hashing algorithms, I'd like to raise the motion to revert 3854 and > deprecate the crypt module. The whole module should be rather moved into > 3rd party library that wraps xcrypt. > > Regards, > Christian > > [1] https://github.com/python/cpython/pull/3854 > [2] > https://www.phoronix.com/scan.php?page=news_item&px=Fedora- > 28-libxcrypt-Plans > [3] https://github.com/besser82/libxcrypt > [4] https://bugs.python.org/issue32635 > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > guido%40python.org > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From elprans at gmail.com Fri Feb 2 11:48:04 2018 From: elprans at gmail.com (Elvis Pranskevichus) Date: Fri, 02 Feb 2018 11:48:04 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <3259286.YNdnAkMY4U@hammer.magicstack.net> Message-ID: <4737330.CapxtRJ2dz@hammer.magicstack.net> > Because it's not the default, it will be documented as being an > advanced use case, and it's useful in rare instances. > > And as I've said a number of times, both here and in other > discussions, I'm not arguing strenuously for this. I just think that, > given that it's not the default and it's not recommended and is > useful in advanced cases, I would prefer to leave it in. I understand > that you disagree with me. Is there a real world example of such an "advanced case"? Eric, have you read https://github.com/python-attrs/attrs/issues/136 ? Specifically this comment from Hynek [1]: "I never really thought about it, but yeah mutable objects shouldn?t have a __hash__ at all." It is clear from that thread that "hash=True" was an early design mistake, which was left in for compatibility reasons. Why are we copying bad design to the standard library? Elvis [1] https://github.com/python-attrs/attrs/issues/ 136#issuecomment-277425421 From eric at trueblade.com Fri Feb 2 11:09:26 2018 From: eric at trueblade.com (Eric V. Smith) Date: Fri, 2 Feb 2018 11:09:26 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <3259286.YNdnAkMY4U@hammer.magicstack.net> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> <3259286.YNdnAkMY4U@hammer.magicstack.net> Message-ID: On 2/2/2018 10:56 AM, Elvis Pranskevichus wrote: > On Friday, February 2, 2018 10:51:11 AM EST Eric V. Smith wrote: >> I was specifically talking about the case of a non-frozen, hashable >> class. If you want to make a class frozen and hashable, then: >> >> @dataclass(frozen=True) >> >> will do just that. >> >> The case you brought up initially is the non-frozen, hashable class. >> It's that case that I think is very rare. I'll ask again: what's your >> use case for wanting a non-frozen, hashable class? I'm genuinely >> interested. > > My point is exactly that there is _no_ valid use case, so (hash=True, > frozen=False) should not be a thing! Why are you so insistent on adding > a dangerous option which you admit is nearly useless? Because it's not the default, it will be documented as being an advanced use case, and it's useful in rare instances. And as I've said a number of times, both here and in other discussions, I'm not arguing strenuously for this. I just think that, given that it's not the default and it's not recommended and is useful in advanced cases, I would prefer to leave it in. I understand that you disagree with me. Eric From storchaka at gmail.com Fri Feb 2 12:05:43 2018 From: storchaka at gmail.com (Serhiy Storchaka) Date: Fri, 2 Feb 2018 19:05:43 +0200 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> Message-ID: 02.02.18 18:18, Guido van Rossum ????: > I'm all for nudging people in the direction of xcrypt. I assume we can't > just switch the C-level crypt with xcrypt and leave the Python API > unchanged? > > However until a usable solution exist (either in the stdlib or as 3rd > party) I don't think we should deprecate anything (deprecating things > before the replacement is ready is stressful for everyone involved). > > I'm also not sure I agree with removing support for old hashes. By all > means put in the docs that they are unsafe. But if someone has a > database full of old hashes it would be nice to be able to at least > read/verify it, right? > > Was a release already made with blowfish, extended DES and NT-Hash? (And > what's so bad with blowfish? It's mentioned in the heading of the xcrypt > project too.) To clarify, extended DES and NT-Hash were not added. They were removed from my PR after Christians request. Only the Blowfish method was added, and it is so strong as SHA-2 methods. It is the only method supported on OpenBSD. This PR is not a single enhancement made in the crypt module recently. I also extended tests and added support for configuring SHA-2 methods. There is an open PR (not merged before 3.7b1 unfortunately) for using crypt_r() instead of crypt(): https://bugs.python.org/issue28503. If deprecate the crypt module, should modules pwd, grp and spwd be deprecated too? The crypt module is needed for checking password hashes provided by spwd. From status at bugs.python.org Fri Feb 2 12:09:48 2018 From: status at bugs.python.org (Python tracker) Date: Fri, 2 Feb 2018 18:09:48 +0100 (CET) Subject: [Python-Dev] Summary of Python tracker Issues Message-ID: <20180202170948.85B4D56864@psf.upfronthosting.co.za> ACTIVITY SUMMARY (2018-01-26 - 2018-02-02) Python tracker at https://bugs.python.org/ To view or respond to any of the issues listed below, click on the issue. Do NOT respond to this message. Issues counts and deltas: open 6411 (-34) closed 38089 (+101) total 44500 (+67) Open issues with patches: 2492 Issues opened (35) ================== #32221: Converting ipv6 address to string representation using getname https://bugs.python.org/issue32221 reopened by yselivanov #32622: Implement loop.sendfile https://bugs.python.org/issue32622 reopened by vstinner #32683: isinstance is calling ob.__getattribute__ as a fallback instea https://bugs.python.org/issue32683 opened by bup #32684: asyncio.gather(..., return_exceptions=True) swallows cancellat https://bugs.python.org/issue32684 opened by socketpair #32689: shutil.move raises AttributeError if first argument is a pathl https://bugs.python.org/issue32689 opened by craigh #32691: "pdb -m " sets __main__.__package__ incorrectly https://bugs.python.org/issue32691 opened by ncoghlan #32692: test_threading.test_set_and_clear fails in AppVeyor CI https://bugs.python.org/issue32692 opened by xiang.zhang #32694: Can no longer specify OpenSSL locations with CPPFLAGS / LDFLAG https://bugs.python.org/issue32694 opened by yselivanov #32695: tarfile.open() raises TypeError when using compresslevel param https://bugs.python.org/issue32695 opened by bbayles #32696: Fix pickling exceptions with multiple arguments https://bugs.python.org/issue32696 opened by slallum #32706: test_check_hostname() of test_ftplib started to fail randomly https://bugs.python.org/issue32706 opened by vstinner #32708: test_sendfile() hangs on AMD64 FreeBSD 10.x Shared 3.x buildbo https://bugs.python.org/issue32708 opened by vstinner #32710: test_asyncio leaked [4, 4, 3] memory blocks, sum=11 on AMD64 W https://bugs.python.org/issue32710 opened by vstinner #32713: tarfile.itn breaks if n is a negative float https://bugs.python.org/issue32713 opened by j0ffrey #32715: Make create_unix_server for SOCK_DGRAM work https://bugs.python.org/issue32715 opened by holger+lp #32716: setup.py register --repository is broken https://bugs.python.org/issue32716 opened by shimizukawa #32717: Document PEP 560 https://bugs.python.org/issue32717 opened by levkivskyi #32718: Install PowerShell activation scripts for venv for all platfor https://bugs.python.org/issue32718 opened by brett.cannon #32719: fatal error raised when Ctrl-C print loop https://bugs.python.org/issue32719 opened by xiang.zhang #32720: Format mini-language integer definition is incorrect https://bugs.python.org/issue32720 opened by ncoghlan #32723: codecs.open silently ignores argument errors https://bugs.python.org/issue32723 opened by xiang.zhang #32725: Instance of _multiprocessing.PipeConnection-subtype crash on d https://bugs.python.org/issue32725 opened by hakril #32726: macOS installer and framework enhancements and changes for 3.7 https://bugs.python.org/issue32726 opened by ned.deily #32728: Extend zipfile's compression level support to LZMA https://bugs.python.org/issue32728 opened by bbayles #32729: socket error handling needed https://bugs.python.org/issue32729 opened by rkdls #32730: Allow py launcher to launch other registered Pythons https://bugs.python.org/issue32730 opened by petsuter at gmail.com #32731: getpass.getuser() raises an unspecified exceptions (ImportErro https://bugs.python.org/issue32731 opened by gregory.p.smith #32732: LoggingAdapter ignores extra kwargs of Logger#log() https://bugs.python.org/issue32732 opened by mcoolive #32734: Asyncio Lock safety issue (unlimited acquire) https://bugs.python.org/issue32734 opened by bar.harel #32739: collections.deque rotate(n=1) default value not documented https://bugs.python.org/issue32739 opened by yuy #32742: zipfile extractall needlessly re-wraps ZipInfo instances https://bugs.python.org/issue32742 opened by peterbe #32743: Typo in hamt.c comments https://bugs.python.org/issue32743 opened by delimitry #32745: ctypes string pointer fields should accept embedded null chara https://bugs.python.org/issue32745 opened by theller #32746: More misspellings, mostly in source code. https://bugs.python.org/issue32746 opened by terry.reedy #32749: Remove dbm.dumb behavior deprecated in 3.6 https://bugs.python.org/issue32749 opened by serhiy.storchaka Most recent 15 issues with no replies (15) ========================================== #32749: Remove dbm.dumb behavior deprecated in 3.6 https://bugs.python.org/issue32749 #32746: More misspellings, mostly in source code. https://bugs.python.org/issue32746 #32728: Extend zipfile's compression level support to LZMA https://bugs.python.org/issue32728 #32718: Install PowerShell activation scripts for venv for all platfor https://bugs.python.org/issue32718 #32717: Document PEP 560 https://bugs.python.org/issue32717 #32716: setup.py register --repository is broken https://bugs.python.org/issue32716 #32713: tarfile.itn breaks if n is a negative float https://bugs.python.org/issue32713 #32708: test_sendfile() hangs on AMD64 FreeBSD 10.x Shared 3.x buildbo https://bugs.python.org/issue32708 #32696: Fix pickling exceptions with multiple arguments https://bugs.python.org/issue32696 #32695: tarfile.open() raises TypeError when using compresslevel param https://bugs.python.org/issue32695 #32692: test_threading.test_set_and_clear fails in AppVeyor CI https://bugs.python.org/issue32692 #32682: test_zlib improve version parsing https://bugs.python.org/issue32682 #32676: test_asyncio emits many warnings when run in debug mode https://bugs.python.org/issue32676 #32671: redesign Windows os.getlogin, and add os.getuser https://bugs.python.org/issue32671 #32663: SMTPUTF8SimTests are not actually being run https://bugs.python.org/issue32663 Most recent 15 issues waiting for review (15) ============================================= #32749: Remove dbm.dumb behavior deprecated in 3.6 https://bugs.python.org/issue32749 #32746: More misspellings, mostly in source code. https://bugs.python.org/issue32746 #32743: Typo in hamt.c comments https://bugs.python.org/issue32743 #32742: zipfile extractall needlessly re-wraps ZipInfo instances https://bugs.python.org/issue32742 #32739: collections.deque rotate(n=1) default value not documented https://bugs.python.org/issue32739 #32734: Asyncio Lock safety issue (unlimited acquire) https://bugs.python.org/issue32734 #32729: socket error handling needed https://bugs.python.org/issue32729 #32726: macOS installer and framework enhancements and changes for 3.7 https://bugs.python.org/issue32726 #32720: Format mini-language integer definition is incorrect https://bugs.python.org/issue32720 #32713: tarfile.itn breaks if n is a negative float https://bugs.python.org/issue32713 #32706: test_check_hostname() of test_ftplib started to fail randomly https://bugs.python.org/issue32706 #32695: tarfile.open() raises TypeError when using compresslevel param https://bugs.python.org/issue32695 #32694: Can no longer specify OpenSSL locations with CPPFLAGS / LDFLAG https://bugs.python.org/issue32694 #32691: "pdb -m " sets __main__.__package__ incorrectly https://bugs.python.org/issue32691 #32689: shutil.move raises AttributeError if first argument is a pathl https://bugs.python.org/issue32689 Top 10 most discussed issues (10) ================================= #32394: socket lib beahavior change in 3.6.4 https://bugs.python.org/issue32394 11 msgs #32622: Implement loop.sendfile https://bugs.python.org/issue32622 11 msgs #32604: Expose the subinterpreters C-API in Python for testing use. https://bugs.python.org/issue32604 10 msgs #32591: Deprecate sys.set_coroutine_wrapper and replace it with more f https://bugs.python.org/issue32591 8 msgs #17305: IDNA2008 encoding missing https://bugs.python.org/issue17305 7 msgs #25988: collections.abc.Indexable https://bugs.python.org/issue25988 7 msgs #32694: Can no longer specify OpenSSL locations with CPPFLAGS / LDFLAG https://bugs.python.org/issue32694 7 msgs #26219: implement per-opcode cache in ceval https://bugs.python.org/issue26219 6 msgs #30693: tarfile add uses random order https://bugs.python.org/issue30693 6 msgs #31368: Add os.preadv() and os.pwritev() https://bugs.python.org/issue31368 6 msgs Issues closed (93) ================== #14799: Tkinter ttk tests hang on linux https://bugs.python.org/issue14799 closed by asvetlov #16648: stdib should use new exception types from PEP 3151 https://bugs.python.org/issue16648 closed by asvetlov #16705: Use concrete classes inherited from OSError instead of errno c https://bugs.python.org/issue16705 closed by asvetlov #17044: Implement PEP 422: Simple class initialisation hook https://bugs.python.org/issue17044 closed by ncoghlan #20984: 'Add/Remove Programs' dialog missing entries for 32-bit CPytho https://bugs.python.org/issue20984 closed by steve.dower #21417: Compression level for zipfile https://bugs.python.org/issue21417 closed by gregory.p.smith #22908: ZipExtFile in zipfile can be seekable https://bugs.python.org/issue22908 closed by gregory.p.smith #24325: Speedup types.coroutine() https://bugs.python.org/issue24325 closed by yselivanov #25942: Do not immediately SIGKILL subprocess child processes upon ^C https://bugs.python.org/issue25942 closed by gregory.p.smith #27505: Missing documentation for setting module __class__ attribute https://bugs.python.org/issue27505 closed by ncoghlan #27931: Email parse IndexError <""@wiarcom.com> https://bugs.python.org/issue27931 closed by xiang.zhang #28864: Add devnull file-like object https://bugs.python.org/issue28864 closed by rhettinger #28914: selectmodule build fails https://bugs.python.org/issue28914 closed by vstinner #29120: Move hash randomisation initialisation out of Python/random.c https://bugs.python.org/issue29120 closed by ncoghlan #29216: Space saving step for the LRU cache https://bugs.python.org/issue29216 closed by rhettinger #30117: test_lib2to3.test_parser.test_all_project_files() fails https://bugs.python.org/issue30117 closed by benjamin.peterson #30306: release arguments of contextmanager https://bugs.python.org/issue30306 closed by ncoghlan #30715: Test_winreg, test_dynamic_key hangs on my Win 10 https://bugs.python.org/issue30715 closed by terry.reedy #31356: Add context manager to temporarily disable GC https://bugs.python.org/issue31356 closed by rhettinger #31429: TLS cipher suite compile time option for downstream https://bugs.python.org/issue31429 closed by christian.heimes #31853: Use super().method instead of socket.method in SSLSocket https://bugs.python.org/issue31853 closed by christian.heimes #31961: subprocess._execute_child doesn't accept a single PathLike arg https://bugs.python.org/issue31961 closed by gregory.p.smith #32045: Does json.dumps have a memory leak? https://bugs.python.org/issue32045 closed by csabella #32072: Issues with binary plists https://bugs.python.org/issue32072 closed by serhiy.storchaka #32102: Add "capture_output=True" option to subprocess.run https://bugs.python.org/issue32102 closed by gregory.p.smith #32137: Stack overflow in repr of deeply nested dicts https://bugs.python.org/issue32137 closed by serhiy.storchaka #32226: Implement PEP 560: Core support for typing module and generic https://bugs.python.org/issue32226 closed by levkivskyi #32228: truncate() changes current stream position https://bugs.python.org/issue32228 closed by pitrou #32251: Add asyncio.BufferedProtocol https://bugs.python.org/issue32251 closed by yselivanov #32336: Save OrderedDict import in argparse https://bugs.python.org/issue32336 closed by rhettinger #32373: Add socket.getblocking() method https://bugs.python.org/issue32373 closed by yselivanov #32433: Provide optimized HMAC digest https://bugs.python.org/issue32433 closed by christian.heimes #32446: ResourceLoader.get_data() should accept a PathLike https://bugs.python.org/issue32446 closed by barry #32454: Add socket.close(fd) function https://bugs.python.org/issue32454 closed by christian.heimes #32513: dataclasses: make it easier to use user-supplied special metho https://bugs.python.org/issue32513 closed by eric.smith #32521: NIS module fails to build due to the removal of interfaces rel https://bugs.python.org/issue32521 closed by christian.heimes #32532: improve sys.settrace and sys.setprofile documentation https://bugs.python.org/issue32532 closed by xiang.zhang #32546: Unusual TypeError with dataclass decorator https://bugs.python.org/issue32546 closed by eric.smith #32550: STORE_ANNOTATION bytecode is unnecessary and can be removed. https://bugs.python.org/issue32550 closed by rhettinger #32565: Document the version of adding opcodes https://bugs.python.org/issue32565 closed by serhiy.storchaka #32571: Speed up and clean up getting optional attributes in C code https://bugs.python.org/issue32571 closed by yselivanov #32572: Add the ftplib option, overrides the IP address. https://bugs.python.org/issue32572 closed by giampaolo.rodola #32583: Crash during decoding using UTF-16/32 and custom error handler https://bugs.python.org/issue32583 closed by xiang.zhang #32605: Should we really hide unawaited coroutine warnings when an exc https://bugs.python.org/issue32605 closed by yselivanov #32611: Tkinter taskbar icon (Windows) https://bugs.python.org/issue32611 closed by terry.reedy #32624: Implement WriteTransport.is_protocol_paused() https://bugs.python.org/issue32624 closed by asvetlov #32630: Migrate decimal to use PEP 567 context variables https://bugs.python.org/issue32630 closed by yselivanov #32640: Python 2.7 str.join documentation is incorrect https://bugs.python.org/issue32640 closed by Mariatta #32645: test_asyncio: TLS tests fail on "x86 Windows7" buildbot https://bugs.python.org/issue32645 closed by vstinner #32649: complete C API doc debug and profile part with new PyTrace_OPC https://bugs.python.org/issue32649 closed by ncoghlan #32650: Debug support for native coroutines is broken https://bugs.python.org/issue32650 closed by yselivanov #32664: Connector "|" missing between ImportError and LookupError https://bugs.python.org/issue32664 closed by Mariatta #32668: deepcopy() fails on ArgumentParser instances https://bugs.python.org/issue32668 closed by terry.reedy #32670: Enforce PEP 479???StopIteration and generators???in Python 3.7 https://bugs.python.org/issue32670 closed by yselivanov #32674: minor documentation fix for '__import__' https://bugs.python.org/issue32674 closed by Mariatta #32675: dict.__contains__(unhashable) raises TypeError where False was https://bugs.python.org/issue32675 closed by r.david.murray #32677: Add.isascii() to str, bytes and bytearray https://bugs.python.org/issue32677 closed by inada.naoki #32678: Lazy import ast in inspect https://bugs.python.org/issue32678 closed by inada.naoki #32681: Fix uninitialized variable in os_dup2_impl https://bugs.python.org/issue32681 closed by gregory.p.smith #32685: Erroneous suggestion in print statement https://bugs.python.org/issue32685 closed by ncoghlan #32686: python3.6.4 build hangs on test_asyncio macOS 10.13 https://bugs.python.org/issue32686 closed by ned.deily #32687: wrong meaning of arg in Py_tracefunc doc https://bugs.python.org/issue32687 closed by xiang.zhang #32688: weird comment out line of code https://bugs.python.org/issue32688 closed by xiang.zhang #32690: Return function locals() in order of creation? https://bugs.python.org/issue32690 closed by rhettinger #32693: os.path.ismount does not accept bytes objects https://bugs.python.org/issue32693 closed by eryksun #32697: Guarantee that Python preserves the order of kwonly parameters https://bugs.python.org/issue32697 closed by larry #32698: Improper gzip compression if output file extension is not "gz" https://bugs.python.org/issue32698 closed by Delgan #32699: pythonXY._pth : unclear how .pth files are handled https://bugs.python.org/issue32699 closed by steve.dower #32700: spam? https://bugs.python.org/issue32700 closed by r.david.murray #32701: Typo in quopri documentation https://bugs.python.org/issue32701 closed by Mariatta #32702: Minor markup typo https://bugs.python.org/issue32702 closed by rhettinger #32703: 'async with' somehow suppresses unawaited coroutine warnings https://bugs.python.org/issue32703 closed by yselivanov #32704: Tracking https://bugs.python.org/issue32704 closed by eric.smith #32705: Current Android does not have posix_spawn https://bugs.python.org/issue32705 closed by gregory.p.smith #32707: Python/hamt.c warnings https://bugs.python.org/issue32707 closed by yselivanov #32709: the iterable for itertools.groupby must be sorted https://bugs.python.org/issue32709 closed by matrixise #32711: Fix warnings for Python/ast_unparse.c https://bugs.python.org/issue32711 closed by christian.heimes #32712: Modifying a list/dict effects all variables sharing that addre https://bugs.python.org/issue32712 closed by r.david.murray #32714: remove resourceLoader related code since it's deprecated https://bugs.python.org/issue32714 closed by jayyin11043 #32721: test_hashlib fails if _md5 is not built https://bugs.python.org/issue32721 closed by benjamin.peterson #32722: Mislabeling of a dict comprehension as a generator expression https://bugs.python.org/issue32722 closed by xiang.zhang #32724: Fix references to commands in Doc/pdb.rst https://bugs.python.org/issue32724 closed by asvetlov #32727: smtplib's SMTP.send_message behaves differently with from_addr https://bugs.python.org/issue32727 closed by r.david.murray #32733: test_coroutines is failed when run with -We https://bugs.python.org/issue32733 closed by njs #32735: Typo in f-string example in docs https://bugs.python.org/issue32735 closed by Mariatta #32736: random.triangular yields unexpected distribution when args mix https://bugs.python.org/issue32736 closed by rhettinger #32737: str.format not producing strings for nan argument in accordanc https://bugs.python.org/issue32737 closed by zach.ware #32738: CDLL : Can't find module https://bugs.python.org/issue32738 closed by christian.heimes #32740: test_calendar and test_re fail with unknown locale: UTF-8 in _ https://bugs.python.org/issue32740 closed by vishalsingh #32741: Add asyncio.TimerHandle.when() function https://bugs.python.org/issue32741 closed by asvetlov #32744: PEP 342 double colons typos in code https://bugs.python.org/issue32744 closed by delimitry #32747: remove trailing spaces in docstring https://bugs.python.org/issue32747 closed by serhiy.storchaka #32748: Improve _asyncio.TaskStepMethWrapper and TaskWakeupMethWrapper https://bugs.python.org/issue32748 closed by asvetlov From chris.barker at noaa.gov Fri Feb 2 15:02:25 2018 From: chris.barker at noaa.gov (Chris Barker) Date: Fri, 2 Feb 2018 12:02:25 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <1906618.bnMMLoM6on@hammer.magicstack.net> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> Message-ID: On Fri, Feb 2, 2018 at 7:38 AM, Elvis Pranskevichus wrote: > On Friday, February 2, 2018 10:08:43 AM EST Eric V. Smith wrote: > > However, I don't feel very strongly about this. As I've said, I expect > > the use cases for hash=True to be very, very rare. > > Why do you think that the requirement to make a dataclass hashable is a > "very, very rare" requirement? I think what's rare is wanting hashability without it being frozen. > Just put yourself in the shoes of an average Python developer. You try > to put a dataclass in a set, you get a TypeError. Your immediate > reaction is to add "hash=True". Things appear to work. agreed, the easy and obvious way should be to make it frozen -- if it's hard to make it hashable and not frozen, then that's good. But it is nice to have the __hash__ generated more you.... so maybe a flag for "unfrozen_hashable" -- really klunky, but if that is a rare need, then there you go. Or maybe: If either hash or frozen is specified, it become both frozen and hashable. If both are specified, then it does what the user is asking for. -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Fri Feb 2 15:21:58 2018 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 2 Feb 2018 12:21:58 -0800 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> Message-ID: On Feb 2, 2018 7:24 AM, "Christian Heimes" wrote: Shortly after the PR has landed, I was made aware that glibc has deprecated crypt(3) API [2] and favor of an external library called libxcrypt [3] from OpenWall Linux. I have patched Python 3.7 [4] to support libxcrypt. In light of deprecation of crypt(3) glibc function and bad quality of hashing algorithms, I'd like to raise the motion to revert 3854 and deprecate the crypt module. The whole module should be rather moved into 3rd party library that wraps xcrypt. If the crypt module can just as well use libxcrypt, and that's how the distros are going to build it, then what's the advantage of deprecating it? Won't it just continue to work indefinitely? -n -------------- next part -------------- An HTML attachment was scrubbed... URL: From solipsis at pitrou.net Fri Feb 2 15:31:18 2018 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 2 Feb 2018 21:31:18 +0100 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> Message-ID: <20180202213118.4143777d@fsol> On Fri, 2 Feb 2018 16:23:20 +0100 Christian Heimes wrote: > Hi, > > in PR 3854 [1] Serhiy added blowfish, extended DES and NT-Hash to > Python's crypt mdodule. I vetoed against addition of the APIs because > all these hashing algorithms are not state of the art. Their quality > ranges from old to horribly, horriblye broken beyond any repair. > > Shortly after the PR has landed, I was made aware that glibc has > deprecated crypt(3) API [2] and favor of an external library called > libxcrypt [3] from OpenWall Linux. I have patched Python 3.7 [4] to > support libxcrypt. > > In light of deprecation of crypt(3) glibc function and bad quality of > hashing algorithms, I'd like to raise the motion to revert 3854 and > deprecate the crypt module. Those are two separate proposals. On the topic of reverting PR #3854, I don't see the point. Is Blowfish more fragile than the other algorithms? If not, it sounds ok to add it. On the topic of deprecating the crypt module, that doesn't sound like a good idea right now. People may need to generate crypt()-compatible output for various reasons, such as being able to automate system administration tasks. Regards Antoine. From ja.py at farowl.co.uk Fri Feb 2 15:10:44 2018 From: ja.py at farowl.co.uk (Jeff Allen) Date: Fri, 2 Feb 2018 20:10:44 +0000 Subject: [Python-Dev] Is object the most base type? (bpo-20285) In-Reply-To: <20180202072534.GO26553@ando.pearwood.info> References: <20180202072534.GO26553@ando.pearwood.info> Message-ID: On 02/02/2018 07:25, Steven D'Aprano wrote: > How about: > "the base class of the class heirarchy" > > "the root of the class heirarchy" > > > Java ... now says: > > "Class Object is the root of the class hierarchy. Every class has > Object as a superclass. All objects, including arrays, implement the > methods of this class." Either for me, but I feel I should draw attention to the spelling. (Java is right.) Ironically, the word derives from priesthood (hieratic), not from inheritance (heir). Jeff Allen From drsalists at gmail.com Fri Feb 2 16:25:34 2018 From: drsalists at gmail.com (Dan Stromberg) Date: Fri, 2 Feb 2018 13:25:34 -0800 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: <20180202213118.4143777d@fsol> References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> <20180202213118.4143777d@fsol> Message-ID: On Fri, Feb 2, 2018 at 12:31 PM, Antoine Pitrou wrote: > On Fri, 2 Feb 2018 16:23:20 +0100 > Christian Heimes wrote: >> Hi, >> >> in PR 3854 [1] Serhiy added blowfish, extended DES and NT-Hash to >> Python's crypt mdodule. I vetoed against addition of the APIs because >> all these hashing algorithms are not state of the art. Their quality >> ranges from old to horribly, horriblye broken beyond any repair. >> >> Shortly after the PR has landed, I was made aware that glibc has >> deprecated crypt(3) API [2] and favor of an external library called >> libxcrypt [3] from OpenWall Linux. I have patched Python 3.7 [4] to >> support libxcrypt. >> >> In light of deprecation of crypt(3) glibc function and bad quality of >> hashing algorithms, I'd like to raise the motion to revert 3854 and >> deprecate the crypt module. > > Those are two separate proposals. > > On the topic of reverting PR #3854, I don't see the point. Is Blowfish > more fragile than the other algorithms? If not, it sounds ok to add it. I'm no cryptographer, but I believe Blowfish's author recommends using Twofish instead now. > On the topic of deprecating the crypt module, that doesn't sound like a > good idea right now. People may need to generate crypt()-compatible > output for various reasons, such as being able to automate system > administration tasks. Encryption algorithms continue to be needed even after the time at which they should no longer be used in new code. But they probably should be documented as deprecated and warned about at runtime - after there are suitable alternatives. From ethan at stoneleaf.us Fri Feb 2 17:08:31 2018 From: ethan at stoneleaf.us (Ethan Furman) Date: Fri, 02 Feb 2018 14:08:31 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> Message-ID: <5A74E15F.9060809@stoneleaf.us> On 02/02/2018 08:14 AM, Yury Selivanov wrote: > Eric, in my opinion we shouldn't copy attrs. [...] > We are designing a new API that is going to be hugely popular. Why > can't we ship it with dangerous options prohibited in 3.7 (it's easy > to do that!) and then enable them in 3.8 when there's an actual clear > use case? +1 -- ~Ethan~ From ethan at stoneleaf.us Fri Feb 2 17:11:24 2018 From: ethan at stoneleaf.us (Ethan Furman) Date: Fri, 02 Feb 2018 14:11:24 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> <3259286.YNdnAkMY4U@hammer.magicstack.net> Message-ID: <5A74E20C.6060204@stoneleaf.us> On 02/02/2018 08:09 AM, Eric V. Smith wrote: > On 2/2/2018 10:56 AM, Elvis Pranskevichus wrote: >> My point is exactly that there is _no_ valid use case, so (hash=True, >> frozen=False) should not be a thing! Why are you so insistent on adding >> a dangerous option which you admit is nearly useless? > > Because it's not the default, it will be documented as being an advanced use case, and it's useful in rare instances. Personally, I don't think advanced use-cases need to be supported by flags as they can be supported by just writing the __dunder__ methods. -- ~Ethan~ From mertz at gnosis.cx Fri Feb 2 19:04:25 2018 From: mertz at gnosis.cx (David Mertz) Date: Fri, 2 Feb 2018 16:04:25 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <5A74E20C.6060204@stoneleaf.us> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> <3259286.YNdnAkMY4U@hammer.magicstack.net> <5A74E20C.6060204@stoneleaf.us> Message-ID: I agree with Ethan, Elvis, and a few others. I think 'hash=True, frozen=False' should be disabled in 3.7. It's an attractive nuisance. Maybe not so attractive because its obscurity, but still with no clear reason to exist. If many users of of dataclass find themselves defining '__hash__' with mutable dataclass, it's perfectly possible to allow the switch combination later. But taking it out after previously allowing it?even if every use in the wild is actually a bug in waiting?is harder. On Feb 2, 2018 2:10 PM, "Ethan Furman" wrote: > On 02/02/2018 08:09 AM, Eric V. Smith wrote: > >> On 2/2/2018 10:56 AM, Elvis Pranskevichus wrote: >> > > My point is exactly that there is _no_ valid use case, so (hash=True, >>> frozen=False) should not be a thing! Why are you so insistent on adding >>> a dangerous option which you admit is nearly useless? >>> >> >> Because it's not the default, it will be documented as being an advanced >> use case, and it's useful in rare instances. >> > > Personally, I don't think advanced use-cases need to be supported by flags > as they can be supported by just writing the __dunder__ methods. > > -- > ~Ethan~ > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/mertz% > 40gnosis.cx > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Fri Feb 2 19:28:24 2018 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 2 Feb 2018 19:28:24 -0500 Subject: [Python-Dev] Is object the most base type? (bpo-20285) In-Reply-To: References: Message-ID: On 2/2/2018 1:53 AM, Terry Reedy wrote: > >>> object.__doc__ > 'The most base type' > > I and several people on python-list thread "interactive help on the base > object" (Dec?? 2013) thought this could be improved.? On > https://bugs.python.org/issue20285 and > https://github.com/python/cpython/pull/4759 > > After some research, I believe the following, which is wrote on the > issue, explains the uneasiness many feel. > > 'Base' is actually two words.? As a noun (or verb), it comes from > Ancient Greek ????? (b?sis), a foundation from which other things extend > or derive.? As an adjective, it comes from Late Latin bassus (?low?). > > In computer science and Python, the couplet 'base class' is being used, > it seems to me and apparently others, as a noun-noun compound, meaning, > 'foundation class', not as an adjective-noun phrase meaning 'low class' > (let along 'depraved class').? However, 'most base class' must be parsed > as '(most base) class', with 'base' re-interpreted as the adjective > meaning 'low' (or worse).? The switch in meaning of 'base' is similar in > 'baseball' versus? 'most base ball'. > --- > > I have suggested > "The superclass for all Python classes." > "The starting base class of all types and classes other than itself." "a base for all classes." object entry in lib ref, function "the ultimate base class of all other classes." Martin Panter object is neither a base or superclass of itself, so the first of each pair above is not quite right. "the base class of the class heirarchy ['hierarchy']" Steven D'Aprano "the root of the class heirarchy" ditto + quotes from Java and Ruby Jeff Allen and Barry Warsaw like either, with BW slightly referring the first. I was thinking about 'tree' instead of 'hierachy', but 'class tree' is wrong and 'directed acyclic graph of classes' a bit wordy. While hierarchies are often thought of as trees, it is possible for someone to report to more than one person. No one (yet) supported the status quo. Martin also suggested a continuation: "When called, it accepts no arguments and returns a new unique and featureless object." Even though object, unlike any other class, is primarily used as a base class rather than instance source, the latter should be included. For other built-in classes, the summary line describes instances. I would leave out 'unique' as that could be confused with 'singleton'. Provisional replacement, following the format of other class docstrings except for describing the class instead of instances, and intended to be helpful to beginners. "The base class of the class hierarchy. When called, it accepts no arguments and returns a new featureless instance that has no instance attributes and cannot be given any." Passing arguments and adding attributes are two common mistakes. -- Terry Jan Reedy From asrp at email.com Fri Feb 2 19:01:30 2018 From: asrp at email.com (asrp asrp) Date: Sat, 3 Feb 2018 01:01:30 +0100 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes Message-ID: Hello, I don't know if this is the right place to post this. Please redirect as needed. I've made a small Python interpreter in Python with runtime AST node semantics and edit-and-continue. I thought it could make prototyping language changes more easily and visualize usage before writing them in C. Its here: https://github.com/asrp/python_terp So, for example, redefining the for_stmt function in the right scope changes the behaviour of future for loops at runtime. Although from discussion I've read in PEPs, actual implementation always look like a non-issue (which seems like magic to me) so maybe no-one here actually needs this. (I really needed edit-and-continue for one of my projects but of course, running it in this extra interpreter is much too slow.) asrp From tritium-list at sdamon.com Fri Feb 2 21:28:00 2018 From: tritium-list at sdamon.com (Alex Walters) Date: Fri, 2 Feb 2018 21:28:00 -0500 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes In-Reply-To: References: Message-ID: <09cc01d39c96$9cb28910$d6179b30$@sdamon.com> Are you aware of pypy? > -----Original Message----- > From: Python-Dev [mailto:python-dev-bounces+tritium- > list=sdamon.com at python.org] On Behalf Of asrp asrp > Sent: Friday, February 2, 2018 7:02 PM > To: python-dev at python.org > Subject: [Python-Dev] A minimal Python interpreter written in Python for > experimenting with language changes > > Hello, > > I don't know if this is the right place to post this. Please redirect as needed. > > I've made a small Python interpreter in Python with runtime AST node > semantics and edit-and-continue. I thought it could make prototyping > language changes more easily and visualize usage before writing them in C. > > Its here: https://github.com/asrp/python_terp > > So, for example, redefining the for_stmt function in the right scope changes > the behaviour of future for loops at runtime. > > Although from discussion I've read in PEPs, actual implementation always > look like a non-issue (which seems like magic to me) so maybe no-one here > actually needs this. > > (I really needed edit-and-continue for one of my projects but of course, > running it in this extra interpreter is much too slow.) > > asrp > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/tritium- > list%40sdamon.com From tjreedy at udel.edu Sat Feb 3 00:50:11 2018 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 3 Feb 2018 00:50:11 -0500 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes In-Reply-To: References: Message-ID: On 2/2/2018 7:01 PM, asrp asrp wrote: > I don't know if this is the right place to post this. Please redirect as needed. This list is for development *of* cpython. Development *with* python in general belongs on python-list. -- Terry Jan Reedy From ncoghlan at gmail.com Sat Feb 3 01:25:15 2018 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 3 Feb 2018 16:25:15 +1000 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <34493569.S535hZLbfa@klinga.prans.org> <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> <1937305.WBmin8xbao@klinga.prans.org> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> Message-ID: On 3 Feb. 2018 1:09 am, "Eric V. Smith" wrote: The problem with dropping hash=True is: how would you write __hash__ yourself? It seems like a bug magnet if you're adding fields to the class and forget to update __hash__, especially in the presence of per-field hash=False and eq=False settings. And you'd need to make sure it matches the generated __eq__ (if 2 objects are equal, they need to have the same hash value). I think anyone that does this needs to think *very* carefully about how they do it, and offering both "hash=True" and "frozen=True" is an attractive nuisance that means people will write "hash=True" when what they wanted was "frozen=True". In particular, having to work out how write a maintainable "__hash__" will encourage folks to separate out the hashed fields as a separate frozen subrecord or base class. If this proves to be an intolerable burden then the short hand spelling could be added back in 3.8, but once we ship it we're going to be stuck with explaining the interactions indefinitely. Cheers, Nick. -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Sat Feb 3 01:44:20 2018 From: guido at python.org (Guido van Rossum) Date: Fri, 2 Feb 2018 22:44:20 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <34493569.S535hZLbfa@klinga.prans.org> <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> <1937305.WBmin8xbao@klinga.prans.org> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> Message-ID: It appears Eric and I are the only ones in favor of keeping the current behavior. But I still am not convinced by all the worries about "attractive nuisances" and all the other bad names this feature has been called. We don't know that any of the doomsday scenarios will happen. In my experience, usually once something is rolled out the set of issues that are *actually* raised is entirely different from the things its designers were worried about. Please don't commit a change to roll this back without checking in with me; I have some misgivings about the problem being raised here that I still need to think through more carefully. In the meantime, please try to use dataclasses with 3.7b1! On Fri, Feb 2, 2018 at 10:25 PM, Nick Coghlan wrote: > > > On 3 Feb. 2018 1:09 am, "Eric V. Smith" wrote: > > > The problem with dropping hash=True is: how would you write __hash__ > yourself? It seems like a bug magnet if you're adding fields to the class > and forget to update __hash__, especially in the presence of per-field > hash=False and eq=False settings. And you'd need to make sure it matches > the generated __eq__ (if 2 objects are equal, they need to have the same > hash value). > > > I think anyone that does this needs to think *very* carefully about how > they do it, and offering both "hash=True" and "frozen=True" is an > attractive nuisance that means people will write "hash=True" when what they > wanted was "frozen=True". > > In particular, having to work out how write a maintainable "__hash__" will > encourage folks to separate out the hashed fields as a separate frozen > subrecord or base class. > > If this proves to be an intolerable burden then the short hand spelling > could be added back in 3.8, but once we ship it we're going to be stuck > with explaining the interactions indefinitely. > > Cheers, > Nick. > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > guido%40python.org > > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From steve at pearwood.info Sat Feb 3 02:12:43 2018 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 3 Feb 2018 18:12:43 +1100 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes In-Reply-To: References: Message-ID: <20180203071242.GQ26553@ando.pearwood.info> On Sat, Feb 03, 2018 at 12:50:11AM -0500, Terry Reedy wrote: > On 2/2/2018 7:01 PM, asrp asrp wrote: > >I don't know if this is the right place to post this. Please redirect as > >needed. > > This list is for development *of* cpython. Development *with* python in > general belongs on python-list. This list is for development of Python the language, not just CPython the interpreter. It seems to me that announcing a new Python interpreter, especially one designed for the purpose of allowing rapid experimentation with the language, is on topic for this list. -- Steve From steve at pearwood.info Sat Feb 3 02:27:46 2018 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 3 Feb 2018 18:27:46 +1100 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes In-Reply-To: References: Message-ID: <20180203072745.GR26553@ando.pearwood.info> On Sat, Feb 03, 2018 at 01:01:30AM +0100, asrp asrp wrote: > I've made a small Python interpreter in Python with runtime AST node > semantics and edit-and-continue. I thought it could make prototyping > language changes more easily and visualize usage before writing them > in C. That sounds interesting. > Its here: https://github.com/asrp/python_terp > > So, for example, redefining the for_stmt function in the right scope > changes the behaviour of future for loops at runtime. Can you give an example of how you would do that? I don't mean the mechanism used, I mean how would a developer implement a new syntactic feature. Suppose I wanted to add a new clause to for...else, let's say: for ... : block otherwise: # runs only if the for-loop was empty How would do I do that? (I've read the example on your github page, and it doesn't have much detail.) -- Steve From christian at python.org Sat Feb 3 03:36:32 2018 From: christian at python.org (Christian Heimes) Date: Sat, 3 Feb 2018 09:36:32 +0100 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> Message-ID: <3f008741-3bc8-d3aa-890d-a45688669183@python.org> On 2018-02-02 18:05, Serhiy Storchaka wrote: > 02.02.18 18:18, Guido van Rossum ????: >> I'm all for nudging people in the direction of xcrypt. I assume we >> can't just switch the C-level crypt with xcrypt and leave the Python >> API unchanged? >> >> However until a usable solution exist (either in the stdlib or as 3rd >> party) I don't think we should deprecate anything (deprecating things >> before the replacement is ready is stressful for everyone involved). >> >> I'm also not sure I agree with removing support for old hashes. By all >> means put in the docs that they are unsafe. But if someone has a >> database full of old hashes it would be nice to be able to at least >> read/verify it, right? >> >> Was a release already made with blowfish, extended DES and NT-Hash? >> (And what's so bad with blowfish? It's mentioned in the heading of the >> xcrypt project too.) > > To clarify, extended DES and NT-Hash were not added. They were removed > from my PR after Christians request. Only the Blowfish method was added, > and it is so strong as SHA-2 methods. It is the only method supported on > OpenBSD. Ah, I misinterpreted the subject of the PR. The closed PR still mentions extended DES and NT-Hash. I'm sorry and blame my travel fatigue. The email was written at the airport after I had a conversion with somebody about new Python 3.7 features. > This PR is not a single enhancement made in the crypt module recently. I > also extended tests and added support for configuring SHA-2 methods. > There is an open PR (not merged before 3.7b1 unfortunately) for using > crypt_r() instead of crypt(): https://bugs.python.org/issue28503. In general I'm all for more tests and improvements of existing modules. However in this case Python 3.7 is sending wrong signals. For example additional of blowfish was prominently features on the largest German newsletter for IT. Both blowfish and SSHA (salted sha) are legacy password hashing algorithms. Glibc has moved them out of the main library for a good reason. (*) > If deprecate the crypt module, should modules pwd, grp and spwd be > deprecated too? The crypt module is needed for checking password hashes > provided by spwd. The pwd and grp module are fine. The modules use proper libc APIs that are internally backed by NSS (libc's Name Service Switch, not Mozilla's Network Security Service). APIs such getpwnam are defined and standardized since POSIX.1-2001. The pwd and grp automatically work with any configured user and group provider, even LDAP, IdM or Active Directory. Fun fact: Golang programs are usually statically compiled and don't even use libc. However Go's os/user package requires CGO and libc because it has to interface with libc and NSS to acquire user and group information. The spwd module is a different story. It's a direct interface to /etc/shadown using Linux-only APIs. The shadow DB API requires root permission. I think it even circumvents system security policies and identity provider. tl;dr pwd + grp == good, required crypt + spwd == bad Regards, Christian (*) Most Linux distros never had blowfish in libc anyway. From tismer at stackless.com Sat Feb 3 04:55:37 2018 From: tismer at stackless.com (Christian Tismer) Date: Sat, 3 Feb 2018 10:55:37 +0100 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes In-Reply-To: <20180203071242.GQ26553@ando.pearwood.info> References: <20180203071242.GQ26553@ando.pearwood.info> Message-ID: On 03.02.18 08:12, Steven D'Aprano wrote: > On Sat, Feb 03, 2018 at 12:50:11AM -0500, Terry Reedy wrote: >> On 2/2/2018 7:01 PM, asrp asrp wrote: >>> I don't know if this is the right place to post this. Please redirect as >>> needed. >> >> This list is for development *of* cpython. Development *with* python in >> general belongs on python-list. > > This list is for development of Python the language, not just CPython > the interpreter. It seems to me that announcing a new Python > interpreter, especially one designed for the purpose of allowing rapid > experimentation with the language, is on topic for this list. > > Well spoken! -- Christian Tismer-Sperling :^) tismer at stackless.com Software Consulting : http://www.stackless.com/ Karl-Liebknecht-Str. 121 : https://github.com/PySide 14482 Potsdam : GPG key -> 0xFB7BEE0E phone +49 173 24 18 776 fax +49 (30) 700143-0023 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 496 bytes Desc: OpenPGP digital signature URL: From christian at python.org Sat Feb 3 06:16:55 2018 From: christian at python.org (Christian Heimes) Date: Sat, 3 Feb 2018 12:16:55 +0100 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: <20180202213118.4143777d@fsol> References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> <20180202213118.4143777d@fsol> Message-ID: On 2018-02-02 21:31, Antoine Pitrou wrote: > On Fri, 2 Feb 2018 16:23:20 +0100 > Christian Heimes wrote: >> Hi, >> >> in PR 3854 [1] Serhiy added blowfish, extended DES and NT-Hash to >> Python's crypt mdodule. I vetoed against addition of the APIs because >> all these hashing algorithms are not state of the art. Their quality >> ranges from old to horribly, horriblye broken beyond any repair. >> >> Shortly after the PR has landed, I was made aware that glibc has >> deprecated crypt(3) API [2] and favor of an external library called >> libxcrypt [3] from OpenWall Linux. I have patched Python 3.7 [4] to >> support libxcrypt. >> >> In light of deprecation of crypt(3) glibc function and bad quality of >> hashing algorithms, I'd like to raise the motion to revert 3854 and >> deprecate the crypt module. > > Those are two separate proposals. > > On the topic of reverting PR #3854, I don't see the point. Is Blowfish > more fragile than the other algorithms? If not, it sounds ok to add it. Blowfish password hashing algorithms (also known as bcrypt) hasn't been state of the art of a long time. Here is an old blog post from 2012 that explains some of the issues with Blowfish: http://www.unlimitednovelty.com/2012/03/dont-use-bcrypt.html > On the topic of deprecating the crypt module, that doesn't sound like a > good idea right now. People may need to generate crypt()-compatible > output for various reasons, such as being able to automate system > administration tasks. IMO legacy support is not a good argument to keep the crypt module. The passlib [1] package provides an excellent selection of legacy and modern password derivation and hashing algorithms. As an admin you cannot rely on the crypt module. libc's crypt() function usually provides a subset. For example most Linux libc do not provide blowfish based hashing. Christian [1] https://pythonhosted.org/passlib/ From christian at python.org Sat Feb 3 06:27:29 2018 From: christian at python.org (Christian Heimes) Date: Sat, 3 Feb 2018 12:27:29 +0100 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> Message-ID: On 2018-02-02 21:21, Nathaniel Smith wrote: > On Feb 2, 2018 7:24 AM, "Christian Heimes" > wrote: > > Shortly after the PR has landed, I was made aware that glibc has > deprecated crypt(3) API [2] and favor of an external library called > libxcrypt [3] from OpenWall Linux. I have patched Python 3.7 [4] to > support libxcrypt. > > In light of deprecation of crypt(3) glibc function and bad quality of > hashing algorithms, I'd like to raise the motion to revert 3854 and > deprecate the crypt module. The whole module should be rather moved into > 3rd party library that wraps xcrypt. > > > If the crypt module can just as well use libxcrypt, and that's how the > distros are going to build it, then what's the advantage of deprecating > it? Won't it just continue to work indefinitely? libxcrypt is now an external library that must be installed separately. It bloats CPython's dependency tree and the size of Python container images for a legacy feature. For Python 4.0 I'd like to drop some of the old, dead batteries and include include useful batteries. Christian From tismer at stackless.com Sat Feb 3 06:29:19 2018 From: tismer at stackless.com (Christian Tismer) Date: Sat, 3 Feb 2018 12:29:19 +0100 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes In-Reply-To: References: Message-ID: <9aa48a73-a02d-634c-dbae-c3c9144c0c06@stackless.com> Hi user with no real name yet, On 03.02.18 01:01, asrp asrp wrote: > Hello, > > I don't know if this is the right place to post this. Please redirect as needed. > > I've made a small Python interpreter in Python with runtime AST node semantics and edit-and-continue. I thought it could make prototyping language changes more easily and visualize usage before writing them in C. > > Its here: https://github.com/asrp/python_terp > > So, for example, redefining the for_stmt function in the right scope changes the behaviour of future for loops at runtime. > > Although from discussion I've read in PEPs, actual implementation always look like a non-issue (which seems like magic to me) so maybe no-one here actually needs this. > > (I really needed edit-and-continue for one of my projects but of course, running it in this extra interpreter is much too slow.) > > asrp In the readme to python_terp you say: """ python_terp is intended to make language modification to Python easier to preview changes more quickly and is not intended for full CPython compatibility. However, a large subset of Python is already included. In particular, enough to run the first stage of its parser. """ This needs clarification. What do you mean by subset? A real subset or also things that are different and will stay different? To what extent are you planning to stay compatible, and where do you plan to deviate? The reason that I'm asking is that by compatible I mean the compatibility of PyPy. If you can reach that, and be it just by a subset, then it makes sense to speak of Python. Cheers - Chris -- Christian Tismer-Sperling :^) tismer at stackless.com Software Consulting : http://www.stackless.com/ Karl-Liebknecht-Str. 121 : https://github.com/PySide 14482 Potsdam : GPG key -> 0xFB7BEE0E phone +49 173 24 18 776 fax +49 (30) 700143-0023 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 496 bytes Desc: OpenPGP digital signature URL: From christian at python.org Sat Feb 3 07:07:17 2018 From: christian at python.org (Christian Heimes) Date: Sat, 3 Feb 2018 13:07:17 +0100 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org> Message-ID: On 2018-02-02 17:18, Guido van Rossum wrote: > I'm all for nudging people in the direction of xcrypt. I assume we can't > just switch the C-level crypt with xcrypt and leave the Python API > unchanged? > > However until a usable solution exist (either in the stdlib or as 3rd > party) I don't think we should deprecate anything (deprecating things > before the replacement is ready is stressful for everyone involved). > > I'm also not sure I agree with removing support for old hashes. By all > means put in the docs that they are unsafe. But if someone has a > database full of old hashes it would be nice to be able to at least > read/verify it, right? > > Was a release already made with blowfish, extended DES and NT-Hash? (And > what's so bad with blowfish? It's mentioned in the heading of the xcrypt > project too.) I answered some of your questions in other replies and will answer the remaining concerns on Monday. You suggested a 3rd party module. I have cloned the crypt module with Serhiy's improvements and turned it into a standalone module with a ctypes interface, https://github.com/tiran/legacycrypt . I'll release the package as soon as I find time to polish the documentation and give Serhiy his will deserved credit for his work. Christian From asrp at email.com Sat Feb 3 05:45:15 2018 From: asrp at email.com (asrp) Date: Sat, 3 Feb 2018 11:45:15 +0100 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes In-Reply-To: <09cc01d39c96$9cb28910$d6179b30$@sdamon.com> References: <09cc01d39c96$9cb28910$d6179b30$@sdamon.com> Message-ID: > Are you aware of pypy? > Yes, but I do not know it well. It seems to be more focused on JIT compiling with many of its functions written for special cases and I did not find a way to make experimental changes easily. I even tried (briefly) to have the main loop of this interpreter run in RPython but didn't find enough information on which subset of Python does RPython handle. > Can you give an example of how you would do that? I don't mean the > mechanism used, I mean how would a developer implement a new syntactic > feature. Suppose I wanted to add a new clause to for...else, let's say: > > for ... : > block > otherwise: > # runs only if the for-loop was empty > > How would do I do that? > Here's one way to do that. Sorry for the lack of doc for the moment. To alter the language's grammar, you'd create a string with the new grammar, parse it and set the result to pyterp.parser. Then, depending on how the grammar was changed (which affects what AST is created), you'd change the corresponding function to handle the new semantics. I didn't quite understand what you mean by "was empty". However, I noticed that "else" blocks after for are ignored in the current implement. Here's an example session adding "else". If you tell me a bit more about the intended behaviour of "otherwise", I'd be happy to do an example with that clause. $ ipython -i test/python_repl.py p>> simport simple_ast p>> for i in [1, 2]: ... print i ... 1 2 p>> for i in [1, 2, 3]: ... print i ... else: ... print 100 ... 1 2 3 p>> ^D In the above for loop, the else block is ignored! I press Control-D to exist the interpreter. In [1]: grammar = python_grammar.full_definition + python_grammar.extra In [2]: grammar += r""" ...: for_stmt = "for" {exprlist} "in" {testlist} ":" {suite} {(SAME_INDENT "else" ":" {suite}) | void=pass_stmt} ...: """ In [3]: pyterp.parser = python.Interpreter(i3.parse("grammar", grammar)) In [4]: pyterp.repl() Now edit for_stmt in simple_ast.py except StopIteration: + evaluate(else_block) return (If else_block was not already an (ignored) parameter, we'd have to change the `def for_stmt...` line too.) And run in the REPL we just restarted. p>> simple_ast.reload_module(simple_ast) p>> for i in [1, 2, 3]: ... print i ... else: ... print 100 ... 1 2 3 100 p>> for i in [1, 2, 3]: ... print i ... break ... else: ... print 100 ... 1 p>> Some notes: 1. I'm using the host Python interpreter to change the grammar here but that's not strictly necessary if we expose pyterp itself in the global scope. 2. I'm editing simple_ast.py and reloading because its less changes but redefining a new function and setting simple_ast.for_stmt (in other words, monkey-patching the change) should also work. If we wanted an "otherwise" clause on top of an "else" clause, we probably want to make some provision for the runtime to distinguish between the two so I'd edit the grammar with something like for_stmt = "for" {exprlist} "in" {testlist} ":" {suite} {((SAME_INDENT "else" ":" {suite}) | void=pass_stmt)=else_block ((SAME_INDENT "otherwise" ":" {suite}) | void=pass_stmt)=otherwise_block} (In this case, "otherwise" blocks have to come after the "else" block if both are present.) asrp From asrp at email.com Sat Feb 3 09:17:47 2018 From: asrp at email.com (asrp) Date: Sat, 3 Feb 2018 15:17:47 +0100 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes In-Reply-To: <9aa48a73-a02d-634c-dbae-c3c9144c0c06@stackless.com> References: <9aa48a73-a02d-634c-dbae-c3c9144c0c06@stackless.com> Message-ID: Hi, (My other reply to Alex and Steven doesn't seems to appear, at least not on pipermail so I hope these don't come in the wrong order.) > In the readme to python_terp you say: > > """ > python_terp is intended to make language modification to Python easier > to preview changes more quickly and is not intended for full CPython > compatibility. However, a large subset of Python is already included. In > particular, enough to run the first stage of its parser. > """ > > This needs clarification. > What do you mean by subset? A real subset or also things that > are different and will stay different? > To what extent are you planning to stay compatible, and where > do you plan to deviate? > > The reason that I'm asking is that by compatible I mean the > compatibility of PyPy. If you can reach that, and be it just > by a subset, then it makes sense to speak of Python. > Good questions! There are two part to what I meant by that passage. One is a reflection of the current state of the interpreter and its capabilities. I put an example instead listing all differences because its easier to update but admittedly much less precise. The other concerns the intention for the future. My mind is not completely made but here's what I currently think it should/could be. First, fidelity would be determined by how the interpreter is used (if its used). More (potentially) drastic changes in the beginning than later on. Hopefully the deviations from Python (if any) for this reason does not grow too much from this. But since its for testing language changes (to Python), I don't think it would stray too much, if any. Second, I'd like to keep the whole thing small (of low complexity) as much as possible so in some cases, the less featureful choice will be kept. For example, in an earlier version, I had more functions parameters handling (defaults, *args, **kwargs, etc) but remove it because just setting up the initial local scope from the parameters passed and function definition took more than 100 lines. (Maybe I just went about this the wrong way...) However, except for easier or simpler implementation of the interpreter or some of the initial AST nodes, I don't intend to make any additions not in the Python language. One example of this is simple_for and single_if statements which I explain a bit in this old post: http://blog.asrpo.com/bootstrap_chicken_or_egg Although those are mainly intended to bootstrap the other statements like if_stmt and for_stmt and not used again later. (Well, there might be some difference for class method lookup (late-binding instead of early-binding) that's needed for edit-and-continue. But the intention would be to list all of those differences.) Feel free to ask again if I missed some aspect of your questions here. asrp > Sent: Saturday, February 03, 2018 at 12:29 PM > From: "Christian Tismer" > To: "asrp asrp" , python-dev at python.org > Subject: Re: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes > > Hi user with no real name yet, > > On 03.02.18 01:01, asrp asrp wrote: > > Hello, > > > > I don't know if this is the right place to post this. Please redirect as needed. > > > > I've made a small Python interpreter in Python with runtime AST node semantics and edit-and-continue. I thought it could make prototyping language changes more easily and visualize usage before writing them in C. > > > > Its here: https://github.com/asrp/python_terp > > > > So, for example, redefining the for_stmt function in the right scope changes the behaviour of future for loops at runtime. > > > > Although from discussion I've read in PEPs, actual implementation always look like a non-issue (which seems like magic to me) so maybe no-one here actually needs this. > > > > (I really needed edit-and-continue for one of my projects but of course, running it in this extra interpreter is much too slow.) > > > > asrp > > In the readme to python_terp you say: > > """ > python_terp is intended to make language modification to Python easier > to preview changes more quickly and is not intended for full CPython > compatibility. However, a large subset of Python is already included. In > particular, enough to run the first stage of its parser. > """ > > This needs clarification. > What do you mean by subset? A real subset or also things that > are different and will stay different? > To what extent are you planning to stay compatible, and where > do you plan to deviate? > > The reason that I'm asking is that by compatible I mean the > compatibility of PyPy. If you can reach that, and be it just > by a subset, then it makes sense to speak of Python. > > Cheers - Chris > > -- > Christian Tismer-Sperling :^) tismer at stackless.com > Software Consulting : http://www.stackless.com/ > Karl-Liebknecht-Str. 121 : https://github.com/PySide > 14482 Potsdam : GPG key -> 0xFB7BEE0E > phone +49 173 24 18 776 fax +49 (30) 700143-0023 > > From asrp at email.com Sat Feb 3 09:25:40 2018 From: asrp at email.com (asrp) Date: Sat, 3 Feb 2018 15:25:40 +0100 Subject: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes References: <9aa48a73-a02d-634c-dbae-c3c9144c0c06@stackless.com> Message-ID: Oh and I forgot to mention that since everything is mostly done at run time, there could just be different modules for (slightly) different languages. I don't know if that'd be problematic for maintenance though. asrp > Sent: Saturday, February 03, 2018 at 3:17 PM > From: asrp > To: "Christian Tismer" > Cc: python-dev at python.org > Subject: Re: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes > > Hi, > > (My other reply to Alex and Steven doesn't seems to appear, at least not on pipermail so I hope these don't come in the wrong order.) > > > In the readme to python_terp you say: > > > > """ > > python_terp is intended to make language modification to Python easier > > to preview changes more quickly and is not intended for full CPython > > compatibility. However, a large subset of Python is already included. In > > particular, enough to run the first stage of its parser. > > """ > > > > This needs clarification. > > What do you mean by subset? A real subset or also things that > > are different and will stay different? > > To what extent are you planning to stay compatible, and where > > do you plan to deviate? > > > > The reason that I'm asking is that by compatible I mean the > > compatibility of PyPy. If you can reach that, and be it just > > by a subset, then it makes sense to speak of Python. > > > > Good questions! There are two part to what I meant by that passage. > > One is a reflection of the current state of the interpreter and its capabilities. I put an example instead listing all differences because its easier to update but admittedly much less precise. > > The other concerns the intention for the future. My mind is not completely made but here's what I currently think it should/could be. > > First, fidelity would be determined by how the interpreter is used (if its used). More (potentially) drastic changes in the beginning than later on. Hopefully the deviations from Python (if any) for this reason does not grow too much from this. But since its for testing language changes (to Python), I don't think it would stray too much, if any. > > Second, I'd like to keep the whole thing small (of low complexity) as much as possible so in some cases, the less featureful choice will be kept. For example, in an earlier version, I had more functions parameters handling (defaults, *args, **kwargs, etc) but remove it because just setting up the initial local scope from the parameters passed and function definition took more than 100 lines. (Maybe I just went about this the wrong way...) > > However, except for easier or simpler implementation of the interpreter or some of the initial AST nodes, I don't intend to make any additions not in the Python language. One example of this is simple_for and single_if statements which I explain a bit in this old post: > > http://blog.asrpo.com/bootstrap_chicken_or_egg > > Although those are mainly intended to bootstrap the other statements like if_stmt and for_stmt and not used again later. > > (Well, there might be some difference for class method lookup (late-binding instead of early-binding) that's needed for edit-and-continue. But the intention would be to list all of those differences.) > > Feel free to ask again if I missed some aspect of your questions here. > > asrp > > > Sent: Saturday, February 03, 2018 at 12:29 PM > > From: "Christian Tismer" > > To: "asrp asrp" , python-dev at python.org > > Subject: Re: [Python-Dev] A minimal Python interpreter written in Python for experimenting with language changes > > > > Hi user with no real name yet, > > > > On 03.02.18 01:01, asrp asrp wrote: > > > Hello, > > > > > > I don't know if this is the right place to post this. Please redirect as needed. > > > > > > I've made a small Python interpreter in Python with runtime AST node semantics and edit-and-continue. I thought it could make prototyping language changes more easily and visualize usage before writing them in C. > > > > > > Its here: https://github.com/asrp/python_terp > > > > > > So, for example, redefining the for_stmt function in the right scope changes the behaviour of future for loops at runtime. > > > > > > Although from discussion I've read in PEPs, actual implementation always look like a non-issue (which seems like magic to me) so maybe no-one here actually needs this. > > > > > > (I really needed edit-and-continue for one of my projects but of course, running it in this extra interpreter is much too slow.) > > > > > > asrp > > > > In the readme to python_terp you say: > > > > """ > > python_terp is intended to make language modification to Python easier > > to preview changes more quickly and is not intended for full CPython > > compatibility. However, a large subset of Python is already included. In > > particular, enough to run the first stage of its parser. > > """ > > > > This needs clarification. > > What do you mean by subset? A real subset or also things that > > are different and will stay different? > > To what extent are you planning to stay compatible, and where > > do you plan to deviate? > > > > The reason that I'm asking is that by compatible I mean the > > compatibility of PyPy. If you can reach that, and be it just > > by a subset, then it makes sense to speak of Python. > > > > Cheers - Chris > > > > -- > > Christian Tismer-Sperling :^) tismer at stackless.com > > Software Consulting : http://www.stackless.com/ > > Karl-Liebknecht-Str. 121 : https://github.com/PySide > > 14482 Potsdam : GPG key -> 0xFB7BEE0E > > phone +49 173 24 18 776 fax +49 (30) 700143-0023 > > > > From guido at python.org Sat Feb 3 11:13:26 2018 From: guido at python.org (Guido van Rossum) Date: Sat, 3 Feb 2018 08:13:26 -0800 Subject: [Python-Dev] Deprecate crypt module and revert PR 3854 In-Reply-To: References: <11437ad6-450f-58a2-94db-b8fe8619157e@python.org>

Message-ID: You can deprecate the crypt module, update the docs to explain it's outdated and point to other 3rd party solutions. A few years from now we may be able to delete it. (With an intermediate step of issuing a non-silent deprecation warning.) Until then just leave it be. Possibly that's what your desired outcome is anyway? On Sat, Feb 3, 2018 at 4:07 AM, Christian Heimes wrote: > On 2018-02-02 17:18, Guido van Rossum wrote: > > I'm all for nudging people in the direction of xcrypt. I assume we can't > > just switch the C-level crypt with xcrypt and leave the Python API > > unchanged? > > > > However until a usable solution exist (either in the stdlib or as 3rd > > party) I don't think we should deprecate anything (deprecating things > > before the replacement is ready is stressful for everyone involved). > > > > I'm also not sure I agree with removing support for old hashes. By all > > means put in the docs that they are unsafe. But if someone has a > > database full of old hashes it would be nice to be able to at least > > read/verify it, right? > > > > Was a release already made with blowfish, extended DES and NT-Hash? (And > > what's so bad with blowfish? It's mentioned in the heading of the xcrypt > > project too.) > > I answered some of your questions in other replies and will answer the > remaining concerns on Monday. > > You suggested a 3rd party module. I have cloned the crypt module with > Serhiy's improvements and turned it into a standalone module with a > ctypes interface, https://github.com/tiran/legacycrypt . I'll release > the package as soon as I find time to polish the documentation and give > Serhiy his will deserved credit for his work. > > Christian > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg at krypto.org Sat Feb 3 14:25:38 2018 From: greg at krypto.org (Gregory P. Smith) Date: Sat, 03 Feb 2018 19:25:38 +0000 Subject: [Python-Dev] "threading.Lock().locked()" is not documented In-Reply-To: <06cf0afb-27ec-793e-61d1-3c0efb8da149@jcea.es> References: <06cf0afb-27ec-793e-61d1-3c0efb8da149@jcea.es> Message-ID: On Wed, Jan 31, 2018 at 4:46 PM Jesus Cea wrote: > https://docs.python.org/3.6/library/threading.html doesn't document > "threading.Lock().locked()", and it is something quite useful. > > In fact, it is used in "threading.py" itself. For instance, lines 109, > 985, 1289. > > Is there any reason to not document it?. > No good reason. It is a public method and part of the public API regardless of being documented or not. Always feel free to make a documentation update PR. The method does have a docstring. https://github.com/python/cpython/blob/master/Modules/_threadmodule.c#L193 -gps > > (I didn't investigate other objects in the module). > > -- > Jes?s Cea Avi?n _/_/ _/_/_/ _/_/_/ > jcea at jcea.es - http://www.jcea.es/ _/_/ _/_/ _/_/ _/_/ _/_/ > Twitter: @jcea _/_/ _/_/ _/_/_/_/_/ > jabber / xmpp:jcea at jabber.org _/_/ _/_/ _/_/ _/_/ _/_/ > "Things are not so easy" _/_/ _/_/ _/_/ _/_/ _/_/ _/_/ > "My name is Dump, Core Dump" _/_/_/ _/_/_/ _/_/ _/_/ > "El amor es poner tu felicidad en la felicidad de otro" - Leibniz > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/greg%40krypto.org > -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg at krypto.org Sat Feb 3 15:05:07 2018 From: greg at krypto.org (Gregory P. Smith) Date: Sat, 03 Feb 2018 20:05:07 +0000 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <34493569.S535hZLbfa@klinga.prans.org> <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> <1937305.WBmin8xbao@klinga.prans.org> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> Message-ID: On Fri, Feb 2, 2018 at 10:25 PM Nick Coghlan wrote: > > > On 3 Feb. 2018 1:09 am, "Eric V. Smith" wrote: > > > The problem with dropping hash=True is: how would you write __hash__ > yourself? It seems like a bug magnet if you're adding fields to the class > and forget to update __hash__, especially in the presence of per-field > hash=False and eq=False settings. And you'd need to make sure it matches > the generated __eq__ (if 2 objects are equal, they need to have the same > hash value). > > > I think anyone that does this needs to think *very* carefully about how > they do it, and offering both "hash=True" and "frozen=True" is an > attractive nuisance that means people will write "hash=True" when what they > wanted was "frozen=True". > > In particular, having to work out how write a maintainable "__hash__" will > encourage folks to separate out the hashed fields as a separate frozen > subrecord or base class. > > If this proves to be an intolerable burden then the short hand spelling > could be added back in 3.8, but once we ship it we're going to be stuck > with explaining the interactions indefinitely. > +1 Nick put words to my chief concerns. It is easy for an author see hash=True in existing code somewhere (cargo culting) and assume it does what they want, or quickly glance at the the API and see "hash=True" without actually taking the time to understand the implications of that to see that the parameter named "frozen" is the one they are supposed to want that _safely_ makes their dataclass properly hashable, not the more attractive parameter named "hash" that enables dangerous behavior. Forcing people who need a __hash__ method to write it explicitly sounds like a good thing to me. I am not at all worried about someone forgetting to add a new field to an implementation of the __hash__ method when adding a new field, the fields and __hash__ method are all defined in the same place in the code. I expect someone with a common need for always having a __hash__ method will produce a library on top of dataclasses that implements something like our current hash=True behavior. If that kind of thing turns out to be widely used, we can reintroduce the feature in dataclasses in 3.8 or later, informed by what we see practical uses actually doing. In my practical experience, people writing Python code do not need to learn and understand the intricacies of what it means to have a __hash__ method, be hashable, or "frozen". We intentionally warn people against writing dunder methods other than __init__ in their code as they are often power features with less obvious semantics than it may seem at first glance making such code harder to maintain. Even calling the parameter "hash=" and saying it adds a __hash__ method as the PEP currently does seems to launder the danger, washing away the "dunder smell" that adding a special considerations __hash__ method carries. The PEP (and presumably forthcoming dataclasses module documentation) says "This is a specialized use case and should be considered carefully" which I agree with. But any time we suggest that in an API, how about having the API name make it clear that this is special and not to be done lightly? I guess i'm arguing against using "hash=" as the arg name in favor of "danger_there_be_vorpal_rabbits_hash_me_maybe=" or something more usefully similar if we're going to have it. -gps -------------- next part -------------- An HTML attachment was scrubbed... URL: From benjamin at python.org Sat Feb 3 16:08:29 2018 From: benjamin at python.org (Benjamin Peterson) Date: Sat, 03 Feb 2018 13:08:29 -0800 Subject: [Python-Dev] "threading.Lock().locked()" is not documented In-Reply-To: References: <06cf0afb-27ec-793e-61d1-3c0efb8da149@jcea.es> Message-ID: <1517692109.3300380.1258485616.38579E97@webmail.messagingengine.com> On Sat, Feb 3, 2018, at 11:25, Gregory P. Smith wrote: > On Wed, Jan 31, 2018 at 4:46 PM Jesus Cea wrote: > > > https://docs.python.org/3.6/library/threading.html doesn't document > > "threading.Lock().locked()", and it is something quite useful. > > > > In fact, it is used in "threading.py" itself. For instance, lines 109, > > 985, 1289. > > > > Is there any reason to not document it?. > > > > No good reason. It is a public method and part of the public API > regardless of being documented or not. While I agree about .locked's present publicness, use of locked predicates in concurrent programs is usually serious code smell. So, the documentation should come with a warning note. From ethan at stoneleaf.us Sat Feb 3 17:02:18 2018 From: ethan at stoneleaf.us (Ethan Furman) Date: Sat, 03 Feb 2018 14:02:18 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <34493569.S535hZLbfa@klinga.prans.org> <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> <1937305.WBmin8xbao@klinga.prans.org> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com>

Message-ID: <5A76316A.40807@stoneleaf.us> On 02/02/2018 10:44 PM, Guido van Rossum wrote: > It appears Eric and I are the only ones in favor of keeping the current behavior. But I still am not convinced by all > the worries about "attractive nuisances" and all the other bad names this feature has been called. We don't know that > any of the doomsday scenarios will happen. In my experience, usually once something is rolled out the set of issues that > are *actually* raised is entirely different from the things its designers were worried about. This may all be true, but consider how many times we have asked, "How does attrs handle that?" It would be wise to also ask, "What pitfalls have attrs discovered, and what would they do different if they could?" -- ~Ethan~ From tritium-list at sdamon.com Sat Feb 3 17:40:30 2018 From: tritium-list at sdamon.com (Alex Walters) Date: Sat, 3 Feb 2018 17:40:30 -0500 Subject: [Python-Dev] Is 4.0 a major breaking changes release? Message-ID: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> I am still working on porting code from 2.x to 3.x. As of late on the lists I've seen comments about making somewhat major changes in 4.0 - now I'm concerned that I should pause my porting effort until that is released. Is python 4 going to be another python 3? From nad at python.org Sat Feb 3 17:46:21 2018 From: nad at python.org (Ned Deily) Date: Sat, 3 Feb 2018 17:46:21 -0500 Subject: [Python-Dev] Is 4.0 a major breaking changes release? In-Reply-To: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> References: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> Message-ID: <64701CC4-9E49-461A-AC3A-336967ECCF29@python.org> On Feb 3, 2018, at 17:40, Alex Walters wrote: > I am still working on porting code from 2.x to 3.x. As of late on the lists > I've seen comments about making somewhat major changes in 4.0 - now I'm > concerned that I should pause my porting effort until that is released. Is > python 4 going to be another python 3? At this point, Python 4 is just a convenient tag for really big changes. There are no concrete plans or schedule for such a major undertaking. Port away to Python 3.x! -- Ned Deily nad at python.org -- [] From p.f.moore at gmail.com Sat Feb 3 17:48:59 2018 From: p.f.moore at gmail.com (Paul Moore) Date: Sat, 3 Feb 2018 22:48:59 +0000 Subject: [Python-Dev] Is 4.0 a major breaking changes release? In-Reply-To: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> References: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> Message-ID: On 3 February 2018 at 22:40, Alex Walters wrote: > I am still working on porting code from 2.x to 3.x. As of late on the lists > I've seen comments about making somewhat major changes in 4.0 - now I'm > concerned that I should pause my porting effort until that is released. Is > python 4 going to be another python 3? No. Guido has gone on record as saying this won't happen. Paul From zachary.ware+pydev at gmail.com Sat Feb 3 17:50:02 2018 From: zachary.ware+pydev at gmail.com (Zachary Ware) Date: Sat, 3 Feb 2018 16:50:02 -0600 Subject: [Python-Dev] Is 4.0 a major breaking changes release? In-Reply-To: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> References: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> Message-ID: On Sat, Feb 3, 2018 at 4:40 PM, Alex Walters wrote: > I am still working on porting code from 2.x to 3.x. As of late on the lists > I've seen comments about making somewhat major changes in 4.0 - now I'm > concerned that I should pause my porting effort until that is released. Is > python 4 going to be another python 3? Emphatically no. Anyone suggesting big breaking changes in 4.0 is exercising wishful thinking :) There may be some cleanup in 4.0, but only removing things that have been deprecated for a long time in 3.x but hadn't been removed to maintain compatibility with 2.7. -- Zach From drsalists at gmail.com Sat Feb 3 17:51:09 2018 From: drsalists at gmail.com (Dan Stromberg) Date: Sat, 3 Feb 2018 14:51:09 -0800 Subject: [Python-Dev] Is 4.0 a major breaking changes release? In-Reply-To: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> References: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> Message-ID: On Sat, Feb 3, 2018 at 2:40 PM, Alex Walters wrote: > I am still working on porting code from 2.x to 3.x. As of late on the lists > I've seen comments about making somewhat major changes in 4.0 - now I'm > concerned that I should pause my porting effort until that is released. Is > python 4 going to be another python 3? https://www.curiousefficiency.org/posts/2014/08/python-4000.html Python 0.x to 1.x was a small change. Python 1.x to 2.x was a small change. I doubt there'll be anything as important as str -> (bytes, unicode) to merit a breaking change. From tjreedy at udel.edu Sat Feb 3 20:48:08 2018 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 3 Feb 2018 20:48:08 -0500 Subject: [Python-Dev] Is 4.0 a major breaking changes release? In-Reply-To: References: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> Message-ID: On 2/3/2018 5:50 PM, Zachary Ware wrote: > On Sat, Feb 3, 2018 at 4:40 PM, Alex Walters wrote: >> I am still working on porting code from 2.x to 3.x. As of late on the lists >> I've seen comments about making somewhat major changes in 4.0 - now I'm >> concerned that I should pause my porting effort until that is released. Is >> python 4 going to be another python 3? > > Emphatically no. Anyone suggesting big breaking changes in 4.0 is > exercising wishful thinking :) > > There may be some cleanup in 4.0, but only removing things that have > been deprecated for a long time in 3.x but hadn't been removed to > maintain compatibility with 2.7. So do your porting with deprecation warnings on. -- Terry Jan Reedy From steve at holdenweb.com Sat Feb 3 21:18:14 2018 From: steve at holdenweb.com (Steve Holden) Date: Sun, 4 Feb 2018 02:18:14 +0000 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <34493569.S535hZLbfa@klinga.prans.org> <0a81521e-74f2-4883-6ab0-96e6fb03a419@trueblade.com> <1937305.WBmin8xbao@klinga.prans.org> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com>

Message-ID: As a Bear of Relatively Little Brain, I've grown up understanding, and teaching, that mutable things aren't to be used as dict keys. I'm aware that immutability isn't strictly the required condition, but it for most people, that's the primary reason for using frozen sets and tuples, for example, and immutability serves as a practical and comprehensible first approximation. So I'm at a loss to understand why I am being offered a feature that (especially during maintenance by a different developer) might be prone to bizarre errors caused by a change in hash. I realise that this won't happen very often, but the difficulty of the debug task should surely merit at least some warning for us bears - you know, the ones that take your work and use it to do mundane things with. On a slightly tangential note, us bears are very glad that such questions are taken seriously and discussed in such depth. Thank you all. Steve Holden On Sat, Feb 3, 2018 at 6:44 AM, Guido van Rossum wrote: > It appears Eric and I are the only ones in favor of keeping the current > behavior. But I still am not convinced by all the worries about "attractive > nuisances" and all the other bad names this feature has been called. We > don't know that any of the doomsday scenarios will happen. In my > experience, usually once something is rolled out the set of issues that are > *actually* raised is entirely different from the things its designers were > worried about. > > Please don't commit a change to roll this back without checking in with > me; I have some misgivings about the problem being raised here that I still > need to think through more carefully. In the meantime, please try to use > dataclasses with 3.7b1! > > On Fri, Feb 2, 2018 at 10:25 PM, Nick Coghlan wrote: > >> >> >> On 3 Feb. 2018 1:09 am, "Eric V. Smith" wrote: >> >> >> The problem with dropping hash=True is: how would you write __hash__ >> yourself? It seems like a bug magnet if you're adding fields to the class >> and forget to update __hash__, especially in the presence of per-field >> hash=False and eq=False settings. And you'd need to make sure it matches >> the generated __eq__ (if 2 objects are equal, they need to have the same >> hash value). >> >> >> I think anyone that does this needs to think *very* carefully about how >> they do it, and offering both "hash=True" and "frozen=True" is an >> attractive nuisance that means people will write "hash=True" when what they >> wanted was "frozen=True". >> >> In particular, having to work out how write a maintainable "__hash__" >> will encourage folks to separate out the hashed fields as a separate frozen >> subrecord or base class. >> >> If this proves to be an intolerable burden then the short hand spelling >> could be added back in 3.8, but once we ship it we're going to be stuck >> with explaining the interactions indefinitely. >> >> Cheers, >> Nick. >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From steve at holdenweb.com Sat Feb 3 21:23:52 2018 From: steve at holdenweb.com (Steve Holden) Date: Sun, 4 Feb 2018 02:23:52 +0000 Subject: [Python-Dev] Is 4.0 a major breaking changes release? In-Reply-To: <64701CC4-9E49-461A-AC3A-336967ECCF29@python.org> References: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> <64701CC4-9E49-461A-AC3A-336967ECCF29@python.org> Message-ID: On Sat, Feb 3, 2018 at 10:46 PM, Ned Deily wrote: > On Feb 3, 2018, at 17:40, Alex Walters wrote: > > I am still working on porting code from 2.x to 3.x. As of late on the > lists > > I've seen comments about making somewhat major changes in 4.0 - now I'm > > concerned that I should pause my porting effort until that is released. > Is > > python 4 going to be another python 3? > > At this point, Python 4 is just a convenient tag for really big changes. > There are no concrete plans or schedule for such a major undertaking. Port > away to Python 3.x! > > ?"Py3K?" they said, "oh, no, we're just noodling around with a few ideas ..."? :-) To be honest, and historically fair to Guido, he did warn for a long time that we should expect breaking changes in an eventual wart-removal release. It seemed to me the biggest disappointment was the team not having the resources to devote to a mooted but never really achieved reorganisation of the stdlib. ?Rectifying that omission would, I hope, be included as a priority in any Python 4 design. Since people rely on the stdlib hugely, automated translation of at least 98% of existing stdlib imports should be a goal. But that's just me. ? -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg.ewing at canterbury.ac.nz Sat Feb 3 23:51:12 2018 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sun, 04 Feb 2018 17:51:12 +1300 Subject: [Python-Dev] Is 4.0 a major breaking changes release? In-Reply-To: References: <0a9e01d39d3f$ff2e9620$fd8bc260$@sdamon.com> <64701CC4-9E49-461A-AC3A-336967ECCF29@python.org> Message-ID: <5A769140.6040302@canterbury.ac.nz> > On Sat, Feb 3, 2018 at 10:46 PM, Ned Deily > wrote: >> I've seen comments about making somewhat major changes in 4.0 - > now I'm >> concerned that I should pause my porting effort until that is > released. Is >> python 4 going to be another python 3? Guido has repeatedly promised that there will never be another upheaval as big as the 2-to-3 one, and that the change from Python 3.9 to 4.0 won't be anything special. Hopefully we can trust him on that. -- Greg From ronaldoussoren at mac.com Sun Feb 4 04:44:34 2018 From: ronaldoussoren at mac.com (Ronald Oussoren) Date: Sun, 04 Feb 2018 10:44:34 +0100 Subject: [Python-Dev] OS-X builds for 3.7.0 In-Reply-To: References: Message-ID: <91C9C795-B485-4E89-AA46-45B01B0D5ACD@mac.com> > On 30 Jan 2018, at 18:42, Chris Barker wrote: > > Ned, > > It looks like you're still building OS-X the same way as in the past: > > Intel 32+64 bit, 10.6 compatibility > > Is that right? > > Might it be time for an update? > > Do we still need to support 32 bit? From: > > https://apple.stackexchange.com/questions/99640/how-old-are-macs-that-cannot-run-64-bit-applications > > There has not been a 32 bit-only Mac sold since 2006, and a out-of the box 32 bit OS since 2006 or 2007 > > I can't find out what the older OS version Apple supports, but I know my IT dept has been making me upgrade, so I"m going to guess 10.8 or newer? A binary with a newer deployment target than 10.6 would be nice because AFAIK the installers are still build on a system running that old version of OSX. This results in binaries that cannot access newer system APIs like openat (and hence don?t support the ?dir_fd? parameter in a number of function in the os module. > > And maybe we could even get rid of the "Framework" builds?? Why? IMHO Framework builds are a nice way to get isolated side-by-side installations. Furthermore a number of Apple APIs (including the GUI libraries) don?t work unless you?re running from an application bundle, which the framework builds arranges for and normal unix builds don?t. Ronald -------------- next part -------------- An HTML attachment was scrubbed... URL: From ronaldoussoren at mac.com Sun Feb 4 06:02:17 2018 From: ronaldoussoren at mac.com (Ronald Oussoren) Date: Sun, 04 Feb 2018 12:02:17 +0100 Subject: [Python-Dev] [python-committers] [RELEASE] Python 3.7.0b1 is now available for testing In-Reply-To: <9425596C-A92F-4B10-A8B7-98F4E827E8D0@python.org> References: <9425596C-A92F-4B10-A8B7-98F4E827E8D0@python.org> Message-ID: > On 1 Feb 2018, at 02:34, Ned Deily wrote: > > [?] > > Attention macOS users: with 3.7.0b1, we are providing a choice of > two binary installers. The new variant provides a 64-bit-only > version for macOS 10.9 and later systems; this variant also now > includes its own built-in version of Tcl/Tk 8.6. We welcome your > feedback. > Why macOS 10.9 or later? MacOS 10.10 introduced a number of useful APIs, in particular openat(2) and the like which are exposed using the ?dir_fd? parameter of functions in the posix module. That said, macOS 10.9 seems to be a fairly common minimal platform requirement these days for developers not tracking Apple?s releases closely. Ronald From chris.barker at noaa.gov Sun Feb 4 14:59:27 2018 From: chris.barker at noaa.gov (Chris Barker - NOAA Federal) Date: Sun, 4 Feb 2018 11:59:27 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> Message-ID: >> IMO, the danger of >> "@dataclass(hash=True)" far overweighs whatever convenience it might >> provide. Is there any reason specifying has=True could set frozen=True unless the user specifically sets frozen=False? Or is that already the case? I think the folks that are concerned about this issue are quite right ? most Python users equate immutable and hashable?so the dataclass API should reflect that. And this would still make it easy and clear to specify the unusual (and arguably dangerous) case of: hash=True, frozen=False -CHB > From guido at python.org Sun Feb 4 17:31:06 2018 From: guido at python.org (Guido van Rossum) Date: Sun, 4 Feb 2018 14:31:06 -0800 Subject: [Python-Dev] Immutability vs. hashability Message-ID: On Sun, Feb 4, 2018 at 11:59 AM, Chris Barker - NOAA Federal < chris.barker at noaa.gov> wrote: > I think the folks that are concerned about this issue are quite right > ? most Python users equate immutable and hashable?so the dataclass API > should reflect that. > Since they are *not* equivalent (consider a tuple containing a list) I'm not at all convinced that any API in the core language should "reflect" this misconception, depending on how you meant that. -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From larry at hastings.org Sun Feb 4 19:35:03 2018 From: larry at hastings.org (Larry Hastings) Date: Sun, 4 Feb 2018 16:35:03 -0800 Subject: [Python-Dev] [RELEASED] Python 3.4.8 and Python 3.5.5 are now available Message-ID: <7b3f1a47-bdb1-8601-3a68-2b6c5b53b1ac@hastings.org> On behalf of the Python development community, I'm happy to announce the availability of Python 3.4.8 and Python 3.5.5. Both Python 3.4 and 3.5 are in "security fixes only" mode.? Both versions only accept security fixes, not conventional bug fixes, and both releases are source-only. You can find Python 3.4.8 here: https://www.python.org/downloads/release/python-348/ And you can find Python 3.5.5 here: https://www.python.org/downloads/release/python-355/ Happy Pythoning, //arry/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Sun Feb 4 20:54:37 2018 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 5 Feb 2018 11:54:37 +1000 Subject: [Python-Dev] Immutability vs. hashability In-Reply-To: References: Message-ID: On 5 February 2018 at 08:31, Guido van Rossum wrote: > On Sun, Feb 4, 2018 at 11:59 AM, Chris Barker - NOAA Federal > wrote: >> >> I think the folks that are concerned about this issue are quite right >> ? most Python users equate immutable and hashable?so the dataclass API >> should reflect that. > > Since they are *not* equivalent (consider a tuple containing a list) I'm not > at all convinced that any API in the core language should "reflect" this > misconception, depending on how you meant that. Lists are themselves mutable, and hence inherently unhashable. Tuples are themselves immutable, and hence hashable if their contents are. I interpret Chris's comment as saying that data classes should behave the same way that the builtin container types do: * if the data class itself is mutable (frozen=False, comparable to list, dict, set), then it is *not* hashable (unless you explicitly implement __hash__) * if the data class itself is immutable (frozen=True, comparable to tuple or frozenset), then whether or not it is hashable depends on whether or not the field values are hashable. It's the ability to ask the interpreter to guess what you mean "frozen=False, hash=True" that creates the likelihood of confusion. Whereas if we leave out the "hash=True" option entirely, then the most natural way to obtain a partially-mutable record, which has a fixed comparison key and selectively mutable state, then the recommended way of handling that would be through containment, where the mutable state is moved out to a subrecord that gets excluded from hashes and comparisons. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From jimjjewett at gmail.com Sun Feb 4 23:42:01 2018 From: jimjjewett at gmail.com (Jim J. Jewett) Date: Sun, 4 Feb 2018 23:42:01 -0500 Subject: [Python-Dev] unfrozen dataclasses and __hash__ (subsets are OK) Message-ID: I understand auto-generating the __hash__ (and __eq__) for a frozen container; that is just convenient. But why is there any desire to autogenerate a __hash__ for something that isn't frozen? Like a list or dict, the normal case would be for it not to have a hash at all, and the author *should* write out any explicit exceptions. The objection to that seems to be that someone might forget to add another field to the hash during later maintenance -- but so what? __hash__ should reference a subset of the fields used for equality, and strict subsets are OK. It *should* ignore some fields if that will provide the right balance between quick calculation and sufficient dispersion. If the record is complicated enough that forgetting a field is a likely problem, then the hash is probably already sufficiently complex without those new fields. -jJ From guido at python.org Mon Feb 5 00:18:25 2018 From: guido at python.org (Guido van Rossum) Date: Sun, 4 Feb 2018 21:18:25 -0800 Subject: [Python-Dev] Immutability vs. hashability In-Reply-To: References:

Message-ID: That's a lot to read between the lines. I was unhappy that Chris took the statement that immutability and hashability are equivalent, claimed that most people think of it that way, and did not point out that it was false, thereby making the impression that he wasn't aware of the difference. The way I think of it generally is that immutability is a property of types, while hashability is a property of values. I don't want the original debate (about what to do with hash=True for dataclasses) to be spread across multiple threads so I'll reply separately there. On Sun, Feb 4, 2018 at 5:54 PM, Nick Coghlan wrote: > On 5 February 2018 at 08:31, Guido van Rossum wrote: > > On Sun, Feb 4, 2018 at 11:59 AM, Chris Barker - NOAA Federal > > wrote: > >> > >> I think the folks that are concerned about this issue are quite right > >> ? most Python users equate immutable and hashable?so the dataclass API > >> should reflect that. > > > > Since they are *not* equivalent (consider a tuple containing a list) I'm > not > > at all convinced that any API in the core language should "reflect" this > > misconception, depending on how you meant that. > > Lists are themselves mutable, and hence inherently unhashable. > > Tuples are themselves immutable, and hence hashable if their contents are. > > I interpret Chris's comment as saying that data classes should behave > the same way that the builtin container types do: > > * if the data class itself is mutable (frozen=False, comparable to > list, dict, set), then it is *not* hashable (unless you explicitly > implement __hash__) > > * if the data class itself is immutable (frozen=True, comparable to > tuple or frozenset), then whether or not it is hashable depends on > whether or not the field values are hashable. > > It's the ability to ask the interpreter to guess what you mean > "frozen=False, hash=True" that creates the likelihood of confusion. > > Whereas if we leave out the "hash=True" option entirely, then the most > natural way to obtain a partially-mutable record, which has a fixed > comparison key and selectively mutable state, then the recommended way > of handling that would be through containment, where the mutable state > is moved out to a subrecord that gets excluded from hashes and > comparisons. > > Cheers, > Nick. > > -- > Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Mon Feb 5 00:49:50 2018 From: guido at python.org (Guido van Rossum) Date: Sun, 4 Feb 2018 21:49:50 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com> Message-ID: Looks like this is turning into a major flamewar regardless of what I say. :-( I really don't want to lose the ability to add a hash function to a mutable dataclass by flipping a flag in the decorator. I'll explain below. But I am fine if this flag has a name that clearly signals it's an unsafe thing to do. I propose to replace the existing (as of 3.7.0b1) hash= keyword for the @dataclass decorator with a simpler flag named unsafe_hash=. This would be a simple bool (not a tri-state flag like the current hash=None|False|True). The default would be False, and the behavior then would be to add a hash function automatically only if it's safe (using the same rules as for hash=None currently). With unsafe_hash=True, a hash function would always be generated that takes all fields into account except those declared using field(hash=False). If there's already a `def __hash__` in the function I don't care what it does, maybe it should raise rather than quietly doing nothing or quietly overwriting it. Here's my use case. A frozen class requires a lot of discipline, since you have to compute the values of all fields before calling the constructor. A mutable class allows other initialization patterns, e.g. manually setting some fields after the instance has been constructed, or having a separate non-dunder init() method. There may be good reasons for using these patterns, e.g. the object may be part of a cycle (e.g. parent/child links in a tree). Or you may just use one of these patterns because you're a pretty casual coder. Or you're modeling something external. My point is that once you have one of those patterns in place, changing your code to avoid them may be difficult. And yet your code may treat the objects as essentially immutable after the initialization phase (e.g. a parse tree). So if you create a dataclass and start coding like that for a while, and much later you need to put one of these into a set or use it as a dict key, switching to frozen=True may not be a quick option. And writing a __hash__ method by hand may feel like a lot of busywork. So this is where [unsafe_]hash=True would come in handy. I think naming the flag unsafe_hash should take away most objections, since it will be clear that this is not a safe thing to do. People who don't understand the danger are likely to copy a worse solution from StackOverflow anyway. The docs can point to frozen=True and explain the danger. -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg at krypto.org Mon Feb 5 00:57:02 2018 From: greg at krypto.org (Gregory P. Smith) Date: Mon, 05 Feb 2018 05:57:02 +0000 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: +1 using unsafe_hash as a name addresses my concern. It's a good signal that there are caveats worth considering. -gps On Sun, Feb 4, 2018, 9:50 PM Guido van Rossum wrote: > Looks like this is turning into a major flamewar regardless of what I say. > :-( > > I really don't want to lose the ability to add a hash function to a > mutable dataclass by flipping a flag in the decorator. I'll explain below. > But I am fine if this flag has a name that clearly signals it's an unsafe > thing to do. > > I propose to replace the existing (as of 3.7.0b1) hash= keyword for the > @dataclass decorator with a simpler flag named unsafe_hash=. This would be > a simple bool (not a tri-state flag like the current hash=None|False|True). > The default would be False, and the behavior then would be to add a hash > function automatically only if it's safe (using the same rules as for > hash=None currently). With unsafe_hash=True, a hash function would always > be generated that takes all fields into account except those declared using > field(hash=False). If there's already a `def __hash__` in the function I > don't care what it does, maybe it should raise rather than quietly doing > nothing or quietly overwriting it. > > Here's my use case. > > A frozen class requires a lot of discipline, since you have to compute the > values of all fields before calling the constructor. A mutable class allows > other initialization patterns, e.g. manually setting some fields after the > instance has been constructed, or having a separate non-dunder init() > method. There may be good reasons for using these patterns, e.g. the object > may be part of a cycle (e.g. parent/child links in a tree). Or you may just > use one of these patterns because you're a pretty casual coder. Or you're > modeling something external. > > My point is that once you have one of those patterns in place, changing > your code to avoid them may be difficult. And yet your code may treat the > objects as essentially immutable after the initialization phase (e.g. a > parse tree). So if you create a dataclass and start coding like that for a > while, and much later you need to put one of these into a set or use it as > a dict key, switching to frozen=True may not be a quick option. And writing > a __hash__ method by hand may feel like a lot of busywork. So this is where > [unsafe_]hash=True would come in handy. > > I think naming the flag unsafe_hash should take away most objections, > since it will be clear that this is not a safe thing to do. People who > don't understand the danger are likely to copy a worse solution from > StackOverflow anyway. The docs can point to frozen=True and explain the > danger. > > -- > --Guido van Rossum (python.org/~guido) > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > https://mail.python.org/mailman/options/python-dev/greg%40krypto.org > -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Mon Feb 5 01:09:52 2018 From: chris.barker at noaa.gov (Chris Barker) Date: Mon, 5 Feb 2018 00:09:52 -0600 Subject: [Python-Dev] Immutability vs. hashability In-Reply-To: References:

Message-ID: On Sun, Feb 4, 2018 at 7:54 PM, Nick Coghlan wrote: > On 5 February 2018 at 08:31, Guido van Rossum wrote: > > On Sun, Feb 4, 2018 at 11:59 AM, Chris Barker - NOAA Federal > > wrote: > >> > >> I think the folks that are concerned about this issue are quite right > >> ? most Python users equate immutable and hashable?so the dataclass API > >> should reflect that. > > > > Since they are *not* equivalent (consider a tuple containing a list) I'm > not > > at all convinced that any API in the core language should "reflect" this > > misconception, depending on how you meant that. > > Lists are themselves mutable, and hence inherently unhashable. > > Tuples are themselves immutable, and hence hashable if their contents are. > > I interpret Chris's comment as saying that data classes should behave > the same way that the builtin container types do: > pretty much, yes, But a bit more detail -- I'm commenting on the API, not the capability - that is, since users often equate hashable and immutability, they will expect that if they say hash=True, then will get an immutable, and if they say frozen=True, they will get something hashable (as long as the fields are hashable, just like a tuple. That is, even though these concepts are independent, the defaults shouldn't reflect that. It's the ability to ask the interpreter to guess what you mean > "frozen=False, hash=True" that creates the likelihood of confusion. > Actually, I think if the user does explicitly specify: "frozen=False, hash=True", then that's what they should get, and it's a pretty fragile beast, but apparently there's enough of a use case for folks to want it, and I don't think it's a confusing API. -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Mon Feb 5 01:19:10 2018 From: chris.barker at noaa.gov (Chris Barker) Date: Mon, 5 Feb 2018 00:19:10 -0600 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: On Sun, Feb 4, 2018 at 11:57 PM, Gregory P. Smith wrote: > +1 using unsafe_hash as a name addresses my concern. > mine too -- anyone surprised by using this deserves what they get :-) -CHB On Sun, Feb 4, 2018, 9:50 PM Guido van Rossum wrote: > >> Looks like this is turning into a major flamewar regardless of what I >> say. :-( >> >> I really don't want to lose the ability to add a hash function to a >> mutable dataclass by flipping a flag in the decorator. I'll explain below. >> But I am fine if this flag has a name that clearly signals it's an unsafe >> thing to do. >> >> I propose to replace the existing (as of 3.7.0b1) hash= keyword for the >> @dataclass decorator with a simpler flag named unsafe_hash=. This would be >> a simple bool (not a tri-state flag like the current hash=None|False|True). >> The default would be False, and the behavior then would be to add a hash >> function automatically only if it's safe (using the same rules as for >> hash=None currently). With unsafe_hash=True, a hash function would always >> be generated that takes all fields into account except those declared using >> field(hash=False). If there's already a `def __hash__` in the function I >> don't care what it does, maybe it should raise rather than quietly doing >> nothing or quietly overwriting it. >> >> Here's my use case. >> >> A frozen class requires a lot of discipline, since you have to compute >> the values of all fields before calling the constructor. A mutable class >> allows other initialization patterns, e.g. manually setting some fields >> after the instance has been constructed, or having a separate non-dunder >> init() method. There may be good reasons for using these patterns, e.g. the >> object may be part of a cycle (e.g. parent/child links in a tree). Or you >> may just use one of these patterns because you're a pretty casual coder. Or >> you're modeling something external. >> >> My point is that once you have one of those patterns in place, changing >> your code to avoid them may be difficult. And yet your code may treat the >> objects as essentially immutable after the initialization phase (e.g. a >> parse tree). So if you create a dataclass and start coding like that for a >> while, and much later you need to put one of these into a set or use it as >> a dict key, switching to frozen=True may not be a quick option. And writing >> a __hash__ method by hand may feel like a lot of busywork. So this is where >> [unsafe_]hash=True would come in handy. >> >> I think naming the flag unsafe_hash should take away most objections, >> since it will be clear that this is not a safe thing to do. People who >> don't understand the danger are likely to copy a worse solution from >> StackOverflow anyway. The docs can point to frozen=True and explain the >> danger. >> >> -- >> --Guido van Rossum (python.org/~guido) >> _______________________________________________ >> Python-Dev mailing list >> Python-Dev at python.org >> https://mail.python.org/mailman/listinfo/python-dev >> Unsubscribe: https://mail.python.org/mailman/options/python-dev/ >> greg%40krypto.org >> > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > chris.barker%40noaa.gov > > -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From v+python at g.nevcal.com Mon Feb 5 02:28:03 2018 From: v+python at g.nevcal.com (Glenn Linderman) Date: Sun, 4 Feb 2018 23:28:03 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: On 2/4/2018 9:49 PM, Guido van Rossum wrote: > A frozen class requires a lot of discipline, since you have to compute > the values of all fields before calling the constructor. A mutable > class allows other initialization patterns, e.g. manually setting some > fields after the instance has been constructed, or having a separate > non-dunder init() method. There may be good reasons for using these > patterns, e.g. the object may be part of a cycle (e.g. parent/child > links in a tree). Or you may just use one of these patterns because > you're a pretty casual coder. Or you're modeling something external. > > My point is that once you have one of those patterns in place, > changing your code to avoid them may be difficult. And yet your code > may treat the objects as essentially immutable after the > initialization phase (e.g. a parse tree). So if you create a dataclass > and start coding like that for a while, and much later you need to put > one of these into a set or use it as a dict key, switching to > frozen=True may not be a quick option. And writing a __hash__ method > by hand may feel like a lot of busywork. So this is where > [unsafe_]hash=True would come in handy. > > I think naming the flag unsafe_hash should take away most objections, > since it will be clear that this is not a safe thing to do. People who > don't understand the danger are likely to copy a worse solution from > StackOverflow anyway. The docs can point to frozen=True and explain > the danger. This is an interesting use case. I haven't got the internals knowledge to know just how just different mutable and immutable classes and objects are under the hood. But this use case makes me wonder if, even at the cost of some performance that "normal" immutable classes and objects might obtain, if it would be possible to use the various undisciplined initialization patterns as desired, followed by as declaration "This OBJECT is now immutable" which would calculate its HASH value, and prevent future mutations of the object? Yes, I'm aware that the decision for immutability has historically been done at the class level, not the object level, but in my ignorance of the internals, I wonder if that is necessary, for performance or more importantly, for other reasons. And perhaps the implementation is internally almost like two classes, one mutable, and the other immutable, and the declaration would convert the object from one to the other.? But if I say more, I'd just be babbling. -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Mon Feb 5 03:11:35 2018 From: njs at pobox.com (Nathaniel Smith) Date: Mon, 5 Feb 2018 00:11:35 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: On Sun, Feb 4, 2018 at 11:28 PM, Glenn Linderman wrote: > This is an interesting use case. I haven't got the internals knowledge to > know just how just different mutable and immutable classes and objects are > under the hood. But this use case makes me wonder if, even at the cost of > some performance that "normal" immutable classes and objects might obtain, > if it would be possible to use the various undisciplined initialization > patterns as desired, followed by as declaration "This OBJECT is now > immutable" which would calculate its HASH value, and prevent future > mutations of the object? It would be technically possible to support something like @dataclass(freezable=True) class Foo: blah: int foo = Foo() # Initially, object is mutable, and hash(foo) raises an error foo.blah = 1 assertRaises(hash, foo) # This method is automatically generated for classes with freezable=True foo.freeze() # Now object is immutable, and hash(foo) is allowed assertRaises(foo.__setattr__, "blah", 2) hash(foo) I don't know if it's worth the complexity, but I guess it would cover at least some of the use cases Guido raised. -n -- Nathaniel J. Smith -- https://vorpus.org From v+python at g.nevcal.com Mon Feb 5 03:54:49 2018 From: v+python at g.nevcal.com (Glenn Linderman) Date: Mon, 5 Feb 2018 00:54:49 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: On 2/5/2018 12:11 AM, Nathaniel Smith wrote: > On Sun, Feb 4, 2018 at 11:28 PM, Glenn Linderman wrote: >> This is an interesting use case. I haven't got the internals knowledge to >> know just how just different mutable and immutable classes and objects are >> under the hood. But this use case makes me wonder if, even at the cost of >> some performance that "normal" immutable classes and objects might obtain, >> if it would be possible to use the various undisciplined initialization >> patterns as desired, followed by as declaration "This OBJECT is now >> immutable" which would calculate its HASH value, and prevent future >> mutations of the object? > It would be technically possible to support something like > > @dataclass(freezable=True) > class Foo: > blah: int > > foo = Foo() > # Initially, object is mutable, and hash(foo) raises an error > foo.blah = 1 > assertRaises(hash, foo) > > # This method is automatically generated for classes with freezable=True > foo.freeze() > > # Now object is immutable, and hash(foo) is allowed > assertRaises(foo.__setattr__, "blah", 2) > hash(foo) > > I don't know if it's worth the complexity, but I guess it would cover > at least some of the use cases Guido raised. > > -n > Thanks, Nathaniel, for confirming that what I was suggesting is not impossible, even if it turns out to be undesirable for some reason, or unwanted by anyone else. But I have encountered a subset of the use cases Guido mentioned, and had to make a 2nd class to gather/hold the values of the eventual immutable class, before I could make it, because pieces of the data for the class values were obtained from different sources at different times. Once all collected, then the immutability could be obtained, the rest of the processing performed. Thrashes the allocator pretty well doing it that way, but the job got done. -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Mon Feb 5 04:25:28 2018 From: tjreedy at udel.edu (Terry Reedy) Date: Mon, 5 Feb 2018 04:25:28 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: On 2/5/2018 2:28 AM, Glenn Linderman wrote: > This is an interesting use case. I haven't got the internals knowledge > to know just how just different mutable and immutable classes and > objects are under the hood. I believe there is no internal difference. An object is immutable if there is not way to mutate it with Python code that not poke into internals, such as one can do with ctypes or 3rd party extensions. Numbers and strings have no mutation methods, including no .__init__. A tuple is a fixed sequence of objects and has no .__init__. But if any object in a tuple is mutable, then the tuple is. But the tuple does not know its status, and there is no 'is_mutable' function. However, tuple.__hash__ calls the .__hash__ method of each object and if that is missing for one, tuple.__hash raises. >>> hash((1, 'a', [])) Traceback (most recent call last): File "", line 1, in hash((1, 'a', [])) TypeError: unhashable type: 'list' The built-in immutable objects are mutated from their initial blank values in the C code of their .__new__ methods. So they are only 'immutable' once constructed. Guido pointed out that users constructing objects in Python code might reasonably do so other than only with .__new__, but still want to treat the object as immutable once constructed. In Lisp, for instance, lists are actually trees. To be immutable, they can only be singly linked and must be constructed from leaf nodes to the root (or head). Python programmers should be able to link in both directions and start from the root, and still treat the result as frozen and hashable. > But this use case makes me wonder if, even > at the cost of some performance that "normal" immutable classes and > objects might obtain, if it would be possible to use the various > undisciplined initialization patterns as desired, followed by as > declaration "This OBJECT is now immutable" which would calculate its > HASH value, and prevent future mutations of the object? Something like this has been proposed, at least for dicts, and rejected. -- Terry Jan Reedy From solipsis at pitrou.net Mon Feb 5 08:48:37 2018 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 5 Feb 2018 14:48:37 +0100 Subject: [Python-Dev] Immutability vs. hashability References: Message-ID: <20180205144837.79afa08b@fsol> On Sun, 4 Feb 2018 14:31:06 -0800 Guido van Rossum wrote: > On Sun, Feb 4, 2018 at 11:59 AM, Chris Barker - NOAA Federal < > chris.barker at noaa.gov> wrote: > > > I think the folks that are concerned about this issue are quite right > > ? most Python users equate immutable and hashable?so the dataclass API > > should reflect that. > > > > Since they are *not* equivalent (consider a tuple containing a list) I'm > not at all convinced that any API in the core language should "reflect" > this misconception, depending on how you meant that. +1 from me. Regards Antoine. From ncoghlan at gmail.com Mon Feb 5 09:28:25 2018 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 6 Feb 2018 00:28:25 +1000 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: On 5 February 2018 at 15:49, Guido van Rossum wrote: > My point is that once you have one of those patterns in place, changing your > code to avoid them may be difficult. And yet your code may treat the objects > as essentially immutable after the initialization phase (e.g. a parse tree). > So if you create a dataclass and start coding like that for a while, and > much later you need to put one of these into a set or use it as a dict key, > switching to frozen=True may not be a quick option. And writing a __hash__ > method by hand may feel like a lot of busywork. So this is where > [unsafe_]hash=True would come in handy. > > I think naming the flag unsafe_hash should take away most objections, since > it will be clear that this is not a safe thing to do. People who don't > understand the danger are likely to copy a worse solution from StackOverflow > anyway. The docs can point to frozen=True and explain the danger. Aye, calling the flag unsafe_hash would convert me from -1 to -0. The remaining -0 is because I think there's a different and more robust way to tackle your example use case: # Mutable initialization phase >>> from dataclasses import dataclass >>> @dataclass ... class Example: ... a: int ... b: int ... >>> c = Example(None, None) >>> c Example(a=None, b=None) >>> c.a = 1 >>> c.b = 2 >>> c Example(a=1, b=2) # Frozen usage phase >>> @dataclass(frozen=True) ... class LockedExample(Example): ... pass ... >>> c.__class__ = LockedExample >>> c.a = 1 Traceback (most recent call last): File "", line 1, in File "/home/ncoghlan/devel/cpython/Lib/dataclasses.py", line 448, in _frozen_setattr raise FrozenInstanceError(f'cannot assign to field {name!r}') dataclasses.FrozenInstanceError: cannot assign to field 'a' >>> c.b = 2 Traceback (most recent call last): File "", line 1, in File "/home/ncoghlan/devel/cpython/Lib/dataclasses.py", line 448, in _frozen_setattr raise FrozenInstanceError(f'cannot assign to field {name!r}') dataclasses.FrozenInstanceError: cannot assign to field 'b' >>> hash(c) 3713081631934410656 The gist of that approach is to assume that there will be *somewhere* in the code where it's possible to declare the construction of the instance "complete", and flip the nominal class over to the frozen subclass to make further mutation unlikely, even though the true underlying type is still the mutable version. That said, if we do provide "unsafe_hash", then the documentation for that flag becomes a place where we can explicitly suggest using a frozen subclass instead. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From kirillbalunov at gmail.com Mon Feb 5 04:59:29 2018 From: kirillbalunov at gmail.com (Kirill Balunov) Date: Mon, 5 Feb 2018 12:59:29 +0300 Subject: [Python-Dev] Dataclasses and correct hashability Message-ID: > > On Sun, Feb 4, 2018, 9:50 PM Guido van Rossum > wrote: > > Looks like this is turning into a major flamewar regardless of what I say. > :-( > I really don't want to lose the ability to add a hash function to a > mutable dataclass by flipping a flag in the decorator. I'll explain below. > But I am fine if this flag has a name that clearly signals it's an unsafe > thing to do. > > I propose to replace the existing (as of 3.7.0b1) hash= keyword for the > @dataclass decorator with a simpler flag named unsafe_hash=. This would be > a simple bool (not a tri-state flag like the current hash=None|False|True). > The default would be False, and the behavior then would be to add a hash > function automatically only if it's safe (using the same rules as for > hash=None currently). With unsafe_hash=True, a hash function would always > be generated that takes all fields into account except those declared using > field(hash=False). If there's already a `def __hash__` in the function I > don't care what it does, maybe it should raise rather than quietly doing > nothing or quietly overwriting it. > > Here's my use case. > > May be it is better to provide a special purpose function `make_unsafe_hash` in dataclass module which will patch a dataclass, instead of to clutter @dataclass API with arguments which are rather special case. This `unsafe_hash` argument will constantly raise questions among ordinary users like me, and will be possibly considered as a non-obvious design - there is a public API but it is somehow unsafe. On the other hand, with a function, when the user asks how to make a `frozen=False` dataclass hashable, you can suggest to use this `make_unsafe_hash` function with all its cautions in its docs or to try to implement __hash__ by yourself. Also taking into account the Python approach for backward compatibility it is better to stick with function and if it will be usefull to add a `unsafe_hash` argument in Python 3.8. It is easier to add later than to deprecate in the future. With kind regards, -gdg -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Mon Feb 5 12:46:05 2018 From: guido at python.org (Guido van Rossum) Date: Mon, 5 Feb 2018 09:46:05 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: I'm sorry, but a solution that requires __class__ assignment is way too fragile for my taste. On Mon, Feb 5, 2018 at 6:28 AM, Nick Coghlan wrote: > On 5 February 2018 at 15:49, Guido van Rossum wrote: > > My point is that once you have one of those patterns in place, changing > your > > code to avoid them may be difficult. And yet your code may treat the > objects > > as essentially immutable after the initialization phase (e.g. a parse > tree). > > So if you create a dataclass and start coding like that for a while, and > > much later you need to put one of these into a set or use it as a dict > key, > > switching to frozen=True may not be a quick option. And writing a > __hash__ > > method by hand may feel like a lot of busywork. So this is where > > [unsafe_]hash=True would come in handy. > > > > I think naming the flag unsafe_hash should take away most objections, > since > > it will be clear that this is not a safe thing to do. People who don't > > understand the danger are likely to copy a worse solution from > StackOverflow > > anyway. The docs can point to frozen=True and explain the danger. > > Aye, calling the flag unsafe_hash would convert me from -1 to -0. > > The remaining -0 is because I think there's a different and more > robust way to tackle your example use case: > > # Mutable initialization phase > >>> from dataclasses import dataclass > >>> @dataclass > ... class Example: > ... a: int > ... b: int > ... > >>> c = Example(None, None) > >>> c > Example(a=None, b=None) > >>> c.a = 1 > >>> c.b = 2 > >>> c > Example(a=1, b=2) > > > # Frozen usage phase > >>> @dataclass(frozen=True) > ... class LockedExample(Example): > ... pass > ... > >>> c.__class__ = LockedExample > >>> c.a = 1 > Traceback (most recent call last): > File "", line 1, in > File "/home/ncoghlan/devel/cpython/Lib/dataclasses.py", line 448, > in _frozen_setattr > raise FrozenInstanceError(f'cannot assign to field {name!r}') > dataclasses.FrozenInstanceError: cannot assign to field 'a' > >>> c.b = 2 > Traceback (most recent call last): > File "", line 1, in > File "/home/ncoghlan/devel/cpython/Lib/dataclasses.py", line 448, > in _frozen_setattr > raise FrozenInstanceError(f'cannot assign to field {name!r}') > dataclasses.FrozenInstanceError: cannot assign to field 'b' > >>> hash(c) > 3713081631934410656 > > The gist of that approach is to assume that there will be *somewhere* > in the code where it's possible to declare the construction of the > instance "complete", and flip the nominal class over to the frozen > subclass to make further mutation unlikely, even though the true > underlying type is still the mutable version. > > That said, if we do provide "unsafe_hash", then the documentation for > that flag becomes a place where we can explicitly suggest using a > frozen subclass instead. > > Cheers, > Nick. > > -- > Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Mon Feb 5 12:47:09 2018 From: guido at python.org (Guido van Rossum) Date: Mon, 5 Feb 2018 09:47:09 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: Message-ID: If there's going to be an API for it, it should be in the class, not something that mutates the class afterwards. On Mon, Feb 5, 2018 at 1:59 AM, Kirill Balunov wrote: > On Sun, Feb 4, 2018, 9:50 PM Guido van Rossum > wrote: >> >> Looks like this is turning into a major flamewar regardless of what I say. >> :-( >> I really don't want to lose the ability to add a hash function to a >> mutable dataclass by flipping a flag in the decorator. I'll explain below. >> But I am fine if this flag has a name that clearly signals it's an unsafe >> thing to do. >> >> I propose to replace the existing (as of 3.7.0b1) hash= keyword for the >> @dataclass decorator with a simpler flag named unsafe_hash=. This would be >> a simple bool (not a tri-state flag like the current hash=None|False|True). >> The default would be False, and the behavior then would be to add a hash >> function automatically only if it's safe (using the same rules as for >> hash=None currently). With unsafe_hash=True, a hash function would always >> be generated that takes all fields into account except those declared using >> field(hash=False). If there's already a `def __hash__` in the function I >> don't care what it does, maybe it should raise rather than quietly doing >> nothing or quietly overwriting it. >> >> Here's my use case. >> >> > May be it is better to provide a special purpose function > `make_unsafe_hash` in > dataclass module which will patch a dataclass, instead of to clutter > @dataclass > API with arguments which are rather special case. > > This `unsafe_hash` argument will constantly raise questions among ordinary > users > like me, and will be possibly considered as a non-obvious design - there > is a > public API but it is somehow unsafe. On the other hand, with a function, > when > the user asks how to make a `frozen=False` dataclass hashable, you can > suggest > to use this `make_unsafe_hash` function with all its cautions in its docs > or to try to > implement __hash__ by yourself. > > Also taking into account the Python approach for backward compatibility it > is > better to stick with function and if it will be usefull to add a > `unsafe_hash` > argument in Python 3.8. It is easier to add later than to deprecate in the > future. > > With kind regards, > -gdg > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > guido%40python.org > > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From mertz at gnosis.cx Mon Feb 5 13:50:21 2018 From: mertz at gnosis.cx (David Mertz) Date: Mon, 5 Feb 2018 10:50:21 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: <3951276.IZdOgMHjlT@hammer.magicstack.net> <15d2818e-5c7b-ea5f-3a8f-e5edd482b5be@trueblade.com> <1906618.bnMMLoM6on@hammer.magicstack.net> <727029d5-3c0b-2242-001e-15a25253ca37@trueblade.com>

Message-ID: Absolutely I agree. 'unsafe_hash' as a name is clear warning to users. On Feb 4, 2018 10:43 PM, "Chris Barker" wrote: On Sun, Feb 4, 2018 at 11:57 PM, Gregory P. Smith wrote: > +1 using unsafe_hash as a name addresses my concern. > mine too -- anyone surprised by using this deserves what they get :-) -CHB On Sun, Feb 4, 2018, 9:50 PM Guido van Rossum wrote: > >> Looks like this is turning into a major flamewar regardless of what I >> say. :-( >> >> I really don't want to lose the ability to add a hash function to a >> mutable dataclass by flipping a flag in the decorator. I'll explain below. >> But I am fine if this flag has a name that clearly signals it's an unsafe >> thing to do. >> >> I propose to replace the existing (as of 3.7.0b1) hash= keyword for the >> @dataclass decorator with a simpler flag named unsafe_hash=. This would be >> a simple bool (not a tri-state flag like the current hash=None|False|True). >> The default would be False, and the behavior then would be to add a hash >> function automatically only if it's safe (using the same rules as for >> hash=None currently). With unsafe_hash=True, a hash function would always >> be generated that takes all fields into account except those declared using >> field(hash=False). If there's already a `def __hash__` in the function I >> don't care what it does, maybe it should raise rather than quietly doing >> nothing or quietly overwriting it. >> >> Here's my use case. >> >> A frozen class requires a lot of discipline, since you have to compute >> the values of all fields before calling the constructor. A mutable class >> allows other initialization patterns, e.g. manually setting some fields >> after the instance has been constructed, or having a separate non-dunder >> init() method. There may be good reasons for using these patterns, e.g. the >> object may be part of a cycle (e.g. parent/child links in a tree). Or you >> may just use one of these patterns because you're a pretty casual coder. Or >> you're modeling something external. >> >> My point is that once you have one of those patterns in place, changing >> your code to avoid them may be difficult. And yet your code may treat the >> objects as essentially immutable after the initialization phase (e.g. a >> parse tree). So if you create a dataclass and start coding like that for a >> while, and much later you need to put one of these into a set or use it as >> a dict key, switching to frozen=True may not be a quick option. And writing >> a __hash__ method by hand may feel like a lot of busywork. So this is where >> [unsafe_]hash=True would come in handy. >> >> I think naming the flag unsafe_hash should take away most objections, >> since it will be clear that this is not a safe thing to do. People who >> don't understand the danger are likely to copy a worse solution from >> StackOverflow anyway. The docs can point to frozen=True and explain the >> danger. >> >> -- >> --Guido van Rossum (python.org/~guido) >> _______________________________________________ >> Python-Dev mailing list >> Python-Dev at python.org >> https://mail.python.org/mailman/listinfo/python-dev >> Unsubscribe: https://mail.python.org/mailman/options/python-dev/greg% >> 40krypto.org >> > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/chris. > barker%40noaa.gov > > -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov _______________________________________________ Python-Dev mailing list Python-Dev at python.org https://mail.python.org/mailman/listinfo/python-dev Unsubscribe: https://mail.python.org/mailman/options/python-dev/ mertz%40gnosis.cx -------------- next part -------------- An HTML attachment was scrubbed... URL: From kirillbalunov at gmail.com Mon Feb 5 14:09:47 2018 From: kirillbalunov at gmail.com (Kirill Balunov) Date: Mon, 5 Feb 2018 22:09:47 +0300 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: Message-ID: 2018-02-05 20:47 GMT+03:00 Guido van Rossum : > If there's going to be an API for it, it should be in the class, not > something that mutates the class afterwards. > I apologize and don't want to make unnecessary noise. But the already selected design with decorator @dataclass implies that it will mutate the freshly created class (which in its turn already limits some possibilities), or I've missed something? If you meant that everything should be defined in one place, then I basically understand your desire as the least of two evils. With kind regards, -gdg -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Mon Feb 5 14:12:36 2018 From: guido at python.org (Guido van Rossum) Date: Mon, 5 Feb 2018 11:12:36 -0800 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: Message-ID: Yes, that's what I meant -- "afterwards" meaning after the @dataclass decorator is applied. On Mon, Feb 5, 2018 at 11:09 AM, Kirill Balunov wrote: > > 2018-02-05 20:47 GMT+03:00 Guido van Rossum : > >> If there's going to be an API for it, it should be in the class, not >> something that mutates the class afterwards. >> > > > I apologize and don't want to make unnecessary noise. But the already > selected design with decorator @dataclass implies that it will mutate > the freshly created class (which in its turn already limits some > possibilities), or I've missed something? If you meant that everything > should be defined in one place, then I basically understand your desire as > the least of two evils. > > With kind regards, > -gdg > -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From python-dev at mgmiller.net Mon Feb 5 15:19:33 2018 From: python-dev at mgmiller.net (Mike Miller) Date: Mon, 5 Feb 2018 12:19:33 -0800 Subject: [Python-Dev] [RELEASE] Python 3.7.0b1 is now available for testing In-Reply-To: <9425596C-A92F-4B10-A8B7-98F4E827E8D0@python.org> References: <9425596C-A92F-4B10-A8B7-98F4E827E8D0@python.org> Message-ID: <8cdb5984-0148-1853-e9ac-9dfcbc6e76a4@mgmiller.net> On 2018-01-31 17:34, Ned Deily wrote: > Please see "What?s New In Python 3.7" for more information. > Additional documentation for these features and for other changes > will be provided during the beta phase. > > https://docs.python.org/3.7/whatsnew/3.7.html I see that the new classmethod fromisoformat in datetime arrived (thanks all) but it hasn't yet received a blurb in the what's new page. https://docs.python.org/3.7/library/datetime.html#datetime.datetime.fromisoformat -Mike From paul at ganssle.io Mon Feb 5 16:06:17 2018 From: paul at ganssle.io (Paul G) Date: Mon, 5 Feb 2018 16:06:17 -0500 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: Message-ID: <94a8ddd0-e54f-8b6c-713f-de6974e386e7@ganssle.io> I don't think it matters so much whether you are stacking two decorators or a single decorator, but would an @add_unsafe_hash decorator be useful for anything *except* data classes? If not, then there's no point in having a *second* decorator that can *only* modify the first one - particularly considering @dataclass actually takes arguments. On 02/05/2018 02:12 PM, Guido van Rossum wrote: > Yes, that's what I meant -- "afterwards" meaning after the @dataclass > decorator is applied. > > On Mon, Feb 5, 2018 at 11:09 AM, Kirill Balunov > wrote: > >> >> 2018-02-05 20:47 GMT+03:00 Guido van Rossum : >> >>> If there's going to be an API for it, it should be in the class, not >>> something that mutates the class afterwards. >>> >> >> >> I apologize and don't want to make unnecessary noise. But the already >> selected design with decorator @dataclass implies that it will mutate >> the freshly created class (which in its turn already limits some >> possibilities), or I've missed something? If you meant that everything >> should be defined in one place, then I basically understand your desire as >> the least of two evils. >> >> With kind regards, >> -gdg >> > > > > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/paul%40ganssle.io > -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: OpenPGP digital signature URL: From matt.priest at intel.com Mon Feb 5 17:41:31 2018 From: matt.priest at intel.com (Priest, Matt) Date: Mon, 5 Feb 2018 22:41:31 +0000 Subject: [Python-Dev] libxml2 installation/binding issue Message-ID: <68B69E4C07B04A488BA8528DF70CA165844150BC@fmsmsx116.amr.corp.intel.com> Hello, I am not sure if this is the correct place to post an issue/question like this, but here goes... I've successfully (?) installed Python 3.6.4 and libxml2, with the ultimate goal of installing GTK+ 3.22.0. However, I'm running into this error: python3 Python 3.6.4 (default, Feb 5 2018, 13:28:04) [GCC 4.7.2] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import libxml2 Traceback (most recent call last): File "", line 1, in File "/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/development/sfwr/lib/python3.6/site-packages/libxml2.py", line 1, in import libxml2mod ImportError: /nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/development/sfwr/lib/python3.6/site-packages/libxml2mod.so: undefined symbol: _PyVerify_fd Here are the details on the version, cflags, and ldflags. python3 --version ; Python 3.6.4 python3-config --cflags -I/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/development/sfwr/include/python3.6m -I/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/development/sfwr/include/python3.6m -Wno-unused-result -Wsign-compare -fPIC -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes python3-config -ldflags; -L/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/development/sfwr/lib/python3.6/config-3.6m-x86_64-linux-gnu -L/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/development/sfwr/lib -lpython3.6m -lpthread -ldl -lutil -lrt -lm -Xlinker -export-dynamic Anyhelp or hint would be appreciated... Matt -------------- next part -------------- An HTML attachment was scrubbed... URL: From ethan at ethanhs.me Mon Feb 5 17:55:21 2018 From: ethan at ethanhs.me (Ethan Smith) Date: Mon, 5 Feb 2018 14:55:21 -0800 Subject: [Python-Dev] libxml2 installation/binding issue In-Reply-To: <68B69E4C07B04A488BA8528DF70CA165844150BC@fmsmsx116.amr.corp.intel.com> References: <68B69E4C07B04A488BA8528DF70CA165844150BC@fmsmsx116.amr.corp.intel.com> Message-ID: This list is for the discussion of development *of* Python. For discussion of development *with* Python, you want python-list. On Mon, Feb 5, 2018 at 2:41 PM, Priest, Matt wrote: > Hello, > > > > I am not sure if this is the correct place to post an issue/question like > this, but here goes? > > > > I?ve successfully (?) installed Python 3.6.4 and libxml2, with the > ultimate goal of installing GTK+ 3.22.0. > > However, I?m running into this error: > > > > > > python3 > > Python 3.6.4 (default, Feb 5 2018, 13:28:04) > > [GCC 4.7.2] on linux > > Type "help", "copyright", "credits" or "license" for more information. > > >>> import libxml2 > > Traceback (most recent call last): > > File "", line 1, in > > File "/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/ > development/sfwr/lib/python3.6/site-packages/libxml2.py", line 1, in > > > import libxml2mod > > ImportError: /nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/ > development/sfwr/lib/python3.6/site-packages/libxml2mod.so: undefined > symbol: _PyVerify_fd > > > > > > Here are the details on the version, cflags, and ldflags. > > python3 --version ; > > Python 3.6.4 > > python3-config --cflags > > -I/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/ > development/sfwr/include/python3.6m > > -I/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/ > development/sfwr/include/python3.6m > > -Wno-unused-result > > -Wsign-compare > > -fPIC -DNDEBUG > > -g > > -fwrapv > > -O3 > > -Wall > > -Wstrict-prototypes > > > > python3-config ?ldflags; > > -L/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/ > development/sfwr/lib/python3.6/config-3.6m-x86_64-linux-gnu > > -L/nfs/sc/disks/slx_1353/mlpriest/sl1/work_root/a0/development/sfwr/lib > > -lpython3.6m > > -lpthread > > -ldl > > -lutil > > -lrt > > -lm > > -Xlinker > > -export-dynamic > > > > Anyhelp or hint would be appreciated? > > > > > > > > Matt > > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > https://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: https://mail.python.org/mailman/options/python-dev/ > ethan%40ethanhs.me > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Mon Feb 5 17:56:40 2018 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 6 Feb 2018 08:56:40 +1000 Subject: [Python-Dev] Dataclasses and correct hashability In-Reply-To: References: Message-ID: