From me at louie.lu Tue Aug 1 09:01:30 2017 From: me at louie.lu (Louie Lu) Date: Tue, 1 Aug 2017 21:01:30 +0800 Subject: [Python-ideas] "any" and "all" support multiple arguments Message-ID: Hi all, In "min" and "max" builtin-func, it support two style of args: min(...) min(iterable, *[, default=obj, key=func]) -> value min(arg1, arg2, *args, *[, key=func]) -> value But for "any" and "all", it only support iterable: all(iterable, /) Return True if bool(x) is True for all values x in the iterable. I'm not sure if this is discuss before, but can "any" and "all" support like min_max "arg1, arg2, *args" style? Thanks, Louie. From p.f.moore at gmail.com Tue Aug 1 09:24:47 2017 From: p.f.moore at gmail.com (Paul Moore) Date: Tue, 1 Aug 2017 14:24:47 +0100 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References: Message-ID: On 1 August 2017 at 14:01, Louie Lu wrote: > I'm not sure if this is discuss before, but can "any" and "all" > support like min_max "arg1, arg2, *args" style? I don't see any particular reason why not, but is there a specific use case for this or is it just a matter of consistency? Unlike max and min, we already have operators in this case (and/or). I'd imagine that if I had a use for any(a, b, c) I'd write it as a or b or c, and for all(a, b, c) I'd write a and b and c. Paul From markusmeskanen at gmail.com Tue Aug 1 09:32:38 2017 From: markusmeskanen at gmail.com (Markus Meskanen) Date: Tue, 1 Aug 2017 16:32:38 +0300 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References: Message-ID: I'd be more interested in supporting the "key" function: any(users, key=User.is_admin) As opposed to: any(user.is_admin() for user in users) 1.8.2017 16.07 "Louie Lu" kirjoitti: Hi all, In "min" and "max" builtin-func, it support two style of args: min(...) min(iterable, *[, default=obj, key=func]) -> value min(arg1, arg2, *args, *[, key=func]) -> value But for "any" and "all", it only support iterable: all(iterable, /) Return True if bool(x) is True for all values x in the iterable. I'm not sure if this is discuss before, but can "any" and "all" support like min_max "arg1, arg2, *args" style? Thanks, Louie. _______________________________________________ Python-ideas mailing list Python-ideas at python.org https://mail.python.org/mailman/listinfo/python-ideas Code of Conduct: http://python.org/psf/codeofconduct/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ned at nedbatchelder.com Tue Aug 1 09:43:04 2017 From: ned at nedbatchelder.com (Ned Batchelder) Date: Tue, 1 Aug 2017 09:43:04 -0400 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References:

Message-ID: I find it frustrating that they always return booleans. It would be more useful if any() returned the first true value it finds. This seems like a backward-compatible-enough change to me... :) --Ned. On 8/1/17 9:32 AM, Markus Meskanen wrote: > I'd be more interested in supporting the "key" function: > > any(users, key=User.is_admin) > > As opposed to: > > any(user.is_admin() for user in users) > > 1.8.2017 16.07 "Louie Lu" > kirjoitti: > > Hi all, > > In "min" and "max" builtin-func, it support two style of args: > > min(...) > min(iterable, *[, default=obj, key=func]) -> value > min(arg1, arg2, *args, *[, key=func]) -> value > > But for "any" and "all", it only support iterable: > > all(iterable, /) > Return True if bool(x) is True for all values x in the > iterable. > > > I'm not sure if this is discuss before, but can "any" and "all" > support like min_max "arg1, arg2, *args" style? > > > Thanks, > Louie. > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > > Code of Conduct: http://python.org/psf/codeofconduct/ > > > > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Tue Aug 1 11:16:02 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 2 Aug 2017 01:16:02 +1000 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References:

Message-ID: On 1 August 2017 at 23:43, Ned Batchelder wrote: > I find it frustrating that they always return booleans. It would be more > useful if any() returned the first true value it finds. This seems like a > backward-compatible-enough change to me... :) While I'm not sure how to interpret that smiley, I figure it's worth making it explicit that this is decidedly *not* true given type-dependent serialisation protocols like JSON: >>> import json >>> class MyClass: ... def __bool__(self): ... return True ... >>> json.dumps(any([MyClass()])) 'true' >>> json.dumps(MyClass()) Traceback (most recent call last): ... TypeError: Object of type 'MyClass' is not JSON serializable The idea of elevating first_true from its current status as an itertools recipe to actually being an itertools module API has certainly come up before, though. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From ncoghlan at gmail.com Tue Aug 1 11:28:24 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 2 Aug 2017 01:28:24 +1000 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References: Message-ID: On 1 August 2017 at 23:24, Paul Moore wrote: > On 1 August 2017 at 14:01, Louie Lu wrote: >> I'm not sure if this is discuss before, but can "any" and "all" >> support like min_max "arg1, arg2, *args" style? > > I don't see any particular reason why not, but is there a specific use > case for this or is it just a matter of consistency? Unlike max and > min, we already have operators in this case (and/or). I'd imagine that > if I had a use for any(a, b, c) I'd write it as a or b or c, and for > all(a, b, c) I'd write a and b and c. Right, the main correspondence here is with "sum()": folks can't write "sum(a, b, c)", but they can write "a + b + c". The various container constructors are also consistent in only taking an iterable, with multiple explicit items being expected to use the syntactic forms (e.g. [a, b, c], {a, b, c}, (a, b, c)) The same rationale holds for any() and all(): supporting multiple positional arguments would be redundant with the existing binary operator syntax, with no clear reason to ever prefer one option over the other. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From cpitclaudel at gmail.com Tue Aug 1 12:57:30 2017 From: cpitclaudel at gmail.com (=?UTF-8?Q?Cl=c3=a9ment_Pit-Claudel?=) Date: Tue, 1 Aug 2017 18:57:30 +0200 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References:

Message-ID: On 2017-08-01 17:28, Nick Coghlan wrote: > Right, the main correspondence here is with "sum()": folks can't write > "sum(a, b, c)", but they can write "a + b + c". > > The various container constructors are also consistent in only taking > an iterable, with multiple explicit items being expected to use the > syntactic forms (e.g. [a, b, c], {a, b, c}, (a, b, c)) > > The same rationale holds for any() and all(): supporting multiple > positional arguments would be redundant with the existing binary > operator syntax, with no clear reason to ever prefer one option over > the other. Isn't there a difference, though, insofar as we don't have a '+/sum' or 'and/all' equivalent of [a, b, *c]? You need to write 1 + 3 + sum(xs), or a and b and all(ys). Or, of course, any(chain([a], [b], c)), but that is not pretty. Cl?ment. From lucas.wiman at gmail.com Tue Aug 1 13:22:03 2017 From: lucas.wiman at gmail.com (Lucas Wiman) Date: Tue, 1 Aug 2017 10:22:03 -0700 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References: Message-ID: On Tue, Aug 1, 2017 at 6:01 AM, Louie Lu wrote: > [...] > I'm not sure if this is discuss before, but can "any" and "all" > support like min_max "arg1, arg2, *args" style? > Can this be done consistently? For example consider x=[[]]. Then all(x) where x is interpreted as an iterable should be False, but all(x) where x is interpreted as a single argument should be True. This inconsistency already exists for max: >>> max({1, 2}) 2 >>> max({1, 2}, {1}) set([1, 2]) However, it doesn't seem like there's a good reason to add an inconsistency to the API for any/all. - Lucas -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Tue Aug 1 16:51:41 2017 From: tjreedy at udel.edu (Terry Reedy) Date: Tue, 1 Aug 2017 16:51:41 -0400 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References: Message-ID: On 8/1/2017 9:01 AM, Louie Lu wrote: > Hi all, > > In "min" and "max" builtin-func, it support two style of args: > > min(...) > min(iterable, *[, default=obj, key=func]) -> value > min(arg1, arg2, *args, *[, key=func]) -> value To me, two APIs is a nuisance. For one thing, default has to be keyword only and not just optional. Compare with sum: >>> sum((2,3,4),5) 14 >>> min((2,3,4),5) # Py3 Traceback (most recent call last): File "", line 1, in min((2,3,4),5) TypeError: '<' not supported between instances of 'int' and 'tuple' >>> min((2,3,4),5) # Py2 5 >>> min(5, (2,3,4)) 5 I believe that a version of the second was in original Python (and at least in 1.3) whereas the first was added later, likely with the new iterator protocol (2.2). In any case, with *unpacking in displays, the second is no longer needed. >>> min(4,3, *[1,2]) 1 >>> min((4,3, *[1,2])) 1 If I am correct, perhaps the doc for max and min in https://docs.python.org/3/library/functions.html#max should mention that the 2nd is derived from the original syntax, kept for back compatibility (rather than a new innovation, to be imitated). I would rather get rid of the exceptional case than emulate it. > But for "any" and "all", it only support iterable: > > all(iterable, /) > Return True if bool(x) is True for all values x in the iterable. As Nick pointed out, this is standard now. >>> list((1,2,3)) [1, 2, 3] >>> list(1,2,3) Traceback (most recent call last): File "", line 1, in list(1,2,3) TypeError: list() takes at most 1 argument (3 given) -- Terry Jan Reedy From mistersheik at gmail.com Tue Aug 1 22:48:58 2017 From: mistersheik at gmail.com (Neil Girdhar) Date: Tue, 1 Aug 2017 19:48:58 -0700 (PDT) Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References:

Message-ID: <3288d6b5-ebc7-4837-96b2-311e4ad1e69f@googlegroups.com> On Tuesday, August 1, 2017 at 12:58:24 PM UTC-4, Cl?ment Pit-Claudel wrote: > > On 2017-08-01 17:28, Nick Coghlan wrote: > > Right, the main correspondence here is with "sum()": folks can't write > > "sum(a, b, c)", but they can write "a + b + c". > > > > The various container constructors are also consistent in only taking > > an iterable, with multiple explicit items being expected to use the > > syntactic forms (e.g. [a, b, c], {a, b, c}, (a, b, c)) > > > > The same rationale holds for any() and all(): supporting multiple > > positional arguments would be redundant with the existing binary > > operator syntax, with no clear reason to ever prefer one option over > > the other. > > Isn't there a difference, though, insofar as we don't have a '+/sum' or > 'and/all' equivalent of [a, b, *c]? > You need to write 1 + 3 + sum(xs), or a and b and all(ys). Or, of course, > any(chain([a], [b], c)), but that is not pretty. > a or b or any(c) seems clear to me. > Cl?ment. > _______________________________________________ > Python-ideas mailing list > Python... at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Wed Aug 2 11:06:22 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 3 Aug 2017 01:06:22 +1000 Subject: [Python-ideas] "any" and "all" support multiple arguments In-Reply-To: References:

Message-ID: On 2 August 2017 at 02:57, Cl?ment Pit-Claudel wrote: > On 2017-08-01 17:28, Nick Coghlan wrote: >> The same rationale holds for any() and all(): supporting multiple >> positional arguments would be redundant with the existing binary >> operator syntax, with no clear reason to ever prefer one option over >> the other. > > Isn't there a difference, though, insofar as we don't have a '+/sum' or 'and/all' equivalent of [a, b, *c]? > You need to write 1 + 3 + sum(xs), or a and b and all(ys). Or, of course, any(chain([a], [b], c)), but that is not pretty. Function calls create an argument tuple anyway, so writing "any(a, b, *ys)" wouldn't actually be significantly more efficient than the current "any((a, b, *ys))" (note the doubled parentheses). You'd potentially save the allocation of a single element tuple to hold the full tuple, but single element tuples are pretty cheap in the grand scheme of things, and Python interpreter implementations often attempt to avoid creating one in the single-positional argument case (since they'd just need to unpack it again to stick it in the corresponding parameter slot). This means that in the case where what you actually want is lazy iteration over the trailing iterable, then you have to use the itertools.chain form: "any(chain((a, b), ys))" The chained binary operator forms also both seem clearer to me than either "sum(1, 3, *xs)" or "any(a, b, *ys)", as those formulations require that the reader know a Python-specific idiosyncratic concept and notation (iterable unpacking), while the binary operator based forms can be interpreted correctly based solely on knowledge of either arithmetic ("+", "sum") or logic ("and", "all"). So while this is an entirely reasonable design question to ask, it turns out there are a few good reasons not to actually make the change: - it doesn't add expressiveness to the language (the binary operator forms already exist, as does the double-parenthesis form) - it doesn't add readability to the language (the iterable unpacking form requires more assumed knowledge than the binary operator form) - it doesn't improve the efficiency of the language (iterable unpacking is an eager operation, not a lazy one, even in function calls) - min() and max() are actually the odd ones out here (for historical reasons), not any(), all() Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From paul_laos at outlook.com Fri Aug 4 03:39:56 2017 From: paul_laos at outlook.com (Paul Laos) Date: Fri, 4 Aug 2017 07:39:56 +0000 Subject: [Python-ideas] Pseudo methods Message-ID: Hi folks I was thinking about how sometimes, a function sometimes acts on classes, and behaves very much like a method. Adding new methods to classes existing classes is currently somewhat difficult, and having pseudo methods would make that easier. Code example: (The syntax can most likely be improved upon) def has_vowels(self: str): for vowel in ["a", "e,", "i", "o", "u"]: if vowel in self: return True This allows one to wring `string.has_vowels()` instead of `has_vowels(string)`, which would make it easier to read, and would make it easier to add functionality to existing classes, without having to extend them. This would be useful for builtins or imported libraries, so one can fill in "missing" methods. * Simple way to extend classes * Improves readability * Easy to understand ~Paul -------------- next part -------------- An HTML attachment was scrubbed... URL: From antoine.rozo at gmail.com Fri Aug 4 03:59:42 2017 From: antoine.rozo at gmail.com (Antoine Rozo) Date: Fri, 4 Aug 2017 09:59:42 +0200 Subject: [Python-ideas] Pseudo methods In-Reply-To: References: Message-ID: Hi, With this kind of feature, you never know which methods are included in the class (depending of which modules have been loaded). I don't think this is a good idea. 2017-08-04 9:39 GMT+02:00 Paul Laos : > Hi folks > I was thinking about how sometimes, a function sometimes acts on classes, > and > behaves very much like a method. Adding new methods to classes existing > classes > is currently somewhat difficult, and having pseudo methods would make that > easier. > > Code example: (The syntax can most likely be improved upon) > def has_vowels(self: str): > for vowel in ["a", "e,", "i", "o", "u"]: > if vowel in self: return True > > This allows one to wring `string.has_vowels()` instead of > `has_vowels(string)`, > which would make it easier to read, and would make it easier to add > functionality to existing classes, without having to extend them. This > would be > useful for builtins or imported libraries, so one can fill in "missing" > methods. > > * Simple way to extend classes > * Improves readability > * Easy to understand > > ~Paul > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > > -- Antoine Rozo -------------- next part -------------- An HTML attachment was scrubbed... URL: From p.f.moore at gmail.com Fri Aug 4 04:16:01 2017 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 4 Aug 2017 09:16:01 +0100 Subject: [Python-ideas] Pseudo methods In-Reply-To: References: Message-ID: On 4 August 2017 at 08:39, Paul Laos wrote: > Hi folks > I was thinking about how sometimes, a function sometimes acts on classes, > and behaves very much like a method. Adding new methods to classes existing > classes is currently somewhat difficult, and having pseudo methods would make that > easier. Adding new methods to classes is deliberately (somewhat) difficult, as it makes it harder to locate the definition of a method. If you need to see the code for a method, you'd expect to look in the class definition. Making it common for people to put method definitions outside the class definition harms supportability by breaking that assumption. > Code example: (The syntax can most likely be improved upon) > def has_vowels(self: str): > for vowel in ["a", "e,", "i", "o", "u"]: > if vowel in self: return True > > This allows one to wring `string.has_vowels()` instead of > `has_vowels(string)`, > which would make it easier to read, That's very much a subjective view. Personally, I don't see "string.has_vowels()" as being any easier to read - except in the sense that it tells me that I can find the definition of has_vowels in the class definition of str (and I can find its documentation in the documentation of the str type). And your proposal removes this advantage! > and would make it easier to add > functionality to existing classes, without having to extend them. This would > be useful for builtins or imported libraries, so one can fill in "missing" > methods. This is a common technique in other languages like Ruby, but is considered specialised and somewhat of an advanced technique (monkeypatching) in Python. As you say yourself, the syntax will make it *easier* to do this - it's already possible, so the change doesn't add any new capabilities. Adding new syntax to the language typically needs a much stronger justification (either in terms of enabling fundamentally new techniques, or providing a significantly more natural spelling of something that's widely used and acknowledged as a common programming idiom). Sorry, but I'm -1 on this change. It doesn't let people do anything they can't do now, on the contrary it makes it simpler to use a technique which has readability and supportability problems, which as a result will mean that people will be inclined to use the approach without properly considering the consequences. Paul From steve at pearwood.info Fri Aug 4 07:32:33 2017 From: steve at pearwood.info (Steven D'Aprano) Date: Fri, 4 Aug 2017 21:32:33 +1000 Subject: [Python-ideas] Pseudo methods In-Reply-To: References: Message-ID: <20170804113233.GS3149@ando.pearwood.info> Hi Paul, and welcome! On Fri, Aug 04, 2017 at 07:39:56AM +0000, Paul Laos wrote: > Hi folks > I was thinking about how sometimes, a function sometimes acts on classes, and > behaves very much like a method. I'm not really sure what you mean by "acts on classes". I can only think of a function which takes a class as a parameter, and modifies the class. Like a class decorator. Or possibly a classmethod. But that's not what you seem to mean below. So I'm not quite certain I understand your proposal. > Adding new methods to classes existing classes > is currently somewhat difficult, If the class is written in Python, it isn't difficult at all, it is trivially easy. First define your method: def method(self, arg): pass Then inject it onto the class using ordinary attribute assignment: TheClass.method = method And we're done! If the class is a built-in, or otherwise written in C, then "somewhat difficult" is an understatement. I think it can't be done at all. > and having pseudo methods would make that easier. I'm not sure that "easier" in this case would be better. > Code example: (The syntax can most likely be improved upon) > def has_vowels(self: str): > for vowel in ["a", "e,", "i", "o", "u"]: > if vowel in self: return True How does Python, and for that matter the human reader, know which class or classes that method is injected into? My guess is it looks at the annotation. But that's a big change: annotations are currently guaranteed to have no runtime semantics (apart from being stored in the function's __annotation__ attribute). I'm not saying that can't be done, but there may be consequences we haven't thought of. If we say dir(str), will "has_vowels" show up? How about vars(str)? How does this interact with metaclasses? > This allows one to wring `string.has_vowels()` instead of `has_vowels(string)`, > which would make it easier to read, Well that's one opinion. > and would make it easier to add > functionality to existing classes, without having to extend them. This would be > useful for builtins or imported libraries, so one can fill in "missing" methods. http://www.virtuouscode.com/2008/02/23/why-monkeypatching-is-destroying-ruby/ I think monkeypatching is great, so long as I'm the only one that does it. When other people do it, invariably they introduce bugs into my code by monkeypatching other things I didn't expect to be monkeypatched. > * Simple way to extend classes > * Improves readability > * Easy to understand I'll agree with the first one of those, if by "simple" you mean "somebody else did all the work to make this syntax do what I want it to do". The work behind the scenes is not likely to be simple: for starters, allowing monkeypatching of built-ins is likely going to require a rather big re-design of the Python interpreter. -- Steve From pobocks at gmail.com Fri Aug 4 09:00:15 2017 From: pobocks at gmail.com (David Mayo) Date: Fri, 4 Aug 2017 09:00:15 -0400 Subject: [Python-ideas] Collection type argument for argparse where nargs != None Message-ID: A friend of mine (@bcjbcjbcj on twitter) came up with an idea for an argparse improvement that I'd like to propose for inclusion. Currently, argparse with nargs= collects arguments into a list (or a list of lists in the case of action="append"). I would like to propose adding a "collection type" argument to the store and append actions and to add_argument, consisting of a callable that would be applied to the list of type-converted args before adding them to the Namespace. This would allow for alternate constructors (e.g. set), for modifying the list (e.g. with sorted), or to do checking of properties expected across all components of the argument at parse time. I've worked up a set of examples in this gist: https://gist.github.com/ pobocks/bff0bea494f2b7ec7eba1e8ae281b888 And a rough implementation here: https://github.com/python/ cpython/compare/master...pobocks:argparse_colltype I think this would be genuinely useful, and would require very little change to argparse, which should be backwards compatible provided that the default for the collection type is list, or None with list specified if None. Thank you all for your time in considering this, - Dave Mayo @pobocks on twitter, github, various others -------------- next part -------------- An HTML attachment was scrubbed... URL: From jsbueno at python.org.br Fri Aug 4 09:20:55 2017 From: jsbueno at python.org.br (Joao S. O. Bueno) Date: Fri, 4 Aug 2017 10:20:55 -0300 Subject: [Python-ideas] Pseudo methods In-Reply-To: <20170804113233.GS3149@ando.pearwood.info> References: <20170804113233.GS3149@ando.pearwood.info> Message-ID: Had not this been discussed here earlier this year? (And despite there being perceived dangers to readability in the long term, was accepted?) Here it is on an archive: https://mail.python.org/pipermail/python-ideas/2017-February/044551.html And anyway - along that discussion, despite dislikng the general idea, I got convinced that creating an outside method that makes "super" or "__class__" work was rather complicated. Maybe we could just have a decorator for that, that would properly create the __class__ cell? js -><- On 4 August 2017 at 08:32, Steven D'Aprano wrote: > Hi Paul, and welcome! > > > On Fri, Aug 04, 2017 at 07:39:56AM +0000, Paul Laos wrote: > > Hi folks > > I was thinking about how sometimes, a function sometimes acts on > classes, and > > behaves very much like a method. > > I'm not really sure what you mean by "acts on classes". I can only think > of a function which takes a class as a parameter, and modifies the > class. Like a class decorator. Or possibly a classmethod. But that's not > what you seem to mean below. So I'm not quite certain I understand your > proposal. > > > > Adding new methods to classes existing classes > > is currently somewhat difficult, > > If the class is written in Python, it isn't difficult at all, it is > trivially easy. First define your method: > > def method(self, arg): > pass > > > Then inject it onto the class using ordinary attribute assignment: > > TheClass.method = method > > And we're done! > > If the class is a built-in, or otherwise written in C, then "somewhat > difficult" is an understatement. I think it can't be done at all. > > > > and having pseudo methods would make that easier. > > I'm not sure that "easier" in this case would be better. > > > > Code example: (The syntax can most likely be improved upon) > > def has_vowels(self: str): > > for vowel in ["a", "e,", "i", "o", "u"]: > > if vowel in self: return True > > > How does Python, and for that matter the human reader, know which > class or classes that method is injected into? My guess is it looks at > the annotation. But that's a big change: annotations are currently > guaranteed to have no runtime semantics (apart from being stored in the > function's __annotation__ attribute). I'm not saying that can't be done, > but there may be consequences we haven't thought of. > > If we say dir(str), will "has_vowels" show up? > > How about vars(str)? > > How does this interact with metaclasses? > > > > > This allows one to wring `string.has_vowels()` instead of > `has_vowels(string)`, > > which would make it easier to read, > > Well that's one opinion. > > > > and would make it easier to add > > functionality to existing classes, without having to extend them. This > would be > > useful for builtins or imported libraries, so one can fill in "missing" > methods. > > http://www.virtuouscode.com/2008/02/23/why-monkeypatching- > is-destroying-ruby/ > > I think monkeypatching is great, so long as I'm the only one that does > it. When other people do it, invariably they introduce bugs into my code > by monkeypatching other things I didn't expect to be monkeypatched. > > > > * Simple way to extend classes > > * Improves readability > > * Easy to understand > > I'll agree with the first one of those, if by "simple" you mean > "somebody else did all the work to make this syntax do > what I want it to do". > > The work behind the scenes is not likely to be simple: for starters, > allowing monkeypatching of built-ins is likely going to require a rather > big re-design of the Python interpreter. > > > -- > Steve > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From p.f.moore at gmail.com Fri Aug 4 09:31:48 2017 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 4 Aug 2017 14:31:48 +0100 Subject: [Python-ideas] Pseudo methods In-Reply-To: References: <20170804113233.GS3149@ando.pearwood.info> Message-ID: On 4 August 2017 at 14:20, Joao S. O. Bueno wrote: > Had not this been discussed here earlier this year? > > (And despite there being perceived dangers to readability in the long term, > was accepted?) > > Here it is on an archive: > https://mail.python.org/pipermail/python-ideas/2017-February/044551.html >From a very brief review of the end of that thread, it looks like it was agreed that a PEP might be worthwhile - it was expected to be rejected, though, and the PEP would simply document the discussion and the fact that the idea was rejected. This agrees with my recollection of the discussion, as well. But as far as I'm aware, no-one ever wrote that PEP. (Not surprising, I guess, as it's hard to get enthusiastic about proposing an idea you know in advance will be rejected). Paul From jsbueno at python.org.br Fri Aug 4 09:42:21 2017 From: jsbueno at python.org.br (Joao S. O. Bueno) Date: Fri, 4 Aug 2017 10:42:21 -0300 Subject: [Python-ideas] Pseudo methods In-Reply-To: References: <20170804113233.GS3149@ando.pearwood.info>

Message-ID: On 4 August 2017 at 10:31, Paul Moore wrote: > On 4 August 2017 at 14:20, Joao S. O. Bueno wrote: > > Had not this been discussed here earlier this year? > > > > (And despite there being perceived dangers to readability in the long > term, > > was accepted?) > > > > Here it is on an archive: > > https://mail.python.org/pipermail/python-ideas/2017-February/044551.html > > From a very brief review of the end of that thread, it looks like it > was agreed that a PEP might be worthwhile - it was expected to be > rejected, though, and the PEP would simply document the discussion and > the fact that the idea was rejected. This agrees with my recollection > of the discussion, as well. But as far as I'm aware, no-one ever wrote > that PEP. (Not surprising, I guess, as it's hard to get enthusiastic > about proposing an idea you know in advance will be rejected). > > Nonetheless, a third party module with some decorators to allow doing that "the right way" might be usefull. If one is willing to write, or retrieve a candidate for that. (I don? t think it is possible to inject the __class__ cell in a clean way, though) js -><- > Paul > -------------- next part -------------- An HTML attachment was scrubbed... URL: From steve at pearwood.info Fri Aug 4 10:37:01 2017 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 5 Aug 2017 00:37:01 +1000 Subject: [Python-ideas] Pseudo methods In-Reply-To: References: <20170804113233.GS3149@ando.pearwood.info> Message-ID: <20170804143700.GU3149@ando.pearwood.info> On Fri, Aug 04, 2017 at 10:20:55AM -0300, Joao S. O. Bueno wrote: > Had not this been discussed here earlier this year? > > (And despite there being perceived dangers to readability in the long term, > was accepted?) > > Here it is on an archive: > https://mail.python.org/pipermail/python-ideas/2017-February/044551.html I don't read this as the same proposal. For starters, I don't believe that it was intended to allow monkey-patching of builtins. Another is that the syntax is much more explicit about where the method is going: def MyClass.method(self, arg): ... is clearly a method of MyClass. There was, if I recall, some open discussion of whether arbitrary assignment targets should be allowed: def module.func(x or None)[23 + n].attr.__type__.method(self, arg): ... or if we should intentionally limit the allowed syntax, like we do for decorators. My vote is for intentionally limiting it to a single dotted name, like MyClass.method. > And anyway - along that discussion, despite dislikng the general idea, I > got convinced that > creating an outside method that makes "super" or "__class__" work was > rather complicated. Complicated is an understatement. It's horrid :-) Here's the problem: we can successfully inject methods into a class: # -----%<----- class Parent: def spam(self): return "spam" class Child(Parent): def food(self): return 'yummy ' + self.spam() c = Child() c.food() # returns 'yummy spam' as expected # inject a new method def spam(self): return 'spam spam spam' Child.spam = spam c.food() # returns 'yummy spam spam spam' as expected # -----%<----- But not if you use the zero-argument form of super(): # -----%<----- del Child.spam # revert to original def spam(self): s = super().spam() return ' '.join([s]*3) Child.spam = spam c.food() # -----%<----- This raises: RuntimeError: super(): __class__ cell not found This is the simplest thing I've found that will fix it: # -----%<----- del Child.spam # revert to original again def outer(): __class__ = Child def spam(self): s = super().spam() return ' '.join([s]*3) return spam Child.spam = outer() c.food() # returns 'yummy spam spam spam' as expected # -----%<----- It's probably possibly to wrap this up in a decorator that takes Child as argument, but I expect it will probably require messing about with the undocumented FunctionType constructor to build up a new closure from the bits and pieces scavenged from the decorated function. > Maybe we could just have a decorator for that, that would properly create > the __class__ cell? I expect its possible. A challenge to somebody who wants to get their hands dirty. -- Steve From brett at python.org Fri Aug 4 12:33:49 2017 From: brett at python.org (Brett Cannon) Date: Fri, 04 Aug 2017 16:33:49 +0000 Subject: [Python-ideas] Collection type argument for argparse where nargs != None In-Reply-To: References: Message-ID: I'm not a heavy argparse user so take my opinion with a grain of salt (and I do appreciate the time you put into proposing this), but I'm not seeing the usefulness to classify this as so pragmatic as to outweigh adding one more thing to explain about argparse. Since you're proposing just having a callable to use after constructing the list couldn't you just do e.g. `args.stuff = frozenset(args.stuff)` instead and just be explicit about it? On Fri, Aug 4, 2017, 06:01 David Mayo, wrote: > A friend of mine (@bcjbcjbcj on twitter) came up with an idea for an > argparse improvement that I'd like to propose for inclusion. > > Currently, argparse with nargs= collects arguments into > a list (or a list of lists in the case of action="append"). I would like to > propose adding a "collection type" argument to the store and append actions > and to add_argument, consisting of a callable that would be applied to the > list of type-converted args before adding them to the Namespace. This would > allow for alternate constructors (e.g. set), for modifying the list (e.g. > with sorted), or to do checking of properties expected across all > components of the argument at parse time. > > I've worked up a set of examples in this gist: > https://gist.github.com/pobocks/bff0bea494f2b7ec7eba1e8ae281b888 > > And a rough implementation here: > https://github.com/python/cpython/compare/master...pobocks:argparse_colltype > > I think this would be genuinely useful, and would require very little > change to argparse, which should be backwards compatible provided that the > default for the collection type is list, or None with list specified if > None. > > Thank you all for your time in considering this, > > - Dave Mayo > @pobocks on twitter, github, various others > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From pobocks at gmail.com Fri Aug 4 12:56:13 2017 From: pobocks at gmail.com (David Mayo) Date: Fri, 4 Aug 2017 12:56:13 -0400 Subject: [Python-ideas] Collection type argument for argparse where nargs != None In-Reply-To: References:

Message-ID: I mean, it's definitely possible, but I'd argue that's actually not any more explicit - and, in fact, args.stuff = something(args.stuff) is arguably less explicit because it's just an arbitrary transform, rather than being called out as "this is the wrapper element for these args." The places where I see doing transforms after as substantially worse than this: 1. any case where a single parser is being used in multiple scripts, or being extended. Moving this kind of thing out of the parser means the logic has to be replicated outside the parser everywhere it's called. 2. validation of multiple arguments against each other - once you're out of the parser, you have to write separate error handling code instead of just throwing the right exception. - Dave On Fri, Aug 4, 2017 at 12:33 PM, Brett Cannon wrote: > I'm not a heavy argparse user so take my opinion with a grain of salt (and > I do appreciate the time you put into proposing this), but I'm not seeing > the usefulness to classify this as so pragmatic as to outweigh adding one > more thing to explain about argparse. Since you're proposing just having a > callable to use after constructing the list couldn't you just do e.g. > `args.stuff = frozenset(args.stuff)` instead and just be explicit about it? > > On Fri, Aug 4, 2017, 06:01 David Mayo, wrote: > >> A friend of mine (@bcjbcjbcj on twitter) came up with an idea for an >> argparse improvement that I'd like to propose for inclusion. >> >> Currently, argparse with nargs= collects arguments >> into a list (or a list of lists in the case of action="append"). I would >> like to propose adding a "collection type" argument to the store and append >> actions and to add_argument, consisting of a callable that would be applied >> to the list of type-converted args before adding them to the Namespace. >> This would allow for alternate constructors (e.g. set), for modifying the >> list (e.g. with sorted), or to do checking of properties expected across >> all components of the argument at parse time. >> >> I've worked up a set of examples in this gist: https://gist.github.com/ >> pobocks/bff0bea494f2b7ec7eba1e8ae281b888 >> >> And a rough implementation here: https://github.com/python/ >> cpython/compare/master...pobocks:argparse_colltype >> >> I think this would be genuinely useful, and would require very little >> change to argparse, which should be backwards compatible provided that the >> default for the collection type is list, or None with list specified if >> None. >> >> Thank you all for your time in considering this, >> >> - Dave Mayo >> @pobocks on twitter, github, various others >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> https://mail.python.org/mailman/listinfo/python-ideas >> Code of Conduct: http://python.org/psf/codeofconduct/ >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From barry at barrys-emacs.org Sat Aug 5 06:49:46 2017 From: barry at barrys-emacs.org (Barry) Date: Sat, 5 Aug 2017 11:49:46 +0100 Subject: [Python-ideas] HTTP compression support for http.server In-Reply-To: References:

Message-ID: Does you code allow suporting more then gzip? For example Brotli compression is becoming inmportant for some web apps. Barry > On 24 Jul 2017, at 17:30, Chris Angelico wrote: > >> On Tue, Jul 25, 2017 at 2:20 AM, Chris Barker wrote: >> On Thu, Jul 20, 2017 at 12:15 AM, Pierre Quentel >> wrote: >>> - if so, should it be supported by default ? It is the case in the PR, >>> where a number of content types, eg text/html, are compressed if the user >>> agent accepts the gzip "encoding" >> >> >> I'm pretty wary of compression happening by default -- i.e. someone runs >> exactly the same code with a newer version of Python, and suddenly some >> content is getting compressed. > > FWIW I'm quite okay with that. HTTP already has a mechanism for > negotiating compression (Accept-Encoding), designed to be compatible > with servers that don't support it. Any time a server gains support > for something that clients already support, it's going to start > happening as soon as you upgrade. > > Obviously this kind of change won't be happening in a bugfix release > of Python, so it would be part of the regular checks when you upgrade > from 3.6 to 3.7 - it'll be in the NEWS file and so on, so you read up > on it before you upgrade. > > ChrisA > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > From turnbull.stephen.fw at u.tsukuba.ac.jp Mon Aug 7 02:36:58 2017 From: turnbull.stephen.fw at u.tsukuba.ac.jp (Stephen J. Turnbull) Date: Mon, 7 Aug 2017 15:36:58 +0900 Subject: [Python-ideas] Collection type argument for argparse where nargs != None In-Reply-To: References:

Message-ID: <22920.2698.483865.970120@turnbull.sk.tsukuba.ac.jp> David Mayo writes: > I mean, it's definitely possible, but I'd argue that's actually not any > more explicit - and, in fact, args.stuff = something(args.stuff) is > arguably less explicit because it's just an arbitrary transform, rather > than being called out as "this is the wrapper element for these > args." The problem is third parties trying to read and work with the code, who now have to go read not only the definition of the parser, but the definition of the wrapper element (which is an arbitrary transform with a specified role). I think argparse is complex enough already. For both of your use cases (argparsers as reusable components and validation of collection arguments), I don't see why they can't be done in a subclass. This argument doesn't kill your proposal, and I'm just one rando, but FWIW I'm -0.5 on it for now. Steve From pierre.quentel at gmail.com Mon Aug 7 04:19:30 2017 From: pierre.quentel at gmail.com (Pierre Quentel) Date: Mon, 7 Aug 2017 10:19:30 +0200 Subject: [Python-ideas] HTTP compression support for http.server In-Reply-To: References:

Message-ID: 2017-08-05 12:49 GMT+02:00 Barry : > Does you code allow suporting more then gzip? For example Brotli > compression is becoming inmportant for some web apps. > > Barry > In the latest version of the Pull Request, only gzip is supported. But your comment makes me think that the code should probably be more modular so that subclasses of SimpleHTTPRequestHandler could handle other algorithms. -------------- next part -------------- An HTML attachment was scrubbed... URL: From victor.stinner at gmail.com Mon Aug 7 04:48:45 2017 From: victor.stinner at gmail.com (Victor Stinner) Date: Mon, 7 Aug 2017 10:48:45 +0200 Subject: [Python-ideas] Pseudo methods In-Reply-To: References: Message-ID: Ruby provides this feature. A friend who is a long term user of Rails complained that Rails abuses this and it's a mess in practice. So I dislike this idea. Victor 2017-08-04 9:39 GMT+02:00 Paul Laos : > Hi folks > I was thinking about how sometimes, a function sometimes acts on classes, > and > behaves very much like a method. Adding new methods to classes existing > classes > is currently somewhat difficult, and having pseudo methods would make that > easier. > > Code example: (The syntax can most likely be improved upon) > def has_vowels(self: str): > for vowel in ["a", "e,", "i", "o", "u"]: > if vowel in self: return True > > This allows one to wring `string.has_vowels()` instead of > `has_vowels(string)`, > which would make it easier to read, and would make it easier to add > functionality to existing classes, without having to extend them. This would > be > useful for builtins or imported libraries, so one can fill in "missing" > methods. > > * Simple way to extend classes > * Improves readability > * Easy to understand > > ~Paul > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > From fakedme+py at gmail.com Mon Aug 7 15:30:05 2017 From: fakedme+py at gmail.com (Soni L.) Date: Mon, 7 Aug 2017 16:30:05 -0300 Subject: [Python-ideas] Generator syntax hooks? Message-ID: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> The generator syntax, (x for x in i if c), currently always creates a new generator. I find this quite inefficient: {x for x in integers if 1000 <= x < 1000000} # never completes, because it's trying to iterate over all integers What if, somehow, object `integers` could hook the generator and produce the equivalent of {x for x in range(1000, 1000000)}, which does complete? What if, (x for x in integers if 1000 <= x < 1000000), was syntax sugar for (x for x in range(1000, 1000000))? (I like mathy syntax. Do you like mathy syntax?) From rosuav at gmail.com Mon Aug 7 16:14:52 2017 From: rosuav at gmail.com (Chris Angelico) Date: Tue, 8 Aug 2017 06:14:52 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> Message-ID: On Tue, Aug 8, 2017 at 5:30 AM, Soni L. wrote: > The generator syntax, (x for x in i if c), currently always creates a new > generator. I find this quite inefficient: > > {x for x in integers if 1000 <= x < 1000000} # never completes, because it's > trying to iterate over all integers > > What if, somehow, object `integers` could hook the generator and produce the > equivalent of {x for x in range(1000, 1000000)}, which does complete? > > What if, (x for x in integers if 1000 <= x < 1000000), was syntax sugar for > (x for x in range(1000, 1000000))? > > (I like mathy syntax. Do you like mathy syntax?) I don't. I prefer to stick with the syntax we already have. The alternative is a more verbose way to identify a range, plus you need a new global "integers" which implies that you could iterate over "reals" the same way (after all, mathematics doesn't mind you working with a subset of reals the same way you'd work with a subset of ints). And good luck iterating over all the reals. :) ChrisA From chris.barker at noaa.gov Mon Aug 7 19:06:32 2017 From: chris.barker at noaa.gov (Chris Barker) Date: Mon, 7 Aug 2017 19:06:32 -0400 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> Message-ID: On Mon, Aug 7, 2017 at 4:14 PM, Chris Angelico wrote: > On Tue, Aug 8, 2017 at 5:30 AM, Soni L. wrote: > > The generator syntax, (x for x in i if c), currently always creates a new > > generator. that's what it's for -- I'm confused as to what the problem is. > > {x for x in integers if 1000 <= x < 1000000} # never completes, because > it's > > trying to iterate over all integers > this is a set comprehension -- but what is "integers"? is it a generator? in which case, it should take an argument so it knows when to end. Or if it's really that symple, that's what range() is for. However, similarly, I find that sometimes I want to iterate over a slice of a sequence, but do'nt want to actually make the slice first. So there is itertools.islice() If "integers" is a sequence: {x for x in integers[1000:10000]} makes an unneeded copy of that slice. {x for x in itertools.islice(integers, 1000, 10000)} will iterate on the fly, and not make any extra copies. It would be nice to have an easier access to an "slice iterator" though -- one of these days I may write up a proposal for that. -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From steve at pearwood.info Mon Aug 7 19:35:37 2017 From: steve at pearwood.info (Steven D'Aprano) Date: Tue, 8 Aug 2017 09:35:37 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> Message-ID: <20170807233537.GC3149@ando.pearwood.info> Hi Soni, and welcome! On Mon, Aug 07, 2017 at 04:30:05PM -0300, Soni L. wrote: > What if, (x for x in integers if 1000 <= x < 1000000), was syntax sugar > for (x for x in range(1000, 1000000))? If you want the integers from 1000 to 1000000, use: range(1000, 1000000) Don't waste your time slowing down the code with an unnecessary and pointless wrapper that does nothing but pass every value on unchanged: (x for x in range(1000, 1000000)) # waste of time and effort -- Steve From fakedme+py at gmail.com Mon Aug 7 19:56:20 2017 From: fakedme+py at gmail.com (Soni L.) Date: Mon, 7 Aug 2017 20:56:20 -0300 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: <20170807233537.GC3149@ando.pearwood.info> References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> <20170807233537.GC3149@ando.pearwood.info> Message-ID: <9e9978d7-1910-4453-e085-8f419fbc2dda@gmail.com> On 2017-08-07 08:35 PM, Steven D'Aprano wrote: > Hi Soni, and welcome! > > On Mon, Aug 07, 2017 at 04:30:05PM -0300, Soni L. wrote: > >> What if, (x for x in integers if 1000 <= x < 1000000), was syntax sugar >> for (x for x in range(1000, 1000000))? > If you want the integers from 1000 to 1000000, use: > > range(1000, 1000000) > > Don't waste your time slowing down the code with an unnecessary and > pointless wrapper that does nothing but pass every value on unchanged: > > (x for x in range(1000, 1000000)) # waste of time and effort > > > Actually, those have different semantics! >>> x = range(1, 10) >>> list(x) [1, 2, 3, 4, 5, 6, 7, 8, 9] >>> list(x) [1, 2, 3, 4, 5, 6, 7, 8, 9] >>> x = (x for x in range(1, 10)) >>> list(x) [1, 2, 3, 4, 5, 6, 7, 8, 9] >>> list(x) [] From stefan_ml at behnel.de Tue Aug 8 03:48:17 2017 From: stefan_ml at behnel.de (Stefan Behnel) Date: Tue, 8 Aug 2017 09:48:17 +0200 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: <9e9978d7-1910-4453-e085-8f419fbc2dda@gmail.com> References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> <20170807233537.GC3149@ando.pearwood.info> <9e9978d7-1910-4453-e085-8f419fbc2dda@gmail.com> Message-ID: Soni L. schrieb am 08.08.2017 um 01:56: > On 2017-08-07 08:35 PM, Steven D'Aprano wrote: >> Hi Soni, and welcome! >> >> On Mon, Aug 07, 2017 at 04:30:05PM -0300, Soni L. wrote: >> >>> What if, (x for x in integers if 1000 <= x < 1000000), was syntax sugar >>> for (x for x in range(1000, 1000000))? >> If you want the integers from 1000 to 1000000, use: >> >> range(1000, 1000000) >> >> Don't waste your time slowing down the code with an unnecessary and >> pointless wrapper that does nothing but pass every value on unchanged: >> >> (x for x in range(1000, 1000000)) # waste of time and effort > > Actually, those have different semantics! > >>>> x = range(1, 10) >>>> list(x) > [1, 2, 3, 4, 5, 6, 7, 8, 9] >>>> list(x) > [1, 2, 3, 4, 5, 6, 7, 8, 9] > >>>> x = (x for x in range(1, 10)) >>>> list(x) > [1, 2, 3, 4, 5, 6, 7, 8, 9] >>>> list(x) > [] In that case, use iter(range(1000, 1000000)). range() creates an iterable, which is iterable more than once. iter(range()) creates an iterator from that iterable, which has the semantics that you apparently wanted. Stefan From turnbull.stephen.fw at u.tsukuba.ac.jp Tue Aug 8 14:45:31 2017 From: turnbull.stephen.fw at u.tsukuba.ac.jp (Stephen J. Turnbull) Date: Wed, 9 Aug 2017 03:45:31 +0900 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: <9e9978d7-1910-4453-e085-8f419fbc2dda@gmail.com> References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com> <20170807233537.GC3149@ando.pearwood.info> <9e9978d7-1910-4453-e085-8f419fbc2dda@gmail.com> Message-ID: <22922.1739.304051.790882@turnbull.sk.tsukuba.ac.jp> >>>>> Soni L. writes: > Steven d'Aprano writes: > > range(1000, 1000000) > > (x for x in range(1000, 1000000)) # waste of time and effort > Actually, those have different semantics! That's not real important. As Stefan Behnel points out, it's simple (and efficient) to get iterator semantics by using iter(). The big issue here is that Python is not the kind of declarative language where (x for x in int if 1_000 ? x ? 1_000_000)[1] is natural to write, let alone easy to implement efficiently. Aside from the problem of (x for x in float if 1_000 ? x ? 1_000_000) (where the answer is "just don't do that"), I can't think of any unbounded collections in Python that aren't iterables, except some types. That makes Steven's criticism pretty compelling. If you need to design a collection's __iter__ specially to allow it to decide whether the subset that satisfies some condition is exhausted, why not just subclass some appropriate existing collection with a more appropriate __iter__? Footnotes: [1] See what I did there? ;-) From ncoghlan at gmail.com Wed Aug 9 00:18:58 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 9 Aug 2017 14:18:58 +1000 Subject: [Python-ideas] Pseudo methods In-Reply-To: References: Message-ID: On 7 August 2017 at 18:48, Victor Stinner wrote: > Ruby provides this feature. A friend who is a long term user of Rails > complained that Rails abuses this and it's a mess in practice. So I > dislike this idea. Right, Python's opinionated design guidance is to clearly distinguish between "data first" designs using methods on objects and "algorithm first" designs using functools.singledispatch (or similar mechanisms), since they place different constraints on how new implementations are added, and where you should look for more information about how an algorithm works. Part of the intent behind this guidance is to better enable local reasoning about a piece of code: from my_string_utils import has_vowels if has_vowels(input("Enter a word: ")): print("Contains vowels!") else: print("Does not contain vowels!") Here, it is clear that if we want to know more about what "has_vowels" does, or if we want to request changes to how it works, then "my_string_utils" is where we need to go next. By contrast, that's significantly less clear if our string utils module were to implicitly modify the behaviour of input() or builtin strings: import my_string_utils if input("Enter a word: ").has_vowels(): print("Contains vowels!") else: print("Does not contain vowels!") To analyse and investigate this code, we need to "just know" that: - the result of "input()" doesn't normally have a "has_vowels()" method - therefore, importing "my_string_utils" must have either replaced the input builtin or mutated the str type - therefore, "my_string_utils" is probably the place to go for more information on "has_vowels" If our import line had instead looked like "import my_string_utils, my_other_utils", we'd have to go look at both of them to figure out where the "has_vowels()" method might be coming from (and hope it wasn't happening further down as a side effect of one of the modules *they* imported). Injecting methods rather than writing functions that dispatch on the type of their first argument also creates new opportunities for naming conflicts: while "my_string_utils.has_vowels" and "your_string_utils.has_vowels" can happily coexist in the same program without conflicts, there's only one "input" builtin, and only one "str" builtin. Can this level of explicitness be an obstacle at times? Yes, it can, especially for testing and interactive use, which is why Python offers features like wildcard imports, runtime support for monkeypatching of user-defined types, and runtime support for dynamically replacing builtins and module globals. However, the concerns around the difficulties of complexity management in the face of implicit action at a distance remain valid, so those features all fall into the category of "supported, but not encouraged, except in specific circumstances". Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From ncoghlan at gmail.com Wed Aug 9 01:06:54 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 9 Aug 2017 15:06:54 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: On 8 August 2017 at 09:06, Chris Barker wrote: > It would be nice to have an easier access to an "slice iterator" though -- > one of these days I may write up a proposal for that. An idea I've occasionally toyed with [1] is some kind of "iterview" that wraps around an arbitrary iterable and produces lazy itertools based results rather than immediate views or copies. However, my experience is also that folks are *really* accustomed to syntactic operations on containers producing either full live views (e.g. memoryview or numpy slices, range as a dynamically computed container), or actual copies (builtin container types). Having them produce consumable iterators instead then gets confusing due to the number of operations that will implicitly consume them (including simple "x in y" checks). The OP's proposal doesn't fit into that category though: rather it's asking about the case where we have an infinite iterator (e.g. itertools.count(0)), and want to drop items until they start meeting some condition (i.e. itertools.dropwhile) and then terminate the iterator as soon as another condition is no longer met (i.e. itertools.takewhile). Right now, getting the "terminate when false" behaviour requires the use of takewhile: {itertools.takewhile(lambda x: x < 1000000, itertools.count(1000)} In these cases, the standard generator expression syntax is an attractive nuisance because it *looks* right from a mathematical perspective, but hides an infinite loop: {x for x in itertools.count(0) if 1000 <= x < 1000000} The most credible proposal to address this that I've seen is to borrow the "while" keyword in its "if not x: break" interpretation to get: {x for x in itertools.count(0) if 1000 <= x while x < 1000000} which would be compiled as equivalent to: x = set() for x in itertools.count(0): if 1000 <= x: set.add(x) if not x < 1000000: break (and similarly for all of the other comprehension variants) There aren't any technical barriers I'm aware of to implementing that, with the main historical objection being that instead of the comprehension level while clause mapping to a while loop directly the way the for and if clauses map to their statement level counterparts, it would instead map to the conditional break in the expanded loop-and-a-half form: while True: if not condition: break While it's taken me a long time to come around to the idea, "Make subtle infinite loops in mathematical code easier to avoid" *is* a pretty compelling user-focused justification for incurring that extra complexity at the language design level. Cheers, Nick. [1] https://mail.python.org/pipermail/python-ideas/2010-April/006983.html -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From guido at python.org Wed Aug 9 01:38:22 2017 From: guido at python.org (Guido van Rossum) Date: Tue, 8 Aug 2017 22:38:22 -0700 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: On Tue, Aug 8, 2017 at 10:06 PM, Nick Coghlan wrote: > On 8 August 2017 at 09:06, Chris Barker wrote: > > It would be nice to have an easier access to an "slice iterator" though > -- > > one of these days I may write up a proposal for that. > > An idea I've occasionally toyed with [1] is some kind of "iterview" > that wraps around an arbitrary iterable and produces lazy itertools > based results rather than immediate views or copies. > > However, my experience is also that folks are *really* accustomed to > syntactic operations on containers producing either full live views > (e.g. memoryview or numpy slices, range as a dynamically computed > container), or actual copies (builtin container types). Having them > produce consumable iterators instead then gets confusing due to the > number of operations that will implicitly consume them (including > simple "x in y" checks). > > The OP's proposal doesn't fit into that category though: rather it's > asking about the case where we have an infinite iterator (e.g. > itertools.count(0)), and want to drop items until they start meeting > some condition (i.e. itertools.dropwhile) and then terminate the > iterator as soon as another condition is no longer met (i.e. > itertools.takewhile). > I don't think that's what the OP meant. The original proposal seemed to assume that it would be somehow reasonable for the input ("integers" in the example) to be able to see and parse the condition in the generator expression ("1000 <= x < 100000" in the example, with "x" somehow known to be bound to the iteration value). That's at least what I think the remark "I like mathy syntax" referred to. > Right now, getting the "terminate when false" behaviour requires the > use of takewhile: > > {itertools.takewhile(lambda x: x < 1000000, itertools.count(1000)} > > In these cases, the standard generator expression syntax is an > attractive nuisance because it *looks* right from a mathematical > perspective, but hides an infinite loop: > > {x for x in itertools.count(0) if 1000 <= x < 1000000} > > The most credible proposal to address this that I've seen is to borrow > the "while" keyword in its "if not x: break" interpretation to get: > > {x for x in itertools.count(0) if 1000 <= x while x < 1000000} > > which would be compiled as equivalent to: > > x = set() > for x in itertools.count(0): > if 1000 <= x: > set.add(x) > if not x < 1000000: > break > > (and similarly for all of the other comprehension variants) > > There aren't any technical barriers I'm aware of to implementing that, > with the main historical objection being that instead of the > comprehension level while clause mapping to a while loop directly the > way the for and if clauses map to their statement level counterparts, > it would instead map to the conditional break in the expanded > loop-and-a-half form: > > while True: > if not condition: > break > > While it's taken me a long time to come around to the idea, "Make > subtle infinite loops in mathematical code easier to avoid" *is* a > pretty compelling user-focused justification for incurring that extra > complexity at the language design level. > I haven't come around to this yet. It looks like it will make explaining comprehensions more complex, since the translation of "while X" into "if not X: break" feels less direct than the translations of "for x in xs" or "if pred(x)". (In particular, your proposal seems to require more experience with mentally translating loops and conditions into jumps -- most regulars of this forum do that for a living, but I doubt it's second nature for the OP.) -- --Guido van Rossum (python.org/~guido) -------------- next part -------------- An HTML attachment was scrubbed... URL: From tarek at ziade.org Wed Aug 9 03:56:20 2017 From: tarek at ziade.org (=?utf-8?Q?Tarek=20Ziad=C3=A9?=) Date: Wed, 09 Aug 2017 09:56:20 +0200 Subject: [Python-ideas] Argparse argument deprecation Message-ID: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> Hey, I don't think there's any helper to deprecate an argument in argparse Let's say you have a --foo option in your CLI and want to deprecate it in the next release before you completely remove it later. My first though on how to do this by adding a new "deprecated" option to https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_argument "deprecated" would be a callable that is called after the argument has been parsed by argparse, so the developer can decide if they want to issue a deprecation warning, use the parsed value or override it etc. Another interesting approach suggest by Doug Hellman, which I like as much, is a set of higher level options that provide a deprecation workflow for arguments, see https://github.com/openstack/oslo.config/blob/master/oslo_config/cfg.py#L441 What do you think? Cheers Tarek -- Tarek Ziad? | coding: https://ziade.org | running: https://foule.es | twitter: @tarek_ziade From turnbull.stephen.fw at u.tsukuba.ac.jp Wed Aug 9 04:18:24 2017 From: turnbull.stephen.fw at u.tsukuba.ac.jp (Stephen J. Turnbull) Date: Wed, 9 Aug 2017 17:18:24 +0900 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: <22922.50512.758357.265110@turnbull.sk.tsukuba.ac.jp> Nick Coghlan writes: > Right now, getting the "terminate when false" behaviour requires the > use of takewhile: > > {itertools.takewhile(lambda x: x < 1000000, itertools.count(1000)} My objection to this interpretation is different from Guido's (I think): if you're really thinking in terms of math, sets are *unordered*, and therefore "takewhile" doesn't guarantee exhaustion of the desired subset. Another way to put this is that in order to make it harder to get bit by subtle infloops, you're going to give more teeth to "Miller time came early"[1] bugs. This may be a bigger issue than some may think, because sets and dicts are iterable, and order of iteration is arbitrary (at best history- dependent). Footnotes: [1] American beer commercial claiming that real men go to drink beer after a full day's work. From turnbull.stephen.fw at u.tsukuba.ac.jp Wed Aug 9 04:19:41 2017 From: turnbull.stephen.fw at u.tsukuba.ac.jp (Stephen J. Turnbull) Date: Wed, 9 Aug 2017 17:19:41 +0900 Subject: [Python-ideas] Pseudo methods In-Reply-To: References:

Message-ID: <22922.50589.497352.85140@turnbull.sk.tsukuba.ac.jp> Nick Coghlan writes: > To analyse and investigate this code, we need to "just know" that: You can of course hope that help(input().has_vowels) will tell you where to find it. If it doesn't, well, shame on you for depending on source-unavailable software that you don't understand. ;-) I'm with you on implementing this feature; I don't like it. But I don't think the discoverability situation is as dire as you suggest. From desmoulinmichel at gmail.com Wed Aug 9 05:23:45 2017 From: desmoulinmichel at gmail.com (Michel Desmoulin) Date: Wed, 9 Aug 2017 11:23:45 +0200 Subject: [Python-ideas] Argparse argument deprecation In-Reply-To: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> References: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> Message-ID: <4972cbcc-0972-4d6f-4b80-4e66a448e76a@gmail.com> +1, but I would make "deprecated" either a warning, an exception or a callable. This way to create a simple deprecation, you just provide DeprecationWarning('This will be gone in the next release'), or ValueError('This has been removed in 2.X, use "stuff instead"') if you decide it's gone for good. But if you need a custom behavior, you pass in a callable. Le 09/08/2017 ? 09:56, Tarek Ziad? a ?crit : > Hey, > > I don't think there's any helper to deprecate an argument in argparse > > Let's say you have a --foo option in your CLI and want to deprecate it > in the next release before you completely remove it later. > > My first though on how to do this by adding a new "deprecated" option to > https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_argument > > "deprecated" would be a callable that is called after the argument has > been parsed by argparse, > so the developer can decide if they want to issue a deprecation warning, > use the parsed value or override it etc. > > Another interesting approach suggest by Doug Hellman, which I like as > much, is a set of higher level options that > provide a deprecation workflow for arguments, see > > https://github.com/openstack/oslo.config/blob/master/oslo_config/cfg.py#L441 > > What do you think? > > Cheers > Tarek > From ned at nedbatchelder.com Wed Aug 9 05:50:47 2017 From: ned at nedbatchelder.com (Ned Batchelder) Date: Wed, 9 Aug 2017 05:50:47 -0400 Subject: [Python-ideas] Argparse argument deprecation In-Reply-To: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> References: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> Message-ID: On 8/9/17 3:56 AM, Tarek Ziad? wrote: > Hey, > > I don't think there's any helper to deprecate an argument in argparse > > Let's say you have a --foo option in your CLI and want to deprecate it > in the next release before you completely remove it later. > > My first though on how to do this by adding a new "deprecated" option to > https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_argument > > "deprecated" would be a callable that is called after the argument has > been parsed by argparse, > so the developer can decide if they want to issue a deprecation warning, > use the parsed value or override it etc. I don't see why this is something that argparse has to do. The semantics of options is handled by the rest of the program. Why would the parser be issuing these warnings? Let argparse parse the options, then let other code deal with what they *mean*. --Ned. From desmoulinmichel at gmail.com Wed Aug 9 05:54:23 2017 From: desmoulinmichel at gmail.com (Michel Desmoulin) Date: Wed, 9 Aug 2017 11:54:23 +0200 Subject: [Python-ideas] Argparse argument deprecation In-Reply-To: References: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> Message-ID: <14d3d469-319a-7e70-a8bf-054de57197bc@gmail.com> Argparse is not just about parsing, it's about providing convenient tooling associated with parsing. Otherwise you would not have automatically generated a "usage" message or a "--help" command. Following your definition, those are not parsing. But there are here, because we all end up coding them anyway. Le 09/08/2017 ? 11:50, Ned Batchelder a ?crit : > On 8/9/17 3:56 AM, Tarek Ziad? wrote: >> Hey, >> >> I don't think there's any helper to deprecate an argument in argparse >> >> Let's say you have a --foo option in your CLI and want to deprecate it >> in the next release before you completely remove it later. >> >> My first though on how to do this by adding a new "deprecated" option to >> https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_argument >> >> "deprecated" would be a callable that is called after the argument has >> been parsed by argparse, >> so the developer can decide if they want to issue a deprecation warning, >> use the parsed value or override it etc. > > I don't see why this is something that argparse has to do. The > semantics of options is handled by the rest of the program. Why would > the parser be issuing these warnings? Let argparse parse the options, > then let other code deal with what they *mean*. > > --Ned. > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > From ned at nedbatchelder.com Wed Aug 9 06:59:18 2017 From: ned at nedbatchelder.com (Ned Batchelder) Date: Wed, 9 Aug 2017 06:59:18 -0400 Subject: [Python-ideas] Argparse argument deprecation In-Reply-To: <14d3d469-319a-7e70-a8bf-054de57197bc@gmail.com> References: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> <14d3d469-319a-7e70-a8bf-054de57197bc@gmail.com> Message-ID: <491f4b14-8dbd-804e-c1e5-970640df954b@nedbatchelder.com> OK, then on a more pragmatic note: why is it easier to write a callback than to write a simple if statement after the parsing? Generating help is complex, and a common task that is closely tied to the syntax of the options, so it makes sense for argparse to do it. Deprecation is neither complex, common, nor closely tied to the syntax of the options. Another note about the proposal: calling it "deprecated" seems odd, since the proposal is really just a general-purpose callback. argparse isn't generating the warning, your callback function would be doing it. Why name it "deprecated"? How is this different than the "action" keyword argument that argparse already provides? --Ned. On 8/9/17 5:54 AM, Michel Desmoulin wrote: > Argparse is not just about parsing, it's about providing convenient > tooling associated with parsing. > > Otherwise you would not have automatically generated a "usage" message > or a "--help" command. > > Following your definition, those are not parsing. But there are here, > because we all end up coding them anyway. > > Le 09/08/2017 ? 11:50, Ned Batchelder a ?crit : >> On 8/9/17 3:56 AM, Tarek Ziad? wrote: >>> Hey, >>> >>> I don't think there's any helper to deprecate an argument in argparse >>> >>> Let's say you have a --foo option in your CLI and want to deprecate it >>> in the next release before you completely remove it later. >>> >>> My first though on how to do this by adding a new "deprecated" option to >>> https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_argument >>> >>> "deprecated" would be a callable that is called after the argument has >>> been parsed by argparse, >>> so the developer can decide if they want to issue a deprecation warning, >>> use the parsed value or override it etc. >> I don't see why this is something that argparse has to do. The >> semantics of options is handled by the rest of the program. Why would >> the parser be issuing these warnings? Let argparse parse the options, >> then let other code deal with what they *mean*. >> >> --Ned. >> >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> https://mail.python.org/mailman/listinfo/python-ideas >> Code of Conduct: http://python.org/psf/codeofconduct/ >> > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ From desmoulinmichel at gmail.com Wed Aug 9 07:16:53 2017 From: desmoulinmichel at gmail.com (Michel Desmoulin) Date: Wed, 9 Aug 2017 13:16:53 +0200 Subject: [Python-ideas] Argparse argument deprecation In-Reply-To: <491f4b14-8dbd-804e-c1e5-970640df954b@nedbatchelder.com> References: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> <14d3d469-319a-7e70-a8bf-054de57197bc@gmail.com> <491f4b14-8dbd-804e-c1e5-970640df954b@nedbatchelder.com> Message-ID: Le 09/08/2017 ? 12:59, Ned Batchelder a ?crit : > OK, then on a more pragmatic note: why is it easier to write a callback > than to write a simple if statement after the parsing? Generating help > is complex, and a common task that is closely tied to the syntax of the > options, so it makes sense for argparse to do it. Deprecation is > neither complex, common, nor closely tied to the syntax of the options. > > Another note about the proposal: calling it "deprecated" seems odd, > since the proposal is really just a general-purpose callback. argparse > isn't generating the warning, your callback function would be doing it. > Why name it "deprecated"? How is this different than the "action" > keyword argument that argparse already provides? I imagine something like: def _(warn, forbid): warn('This is deprecated') # for forbid to just put an error parser.add_option(on_deprecated=deprecationCallback) This does: - provide an easy way to warn, or transition to forbid - allow introspection to list the deprecated options - deprecated options can be marked as such in the generated --help - create a complex dynamic deprecation message, or just pass a short lambda But indeed I'd like it to be able to do: parser.add_option(on_deprecated=DeprecationWarning('meh')) parser.add_option(on_deprecated=ValueError('meh')) As a shortcut for simple use cases. I still don't know how to make the distinction between deprecated and removed from the introspection point of view. All in all, I think it's an interesting proposal, but I'm not going to fight over it. If it never happens, I can fit a bunch of "if" like you said. > > --Ned. > > > On 8/9/17 5:54 AM, Michel Desmoulin wrote: >> Argparse is not just about parsing, it's about providing convenient >> tooling associated with parsing. >> >> Otherwise you would not have automatically generated a "usage" message >> or a "--help" command. >> >> Following your definition, those are not parsing. But there are here, >> because we all end up coding them anyway. >> >> Le 09/08/2017 ? 11:50, Ned Batchelder a ?crit : >>> On 8/9/17 3:56 AM, Tarek Ziad? wrote: >>>> Hey, >>>> >>>> I don't think there's any helper to deprecate an argument in argparse >>>> >>>> Let's say you have a --foo option in your CLI and want to deprecate it >>>> in the next release before you completely remove it later. >>>> >>>> My first though on how to do this by adding a new "deprecated" option to >>>> https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_argument >>>> >>>> "deprecated" would be a callable that is called after the argument has >>>> been parsed by argparse, >>>> so the developer can decide if they want to issue a deprecation warning, >>>> use the parsed value or override it etc. >>> I don't see why this is something that argparse has to do. The >>> semantics of options is handled by the rest of the program. Why would >>> the parser be issuing these warnings? Let argparse parse the options, >>> then let other code deal with what they *mean*. >>> >>> --Ned. >>> >>> _______________________________________________ >>> Python-ideas mailing list >>> Python-ideas at python.org >>> https://mail.python.org/mailman/listinfo/python-ideas >>> Code of Conduct: http://python.org/psf/codeofconduct/ >>> >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> https://mail.python.org/mailman/listinfo/python-ideas >> Code of Conduct: http://python.org/psf/codeofconduct/ > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > From tarek at ziade.org Wed Aug 9 07:38:25 2017 From: tarek at ziade.org (=?utf-8?Q?Tarek=20Ziad=C3=A9?=) Date: Wed, 09 Aug 2017 13:38:25 +0200 Subject: [Python-ideas] Argparse argument deprecation In-Reply-To: <491f4b14-8dbd-804e-c1e5-970640df954b@nedbatchelder.com> References: <1502265380.2026839.1067738128.60EEB727@webmail.messagingengine.com> <14d3d469-319a-7e70-a8bf-054de57197bc@gmail.com> <491f4b14-8dbd-804e-c1e5-970640df954b@nedbatchelder.com> Message-ID: <1502278705.2072904.1067929320.33FBB792@webmail.messagingengine.com> > Another note about the proposal: calling it "deprecated" seems odd, > since the proposal is really just a general-purpose callback. argparse > isn't generating the warning, your callback function would be doing it. > Why name it "deprecated"? How is this different than the "action" > keyword argument that argparse already provides? That sounds right. Maybe a better implementation would be to implement a custom action by inheriting from argparse.Action https://docs.python.org/3/library/argparse.html#action and do all the warning/deprecation job there. I'll experiment with this idea on my side to see how it goes :) Cheers Tarek From ncoghlan at gmail.com Wed Aug 9 10:54:57 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 10 Aug 2017 00:54:57 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: On 9 August 2017 at 15:38, Guido van Rossum wrote: > On Tue, Aug 8, 2017 at 10:06 PM, Nick Coghlan wrote: >> The OP's proposal doesn't fit into that category though: rather it's >> asking about the case where we have an infinite iterator (e.g. >> itertools.count(0)), and want to drop items until they start meeting >> some condition (i.e. itertools.dropwhile) and then terminate the >> iterator as soon as another condition is no longer met (i.e. >> itertools.takewhile). > > I don't think that's what the OP meant. The original proposal seemed to > assume that it would be somehow reasonable for the input ("integers" in the > example) to be able to see and parse the condition in the generator > expression ("1000 <= x < 100000" in the example, with "x" somehow known to > be bound to the iteration value). That's at least what I think the remark "I > like mathy syntax" referred to. Right, I was separating the original request to make "{x for x in integers if 1000 <= x < 1000000}" work into the concrete proposal to make exactly *that* syntax work (which I don't think is feasible), and the slightly more general notion of offering a more math-like syntax that allows finite sets to be built from infinite iterators by defining a termination condition in addition to a filter condition. >> There aren't any technical barriers I'm aware of to implementing that, >> with the main historical objection being that instead of the >> comprehension level while clause mapping to a while loop directly the >> way the for and if clauses map to their statement level counterparts, >> it would instead map to the conditional break in the expanded >> loop-and-a-half form: >> >> while True: >> if not condition: >> break >> >> While it's taken me a long time to come around to the idea, "Make >> subtle infinite loops in mathematical code easier to avoid" *is* a >> pretty compelling user-focused justification for incurring that extra >> complexity at the language design level. > > I haven't come around to this yet. It looks like it will make explaining > comprehensions more complex, since the translation of "while X" into "if not > X: break" feels less direct than the translations of "for x in xs" or "if > pred(x)". (In particular, your proposal seems to require more experience > with mentally translating loops and conditions into jumps -- most regulars > of this forum do that for a living, but I doubt it's second nature for the > OP.) Yeah, if we ever did add something like this, I suspect a translation using takewhile would potentially be easier for at least some users to understand than the one to a break condition: {x for x in itertools.count(0) if 1000 <= x while x < 1000000} <=> x = set() for x in itertools.count(0): if 1000 <= x: set.add(x) # If you've never used the loop-and-a-half idiom, it's # not obvious why "while " means "if not : break" if not x < 1000000: break is roughly {x for x in itertools.takewhile(itertools.count(0), lambda x: x < 1000000) if 1000 <= x} <=> x = set() for x in takewhile(itertools.count(0), lambda x: x < 1000000): if 1000 <= x: set.add(x) However, the break condition is the translation that would make sense at a language *implementation* level (and would hence be the one that determined the relative location of the while clause in the expression form). That discrepancy *still* sets off alarm bells for me (since it's a clear sign that "how people would think this works" and "how it would actually work" probably wouldn't match), I'm also conscious of the amount of syntactic noise that "takewhile" introduces vs the "while" keyword. The counter-argument (which remains valid even against my own change of heart) is that adding a new comprehension clause doesn't actually fix the "accidental infinite loop" problem: "{x for x in itertools.count(0) if 1000 <= x < 1000000}" will still loop forever, it would just have a nicer fix to get it to terminate (adding " while x" to turn the second filter condition into a termination condition). So while I'm +0 where I used to be a firm -1, it's still only a +0 :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From fakedme+py at gmail.com Wed Aug 9 11:49:13 2017 From: fakedme+py at gmail.com (Soni L.) Date: Wed, 9 Aug 2017 12:49:13 -0300 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: On 2017-08-09 11:54 AM, Nick Coghlan wrote: > On 9 August 2017 at 15:38, Guido van Rossum wrote: >> On Tue, Aug 8, 2017 at 10:06 PM, Nick Coghlan wrote: >>> The OP's proposal doesn't fit into that category though: rather it's >>> asking about the case where we have an infinite iterator (e.g. >>> itertools.count(0)), and want to drop items until they start meeting >>> some condition (i.e. itertools.dropwhile) and then terminate the >>> iterator as soon as another condition is no longer met (i.e. >>> itertools.takewhile). >> I don't think that's what the OP meant. The original proposal seemed to >> assume that it would be somehow reasonable for the input ("integers" in the >> example) to be able to see and parse the condition in the generator >> expression ("1000 <= x < 100000" in the example, with "x" somehow known to >> be bound to the iteration value). That's at least what I think the remark "I >> like mathy syntax" referred to. > Right, I was separating the original request to make "{x for x in > integers if 1000 <= x < 1000000}" work into the concrete proposal to > make exactly *that* syntax work (which I don't think is feasible), and > the slightly more general notion of offering a more math-like syntax > that allows finite sets to be built from infinite iterators by > defining a termination condition in addition to a filter condition. Ok. A concrete proposal would give a read-only 'filter' argument to the iterator somehow, which represents some form of simplified AST of the condition. So e.g. {x for x in integers if (lambda v: 1000 <= v < 1000000)(x)} would never complete, but {x for x in integers if 1000 <= x < 1000000} would. (But perhaps lambda objects should include an AST attribute... Having it for normal functions would introduce too much overhead tho, and then it would no longer be a simplified AST, but rather a complete python AST, which we don't want.) > >>> There aren't any technical barriers I'm aware of to implementing that, >>> with the main historical objection being that instead of the >>> comprehension level while clause mapping to a while loop directly the >>> way the for and if clauses map to their statement level counterparts, >>> it would instead map to the conditional break in the expanded >>> loop-and-a-half form: >>> >>> while True: >>> if not condition: >>> break >>> >>> While it's taken me a long time to come around to the idea, "Make >>> subtle infinite loops in mathematical code easier to avoid" *is* a >>> pretty compelling user-focused justification for incurring that extra >>> complexity at the language design level. >> I haven't come around to this yet. It looks like it will make explaining >> comprehensions more complex, since the translation of "while X" into "if not >> X: break" feels less direct than the translations of "for x in xs" or "if >> pred(x)". (In particular, your proposal seems to require more experience >> with mentally translating loops and conditions into jumps -- most regulars >> of this forum do that for a living, but I doubt it's second nature for the >> OP.) > Yeah, if we ever did add something like this, I suspect a translation > using takewhile would potentially be easier for at least some users to > understand than the one to a break condition: > > {x for x in itertools.count(0) if 1000 <= x while x < 1000000} > > <=> > > x = set() > for x in itertools.count(0): > if 1000 <= x: > set.add(x) > # If you've never used the loop-and-a-half idiom, it's > # not obvious why "while " means "if not : break" > if not x < 1000000: > break > > is roughly > > {x for x in itertools.takewhile(itertools.count(0), lambda x: x < > 1000000) if 1000 <= x} > > <=> > > x = set() > for x in takewhile(itertools.count(0), lambda x: x < 1000000): > if 1000 <= x: > set.add(x) > > However, the break condition is the translation that would make sense > at a language *implementation* level (and would hence be the one that > determined the relative location of the while clause in the expression > form). > > That discrepancy *still* sets off alarm bells for me (since it's a > clear sign that "how people would think this works" and "how it would > actually work" probably wouldn't match), I'm also conscious of the > amount of syntactic noise that "takewhile" introduces vs the "while" > keyword. > > The counter-argument (which remains valid even against my own change > of heart) is that adding a new comprehension clause doesn't actually > fix the "accidental infinite loop" problem: "{x for x in > itertools.count(0) if 1000 <= x < 1000000}" will still loop forever, > it would just have a nicer fix to get it to terminate (adding " while > x" to turn the second filter condition into a termination condition). > > So while I'm +0 where I used to be a firm -1, it's still only a +0 :) > > Cheers, > Nick. > From e4r7hbug at gmail.com Wed Aug 9 13:42:18 2017 From: e4r7hbug at gmail.com (Nate.) Date: Wed, 09 Aug 2017 17:42:18 +0000 Subject: [Python-ideas] Mimetypes Include application/json Message-ID: Hi, A friend and I have hit a funny situation with the `mimetypes.py` library guessing the type for a '.json' file. Is there a reason why '.json' hasn't been added to the mapping? Without `mailcap` installed: [root at de169da8cc46 /]# python3 -m mimetypes build.json I don't know anything about type build.json With `mailcap` installed: [root at de169da8cc46 /]# python3 -m mimetypes build.json type: application/json encoding: None We experimented with adding a mapping for '.json' to 'application/json' to `mimetypes.py` and it seems to work fine for us. It looks like it has been registered with IANA and everything. Proposed diff: ntangsurat at derigible ~/git/e4r7hbug.cpython/Lib master $ git diff diff --git a/Lib/mimetypes.py b/Lib/mimetypes.py index 3d68694864..5919b45a9b 100644 --- a/Lib/mimetypes.py +++ b/Lib/mimetypes.py @@ -439,6 +439,7 @@ def _default_mime_types(): '.jpeg' : 'image/jpeg', '.jpg' : 'image/jpeg', '.js' : 'application/javascript', + '.json' : 'application/json', '.ksh' : 'text/plain', '.latex' : 'application/x-latex', '.m1v' : 'video/mpeg', Nate. -------------- next part -------------- An HTML attachment was scrubbed... URL: From brett at python.org Wed Aug 9 14:17:59 2017 From: brett at python.org (Brett Cannon) Date: Wed, 09 Aug 2017 18:17:59 +0000 Subject: [Python-ideas] Mimetypes Include application/json In-Reply-To: References: Message-ID: On Wed, 9 Aug 2017 at 10:43 Nate. wrote: > Hi, > > A friend and I have hit a funny situation with the `mimetypes.py` library > guessing the type for a '.json' file. Is there a reason why '.json' hasn't > been > added to the mapping? > Probably no one thought about it since the module was added in 1997 which is only 2 years after the creation of JavaScript itself. :) > > Without `mailcap` installed: > > [root at de169da8cc46 /]# python3 -m mimetypes build.json > I don't know anything about type build.json > > With `mailcap` installed: > > [root at de169da8cc46 /]# python3 -m mimetypes build.json > type: application/json encoding: None > > We experimented with adding a mapping for '.json' to 'application/json' to > `mimetypes.py` and it seems to work fine for us. It looks like it has been > registered with IANA and everything. > > Proposed diff: > > ntangsurat at derigible ~/git/e4r7hbug.cpython/Lib master $ git diff > diff --git a/Lib/mimetypes.py b/Lib/mimetypes.py > index 3d68694864..5919b45a9b 100644 > --- a/Lib/mimetypes.py > +++ b/Lib/mimetypes.py > @@ -439,6 +439,7 @@ def _default_mime_types(): > '.jpeg' : 'image/jpeg', > '.jpg' : 'image/jpeg', > '.js' : 'application/javascript', > + '.json' : 'application/json', > '.ksh' : 'text/plain', > '.latex' : 'application/x-latex', > '.m1v' : 'video/mpeg', > Feel free to file a bug at bugs.python.org and if you aren't too bothered then submit a PR to github.com/python/cpython (https://devguide.python.org/ has all the details). -------------- next part -------------- An HTML attachment was scrubbed... URL: From storchaka at gmail.com Wed Aug 9 14:24:49 2017 From: storchaka at gmail.com (Serhiy Storchaka) Date: Wed, 9 Aug 2017 21:24:49 +0300 Subject: [Python-ideas] Mimetypes Include application/json In-Reply-To: References:

Message-ID: 09.08.17 21:17, Brett Cannon ????: > On Wed, 9 Aug 2017 at 10:43 Nate. > > wrote: > A friend and I have hit a funny situation with the `mimetypes.py` > library > guessing the type for a '.json' file. Is there a reason why '.json' > hasn't been > added to the mapping? > > > Probably no one thought about it since the module was added in 1997 > which is only 2 years after the creation of JavaScript itself. :) No one proposed a patch. > Feel free to file a bug at bugs.python.org and > if you aren't too bothered then submit a PR to github.com/python/cpython > (https://devguide.python.org/ has all > the details). https://bugs.python.org/issue30824 From phd at phdru.name Wed Aug 9 14:25:55 2017 From: phd at phdru.name (Oleg Broytman) Date: Wed, 9 Aug 2017 20:25:55 +0200 Subject: [Python-ideas] Mimetypes Include application/json In-Reply-To: References: Message-ID: <20170809182555.GA4079@phdru.name> On Wed, Aug 09, 2017 at 05:42:18PM +0000, "Nate." wrote: > A friend and I have hit a funny situation with the `mimetypes.py` library > guessing the type for a '.json' file. Is there a reason why '.json' hasn't > been > added to the mapping? My guess is that nobody uses mimetypes without mailcap. > Without `mailcap` installed: > > [root at de169da8cc46 /]# python3 -m mimetypes build.json > I don't know anything about type build.json > > With `mailcap` installed: > > [root at de169da8cc46 /]# python3 -m mimetypes build.json > type: application/json encoding: None > > We experimented with adding a mapping for '.json' to 'application/json' to > `mimetypes.py` and it seems to work fine for us. It looks like it has been > registered with IANA and everything. > > Proposed diff: Patches should be published at the issue tracker. > ntangsurat at derigible ~/git/e4r7hbug.cpython/Lib master $ git diff > diff --git a/Lib/mimetypes.py b/Lib/mimetypes.py > index 3d68694864..5919b45a9b 100644 > --- a/Lib/mimetypes.py > +++ b/Lib/mimetypes.py > @@ -439,6 +439,7 @@ def _default_mime_types(): > '.jpeg' : 'image/jpeg', > '.jpg' : 'image/jpeg', > '.js' : 'application/javascript', > + '.json' : 'application/json', > '.ksh' : 'text/plain', > '.latex' : 'application/x-latex', > '.m1v' : 'video/mpeg', > > Nate. Oleg. -- Oleg Broytman http://phdru.name/ phd at phdru.name Programmers don't die, they just GOSUB without RETURN. From e4r7hbug at gmail.com Wed Aug 9 14:50:47 2017 From: e4r7hbug at gmail.com (Nate.) Date: Wed, 09 Aug 2017 18:50:47 +0000 Subject: [Python-ideas] Mimetypes Include application/json In-Reply-To: References:

Message-ID: O, fun! Thank you for the guidance. I managed to find a Bug already created, http://bugs.python.org/issue30824. I'll create a Pull Request using that Bug. On Wed, Aug 9, 2017 at 1:18 PM Brett Cannon wrote: > On Wed, 9 Aug 2017 at 10:43 Nate. wrote: > >> Hi, >> >> A friend and I have hit a funny situation with the `mimetypes.py` library >> guessing the type for a '.json' file. Is there a reason why '.json' >> hasn't been >> added to the mapping? >> > > Probably no one thought about it since the module was added in 1997 which > is only 2 years after the creation of JavaScript itself. :) > > >> >> Without `mailcap` installed: >> >> [root at de169da8cc46 /]# python3 -m mimetypes build.json >> I don't know anything about type build.json >> >> With `mailcap` installed: >> >> [root at de169da8cc46 /]# python3 -m mimetypes build.json >> type: application/json encoding: None >> >> We experimented with adding a mapping for '.json' to 'application/json' to >> `mimetypes.py` and it seems to work fine for us. It looks like it has been >> registered with IANA and everything. >> >> Proposed diff: >> >> ntangsurat at derigible ~/git/e4r7hbug.cpython/Lib master $ git diff >> diff --git a/Lib/mimetypes.py b/Lib/mimetypes.py >> index 3d68694864..5919b45a9b 100644 >> --- a/Lib/mimetypes.py >> +++ b/Lib/mimetypes.py >> @@ -439,6 +439,7 @@ def _default_mime_types(): >> '.jpeg' : 'image/jpeg', >> '.jpg' : 'image/jpeg', >> '.js' : 'application/javascript', >> + '.json' : 'application/json', >> '.ksh' : 'text/plain', >> '.latex' : 'application/x-latex', >> '.m1v' : 'video/mpeg', >> > > Feel free to file a bug at bugs.python.org and if you aren't too bothered > then submit a PR to github.com/python/cpython ( > https://devguide.python.org/ has all the details). > -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Wed Aug 9 16:23:28 2017 From: chris.barker at noaa.gov (Chris Barker) Date: Wed, 9 Aug 2017 13:23:28 -0700 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: On 8/9/2017 10:54 AM, Nick Coghlan wrote: > On 9 August 2017 at 15:38, Guido van Rossum wrote: >> On Tue, Aug 8, 2017 at 10:06 PM, Nick Coghlan wrote: >>> The OP's proposal doesn't fit into that category though: rather it's >>> asking about the case where we have an infinite iterator (e.g. >>> itertools.count(0)), and want to drop items until they start meeting >>> some condition (i.e. itertools.dropwhile) and then terminate the >>> iterator as soon as another condition is no longer met (i.e. >>> itertools.takewhile). >> >> I don't think that's what the OP meant. The original proposal seemed to >> assume that it would be somehow reasonable for the input ("integers" in the >> example) to be able to see and parse the condition in the generator >> expression ("1000 <= x < 100000" in the example, with "x" somehow known to >> be bound to the iteration value). That's at least what I think the remark "I >> like mathy syntax" referred to. > > Right, I was separating the original request to make "{x for x in > integers if 1000 <= x < 1000000}" work into the concrete proposal to > make exactly *that* syntax work (which I don't think is feasible), and > the slightly more general notion of offering a more math-like syntax > that allows finite sets to be built from infinite iterators by > defining a termination condition in addition to a filter condition. We already have three nice one liners for that, one of which you gave. x = set(filter(filter_condition, takewhile(continue_condition, source))) x = set(x for x in takewhile(continue_condition, source) if filter_condition) x = {x for x in takewhile(continue_condition, source) if filter_condition} Replace takewhile with islice(source, max) if the continue condition is (number seen < max). Add enumerate if the running count is needed otherwise. Terminating an infinite iterator and filtering the initial slice are different operations. The operations are easily composed as they are, in multiple ways. Trying to mix them together in one jumbled special syntax is a bad idea to me. >>> There aren't any technical barriers I'm aware of to implementing that, >>> with the main historical objection being that instead of the >>> comprehension level while clause mapping to a while loop directly the >>> way the for and if clauses map to their statement level counterparts, >>> it would instead map to the conditional break in the expanded >>> loop-and-a-half form: >>> >>> while True: >>> if not condition: >>> break In other words, aside from other issues, you would have 'while' mean 'do...while' in this one special place. -1. -- Terry Jan Reedy From ncoghlan at gmail.com Wed Aug 9 23:24:45 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 10 Aug 2017 13:24:45 +1000 Subject: [Python-ideas] Pseudo methods In-Reply-To: <22922.50589.497352.85140@turnbull.sk.tsukuba.ac.jp> References:

<22922.50589.497352.85140@turnbull.sk.tsukuba.ac.jp> Message-ID: On 9 August 2017 at 18:19, Stephen J. Turnbull wrote: > Nick Coghlan writes: > > > To analyse and investigate this code, we need to "just know" that: > > You can of course hope that help(input().has_vowels) will tell you > where to find it. If it doesn't, well, shame on you for depending on > source-unavailable software that you don't understand. ;-) We can't run "help" when we're reviewing a diff or otherwise reading code in a situation where interactive help isn't available :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From ncoghlan at gmail.com Wed Aug 9 23:30:24 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 10 Aug 2017 13:30:24 +1000 Subject: [Python-ideas] Mimetypes Include application/json In-Reply-To: References:

Message-ID: On 10 August 2017 at 04:24, Serhiy Storchaka wrote: > 09.08.17 21:17, Brett Cannon ????: >> >> On Wed, 9 Aug 2017 at 10:43 Nate. > > wrote: >> A friend and I have hit a funny situation with the `mimetypes.py` >> library >> guessing the type for a '.json' file. Is there a reason why '.json' >> hasn't been >> added to the mapping? >> >> >> Probably no one thought about it since the module was added in 1997 which >> is only 2 years after the creation of JavaScript itself. :) > > No one proposed a patch. That's not *quite* true - there was at least one proposal a few years to modernise the mimetypes list, but the one I was involved in reviewing got intertwined with a proposal to completely rewrite the mimetypes module, and the submitter wasn't interested in creating a more minimalist patch that solved the specific problem (i.e. the list was pretty out of date) without all the extraneous changes to how the module actually worked :( Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From ncoghlan at gmail.com Thu Aug 10 00:11:48 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 10 Aug 2017 14:11:48 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: On 10 August 2017 at 00:54, Nick Coghlan wrote: > Yeah, if we ever did add something like this, I suspect a translation > using takewhile would potentially be easier for at least some users to > understand than the one to a break condition: > > {x for x in itertools.count(0) if 1000 <= x while x < 1000000} > > <=> > > x = set() > for x in itertools.count(0): > if 1000 <= x: > set.add(x) > # If you've never used the loop-and-a-half idiom, it's > # not obvious why "while " means "if not : break" > if not x < 1000000: > break > > is roughly > > {x for x in itertools.takewhile(itertools.count(0), lambda x: x < > 1000000) if 1000 <= x} > > <=> > > x = set() > for x in takewhile(itertools.count(0), lambda x: x < 1000000): > if 1000 <= x: > set.add(x) Ugh, this discrepancy is worse than I thought, since the translation with that clause order is actually wrong (Terry mentioned this by pointing out that the proposed syntactic translation implemented "do...while" ordering). The takewhile example is also wrong, since it has the arguments in the wrong order. Fixing both of those issues gives the comparison: {x for x in itertools.count(0) while x < 1000000 if 1000 <= x} <=> x = set() for x in itertools.count(0): # If you've never used the loop-and-a-half idiom, it's # not obvious why "while " means "if : else: break" if x < 1000000: if 1000 <= x: set.add(x) else: break is roughly: {x for x in itertools.takewhile(lambda x: x < 1000000, itertools.count(0)) if 1000 <= x} <=> x = set() for x in takewhile(lambda x: x < 1000000, itertools.count(0)): if 1000 <= x: set.add(x) And I think that gets me back to pretty much where I was the last time this came up: a while clause in comprehensions really only makes sense in combination with a while clause on for loops, where: for x in itertools.count(0) while x < 1000000: ... was roughly equivalent to: for x in itertools.count(0): if x < 1000000: ... else: break (such that there's only one loop from the point of view of break/continue/else, but the loop may terminate based on either exhaustion of the underlying iterator *or* some specific condition becoming false) While I do think such a clause would be more readable for more people than the dropwhile/takewhile equivalents (especially when the latter end up needing to use lambda expressions), I'm still dubious that these cases come up often enough to justify the addition of a for-while loop as a composite construct (the old "dropwhile and takewhile aren't even common enough to justify being builtins, why should they jump all the way to syntactic support?" question applies). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From lists at cooperlees.com Thu Aug 10 00:21:56 2017 From: lists at cooperlees.com (Cooper Ry Lees) Date: Thu, 10 Aug 2017 12:21:56 +0800 Subject: [Python-ideas] PyPI JSON Metadata Standardization for Mirrors Message-ID: Hi all, First time emailer, so please be kind. Also, if this is not the right mailing list for PyPA talk, I apologize. Please point me in the right direction if so. The main reason I have emailed here is I believe it may be PEP time to standardize the JSON metadata that PyPI makes available, like what was done for the `'simple API` described in PEP503. I've been doing a bit of work on `bandersnatch` (I didn't name it), which is a PEP 381 mirroring package and wanted to enhance it to also mirror the handy JSON metadata PyPI generates and makes available @ https://pypi.python.org/pypi/PKG_NAME/json. I've done a PR on bandersnatch as a POC that mirrors both the PyPI directory structure (URL/pypi/PKG_NAME/json) and created a standardizable URL/json/PKG_NAME that the former symlinks to (to be served by NGINX / some other proxy). I'm also contemplating naming the directory 'metadata' rather than JSON so if some new hotness / we want to change the format down the line we're not stuck with json as the dirname. This PR can be found here: https://bitbucket.org/pypa/bandersnatch/pull-requests/33/save-json-metadata-to-mirror My main use case is to write a very simple async 'verifier' tool that will crawl all the JSON files and then ensure the packages directory on each of my internal mirrors (I have a mirror per region / datacenter) have all the files they should. I sync centrally (to save resource on the PyPI infrastructure) and then rsync out all the diffs to each region / datacenter, and under some failure scenarios I could miss a file or many. So I feel using JSON pulled down from the authoritative source will allow an async job to verify the MD5 of all the package files on each mirror. What are peoples thoughts here? Is it worth a PEP similar to PEP503 going forward? Can people enhance / share some thoughts on this idea. Thanks, Cooper Lees me at cooperlees.com https://cooperlees.com/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Thu Aug 10 00:43:50 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 10 Aug 2017 14:43:50 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: On 10 August 2017 at 01:49, Soni L. wrote: > On 2017-08-09 11:54 AM, Nick Coghlan wrote: >> Right, I was separating the original request to make "{x for x in >> integers if 1000 <= x < 1000000}" work into the concrete proposal to >> make exactly *that* syntax work (which I don't think is feasible), and >> the slightly more general notion of offering a more math-like syntax >> that allows finite sets to be built from infinite iterators by >> defining a termination condition in addition to a filter condition. > > Ok. A concrete proposal would give a read-only 'filter' argument to the > iterator somehow, which represents some form of simplified AST of the > condition. > > So e.g. {x for x in integers if (lambda v: 1000 <= v < 1000000)(x)} would > never complete, but {x for x in integers if 1000 <= x < 1000000} would. (But > perhaps lambda objects should include an AST attribute... Having it for > normal functions would introduce too much overhead tho, and then it would no > longer be a simplified AST, but rather a complete python AST, which we don't > want.) There have been a variety of different "thunking" proposals over the years, but they've all foundered on the question of what the *primitive* quoted form should look like, and how the thunks should subsequently be executed. For cases like this, where integration with Python's name resolution mechanism isn't actually required, folks have ended up just using strings, where the only downside is the fact that syntax highlighters and other static analysers don't know that the contents are supposed to be valid Python code. In a case like this, that might look like: {x for x in integers.build_set("1000 <= x < 1000000")} As with regexes, the cost of dynamically parsing such strings can then be amortised at runtime through the use of an appropriate caching strategy. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From steve at pearwood.info Thu Aug 10 09:42:33 2017 From: steve at pearwood.info (Steven D'Aprano) Date: Thu, 10 Aug 2017 23:42:33 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: <20170810134232.GE7395@ando.pearwood.info> On Wed, Aug 09, 2017 at 01:23:28PM -0700, Chris Barker wrote: > I can't recall the use case(s) at the moment, but I have definitely wanted > a way to break out of a comprehension -- and not always with infinite > iterators. > > After all, we have "break" in both for and while loops, so clearly there is > the use case... Indeed :-) > If someone comes up with a clean and not confusing (and general purpose) > syntax, I think it would be very useful. We used to be able to (ab)use StopIteration to do this: def Break(): raise StopIteration # generator expressions only, not list comprehensions result = (expression for x in sequence if condition or Break()) but I believe that loophole has been closed in 3.6. Comprehensions in Clojure have this feature: http://clojuredocs.org/clojure_core/clojure.core/for Clojure uses "when" where Python uses "if", giving: ;; :when continues through the collection even if some have the ;; condition evaluate to false, like filter user=> (for [x (range 3 33 2) :when (prime? x)] x) (3 5 7 11 13 17 19 23 29 31) ;; :while stops at the first collection element that evaluates to ;; false, like take-while user=> (for [x (range 3 33 2) :while (prime? x)] x) (3 5 7) Translating into Python: [x for x in range(3, 33, 2) if is_prime(x)] [x for x in range(3, 33, 2) while is_prime(x)] # hypothetical syntax I don't think it is confusing. Regardless of the implementation, the meaning of: [expression for x in sequence while condition] should (I believe) be obvious to anyone who already groks comprehension syntax. The mapping to a for-loop is admittedly a tad more complex: result = [] for x in sequence: if not condition: break result.append(expression) but I'm yet to meet anyone who routinely and regularly reads comprehensions by converting them to for loops like that. And if they did, all they need do is mentally map "while condition" to "if not condition: break" and it should all Just Work?. -- Steve From steve at pearwood.info Thu Aug 10 10:50:35 2017 From: steve at pearwood.info (Steven D'Aprano) Date: Fri, 11 Aug 2017 00:50:35 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: <20170810145035.GF7395@ando.pearwood.info> On Thu, Aug 10, 2017 at 12:54:57AM +1000, Nick Coghlan wrote: Guido wrote: > > I haven't come around to this yet. It looks like it will make explaining > > comprehensions more complex, since the translation of "while X" into "if not > > X: break" feels less direct than the translations of "for x in xs" or "if > > pred(x)". (In particular, your proposal seems to require more experience > > with mentally translating loops and conditions into jumps -- most regulars > > of this forum do that for a living, but I doubt it's second nature for the > > OP.) > > Yeah, if we ever did add something like this, I suspect a translation > using takewhile would potentially be easier for at least some users to > understand than the one to a break condition: "Some users"? Sure, why not? There's probably somebody out there who understands takewhile, but if so, I don't know who they are :-) I always have to look at the docs for takewhile to remind myself whether it drops items ("takes them away") while the condition is true, or yields items ("gives items") while the condition is true. > {x for x in itertools.count(0) if 1000 <= x while x < 1000000} > > <=> > > x = set() > for x in itertools.count(0): > if 1000 <= x: > set.add(x) > # If you've never used the loop-and-a-half idiom, it's > # not obvious why "while " means "if not : break" > if not x < 1000000: > break I'd like to take issue with that "not obvious" comment. I think that anyone who knows while loops knows that the loop exits when the condition becomes false. That's exactly the behaviour we get for the (hypothetical) [expr for x in seq while condition] syntax: when the condition is false, the loop and hence the comprehension, exits. For such simple cases, there's no need to think about "loop and a half". The obvious explanation is that the loop exits when the while condition fails. Based on my experience with beginners on the tutor mailing list, and elsewhere, I think there's a definite learning "hump" to get over before people grok even the trivial case of [expression for x in sequence] but once they do, then adding an "if" clause is obvious, and I expect that the same will apply to "when". Once you move beyond the simple case of a single for and no more than a single if (or while), I don't think there's *anything* obvious about comprehension syntax at all, while clause or no while clause. Holding the while clause to a standard that comprehensions already fail (in my opinion) is unfair: [expression for x in seq1 for y in seq2 if pred1 for z in seq3 if pred2 if pred3 if pred4 for w in seq4 while condition for v in seq5] I don't think it's the "while" that tips that over the edge, readability-wise :-) In any case, I think we're all guessing whether or not people will understand the "while condition" syntax. So I've done an informal survey on the Python-Ideas list, and once folks have had a day or so to answer I'll report what they say. It's not a truly scientific UI test, but it's the best I can do. -- Steve From p.f.moore at gmail.com Thu Aug 10 11:39:32 2017 From: p.f.moore at gmail.com (Paul Moore) Date: Thu, 10 Aug 2017 16:39:32 +0100 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: <20170810134232.GE7395@ando.pearwood.info> References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

<20170810134232.GE7395@ando.pearwood.info> Message-ID: On 10 August 2017 at 14:42, Steven D'Aprano wrote: > I don't think it is confusing. Regardless of the implementation, the > meaning of: > > [expression for x in sequence while condition] > > should (I believe) be obvious to anyone who already groks comprehension > syntax. The mapping to a for-loop is admittedly a tad more complex: > > result = [] > for x in sequence: > if not condition: break > result.append(expression) > > but I'm yet to meet anyone who routinely and regularly reads > comprehensions by converting them to for loops like that. And if they > did, all they need do is mentally map "while condition" to "if not > condition: break" and it should all Just Work?. The hard part is the interaction between if and while. Consider (expr for var in seq if cond1 while cond2): This means: for var in seq: if cond1: if not cond2: break yield expr Note that unlike all other comprehension clauses (for and if) while doesn't introduce a new level of nesting. That's an inconsistency, and while it's minor, it would need clarifying (my original draft of this email was a mess, because I misinterpreted how if and while would interact, precisely over this point). Also, there's a potential issue here - consider [expr for var in even_numbers() if is_odd(var) while var < 100] This is an infinite loop, even though it has a finite termination condition (var < 100), because we only test the termination condition if var is odd, which it never will be. Obviously, this is a contrived example. And certainly "don't do that, then" is a valid response. But my instinct is that people are going to get this wrong - *especially* in a maintenance environment. That example could have started off being "for var in count(0)" and then someone realised they could "optimise" it by omitting odd numbers, introducing the bug in the process. (And I'm sure real life code could come up with much subtler examples ;-)) Overall, I agree with Steven's point. It seems pretty obvious what the intention is, and while it's probably possible to construct examples that are somewhat unclear, 1. The mechanical rule gives an explicit meaning 2. People shouldn't be writing such complex comprehensions, so if the rule doesn't give what they expect, they can always rewrite the code with an explicit (and clearer) loop. But while I think this says that the above interpretation of while is the only sensible one, and in general other approaches are unlikely to be as natural, I *don't* think that it unequivocally says that allowing while is a good thing. It may still be better to omit it, and force people to state their intent explicitly (albeit a bit more verbosely). Paul From fakedme+py at gmail.com Thu Aug 10 11:39:34 2017 From: fakedme+py at gmail.com (Soni L.) Date: Thu, 10 Aug 2017 12:39:34 -0300 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: On 2017-08-10 01:43 AM, Nick Coghlan wrote: > On 10 August 2017 at 01:49, Soni L. wrote: >> On 2017-08-09 11:54 AM, Nick Coghlan wrote: >>> Right, I was separating the original request to make "{x for x in >>> integers if 1000 <= x < 1000000}" work into the concrete proposal to >>> make exactly *that* syntax work (which I don't think is feasible), and >>> the slightly more general notion of offering a more math-like syntax >>> that allows finite sets to be built from infinite iterators by >>> defining a termination condition in addition to a filter condition. >> Ok. A concrete proposal would give a read-only 'filter' argument to the >> iterator somehow, which represents some form of simplified AST of the >> condition. >> >> So e.g. {x for x in integers if (lambda v: 1000 <= v < 1000000)(x)} would >> never complete, but {x for x in integers if 1000 <= x < 1000000} would. (But >> perhaps lambda objects should include an AST attribute... Having it for >> normal functions would introduce too much overhead tho, and then it would no >> longer be a simplified AST, but rather a complete python AST, which we don't >> want.) > There have been a variety of different "thunking" proposals over the > years, but they've all foundered on the question of what the > *primitive* quoted form should look like, and how the thunks should > subsequently be executed. > > For cases like this, where integration with Python's name resolution > mechanism isn't actually required, folks have ended up just using > strings, where the only downside is the fact that syntax highlighters > and other static analysers don't know that the contents are supposed > to be valid Python code. In a case like this, that might look like: > > {x for x in integers.build_set("1000 <= x < 1000000")} > > As with regexes, the cost of dynamically parsing such strings can then > be amortised at runtime through the use of an appropriate caching > strategy. I'm pretty sure I read somewhere that lambdas and generators share their syntax, and that syntax is already a subset of python syntax. Would it be too hard to expose that with a "simplified AST" API? > > Cheers, > Nick. > From steve at pearwood.info Thu Aug 10 12:11:42 2017 From: steve at pearwood.info (Steven D'Aprano) Date: Fri, 11 Aug 2017 02:11:42 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

Message-ID: <20170810161142.GH7395@ando.pearwood.info> On Thu, Aug 10, 2017 at 12:39:34PM -0300, Soni L. wrote: > I'm pretty sure I read somewhere that lambdas and generators share their > syntax, and that syntax is already a subset of python syntax. Would it > be too hard to expose that with a "simplified AST" API? I don't understand what you mean by this. The syntax for lambda is (roughly): lambda parameter-list : expression The syntax for generators is (again, roughly): def name ( parameter-list ) : suite-containing-yield Obviously the generator suite can contain expressions, and both have a parameter-list. What shared syntax are you referring to, and how is it relevant? Or are you referring to generator expressions, rather than generators? ( expression for target in expression ... ) Obviously a Python expression is a Python expression, wherever it is, so a lambda can contain generator expressions, and generator expressions can contain lambdas... And what do you mean by "simplified AST" API? I'm afraid your comment is too abstract for me to understand. -- Steve From fakedme+py at gmail.com Thu Aug 10 12:37:29 2017 From: fakedme+py at gmail.com (Soni L.) Date: Thu, 10 Aug 2017 13:37:29 -0300 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: <20170810161142.GH7395@ando.pearwood.info> References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

<20170810161142.GH7395@ando.pearwood.info> Message-ID: On 2017-08-10 01:11 PM, Steven D'Aprano wrote: > On Thu, Aug 10, 2017 at 12:39:34PM -0300, Soni L. wrote: > >> I'm pretty sure I read somewhere that lambdas and generators share their >> syntax, and that syntax is already a subset of python syntax. Would it >> be too hard to expose that with a "simplified AST" API? > I don't understand what you mean by this. > > The syntax for lambda is (roughly): > > lambda parameter-list : expression > > The syntax for generators is (again, roughly): > > def name ( parameter-list ) : > suite-containing-yield > > > Obviously the generator suite can contain expressions, and both have a > parameter-list. What shared syntax are you referring to, and how is it > relevant? > > Or are you referring to generator expressions, rather than generators? > > ( expression for target in expression ... ) > > Obviously a Python expression is a Python expression, wherever it is, so > a lambda can contain generator expressions, and generator expressions > can contain lambdas... > > And what do you mean by "simplified AST" API? I'm afraid your comment is > too abstract for me to understand. > Yes, see, both are expressions. Expression AST is a subset of python AST, so it's a simplified form of the python AST. > From brett at python.org Thu Aug 10 15:09:34 2017 From: brett at python.org (Brett Cannon) Date: Thu, 10 Aug 2017 19:09:34 +0000 Subject: [Python-ideas] PyPI JSON Metadata Standardization for Mirrors In-Reply-To: References: Message-ID: The proper list for this would be distutils-sig as that's where packaging-related discussions typically occur. On Wed, 9 Aug 2017 at 21:22 Cooper Ry Lees wrote: > Hi all, > > First time emailer, so please be kind. Also, if this is not the right > mailing list for PyPA talk, I apologize. Please point me in the right > direction if so. The main reason I have emailed here is I believe it may be > PEP time to standardize the JSON metadata that PyPI makes available, like > what was done for the `'simple API` described in PEP503. > > I've been doing a bit of work on `bandersnatch` (I didn't name it), which > is a PEP 381 mirroring package and wanted to enhance it to also mirror the > handy JSON metadata PyPI generates and makes available @ > https://pypi.python.org/pypi/PKG_NAME/json. > > I've done a PR on bandersnatch as a POC that mirrors both the PyPI > directory structure (URL/pypi/PKG_NAME/json) and created a standardizable > URL/json/PKG_NAME that the former symlinks to (to be served by NGINX / some > other proxy). I'm also contemplating naming the directory 'metadata' rather > than JSON so if some new hotness / we want to change the format down the > line we're not stuck with json as the dirname. This PR can be found here: > https://bitbucket.org/pypa/bandersnatch/pull-requests/33/save-json-metadata-to-mirror > > My main use case is to write a very simple async 'verifier' tool that will > crawl all the JSON files and then ensure the packages directory on each of > my internal mirrors (I have a mirror per region / datacenter) have all the > files they should. I sync centrally (to save resource on the PyPI > infrastructure) and then rsync out all the diffs to each region / > datacenter, and under some failure scenarios I could miss a file or many. > So I feel using JSON pulled down from the authoritative source will allow > an async job to verify the MD5 of all the package files on each mirror. > > What are peoples thoughts here? Is it worth a PEP similar to PEP503 going > forward? Can people enhance / share some thoughts on this idea. > > Thanks, > Cooper Lees > me at cooperlees.com > https://cooperlees.com/ > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Thu Aug 10 16:03:42 2017 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 10 Aug 2017 16:03:42 -0400 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: <20170810134232.GE7395@ando.pearwood.info> References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

<20170810134232.GE7395@ando.pearwood.info> Message-ID: On 8/10/2017 9:42 AM, Steven D'Aprano wrote: > On Wed, Aug 09, 2017 at 01:23:28PM -0700, Chris Barker wrote: > >> I can't recall the use case(s) at the moment, but I have definitely wanted >> a way to break out of a comprehension -- and not always with infinite >> iterators. >> >> After all, we have "break" in both for and while loops, so clearly there is >> the use case... In both cases, we use 'break' to mean break. If we want to break comprehensions, I think we should continue to use 'break' to mean break instead of twisting 'while' to mean 'break'. > [expression for x in sequence while condition] > > should (I believe) be obvious to anyone who already groks comprehension > syntax. The mapping to a for-loop is admittedly a tad more complex: > > result = [] > for x in sequence: > if not condition: break > result.append(expression) This is the same as result = [] for x in sequence: if condition: result.append(expression) else: break which could be written [expression for x in sequence if condition break] -- Terry Jan Reedy From chris.barker at noaa.gov Thu Aug 10 16:25:24 2017 From: chris.barker at noaa.gov (Chris Barker) Date: Thu, 10 Aug 2017 13:25:24 -0700 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

<20170810134232.GE7395@ando.pearwood.info> Message-ID: On Thu, Aug 10, 2017 at 8:39 AM, Paul Moore wrote: > Also, there's a potential issue > here - consider > > [expr for var in even_numbers() if is_odd(var) while var < 100] > > This is an infinite loop, even though it has a finite termination > condition (var < 100), because we only test the termination condition > if var is odd, which it never will be. > why is the termination only tested if teh if clause is True? Could then not be processed in parallel? or the while first.... so maybe better to do: [expr for var in even_numbers() while var < 100 if is_odd(var)] Maybe it's just me, but I would certainly expect the while to have precedence. I guess I think of it like this: "if" is providing a filtering mechanism "while" is providing a termination mechanism -- is there a use case anyone can think of when they would want the while to be applied to the list AFTER filtering? Obviously, this is a contrived example. And certainly "don't do that, > then" is a valid response. But my instinct is that people are going to > get this wrong - *especially* in a maintenance environment. sure, but would there be an issue if teh while were given precedence? Overall, I agree with Steven's point. It seems pretty obvious what the > intention is, and while it's probably possible to construct examples > that are somewhat unclear, > > 1. The mechanical rule gives an explicit meaning > 2. People shouldn't be writing such complex comprehensions, so if the > rule doesn't give what they expect, they can always rewrite the code > with an explicit (and clearer) loop. > me too -- a direct translation to a for loop isn't necessary to understand how it works. -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Thu Aug 10 16:28:12 2017 From: chris.barker at noaa.gov (Chris Barker) Date: Thu, 10 Aug 2017 13:28:12 -0700 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

<20170810134232.GE7395@ando.pearwood.info> Message-ID: On Thu, Aug 10, 2017 at 1:03 PM, Terry Reedy wrote: > After all, we have "break" in both for and while loops, so clearly there is >>> the use case... >>> >> > In both cases, we use 'break' to mean break. If we want to break > comprehensions, I think we should continue to use 'break' to mean break > instead of twisting 'while' to mean 'break'. I was thinking that too. >> [expression for x in sequence if condition break] hmm, but if you want to filter, also? [expression for x in sequence if condition if condition break] or [expression for x in sequence if condition break if condition ] both of those seem more confusing to me than while. -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From p.f.moore at gmail.com Thu Aug 10 16:52:51 2017 From: p.f.moore at gmail.com (Paul Moore) Date: Thu, 10 Aug 2017 21:52:51 +0100 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

<20170810134232.GE7395@ando.pearwood.info> Message-ID: On 10 August 2017 at 21:25, Chris Barker wrote: > On Thu, Aug 10, 2017 at 8:39 AM, Paul Moore wrote: > >> >> Also, there's a potential issue >> here - consider >> >> [expr for var in even_numbers() if is_odd(var) while var < 100] >> >> This is an infinite loop, even though it has a finite termination >> condition (var < 100), because we only test the termination condition >> if var is odd, which it never will be. > > > why is the termination only tested if teh if clause is True? Could then not > be processed in parallel? or the while first.... See? That's my point - the "obvious" interpretation stops being obvious pretty fast... > so maybe better to do: > > [expr for var in even_numbers() while var < 100 if is_odd(var)] That would work. But I bet people's intuition wouldn't immediately lead to that fix (or indeed, necessarily incline them to put the clauses in this order in the first place). > Maybe it's just me, but I would certainly expect the while to have > precedence. > > I guess I think of it like this: > > "if" is providing a filtering mechanism > > "while" is providing a termination mechanism > > -- is there a use case anyone can think of when they would want the while > to be applied to the list AFTER filtering? Probably not, but when you can have multiple FORs, WHILEs and IFs, in any order, explaining the behaviour precisely while still preserving some sense of "filtering comes after termination" is going to be pretty difficult. [expr for var1 in seq1 if cond1 for var2 in seq2 for var3 in seq3 if cond2 if cond3] is legal - stupid, but legal. Now add while clauses randomly in that, and define your expected semantics clearly so a user (and the compiler!) can determine what the resulting mess means. The main benefit of the current "works like a for loop" interpretation is that it's 100% explicit. Nothing will make a mess like the above good code, but at least it's well-defined. Paul From spencerb21 at live.com Thu Aug 10 16:53:18 2017 From: spencerb21 at live.com (Spencer Brown) Date: Thu, 10 Aug 2017 20:53:18 +0000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

<20170810134232.GE7395@ando.pearwood.info> , Message-ID: The logical solution to me is to allow any order of while and if, and follow the same 'rule' as multiple for loops - just nest/test those in that order. Then you can have whatever priority you need. One question though is how this should handle multiple loops - break all of them, or just the current one? - Spencer Brown On 11 Aug 2017, at 6:27 am, Chris Barker > wrote: On Thu, Aug 10, 2017 at 8:39 AM, Paul Moore > wrote: Also, there's a potential issue here - consider [expr for var in even_numbers() if is_odd(var) while var < 100] This is an infinite loop, even though it has a finite termination condition (var < 100), because we only test the termination condition if var is odd, which it never will be. why is the termination only tested if teh if clause is True? Could then not be processed in parallel? or the while first.... so maybe better to do: [expr for var in even_numbers() while var < 100 if is_odd(var)] Maybe it's just me, but I would certainly expect the while to have precedence. I guess I think of it like this: "if" is providing a filtering mechanism "while" is providing a termination mechanism -- is there a use case anyone can think of when they would want the while to be applied to the list AFTER filtering? Obviously, this is a contrived example. And certainly "don't do that, then" is a valid response. But my instinct is that people are going to get this wrong - *especially* in a maintenance environment. sure, but would there be an issue if teh while were given precedence? Overall, I agree with Steven's point. It seems pretty obvious what the intention is, and while it's probably possible to construct examples that are somewhat unclear, 1. The mechanical rule gives an explicit meaning 2. People shouldn't be writing such complex comprehensions, so if the rule doesn't give what they expect, they can always rewrite the code with an explicit (and clearer) loop. me too -- a direct translation to a for loop isn't necessary to understand how it works. -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov _______________________________________________ Python-ideas mailing list Python-ideas at python.org https://mail.python.org/mailman/listinfo/python-ideas Code of Conduct: http://python.org/psf/codeofconduct/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Thu Aug 10 18:12:07 2017 From: chris.barker at noaa.gov (Chris Barker) Date: Thu, 10 Aug 2017 15:12:07 -0700 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>

<20170810134232.GE7395@ando.pearwood.info> Message-ID: On Thu, Aug 10, 2017 at 1:53 PM, Spencer Brown wrote: > The logical solution to me is to allow any order of while and if, and > follow the same 'rule' as multiple for loops - just nest/test those in that > order. > Actually, I think it would be better to only allow one order, and have the "while" always teeted first -- which may mean it should be placed first for clarity. > Then you can have whatever priority you need. One question though is how > this should handle multiple loops - break all of them, or just the current > one? > just the current one, just like a "break", or for that matter, a nested while... -CHB > - Spencer Brown > > On 11 Aug 2017, at 6:27 am, Chris Barker wrote: > > On Thu, Aug 10, 2017 at 8:39 AM, Paul Moore wrote: > > >> Also, there's a potential issue >> here - consider >> >> [expr for var in even_numbers() if is_odd(var) while var < 100] >> >> This is an infinite loop, even though it has a finite termination >> condition (var < 100), because we only test the termination condition >> if var is odd, which it never will be. >> > > why is the termination only tested if teh if clause is True? Could then > not be processed in parallel? or the while first.... > > so maybe better to do: > > [expr for var in even_numbers() while var < 100 if is_odd(var)] > > Maybe it's just me, but I would certainly expect the while to have > precedence. > > I guess I think of it like this: > > "if" is providing a filtering mechanism > > "while" is providing a termination mechanism > > -- is there a use case anyone can think of when they would want the while > to be applied to the list AFTER filtering? > > Obviously, this is a contrived example. And certainly "don't do that, >> then" is a valid response. But my instinct is that people are going to >> get this wrong - *especially* in a maintenance environment. > > > sure, but would there be an issue if teh while were given precedence? > > Overall, I agree with Steven's point. It seems pretty obvious what the >> intention is, and while it's probably possible to construct examples >> that are somewhat unclear, >> >> 1. The mechanical rule gives an explicit meaning >> 2. People shouldn't be writing such complex comprehensions, so if the >> rule doesn't give what they expect, they can always rewrite the code >> with an explicit (and clearer) loop. >> > > me too -- a direct translation to a for loop isn't necessary to understand > how it works. > > -CHB > > > -- > > Christopher Barker, Ph.D. > Oceanographer > > Emergency Response Division > NOAA/NOS/OR&R (206) 526-6959 voice > 7600 Sand Point Way NE (206) 526-6329 fax > Seattle, WA 98115 (206) 526-6317 main reception > > Chris.Barker at noaa.gov > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > https://mail.python.org/mailman/listinfo/python-ideas > Code of Conduct: http://python.org/psf/codeofconduct/ > > -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From wes.turner at gmail.com Thu Aug 10 22:46:05 2017 From: wes.turner at gmail.com (Wes Turner) Date: Thu, 10 Aug 2017 21:46:05 -0500 Subject: [Python-ideas] PyPI JSON Metadata Standardization for Mirrors In-Reply-To: References: Message-ID: On Wednesday, August 9, 2017, Cooper Ry Lees wrote: > Hi all, > > First time emailer, so please be kind. Also, if this is not the right > mailing list for PyPA talk, I apologize. Please point me in the right > direction if so. > Here are some notes re: changing metadata: https://github.com/pypa/interoperability-peps/issues/31 https://www.google.com/search?q=pep426jsonld Towards JSONLD is the best approach, I think. So, that means it would be best to, if you need to add additional metadata (?) and must key things, also copy the key into an object: {"thing1": {"@id": "thing1", "url": "..."}} Instead of just: {"thing1": {"url": "..."}} https://github.com/pypa/interoperability-peps/issues/31#issuecomment-233195564 > The main reason I have emailed here is I believe it may be PEP time to > standardize the JSON metadata that PyPI makes available, like what was done > for the `'simple API` described in PEP503. > > I've been doing a bit of work on `bandersnatch` (I didn't name it), which > is a PEP 381 mirroring package and wanted to enhance it to also mirror the > handy JSON metadata PyPI generates and makes available @ > https://pypi.python.org/pypi/PKG_NAME/json. > > I've done a PR on bandersnatch as a POC that mirrors both the PyPI > directory structure (URL/pypi/PKG_NAME/json) and created a standardizable > URL/json/PKG_NAME that the former symlinks to (to be served by NGINX / some > other proxy). I'm also contemplating naming the directory 'metadata' rather > than JSON so if some new hotness / we want to change the format down the > line we're not stuck with json as the dirname. This PR can be found here: > https://bitbucket.org/pypa/bandersnatch/pull-requests/33/save-json- > metadata-to-mirror > > My main use case is to write a very simple async 'verifier' tool that will > crawl all the JSON files and then ensure the packages directory on each of > my internal mirrors (I have a mirror per region / datacenter) have all the > files they should. I sync centrally (to save resource on the PyPI > infrastructure) and then rsync out all the diffs to each region / > datacenter, and under some failure scenarios I could miss a file or many. > So I feel using JSON pulled down from the authoritative source will allow > an async job to verify the MD5 of all the package files on each mirror. > > What are peoples thoughts here? Is it worth a PEP similar to PEP503 going > forward? Can people enhance / share some thoughts on this idea. > Here are some notes on making this more efficient: "Add API endpoint to get latest version of all projects" https://github.com/pypa/warehouse/issues/347 ... To http://markmail.org/search/?q=list:org.python.distutils-sig . > > Thanks, > Cooper Lees > me at cooperlees.com > https://cooperlees.com/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Thu Aug 10 23:54:05 2017 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 11 Aug 2017 13:54:05 +1000 Subject: [Python-ideas] Generator syntax hooks? In-Reply-To: References: <82672ca6-315a-d054-f039-d5c6c7c630b3@gmail.com>