From mwm at mired.org Tue Nov 1 00:19:01 2011 From: mwm at mired.org (Mike Meyer) Date: Mon, 31 Oct 2011 16:19:01 -0700 Subject: [Python-ideas] Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> Message-ID: On Mon, Oct 31, 2011 at 3:58 PM, Bruce Leban wrote: > On Sun, Oct 30, 2011 at 8:11 PM, Mike Meyer wrote: > >> Any attempt to mutate an object that isn't currently locked will raise >> an exception. Possibly ValueError, possibly a new exception class just >> for this purpose. This includes rebinding attributes of objects that >> aren't locked. >> > > Do you mean that at any time attempting to mutate an unlocked object > throws an exception? Yes, that's the idea. There are some exceptions, but you have to explicitly work around that restriction. > That would mean that all of my current code is broken. Pretty much, yes. It's like adding garbage collection and removing alloc*/free. It's going to break a *lot* of code. That's why I said "not in 3. and possibly never in cPython." > Do you mean, that inside the control of 'locking', you can't mutate an > unlocked object? That still breaks lots of code that is safe. You can't use > itertools.cycle anymore until that's updated in a completely unnecessary > way: > > def cycle(iterable): > # cycle('ABCD') --> A B C D A B C D A B C D ... > saved = [] > for element in iterable: > yield element > saved.append(element) *# throws an exception when called on a locked iterable* > while saved: > for element in saved: > yield element > > According to what I wrote, yes, it does.Since the list being mutated is only visible inside the function, it doesn't need to be. It might be possible to figure out that this is the case at compile time and thus allow the code to run unmodified. But that's 1) hard, 2) will miss some cases, 3) seems like a corner case. This proposal would break enough code that not breaking this case doesn't seem to be worth the effort. That's a question that needs to be answered. > I think the semantics of this need to be tightened up. > That's why I brought it up. I'm trying to get more eyes on the issue. > Furthermore, merely *reading* an object that isn't locked can cause > problems. This code is not thread-safe: > > if element in dictionary: return dictionary[element] > > so you have to decide how much safety you want and what cost we're willing > to pay for this. > You're right - it's not thread safe. However, it also doesn't suffer from the problem I'm trying to deal with, where you mutate an object in a way that leaves things broken, but won't be detected at that point. If it breaks because someone mutates the object underneath it, it'll throw an exception at that point. I know you can construct cases where that isn't so. Maybe we need two types of locking - one that allows readers, and one that doesn't. I could live with that, as you'd still have to consider the issue where you mutate the object. From ron3200 at gmail.com Tue Nov 1 01:22:57 2011 From: ron3200 at gmail.com (Ron Adam) Date: Mon, 31 Oct 2011 19:22:57 -0500 Subject: [Python-ideas] Cofunctions - Getting away from the iterator protocol In-Reply-To: <4EAF0DEE.1020500@canterbury.ac.nz> References: <4EA8BD66.6010807@canterbury.ac.nz> <4EA94C53.2060209@pearwood.info> <20111027183208.GH20970@pantoffel-wg.de> <4EA9AB03.8070302@stoneleaf.us>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> Message-ID: <1320106977.6637.50.camel@Gutsy> On Tue, 2011-11-01 at 10:06 +1300, Greg Ewing wrote: > Ron Adam wrote: > > If we put some strict requirements on the idea. > > > > 1. Only have a *SINGLE* exception type as being resumable. > > > > 2. That exception should *NEVER* occur naturally. > > > > 3. Only allow continuing after it's *EXPLICITLY RAISED* by a > > raised statement. > > > > All of the problem issues go away with those requirements in place, and > > you only have the issue of how to actually write the patch. Earlier > > discussions indicated, it might not be that hard to do. > > I'm not familiar with these earlier discussions. Did they go > as far as sketching a feasible implementation? It's all very > well to propose things like this, but the devil is very much > in the details. Yeah, there isn't very much about the details, but I think it is worth looking into as it would pretty much does exactly what is needed. (IMHO) Here is some of the things I was able to find. But as I said, the discussions didn't get very far. I'm hoping that the idea has more merit in the smaller 'restricted' context of coroutines rather than for general exception handling. Here is a very old 1994 thread that is interesting reading. (Long but worth while.) http://groups.google.com/group/comp.lang.python/browse_thread/thread/674a821ed7003b69/2aa10cabcbcc4acb?q=python+continuations+restarts#2aa10cabcbcc4acb There was a reference to allow 'raise' to send out a reference, rather than an exception. That may be an interesting way to do this. Some more that didn't go anywhere... http://bytes.com/topic/python/answers/46053-resume-after-exception http://bytes.com/topic/python/answers/36650-exception-feature-creep-entering-normal-flow-after-exception-raised http://mail.python.org/pipermail/python-list/2010-December/1261919.html I can't seem to find where I found the "It wouldn't be too hard to do part.". But if a final restriction of only working with generators at first is added, it may make it easier as they can already be suspended. Here is a python implementation for lisp style restarts. I haven't studied it yet, but it may show a way. (I'm going to look at this in more detail tonight.) http://pypi.python.org/pypi/withrestart/0.2.6 Not sure if these would help, but they may be of interest on a more theoretical level. http://okmij.org/ftp/continuations/generators.html http://lambda-the-ultimate.org/node/1544 Wikipedia has this on it's exception handling page ... """ Restarts separate mechanism from policy Condition handling moreover provides a separation of mechanism from policy. Restarts provide various possible mechanisms for recovering from error, but do not select which mechanism is appropriate in a given situation. That is the province of the condition handler, which (since it is located in higher-level code) has access to a broader view. """ In the case of coroutines, the error's are the suspension points, and the error handler is the scheduler that switches between them. The context is a bit different but I believe the concept is still applicable. Hope some of this is helpful. Cheers, Ron From bruce at leapyear.org Tue Nov 1 02:07:19 2011 From: bruce at leapyear.org (Bruce Leban) Date: Mon, 31 Oct 2011 18:07:19 -0700 Subject: [Python-ideas] Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> Message-ID: On Mon, Oct 31, 2011 at 4:19 PM, Mike Meyer wrote: > On Mon, Oct 31, 2011 at 3:58 PM, Bruce Leban wrote: > >> On Sun, Oct 30, 2011 at 8:11 PM, Mike Meyer wrote: >> >>> Any attempt to mutate an object that isn't currently locked will raise >>> an exception. Possibly ValueError, possibly a new exception class just >>> for this purpose. This includes rebinding attributes of objects that >>> aren't locked. >>> >> >> Do you mean that at any time attempting to mutate an unlocked object >> throws an exception? > > > Yes, that's the idea. There are some exceptions, but you have to > explicitly work around that restriction. > > >> That would mean that all of my current code is broken. > > > Pretty much, yes. It's like adding garbage collection and removing > alloc*/free. It's going to break a *lot* of code. That's why I said "not in > 3. and possibly never in cPython." > In order to make concurrent code slightly safer, you're going to break all existing programs that don't use concurrency. That seems to me like a new language, not Python. You've been on this list long enough to see the attention that's paid to backward compatibility. > Furthermore, merely *reading* an object that isn't locked can cause >> problems. This code is not thread-safe: >> >> if element in dictionary: return dictionary[element] >> >> so you have to decide how much safety you want and what cost we're >> willing to pay for this. >> > > You're right - it's not thread safe. However, it also doesn't suffer from > the problem I'm trying to deal with, where you mutate an object in a way > that leaves things broken, but won't be detected at that point. If it > breaks because someone mutates the object underneath it, it'll throw an > exception at that point. I know you can construct cases where that isn't > so. > I think the cases where non-thread-safe code won't throw an exception are numerous, for example, the equally trivial: if element not in dictionary: dictionary[element] = 0 heck even this is not safe: dictionary[element] +=1 If you're going to tackle thread safety, it should address more of the problem. These bugs are in many ways worse than mutating "an object in a way that leaves things broken, but won't be detected at that point." The above bugs may *never* be detected. I've come across bugs like that that were in code for many years before I found them (and I'm sure that's happened to others on this list as well). The first thing to do is identify the problems you want to solve and make sure that the problems are well understood. Then design some solutions. Starting with a bad solution to a fraction of the problem isn't a good start. --- Bruce -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Tue Nov 1 02:14:44 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 1 Nov 2011 11:14:44 +1000 Subject: [Python-ideas] Cofunctions - Getting away from the iterator protocol In-Reply-To: <1320106977.6637.50.camel@Gutsy> References: <4EA8BD66.6010807@canterbury.ac.nz> <4EA94C53.2060209@pearwood.info> <20111027183208.GH20970@pantoffel-wg.de> <4EA9AB03.8070302@stoneleaf.us>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> Message-ID: On Tue, Nov 1, 2011 at 10:22 AM, Ron Adam wrote: > On Tue, 2011-11-01 at 10:06 +1300, Greg Ewing wrote: >> Ron Adam wrote: >> > If we put some strict requirements on the idea. >> > >> > ? ? 1. Only have a *SINGLE* exception type as being resumable. >> > >> > ? ? 2. That exception should *NEVER* occur naturally. >> > >> > ? ? 3. Only allow continuing after it's *EXPLICITLY RAISED* by a >> > ? ? ? ?raised statement. >> > >> > All of the problem issues go away with those requirements in place, and >> > you only have the issue of how to actually write the patch. ?Earlier >> > discussions indicated, it might not be that hard to do. >> >> I'm not familiar with these earlier discussions. Did they go >> as far as sketching a feasible implementation? It's all very >> well to propose things like this, but the devil is very much >> in the details. > > Yeah, there isn't very much about the details, but I think it is worth > looking into as it would pretty much does exactly what is needed. (IMHO) It gave me another thought on an existing utility worth exploring in this context: pdb's post-mortem capabilities. Now, those *don't* implement coroutines (when you do a postmortem, you end up in an emulation of the eval loop, not the eval loop itself). However, that exception instance *does* contain the full frame stack, all the way down to where the exception was thrown. Figuring out what hooks you would need in the core eval loop in order to reinstate an exception's frame stack as the "real" frame stack might be an interesting exercise. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From anacrolix at gmail.com Tue Nov 1 04:47:53 2011 From: anacrolix at gmail.com (Matt Joiner) Date: Tue, 1 Nov 2011 14:47:53 +1100 Subject: [Python-ideas] Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> Message-ID: I wouldn't say it necessarily has to be a new language. All too many new languages have arisen when the problem was actually with some specific part of an implementation, or could have been fixed with a careful iteration. It might be slightly off-topic, but one of Python's strengths in my eyes has been very careful incremental improvements. Prior to 2.5 Python was of no interest to me due to various concerns that have now been fixed. Indeed the language is significantly more consistent and usable than it was in the past. The last thing I (and many others) would want to see is yet another language created, because an an existing excellent language's implementation couldn't keep up with the times, or surmount an obstacle. Innovations like PyPy, multiprocessing, and the currently debated coroutines are what give Python a chance in the future. I might be beating a dead dog, but the last thing Python should do is roll over and die because "it's too hard". Just wanted to point out that language != implementation. On Tue, Nov 1, 2011 at 12:07 PM, Bruce Leban wrote: > > On Mon, Oct 31, 2011 at 4:19 PM, Mike Meyer wrote: >> >> On Mon, Oct 31, 2011 at 3:58 PM, Bruce Leban wrote: >>> >>> On Sun, Oct 30, 2011 at 8:11 PM, Mike Meyer wrote: >>>> >>>> Any attempt to mutate an object that isn't currently locked will raise >>>> an exception. Possibly ValueError, possibly a new exception class just >>>> for this purpose. This includes rebinding attributes of objects that >>>> aren't locked. >>> >>> Do you mean that at any time attempting to mutate an unlocked object >>> throws an exception? >> >> Yes, that's the idea. ?There are some exceptions, but you have to >> explicitly work around that restriction. >> >>> >>> That would mean that all of my current code is broken. >> >> Pretty much, yes. It's like adding garbage collection and removing >> alloc*/free. It's going to break a *lot* of code. That's why I said "not in >> 3. and possibly never in cPython." > > In order to make concurrent code slightly safer, you're going to break all > existing programs that don't use concurrency. That seems to me like a new > language, not Python. You've been on this list long enough to see the > attention that's paid to backward compatibility. > >>> >>> Furthermore, merely *reading* an object that isn't locked can cause >>> problems. This code is not thread-safe: >>> ? ? if element in dictionary: return dictionary[element] >>> so you have to decide how much safety you want and what cost we're >>> willing to pay for this. >> >> You're right - it's not thread safe. However, it also doesn't suffer from >> the problem I'm trying to deal with, where you mutate an object in a way >> that leaves things broken, but won't be detected at that point. If it breaks >> because someone mutates the object underneath it, it'll throw an exception >> at that point. I know you can construct cases where that isn't so. > > I think the cases where non-thread-safe code won't throw an exception are > numerous, for example, the equally trivial: > ? ? if element not in dictionary: dictionary[element] = 0 > heck even this is not safe: > ? ? dictionary[element] +=1 > If you're going to tackle thread safety, it should address more of the > problem. These bugs are in many ways worse than mutating "an object in a way > that leaves things broken, but won't be detected at that point." The above > bugs may *never* be detected. I've come across bugs like that that were in > code for many years before I found them (and I'm sure that's happened to > others on this list as well). > The first thing to do is identify the problems you want to solve and make > sure that the problems are well understood. Then design some solutions. > Starting with a bad solution to a fraction of the problem isn't a good > start. > --- Bruce > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > > From ron3200 at gmail.com Tue Nov 1 05:58:26 2011 From: ron3200 at gmail.com (Ron Adam) Date: Mon, 31 Oct 2011 23:58:26 -0500 Subject: [Python-ideas] Cofunctions - Getting away from the iterator protocol In-Reply-To: References: <4EA8BD66.6010807@canterbury.ac.nz> <4EA94C53.2060209@pearwood.info> <20111027183208.GH20970@pantoffel-wg.de> <4EA9AB03.8070302@stoneleaf.us>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> Message-ID: <1320123506.9456.57.camel@Gutsy> On Tue, 2011-11-01 at 11:14 +1000, Nick Coghlan wrote: > On Tue, Nov 1, 2011 at 10:22 AM, Ron Adam wrote: > > On Tue, 2011-11-01 at 10:06 +1300, Greg Ewing wrote: > >> Ron Adam wrote: > >> > If we put some strict requirements on the idea. > >> > > >> > 1. Only have a *SINGLE* exception type as being resumable. > >> > > >> > 2. That exception should *NEVER* occur naturally. > >> > > >> > 3. Only allow continuing after it's *EXPLICITLY RAISED* by a > >> > raised statement. > >> > > >> > All of the problem issues go away with those requirements in place, and > >> > you only have the issue of how to actually write the patch. Earlier > >> > discussions indicated, it might not be that hard to do. > >> > >> I'm not familiar with these earlier discussions. Did they go > >> as far as sketching a feasible implementation? It's all very > >> well to propose things like this, but the devil is very much > >> in the details. > > > > Yeah, there isn't very much about the details, but I think it is worth > > looking into as it would pretty much does exactly what is needed. (IMHO) > > It gave me another thought on an existing utility worth exploring in > this context: pdb's post-mortem capabilities. > > Now, those *don't* implement coroutines (when you do a postmortem, you > end up in an emulation of the eval loop, not the eval loop itself). > However, that exception instance *does* contain the full frame stack, > all the way down to where the exception was thrown. Figuring out what > hooks you would need in the core eval loop in order to reinstate an > exception's frame stack as the "real" frame stack might be an > interesting exercise. Poking around a bit, it looks like 'raise' does most of the work and the exception is just an envelope for what ever 'raise' puts in it. Is that right? I'd like to be able to make this work. class Suspend: def __init__(self, source): self.source = source self.suspend = True def __next__(self): nonlocal self.suspend if self.suspend: self.suspend = False raise SuspendException self.suspend = True return next(self.source) There are two issues with it... The "self.suspend = False" doesn't seem to work. The __next__ seems to get it's own copies of the attributes at the time the generator is created. And after the SuspendException is raised, a StopIteratoion is issued on the next next() call. The StopIteration is from the whole chain. The only reason the scheduler doesn't stop is it catches the Suspendexception. I want to be able to stick something like this in the generator chained pipe example below. Cheers, Ron *This is broken down into finer steps than you would normally do in order to test how it behaves. """ Basic scheduler test -- co-pipes version """ from co_pipes import * def Person(args): name, count = args p = Producer(lambda:name) # call function each time p = Limit(p, count) # exit after count yields. p = Enumerate(p) # -> (n, data) #p = Suspend(p) # suspend doesn't work. p = Apply(p, "{0[0]}: {0[1]} ".format) p = Apply(p, print) # consumer for _ in p: yield # pull data from here. def main(data): p = Source(data) # take an iterable p = Apply(p, Person) # apply a callable to data p = Collect(p) # collect items in a list p = Scheduler(p) # take a list of generators. next(p) # start the scheduler. if __name__ == "__main__": data = [("John", 2), ("Micheal", 3), ("Terry", 4)] main(data) Prints... 1: John 1: Micheal 1: Terry 2: John 2: Micheal 2: Terry 3: Micheal 3: Terry 4: Terry From ron3200 at gmail.com Tue Nov 1 06:10:46 2011 From: ron3200 at gmail.com (Ron Adam) Date: Tue, 01 Nov 2011 00:10:46 -0500 Subject: [Python-ideas] Cofunctions - Getting away from the iterator protocol In-Reply-To: <1320123506.9456.57.camel@Gutsy> References: <4EA8BD66.6010807@canterbury.ac.nz> <4EA94C53.2060209@pearwood.info> <20111027183208.GH20970@pantoffel-wg.de> <4EA9AB03.8070302@stoneleaf.us>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> <1320123506.9456.57.camel@Gutsy> Message-ID: <1320124246.9456.62.camel@Gutsy> On Mon, 2011-10-31 at 23:58 -0500, Ron Adam wrote: > On Tue, 2011-11-01 at 11:14 +1000, Nick Coghlan wrote: > > On Tue, Nov 1, 2011 at 10:22 AM, Ron Adam wrote: > > > On Tue, 2011-11-01 at 10:06 +1300, Greg Ewing wrote: > > >> Ron Adam wrote: > > >> > If we put some strict requirements on the idea. > > >> > > > >> > 1. Only have a *SINGLE* exception type as being resumable. > > >> > > > >> > 2. That exception should *NEVER* occur naturally. > > >> > > > >> > 3. Only allow continuing after it's *EXPLICITLY RAISED* by a > > >> > raised statement. > > >> > > > >> > All of the problem issues go away with those requirements in place, and > > >> > you only have the issue of how to actually write the patch. Earlier > > >> > discussions indicated, it might not be that hard to do. > > >> > > >> I'm not familiar with these earlier discussions. Did they go > > >> as far as sketching a feasible implementation? It's all very > > >> well to propose things like this, but the devil is very much > > >> in the details. > > > > > > Yeah, there isn't very much about the details, but I think it is worth > > > looking into as it would pretty much does exactly what is needed. (IMHO) > > > > It gave me another thought on an existing utility worth exploring in > > this context: pdb's post-mortem capabilities. > > > > Now, those *don't* implement coroutines (when you do a postmortem, you > > end up in an emulation of the eval loop, not the eval loop itself). > > However, that exception instance *does* contain the full frame stack, > > all the way down to where the exception was thrown. Figuring out what > > hooks you would need in the core eval loop in order to reinstate an > > exception's frame stack as the "real" frame stack might be an > > interesting exercise. > > Poking around a bit, it looks like 'raise' does most of the work and the > exception is just an envelope for what ever 'raise' puts in it. Is that > right? > > > > I'd like to be able to make this work. > > class Suspend: > def __init__(self, source): > self.source = source > self.suspend = True > > def __next__(self): > nonlocal self.suspend > if self.suspend: > self.suspend = False > raise SuspendException > self.suspend = True > return next(self.source) LOL... Need to recheck my cut and copy between edits. Remove the nonlocal self.suspend. It was just to see what if anything acted different and I forgot to remove it. > There are two issues with it... > > The "self.suspend = False" doesn't seem to work. The __next__ seems to > get it's own copies of the attributes at the time the generator is > created. Ok, the self.suspend reference does work as it should. I was just not putting my print statement in the right place. Time to call it a night. The rest is ok. Cheers, Ron > And after the SuspendException is raised, a StopIteratoion is issued on > the next next() call. The StopIteration is from the whole chain. The > only reason the scheduler doesn't stop is it catches the > Suspendexception. > > I want to be able to stick something like this in the generator chained > pipe example below. > > Cheers, > Ron > > > > *This is broken down into finer steps than you would normally do in > order to test how it behaves. > > """ > Basic scheduler test -- co-pipes version > """ > from co_pipes import * > > def Person(args): > name, count = args > p = Producer(lambda:name) # call function each time > p = Limit(p, count) # stop after count yields. > p = Enumerate(p) # -> (n, data) > #p = Suspend(p) # suspend doesn't work. > p = Apply(p, "{0[0]}: {0[1]} ".format) > p = Apply(p, print) # consumer > for _ in p: > yield # pull data from here. > > def main(data): > p = Source(data) # take an iterable > p = Apply(p, Person) # apply a callable to data > p = Collect(p) # collect items in a list > p = Scheduler(p) # take a list of generators. > next(p) # start the scheduler. > > if __name__ == "__main__": > data = [("John", 2), ("Micheal", 3), ("Terry", 4)] > main(data) > > > Prints... > > 1: John > 1: Micheal > 1: Terry > 2: John > 2: Micheal > 2: Terry > 3: Micheal > 3: Terry > 4: Terry > > > > From mwm at mired.org Tue Nov 1 06:32:59 2011 From: mwm at mired.org (Mike Meyer) Date: Mon, 31 Oct 2011 22:32:59 -0700 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: <20111031223048.6e5d2798@bhuda.mired.org> References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> Message-ID: Sigh. Sent this only to bruce by accident. Sorry about the duplicates Bruce. ---------- Forwarded message ---------- From: Mike Meyer Date: Mon, Oct 31, 2011 at 10:30 PM Subject: Re: [Python-ideas] Concurrent safety? To: Bruce Leban On Mon, 31 Oct 2011 18:07:19 -0700 Bruce Leban wrote: > On Mon, Oct 31, 2011 at 4:19 PM, Mike Meyer wrote: > >> That would mean that all of my current code is broken. > > Pretty much, yes. It's like adding garbage collection and removing > > alloc*/free. It's going to break a *lot* of code. That's why I said "not in > > 3. and possibly never in cPython." > In order to make concurrent code slightly safer, you're going to break all > existing programs that don't use concurrency. That seems to me like a new > language, not Python. You've been on this list long enough to see the > attention that's paid to backward compatibility. The way adding automatic memory management just made pointer arithmetic slightly safer. But yeah, the first thing I said was "never in 3.x, possibly never in cPython." I've been on this list long enough to know that, while the community pays a lot of attention to backwards compatibility, it is willing to throw it out when there's enough benefit. As you point out, this is a hard problem. I know I haven't covered all the issues. That's why the second thing I said was that I'm hoping to get people smarter than me to look at things. The cpu manufacturers have switched to improving performance by adding more cores instead of cranking clock speeds. As time goes by, more and more programmers are going to want to leverage that in serious ways. I already do, and find that Python makes some design choices unavailable or very fragile(*). I'd like to make those choices available, and help Python get ready for the time when that desire is the norm instead of the exception. > > Furthermore, merely *reading* an object that isn't locked can cause > >> problems. This code is not thread-safe: > >> if element in dictionary: return dictionary[element] > >> so you have to decide how much safety you want and what cost we're > >> willing to pay for this. > > You're right - it's not thread safe. However, it also doesn't suffer from > > the problem I'm trying to deal with, where you mutate an object in a way > > that leaves things broken, but won't be detected at that point. If it > > breaks because someone mutates the object underneath it, it'll throw an > > exception at that point. I know you can construct cases where that isn't > > so. > I think the cases where non-thread-safe code won't throw an exception are > numerous, for example, the equally trivial: Again, I said such cases can be built. I *didn't* say they were exceptions, I proposed a change to deal with them. > If you're going to tackle thread safety, it should address more of the > problem. These bugs are in many ways worse than mutating "an object in a > way that leaves things broken, but won't be detected at that point." The > above bugs may *never* be detected. I've come across bugs like that that > were in code for many years before I found them (and I'm sure that's > happened to others on this list as well). Like me. That's part of why I want to get the interpreter to help find them. > The first thing to do is identify the problems you want to solve and make > sure that the problems are well understood. Then design some solutions. > Starting with a bad solution to a fraction of the problem isn't a good > start. I've identified the problem I want to solve: I want to make concurrent use of python objects "safe by default", so that doing unsafe things causes the programmer to have to do something explicit about making things safe. I believe this can be done at the mutation points (again, clojure shows that it can be done). I also want to preserve as much of Python's existing code as possible. It may be that Python's existing data structures mean my believe about mutation points is wrong. This may be the wrong solution. It may be that such a change is to large to be acceptable. But the only way to find out is to investigate it. This discussion has already generated significant changes in the original proposal, plus some implementation ideas. http://www.mired.org/ Independent Software developer/SCM consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg.ewing at canterbury.ac.nz Tue Nov 1 07:00:15 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 01 Nov 2011 19:00:15 +1300 Subject: [Python-ideas] Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org>

Message-ID: <4EAF8AEF.5050804@canterbury.ac.nz> Mike Meyer wrote: > The goal here is to move from where we are to a place similar to where > handling files is, so that failing to properly deal with the possibility > of concurrent access causes an error when it happens, not at a point > distant in both time and space. I don't think what you're suggesting would achieve this, though. The locking required for correctness often involves more than one object or more than one operation on an object. Consider new_balance = balance + deposit lock(balance) balance = new_balance unlock(balance) This wouldn't trigger any of your alarms, but it would still be wrong. -- Greg From tjreedy at udel.edu Tue Nov 1 07:27:55 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Tue, 01 Nov 2011 02:27:55 -0400 Subject: [Python-ideas] Cofunctions - Getting away from the iterator protocol In-Reply-To: <1320123506.9456.57.camel@Gutsy> References: <4EA8BD66.6010807@canterbury.ac.nz> <20111027183208.GH20970@pantoffel-wg.de> <4EA9AB03.8070302@stoneleaf.us>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> <1320123506.9456.57.camel@Gutsy> Message-ID: On 11/1/2011 12:58 AM, Ron Adam wrote: > Poking around a bit, it looks like 'raise' does most of the work and the > exception is just an envelope for what ever 'raise' puts in it. Is that > right? I believe raise just instantiates the indicated exception. I expect that Exception.__new__ or .__init__ captures the traceback info. Subclasses can add more. A SuspendExecution exception should be able to grab as much as is needed for a resume. A CAPI call could be added if needed. I hope you keep looking at this idea. Function calls stop execution and pass control 'down', to be resumed by return. yield stops execution and passes control 'up', to be resumed by next (or .send). Exceptions pass control 'up' (or 'out') without the possibility of resuming. All that is lacking is something to suspend and pass control 'sideways', to a specific target. A special exception makes some sense in that exceptions already get the call stack needed to resume after suspension. -- Terry Jan Reedy From tjreedy at udel.edu Tue Nov 1 07:55:23 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Tue, 01 Nov 2011 02:55:23 -0400 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> Message-ID: On 11/1/2011 1:32 AM, Mike Meyer wrote: > As you point out, this is a hard problem. I know I haven't covered all > the issues. That's why the second thing I said was that I'm hoping to > get people smarter than me to look at things. This is one of the hard problems that keep getting swept under the rug while we do easier things. Well, we have overhauled unicode and packaging for 3.3, so maybe concurrency can get some attention. I keep thinking that CPython's design of allowing C coded modules either outside or inside the stdlib should allow some progress. Would it be helpful, for instance, to have truly immutable restricted tuples and frozensets, whose __new__ methods only allowed true immutables (None, booleans, numbers, strings, other restricted tuples and frozensets) as members? How about a metaclass, say 'immutable', that made the instances of a user class truly immutable? (I don't know how to do this, but lets assume it can be done -- perhaps with a new frozendict.) If such were possible, instances of instances of such a metaclass could be added to the list above. Could a metaclass automatically add fine-grained locks around around attribute mutations? -- Terry Jan Reedy From stephen at xemacs.org Tue Nov 1 09:01:19 2011 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Tue, 01 Nov 2011 17:01:19 +0900 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> Message-ID: <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> Mike Meyer writes: > The cpu manufacturers have switched to improving performance by adding > more cores instead of cranking clock speeds. As time goes by, more and > more programmers are going to want to leverage that in serious > ways. Please, everybody is aware of that. Anybody against improved support for concurrency is probably in favor of criminalizing motherhood and apple pie, too. What you need to justify is the apparently expensive approach you propose, and you need to resolve the apparent contradiction between that expense and the basic argument for threads which is precisely how expensive they *aren't*. > I've identified the problem I want to solve: I want to make > concurrent use of python objects "safe by default", But that's not what you've proposed, AIUI. You've proposed making concurrent use *safer*, but not yet *safe*. That's quite different from the analogy with automatic memory management, where the programmer can't do anything dangerous with pointers (because they can't do anything at all). The analogous model for concurrency is processes, it seems to me. (I don't have a desperate need for high- performance concurrency, so I take no position on processes + message passing vs. threads + shared resources.) > so that doing unsafe things causes the programmer to have to do > something explicit about making things safe. This is un-Pythonic, IMO[1]. Python generally permits dangerous (and even ugly) things when done by "consenting adults", on the theory that the programmer knows more about her problem than Python does. It seems to me that a more Pythonic approach to this would be to provide something like STM as a metaclass, mixin class, or decorator. (Don't ask me how.) > I believe this can be done at the mutation points (again, clojure > shows that it can be done). But clojure is a Lisp-derived language. Lisp was designed as a pure functional language (although AFAIK it pretty much immediately acquired "set"), and a very large number of Lisp algorithms are designed around conses which are (like Python tuples) basically immutable (yes, I know about setcar and setcdr, but use of those functions is generally considered a bug). Whether that orientation toward immutable objects continues in Clojure I don't know, but if it does, the problem of designing a "write barrier" for mutations may be (a) simpler and (b) have less performance impact than the analogous task applied to Python. While Python-the-language does have some immutable objects like tuples and strings, it's really kind of hard to avoid use of containers like lists and dictionaries and classes with mutable objects. Footnotes: [1] But I've been clobbered for expressing my opinion on Pythonicity in the past, so don't put too much weight on that. From ncoghlan at gmail.com Tue Nov 1 09:15:47 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 1 Nov 2011 18:15:47 +1000 Subject: [Python-ideas] Cofunctions - Getting away from the iterator protocol In-Reply-To: References: <4EA8BD66.6010807@canterbury.ac.nz> <20111027183208.GH20970@pantoffel-wg.de> <4EA9AB03.8070302@stoneleaf.us>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> <1320123506.9456.57.camel@Gutsy> Message-ID: On Tue, Nov 1, 2011 at 4:27 PM, Terry Reedy wrote: > I believe raise just instantiates the indicated exception. I expect that > Exception.__new__ or .__init__ captures the traceback info. Subclasses can > add more. A SuspendExecution exception should be able to grab as much as is > needed for a resume. A CAPI call could be added if needed. No, the traceback info is added by the eval loop itself. Remember that when you raise an exception *type* (rather than an instance), the exception doesn't get instantiated until it gets caught somewhere - the eval loop maintains the unwinding stack for the traceback as part of the thread state until it is time to attach it to the exception object. This is all at the tail end of the eval loop in CPython, but be warned it's fairly brain bending stuff that depends on various internal details of the eval loop: http://hg.python.org/cpython/file/default/Python/ceval.c#l2879 > I hope you keep looking at this idea. Function calls stop execution and pass > control 'down', to be resumed by return. yield stops execution and passes > control 'up', to be resumed by next (or .send). Exceptions pass control 'up' > (or 'out') without the possibility of resuming. All that is lacking is > something to suspend and pass control 'sideways', to a specific target. A > special exception makes some sense in that exceptions already get the call > stack needed to resume after suspension. That's not actually true - due to the need to process exception handling clauses and finally blocks (including the implicit ones inside with statements), the internal state of those frames is potentially no longer valid for resumption (they've moved on beyond the point where the internal function was called). I'll also note that it isn't necessary to pass control sideways, since there are two different flavours of coroutine design (the PDF article in the other thread describes this well). The Lua version is "asymmetric coroutines", and they only allow you to return to the point that first invoked the coroutine (this model is a fairly close fit with Python's generators and exception handling). The greenlet version is "symmetric" coroutines, and those let you switch directly to any other coroutine. Both models have their pros and cons, but the main advantage of asymmetric coroutines is that you can just say "suspend this thread" without having to say *where* you want to switch to. Of course, you can implement much the same API with symmetric coroutines as well, so long as you can look up your parent coroutine easily. Ultimately, I expect the symmetric vs asymmetric decision will be driven more by implementation details than by philosophical preferences one way or the other. I will note that Ron's suggestion to leverage the existing eval loop stack collection provided by the exception handling machinery does heavily favour the asymmetric approach. Having a quick look to refresh my memory of some of the details of CPython's exception handling, I've come to the following tentative conclusions: - an ordinary exception won't do, since you don't want to trigger except and finally blocks in outer frames (ceval.c#2903) - in CPython, a new "why = WHY_SUSPEND" at the eval loop layer is likely a better approach, since it would allow the frame stack to be collected without triggering exception handling - the stack unwinding would then end when a "SETUP_COCALL" block was encountered on the block stack (just as SETUP_EXCEPT and SETUP_FINALLY can stop the stack unwinding following an exception - with the block stacks within the individual frames preserved, the collected stack should be in a fit state for later restoration - the "fast_yield" code and the generator resumption code should also provide useful insight There's nothing too magical there - once we disclaim the ability to suspend coroutines while inside a C function (even one that has called back in via the C/Python API), it should boil down to a combination of the existing mechanics for generators and exception handling. So, even though the above description is (highly) CPython specific, it should be feasible for other implementations to come up with something similar (although perhaps not easy: http://lua-users.org/lists/lua-l/2007-07/msg00002.html). Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From ncoghlan at gmail.com Tue Nov 1 09:31:24 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 1 Nov 2011 18:31:24 +1000 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On Tue, Nov 1, 2011 at 6:01 PM, Stephen J. Turnbull wrote: > ?> I've identified the problem I want to solve: I want to make > ?> concurrent use of python objects "safe by default", > > But that's not what you've proposed, AIUI. ?You've proposed making > concurrent use *safer*, but not yet *safe*. ?That's quite different > from the analogy with automatic memory management, where the > programmer can't do anything dangerous with pointers (because they > can't do anything at all). ?The analogous model for concurrency is > processes, it seems to me. ?(I don't have a desperate need for high- > performance concurrency, so I take no position on processes + message > passing vs. threads + shared resources.) Guido and python-dev in general *have* effectively taken a position on that, though (mainly due to Global Interpreter Lock discussions). 1. Even for threads, the recommended approach is to use queue.Queue to avoid the common concurrency issues (such as race conditions and deadlock) associated with explicit locking 2. In Python 3, concurrent.futures offers an even *safer* interface and higher level interface for many concurrent workloads 3. If you use multiple processes and serialised messages, or higher level APIs like concurrent.futures, you can not only scale to multiple cores, but also to multiple *machines*. This has led to a quite deserved reputation for being intolerant of changes that claim to make multithreaded development "better", but only at the expense of making single-threaded development worse. That's actually one of the more interesting aspects of PyPy's experiments with software transactional memory - the sophistication of their toolchain means that they can make the STM feature optional at the translation stage, without resorting to ugly #ifdef hackery the way we would need to in order to make such a feature similarly optional in CPython. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From stephen at xemacs.org Tue Nov 1 10:13:01 2011 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Tue, 01 Nov 2011 18:13:01 +0900 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: <87hb2op5nm.fsf@uwakimon.sk.tsukuba.ac.jp> Nick Coghlan writes: > sjt wrote: > > (I don't have a desperate need for high-performance concurrency, > > so I take no position on processes + message passing vs. threads > > + shared resources.) > > Guido and python-dev in general *have* effectively taken a position on > that, though (mainly due to Global Interpreter Lock discussions). Sure, as a matter of "development politics" that's pretty clear. I'm sure Mike understands that, too. (And is frustrated by it!) My point is that Mike's approach of trying to make *everything* safe for concurrency seems to point in the direction of process + message passing, but I don't claim that this proves that processes are in any sense technically superior, just that his approach needs justification beyond "hey, we need to do something about concurrency in Python!" From greg.ewing at canterbury.ac.nz Tue Nov 1 11:24:44 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 01 Nov 2011 23:24:44 +1300 Subject: [Python-ideas] Cofunctions - A New Protocol In-Reply-To: References: <4EA8BD66.6010807@canterbury.ac.nz>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> <1320123506.9456.57.camel@Gutsy> Message-ID: <4EAFC8EC.4080601@canterbury.ac.nz> A Coroutine Protocol ==================== Here are some thoughts on the design of a new protocol to support lightweight threads using a mechanism similar to, but distinct from, generators and yield-from. Separating the two protocols will make it much easier to support suspendable generators, something that is not possible using the cofunction mechanism as currently specified in PEP 3152. The protocol to be described is similar in many ways to the generator protocol, and in what follows, analogies will be drawn between the two protocols where it may aid understanding. API --- This section describes the outward appearance of the coroutine mechanism to the programmer. A coroutine is created using the following constructor: :: coroutine(f, *args, **kwds) where ``f`` is an object obeying the "coroutine protocol" to be described below. Syntactic support will be provided for creating such an object using a special form of Python function definition, analogous to a generator. The result is a "coroutine object" having the following methods: ``resume(value = None)`` Resumes execution of the coroutine at the point where it was last suspended. The value, if any, is passed into the coroutine and becomes the return value of the operation that caused the suspension. The coroutine executes until its next suspension point, at which time the ``resume`` call returns with the value passed into the suspension operation. (Note: This is analogous to calling next() or send() on a generator-iterator. Suspension of a coroutine is analogous to a generator executing a ``yield`` operation.) If the coroutine has been freshly created, the passed-in value is ignored and the coroutine executes up to its first suspension point. If the top level of the coroutine finishes execution without encountering any further suspension points, a ``CoReturn`` exception is raised. This exception has a ``value`` attribute containing the return value from the coroutine. (Note: ``CoReturn`` is analogous to the ``StopIteration`` exception raised by an exhausted iterator or generator.) ``throw(exception)`` Causes the given exception to be raised in the coroutine at its current suspension point. ``close()`` Requests that the coroutine shut down and clean itself up. This is achieved by throwing in a ``CoExit`` exception (analogous to ``GeneratorExit``). It is expected that programmers will not write code that deals directly with coroutine objects very often; rather, some kind of driver or scheduler will be used that takes care of making ``resume()`` calls and handling ``CoReturn`` exceptions. Cofunctions ----------- There will be a special form of Python function called a "cofunction", defined using the new keyword ``codef`` in place of ``def``. A cofunction provides a convenient way of creating an object obeying the coroutine protocol. (This is similar to how a generator provides a convenient way of creating an object obeying the iterator protocol). Suspension of a cofunction is achieved using the expression :: ``coyield`` [value] This is analogous to a ``yield`` expression in a generator, and like ``yield``, it can both provide and receive a value. However, unlike ``yield``, it is *not* restricted to communicating with the immediate caller. It communicates directly with the ``resume`` method of the coroutine, however deep the nesting of calls is between the ``resume`` call and the ``coyield``. There are some restrictions, however: * A ``coyield`` is only allowed in the body of a cofunction (a function defined with ``codef``), not in any other context. * A cofunction can only be called from the body of another cofunction, not in any other context. Exceptions are raised if any of these restrictions are violated. As a consequence, there must be an unbroken chain of cofunctions (or other objects obeying the cofunction protocol, see below) making up the call stack from the ``resume`` method down to the suspension point. A cofunction may call an ordinary function, but that function or anything called by it will not be able to suspend the coroutine. Note that the class of "ordinary functions" includes most functions and methods written in C. However, it is possible for an object implemented in C to participate in a coroutine stack by implementing the coroutine protocol below explicitly. Coroutine Protocol ------------------ As well as the coroutine object, the coroutine protocol involves three other kinds of objects, "cocallable objects", "coframe objects" and "coiterator objects". A cocallable object has the following method: ``__cocall__(*args, **kwds)`` Initiates a suspendable computation. Returns a coframe object. (This is analogous to the __iter__ method of an iterable object.) May return NotImplemented to signal that the object does not support the coroutine protocol. This enables wrapper objects such as bound methods to reflect whether or not the wrapped object supports the coroutine protocol. A coframe object has the following methods: ``__resume__(costack, value)`` There are two purposes for which this method is called: to continue execution from a suspension point, and to pass in the return value resulting from a nested call to another cocallable object. In both cases, the ``resume`` method is expected to continue execution until the next suspension point, and return the value produced by it. If the computation finishes before reaching another suspension point, ``CoReturn(retval)`` must be raised, where ``retval`` is the return value of the computation. (This method is analogous to the __send__ method of a generator-iterator. With a value of None, it is analogous to the __next__ method of an iterator.) The currently-executing coroutine object is passed in as the ``costack`` parameter. The ``__resume__`` method can make a nested call to another cocallable object ``sub`` by performing: ``return costack.call(sub, *args, **kwds)`` No further calls to this coframe will be made until ``obj`` finishes. When it does, the ``__resume__`` method of this coframe is called with the return value from ``sub``. It is the responsibility of the coframe object to keep track of whether the previous call to its ``__resume__`` method resulted in a suspension or a nested call, and make use of the ``value`` parameter accordingly. ``__throw__(costack, exception)`` Called to throw an exception into the computation. The coframe may choose to absorb the exception and continue executing, in which case ``__throw__`` should return the value produced by the next exception point or raise ``CoReturn`` as for ``__resume__``. Alternatively it may allow the same or a different exception to propagate out. Implementation of this method is optional. If it is not present, the behaviour is as if a trivial ``__throw__`` method were present that simply re-raises the exception. A coiterator is an iterator that permits iteration to be carried out in a suspendable manner. A coiterator object has the following method: ``__conext__()`` Returns a coframe for computing the next item from the iteration. This is the coroutine equivalent of an iterator's ``__next__`` method, and behaves accordingly: its ``__resume__`` method must return an item by raising ``CoReturn(item)``. To finish the iteration, it raises ``StopIteration`` as usual. To support coiteration, whenever a "next" operation is invoked by a cofunction (whether implicitly by means of a for-loop or explicitly by calling ``next()``) a ``__conext__`` method is first looked for, and if found, the operation is carried out suspendably. Otherwise a normal call is made to the ``__next__`` method. Formal Semantics ---------------- The semantics of the coroutine object are defined by the following Python implementation. :: class coroutine(object): # Public methods def __init__(self, main, *args, **kwds): self._stack = [] self._push(_cocall(main, *args, **kwds)) def resume(self, value = None): return self._run(value, None) def throw(self, exc): return self._run(None, exc) def close(self): try: self.throw(CoExit) except (CoExit, CoReturn): pass def call(self, subroutine, *args, **kwds): meth = getattr(subroutine, '__cocall__', None) if meth is not None: frame = meth(*args, **kwds) if frame is not NotImplemented: self._push(frame) return self._run(None, None) return CoReturn(subroutine(*args, **kwds)) # Private methods def _run(self, value, exc): while True: try: frame = self._top() if exc is None: return frame.__resume__(self, value) else: meth = getattr(frame, '__throw__', None) if meth is not None: return meth(self, exc) else: raise exc except BaseException as exc: if self._pop(): if isinstance(exc, CoReturn): value = exc.value exc = None else: raise def _push(self, frame): self._stack.append(frame) def _pop(self): if len(self._stack) > 0: del self._stack[-1] return True else: return False def _top(self): return self._stack[-1] -- Greg From mwm at mired.org Tue Nov 1 16:25:08 2011 From: mwm at mired.org (Mike Meyer) Date: Tue, 1 Nov 2011 08:25:08 -0700 Subject: [Python-ideas] Concurrent safety? In-Reply-To: <4EAF8AEF.5050804@canterbury.ac.nz> References: <20111030201143.481fdca2@bhuda.mired.org>

<4EAF8AEF.5050804@canterbury.ac.nz> Message-ID: On Mon, Oct 31, 2011 at 11:00 PM, Greg Ewing wrote: > Mike Meyer wrote: > >> The goal here is to move from where we are to a place similar to where >> handling files is, so that failing to properly deal with the possibility of >> concurrent access causes an error when it happens, not at a point distant >> in both time and space. >> > > I don't think what you're suggesting would achieve this, > though. The locking required for correctness often involves > more than one object or more than one operation on an > object. Consider > > new_balance = balance + deposit > lock(balance) > balance = new_balance > unlock(balance) > > This wouldn't trigger any of your alarms, but it would > still be wrong. You're right - I chose my words poorly. As stated, solving it would involve solving the halting problem. Replace the word "properly" with "at all". I.e. - if you don't think about a concurrent access and should have, it'll cause an error. If you think about it and get it wrong - well, nothing will prevent all bugs. Partially automated resource allocation doesn't prevent the programmer from writing bad code, and this is in that category. From mwm at mired.org Tue Nov 1 16:38:20 2011 From: mwm at mired.org (Mike Meyer) Date: Tue, 1 Nov 2011 08:38:20 -0700 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On Tue, Nov 1, 2011 at 1:01 AM, Stephen J. Turnbull wrote: > Mike Meyer writes: > > I've identified the problem I want to solve: I want to make > > concurrent use of python objects "safe by default", > But that's not what you've proposed, AIUI. You've proposed making > concurrent use *safer*, but not yet *safe*. That's quite different > from the analogy with automatic memory management, where the > programmer can't do anything dangerous with pointers (because they > can't do anything at all). The analogous model for concurrency is > processes, it seems to me. (I don't have a desperate need for high- > performance concurrency, so I take no position on processes + message > passing vs. threads + shared resources.) > No, the proposal does make things "safe by default". The default behavior disallows all mutation. You have to do something explicit to allow it - because "explicit is better than implicit." > > so that doing unsafe things causes the programmer to have to do > > something explicit about making things safe. > This is un-Pythonic, IMO[1]. Python generally permits dangerous (and > even ugly) things when done by "consenting adults", on the theory that > the programmer knows more about her problem than Python does. It > seems to me that a more Pythonic approach to this would be to provide > something like STM as a metaclass, mixin class, or decorator. (Don't > ask me how.) > Adding STM would make concurrency easier to deal with, but wouldn't address the fundamental problem. The proposed change doesn't prevent users from doing dangerous (and even ugly things). It just forces them to *think* about what they're doing beforehand. I can even see allowing immutable objects to change their attributes, with the caveat that this shouldn't change the externally visible behavior of the object. > > I believe this can be done at the mutation points (again, clojure > > shows that it can be done). > But clojure is a Lisp-derived language. Lisp was designed as a pure > functional language (although AFAIK it pretty much immediately > acquired "set"), and a very large number of Lisp algorithms are > designed around conses which are (like Python tuples) basically > immutable (yes, I know about setcar and setcdr, but use of those > functions is generally considered a bug). Whether that orientation > toward immutable objects continues in Clojure I don't know, but if it > does, the problem of designing a "write barrier" for mutations may be > (a) simpler and (b) have less performance impact than the analogous > task applied to Python. Um, yeah, I did point out later in the paragraph that preserving pythons data types may make this assumption false. > While Python-the-language does have some immutable objects like tuples > and strings, it's really kind of hard to avoid use of containers like > lists and dictionaries and classes with mutable objects. > And I also pointed out that this may be to much of a change to be palatable to Python users. For that matter, if it requires losing pythons primitive container types, it's probably to much of a change to be palatable to me. From mwm at mired.org Tue Nov 1 16:54:58 2011 From: mwm at mired.org (Mike Meyer) Date: Tue, 1 Nov 2011 08:54:58 -0700 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On Tue, Nov 1, 2011 at 1:31 AM, Nick Coghlan wrote: > On Tue, Nov 1, 2011 at 6:01 PM, Stephen J. Turnbull > wrote: > > > I've identified the problem I want to solve: I want to make > > > concurrent use of python objects "safe by default", > > > > But that's not what you've proposed, AIUI. You've proposed making > > concurrent use *safer*, but not yet *safe*. That's quite different > > from the analogy with automatic memory management, where the > > programmer can't do anything dangerous with pointers (because they > > can't do anything at all). The analogous model for concurrency is > > processes, it seems to me. (I don't have a desperate need for high- > > performance concurrency, so I take no position on processes + message > > passing vs. threads + shared resources.) > > Guido and python-dev in general *have* effectively taken a position on > that, though (mainly due to Global Interpreter Lock discussions). > > 1. Even for threads, the recommended approach is to use queue.Queue to > avoid the common concurrency issues (such as race conditions and > deadlock) associated with explicit locking > 2. In Python 3, concurrent.futures offers an even *safer* interface > and higher level interface for many concurrent workloads > 3. If you use multiple processes and serialised messages, or higher > level APIs like concurrent.futures, you can not only scale to multiple > cores, but also to multiple *machines*. > I am aware of all this. I've written large systems using Queue.queue and the multiple process/serialized messages model. I've dealt with code that tried to mix the two (*not* a good idea). The process model works really well - if you can use it. The problem is, if you can't, you lose all the protection it provides. That's the area I'm trying to address. Also, the process model doesn't prevent these concurrency issues, it just moves them to external objects. I figure that's an even harder problem, since it can involve multiple machines. An improvement in the shared storage case might shed some light on it. > This has led to a quite deserved reputation for being intolerant of > changes that claim to make multithreaded development "better", but > only at the expense of making single-threaded development worse. > I think I've found a way to implement the proposal without having a serious impact on single-threaded code - at least in terms of performance and having to change the code. From p.f.moore at gmail.com Tue Nov 1 17:36:36 2011 From: p.f.moore at gmail.com (Paul Moore) Date: Tue, 1 Nov 2011 16:36:36 +0000 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On 1 November 2011 15:38, Mike Meyer wrote: > No, the proposal does make things "safe by default". The default behavior > disallows all mutation. You have to do something explicit to allow it - > because "explicit is better than implicit." [...] > Um, yeah, I did point out ?later in the paragraph that preserving pythons > data types may make this assumption false. [...] > And I also pointed out that this may be to much of a change to be palatable > to Python users. For that matter, if it requires losing pythons primitive > container types, it's probably to much of a change to be palatable to me. I don't know if you've considered this already, but a for-loop in Python creates an iterator and then mutates it (by calling next()) on each run through the loop. I can't see any way this could be a concurrency problem in itself, but you'd likely need to either reimplement the for loop to avoid relying on mutable iterators, or you'd need to add some sort of exclusion for iterators in for loops. It'll be details like this that will be the hardest to thrash out, I suspect... Paul. From van.lindberg at gmail.com Tue Nov 1 18:08:22 2011 From: van.lindberg at gmail.com (VanL) Date: Tue, 01 Nov 2011 12:08:22 -0500 Subject: [Python-ideas] Draft PEP for the regularization of Python install layouts In-Reply-To: References:

Message-ID: Hi Jim, On Oct 30, 2011 5:58 PM, "Jim Jewett" wrote: > Is this something that Python even *can* reasonably control, > particularly on the various Linux distributions? In my experience, the location for the Python environment changes a bit, but the internal layout is general consistent with what is set out in sysconfig and distutils.command.install. The most unique layout I have seen is for Ubuntu, which adds a vendor-packages directory. I would love to be corrected in this regard. > What might be helpful would be a few more symbolics (if any are > actually missing) and a few test environments that use something > unexpected for each value, so that you *will* notice if you have > hardcoded assumptions specific to your own setup. The suggested values are taken from the stdlib, not from my own setup, so if someone is generating or using different values, they are not coming from the stdlib. Thanks, Van From phd at phdru.name Tue Nov 1 18:29:37 2011 From: phd at phdru.name (Oleg Broytman) Date: Tue, 1 Nov 2011 21:29:37 +0400 Subject: [Python-ideas] Draft PEP for the regularization of Python install layouts In-Reply-To: References:

Message-ID: <20111101172937.GA30350@iskra.aviel.ru> On Tue, Nov 01, 2011 at 12:08:22PM -0500, VanL wrote: > On Oct 30, 2011 5:58 PM, "Jim Jewett" wrote: > > Is this something that Python even *can* reasonably control, > > particularly on the various Linux distributions? > > In my experience, the location for the Python environment changes a > bit, but the internal layout is general consistent with what is set > out in sysconfig and distutils.command.install. The most unique > layout I have seen is for Ubuntu, which adds a vendor-packages > directory. Debian 6 "squeeze" added dist-packages for 3rd-party modules installed via apt/dpkg. Oleg. -- Oleg Broytman http://phdru.name/ phd at phdru.name Programmers don't die, they just GOSUB without RETURN. From stephen at xemacs.org Tue Nov 1 19:05:06 2011 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Wed, 02 Nov 2011 03:05:06 +0900 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: <87ehxrpvl9.fsf@uwakimon.sk.tsukuba.ac.jp> Mike Meyer writes: > The proposed change doesn't prevent users from doing dangerous (and even > ugly things). I didn't say it did, it "merely" imposes substantial inconvenience in hope that: > It just forces them to *think* about what they're doing beforehand. which I believe to be un-Pythonic. But you say that you have an approach in mind which is reasonably performant and doesn't change things too much for single-threaded apps, which would make the discussion moot. So let's see how that works out. From mwm at mired.org Tue Nov 1 22:09:33 2011 From: mwm at mired.org (Mike Meyer) Date: Tue, 1 Nov 2011 14:09:33 -0700 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp>

Message-ID: On Tue, Nov 1, 2011 at 9:36 AM, Paul Moore wrote: > On 1 November 2011 15:38, Mike Meyer wrote: > > No, the proposal does make things "safe by default". The default behavior > > disallows all mutation. You have to do something explicit to allow it - > > because "explicit is better than implicit." > [...] > > Um, yeah, I did point out later in the paragraph that preserving pythons > > data types may make this assumption false. > [...] > > And I also pointed out that this may be to much of a change to be > palatable > > to Python users. For that matter, if it requires losing pythons primitive > > container types, it's probably to much of a change to be palatable to me. > > I don't know if you've considered this already, but a for-loop in > Python creates an iterator and then mutates it (by calling next()) on > each run through the loop. I can't see any way this could be a > concurrency problem in itself, but you'd likely need to either > reimplement the for loop to avoid relying on mutable iterators, or > you'd need to add some sort of exclusion for iterators in for loops. How about a third option? Iterators have to be locked to do a next in general, as they can be bound and thus shared between execution threads. On the other hand, locking & unlocking should be the major performance hit, so you don't want to do that on something that's going to be happening a lot, so the caller should be allowed to do something to indicate that it's not required. Locking the iterator should do that. So the next method needs to add a test to see if self is locked, and if not lock and then unlock self. It'll be details like this that will be the hardest to thrash out, I > suspect... Yup. Thanks, From mwm at mired.org Tue Nov 1 22:13:07 2011 From: mwm at mired.org (Mike Meyer) Date: Tue, 1 Nov 2011 14:13:07 -0700 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: <87ehxrpvl9.fsf@uwakimon.sk.tsukuba.ac.jp> References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> <87ehxrpvl9.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On Tue, Nov 1, 2011 at 11:05 AM, Stephen J. Turnbull wrote: > Mike Meyer writes: > > The proposed change doesn't prevent users from doing dangerous (and even > > ugly things). > I didn't say it did, it "merely" imposes substantial inconvenience in > hope that: > > It just forces them to *think* about what they're doing beforehand. > which I believe to be un-Pythonic. > Really? Thinking is unpythonic? > But you say that you have an approach in mind which is reasonably > performant and doesn't change things too much for single-threaded > apps, which would make the discussion moot. So let's see how that > works out. > If all you want to do is get the old semantics back in a single-threaded application, you could do something like turning: if __name__ == '__main__': main() into: if __name__ == '__main__': locking: main() Actually, that achieves my goal - you hopefully thought about this long enough to realize that this was safe before doing it. From mwm at mired.org Wed Nov 2 00:49:50 2011 From: mwm at mired.org (Mike Meyer) Date: Tue, 1 Nov 2011 16:49:50 -0700 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> Message-ID: On Mon, Oct 31, 2011 at 11:55 PM, Terry Reedy wrote: > On 11/1/2011 1:32 AM, Mike Meyer wrote: >> As you point out, this is a hard problem. I know I haven't covered all >> the issues. That's why the second thing I said was that I'm hoping to >> get people smarter than me to look at things. > > This is one of the hard problems that keep getting swept under the rug while we do easier things. Well, we have overhauled unicode and packaging for 3.3, so maybe concurrency can get some attention. Hey, it worked! > > I keep thinking that CPython's design of allowing C coded modules either outside or inside the stdlib should allow some progress. > > Would it be helpful, for instance, to have truly immutable restricted tuples and frozensets, whose __new__ methods only allowed true immutables (None, booleans, numbers, strings, other restricted tuples and frozensets) as members? Possibly. However, so long as the mutations they make don't change the externally visible behavior, then for the purposes of this discussion, they already are immutable. Or is it possible that concurrent updates of that not-externally-visible state could cause things to break? > How about a metaclass, say 'immutable', that made the instances of a user class truly immutable? (I don't know how to do this, but lets assume it can be done -- perhaps with a new frozendict.) If such were possible, instances of instances of such a metaclass could be added to the list above. Well, on the basis that we're all adults, I'm willing to accept that a programmer saying "I want instances of this class to be immutable" means they'll only subvert whatever mechanism is used to do this when it's safe to do so (i.e. - "not externally visible"), so catching casual attempts - assignments to attributes - to do so will do, then we can do this by providing a __setattr__ method that always throws an exception. Actually, I think that's the key to implementing this efficiently. __setattr__ on objects that aren't locked throws an exception (or triggers locking inside an STM). Locking them changes __setattr__ to something that works appropriately. Builtin types will need more extensive tweaking along those lines. An immutable type doesn't need the working variant of __setattr__. > Could a metaclass automatically add fine-grained locks around around attribute mutations? Wouldn't that be another variation on the __setattr__ method, that did: locking self.__dict__: self.__dict__[name] = value I can see that that would be useful, but would expect most objects would want to change more than one attribute in a consistent method, so they'd have a method that locked self and made all those changes. ? ? ? References: <4EA8BD66.6010807@canterbury.ac.nz>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> <1320123506.9456.57.camel@Gutsy> <4EAFC8EC.4080601@canterbury.ac.nz> Message-ID: On Wed, Nov 2, 2011 at 10:05 AM, Terry Reedy wrote: >> Cofunctions >> ----------- >> >> There will be a special form of Python function called a "cofunction", >> defined >> using the new keyword ``codef`` in place of ``def``. > > Is this really needed? The presence of 'coyield' signals 'cofunction', just > as 'yield' signals 'generator'. Does a cofunction without a suspend point > make sense? (And if it did, 'if False: coyield' or 'coyield' after 'return' > could serve as a signal.) Something is needed, since there probably won't *be* an explicit coyield in the top level function (instead, it would call async I/O operations that used coyield internally). However, as per the previous thread, I don't believe this needs to be embedded in the bytecode by the compiler - it could instead be a runtime switch in the eval loop, changing the way function calls and iteration are handled. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From stephen at xemacs.org Wed Nov 2 02:45:42 2011 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Wed, 02 Nov 2011 10:45:42 +0900 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> <87ehxrpvl9.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: <87bosvpa9l.fsf@uwakimon.sk.tsukuba.ac.jp> Mike Meyer writes: > On Tue, Nov 1, 2011 at 11:05 AM, Stephen J. Turnbull wrote: > > I didn't say it did, it "merely" imposes substantial inconvenience in > > hope that: > > > It just forces them to *think* about what they're doing beforehand. > > which I believe to be un-Pythonic. > > > > Really? Thinking is unpythonic? No, "forcing" is. Consenting adults and all that. > If all you want to do is get the old semantics back in a single-threaded > application, you could do something like turning: > > if __name__ == '__main__': > main() > > into: > > if __name__ == '__main__': > locking: > main() > > Actually, that achieves my goal - you hopefully thought about this long > enough to realize that this was safe before doing it. Anybody who does that is simply shutting off the warning/errors, and clearly is not thinking about their app at all. But this is revealing: you say *your* goal is making *me* think. That's what I consider un-Pythonic. A Pythonic approach would allow me to worry about it when *I* think it necessary. Maybe we don't have that choice, maybe concurrency is too hard to solve without some annoying constraints. But that's not at all clear to me, and I'd rather make gradual progress toward safety in a language that's fun and profitable to use, rather than have safety in a language that is a pain in the neck to use. From mwm at mired.org Wed Nov 2 06:53:41 2011 From: mwm at mired.org (Mike Meyer) Date: Tue, 1 Nov 2011 22:53:41 -0700 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: <87bosvpa9l.fsf@uwakimon.sk.tsukuba.ac.jp> References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> <87ehxrpvl9.fsf@uwakimon.sk.tsukuba.ac.jp> <87bosvpa9l.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: <20111101225341.51801e1b@bhuda.mired.org> On Wed, 02 Nov 2011 10:45:42 +0900 "Stephen J. Turnbull" wrote: > Mike Meyer writes: > > On Tue, Nov 1, 2011 at 11:05 AM, Stephen J. Turnbull wrote: > > > I didn't say it did, it "merely" imposes substantial inconvenience in > > > hope that: > > > > It just forces them to *think* about what they're doing beforehand. > > > which I believe to be un-Pythonic. > > Really? Thinking is unpythonic? > No, "forcing" is. Consenting adults and all that. But you yourself admit that this isn't forcing you to think: > > If all you want to do is get the old semantics back in a single-threaded > > application, you could do something like turning: > > > > if __name__ == '__main__': > > main() > > > > into: > > > > if __name__ == '__main__': > > locking: > > main() > > > > Actually, that achieves my goal - you hopefully thought about this long > > enough to realize that this was safe before doing it. > > Anybody who does that is simply shutting off the warning/errors, and > clearly is not thinking about their app at all. So you admit this doesn't force you to think. It just makes you add a statement to shut up the warnings. Pretty much the same thing as using a bare except clause. Me, I'd think about it long enough to convince myself that the app really was single-threaded. > But this is revealing: you say *your* goal is making *me* think. Only if I may wind up maintaining the code you wrote. But that's a driving factor in a *lot* of the design decisions when it comes to extending python. > That's what I consider un-Pythonic. I feel just the opposite. Python doesn't allow errors to silently pass, or guess what the programmer wanted to do, or make inferences about things - it raises exceptions. That forces the programmer to think about the exception and handle it properly. Or they can not think about it, and just use a bare except clause. I think that's very pythonic. In fact, getting tired of chasing down such bugs in Perl code was why I switched from Perl to Python, and then cut my rates in order to convince my clients to let me write in what was then a strange new language. This proposal builds on that base: it catches errors of a type that are currently ignored and raises an exception. It also adds a new statement for *dealing* with those errors, because handling them with exceptions won't really work. There's even an analog for the bare except if you want to use it. And it comes about for much the same reason: I'm getting tired of chasing down bugs in concurrent code. There are languages that offer that. Some even run in environments I like, and are fun to write when they're applicable. But I find myself wishing for Python's features when I write in them. > A Pythonic approach would allow me to worry about it when *I* think it > necessary. Maybe we don't have that choice, maybe concurrency is too > hard to solve without some annoying constraints. But that's not at > all clear to me, and I'd rather make gradual progress toward safety in > a language that's fun and profitable to use, rather than have safety > in a language that is a pain in the neck to use. Based on my experience, your second sentence is true. If that were all it were, the Queue module would be most of a solution, and there are STM modules available if that's not good enough. But they only solve half the problem - they make it easier to get things right once you decide the data is shared. People are as likely to miss that data is shared as they are to screw up the locking. In other words, if we do it your way, it'll deal with less than half of whats bugging me. It may be that Python's data structures will make this unworkable. It may be that a workable solution will suck the performance out of non-concurrent applications. It may be that anything that fixes both of those will be unpalatable for other reasons. There's no way to find out except by trying. And I'd rather try that than start trying to convince people to let me write in some strange new language again. http://www.mired.org/ Independent Software developer/SCM consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From ncoghlan at gmail.com Wed Nov 2 07:12:06 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 2 Nov 2011 16:12:06 +1000 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: <20111101225341.51801e1b@bhuda.mired.org> References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> <87ehxrpvl9.fsf@uwakimon.sk.tsukuba.ac.jp> <87bosvpa9l.fsf@uwakimon.sk.tsukuba.ac.jp> <20111101225341.51801e1b@bhuda.mired.org> Message-ID: On Wed, Nov 2, 2011 at 3:53 PM, Mike Meyer wrote: > On Wed, 02 Nov 2011 10:45:42 +0900 > "Stephen J. Turnbull" wrote: >> Mike Meyer writes: >> ?> On Tue, Nov 1, 2011 at 11:05 AM, Stephen J. Turnbull wrote: >> ?> > I didn't say it did, it "merely" imposes substantial inconvenience in >> ?> > hope that: >> ?> > ?> It just forces them to *think* about what they're doing beforehand. >> ?> > which I believe to be un-Pythonic. >> ?> Really? Thinking is unpythonic? >> No, "forcing" is. ?Consenting adults and all that. > > But you yourself admit that this isn't forcing you to think: It's forcing you to think the way Java's checked exceptions force you to think - they make you think "Gee, it's tedious having to write all this boilerplate to get the compiler/interpreter to STFU and let me get on with doing my job". "safe by default" is an excellent design principle, but so is "stay out of the way". The two are often in tension, and this is one of those times. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From stephen at xemacs.org Wed Nov 2 09:49:48 2011 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Wed, 02 Nov 2011 17:49:48 +0900 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: <20111101225341.51801e1b@bhuda.mired.org> References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> <87ehxrpvl9.fsf@uwakimon.sk.tsukuba.ac.jp> <87bosvpa9l.fsf@uwakimon.sk.tsukuba.ac.jp> <20111101225341.51801e1b@bhuda.mired.org> Message-ID: <87aa8eq577.fsf@uwakimon.sk.tsukuba.ac.jp> Mike Meyer writes: > > No, "forcing" is. Consenting adults and all that. > > But you yourself admit that this isn't forcing you to think: Nice try, but I didn't say it forces me to think. It forces me to do something to shut up the language. That's ugly. > It just makes you add a statement to shut up the warnings. Pretty > much the same thing as using a bare except clause. The bare except clause is optional; I can (and often do) simply let the exception terminate the process *if* it ever happens. My understanding is that that isn't good enough for you (because concurrency errors usually lead to silent data corruption rather than a spectacular and immediate crash). > And it comes about for much the same reason: I'm getting tired of > chasing down bugs in concurrent code. There are languages that > offer that. Well, if you want help chasing down bugs in concurrent code, I would think that you would want to focus on concurrent code. First, AFAICS ordinary function calls don't expose additional objects to concurrency (they may access exposed objects, of course, but they were passed in from above by a task, or are globals). So basically every object exposed to concurrency is in either args or kwargs in a call to threading.Thread (or thread.start_new_thread), no? Wouldn't it be possible to wrap those objects (and only those objects) such that the wrapper intercepts attempts to access the wrapped objects, and "does something" (warn, raise, dance on the head of a pin) if the access is unlocked or whatever? Then only concurrent code and the objects exposed to it pay the cost. If it's really feasible to do it via wrapper, you could write a decorator or something that could easily be turned into a no-op for tested code ready to go into production. > People are as likely to miss that data is shared as they are to > screw up the locking. In other words, if we do it your way, it'll > deal with less than half of whats bugging me. [...] > There's no way to find out except by trying. Well, no, it's not about doing it my way; I'm perfectly happy with processes and message-passing in my applications, and aside from wacky ideas like the above, that I don't really know how to implement myself, I don't have a lot of suggestions for concurrency by threading. Rather, it's that my guess is that if you don't make the costs of safe(r) concurrency look more reasonable you won't be getting much help here. From p.f.moore at gmail.com Wed Nov 2 10:27:51 2011 From: p.f.moore at gmail.com (Paul Moore) Date: Wed, 2 Nov 2011 09:27:51 +0000 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp>

Message-ID: On 1 November 2011 21:09, Mike Meyer wrote: > On Tue, Nov 1, 2011 at 9:36 AM, Paul Moore wrote: >> I don't know if you've considered this already, but a for-loop in >> Python creates an iterator and then mutates it (by calling next()) on >> each run through the loop. I can't see any way this could be a >> concurrency problem in itself, but you'd likely need to either >> reimplement the for loop to avoid relying on mutable iterators, or >> you'd need to add some sort of exclusion for iterators in for loops. > > How about a third option? Iterators have to be locked to do a next in > general, as they can be bound and thus shared between execution threads. On > the other hand, locking & unlocking should be the major performance hit, so > you don't want to do that on something that's going to be happening a lot, > so the caller should be allowed to do something to indicate that it's not > required. Locking the iterator should do that. So the next method needs to > add a test to see if self is locked, and if not lock and then unlock self. I'm not sure what you mean here. Suppose I have l = [1,2,3] for i in l: print(i) Here, the thing you need to lock is not l, as it's not being mutated, but the temporary iterator generated by the for loop. That's not exposed to the user, so you can't lock it manually. Should it be locked? It can never be seen from another thread. But how do you code that exception to the rule? What about l = iter([1,2,3]) for i in l: print(i) Here the for loop gnerates iter(l) - which, simply because of the implementation of __iter__ for iterators, returns l. So should I lock l here? It *is* exposed to other threads, potentially. How does the compiler detect the difference between this and the previous example? This seems to me to be a recipe for having users scatter arbitrary locks around their code "just to shut the interpreter up". It's not at all clear that it helps people think, in that there's no easy mental model people can acquire to help them reason about what is going on. Just a load of exceptions that need to be silenced somehow. Paul. From greg.ewing at canterbury.ac.nz Wed Nov 2 10:44:28 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 02 Nov 2011 22:44:28 +1300 Subject: [Python-ideas] Cofunctions - A New Protocol In-Reply-To: References: <4EA8BD66.6010807@canterbury.ac.nz> <4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

Message-ID: <4EB11370.8060205@canterbury.ac.nz> Nick Coghlan wrote: > However, as per the previous thread, I don't believe this needs to be > embedded in the bytecode by the compiler - it could instead be a > runtime switch in the eval loop, changing the way function calls and > iteration are handled. Yes, but I'm no longer sure whether it's such a good idea to have no special syntax at all to mark a cofunction, seeing as cofunctionness won't be able to propagate through C calls, special methods, etc. By having cofunctions declared in a distinctive way, you can look at the source and see exactly where the boundary is between cofunction and non-cofunction code. Without such markers, when you get an exception because you tried to suspend in a non-coroutine zone, it may not be obvious at which point along the call chain you made a mistake. -- Greg From anacrolix at gmail.com Wed Nov 2 11:07:08 2011 From: anacrolix at gmail.com (Matt Joiner) Date: Wed, 2 Nov 2011 21:07:08 +1100 Subject: [Python-ideas] Cofunctions - A New Protocol In-Reply-To: <4EAFC8EC.4080601@canterbury.ac.nz> References: <4EA8BD66.6010807@canterbury.ac.nz>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> <1320123506.9456.57.camel@Gutsy> <4EAFC8EC.4080601@canterbury.ac.nz> Message-ID: I don't think new keywords should be necessary. A module should be sufficient. Also why CoExit when you have GeneratorExit? Might as well make it CoroutineExit. On Tue, Nov 1, 2011 at 9:24 PM, Greg Ewing wrote: > A Coroutine Protocol > ==================== > > Here are some thoughts on the design of a new protocol to support > lightweight > threads using a mechanism similar to, but distinct from, generators and > yield-from. Separating the two protocols will make it much easier to support > suspendable generators, something that is not possible using the cofunction > mechanism as currently specified in PEP 3152. > > The protocol to be described is similar in many ways to the generator > protocol, and in what follows, analogies will be drawn between the two > protocols > where it may aid understanding. > > > API > --- > > This section describes the outward appearance of the coroutine mechanism to > the programmer. > > A coroutine is created using the following constructor: > > :: > > ? ?coroutine(f, *args, **kwds) > > where ``f`` is an object obeying the "coroutine protocol" to be described > below. Syntactic support will be provided for creating such an object using > a special form of Python function definition, analogous to a generator. > > The result is a "coroutine object" having the following methods: > > ``resume(value = None)`` > > ? ?Resumes execution of the coroutine at the point where it was last > ? ?suspended. The value, if any, is passed into the coroutine and > ? ?becomes the return value of the operation that caused the suspension. > ? ?The coroutine executes until its next suspension point, at which > ? ?time the ``resume`` call returns with the value passed into the > ? ?suspension operation. > > ? ?(Note: This is analogous to calling next() or send() on a > generator-iterator. > ? ?Suspension of a coroutine is analogous to a generator executing a > ? ?``yield`` operation.) > > ? ?If the coroutine has been freshly created, the passed-in value is > ? ?ignored and the coroutine executes up to its first suspension point. > > ? ?If the top level of the coroutine finishes execution without > ? ?encountering any further suspension points, a ``CoReturn`` exception > ? ?is raised. This exception has a ``value`` attribute containing the > ? ?return value from the coroutine. > > ? ?(Note: ``CoReturn`` is analogous to the ``StopIteration`` exception > ? ?raised by an exhausted iterator or generator.) > > ``throw(exception)`` > > ? ?Causes the given exception to be raised in the coroutine at its > ? ?current suspension point. > > ``close()`` > > ? ?Requests that the coroutine shut down and clean itself up. This is > ? ?achieved by throwing in a ``CoExit`` exception (analogous to > ``GeneratorExit``). > > It is expected that programmers will not write code that deals directly with > coroutine objects very often; rather, some kind of driver or scheduler will > be > used that takes care of making ``resume()`` calls and handling ``CoReturn`` > exceptions. > > > Cofunctions > ----------- > > There will be a special form of Python function called a "cofunction", > defined > using the new keyword ``codef`` in place of ``def``. A cofunction provides a > convenient way of creating an object obeying the coroutine protocol. (This > is > similar to how a generator provides a convenient way of creating an object > obeying the iterator protocol). > > Suspension of a cofunction is achieved using the expression > > :: > > ? ?``coyield`` [value] > > This is analogous to a ``yield`` expression in a generator, and like > ``yield``, > it can both provide and receive a value. However, unlike ``yield``, it is > *not* > restricted to communicating with the immediate caller. It communicates > directly > with the ``resume`` method of the coroutine, however deep the nesting of > calls > is between the ``resume`` call and the ``coyield``. > > There are some restrictions, however: > > * A ``coyield`` is only allowed in the body of a cofunction (a function > defined > with ``codef``), not in any other context. > > * A cofunction can only be called from the body of another cofunction, not > in > any other context. > > Exceptions are raised if any of these restrictions are violated. > > As a consequence, there must be an unbroken chain of cofunctions (or other > objects > obeying the cofunction protocol, see below) making up the call stack from > the > ``resume`` method down to the suspension point. A cofunction may call an > ordinary > function, but that function or anything called by it will not be able to > suspend > the coroutine. > > Note that the class of "ordinary functions" includes most functions and > methods > written in C. However, it is possible for an object implemented in C to > participate > in a coroutine stack by implementing the coroutine protocol below > explicitly. > > > Coroutine Protocol > ------------------ > > As well as the coroutine object, the coroutine protocol involves three other > kinds > of objects, "cocallable objects", "coframe objects" and "coiterator > objects". > > A cocallable object has the following method: > > ``__cocall__(*args, **kwds)`` > > ? ?Initiates a suspendable computation. Returns a coframe object. > > ? ?(This is analogous to the __iter__ method of an iterable object.) > > ? ?May return NotImplemented to signal that the object does not support the > ? ?coroutine protocol. This enables wrapper objects such as bound methods to > ? ?reflect whether or not the wrapped object supports the coroutine > protocol. > > A coframe object has the following methods: > > ``__resume__(costack, value)`` > > ? ?There are two purposes for which this method is called: to continue > ? ?execution from a suspension point, and to pass in the return value > resulting > ? ?from a nested call to another cocallable object. > > ? ?In both cases, the ``resume`` method is expected to continue execution > until > ? ?the next suspension point, and return the value produced by it. If the > ? ?computation finishes before reaching another suspension point, > ? ?``CoReturn(retval)`` must be raised, where ``retval`` is the return value > of > ? ?the computation. > > ? ?(This method is analogous to the __send__ method of a generator-iterator. > ? ?With a value of None, it is analogous to the __next__ method of an > iterator.) > > ? ?The currently-executing coroutine object is passed in as the ``costack`` > ? ?parameter. The ``__resume__`` method can make a nested call to another > cocallable > ? ?object ``sub`` by performing: > > ? ? ? ?``return costack.call(sub, *args, **kwds)`` > > ? ?No further calls to this coframe will be made until ``obj`` finishes. > When > ? ?it does, the ``__resume__`` method of this coframe ?is called with the > ? ?return value from ``sub``. > > ? ?It is the responsibility of the coframe object to keep track of whether > the > ? ?previous call to its ``__resume__`` method resulted in a suspension or a > nested > ? ?call, and make use of the ``value`` parameter accordingly. > > ``__throw__(costack, exception)`` > > ? ?Called to throw an exception into the computation. The coframe may choose > to > ? ?absorb the exception and continue executing, in which case ``__throw__`` > should > ? ?return the value produced by the next exception point or raise > ``CoReturn`` as > ? ?for ``__resume__``. Alternatively it may allow the same or a different > exception > ? ?to propagate out. > > ? ?Implementation of this method is optional. If it is not present, the > behaviour > ? ?is as if a trivial ``__throw__`` method were present that simply > re-raises the > ? ?exception. > > A coiterator is an iterator that permits iteration to be carried out in a > suspendable > manner. A coiterator object has the following method: > > ``__conext__()`` > > ? ?Returns a coframe for computing the next item from the iteration. This is > the > ? ?coroutine equivalent of an iterator's ``__next__`` method, and behaves > accordingly: > ? ?its ``__resume__`` method must return an item by raising > ``CoReturn(item)``. To > ? ?finish the iteration, it raises ``StopIteration`` as usual. > > To support coiteration, whenever a "next" operation is invoked by a > cofunction > (whether implicitly by means of a for-loop or explicitly by calling > ``next()``) > a ``__conext__`` method is first looked for, and if found, the operation is > carried out suspendably. Otherwise a normal call is made to the ``__next__`` > method. > > > Formal Semantics > ---------------- > > The semantics of the coroutine object are defined by the following Python > implementation. > > :: > > ? ?class coroutine(object): > > ? ? ? ?# ?Public methods > > ? ? ? ?def __init__(self, main, *args, **kwds): > ? ? ? ? ? ?self._stack = [] > ? ? ? ? ? ?self._push(_cocall(main, *args, **kwds)) > > ? ? ? ?def resume(self, value = None): > ? ? ? ? ? ?return self._run(value, None) > > ? ? ? ?def throw(self, exc): > ? ? ? ? ? ?return self._run(None, exc) > > ? ? ? ?def close(self): > ? ? ? ? ? ?try: > ? ? ? ? ? ? ? ?self.throw(CoExit) > ? ? ? ? ? ?except (CoExit, CoReturn): > ? ? ? ? ? ? ? ?pass > > ? ? ? ?def call(self, subroutine, *args, **kwds): > ? ? ? ? ? ?meth = getattr(subroutine, '__cocall__', None) > ? ? ? ? ? ?if meth is not None: > ? ? ? ? ? ? ? ?frame = meth(*args, **kwds) > ? ? ? ? ? ? ? ?if frame is not NotImplemented: > ? ? ? ? ? ? ? ? ? ?self._push(frame) > ? ? ? ? ? ? ? ? ? ?return self._run(None, None) > ? ? ? ? ? ?return CoReturn(subroutine(*args, **kwds)) > > ? ? ? ?# ?Private methods > > ? ? ? ?def _run(self, value, exc): > ? ? ? ? ? ?while True: > ? ? ? ? ? ? ? ?try: > ? ? ? ? ? ? ? ? ? ?frame = self._top() > ? ? ? ? ? ? ? ? ? ?if exc is None: > ? ? ? ? ? ? ? ? ? ? ? ?return frame.__resume__(self, value) > ? ? ? ? ? ? ? ? ? ?else: > ? ? ? ? ? ? ? ? ? ? ? ?meth = getattr(frame, '__throw__', None) > ? ? ? ? ? ? ? ? ? ? ? ?if meth is not None: > ? ? ? ? ? ? ? ? ? ? ? ? ? ?return meth(self, exc) > ? ? ? ? ? ? ? ? ? ? ? ?else: > ? ? ? ? ? ? ? ? ? ? ? ? ? ?raise exc > ? ? ? ? ? ? ? ?except BaseException as exc: > ? ? ? ? ? ? ? ? ? ?if self._pop(): > ? ? ? ? ? ? ? ? ? ? ? ?if isinstance(exc, CoReturn): > ? ? ? ? ? ? ? ? ? ? ? ? ? ?value = exc.value > ? ? ? ? ? ? ? ? ? ? ? ? ? ?exc = None > ? ? ? ? ? ? ? ? ? ?else: > ? ? ? ? ? ? ? ? ? ? ? ?raise > > ? ? ? ?def _push(self, frame): > ? ? ? ? ? ?self._stack.append(frame) > > ? ? ? ?def _pop(self): > ? ? ? ? ? ?if len(self._stack) > 0: > ? ? ? ? ? ? ? ?del self._stack[-1] > ? ? ? ? ? ? ? ?return True > ? ? ? ? ? ?else: > ? ? ? ? ? ? ? ?return False > > ? ? ? ?def _top(self): > ? ? ? ? ? ?return self._stack[-1] > > -- > Greg > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From jh at improva.dk Wed Nov 2 12:54:07 2011 From: jh at improva.dk (Jacob Holm) Date: Wed, 02 Nov 2011 12:54:07 +0100 Subject: [Python-ideas] Cofunctions - A New Protocol In-Reply-To: <4EB11370.8060205@canterbury.ac.nz> References: <4EA8BD66.6010807@canterbury.ac.nz> <4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EB11370.8060205@canterbury.ac.nz> Message-ID: <4EB12F5F.4080109@improva.dk> On 2011-11-02 10:54, Greg Ewing wrote: > Nick Coghlan wrote: > >> However, as per the previous thread, I don't believe this needs to be >> embedded in the bytecode by the compiler - it could instead be a >> runtime switch in the eval loop, changing the way function calls and >> iteration are handled. > > Yes, but I'm no longer sure whether it's such a good idea > to have no special syntax at all to mark a cofunction, > seeing as cofunctionness won't be able to propagate through > C calls, special methods, etc. > > By having cofunctions declared in a distinctive way, you > can look at the source and see exactly where the boundary is > between cofunction and non-cofunction code. Without such > markers, when you get an exception because you tried to > suspend in a non-coroutine zone, it may not be obvious > at which point along the call chain you made a mistake. > If the switch Nick describes is available as a flag on the frame objects, it would be easy to extend the traceback to show *exactly* where you entered the no-coroutine zone that you are now failing to suspend. I don't think the additional syntax is helpful, and it would be quite annoying to need to have two versions of every wrapper function/decorator to make it useable in both contexts. - Jacob From ron3200 at gmail.com Wed Nov 2 17:21:12 2011 From: ron3200 at gmail.com (Ron Adam) Date: Wed, 02 Nov 2011 11:21:12 -0500 Subject: [Python-ideas] Cofunctions - Getting away from the iterator protocol In-Reply-To: References: <4EA8BD66.6010807@canterbury.ac.nz> <20111027183208.GH20970@pantoffel-wg.de> <4EA9AB03.8070302@stoneleaf.us>

<4EA9FED3.6050505@pearwood.info> <4EADBEA7.9000608@canterbury.ac.nz>

<4EAE5F83.9040305@canterbury.ac.nz> <1320083850.5984.115.camel@Gutsy> <4EAF0DEE.1020500@canterbury.ac.nz> <1320106977.6637.50.camel@Gutsy> <1320123506.9456.57.camel@Gutsy> Message-ID: <1320250872.14285.65.camel@Gutsy> On Tue, 2011-11-01 at 18:15 +1000, Nick Coghlan wrote: > On Tue, Nov 1, 2011 at 4:27 PM, Terry Reedy wrote: > > I believe raise just instantiates the indicated exception. I expect that > > Exception.__new__ or .__init__ captures the traceback info. Subclasses can > > add more. A SuspendExecution exception should be able to grab as much as is > > needed for a resume. A CAPI call could be added if needed. > > No, the traceback info is added by the eval loop itself. Remember that > when you raise an exception *type* (rather than an instance), the > exception doesn't get instantiated until it gets caught somewhere - > the eval loop maintains the unwinding stack for the traceback as part > of the thread state until it is time to attach it to the exception > object. > > This is all at the tail end of the eval loop in CPython, but be warned > it's fairly brain bending stuff that depends on various internal > details of the eval loop: > http://hg.python.org/cpython/file/default/Python/ceval.c#l2879 Thanks for the link, I've been trying to get my brain bent around it, but, yes it is hard to understand how it all ties together. This morning I had a thought and maybe it may lead somewhere... Would it be possible to rewrite the 'yield' internals so they work in the following way... # a = yield b try: raise SuspendException(b, _self=_self) Except ContinueException as exc: a = exc.args # b = gen.send(a) def send(gen, a=None): try: gen.throw(ContinueException(a)) except SuspendException as exc: (gen, *b) = exc.args return b The two requirements for this to work are... *A SuspendException needs to be able to pass out of the generator without causing it to stop. *A throw needs to be able to work where the SuspendException was raised. The next issue after that is how to allow a subclass of SuspendException to get pass the next() or .send() caller. A subclassed SuspendException would still be caught by 'except SuspendException as exc'. This is needed as a scheduler or other outer framework sits outside the scope the generator is *called in. *Exceptions work in the callers frame rather than the defining scope. That's an important feature as it will allow coroutines much more freedom to be used in different contexts. What this does is give the non_local symantics you mentioned earlier. Cheers, Ron > > I hope you keep looking at this idea. Function calls stop execution and pass > > control 'down', to be resumed by return. yield stops execution and passes > > control 'up', to be resumed by next (or .send). Exceptions pass control 'up' > > (or 'out') without the possibility of resuming. All that is lacking is > > something to suspend and pass control 'sideways', to a specific target. A > > special exception makes some sense in that exceptions already get the call > > stack needed to resume after suspension. > > That's not actually true - due to the need to process exception > handling clauses and finally blocks (including the implicit ones > inside with statements), the internal state of those frames is > potentially no longer valid for resumption (they've moved on beyond > the point where the internal function was called). > > I'll also note that it isn't necessary to pass control sideways, since > there are two different flavours of coroutine design (the PDF article > in the other thread describes this well). The Lua version is > "asymmetric coroutines", and they only allow you to return to the > point that first invoked the coroutine (this model is a fairly close > fit with Python's generators and exception handling). The greenlet > version is "symmetric" coroutines, and those let you switch directly > to any other coroutine. > > Both models have their pros and cons, but the main advantage of > asymmetric coroutines is that you can just say "suspend this thread" > without having to say *where* you want to switch to. Of course, you > can implement much the same API with symmetric coroutines as well, so > long as you can look up your parent coroutine easily. Ultimately, I > expect the symmetric vs asymmetric decision will be driven more by > implementation details than by philosophical preferences one way or > the other. > > I will note that Ron's suggestion to leverage the existing eval loop > stack collection provided by the exception handling machinery does > heavily favour the asymmetric approach. Having a quick look to refresh > my memory of some of the details of CPython's exception handling, I've > come to the following tentative conclusions: > > - an ordinary exception won't do, since you don't want to trigger > except and finally blocks in outer frames (ceval.c#2903) > - in CPython, a new "why = WHY_SUSPEND" at the eval loop layer is > likely a better approach, since it would allow the frame stack to be > collected without triggering exception handling > - the stack unwinding would then end when a "SETUP_COCALL" block was > encountered on the block stack (just as SETUP_EXCEPT and SETUP_FINALLY > can stop the stack unwinding following an exception > - with the block stacks within the individual frames preserved, the > collected stack should be in a fit state for later restoration > - the "fast_yield" code and the generator resumption code should also > provide useful insight > > There's nothing too magical there - once we disclaim the ability to > suspend coroutines while inside a C function (even one that has called > back in via the C/Python API), it should boil down to a combination of > the existing mechanics for generators and exception handling. So, even > though the above description is (highly) CPython specific, it should > be feasible for other implementations to come up with something > similar (although perhaps not easy: > http://lua-users.org/lists/lua-l/2007-07/msg00002.html). > > Cheers, > Nick. > From guido at python.org Wed Nov 2 18:26:20 2011 From: guido at python.org (Guido van Rossum) Date: Wed, 2 Nov 2011 10:26:20 -0700 Subject: [Python-ideas] Changing str(someclass) to return only the class name In-Reply-To: References: <4EA18598.9060602@netwok.org> <4EA1AFB0.4080000@pearwood.info> <4EA27189.8010002@pearwood.info> <4EA32507.7010900@pearwood.info> <4EAAD51A.9030608@netwok.org> Message-ID: -1. I'd like it so that (given a suitable set of imports) if you typed back to the interpreter what it printed at you, you get the same thing back again. >>> print(x) 42 >>> print(42) 42 >>> print(y) None >>> print(None) None >>> print(z) foo >>> print(foo) foo Ping's proposal would goes against this: >>> print(x) foo() >>> print(foo()) 42 >>> print(Splat) class Splat >>> print(class Splat) SyntaxError: invalid syntax I'm expecting that in most cases there is enough redundancy in the name that you'll know what kind of thing it is. And if you want to know for sure, continue to use repr() -- or, at the interactive prompt, just omit the print() call, since the interactive interpreter automatically calls repr() on your expression. --Guido On Fri, Oct 28, 2011 at 3:00 PM, Ka-Ping Yee wrote: > Hi there, > > I get that repr() is supposed to be the precise representation > and str() is intended more to be friendly than precise. ?My > concern with the proposal is just that this: > > ? ?>>> print x > ? ?foo > > ...doesn't actually feel that friendly to me. ?I want to know > that it's *probably* a function or *probably* a class, the same > way that today, when I see: > > ? ?>>> print x > ? ?biscuit > > ? ?>>> print y > ? ?[1, 2, 3] > > I can guess that x is *probably* a string and y is *probably* > a list (e.g. because I know I'm not working with any custom > objects whose __str__ returns those things). > > It would create a slightly higher mental burden (or slightly > higher probability of human error) if, when I see: > > ? ?>>> print x > ? ?Splat > > ...I have to remember that x might be a string or a function or > a class. > > I'd just like some kind of visual hint as to what it is. ?Like: > > ? ?>>> print x > ? ?foo() > > or: > > ? ?>>> print x > ? ?function foo > > or: > > ? ?>>> print x > ? ?function foo(a, b) > > or: > > ? ?>>> print x > ? ?class Bar > > In fact "function foo(a, b)" would actually be rather useful > in a lot of situations, and I would argue, friendlier than "foo". > > > --Ping > -- --Guido van Rossum (python.org/~guido) From mwm at mired.org Wed Nov 2 19:10:48 2011 From: mwm at mired.org (Mike Meyer) Date: Wed, 2 Nov 2011 11:10:48 -0700 Subject: [Python-ideas] Fwd: Concurrent safety? In-Reply-To: References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp>

Message-ID: On Wed, Nov 2, 2011 at 2:27 AM, Paul Moore wrote: > On 1 November 2011 21:09, Mike Meyer wrote: >> On Tue, Nov 1, 2011 at 9:36 AM, Paul Moore wrote: >>> I don't know if you've considered this already, but a for-loop in >>> Python creates an iterator and then mutates it (by calling next()) on >>> each run through the loop. I can't see any way this could be a >>> concurrency problem in itself, but you'd likely need to either >>> reimplement the for loop to avoid relying on mutable iterators, or >>> you'd need to add some sort of exclusion for iterators in for loops. >> >> How about a third option? Iterators have to be locked to do a next in >> general, as they can be bound and thus shared between execution threads. On >> the other hand, locking & unlocking should be the major performance hit, so >> you don't want to do that on something that's going to be happening a lot, >> so the caller should be allowed to do something to indicate that it's not >> required. Locking the iterator should do that. So the next method needs to >> add a test to see if self is locked, and if not lock and then unlock self. > > I'm not sure what you mean here. Suppose I have > > l = [1,2,3] > for i in l: > ?print(i) > > Here, the thing you need to lock is not l, as it's not being mutated, > but the temporary iterator generated by the for loop. That's not > exposed to the user, so you can't lock it manually. Should it be > locked? It can never be seen from another thread. But how do you code > that exception to the rule? You don't have to do anything. Iterators need to lock themselves to be safe in concurrent use, so this will work fine, with the temporary iterator doing whatever locking is needed. > What about > > l = iter([1,2,3]) > for i in l: > ?print(i) This will work the same as the above. However, since you have the iterator in hand, you have the option to lock it before entering the for loop, which will cause it to *not* do it's own internal locking. This brings up an interesting point, though - look for mail with the subject "Concurrency survey..." References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp> <87ehxrpvl9.fsf@uwakimon.sk.tsukuba.ac.jp> <87bosvpa9l.fsf@uwakimon.sk.tsukuba.ac.jp> <20111101225341.51801e1b@bhuda.mired.org> <87aa8eq577.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On Wed, Nov 2, 2011 at 1:49 AM, Stephen J. Turnbull wrote: > Mike Meyer writes: > Well, if you want help chasing down bugs in concurrent code, I would > think that you would want to focus on concurrent code. ?First, AFAICS > ordinary function calls don't expose additional objects to concurrency > (they may access exposed objects, of course, but they were passed in > from above by a task, or are globals). ?So basically every object > exposed to concurrency is in either args or kwargs in a call to > threading.Thread (or thread.start_new_thread), no? No. You missed two things. First, all the objects that can be accessed - however indirectly - through those objects are exposed to concurrency. Also, any globals and anything that can be accessed through them are exposed to concurrency. No, make that three things: a wrapped C library that has call backs to Python code and uses threads internally can expose anything it's passed (and anything accessible from those objects) to concurrency, without ever using the Python threading code. I mentioned see a bug yesterday. My clients response means I can't in good faith try and fix it (it's in a testing framework, so doesn't affect the product, so they don't care). So this is a guess, but here's what I think is going on: 1) We're using a test framework that creates a mock time module for some reason. At some point, the mock object has the value None. 2) The application being tested uses a database module that uses threads as part of managing a connection pool. The concurrency unsafe codde in the test framework (which is clearly and obviously single-threaded, right?) managed to get the None-valued mock inserted in sys.modules due to a concurrency bug. So when I then use the time module in testing, I get an exception trying to access it's attributes. This does show a bigger problem. Look for my next mail... References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp>

Message-ID: On 2 November 2011 18:10, Mike Meyer wrote: > You don't have to do anything. Iterators need to lock themselves to be > safe in concurrent use, so this will work fine, with the temporary > iterator doing whatever locking is needed. So all iterators automatically lock themselves? Sounds like potentially quite an overhead. But you've obviously thought about it (even if I don't follow all the details) so I'll leave it at that. Paul. From mwm at mired.org Wed Nov 2 20:36:26 2011 From: mwm at mired.org (Mike Meyer) Date: Wed, 2 Nov 2011 12:36:26 -0700 Subject: [Python-ideas] A concurrency survey of sorts Message-ID: In order to get a better idea of where things stand, I'd like to get answers to a few questions. This isn't a traditional broadbased survey, but an attempt to get answers from a few people who might know or have good ideas. This is probably where I should have started, but better late than never. 1) How much of the Python standard library is known to be thread safe? 2) How many packages in PyPI are known to be thread safe? 3) Can you suggest another approach to getting safe high-performance shared data in concurrent operation? I've already considered: a) I proposed making actions that mutate data require locked objects, because I've seen that work in other languages. I recognize that doesn't mean it will work in Python, but it's more than I can say about the alternatives I knew about then., b) Bertrand Meyer's SCOOPS system, designed for Eiffel. It has two major strikes against it: 1) it is based on type attributes on *variables*, andI could figure out how to translate that to a language where variables aren't typed. 2) I don't know that there's a working implementation. 4) Can you suggest a minor change that would move things toward safer concurrent code with high-performance shared data? I can see two possibilities: a) Audit any parts of the standard library that aren't already known to be thread safe, and flag those that aren't. Fixing them may want to wait on a better mechanism than posix locks. b) Add a high-level, high-performance shared object facility to the multiprocess package. Thanks, References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp>

Message-ID: On Wed, Nov 2, 2011 at 12:31 PM, Paul Moore wrote: > On 2 November 2011 18:10, Mike Meyer wrote: >> You don't have to do anything. Iterators need to lock themselves to be >> safe in concurrent use, so this will work fine, with the temporary >> iterator doing whatever locking is needed. > So all iterators automatically lock themselves? Sounds like > potentially quite an overhead. No, all iterators are written to be thread safe. This is pretty much a requirement if you want to use them in a threaded environment. Some iterators may be able to do this without locking. I suspect most won't. This makes me wonder about something. Is there a high-performance threading world that 1) doesn't assume that threading is the norm and thus doesn't worry about single-threaded performance (this is the impression I get about Java, but I may well be wrong) and 2) doesn't require knowing at either build (C/C++) or launch (haskell) time that it's going to be threaded? I haven't found such. References: <20111030201143.481fdca2@bhuda.mired.org> <20111031223048.6e5d2798@bhuda.mired.org> <87ipn4p8z4.fsf@uwakimon.sk.tsukuba.ac.jp>

Message-ID: On 11/2/2011 5:27 AM, Paul Moore wrote: > I'm not sure what you mean here. Suppose I have > > l = [1,2,3] > for i in l: > print(i) > > Here, the thing you need to lock is not l, as it's not being mutated, If l is exposed to another thread, it can be mutated, and the hidden iterator in the for loop will work, but with indeterminant, or at least unexpected results. Were you implicitly excluding such exposure? (A list can also be mutated within its iteration loop. There is even a use case for deleting an item while iterating in reverse.) Dicts *are* locked for iteration because mutating a hash array during iteration could have more drastic effects and there is no good use case. A built-in subclass of list could use the same mechanism as dict for locking during iteration. > but the temporary iterator generated by the for loop. That's not > exposed to the user, so you can't lock it manually. Should it be > locked? It can never be seen from another thread. So no need to lock *it*. -- Terry Jan Reedy From tjreedy at udel.edu Wed Nov 2 23:34:55 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 02 Nov 2011 18:34:55 -0400 Subject: [Python-ideas] Cofunctions - A New Protocol In-Reply-To: <4EB110FC.1040108@canterbury.ac.nz> References: <4EA8BD66.6010807@canterbury.ac.nz> <4EA9FED3.6050505@pearwood.info>