From dalcinl at gmail.com Fri Apr 1 03:11:19 2011 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Thu, 31 Mar 2011 22:11:19 -0300 Subject: [Cython] Cannot profile nogil function. Error or Warn? In-Reply-To: References: Message-ID: On 29 March 2011 21:26, Lisandro Dalcin wrote: > Error compiling Cython file: > ------------------------------------------------------------ > ... > > cdef int PyMPE_Raise(int ierr) except -1 with gil: > ? ?__Pyx_Raise(RuntimeError, "MPE logging error [code: %d]" % ierr, NULL) > ? ?return 0 > > cdef inline int CHKERR(int ierr) nogil except -1: > ? ?^ > ------------------------------------------------------------ > > /home/dalcinl/Devel/mpi4py-dev/src/MPE/helpers.pxi:22:5: Cannot > profile nogil function. > > > Do we REALLY want this to be an error? Why not just a warning? > OK, I pushed a fix. Without this, using -X profile=True cannot work with any pyx source that has nogil functions. Enabling profiling should not force users to change source code. -- Lisandro Dalcin --------------- CIMEC (INTEC/CONICET-UNL) Predio CONICET-Santa Fe Colectora RN 168 Km 472, Paraje El Pozo 3000 Santa Fe, Argentina Tel: +54-342-4511594 (ext 1011) Tel/Fax: +54-342-4511169 From stefan_ml at behnel.de Fri Apr 1 10:11:56 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 01 Apr 2011 10:11:56 +0200 Subject: [Cython] Interest in contributing to the project In-Reply-To: References: Message-ID: <4D9588CC.6000303@behnel.de> Arthur de Souza Ribeiro, 29.03.2011 09:11: > Hello everybody, > > My name is Arthur de Souza Ribeiro and I'm a fourth-year student of Computer > Science in Federal University of Campina Grande, Brazil. I'm a python > programmer and have knowledge of other languages too, like Java, C, C++, Qt, > Grails and ActionScript (used in Flex framework of Adobe). > > I saw Cython project and got really interested in contributing to it. By the > way, I saw that the project is trying to participate of GSoC under Python > Software Foundation umbrella. I know the student application period have > already started, but, I'd really enjoy to participate of GSoC 2011 as a > Cython's student. Until day 8 I could work really hard to show you that I > can be selected as a GSoC student for Cython. I looked for an Ideas Page of > the project but didn't find it, Is there any idea that you have to submit a > project in GSoC? > > If possible, please tell me things that I can start doing to help the > project. Hi Arthur, sorry for the late response and thank you for your application. We are always happy about contributions. The Cython project is currently running a workshop that may yield further possible GSoC tasks, but the one we already have identified is IMHO quite a nice and self-contained one. The goal is to rewrite modules in CPython's standard library in Cython that are currently written in C. The intention is a) to simplify the implementation to make it easier for CPython developers to maintain their code base and b) to try to make the modules even faster than they are to show off Cython's optimisation capabilities (in that order, I think). A related task could be to take existing Python modules in the stdlib, to profile them, and to add external type annotations to optimise them when being compiled with Cython. Both the task of showing Cython's ability to efficiently (and compatibly) implement or compile parts of the stdlib, and the resulting testing of Cython (and bug reporting/fixing) against real world Python code would be very valuable to our project. If you're interested, you could start by writing a short proposal including the modules that you would like to rewrite and what makes them interesting. Both "itertools" and "math" are certainly hot candidates, but there are definitely others, and your interest may change the priorities. If you think that's not a good project for you, please bug us again, we may be able to come up with other projects as well. Stefan From dalcinl at gmail.com Fri Apr 1 18:57:08 2011 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Fri, 1 Apr 2011 13:57:08 -0300 Subject: [Cython] implementation of cdef functions with default arguments Message-ID: Perhaps I'm missing something, but why we need the intermediate struct? Why not generate a regular C function with all the args, and then generate the call providing arguments? We could even extend this to support kwargs for calling functions in Cython, 1 - The implementation would be cleaner, IMHO. 2 - These functions cannot be easily used in external C code (or course, C code should provide all the args) 3 - We could define default args for "cdef extern" C functions, Cython would provide the arg values on call. 4 - We could add support to pass values as kwargs (well, we could do that with the current implementation). 5 - Faster code? Comments? -- Lisandro Dalcin --------------- CIMEC (INTEC/CONICET-UNL) Predio CONICET-Santa Fe Colectora RN 168 Km 472, Paraje El Pozo 3000 Santa Fe, Argentina Tel: +54-342-4511594 (ext 1011) Tel/Fax: +54-342-4511169 From arthurdesribeiro at gmail.com Sat Apr 2 03:52:40 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Fri, 1 Apr 2011 22:52:40 -0300 Subject: [Cython] Interest in contributing to the project In-Reply-To: <4D9588CC.6000303@behnel.de> References: <4D9588CC.6000303@behnel.de> Message-ID: HI Stefan, thank you very much for responding my e-mail to cython's list. About the proposal, I'd be very happy in helping the cython community doing the task 'rewrite modules in CPython's standard library in Cython that are currently written in C'. I didn't think about any special modules, but I'm going to start doing it, in my opinion, both modules you've mentioned are really good examples. I think this project could be very important, but, I don't know CPython very well, are there any examples you could suggest me to understand CPython better? I think I could do a good effort to understand this as fast as I can and we discuss more the proposal. Waiting for your reply... Best Regards.. []s Arthur 2011/4/1 Stefan Behnel > Arthur de Souza Ribeiro, 29.03.2011 09:11: > > Hello everybody, >> >> My name is Arthur de Souza Ribeiro and I'm a fourth-year student of >> Computer >> Science in Federal University of Campina Grande, Brazil. I'm a python >> programmer and have knowledge of other languages too, like Java, C, C++, >> Qt, >> Grails and ActionScript (used in Flex framework of Adobe). >> >> I saw Cython project and got really interested in contributing to it. By >> the >> way, I saw that the project is trying to participate of GSoC under Python >> Software Foundation umbrella. I know the student application period have >> already started, but, I'd really enjoy to participate of GSoC 2011 as a >> Cython's student. Until day 8 I could work really hard to show you that I >> can be selected as a GSoC student for Cython. I looked for an Ideas Page >> of >> the project but didn't find it, Is there any idea that you have to submit >> a >> project in GSoC? >> >> If possible, please tell me things that I can start doing to help the >> project. >> > > Hi Arthur, > > sorry for the late response and thank you for your application. We are > always happy about contributions. > > The Cython project is currently running a workshop that may yield further > possible GSoC tasks, but the one we already have identified is IMHO quite a > nice and self-contained one. The goal is to rewrite modules in CPython's > standard library in Cython that are currently written in C. The intention is > a) to simplify the implementation to make it easier for CPython developers > to maintain their code base and b) to try to make the modules even faster > than they are to show off Cython's optimisation capabilities (in that order, > I think). > > A related task could be to take existing Python modules in the stdlib, to > profile them, and to add external type annotations to optimise them when > being compiled with Cython. > > Both the task of showing Cython's ability to efficiently (and compatibly) > implement or compile parts of the stdlib, and the resulting testing of > Cython (and bug reporting/fixing) against real world Python code would be > very valuable to our project. > > If you're interested, you could start by writing a short proposal including > the modules that you would like to rewrite and what makes them interesting. > Both "itertools" and "math" are certainly hot candidates, but there are > definitely others, and your interest may change the priorities. > > If you think that's not a good project for you, please bug us again, we may > be able to come up with other projects as well. > > Stefan > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From stefan_ml at behnel.de Sat Apr 2 08:50:38 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Sat, 02 Apr 2011 08:50:38 +0200 Subject: [Cython] Interest in contributing to the project In-Reply-To: References: <4D9588CC.6000303@behnel.de> Message-ID: <4D96C73E.4080600@behnel.de> Hi Arthur, Arthur de Souza Ribeiro, 02.04.2011 03:52: > HI Stefan, thank you very much for responding my e-mail to cython's list. > > About the proposal, I'd be very happy in helping the cython community doing > the task 'rewrite modules in CPython's standard library in Cython that are > currently written in C'. I didn't think about any special modules, but I'm > going to start doing it, in my opinion, both modules you've mentioned are > really good examples. Cool. > I think this project could be very important, but, I don't know CPython very > well, are there any examples you could suggest me to understand CPython > better? I think I could do a good effort to understand this as fast as I can > and we discuss more the proposal. The nice thing about this task is that you don't have to be an expert of CPython's C-API, nor a core developer of Cython. You will have to read the C code of the modules, and you will have to look up and understand what the C-API calls in the code are doing, but most of them have rather understandable names. However, you will have to program efficiently in Cython, and write fast code in it. Writing Cython code that is easy to read and maintain, and at the same time fast enough to replace the existing manually tuned C code is the challenging bit here. So my advice would be to get going in Cython programming (take a look through our tutorials), and to start reading the source code of a couple of CPython stdlib modules to get an idea of what you need to translate. It would certainly help your application if you could reimplement one reasonably sized and self-contained function in a stdlib C module of your choice, and present that on the cython-users mailing list to get feedback. A couple of benchmark or profiling results comparing it to the original CPython function would round this up very nicely. Stefan From arthurdesribeiro at gmail.com Sun Apr 3 04:17:37 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Sat, 2 Apr 2011 23:17:37 -0300 Subject: [Cython] Interest in contributing to the project In-Reply-To: <4D96C73E.4080600@behnel.de> References: <4D9588CC.6000303@behnel.de> <4D96C73E.4080600@behnel.de> Message-ID: Hi Stefan, well, i took a look at CPython's source code and as you said, if we use Cython in there we could get a very more readable code without losing performance (I suppose). I took a look especially in cmathmodule.c that composes Python 3.2 source code (more recent stable version). As you said, depending on how fast I create the Cython code, we could add more modules (like socket one, for example). An example on how code would be more readable (in my opinion, please correct me if I'm wrong) is that we wouldn't have to have configuration code about what functions would compose the module, for example, in math module we have: static PyMethodDef cmath_methods[] = { {"acos", cmath_acos, METH_VARARGS, c_acos_doc}, {"acosh", cmath_acosh, METH_VARARGS, c_acosh_doc}, {"asin", cmath_asin, METH_VARARGS, c_asin_doc}, {"asinh", cmath_asinh, METH_VARARGS, c_asinh_doc}, {"atan", cmath_atan, METH_VARARGS, c_atan_doc}, {"atanh", cmath_atanh, METH_VARARGS, c_atanh_doc}, {"cos", cmath_cos, METH_VARARGS, c_cos_doc}, {"cosh", cmath_cosh, METH_VARARGS, c_cosh_doc}, {"exp", cmath_exp, METH_VARARGS, c_exp_doc}, {"isfinite", cmath_isfinite, METH_VARARGS, cmath_isfinite_doc}, {"isinf", cmath_isinf, METH_VARARGS, cmath_isinf_doc}, {"isnan", cmath_isnan, METH_VARARGS, cmath_isnan_doc}, {"log", cmath_log, METH_VARARGS, cmath_log_doc}, {"log10", cmath_log10, METH_VARARGS, c_log10_doc}, {"phase", cmath_phase, METH_VARARGS, cmath_phase_doc}, {"polar", cmath_polar, METH_VARARGS, cmath_polar_doc}, {"rect", cmath_rect, METH_VARARGS, cmath_rect_doc}, {"sin", cmath_sin, METH_VARARGS, c_sin_doc}, {"sinh", cmath_sinh, METH_VARARGS, c_sinh_doc}, {"sqrt", cmath_sqrt, METH_VARARGS, c_sqrt_doc}, {"tan", cmath_tan, METH_VARARGS, c_tan_doc}, {"tanh", cmath_tanh, METH_VARARGS, c_tanh_doc}, {NULL, NULL} /* sentinel */ }; static struct PyModuleDef cmathmodule = { PyModuleDef_HEAD_INIT, "cmath", module_doc, -1, cmath_methods, NULL, NULL, NULL, NULL }; And the init function after it, as I saw in cython (and implemented some examples), we would just have to implement the functions that the compilation would generate the object files that would be imported. But, I noticed a problem that may likely appears that is the configuration part. I mean, Cython code is compiled differently than CPython's one right? If yes, would you have an idea on how we could work on this? Another stuff that I'm getting in trouble in this initial part is how we would translate functions like PyArg_ParseTuple, any clue? I'm studing ways to replace too. As you suggested, I'm practicing Cython to create functions and get more and more familiar with the language, so that I can create a very efficient cython code that would meet our expectations. I'm also reading CPython's code to see where cython can be applied. Thank you. Best Regards. []s Arthur 2011/4/2 Stefan Behnel > Hi Arthur, > > Arthur de Souza Ribeiro, 02.04.2011 03:52: > > HI Stefan, thank you very much for responding my e-mail to cython's list. >> >> About the proposal, I'd be very happy in helping the cython community >> doing >> the task 'rewrite modules in CPython's standard library in Cython that are >> currently written in C'. I didn't think about any special modules, but I'm >> going to start doing it, in my opinion, both modules you've mentioned are >> really good examples. >> > > Cool. > > > > I think this project could be very important, but, I don't know CPython >> very >> well, are there any examples you could suggest me to understand CPython >> better? I think I could do a good effort to understand this as fast as I >> can >> and we discuss more the proposal. >> > > The nice thing about this task is that you don't have to be an expert of > CPython's C-API, nor a core developer of Cython. You will have to read the C > code of the modules, and you will have to look up and understand what the > C-API calls in the code are doing, but most of them have rather > understandable names. > > However, you will have to program efficiently in Cython, and write fast > code in it. Writing Cython code that is easy to read and maintain, and at > the same time fast enough to replace the existing manually tuned C code is > the challenging bit here. > > So my advice would be to get going in Cython programming (take a look > through our tutorials), and to start reading the source code of a couple of > CPython stdlib modules to get an idea of what you need to translate. > > It would certainly help your application if you could reimplement one > reasonably sized and self-contained function in a stdlib C module of your > choice, and present that on the cython-users mailing list to get feedback. A > couple of benchmark or profiling results comparing it to the original > CPython function would round this up very nicely. > > Stefan > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sturla at molden.no Mon Apr 4 01:49:37 2011 From: sturla at molden.no (Sturla Molden) Date: Mon, 04 Apr 2011 01:49:37 +0200 Subject: [Cython] Interest in contributing to the project In-Reply-To: References: <4D9588CC.6000303@behnel.de> <4D96C73E.4080600@behnel.de> Message-ID: <4D990791.6080301@molden.no> Den 03.04.2011 04:17, skrev Arthur de Souza Ribeiro: > > static PyMethodDef cmath_methods[] = { > {"acos", cmath_acos, METH_VARARGS, c_acos_doc}, > {"acosh", cmath_acosh, METH_VARARGS, c_acosh_doc}, > {"asin", cmath_asin, METH_VARARGS, c_asin_doc}, > {"asinh", cmath_asinh, METH_VARARGS, c_asinh_doc}, > {"atan", cmath_atan, METH_VARARGS, c_atan_doc}, > {"atanh", cmath_atanh, METH_VARARGS, c_atanh_doc}, > {"cos", cmath_cos, METH_VARARGS, c_cos_doc}, > {"cosh", cmath_cosh, METH_VARARGS, c_cosh_doc}, > {"exp", cmath_exp, METH_VARARGS, c_exp_doc}, > {"isfinite", cmath_isfinite, METH_VARARGS, cmath_isfinite_doc}, > {"isinf", cmath_isinf, METH_VARARGS, cmath_isinf_doc}, > {"isnan", cmath_isnan, METH_VARARGS, cmath_isnan_doc}, > {"log", cmath_log, METH_VARARGS, cmath_log_doc}, > {"log10", cmath_log10, METH_VARARGS, c_log10_doc}, > {"phase", cmath_phase, METH_VARARGS, cmath_phase_doc}, > {"polar", cmath_polar, METH_VARARGS, cmath_polar_doc}, > {"rect", cmath_rect, METH_VARARGS, cmath_rect_doc}, > {"sin", cmath_sin, METH_VARARGS, c_sin_doc}, > {"sinh", cmath_sinh, METH_VARARGS, c_sinh_doc}, > {"sqrt", cmath_sqrt, METH_VARARGS, c_sqrt_doc}, > {"tan", cmath_tan, METH_VARARGS, c_tan_doc}, > {"tanh", cmath_tanh, METH_VARARGS, c_tanh_doc}, > {NULL, NULL} /* sentinel */ > }; > > > static struct PyModuleDef cmathmodule = { > PyModuleDef_HEAD_INIT, > "cmath", > module_doc, > -1, > cmath_methods, > NULL, > NULL, > NULL, > NULL > }; > Cython will make this, do not care about it. You don't have to set up jump tables to make Python get the right function from Cython generated C code. If you have 22 Python-callable functions (i.e. declared def or cpdef), Cython will make a jump table for those as above. > Another stuff that I'm getting in trouble in this initial part is how > we would translate functions like PyArg_ParseTuple, any clue? I'm > studing ways to replace too. > Do not care about PyArg_ParseTuple either. It's what C Python needs to parse function call arguments from a tuple into C primitives. Cython will do this, which is some of the raison d'etre for using Cython. Also observe that any initialisation done in PyInit_cmathshould go as a module level function call in Cython, i.e. PyInit_cmathis called on import just like module level Python and Cython code. I don't have time to implement all of cmathmodule.c, but here is a starter (not tested & not complete). It might actually be that we should use "cdef complex z" instead of "cdef double complex z". There might be a distinction in the generated C/C++ code between those types, e.g. Py_complex for complex and "double _Complex" or "std::complex" for double complex, even though they are binary equivalent. I'm not sure about the state of Cython with respect to complex numbers, so just try it and see which works better :-) Also observe that we do not release the GIL here. That is not because these functions are not thread-safe, they are, but yielding the GIL will slow things terribly. Sturla cimport math cimport stdlib cdef extern from "_math.h": int Py_IS_FINITE(double) int Py_IS_NAN(double) double copysign(double,double) cdef enum special_types: ST_NINF # 0, negative infinity ST_NEG # 1, negative finite number (nonzero) ST_NZERO # 2, -0. ST_PZERO # 3, +0. ST_POS # 4, positive finite number (nonzero) ST_PINF # 5, positive infinity ST_NAN # 6, Not a Number cdef inline special_types special_type(double d): if (Py_IS_FINITE(d)): if (d != 0): if (copysign(1., d) == 1.): return ST_POS else return ST_NEG else: if (copysign(1., d) == 1.): return ST_PZERO else return ST_NZERO if (Py_IS_NAN(d)): return ST_NAN if (copysign(1., d) == 1.): return ST_PINF else: return ST_NINF cdef void INIT_SPECIAL_VALUES( double complex *table, double complex arc[][7]): stdlib.memcpy(table, &(src[0][0]), 7*7*sizeof(double complex)) cdef inline double complex *SPECIAL_VALUE(double complex z, double complex table[][7]): if (not Py_IS_FINITE(z.real)) or (not Py_IS_FINITE(z.imag)): errno = 0 return &(table[special_type(z.real)][special_type(z.imag)]) else: return NULL cdef double complex acosh_special_values[7][7] INIT_SPECIAL_VALUES( acos_special_values, { C(P34,INF) C(P,INF) C(P,INF) C(P,-INF) C(P,-INF) C(P34,-INF) C(N,INF) C(P12,INF) C(U,U) C(U,U) C(U,U) C(U,U) C(P12,-INF) C(N,N) C(P12,INF) C(U,U) C(P12,0.) C(P12,-0.) C(U,U) C(P12,-INF) C(P12,N) C(P12,INF) C(U,U) C(P12,0.) C(P12,-0.) C(U,U) C(P12,-INF) C(P12,N) C(P12,INF) C(U,U) C(U,U) C(U,U) C(U,U) C(P12,-INF) C(N,N) C(P14,INF) C(0.,INF) C(0.,INF) C(0.,-INF) C(0.,-INF) C(P14,-INF) C(N,INF) C(N,INF) C(N,N) C(N,N) C(N,N) C(N,N) C(N,-INF) C(N,N) }) cdef double complex cmath_acosh(double complex z): cdef double complex s1, s2, r, *psv psv = SPECIAL_VALUE(z, acosh_special_values) if (psv != NULL): return psv[0] if (math.fabs(z.real) > CM_LARGE_DOUBLE or math.fabs(z.imag) > CM_LARGE_DOUBLE): # avoid unnecessary overflow for large arguments r.real = math.log(hypot(z.real/2., z.imag/2.)) + M_LN2*2. r.imag = math.atan2(z.imag, z.real) else: s1.real = z.real - 1. s1.imag = z.imag s1 = cmath_sqrt(s1) # cdef double complex cmath_sqrt(double complex z) s2.real = z.real + 1. s2.imag = z.imag s2 = cmath_sqrt(s2) r.real = math.asinh(s1.real*s2.real + s1.imag*s2.imag) r.imag = 2.*math.atan2(s1.imag, s2.real) errno = 0 return r def acosh(object arg): """acos(x) Return the arc cosine of x.""" double complex z z = cmath_acosh( arg) return complex(z) From sturla at molden.no Mon Apr 4 01:53:57 2011 From: sturla at molden.no (Sturla Molden) Date: Mon, 04 Apr 2011 01:53:57 +0200 Subject: [Cython] Interest in contributing to the project In-Reply-To: <4D990791.6080301@molden.no> References: <4D9588CC.6000303@behnel.de> <4D96C73E.4080600@behnel.de> <4D990791.6080301@molden.no> Message-ID: <4D990895.60609@molden.no> Den 04.04.2011 01:49, skrev Sturla Molden: > Also observe that we do not release the GIL here. That is not because > these functions are not thread-safe, they are, but yielding the GIL > will slow things terribly. Oh, actually they are not thread-safe because we set errno... Sorry. Sturla From d.s.seljebotn at astro.uio.no Mon Apr 4 12:17:56 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 12:17:56 +0200 Subject: [Cython] CEP: prange for parallel loops Message-ID: <4D999AD4.8080609@astro.uio.no> CEP up at http://wiki.cython.org/enhancements/prange """ This spec is the result of a number of discussions at Cython workshop 1. Quite a few different ways of expressing parallelism was looked at, and finally we decided to split the problem in two: * A simple and friendly solution that covers, perhaps, 80% of the cases, based on simply replacing range with prange. * Less friendly solutions for the remaining cases. These cases may well not even require language support in Cython, or only in indirect ways (e.g., cdef closures if normal closures are too expensive). This document focuses exclusively on the former solution and does not intend to cover all use-cases for parallel programming, only the most common ones. """ Note that me and Mark talked some more on the way to the airport, and also I got a couple of more ideas afterwards, so everybody interested should probably take a read even if you were there for discussions. Main post-workshop changes: * cython.parallel.firstiteration()/lastiteration # for in-loop if-test for thread setup/teardown blocks * An idea for how to implement numthreads(), so that we can drop the rather complex Context idea. * More thoughts on firstprivate/lastprivate Dag Sverre From d.s.seljebotn at astro.uio.no Mon Apr 4 11:43:37 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 11:43:37 +0200 Subject: [Cython] CEP: prange for parallel loops Message-ID: <4D9992C9.4060605@student.matnat.uio.no> CEP up at http://wiki.cython.org/enhancements/prange """ This spec is the result of a number of discussions at Cython workshop 1. Quite a few different ways of expressing parallelism was looked at, and finally we decided to split the problem in two: * A simple and friendly solution that covers, perhaps, 80% of the cases, based on simply replacing range with prange. * Less friendly solutions for the remaining cases. These cases may well not even require language support in Cython, or only in indirect ways (e.g., cdef closures if normal closures are too expensive). This document focuses exclusively on the former solution and does not intend to cover all use-cases for parallel programming, only the most common ones. """ Note that me and Mark talked some more on the way to the airport, and also I got a couple of more ideas afterwards, so everybody interested should probably take a read even if you were there for discussions. Dag Sverre From d.s.seljebotn at astro.uio.no Mon Apr 4 11:47:06 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 11:47:06 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D9992C9.4060605@student.matnat.uio.no> References: <4D9992C9.4060605@student.matnat.uio.no> Message-ID: <4D99939A.5050001@student.matnat.uio.no> On 04/04/2011 11:43 AM, Dag Sverre Seljebotn wrote: > CEP up at http://wiki.cython.org/enhancements/prange > > """ > This spec is the result of a number of discussions at Cython workshop > 1. Quite a few different ways of expressing parallelism was looked at, > and finally we decided to split the problem in two: > > * A simple and friendly solution that covers, perhaps, 80% of the > cases, based on simply replacing range with prange. > > * Less friendly solutions for the remaining cases. These cases may > well not even require language support in Cython, or only in indirect > ways (e.g., cdef closures if normal closures are too expensive). > > This document focuses exclusively on the former solution and does not > intend to cover all use-cases for parallel programming, only the most > common ones. > """ > > Note that me and Mark talked some more on the way to the airport, and > also I got a couple of more ideas afterwards, so everybody interested > should probably take a read even if you were there for discussions. To be more specific, here's the main post-workshop changes: * if cython.parallel.firstthreaditer()/lastthreaditer() # Use if-test in loop for thread setup/teardown * An idea for implementing threadnum() in a way so that we can drop the rather complex Context idea. * More thoughts on firstprivate/lastprivate Dag Sverre From stefan_ml at behnel.de Mon Apr 4 13:23:50 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Mon, 04 Apr 2011 13:23:50 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D999AD4.8080609@astro.uio.no> References: <4D999AD4.8080609@astro.uio.no> Message-ID: <4D99AA46.7020600@behnel.de> Dag Sverre Seljebotn, 04.04.2011 12:17: > CEP up at http://wiki.cython.org/enhancements/prange """ Variable handling Rather than explicit declaration of shared/private variables we rely on conventions: * Thread-shared: Variables that are only read and not written in the loop body are shared across threads. Variables that are only used in the else block are considered shared as well. * Thread-private: Variables that are assigned to in the loop body are thread-private. Obviously, the iteration counter is thread-private as well. * Reduction: Variables that only used on the LHS of an inplace operator, such as s above, are marked as targets for reduction. If the variable is also used in other ways (LHS of assignment or in an expression) it does instead turn into a thread-private variable. Note: This means that if one, e.g., inserts printf(... s) above, s is turned into a thread-local variable. OTOH, there is simply no way to correctly emulate the effect printf(... s) would have in a sequential loop, so such code must be discouraged anyway. """ What about simply (ab-)using Python semantics and creating a new inner scope for the prange loop body? That would basically make the loop behave like a closure function, but with the looping header at the 'right' place rather than after the closure. Also, in the example, the local variable declaration of "tmp" outside of the loop looks somewhat misplaced, although it's precedented by comprehensions (which also have their own local scope in Cython). Stefan From d.s.seljebotn at astro.uio.no Mon Apr 4 13:53:16 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 13:53:16 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D99AA46.7020600@behnel.de> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> Message-ID: <4D99B12C.10201@astro.uio.no> On 04/04/2011 01:23 PM, Stefan Behnel wrote: > Dag Sverre Seljebotn, 04.04.2011 12:17: >> CEP up at http://wiki.cython.org/enhancements/prange > > """ > Variable handling > > Rather than explicit declaration of shared/private variables we rely > on conventions: > > * Thread-shared: Variables that are only read and not written in > the loop body are shared across threads. Variables that are only used > in the else block are considered shared as well. > > * Thread-private: Variables that are assigned to in the loop body > are thread-private. Obviously, the iteration counter is thread-private > as well. > > * Reduction: Variables that only used on the LHS of an inplace > operator, such as s above, are marked as targets for reduction. If the > variable is also used in other ways (LHS of assignment or in an > expression) it does instead turn into a thread-private variable. Note: > This means that if one, e.g., inserts printf(... s) above, s is turned > into a thread-local variable. OTOH, there is simply no way to > correctly emulate the effect printf(... s) would have in a sequential > loop, so such code must be discouraged anyway. > """ > > What about simply (ab-)using Python semantics and creating a new inner > scope for the prange loop body? That would basically make the loop > behave like a closure function, but with the looping header at the > 'right' place rather than after the closure. I'm not quite sure what the concrete changes to the CEP this would lead to (assuming you mean this as a proposal for alternative semantics, and not an implementation detail). How would we treat reduction variables? They need to be supported, and there's nothing in Python semantics to support reduction variables, they are a rather special case everywhere. I suppose keeping the reduction clause above, or use the "nonlocal" keyword in the loop body... Also there's the else:-block, although we could make that part of the scope. And the "lastprivate" functionality, although that could be dropped without much loss. > > Also, in the example, the local variable declaration of "tmp" outside > of the loop looks somewhat misplaced, although it's precedented by > comprehensions (which also have their own local scope in Cython). Well, depending on the decision of lastprivate, the declaration would need to be outside; I really like the idea of moving "cdef", and am prepared to drop lastprivate for this. Being explicit about thread-local variables does make things a lot safer to use. (One problem is that switching between serial and parallel one needs to move variable declarations. But that only happens once, and one can use "nthreads=1" to disable parallel after that.) An example would then be: def f(np.ndarray[double] x, double alpha): cdef double s = 0, globtmp with nogil: for i in prange(x.shape[0]): cdef double tmp # thread-private tmp = alpha * i # alpha available from global scope s += x[i] * tmp # still automatic reduction for inplace operators # printf(...s) -> now leads to error, since s is not declared thread-private but is read else: # tmp still available here...looks a bit strange, but useful s += tmp * 10 globtmp = tmp # we save tmp for later # tmp not available here, globtmp is return s Or, we just drop support for the else block on these loops. Dag Sverre From stefan_ml at behnel.de Mon Apr 4 15:04:11 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Mon, 04 Apr 2011 15:04:11 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D99B12C.10201@astro.uio.no> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> Message-ID: <4D99C1CB.2060400@behnel.de> Dag Sverre Seljebotn, 04.04.2011 13:53: > On 04/04/2011 01:23 PM, Stefan Behnel wrote: >> Dag Sverre Seljebotn, 04.04.2011 12:17: >>> CEP up at http://wiki.cython.org/enhancements/prange >> >> """ >> Variable handling >> >> Rather than explicit declaration of shared/private variables we rely on >> conventions: >> >> * Thread-shared: Variables that are only read and not written in the loop >> body are shared across threads. Variables that are only used in the else >> block are considered shared as well. >> >> * Thread-private: Variables that are assigned to in the loop body are >> thread-private. Obviously, the iteration counter is thread-private as well. >> >> * Reduction: Variables that only used on the LHS of an inplace operator, >> such as s above, are marked as targets for reduction. If the variable is >> also used in other ways (LHS of assignment or in an expression) it does >> instead turn into a thread-private variable. Note: This means that if >> one, e.g., inserts printf(... s) above, s is turned into a thread-local >> variable. OTOH, there is simply no way to correctly emulate the effect >> printf(... s) would have in a sequential loop, so such code must be >> discouraged anyway. >> """ >> >> What about simply (ab-)using Python semantics and creating a new inner >> scope for the prange loop body? That would basically make the loop behave >> like a closure function, but with the looping header at the 'right' place >> rather than after the closure. > > I'm not quite sure what the concrete changes to the CEP this would lead to > (assuming you mean this as a proposal for alternative semantics, and not an > implementation detail). What I would like to avoid is having to tell users "and now for something completely different". It looks like a loop, but then there's a whole page of new semantics for it. And this also cannot be used in plain Python code due to the differing scoping behaviour. > How would we treat reduction variables? They need to be supported, and > there's nothing in Python semantics to support reduction variables, they > are a rather special case everywhere. I suppose keeping the reduction > clause above, or use the "nonlocal" keyword in the loop body... That's what I thought, yes. It looks unexpected, sure. That's the clear advantage of using inner functions, which do not add anything new at all. But if we want to add something that looks more like a loop, we should at least make it behave like something that's easy to explain. Sorry for not taking the opportunity to articulate my scepticism in the workshop discussion. Skipping through the CEP now, I think this feature adds quite some complexity to the language, and I'm not sure it's worth that when compared to the existing closures. The equivalent closure+decorator syntax is certainly easier to explain, and could translate into exactly the same code. But with the clear advantage that the scope of local, nonlocal and thread-configuring variables is immediately obvious. Basically, your example would become def f(np.ndarray[double] x, double alpha): cdef double s = 0 with cython.nogil: @cython.run_parallel_for_loop( range(x.shape[0]) ) cdef threaded_loop(i): # 'nogil' is inherited cdef double tmp = alpha * i nonlocal s s += x[i] * tmp s += alpha * (x.shape[0] - 1) return s We likely agree that this is not beautiful. It's also harder to implement than a "simple" for-in-prange loop. But I find it at least easier to explain and semantically 'obvious'. And it would allow us to write a pure mode implementation for this based on the threading module. > Also there's the else:-block, although we could make that part of the > scope. Since that's supposed to run single-threaded anyway, it can be written after the loop, right? Or is there really a use case where one of the threads has to do something in parallel, especially based on its local thread state, that the others don't do? > And the "lastprivate" functionality, although that could be dropped > without much loss. I'm not sure how the "else" block and "lastprivate" could be integrated into the closures approach. Stefan From njs at pobox.com Mon Apr 4 15:27:50 2011 From: njs at pobox.com (Nathaniel Smith) Date: Mon, 4 Apr 2011 06:27:50 -0700 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D999AD4.8080609@astro.uio.no> References: <4D999AD4.8080609@astro.uio.no> Message-ID: On Mon, Apr 4, 2011 at 3:17 AM, Dag Sverre Seljebotn wrote: > ?* A simple and friendly solution that covers, perhaps, 80% of the cases, > based on simply replacing range with prange. This is a "merely" aesthetic objection, while remaining agnostic on the larger discussion, but -- 'for i in prange(...)' looks Just Wrong. This is not a regular loop over a funny range, it's a funny loop over a regular range. Surely it should be 'pfor i in range(...)'. Or better yet, spell it 'parallel_for'. -- Nathaniel From d.s.seljebotn at astro.uio.no Mon Apr 4 15:33:02 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 15:33:02 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D99C1CB.2060400@behnel.de> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99C1CB.2060400@behnel.de> Message-ID: <4D99C88E.6030004@astro.uio.no> On 04/04/2011 03:04 PM, Stefan Behnel wrote: > Dag Sverre Seljebotn, 04.04.2011 13:53: >> On 04/04/2011 01:23 PM, Stefan Behnel wrote: >>> Dag Sverre Seljebotn, 04.04.2011 12:17: >>>> CEP up at http://wiki.cython.org/enhancements/prange >>> >>> """ >>> Variable handling >>> >>> Rather than explicit declaration of shared/private variables we rely on >>> conventions: >>> >>> * Thread-shared: Variables that are only read and not written in the >>> loop >>> body are shared across threads. Variables that are only used in the >>> else >>> block are considered shared as well. >>> >>> * Thread-private: Variables that are assigned to in the loop body are >>> thread-private. Obviously, the iteration counter is thread-private >>> as well. >>> >>> * Reduction: Variables that only used on the LHS of an inplace >>> operator, >>> such as s above, are marked as targets for reduction. If the >>> variable is >>> also used in other ways (LHS of assignment or in an expression) it does >>> instead turn into a thread-private variable. Note: This means that if >>> one, e.g., inserts printf(... s) above, s is turned into a thread-local >>> variable. OTOH, there is simply no way to correctly emulate the effect >>> printf(... s) would have in a sequential loop, so such code must be >>> discouraged anyway. >>> """ >>> >>> What about simply (ab-)using Python semantics and creating a new inner >>> scope for the prange loop body? That would basically make the loop >>> behave >>> like a closure function, but with the looping header at the 'right' >>> place >>> rather than after the closure. >> >> I'm not quite sure what the concrete changes to the CEP this would >> lead to >> (assuming you mean this as a proposal for alternative semantics, and >> not an >> implementation detail). > > What I would like to avoid is having to tell users "and now for > something completely different". It looks like a loop, but then > there's a whole page of new semantics for it. And this also cannot be > used in plain Python code due to the differing scoping behaviour. Well, at least it's better than the 300 pages of semantics for OpenMP :-) > > >> How would we treat reduction variables? They need to be supported, and >> there's nothing in Python semantics to support reduction variables, they >> are a rather special case everywhere. I suppose keeping the reduction >> clause above, or use the "nonlocal" keyword in the loop body... > > That's what I thought, yes. It looks unexpected, sure. That's the > clear advantage of using inner functions, which do not add anything > new at all. But if we want to add something that looks more like a > loop, we should at least make it behave like something that's easy to > explain. > > Sorry for not taking the opportunity to articulate my scepticism in > the workshop discussion. I like the idea of considering cdef/nonlocal in the prange blocks. But, yes, I do feel that opposing a parallel loop construct in general is rather late, or at least could have been done at a more convenient time... All I know and care about is that a decorator-and-closure solution will be a lot more obscure among non-CS people who have no clue what a closure or decorator is, and those are exactly the people who need this kind of simple 80%-solution. You and me don't really need any support from Cython at all to write multithreaded apps (leaving aesthetics and number of keystrokes to the side). It'd be good to hear Robert's and Mark's opinions before going further, let's economise this thread a bit. Dag Sverre From sturla at molden.no Mon Apr 4 15:33:52 2011 From: sturla at molden.no (Sturla Molden) Date: Mon, 04 Apr 2011 15:33:52 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D99C1CB.2060400@behnel.de> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99C1CB.2060400@behnel.de> Message-ID: <4D99C8C0.2020700@molden.no> Den 04.04.2011 15:04, skrev Stefan Behnel: > > What I would like to avoid is having to tell users "and now for > something completely different". It looks like a loop, but then > there's a whole page of new semantics for it. And this also cannot be > used in plain Python code due to the differing scoping behaviour. > I've been working on something similar, which does not involve any changes to Cython, and will work from Python as well. It's been discussed before, basically it involves wrapping a loop in a closure, and then normal Python scoping rules applies. cdef int n @parallel def _parallel_loop(parallel_env): cdef int i, s0, s1 for s0,s1 in parallel_env.range(n): for i in range(s0,s1): pass I am not happy about the verbosity of the wrapper compared to for i in prange(n): pass but this is the best I can do without changing the compiler. Notice e.g. that the loop becomes two nested loops, which is required for efficient work scheduling. Progress is mainly limited by lack of time and personal need. If I ned parallel computing I use Fortran or an optimized LAPACK library (e.g. ACML). Sturla From d.s.seljebotn at astro.uio.no Mon Apr 4 15:51:23 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 15:51:23 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> Message-ID: <4D99CCDB.7000406@astro.uio.no> On 04/04/2011 03:27 PM, Nathaniel Smith wrote: > On Mon, Apr 4, 2011 at 3:17 AM, Dag Sverre Seljebotn > wrote: >> * A simple and friendly solution that covers, perhaps, 80% of the cases, >> based on simply replacing range with prange. > This is a "merely" aesthetic objection, while remaining agnostic on > the larger discussion, but -- 'for i in prange(...)' looks Just Wrong. > This is not a regular loop over a funny range, it's a funny loop over > a regular range. Surely it should be 'pfor i in range(...)'. Or better > yet, spell it 'parallel_for'. I don't mind calling it "parallel_for" myself, if only a good place to provide scheduling parameters (numthreads, dynamic vs. static scheduling, chunksize) can be found. That would make it more obvious that scoping rules are different too. No sense in discussing this further until the higher-level discussion on whether to do it or not has completed though. Dag Sverre From markflorisson88 at gmail.com Mon Apr 4 17:22:20 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Mon, 4 Apr 2011 17:22:20 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D99B12C.10201@astro.uio.no> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> Message-ID: On 4 April 2011 13:53, Dag Sverre Seljebotn wrote: > On 04/04/2011 01:23 PM, Stefan Behnel wrote: >> >> Dag Sverre Seljebotn, 04.04.2011 12:17: >>> >>> CEP up at http://wiki.cython.org/enhancements/prange >> >> """ >> Variable handling >> >> Rather than explicit declaration of shared/private variables we rely on >> conventions: >> >> ? ?* Thread-shared: Variables that are only read and not written in the >> loop body are shared across threads. Variables that are only used in the >> else block are considered shared as well. >> >> ? ?* Thread-private: Variables that are assigned to in the loop body are >> thread-private. Obviously, the iteration counter is thread-private as well. >> >> ? ?* Reduction: Variables that only used on the LHS of an inplace >> operator, such as s above, are marked as targets for reduction. If the >> variable is also used in other ways (LHS of assignment or in an expression) >> it does instead turn into a thread-private variable. Note: This means that >> if one, e.g., inserts printf(... s) above, s is turned into a thread-local >> variable. OTOH, there is simply no way to correctly emulate the effect >> printf(... s) would have in a sequential loop, so such code must be >> discouraged anyway. >> """ >> >> What about simply (ab-)using Python semantics and creating a new inner >> scope for the prange loop body? That would basically make the loop behave >> like a closure function, but with the looping header at the 'right' place >> rather than after the closure. > > I'm not quite sure what the concrete changes to the CEP this would lead to > (assuming you mean this as a proposal for alternative semantics, and not an > implementation detail). > > How would we treat reduction variables? They need to be supported, and > there's nothing in Python semantics to support reduction variables, they are > a rather special case everywhere. I suppose keeping the reduction clause > above, or use the "nonlocal" keyword in the loop body... > > Also there's the else:-block, although we could make that part of the scope. > And the "lastprivate" functionality, although that could be dropped without > much loss. > >> >> Also, in the example, the local variable declaration of "tmp" outside of >> the loop looks somewhat misplaced, although it's precedented by >> comprehensions (which also have their own local scope in Cython). > > Well, depending on the decision of lastprivate, the declaration would need > to be outside; I really like the idea of moving "cdef", and am prepared to > drop lastprivate for this. > > Being explicit about thread-local variables does make things a lot safer to > use. > > (One problem is that switching between serial and parallel one needs to move > variable declarations. But that only happens once, and one can use > "nthreads=1" to disable parallel after that.) > > An example would then be: > > def f(np.ndarray[double] x, double alpha): > ? ?cdef double s = 0, globtmp > ? ?with nogil: > ? ? ? ?for i in prange(x.shape[0]): > ? ? ? ? ? ?cdef double tmp # thread-private > ? ? ? ? ? ?tmp = alpha * i # alpha available from global scope > ? ? ? ? ? ?s += x[i] * tmp # still automatic reduction for inplace operators > ? ? ? ? ? ?# printf(...s) -> now leads to error, since s is not declared > thread-private but is read > ? ? ? ?else: > ? ? ? ? ? ?# tmp still available here...looks a bit strange, but useful > ? ? ? ? ? ?s += tmp * 10 > ? ? ? ? ? ?globtmp = tmp # we save tmp for later > ? ? ? ?# tmp not available here, globtmp is > ? ?return s > > Or, we just drop support for the else block on these loops. I think since we are disallowing break (yet) we shouldn't support the else clause. Basically, I think we can make the CEP a tad more simple. I think we could declare everything outside of the prange body. Then, in the prange loop body: if a variable is assigned to anywhere -> make it lastprivate - if a variable is read before assigned to -> make it firstprivate in addition to lastprivate (raise compiler error if the variable is not initialized outside of the loop body) if a variable is only ever read -> make it shared (the default for OpenMP) if a variable has an inplace operator -> make it a reduction There is really no reason to disallow reading of the reduction variable (in e.g. a printf). The reduction should also be initialized outside of the prange body. Then prange() could be implemented in pure mode as simply the sequential version, i.e. range() which some more arguments. For any scratch space buffers etc, I'd prefer something like with cython.parallel: cdef char *buf = malloc(100) for i in prange(n): use buf free(buf) At least it fits my brain pretty well :) (this code does however assume that malloc is thread-safe). Anyway, I'm not sure I just covered all cases, but what do you think? > Dag Sverre > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From d.s.seljebotn at astro.uio.no Mon Apr 4 19:01:46 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 19:01:46 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D99C1CB.2060400@behnel.de> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99C1CB.2060400@behnel.de> Message-ID: <4D99F97A.70003@astro.uio.no> On 04/04/2011 03:04 PM, Stefan Behnel wrote: > > That's what I thought, yes. It looks unexpected, sure. That's the > clear advantage of using inner functions, which do not add anything > new at all. But if we want to add something that looks more like a > loop, we should at least make it behave like something that's easy to > explain. > > Sorry for not taking the opportunity to articulate my scepticism in > the workshop discussion. Skipping through the CEP now, I think this > feature adds quite some complexity to the language, and I'm not sure > it's worth that when compared to the existing closures. The equivalent > closure+decorator syntax is certainly easier to explain, and could > translate into exactly the same code. But with the clear advantage > that the scope of local, nonlocal and thread-configuring variables is > immediately obvious. > > Basically, your example would become > > def f(np.ndarray[double] x, double alpha): > cdef double s = 0 > > with cython.nogil: > @cython.run_parallel_for_loop( range(x.shape[0]) ) > cdef threaded_loop(i): # 'nogil' is inherited > cdef double tmp = alpha * i > nonlocal s > s += x[i] * tmp > s += alpha * (x.shape[0] - 1) > return s > > We likely agree that this is not beautiful. It's also harder to > implement than a "simple" for-in-prange loop. But I find it at least > easier to explain and semantically 'obvious'. And it would allow us to > write a pure mode implementation for this based on the threading module. Short clarification on this example: There is still magic going on here in the reduction variable -- one must have a version of "s" for each thread, and then reduce at the end. (Stefan: I realize that you may know this, I'm just making sure everything is stated clearly in this discussion.) Dag Sverre From d.s.seljebotn at astro.uio.no Mon Apr 4 19:18:40 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 19:18:40 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> Message-ID: <4D99FD70.6010408@astro.uio.no> On 04/04/2011 05:22 PM, mark florisson wrote: > On 4 April 2011 13:53, Dag Sverre Seljebotn wrote: >> On 04/04/2011 01:23 PM, Stefan Behnel wrote: >>> Dag Sverre Seljebotn, 04.04.2011 12:17: >>>> CEP up at http://wiki.cython.org/enhancements/prange >>> """ >>> Variable handling >>> >>> Rather than explicit declaration of shared/private variables we rely on >>> conventions: >>> >>> * Thread-shared: Variables that are only read and not written in the >>> loop body are shared across threads. Variables that are only used in the >>> else block are considered shared as well. >>> >>> * Thread-private: Variables that are assigned to in the loop body are >>> thread-private. Obviously, the iteration counter is thread-private as well. >>> >>> * Reduction: Variables that only used on the LHS of an inplace >>> operator, such as s above, are marked as targets for reduction. If the >>> variable is also used in other ways (LHS of assignment or in an expression) >>> it does instead turn into a thread-private variable. Note: This means that >>> if one, e.g., inserts printf(... s) above, s is turned into a thread-local >>> variable. OTOH, there is simply no way to correctly emulate the effect >>> printf(... s) would have in a sequential loop, so such code must be >>> discouraged anyway. >>> """ >>> >>> What about simply (ab-)using Python semantics and creating a new inner >>> scope for the prange loop body? That would basically make the loop behave >>> like a closure function, but with the looping header at the 'right' place >>> rather than after the closure. >> I'm not quite sure what the concrete changes to the CEP this would lead to >> (assuming you mean this as a proposal for alternative semantics, and not an >> implementation detail). >> >> How would we treat reduction variables? They need to be supported, and >> there's nothing in Python semantics to support reduction variables, they are >> a rather special case everywhere. I suppose keeping the reduction clause >> above, or use the "nonlocal" keyword in the loop body... >> >> Also there's the else:-block, although we could make that part of the scope. >> And the "lastprivate" functionality, although that could be dropped without >> much loss. >> >>> Also, in the example, the local variable declaration of "tmp" outside of >>> the loop looks somewhat misplaced, although it's precedented by >>> comprehensions (which also have their own local scope in Cython). >> Well, depending on the decision of lastprivate, the declaration would need >> to be outside; I really like the idea of moving "cdef", and am prepared to >> drop lastprivate for this. >> >> Being explicit about thread-local variables does make things a lot safer to >> use. >> >> (One problem is that switching between serial and parallel one needs to move >> variable declarations. But that only happens once, and one can use >> "nthreads=1" to disable parallel after that.) >> >> An example would then be: >> >> def f(np.ndarray[double] x, double alpha): >> cdef double s = 0, globtmp >> with nogil: >> for i in prange(x.shape[0]): >> cdef double tmp # thread-private >> tmp = alpha * i # alpha available from global scope >> s += x[i] * tmp # still automatic reduction for inplace operators >> # printf(...s) -> now leads to error, since s is not declared >> thread-private but is read >> else: >> # tmp still available here...looks a bit strange, but useful >> s += tmp * 10 >> globtmp = tmp # we save tmp for later >> # tmp not available here, globtmp is >> return s >> >> Or, we just drop support for the else block on these loops. > I think since we are disallowing break (yet) we shouldn't support the > else clause. Basically, I think we can make the CEP a tad more simple. > > I think we could declare everything outside of the prange body. Then, > in the prange loop body: > > if a variable is assigned to anywhere -> make it lastprivate > - if a variable is read before assigned to -> make it > firstprivate in addition to lastprivate (raise compiler error if the > variable is not initialized outside of the loop body) > > if a variable is only ever read -> make it shared (the default for OpenMP) > > if a variable has an inplace operator -> make it a reduction > > There is really no reason to disallow reading of the reduction > variable (in e.g. a printf). The reduction should also be initialized > outside of the prange body. The reason for disallowing reading the reduction variable is that otherwise you have a contradiction above, since a reduction variable may also be a thread-local variable. Or, you disable inplace operators for thread-local variables? (ugh) That's the main reason I'm leaning towards explicit declaring local variables using "cdef". If we're reducing complexity BTW, I'd rather remove firstprivate/lastprivate alltogether, see below. > Then prange() could be implemented in pure mode as simply the > sequential version, i.e. range() which some more arguments. > > For any scratch space buffers etc, I'd prefer something like > > > with cython.parallel: > cdef char *buf = malloc(100) > > for i in prange(n): > use buf > > free(buf) > > At least it fits my brain pretty well :) (this code does however > assume that malloc is thread-safe). Yes...perhaps a cython.parellel block will make everybody happy: - It's more obvious that we create a new scope, which at least answers some of Stefan's complaints - We can use normal "for i in range", and put scheduling params on parallel(), which makes Nathaniel happy In this case I'd say we simply do not support firstprivate, all thread-local variables must be declared in the block, and for firstprivate behaviour you just initialize them yourself which is more explicit and Pythonic. The "else:"-block on loops is still useful for lastprivate behaviour -- the point of executing the else block in one of the threads is that you can then copy thread-local variables of the "last" thread into shared variables to get lastprivate behaviour (again, more explicit and Python). If we allow "with cython.nogil, cython.parallel" we can keep the same number of indentation levels in some cases. Also, I think there's still a use for my num_threads_that_would_spawn(), so that the malloc can be moved out to a GIL-holding section if one wants to -- I may want to allocate with a NumPy array instead of malloc. Dag Sverre From markflorisson88 at gmail.com Mon Apr 4 21:26:34 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Mon, 4 Apr 2011 21:26:34 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D99FD70.6010408@astro.uio.no> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> Message-ID: On 4 April 2011 19:18, Dag Sverre Seljebotn wrote: > On 04/04/2011 05:22 PM, mark florisson wrote: >> >> On 4 April 2011 13:53, Dag Sverre Seljebotn >> ?wrote: >>> >>> On 04/04/2011 01:23 PM, Stefan Behnel wrote: >>>> >>>> Dag Sverre Seljebotn, 04.04.2011 12:17: >>>>> >>>>> CEP up at http://wiki.cython.org/enhancements/prange >>>> >>>> """ >>>> Variable handling >>>> >>>> Rather than explicit declaration of shared/private variables we rely on >>>> conventions: >>>> >>>> ? ?* Thread-shared: Variables that are only read and not written in the >>>> loop body are shared across threads. Variables that are only used in the >>>> else block are considered shared as well. >>>> >>>> ? ?* Thread-private: Variables that are assigned to in the loop body are >>>> thread-private. Obviously, the iteration counter is thread-private as >>>> well. >>>> >>>> ? ?* Reduction: Variables that only used on the LHS of an inplace >>>> operator, such as s above, are marked as targets for reduction. If the >>>> variable is also used in other ways (LHS of assignment or in an >>>> expression) >>>> it does instead turn into a thread-private variable. Note: This means >>>> that >>>> if one, e.g., inserts printf(... s) above, s is turned into a >>>> thread-local >>>> variable. OTOH, there is simply no way to correctly emulate the effect >>>> printf(... s) would have in a sequential loop, so such code must be >>>> discouraged anyway. >>>> """ >>>> >>>> What about simply (ab-)using Python semantics and creating a new inner >>>> scope for the prange loop body? That would basically make the loop >>>> behave >>>> like a closure function, but with the looping header at the 'right' >>>> place >>>> rather than after the closure. >>> >>> I'm not quite sure what the concrete changes to the CEP this would lead >>> to >>> (assuming you mean this as a proposal for alternative semantics, and not >>> an >>> implementation detail). >>> >>> How would we treat reduction variables? They need to be supported, and >>> there's nothing in Python semantics to support reduction variables, they >>> are >>> a rather special case everywhere. I suppose keeping the reduction clause >>> above, or use the "nonlocal" keyword in the loop body... >>> >>> Also there's the else:-block, although we could make that part of the >>> scope. >>> And the "lastprivate" functionality, although that could be dropped >>> without >>> much loss. >>> >>>> Also, in the example, the local variable declaration of "tmp" outside of >>>> the loop looks somewhat misplaced, although it's precedented by >>>> comprehensions (which also have their own local scope in Cython). >>> >>> Well, depending on the decision of lastprivate, the declaration would >>> need >>> to be outside; I really like the idea of moving "cdef", and am prepared >>> to >>> drop lastprivate for this. >>> >>> Being explicit about thread-local variables does make things a lot safer >>> to >>> use. >>> >>> (One problem is that switching between serial and parallel one needs to >>> move >>> variable declarations. But that only happens once, and one can use >>> "nthreads=1" to disable parallel after that.) >>> >>> An example would then be: >>> >>> def f(np.ndarray[double] x, double alpha): >>> ? ?cdef double s = 0, globtmp >>> ? ?with nogil: >>> ? ? ? ?for i in prange(x.shape[0]): >>> ? ? ? ? ? ?cdef double tmp # thread-private >>> ? ? ? ? ? ?tmp = alpha * i # alpha available from global scope >>> ? ? ? ? ? ?s += x[i] * tmp # still automatic reduction for inplace >>> operators >>> ? ? ? ? ? ?# printf(...s) -> ?now leads to error, since s is not declared >>> thread-private but is read >>> ? ? ? ?else: >>> ? ? ? ? ? ?# tmp still available here...looks a bit strange, but useful >>> ? ? ? ? ? ?s += tmp * 10 >>> ? ? ? ? ? ?globtmp = tmp # we save tmp for later >>> ? ? ? ?# tmp not available here, globtmp is >>> ? ?return s >>> >>> Or, we just drop support for the else block on these loops. >> >> I think since we are disallowing break (yet) we shouldn't support the >> else clause. Basically, I think we can make the CEP a tad more simple. >> >> I think we could declare everything outside of the prange body. Then, >> in the prange loop body: >> >> ? ? if a variable is assigned to anywhere -> ?make it lastprivate >> ? ? ? ? - if a variable is read before assigned to -> ?make it >> firstprivate in addition to lastprivate (raise compiler error if the >> variable is not initialized outside of the loop body) >> >> ? ? if a variable is only ever read -> ?make it shared (the default for >> OpenMP) >> >> ? ? if a variable has an inplace operator -> ?make it a reduction >> >> There is really no reason to disallow reading of the reduction >> variable (in e.g. a printf). The reduction should also be initialized >> outside of the prange body. > > The reason for disallowing reading the reduction variable is that otherwise > you have a contradiction above, since a reduction variable may also be a > thread-local variable. Or, you disable inplace operators for thread-local > variables? (ugh) Yes, an inplace operator would make it a reduction variable, just like assigning something makes it lastprivate, only reading makes it shared and reading before writing makes it firstprivate in addition to lastprivate. This is all implicit. Alternatively, if you want it more explicit, then instead of the inplace operator you could allow something like sum = cython.parallel.reduction('+', sum) + var1 * var2 instead of sum += var1 * var2 > That's the main reason I'm leaning towards explicit declaring local > variables using "cdef". > > If we're reducing complexity BTW, I'd rather remove firstprivate/lastprivate > alltogether, see below. >> Then prange() could be implemented in pure mode as simply the >> sequential version, i.e. range() which some more arguments. >> >> For any scratch space buffers etc, I'd prefer something like >> >> >> with cython.parallel: >> ? ? cdef char *buf = malloc(100) >> >> ? ? for i in prange(n): >> ? ? ? ? use buf >> >> ? ? free(buf) >> >> At least it fits my brain pretty well :) (this code does however >> assume that malloc is thread-safe). > > Yes...perhaps a cython.parellel block will make everybody happy: > > ?- It's more obvious that we create a new scope, which at least answers some > of Stefan's complaints > > ?- We can use normal "for i in range", and put scheduling params on > parallel(), which makes Nathaniel happy That doesn't sound intuitive, as the scheduling pertains to the worksharing 'for' construct, and not the entire parallel region. So scheduling parameters should be provided to e.g. cython.parallel.range() (or cython.prange, cython.parallel_range, whatever). Then if cython.parallel.range() is in a 'with cython.parallel' block, it would have '#pragma omp for' semantics (considering OpenMP), whereas it would be a '#pragma omp parallel for' if not closely nested in such a block. > In this case I'd say we simply do not support firstprivate, all thread-local > variables must be declared in the block, and for firstprivate behaviour you > just initialize them yourself which is more explicit and Pythonic. The > "else:"-block on loops is still useful for lastprivate behaviour -- the > point of executing the else block in one of the threads is that you can then > copy thread-local variables of the "last" thread into shared variables to > get lastprivate behaviour (again, more explicit and Python). Why? They are entirely implicit in my proposal, and intuitively so. Having the parallel range match the sequential range semantics in this way feel much more Pythonic than having to copy things over in an else block and having to declare and define simple variables in a special place. So basically you keep your options open: a simple and very concise way to do a parallel range, and a slightly more convoluted way if you need to initialize some thread-local buffers. And the good thing is, you can move back to the sequential range by simply renaming cython.parallel.range to range. > If we allow "with cython.nogil, cython.parallel" we can keep the same number > of indentation levels in some cases. Yeah that would be nice. We could also make cython.parallel implicitly nogil, but your approach is more flexible if we want to allow this construct with the gil in the future. > Also, I think there's still a use for my num_threads_that_would_spawn(), so > that the malloc can be moved out to a GIL-holding section if one wants to -- > I may want to allocate with a NumPy array instead of malloc. Yeah we can keep that in for full flexibility. > Dag Sverre > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > For clarity, I'll add an example: def f(np.ndarray[double] x, double alpha): cdef double s = 0 cdef double tmp = 2 cdef double other = 6.6 with nogil: for i in prange(x.shape[0]): # reading 'tmp' makes it firstprivate in addition to lastprivate # 'other' is only ever read, so it's shared printf("%lf %lf %lf\n", tmp, s, other) # assigning 'tmp' makes it lastprivate tmp = alpha * i # using += on 's' makes it a reduction variable with operator '+' s += x[i] * tmp # at this point, all variables s, tmp and other are well defined return s NOTE: any variable that is determined firstprivate, shared or reduction must be defined, so there is no place for implicit behaviour biting you in the behind From vitja.makarov at gmail.com Mon Apr 4 22:06:41 2011 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Mon, 4 Apr 2011 22:06:41 +0200 Subject: [Cython] problem building master with python3 In-Reply-To: References: Message-ID: 2011/4/4 Darren Dale : > On Mon, Apr 4, 2011 at 3:32 PM, Darren Dale wrote: >> I'm attempting to install cython from the git repository to benefit >> from this fix: http://trac.cython.org/cython_trac/ticket/597 . When I >> run "python3 setup.py install --user", I get an error: >> >> cythoning /Users/darren/Projects/cython/Cython/Compiler/Code.py to >> /Users/darren/Projects/cython/Cython/Compiler/Code.c >> >> Error compiling Cython file: >> ------------------------------------------------------------ >> ... >> ? ? ? ?self.cname = cname >> ? ? ? ?self.text = text >> ? ? ? ?self.escaped_value = StringEncoding.escape_byte_string(byte_string) >> ? ? ? ?self.py_strings = None >> >> ? ?def get_py_string_const(self, encoding, identifier=None, is_str=False): >> ? ^ >> ------------------------------------------------------------ >> >> Cython/Compiler/Code.py:320:4: Signature not compatible with previous >> declaration >> >> Error compiling Cython file: >> ------------------------------------------------------------ >> ... >> ? ?cdef public object text >> ? ?cdef public object escaped_value >> ? ?cdef public dict py_strings >> >> ? ?@cython.locals(intern=bint, is_str=bint, is_unicode=bint) >> ? ?cpdef get_py_string_const(self, encoding, identifier=*, is_str=*) >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ^ >> ------------------------------------------------------------ >> >> Cython/Compiler/Code.pxd:64:30: Previous declaration is here >> building 'Cython.Compiler.Code' extension >> /usr/bin/gcc-4.2 -fno-strict-aliasing -fno-common -dynamic -DNDEBUG -g >> -fwrapv -O3 -Wall -Wstrict-prototypes -O2 >> -I/opt/local/Library/Frameworks/Python.framework/Versions/3.2/include/python3.2m >> -c /Users/darren/Projects/cython/Cython/Compiler/Code.c -o >> build/temp.macosx-10.6-x86_64-3.2/Users/darren/Projects/cython/Cython/Compiler/Code.o >> /Users/darren/Projects/cython/Cython/Compiler/Code.c:1:2: error: >> #error Do not use this file, it is the result of a failed Cython >> compilation. >> error: command '/usr/bin/gcc-4.2' failed with exit status 1 >> > > Actually, I get this same error when I try to build with python-2.7 as well. > > Darren This one fails too :( Generators branch is okay. But upstream after merge isn't :( vitja at vitja-laptop:~/work/cython.git$ cat ttt.py def foo(is_str=False): pass vitja at vitja-laptop:~/work/cython.git$ cat ttt.pxd cimport cython @cython.locals(is_str=cython.bint) cdef foo(is_str=*) -- vitja. From d.s.seljebotn at astro.uio.no Mon Apr 4 22:43:11 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 04 Apr 2011 22:43:11 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> Message-ID: <4D9A2D5F.60503@astro.uio.no> On 04/04/2011 09:26 PM, mark florisson wrote: > On 4 April 2011 19:18, Dag Sverre Seljebotn wrote: >> On 04/04/2011 05:22 PM, mark florisson wrote: >>> On 4 April 2011 13:53, Dag Sverre Seljebotn >>> wrote: >>>> On 04/04/2011 01:23 PM, Stefan Behnel wrote: >>>>> Dag Sverre Seljebotn, 04.04.2011 12:17: >>>>>> CEP up at http://wiki.cython.org/enhancements/prange >>>>> """ >>>>> Variable handling >>>>> >>>>> Rather than explicit declaration of shared/private variables we rely on >>>>> conventions: >>>>> >>>>> * Thread-shared: Variables that are only read and not written in the >>>>> loop body are shared across threads. Variables that are only used in the >>>>> else block are considered shared as well. >>>>> >>>>> * Thread-private: Variables that are assigned to in the loop body are >>>>> thread-private. Obviously, the iteration counter is thread-private as >>>>> well. >>>>> >>>>> * Reduction: Variables that only used on the LHS of an inplace >>>>> operator, such as s above, are marked as targets for reduction. If the >>>>> variable is also used in other ways (LHS of assignment or in an >>>>> expression) >>>>> it does instead turn into a thread-private variable. Note: This means >>>>> that >>>>> if one, e.g., inserts printf(... s) above, s is turned into a >>>>> thread-local >>>>> variable. OTOH, there is simply no way to correctly emulate the effect >>>>> printf(... s) would have in a sequential loop, so such code must be >>>>> discouraged anyway. >>>>> """ >>>>> >>>>> What about simply (ab-)using Python semantics and creating a new inner >>>>> scope for the prange loop body? That would basically make the loop >>>>> behave >>>>> like a closure function, but with the looping header at the 'right' >>>>> place >>>>> rather than after the closure. >>>> I'm not quite sure what the concrete changes to the CEP this would lead >>>> to >>>> (assuming you mean this as a proposal for alternative semantics, and not >>>> an >>>> implementation detail). >>>> >>>> How would we treat reduction variables? They need to be supported, and >>>> there's nothing in Python semantics to support reduction variables, they >>>> are >>>> a rather special case everywhere. I suppose keeping the reduction clause >>>> above, or use the "nonlocal" keyword in the loop body... >>>> >>>> Also there's the else:-block, although we could make that part of the >>>> scope. >>>> And the "lastprivate" functionality, although that could be dropped >>>> without >>>> much loss. >>>> >>>>> Also, in the example, the local variable declaration of "tmp" outside of >>>>> the loop looks somewhat misplaced, although it's precedented by >>>>> comprehensions (which also have their own local scope in Cython). >>>> Well, depending on the decision of lastprivate, the declaration would >>>> need >>>> to be outside; I really like the idea of moving "cdef", and am prepared >>>> to >>>> drop lastprivate for this. >>>> >>>> Being explicit about thread-local variables does make things a lot safer >>>> to >>>> use. >>>> >>>> (One problem is that switching between serial and parallel one needs to >>>> move >>>> variable declarations. But that only happens once, and one can use >>>> "nthreads=1" to disable parallel after that.) >>>> >>>> An example would then be: >>>> >>>> def f(np.ndarray[double] x, double alpha): >>>> cdef double s = 0, globtmp >>>> with nogil: >>>> for i in prange(x.shape[0]): >>>> cdef double tmp # thread-private >>>> tmp = alpha * i # alpha available from global scope >>>> s += x[i] * tmp # still automatic reduction for inplace >>>> operators >>>> # printf(...s) -> now leads to error, since s is not declared >>>> thread-private but is read >>>> else: >>>> # tmp still available here...looks a bit strange, but useful >>>> s += tmp * 10 >>>> globtmp = tmp # we save tmp for later >>>> # tmp not available here, globtmp is >>>> return s >>>> >>>> Or, we just drop support for the else block on these loops. >>> I think since we are disallowing break (yet) we shouldn't support the >>> else clause. Basically, I think we can make the CEP a tad more simple. >>> >>> I think we could declare everything outside of the prange body. Then, >>> in the prange loop body: >>> >>> if a variable is assigned to anywhere -> make it lastprivate >>> - if a variable is read before assigned to -> make it >>> firstprivate in addition to lastprivate (raise compiler error if the >>> variable is not initialized outside of the loop body) >>> >>> if a variable is only ever read -> make it shared (the default for >>> OpenMP) >>> >>> if a variable has an inplace operator -> make it a reduction >>> >>> There is really no reason to disallow reading of the reduction >>> variable (in e.g. a printf). The reduction should also be initialized >>> outside of the prange body. >> The reason for disallowing reading the reduction variable is that otherwise >> you have a contradiction above, since a reduction variable may also be a >> thread-local variable. Or, you disable inplace operators for thread-local >> variables? (ugh) > Yes, an inplace operator would make it a reduction variable, just like > assigning something makes it lastprivate, only reading makes it shared > and reading before writing makes it firstprivate in addition to > lastprivate. This is all implicit. > > Alternatively, if you want it more explicit, then instead of the > inplace operator you could allow something like > > sum = cython.parallel.reduction('+', sum) + var1 * var2 > > instead of > > sum += var1 * var2 > >> That's the main reason I'm leaning towards explicit declaring local >> variables using "cdef". >> >> If we're reducing complexity BTW, I'd rather remove firstprivate/lastprivate >> alltogether, see below. >>> Then prange() could be implemented in pure mode as simply the >>> sequential version, i.e. range() which some more arguments. >>> >>> For any scratch space buffers etc, I'd prefer something like >>> >>> >>> with cython.parallel: >>> cdef char *buf = malloc(100) >>> >>> for i in prange(n): >>> use buf >>> >>> free(buf) >>> >>> At least it fits my brain pretty well :) (this code does however >>> assume that malloc is thread-safe). >> Yes...perhaps a cython.parellel block will make everybody happy: >> >> - It's more obvious that we create a new scope, which at least answers some >> of Stefan's complaints >> >> - We can use normal "for i in range", and put scheduling params on >> parallel(), which makes Nathaniel happy > That doesn't sound intuitive, as the scheduling pertains to the > worksharing 'for' construct, and not the entire parallel region. So > scheduling parameters should be provided to e.g. > cython.parallel.range() (or cython.prange, cython.parallel_range, > whatever). > > Then if cython.parallel.range() is in a 'with cython.parallel' block, > it would have '#pragma omp for' semantics (considering OpenMP), > whereas it would be a '#pragma omp parallel for' if not closely nested > in such a block. > >> In this case I'd say we simply do not support firstprivate, all thread-local >> variables must be declared in the block, and for firstprivate behaviour you >> just initialize them yourself which is more explicit and Pythonic. The >> "else:"-block on loops is still useful for lastprivate behaviour -- the >> point of executing the else block in one of the threads is that you can then >> copy thread-local variables of the "last" thread into shared variables to >> get lastprivate behaviour (again, more explicit and Python). > Why? They are entirely implicit in my proposal, and intuitively so. > Having the parallel range match the sequential range semantics in this > way feel much more Pythonic than having to copy things over in an else > block and having to declare and define simple variables in a special > place. I'm just afraid the risk of creating bugs is too high with inplace operators being as magic as you propose. We must not only judge the convenience when done right, but also the chance of using it in the wrong way. "import this" says "Explicit is better than implicit". I'll likely draft a different CEP tomorrow just to explore more -- in the end I may still well favour your approach. Dag Sverre From greg.ewing at canterbury.ac.nz Mon Apr 4 23:18:03 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 05 Apr 2011 09:18:03 +1200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> Message-ID: <4D9A358B.7040302@canterbury.ac.nz> Nathaniel Smith wrote: > Surely it should be 'pfor i in range(...)'. Or 'pfhor', just to let you know it's really something out of this world. http://marathongame.wikia.com/wiki/Pfhor_%28Race%29 -- Greg From robertwb at math.washington.edu Tue Apr 5 07:05:54 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Mon, 4 Apr 2011 22:05:54 -0700 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D99C1CB.2060400@behnel.de> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99C1CB.2060400@behnel.de> Message-ID: On Mon, Apr 4, 2011 at 6:04 AM, Stefan Behnel wrote: > Dag Sverre Seljebotn, 04.04.2011 13:53: >> >> On 04/04/2011 01:23 PM, Stefan Behnel wrote: >>> >>> Dag Sverre Seljebotn, 04.04.2011 12:17: >>>> >>>> CEP up at http://wiki.cython.org/enhancements/prange >>> >>> """ >>> Variable handling >>> >>> Rather than explicit declaration of shared/private variables we rely on >>> conventions: >>> >>> * Thread-shared: Variables that are only read and not written in the loop >>> body are shared across threads. Variables that are only used in the else >>> block are considered shared as well. >>> >>> * Thread-private: Variables that are assigned to in the loop body are >>> thread-private. Obviously, the iteration counter is thread-private as >>> well. >>> >>> * Reduction: Variables that only used on the LHS of an inplace operator, >>> such as s above, are marked as targets for reduction. If the variable is >>> also used in other ways (LHS of assignment or in an expression) it does >>> instead turn into a thread-private variable. Note: This means that if >>> one, e.g., inserts printf(... s) above, s is turned into a thread-local >>> variable. OTOH, there is simply no way to correctly emulate the effect >>> printf(... s) would have in a sequential loop, so such code must be >>> discouraged anyway. >>> """ >>> >>> What about simply (ab-)using Python semantics and creating a new inner >>> scope for the prange loop body? That would basically make the loop behave >>> like a closure function, but with the looping header at the 'right' place >>> rather than after the closure. >> >> I'm not quite sure what the concrete changes to the CEP this would lead to >> (assuming you mean this as a proposal for alternative semantics, and not >> an >> implementation detail). > > What I would like to avoid is having to tell users "and now for something > completely different". It looks like a loop, but then there's a whole page > of new semantics for it. And this also cannot be used in plain Python code > due to the differing scoping behaviour. The same could be said of OpenMP--it looks exactly like a loop except for a couple of pragmas. The proposed (as I'm reading the CEP now) semantics of what's shared and first/last private and reduction would give it the semantics of a normal, sequential loop (and if your final result changes based on how many threads were involved then you've got incorrect code). Perhaps reading of the reduction variable could be fine (though obviously ill-defined, suitable only for debugging). >> How would we treat reduction variables? They need to be supported, and >> there's nothing in Python semantics to support reduction variables, they >> are a rather special case everywhere. I suppose keeping the reduction >> clause above, or use the "nonlocal" keyword in the loop body... > > That's what I thought, yes. It looks unexpected, sure. That's the clear > advantage of using inner functions, which do not add anything new at all. > But if we want to add something that looks more like a loop, we should at > least make it behave like something that's easy to explain. > > Sorry for not taking the opportunity to articulate my scepticism in the > workshop discussion. Skipping through the CEP now, I think this feature adds > quite some complexity to the language, and I'm not sure it's worth that when > compared to the existing closures. The equivalent closure+decorator syntax > is certainly easier to explain, and could translate into exactly the same > code. But with the clear advantage that the scope of local, nonlocal and > thread-configuring variables is immediately obvious. > > Basically, your example would become > > def f(np.ndarray[double] x, double alpha): > ? ?cdef double s = 0 > > ? ?with cython.nogil: > ? ? ? ?@cython.run_parallel_for_loop( range(x.shape[0]) ) > ? ? ? ?cdef threaded_loop(i): ? ?# 'nogil' is inherited > ? ? ? ? ? ?cdef double tmp = alpha * i > ? ? ? ? ? ?nonlocal s > ? ? ? ? ? ?s += x[i] * tmp > ? ? ? ?s += alpha * (x.shape[0] - 1) > ? ?return s > > We likely agree that this is not beautiful. It's also harder to implement > than a "simple" for-in-prange loop. But I find it at least easier to explain > and semantically 'obvious'. And it would allow us to write a pure mode > implementation for this based on the threading module. I'm not opposed to having something like this, it's a whole lot of code and extra refactoring for the basic usecase. I think a nice, clean syntax is worthwhile and requires at lest some level of language support. In some ways it's like buffer support--what goes on under the hood does take some explaining, but most of the time it works as expected (i.e. as if you hadn't declared the type), just faster. The inner workings of prange may be a bit magical, but the intent is not, and the latter is what users care about. - Robert From arthurdesribeiro at gmail.com Tue Apr 5 07:54:33 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Tue, 5 Apr 2011 02:54:33 -0300 Subject: [Cython] Interest in contributing to the project In-Reply-To: <4D990895.60609@molden.no> References: <4D9588CC.6000303@behnel.de> <4D96C73E.4080600@behnel.de> <4D990791.6080301@molden.no> <4D990895.60609@molden.no> Message-ID: Thanks for clarification Sturla, that's just the way I was thinking about some things... I realized that you used some C code that is in _math.h header file, I mean, I was thinking that in the project I should rewrite code that belongs to this file too right? I started coding but I got stucked in functions like Py_Is_Infinite and Py_Is_NaN... I saw Sturla e-mail but I thought this would be wrote in a different way, was I wrong? I also started to write a proposal for this project and hope to publish it here tomorrow for your evaluation. Another point that I'm thinking about is how the profile results should be organized. Is there any template for this? Best Regards []s Arthur 2011/4/3 Sturla Molden > Den 04.04.2011 01:49, skrev Sturla Molden: > > Also observe that we do not release the GIL here. That is not because >> these functions are not thread-safe, they are, but yielding the GIL will >> slow things terribly. >> > > Oh, actually they are not thread-safe because we set errno... Sorry. > > Sturla > > > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From d.s.seljebotn at astro.uio.no Tue Apr 5 08:16:59 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 05 Apr 2011 08:16:59 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99C1CB.2060400@behnel.de> Message-ID: <4D9AB3DB.7090706@astro.uio.no> On 04/05/2011 07:05 AM, Robert Bradshaw wrote: > On Mon, Apr 4, 2011 at 6:04 AM, Stefan Behnel wrote: >> Dag Sverre Seljebotn, 04.04.2011 13:53: >>> On 04/04/2011 01:23 PM, Stefan Behnel wrote: >>>> Dag Sverre Seljebotn, 04.04.2011 12:17: >>>>> CEP up at http://wiki.cython.org/enhancements/prange >>>> """ >>>> Variable handling >>>> >>>> Rather than explicit declaration of shared/private variables we rely on >>>> conventions: >>>> >>>> * Thread-shared: Variables that are only read and not written in the loop >>>> body are shared across threads. Variables that are only used in the else >>>> block are considered shared as well. >>>> >>>> * Thread-private: Variables that are assigned to in the loop body are >>>> thread-private. Obviously, the iteration counter is thread-private as >>>> well. >>>> >>>> * Reduction: Variables that only used on the LHS of an inplace operator, >>>> such as s above, are marked as targets for reduction. If the variable is >>>> also used in other ways (LHS of assignment or in an expression) it does >>>> instead turn into a thread-private variable. Note: This means that if >>>> one, e.g., inserts printf(... s) above, s is turned into a thread-local >>>> variable. OTOH, there is simply no way to correctly emulate the effect >>>> printf(... s) would have in a sequential loop, so such code must be >>>> discouraged anyway. >>>> """ >>>> >>>> What about simply (ab-)using Python semantics and creating a new inner >>>> scope for the prange loop body? That would basically make the loop behave >>>> like a closure function, but with the looping header at the 'right' place >>>> rather than after the closure. >>> I'm not quite sure what the concrete changes to the CEP this would lead to >>> (assuming you mean this as a proposal for alternative semantics, and not >>> an >>> implementation detail). >> What I would like to avoid is having to tell users "and now for something >> completely different". It looks like a loop, but then there's a whole page >> of new semantics for it. And this also cannot be used in plain Python code >> due to the differing scoping behaviour. > The same could be said of OpenMP--it looks exactly like a loop except > for a couple of pragmas. > > The proposed (as I'm reading the CEP now) semantics of what's shared > and first/last private and reduction would give it the semantics of a > normal, sequential loop (and if your final result changes based on how > many threads were involved then you've got incorrect code). Perhaps > reading of the reduction variable could be fine (though obviously > ill-defined, suitable only for debugging). So would you disable inplace operators for thread-private variables? Otherwise a variable could be both a reduction variable and thread-private... There's a reason I disabled reading the reduction variable (which I should have written down). Dag Sverre From d.s.seljebotn at astro.uio.no Tue Apr 5 09:21:43 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 05 Apr 2011 09:21:43 +0200 Subject: [Cython] Another CEP: Parallel block Message-ID: <4D9AC307.4010006@astro.uio.no> There's a (much shorter) proposal for a more explicit parallelism construct at http://wiki.cython.org/enhancements/parallelblock This is a little more verbose for the simplest case, but makes the medium-cases that needs work buffers much simpler, and is also more explicit and difficult to get wrong. I am not sure myself which one I prefer of this and prange. Justification for Cython-specific syntax: This is something that is really only useful if you can release the GIL *outside* of the loop. So I feel this is an area where a custom Cython solution is natural, sort of like "cdef extern", and the buffer access. Since a similar pure-Python solution is rather useless, I also think there's less incentive for making something that works well in pure-Python mode. Dag Sverre From markflorisson88 at gmail.com Tue Apr 5 10:26:40 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 5 Apr 2011 10:26:40 +0200 Subject: [Cython] Another CEP: Parallel block In-Reply-To: <4D9AC307.4010006@astro.uio.no> References: <4D9AC307.4010006@astro.uio.no> Message-ID: On 5 April 2011 09:21, Dag Sverre Seljebotn wrote: > There's a (much shorter) proposal for a more explicit parallelism construct > at > > http://wiki.cython.org/enhancements/parallelblock > > This is a little more verbose for the simplest case, but makes the > medium-cases that needs work buffers much simpler, and is also more explicit > and difficult to get wrong. I actually think your else block really complicates matters. In this example even your index variable is not well-defined right after the loop, because it's not "declared lastprivate through the else block". There is really no reason to make variables private instead of lastprivate (and additionally firstprivate if needed) by default. I think we should allow at least both options, so if the variables are declared in the parallel nogil block they can only be used inside that block (but are still lastprivate, as the first loop may be followed by other code). But the user will also still be able to declare and define stuff outside of the block and omit the with parallel block entirely. And again, you will want something like cython.parallel.range instead of just range, as you will want to pass scheduling parameters to the range(), and not the parallel. So e.g. you can still write something like this: cdef Py_ssize_t i for i in cython.parallel.range(..., schedule='dynamic', nogil=True): do something print i # i is well-defined here My point is, implicit first- and lastprivate can be implicit because it works the exact same way as the sequential python version does. The only remaining pitfall is the in-place operator which declares a reduction. > I am not sure myself which one I prefer of this and prange. > > Justification for Cython-specific syntax: This is something that is really > only useful if you can release the GIL *outside* of the loop. So I feel this > is an area where a custom Cython solution is natural, sort of like "cdef > extern", and the buffer access. > > Since a similar pure-Python solution is rather useless, I also think there's > less incentive for making something that works well in pure-Python mode. Which feature is Cython specific here? The 'with a, b as c:' thing? > Dag Sverre > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From stefan_ml at behnel.de Tue Apr 5 10:34:19 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Tue, 05 Apr 2011 10:34:19 +0200 Subject: [Cython] Another CEP: Parallel block In-Reply-To: References: <4D9AC307.4010006@astro.uio.no> Message-ID: <4D9AD40B.20407@behnel.de> mark florisson, 05.04.2011 10:26: > On 5 April 2011 09:21, Dag Sverre Seljebotn wrote: >> Justification for Cython-specific syntax: This is something that is really >> only useful if you can release the GIL *outside* of the loop. So I feel this >> is an area where a custom Cython solution is natural, sort of like "cdef >> extern", and the buffer access. >> >> Since a similar pure-Python solution is rather useless, I also think there's >> less incentive for making something that works well in pure-Python mode. > > Which feature is Cython specific here? The 'with a, b as c:' thing? No, the syntax is just Python. It's the scoping that's Cython specific, including the local variable declarations inside of the "with" block. Stefan From markflorisson88 at gmail.com Tue Apr 5 10:44:41 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 5 Apr 2011 10:44:41 +0200 Subject: [Cython] Another CEP: Parallel block In-Reply-To: <4D9AD40B.20407@behnel.de> References: <4D9AC307.4010006@astro.uio.no> <4D9AD40B.20407@behnel.de> Message-ID: On 5 April 2011 10:34, Stefan Behnel wrote: > mark florisson, 05.04.2011 10:26: >> >> On 5 April 2011 09:21, Dag Sverre Seljebotn wrote: >>> >>> Justification for Cython-specific syntax: This is something that is >>> really >>> only useful if you can release the GIL *outside* of the loop. So I feel >>> this >>> is an area where a custom Cython solution is natural, sort of like "cdef >>> extern", and the buffer access. >>> >>> Since a similar pure-Python solution is rather useless, I also think >>> there's >>> less incentive for making something that works well in pure-Python mode. >> >> Which feature is Cython specific here? The 'with a, b as c:' thing? > > No, the syntax is just Python. It's the scoping that's Cython specific, > including the local variable declarations inside of the "with" block. Hmm, but you can use cython.declare() for that, no? > Stefan > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From markflorisson88 at gmail.com Tue Apr 5 10:45:32 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 5 Apr 2011 10:45:32 +0200 Subject: [Cython] Another CEP: Parallel block In-Reply-To: References: <4D9AC307.4010006@astro.uio.no> <4D9AD40B.20407@behnel.de> Message-ID: On 5 April 2011 10:44, mark florisson wrote: > On 5 April 2011 10:34, Stefan Behnel wrote: >> mark florisson, 05.04.2011 10:26: >>> >>> On 5 April 2011 09:21, Dag Sverre Seljebotn wrote: >>>> >>>> Justification for Cython-specific syntax: This is something that is >>>> really >>>> only useful if you can release the GIL *outside* of the loop. So I feel >>>> this >>>> is an area where a custom Cython solution is natural, sort of like "cdef >>>> extern", and the buffer access. >>>> >>>> Since a similar pure-Python solution is rather useless, I also think >>>> there's >>>> less incentive for making something that works well in pure-Python mode. >>> >>> Which feature is Cython specific here? The 'with a, b as c:' thing? >> >> No, the syntax is just Python. It's the scoping that's Cython specific, >> including the local variable declarations inside of the "with" block. > > Hmm, but you can use cython.declare() for that, no? (disregarding the malloc() and pointer arithmetic, of course :) >> Stefan >> _______________________________________________ >> cython-devel mailing list >> cython-devel at python.org >> http://mail.python.org/mailman/listinfo/cython-devel >> > From stefan_ml at behnel.de Tue Apr 5 11:01:02 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Tue, 05 Apr 2011 11:01:02 +0200 Subject: [Cython] Another CEP: Parallel block In-Reply-To: References: <4D9AC307.4010006@astro.uio.no> <4D9AD40B.20407@behnel.de> Message-ID: <4D9ADA4E.4050005@behnel.de> mark florisson, 05.04.2011 10:44: > On 5 April 2011 10:34, Stefan Behnel wrote: >> mark florisson, 05.04.2011 10:26: >>> >>> On 5 April 2011 09:21, Dag Sverre Seljebotn wrote: >>>> >>>> Justification for Cython-specific syntax: This is something that is >>>> really >>>> only useful if you can release the GIL *outside* of the loop. So I feel >>>> this >>>> is an area where a custom Cython solution is natural, sort of like "cdef >>>> extern", and the buffer access. >>>> >>>> Since a similar pure-Python solution is rather useless, I also think >>>> there's >>>> less incentive for making something that works well in pure-Python mode. >>> >>> Which feature is Cython specific here? The 'with a, b as c:' thing? >> >> No, the syntax is just Python. It's the scoping that's Cython specific, >> including the local variable declarations inside of the "with" block. > > Hmm, but you can use cython.declare() for that, no? cython.declare() is a no-op (or just a plain assignment) in Python. But the thread-local scoping of these variables cannot be emulated in Python. So this would be a feature that cannot be used in pure Python mode, unlike closures. Stefan From markflorisson88 at gmail.com Tue Apr 5 11:05:26 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 5 Apr 2011 11:05:26 +0200 Subject: [Cython] Another CEP: Parallel block In-Reply-To: <4D9ADA4E.4050005@behnel.de> References: <4D9AC307.4010006@astro.uio.no> <4D9AD40B.20407@behnel.de> <4D9ADA4E.4050005@behnel.de> Message-ID: On 5 April 2011 11:01, Stefan Behnel wrote: > mark florisson, 05.04.2011 10:44: >> >> On 5 April 2011 10:34, Stefan Behnel wrote: >>> >>> mark florisson, 05.04.2011 10:26: >>>> >>>> On 5 April 2011 09:21, Dag Sverre Seljebotn wrote: >>>>> >>>>> Justification for Cython-specific syntax: This is something that is >>>>> really >>>>> only useful if you can release the GIL *outside* of the loop. So I feel >>>>> this >>>>> is an area where a custom Cython solution is natural, sort of like >>>>> "cdef >>>>> extern", and the buffer access. >>>>> >>>>> Since a similar pure-Python solution is rather useless, I also think >>>>> there's >>>>> less incentive for making something that works well in pure-Python >>>>> mode. >>>> >>>> Which feature is Cython specific here? The 'with a, b as c:' thing? >>> >>> No, the syntax is just Python. It's the scoping that's Cython specific, >>> including the local variable declarations inside of the "with" block. >> >> Hmm, but you can use cython.declare() for that, no? > > cython.declare() is a no-op (or just a plain assignment) in Python. But the > thread-local scoping of these variables cannot be emulated in Python. So > this would be a feature that cannot be used in pure Python mode, unlike > closures. Sure, but the Python version would just be serial, it wouldn't use threads at all. That's the great thing about OpenMP's philosophy is that it can be either serial or parallel, the only difference is speed. If you want speed, use Cython. > Stefan > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From d.s.seljebotn at astro.uio.no Tue Apr 5 11:08:43 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 05 Apr 2011 11:08:43 +0200 Subject: [Cython] Another CEP: Parallel block In-Reply-To: <4D9ADA4E.4050005@behnel.de> References: <4D9AC307.4010006@astro.uio.no> <4D9AD40B.20407@behnel.de> <4D9ADA4E.4050005@behnel.de> Message-ID: <4D9ADC1B.30302@astro.uio.no> On 04/05/2011 11:01 AM, Stefan Behnel wrote: > mark florisson, 05.04.2011 10:44: >> On 5 April 2011 10:34, Stefan Behnel wrote: >>> mark florisson, 05.04.2011 10:26: >>>> >>>> On 5 April 2011 09:21, Dag Sverre Seljebotn wrote: >>>>> >>>>> Justification for Cython-specific syntax: This is something that is >>>>> really >>>>> only useful if you can release the GIL *outside* of the loop. So I >>>>> feel >>>>> this >>>>> is an area where a custom Cython solution is natural, sort of like >>>>> "cdef >>>>> extern", and the buffer access. >>>>> >>>>> Since a similar pure-Python solution is rather useless, I also think >>>>> there's >>>>> less incentive for making something that works well in pure-Python >>>>> mode. >>>> >>>> Which feature is Cython specific here? The 'with a, b as c:' thing? >>> >>> No, the syntax is just Python. It's the scoping that's Cython specific, >>> including the local variable declarations inside of the "with" block. >> >> Hmm, but you can use cython.declare() for that, no? > > cython.declare() is a no-op (or just a plain assignment) in Python. > But the thread-local scoping of these variables cannot be emulated in > Python. So this would be a feature that cannot be used in pure Python > mode, unlike closures. The intention of prange was certainly to fall back to a normal single-threaded range in Python mode. Because of the GIL there would rarely be any benefit in running the loop in parallel -- only if you immediately dispatch to a long-running task that itself releases the GIL, but in those cases you should rather stick to pure Python in the first place and not bother with prange. I think the chance of seeing real-life code that both requires prange to run optimally in Cython, and that would not be made slower by more than one thread in Python, is pretty close to zero. Dag Sverre From stefan_ml at behnel.de Tue Apr 5 12:51:33 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Tue, 05 Apr 2011 12:51:33 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> Message-ID: <4D9AF435.5080905@behnel.de> mark florisson, 04.04.2011 21:26: > For clarity, I'll add an example: > > def f(np.ndarray[double] x, double alpha): > cdef double s = 0 > cdef double tmp = 2 > cdef double other = 6.6 > > with nogil: > for i in prange(x.shape[0]): > # reading 'tmp' makes it firstprivate in addition to lastprivate > # 'other' is only ever read, so it's shared > printf("%lf %lf %lf\n", tmp, s, other) So, adding a printf() to your code can change the semantics of your variables? That sounds like a really bad design to me. Stefan From markflorisson88 at gmail.com Tue Apr 5 13:52:20 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 5 Apr 2011 13:52:20 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D9AF435.5080905@behnel.de> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> <4D9AF435.5080905@behnel.de> Message-ID: On 5 April 2011 12:51, Stefan Behnel wrote: > mark florisson, 04.04.2011 21:26: >> >> For clarity, I'll add an example: >> >> def f(np.ndarray[double] x, double alpha): >> ? ? cdef double s = 0 >> ? ? cdef double tmp = 2 >> ? ? cdef double other = 6.6 >> >> ? ? with nogil: >> ? ? ? ? for i in prange(x.shape[0]): >> ? ? ? ? ? ? # reading 'tmp' makes it firstprivate in addition to >> lastprivate >> ? ? ? ? ? ? # 'other' is only ever read, so it's shared >> ? ? ? ? ? ? printf("%lf %lf %lf\n", tmp, s, other) > > So, adding a printf() to your code can change the semantics of your > variables? That sounds like a really bad design to me. I agree, I think we should refrain from the firstprivate() entirely, as it wouldn't have the same semantics as serial execution (as 'tmp' would have the original value with parallel execution and the value from previous iterations with serial execution). So basically we should allow reading of private variables only after they are assigned to in the loop body. > Stefan > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From pav at iki.fi Tue Apr 5 14:55:36 2011 From: pav at iki.fi (Pauli Virtanen) Date: Tue, 5 Apr 2011 12:55:36 +0000 (UTC) Subject: [Cython] CEP: prange for parallel loops References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> Message-ID: Mon, 04 Apr 2011 21:26:34 +0200, mark florisson wrote: [clip] > For clarity, I'll add an example: [clip] How about making all the special declarations explicit? The automatic inference of variables has a problem in that a small change in a part of the code can have somewhat unintuitive non-local effects, as the private/ shared/reduction status of the variable changes in the whole function scope (if Python scoping is retained). Like so with explicit declarations: def f(np.ndarray[double] x, double alpha): cdef double alpha = 6.6 cdef char *ptr = something() # Parallel variables are declared beforehand; # the exact syntax could also be something else cdef cython.parallel.private[int] tmp = 2, tmp2 cdef cython.parallel.reduction[int] s = 0 # Act like ordinary cdef outside prange(); in the prange they are # firstprivate if initialized or written to outside the loop anywhere # in the scope. Or, they could be firstprivate always, if this # has a negligible performance impact. tmp = 3 with nogil: s = 9 for i in prange(x.shape[0]): if cython.parallel.first_iteration(i): # whatever initialization; Cython is in principle allowed # to move this outside the loop, at least if it is # the first thing here pass # tmp2 is not firstprivate, as it's not written to outside # the loop body; also, it's also not lastprivate as it's not # read outside the loop tmp2 = 99 # Increment a private variable tmp += 2*tmp # Add stuff to reduction s += alpha*i # The following raise a compilation error -- the reduction # variable cannot be assigned to, and can be only operated on # with only a single reduction operation inside prange s *= 9 s = 8 # It can be read, however, provided openmp supports this tmp = s # Assignment to non-private variables causes a compile-time # error; this avoids common mistakes, such as forgetting to # declare the reduction variable. alpha += 42 alpha123 = 9 ptr = 94 # These, however, need to be allowed: # the users are on their own to make sure they don't clobber # non-local variables x[i] = 123 (ptr + i)[0] = 123 some_routine(x, ptr, i) else: # private variables are lastprivate if read outside the loop foo = tmp # The else: block can be added, but actually has no effect # as it is always executed --- the code here could as well # be written after the for loop foo = tmp # <- same result with nogil: # Suppose Cython allowed cdef inside blocks with usual scoping # rules cdef cython.parallel.reduction[double] r = 0 # the same variables can be used again in a second parallel loop for i in prange(x.shape[0]): r += 1.5 s -= i tmp = 9 # also the iteration variable is available after the loop count = i # As per usual Cython scoping rules return r, s What did I miss here? As far as I see, the above would have the same semantics and scoping as a single-threaded Python implementation. The only change required to make things parallel is replacing range() by prange() and adding the variable declarations. -- Pauli Virtanen From pav at iki.fi Tue Apr 5 15:10:55 2011 From: pav at iki.fi (Pauli Virtanen) Date: Tue, 5 Apr 2011 13:10:55 +0000 (UTC) Subject: [Cython] CEP: prange for parallel loops References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> Message-ID: Tue, 05 Apr 2011 12:55:36 +0000, Pauli Virtanen wrote: [clip] > # Assignment to non-private variables causes a compile-time > # error; this avoids common mistakes, such as forgetting to > # declare the reduction variable. > alpha += 42 > alpha123 = 9 > ptr = 94 Actually, I'm not sure this is absolutely necessary -- life is tough, especially if you are programming in parallel, and there are limits to hand-holding. However, an explicit declaration could be added for turning the error off for the (rare) cases where this makes sense (e.g. setting a shared flag) cdef cython.parallel.shared[double] some_flag -- Pauli Virtanen From markflorisson88 at gmail.com Tue Apr 5 15:56:55 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 5 Apr 2011 15:56:55 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> Message-ID: On 5 April 2011 14:55, Pauli Virtanen wrote: > > Mon, 04 Apr 2011 21:26:34 +0200, mark florisson wrote: > [clip] > > For clarity, I'll add an example: > [clip] > > How about making all the special declarations explicit? The automatic > inference of variables has a problem in that a small change in a part of > the code can have somewhat unintuitive non-local effects, as the private/ > shared/reduction status of the variable changes in the whole function > scope (if Python scoping is retained). > > Like so with explicit declarations: > > def f(np.ndarray[double] x, double alpha): > ? ?cdef double alpha = 6.6 > ? ?cdef char *ptr = something() > > ? ?# Parallel variables are declared beforehand; > ? ?# the exact syntax could also be something else > ? ?cdef cython.parallel.private[int] tmp = 2, tmp2 > ? ?cdef cython.parallel.reduction[int] s = 0 > > ? ?# Act like ordinary cdef outside prange(); in the prange they are > ? ?# firstprivate if initialized or written to outside the loop anywhere > ? ?# in the scope. Or, they could be firstprivate always, if this > ? ?# has a negligible performance impact. > ? ?tmp = 3 The problem with firstprivate() is that it doesn't give you the same semantics as in the sequential version. That's why I think it would be best to forget about firstprivate entirely and allow reading of private variables only after they are assigned to in the loop body. > > ? ?with nogil: > ? ? ? ?s = 9 > > ? ? ? ?for i in prange(x.shape[0]): > ? ? ? ? ? ?if cython.parallel.first_iteration(i): > ? ? ? ? ? ? ? ?# whatever initialization; Cython is in principle allowed > ? ? ? ? ? ? ? ?# to move this outside the loop, at least if it is > ? ? ? ? ? ? ? ?# the first thing here > ? ? ? ? ? ? ? ?pass For this I prefer the aforementioned 'with cython.parallel:' block. > > ? ? ? ? ? ?# tmp2 is not firstprivate, as it's not written to outside > ? ? ? ? ? ?# the loop body; also, it's also not lastprivate as it's not > ? ? ? ? ? ?# read outside the loop > ? ? ? ? ? ?tmp2 = 99 > > ? ? ? ? ? ?# Increment a private variable > ? ? ? ? ? ?tmp += 2*tmp > > ? ? ? ? ? ?# Add stuff to reduction > ? ? ? ? ? ?s += alpha*i > > ? ? ? ? ? ?# The following raise a compilation error -- the reduction > ? ? ? ? ? ?# variable cannot be assigned to, and can be only operated on > ? ? ? ? ? ?# with only a single reduction operation inside prange > ? ? ? ? ? ?s *= 9 > ? ? ? ? ? ?s = 8 I think OpenMP allows arbitrary assignments and expressions to the reduction variable, all the spec says "usually it will be of the form 'x = ...'". > > ? ? ? ? ? ?# It can be read, however, provided openmp supports this > ? ? ? ? ? ?tmp = s > > ? ? ? ? ? ?# Assignment to non-private variables causes a compile-time > ? ? ? ? ? ?# error; this avoids common mistakes, such as forgetting to > ? ? ? ? ? ?# declare the reduction variable. > ? ? ? ? ? ?alpha += 42 > ? ? ? ? ? ?alpha123 = 9 > ? ? ? ? ? ?ptr = 94 > > ? ? ? ? ? ?# These, however, need to be allowed: > ? ? ? ? ? ?# the users are on their own to make sure they don't clobber > ? ? ? ? ? ?# non-local variables > ? ? ? ? ? ?x[i] = 123 > ? ? ? ? ? ?(ptr + i)[0] = 123 > ? ? ? ? ? ?some_routine(x, ptr, i) Indeed. They could be either shared or firstprivate (as the pointer would be firstprivate, and not the entire array, unless it was declared as a C array of certain size). > ? ? ? ?else: > ? ? ? ? ? ?# private variables are lastprivate if read outside the loop > ? ? ? ? ? ?foo = tmp > > ? ? ? ?# The else: block can be added, but actually has no effect > ? ? ? ?# as it is always executed --- the code here could as well > ? ? ? ?# be written after the for loop > ? ? ? ?foo = tmp ?# <- same result > > ? ?with nogil: > ? ? ? ?# Suppose Cython allowed cdef inside blocks with usual scoping > ? ? ? ?# rules > ? ? ? ?cdef cython.parallel.reduction[double] r = 0 > > ? ? ? ?# the same variables can be used again in a second parallel loop > ? ? ? ?for i in prange(x.shape[0]): > ? ? ? ? ? ?r += 1.5 > ? ? ? ? ? ?s -= i > ? ? ? ? ? ?tmp = 9 > > ? ? ? ?# also the iteration variable is available after the loop > ? ? ? ?count = i > > ? ?# As per usual Cython scoping rules > ? ?return r, s > > What did I miss here? As far as I see, the above would have the same > semantics and scoping as a single-threaded Python implementation. > > The only change required to make things parallel is replacing range() by > prange() and adding the variable declarations. Basically, I like your approach. It's only slightly more verbose as the implicit way, as you need to declare the type of each variable anyway. I also still like the implicit way, but it has a couple of problems: - inplace operators suddenly declare a reduction - assigning to a variable has implicit (last)private semantics, whereas assigning to an element in a buffer has shared semantics Your explicit version solves both these problems. So I'm +1. > -- > Pauli Virtanen > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel From markflorisson88 at gmail.com Tue Apr 5 15:57:50 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 5 Apr 2011 15:57:50 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> Message-ID: On 5 April 2011 15:10, Pauli Virtanen wrote: > Tue, 05 Apr 2011 12:55:36 +0000, Pauli Virtanen wrote: > [clip] >> ? ? ? ? ? ? # Assignment to non-private variables causes a compile-time >> ? ? ? ? ? ? # error; this avoids common mistakes, such as forgetting to >> ? ? ? ? ? ? # declare the reduction variable. >> ? ? ? ? ? ? alpha += 42 >> ? ? ? ? ? ? alpha123 = 9 >> ? ? ? ? ? ? ptr = 94 > > Actually, I'm not sure this is absolutely necessary -- life is tough, > especially if you are programming in parallel, and there are limits to > hand-holding. > > However, an explicit declaration could be added for turning the error off > for the (rare) cases where this makes sense (e.g. setting a shared flag) > > ? ? ? ?cdef cython.parallel.shared[double] some_flag I think that unless we add support for critical, single or master sections, or the atomic construct, we should also disallow assigning to shared variables entirely. > -- > Pauli Virtanen > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From robertwb at math.washington.edu Tue Apr 5 16:53:33 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Tue, 5 Apr 2011 07:53:33 -0700 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D9AF435.5080905@behnel.de> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> <4D9AF435.5080905@behnel.de> Message-ID: On Tue, Apr 5, 2011 at 3:51 AM, Stefan Behnel wrote: > mark florisson, 04.04.2011 21:26: >> >> For clarity, I'll add an example: >> >> def f(np.ndarray[double] x, double alpha): >> ? ? cdef double s = 0 >> ? ? cdef double tmp = 2 >> ? ? cdef double other = 6.6 >> >> ? ? with nogil: >> ? ? ? ? for i in prange(x.shape[0]): >> ? ? ? ? ? ? # reading 'tmp' makes it firstprivate in addition to >> lastprivate >> ? ? ? ? ? ? # 'other' is only ever read, so it's shared >> ? ? ? ? ? ? printf("%lf %lf %lf\n", tmp, s, other) > > So, adding a printf() to your code can change the semantics of your > variables? That sounds like a really bad design to me. That's what I was thinking. Basically, if you do an inlace operation, then it's a reduction variable, no matter what else you do to it (including possibly a direct assignment, though we could make that a compile-time error). - Robert From d.s.seljebotn at astro.uio.no Tue Apr 5 16:58:01 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 05 Apr 2011 16:58:01 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> <4D9AF435.5080905@behnel.de> Message-ID: <4D9B2DF9.8040305@astro.uio.no> On 04/05/2011 04:53 PM, Robert Bradshaw wrote: > On Tue, Apr 5, 2011 at 3:51 AM, Stefan Behnel wrote: >> mark florisson, 04.04.2011 21:26: >>> For clarity, I'll add an example: >>> >>> def f(np.ndarray[double] x, double alpha): >>> cdef double s = 0 >>> cdef double tmp = 2 >>> cdef double other = 6.6 >>> >>> with nogil: >>> for i in prange(x.shape[0]): >>> # reading 'tmp' makes it firstprivate in addition to >>> lastprivate >>> # 'other' is only ever read, so it's shared >>> printf("%lf %lf %lf\n", tmp, s, other) >> So, adding a printf() to your code can change the semantics of your >> variables? That sounds like a really bad design to me. > That's what I was thinking. Basically, if you do an inlace operation, > then it's a reduction variable, no matter what else you do to it > (including possibly a direct assignment, though we could make that a > compile-time error). -1, I think that's too obscure. Not being able to use inplace operators for certain variables will be at the very least be nagging. I think we need to explicitly declare something. Either a simple prange(..., reduce="s:+"), or all-out declaration of thread-local variables. Reduction isn't *that* common, so perhaps that is what should be explicit, unlike my other proposal... Dag From robertwb at math.washington.edu Tue Apr 5 17:01:16 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Tue, 5 Apr 2011 08:01:16 -0700 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> <4D9AF435.5080905@behnel.de> Message-ID: On Tue, Apr 5, 2011 at 4:52 AM, mark florisson wrote: > On 5 April 2011 12:51, Stefan Behnel wrote: >> mark florisson, 04.04.2011 21:26: >>> >>> For clarity, I'll add an example: >>> >>> def f(np.ndarray[double] x, double alpha): >>> ? ? cdef double s = 0 >>> ? ? cdef double tmp = 2 >>> ? ? cdef double other = 6.6 >>> >>> ? ? with nogil: >>> ? ? ? ? for i in prange(x.shape[0]): >>> ? ? ? ? ? ? # reading 'tmp' makes it firstprivate in addition to >>> lastprivate >>> ? ? ? ? ? ? # 'other' is only ever read, so it's shared >>> ? ? ? ? ? ? printf("%lf %lf %lf\n", tmp, s, other) >> >> So, adding a printf() to your code can change the semantics of your >> variables? That sounds like a really bad design to me. > > I agree, I think we should refrain from the firstprivate() entirely, > as it wouldn't have the same semantics as serial execution (as 'tmp' > would have the original value with parallel execution and the value > from previous iterations with serial execution). So basically we > should allow reading of private variables only after they are assigned > to in the loop body. Unless I'm miss-understanding the meaning of firstprivate (it's initialized per-thread, not per-iteration), for single-threaded execution, it would have exactly the same semantics as serial execution. As I mentioned before, if your code functions differently for single or multiple threads, then it's incorrect. I think it's natural that a parallel loop would behave like tmp = global_value if fork(): # do first half of the loop, with tmp starting as global_value else: # do last half of the loop, with tmp starting as global_value # reduction magic - Robert From d.s.seljebotn at astro.uio.no Tue Apr 5 17:02:26 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 05 Apr 2011 17:02:26 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D9B2DF9.8040305@astro.uio.no> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> <4D9AF435.5080905@behnel.de> <4D9B2DF9.8040305@astro.uio.no> Message-ID: <4D9B2F02.5050301@astro.uio.no> On 04/05/2011 04:58 PM, Dag Sverre Seljebotn wrote: > On 04/05/2011 04:53 PM, Robert Bradshaw wrote: >> On Tue, Apr 5, 2011 at 3:51 AM, Stefan Behnel >> wrote: >>> mark florisson, 04.04.2011 21:26: >>>> For clarity, I'll add an example: >>>> >>>> def f(np.ndarray[double] x, double alpha): >>>> cdef double s = 0 >>>> cdef double tmp = 2 >>>> cdef double other = 6.6 >>>> >>>> with nogil: >>>> for i in prange(x.shape[0]): >>>> # reading 'tmp' makes it firstprivate in addition to >>>> lastprivate >>>> # 'other' is only ever read, so it's shared >>>> printf("%lf %lf %lf\n", tmp, s, other) >>> So, adding a printf() to your code can change the semantics of your >>> variables? That sounds like a really bad design to me. >> That's what I was thinking. Basically, if you do an inlace operation, >> then it's a reduction variable, no matter what else you do to it >> (including possibly a direct assignment, though we could make that a >> compile-time error). > > -1, I think that's too obscure. Not being able to use inplace > operators for certain variables will be at the very least be nagging. > > I think we need to explicitly declare something. Either a simple > prange(..., reduce="s:+"), or all-out declaration of thread-local > variables. Sorry: prange(..., reduce="s"), or perhaps &s or cython.address(s). The + is of course still specified in code. Dag Sverre From robertwb at math.washington.edu Tue Apr 5 17:14:38 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Tue, 5 Apr 2011 08:14:38 -0700 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> Message-ID: On Tue, Apr 5, 2011 at 5:55 AM, Pauli Virtanen wrote: > Mon, 04 Apr 2011 21:26:34 +0200, mark florisson wrote: > [clip] >> For clarity, I'll add an example: > [clip] > > How about making all the special declarations explicit? The automatic > inference of variables has a problem in that a small change in a part of > the code can have somewhat unintuitive non-local effects, as the private/ > shared/reduction status of the variable changes in the whole function > scope (if Python scoping is retained). > > Like so with explicit declarations: That's an interesting idea. It's a bit odd specifying the scope as part of the type, but may work. However, I'm still not convinced that we can't safely infer this information. > def f(np.ndarray[double] x, double alpha): > ? ?cdef double alpha = 6.6 > ? ?cdef char *ptr = something() > > ? ?# Parallel variables are declared beforehand; > ? ?# the exact syntax could also be something else > ? ?cdef cython.parallel.private[int] tmp = 2, tmp2 > ? ?cdef cython.parallel.reduction[int] s = 0 > > ? ?# Act like ordinary cdef outside prange(); in the prange they are > ? ?# firstprivate if initialized or written to outside the loop anywhere > ? ?# in the scope. Or, they could be firstprivate always, if this > ? ?# has a negligible performance impact. > ? ?tmp = 3 > > ? ?with nogil: > ? ? ? ?s = 9 > > ? ? ? ?for i in prange(x.shape[0]): > ? ? ? ? ? ?if cython.parallel.first_iteration(i): > ? ? ? ? ? ? ? ?# whatever initialization; Cython is in principle allowed > ? ? ? ? ? ? ? ?# to move this outside the loop, at least if it is > ? ? ? ? ? ? ? ?# the first thing here > ? ? ? ? ? ? ? ?pass > > ? ? ? ? ? ?# tmp2 is not firstprivate, as it's not written to outside > ? ? ? ? ? ?# the loop body; also, it's also not lastprivate as it's not > ? ? ? ? ? ?# read outside the loop > ? ? ? ? ? ?tmp2 = 99 > > ? ? ? ? ? ?# Increment a private variable > ? ? ? ? ? ?tmp += 2*tmp > > ? ? ? ? ? ?# Add stuff to reduction > ? ? ? ? ? ?s += alpha*i > > ? ? ? ? ? ?# The following raise a compilation error -- the reduction > ? ? ? ? ? ?# variable cannot be assigned to, and can be only operated on > ? ? ? ? ? ?# with only a single reduction operation inside prange > ? ? ? ? ? ?s *= 9 > ? ? ? ? ? ?s = 8 > > ? ? ? ? ? ?# It can be read, however, provided openmp supports this > ? ? ? ? ? ?tmp = s > > ? ? ? ? ? ?# Assignment to non-private variables causes a compile-time > ? ? ? ? ? ?# error; this avoids common mistakes, such as forgetting to > ? ? ? ? ? ?# declare the reduction variable. > ? ? ? ? ? ?alpha += 42 > ? ? ? ? ? ?alpha123 = 9 > ? ? ? ? ? ?ptr = 94 > > ? ? ? ? ? ?# These, however, need to be allowed: > ? ? ? ? ? ?# the users are on their own to make sure they don't clobber > ? ? ? ? ? ?# non-local variables > ? ? ? ? ? ?x[i] = 123 > ? ? ? ? ? ?(ptr + i)[0] = 123 > ? ? ? ? ? ?some_routine(x, ptr, i) > ? ? ? ?else: > ? ? ? ? ? ?# private variables are lastprivate if read outside the loop > ? ? ? ? ? ?foo = tmp > > ? ? ? ?# The else: block can be added, but actually has no effect > ? ? ? ?# as it is always executed --- the code here could as well > ? ? ? ?# be written after the for loop > ? ? ? ?foo = tmp ?# <- same result > > ? ?with nogil: > ? ? ? ?# Suppose Cython allowed cdef inside blocks with usual scoping > ? ? ? ?# rules > ? ? ? ?cdef cython.parallel.reduction[double] r = 0 > > ? ? ? ?# the same variables can be used again in a second parallel loop > ? ? ? ?for i in prange(x.shape[0]): > ? ? ? ? ? ?r += 1.5 > ? ? ? ? ? ?s -= i > ? ? ? ? ? ?tmp = 9 > > ? ? ? ?# also the iteration variable is available after the loop > ? ? ? ?count = i > > ? ?# As per usual Cython scoping rules > ? ?return r, s > > What did I miss here? As far as I see, the above would have the same > semantics and scoping as a single-threaded Python implementation. One thing is that it's forcing the scope of the variable to be consistant throughout the entire function body, so, for example, a reduction variable in one loop could not be used as a shared in another (without having to declare a new variable), which is a different form of non-locality. > The only change required to make things parallel is replacing range() by > prange() and adding the variable declarations. > > -- > Pauli Virtanen > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From robertwb at math.washington.edu Tue Apr 5 17:26:24 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Tue, 5 Apr 2011 08:26:24 -0700 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D9B2F02.5050301@astro.uio.no> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> <4D9AF435.5080905@behnel.de> <4D9B2DF9.8040305@astro.uio.no> <4D9B2F02.5050301@astro.uio.no> Message-ID: On Tue, Apr 5, 2011 at 8:02 AM, Dag Sverre Seljebotn wrote: > On 04/05/2011 04:58 PM, Dag Sverre Seljebotn wrote: >> >> On 04/05/2011 04:53 PM, Robert Bradshaw wrote: >>> >>> On Tue, Apr 5, 2011 at 3:51 AM, Stefan Behnel >>> ?wrote: >>>> >>>> mark florisson, 04.04.2011 21:26: >>>>> >>>>> For clarity, I'll add an example: >>>>> >>>>> def f(np.ndarray[double] x, double alpha): >>>>> ? ? cdef double s = 0 >>>>> ? ? cdef double tmp = 2 >>>>> ? ? cdef double other = 6.6 >>>>> >>>>> ? ? with nogil: >>>>> ? ? ? ? for i in prange(x.shape[0]): >>>>> ? ? ? ? ? ? # reading 'tmp' makes it firstprivate in addition to >>>>> lastprivate >>>>> ? ? ? ? ? ? # 'other' is only ever read, so it's shared >>>>> ? ? ? ? ? ? printf("%lf %lf %lf\n", tmp, s, other) >>>> >>>> So, adding a printf() to your code can change the semantics of your >>>> variables? That sounds like a really bad design to me. >>> >>> That's what I was thinking. Basically, if you do an inlace operation, >>> then it's a reduction variable, no matter what else you do to it >>> (including possibly a direct assignment, though we could make that a >>> compile-time error). >> >> -1, I think that's too obscure. Not being able to use inplace operators >> for certain variables will be at the very least be nagging. You could still use inplace operators to your hearts content--just don't bother using the reduced variable outside the loop. (I guess I'm assuming reducing a variable has negligible performance overhead, which it should.) For the rare cases that you want the non-aggregated private, make an assignment to another variable, or use non-inplace operations. Not being able to mix inplace operators might be an annoyance. We could also allow explicit declarations, as per Pauli's suggestion, but not require them. Essentially, as long as we have 1) Sequential behavior == one thread scheduled (by semantics) 2) one thread scheduled == multiple threads scheduled (user's responsibility, as it must be) then I think we should be fine. >> I think we need to explicitly declare something. Either a simple >> prange(..., reduce="s:+"), or all-out declaration of thread-local variables. > > Sorry: prange(..., reduce="s"), or perhaps &s or cython.address(s). The + is > of course still specified in code. > > Dag Sverre > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From d.s.seljebotn at astro.uio.no Tue Apr 5 18:32:25 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 05 Apr 2011 18:32:25 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> <4D9AF435.5080905@behnel.de> <4D9B2DF9.8040305@astro.uio.no> <4D9B2F02.5050301@astro.uio.no> Message-ID: <4D9B4419.5030804@astro.uio.no> On 04/05/2011 05:26 PM, Robert Bradshaw wrote: > On Tue, Apr 5, 2011 at 8:02 AM, Dag Sverre Seljebotn > wrote: >> On 04/05/2011 04:58 PM, Dag Sverre Seljebotn wrote: >>> On 04/05/2011 04:53 PM, Robert Bradshaw wrote: >>>> On Tue, Apr 5, 2011 at 3:51 AM, Stefan Behnel >>>> wrote: >>>>> mark florisson, 04.04.2011 21:26: >>>>>> For clarity, I'll add an example: >>>>>> >>>>>> def f(np.ndarray[double] x, double alpha): >>>>>> cdef double s = 0 >>>>>> cdef double tmp = 2 >>>>>> cdef double other = 6.6 >>>>>> >>>>>> with nogil: >>>>>> for i in prange(x.shape[0]): >>>>>> # reading 'tmp' makes it firstprivate in addition to >>>>>> lastprivate >>>>>> # 'other' is only ever read, so it's shared >>>>>> printf("%lf %lf %lf\n", tmp, s, other) >>>>> So, adding a printf() to your code can change the semantics of your >>>>> variables? That sounds like a really bad design to me. >>>> That's what I was thinking. Basically, if you do an inlace operation, >>>> then it's a reduction variable, no matter what else you do to it >>>> (including possibly a direct assignment, though we could make that a >>>> compile-time error). >>> -1, I think that's too obscure. Not being able to use inplace operators >>> for certain variables will be at the very least be nagging. > You could still use inplace operators to your hearts content--just > don't bother using the reduced variable outside the loop. (I guess I'm > assuming reducing a variable has negligible performance overhead, > which it should.) For the rare cases that you want the non-aggregated > private, make an assignment to another variable, or use non-inplace > operations. Ahh! Of course! With some control flow analysis we could even eliminate the reduction if the variable isn't used after the loop, although I agree the cost should be trivial. > Not being able to mix inplace operators might be an annoyance. We > could also allow explicit declarations, as per Pauli's suggestion, but > not require them. Essentially, as long as we have I think you should be able to mix them, but if you do a reduction doesn't happen. This is slightly uncomfortable, but I believe control flow analysis and disabling firstprivate can solve it, see below. I believe I'm back in the implicit-camp. And the CEP can probably be simplified a bit too, I'll try to do that tomorrow. Two things: * It'd still be nice with something like a parallel block for thread setup/teardown rather than "if firstthreaditeration():". So, a prange for the 50% simplest cases, followed by a parallel-block for the next 30%. * Control flow analysis can help us tight it up a bit: For loops where you actually depend on values of thread-private variables computed in the previous iteration (beyond reduction), it'd be nice to raise a warning unless the variable is explicitly declared thread-local or similar. There are uses for such variables but they'd be rather rare, and such a hint could be very helpful. I'm still not sure if we want firstprivate, even if we can do it. It'd be good to see a usecase for it. I'd rather have NaN and 0x7FFFFFFF personally, as relying on the firstprivate value is likely a bug -- yes, it makes the sequential case work, but that is exactly in the case where parallelizing the sequential case would be wrong!! Grepping through 30000 lines of heavily OpenMP-ified Fortran code here there's no mention of firstprivate or lastprivate (although we certainly want lastprivate to align with the sequential case). Dag Sverre From markflorisson88 at gmail.com Tue Apr 5 19:33:42 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 5 Apr 2011 19:33:42 +0200 Subject: [Cython] CEP: prange for parallel loops In-Reply-To: <4D9B4419.5030804@astro.uio.no> References: <4D999AD4.8080609@astro.uio.no> <4D99AA46.7020600@behnel.de> <4D99B12C.10201@astro.uio.no> <4D99FD70.6010408@astro.uio.no> <4D9AF435.5080905@behnel.de> <4D9B2DF9.8040305@astro.uio.no> <4D9B2F02.5050301@astro.uio.no> <4D9B4419.5030804@astro.uio.no> Message-ID: On 5 April 2011 18:32, Dag Sverre Seljebotn wrote: > On 04/05/2011 05:26 PM, Robert Bradshaw wrote: >> >> On Tue, Apr 5, 2011 at 8:02 AM, Dag Sverre Seljebotn >> ?wrote: >>> >>> On 04/05/2011 04:58 PM, Dag Sverre Seljebotn wrote: >>>> >>>> On 04/05/2011 04:53 PM, Robert Bradshaw wrote: >>>>> >>>>> On Tue, Apr 5, 2011 at 3:51 AM, Stefan Behnel >>>>> ?wrote: >>>>>> >>>>>> mark florisson, 04.04.2011 21:26: >>>>>>> >>>>>>> For clarity, I'll add an example: >>>>>>> >>>>>>> def f(np.ndarray[double] x, double alpha): >>>>>>> ? ? cdef double s = 0 >>>>>>> ? ? cdef double tmp = 2 >>>>>>> ? ? cdef double other = 6.6 >>>>>>> >>>>>>> ? ? with nogil: >>>>>>> ? ? ? ? for i in prange(x.shape[0]): >>>>>>> ? ? ? ? ? ? # reading 'tmp' makes it firstprivate in addition to >>>>>>> lastprivate >>>>>>> ? ? ? ? ? ? # 'other' is only ever read, so it's shared >>>>>>> ? ? ? ? ? ? printf("%lf %lf %lf\n", tmp, s, other) >>>>>> >>>>>> So, adding a printf() to your code can change the semantics of your >>>>>> variables? That sounds like a really bad design to me. >>>>> >>>>> That's what I was thinking. Basically, if you do an inlace operation, >>>>> then it's a reduction variable, no matter what else you do to it >>>>> (including possibly a direct assignment, though we could make that a >>>>> compile-time error). >>>> >>>> -1, I think that's too obscure. Not being able to use inplace operators >>>> for certain variables will be at the very least be nagging. >> >> You could still use inplace operators to your hearts content--just >> don't bother using the reduced variable outside the loop. (I guess I'm >> assuming reducing a variable has negligible performance overhead, >> which it should.) For the rare cases that you want the non-aggregated >> private, make an assignment to another variable, or use non-inplace >> operations. > > Ahh! Of course! With some control flow analysis we could even eliminate the > reduction if the variable isn't used after the loop, although I agree the > cost should be trivial. > > >> Not being able to mix inplace operators might be an annoyance. We >> could also allow explicit declarations, as per Pauli's suggestion, but >> not require them. Essentially, as long as we have > > I think you should be able to mix them, but if you do a reduction doesn't > happen. This is slightly uncomfortable, but I believe control flow analysis > and disabling firstprivate can solve it, see below. > > I believe I'm back in the implicit-camp. And the CEP can probably be > simplified a bit too, I'll try to do that tomorrow. > > Two things: > > ?* It'd still be nice with something like a parallel block for thread > setup/teardown rather than "if firstthreaditeration():". So, a prange for > the 50% simplest cases, followed by a parallel-block for the next 30%. Definitely, I think it could also make way for things such as sections etc, but I'll bring that up later :) > ?* Control flow analysis can help us tight it up a bit: For loops where you > actually depend on values of thread-private variables computed in the > previous iteration (beyond reduction), it'd be nice to raise a warning > unless the variable is explicitly declared thread-local or similar. There > are uses for such variables but they'd be rather rare, and such a hint could > be very helpful. > > I'm still not sure if we want firstprivate, even if we can do it. It'd be > good to see a usecase for it. I'd rather have NaN and 0x7FFFFFFF personally, > as relying on the firstprivate value is likely a bug -- yes, it makes the > sequential case work, but that is exactly in the case where parallelizing > the sequential case would be wrong!! Yeah, I think if we go the implicit route then firstprivate might be quite a surprise for users. > Grepping through 30000 lines of heavily OpenMP-ified Fortran code here > there's no mention of firstprivate or lastprivate (although we certainly > want lastprivate to align with the sequential case). > > Dag Sverre > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > Basically I'm fine with either implicit or explicit, although I think the explicit case would be easier to understand for people that have used OpenMP. In either case it would be nice to give prange a 'nogil' option. So to be clear, when we assign to a variable it will be lastprivate, and when we assign to the subscript of a variable we make that variable shared (unless it is declared inside the parallel with block), right? From d.s.seljebotn at astro.uio.no Tue Apr 5 22:29:29 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 05 Apr 2011 22:29:29 +0200 Subject: [Cython] prange CEP updated Message-ID: <4D9B7BA9.2060509@astro.uio.no> I've done a pretty major revision to the prange CEP, bringing in a lot of the feedback. Thread-private variables are now split in two cases: i) The safe cases, which really require very little technical knowledge -> automatically inferred ii) As an advanced feature, unsafe cases that requires some knowledge of threading -> must be explicitly declared I think this split simplifies things a great deal. I'm rather excited over this now; this could turn out to be a really user-friendly and safe feature that would not only allow us to support OpenMP-like threading, but be more convenient to use in a range of common cases. http://wiki.cython.org/enhancements/prange Dag Sverre -------------- next part -------------- An HTML attachment was scrubbed... URL: From d.s.seljebotn at astro.uio.no Tue Apr 5 22:47:41 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 05 Apr 2011 22:47:41 +0200 Subject: [Cython] prange CEP updated In-Reply-To: <4D9B7BA9.2060509@astro.uio.no> References: <4D9B7BA9.2060509@astro.uio.no> Message-ID: <4D9B7FED.4080500@astro.uio.no> On 04/05/2011 10:29 PM, Dag Sverre Seljebotn wrote: > I've done a pretty major revision to the prange CEP, bringing in a lot > of the feedback. > > Thread-private variables are now split in two cases: > > i) The safe cases, which really require very little technical > knowledge -> automatically inferred > > ii) As an advanced feature, unsafe cases that requires some knowledge > of threading -> must be explicitly declared > > I think this split simplifies things a great deal. > > I'm rather excited over this now; this could turn out to be a really > user-friendly and safe feature that would not only allow us to support > OpenMP-like threading, but be more convenient to use in a range of > common cases. > > http://wiki.cython.org/enhancements/prange > As a digression: threadlocal(int)-variables could also be supported elsewhere as syntax candy for the pythread.h Thread Local Storage, which would work well for fast TLS for any kind of threads (e.g., when using threading module). Dag Sverre (Sorry about the previous HTML-mail.) From d.s.seljebotn at astro.uio.no Wed Apr 6 09:34:06 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Wed, 06 Apr 2011 09:34:06 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering Message-ID: <4D9C176E.80406@astro.uio.no> I just wanted to make everybody aware that there's a paper on Cython in this month's CiSE (http://cise.aip.org/). http://dx.doi.org/10.1109/MCSE.2010.118 (paywall) Researchers: Please consider citing this paper if Cython helps your research in non-trivial ways. Dag Sverre From dalcinl at gmail.com Wed Apr 6 19:14:48 2011 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Wed, 6 Apr 2011 14:14:48 -0300 Subject: [Cython] cython broken Message-ID: Since the commit below, Cython fails to compile itself. That fix requires further work and definitely more tests. If that is impossible right now, I would ask the guilty parties to revert the change and continue working on this the bug tracker and repo clones. Please try to keep cython-dev repo clean. commit 3069c3e516fc7336b003861881623f30e168849e Author: Haoyu Bai Date: Thu Mar 31 04:19:14 2011 +0800 fix T477 by refactor FuncDefNode, so cython decorators can applied to cdef function See yourself: $ git checkout 3069c3e516fc7336b003861881623f30e168849e Note: checking out '3069c3e516fc7336b003861881623f30e168849e'. You are in 'detached HEAD' state. You can look around, make experimental changes and commit them, and you can discard any commits you make in this state without impacting any branches by performing another checkout. If you want to create a new branch to retain commits you create, you may do so (now or later) by using -b with the checkout command again. Example: git checkout -b new_branch_name HEAD is now at 3069c3e... fix T477 by refactor FuncDefNode, so cython decorators can applied to cdef function [dalcinl at trantor cython-dev]$ python setup.py --name Compiling module Cython.Compiler.Scanning ... Compiling module Cython.Compiler.Parsing ... Compiling module Cython.Compiler.Code ... Error compiling Cython file: ------------------------------------------------------------ ... self.cname = cname self.text = text self.escaped_value = StringEncoding.escape_byte_string(byte_string) self.py_strings = None def get_py_string_const(self, encoding, identifier=None, is_str=False): ^ ------------------------------------------------------------ Cython/Compiler/Code.py:316:4: Signature not compatible with previous declaration Error compiling Cython file: ------------------------------------------------------------ ... cdef public object text cdef public object escaped_value cdef public dict py_strings @cython.locals(intern=bint, is_str=bint, is_unicode=bint) cpdef get_py_string_const(self, encoding, identifier=*, is_str=*) ^ ------------------------------------------------------------ Cython/Compiler/Code.pxd:62:30: Previous declaration is here Compilation failed Cython -- Lisandro Dalcin --------------- CIMEC (INTEC/CONICET-UNL) Predio CONICET-Santa Fe Colectora RN 168 Km 472, Paraje El Pozo 3000 Santa Fe, Argentina Tel: +54-342-4511594 (ext 1011) Tel/Fax: +54-342-4511169 From thomas.e.keller at gmail.com Thu Apr 7 00:50:44 2011 From: thomas.e.keller at gmail.com (Thomas Keller) Date: Wed, 6 Apr 2011 17:50:44 -0500 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: <4D9C176E.80406@astro.uio.no> References: <4D9C176E.80406@astro.uio.no> Message-ID: This article was well written and informative. I thank the six of you for writing it. Cheers, TEK On Wed, Apr 6, 2011 at 2:34 AM, Dag Sverre Seljebotn < d.s.seljebotn at astro.uio.no> wrote: > I just wanted to make everybody aware that there's a paper on Cython in > this month's CiSE (http://cise.aip.org/). > > http://dx.doi.org/10.1109/MCSE.2010.118 (paywall) > > Researchers: Please consider citing this paper if Cython helps your > research in non-trivial ways. > > Dag Sverre > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jason-sage at creativetrax.com Thu Apr 7 00:56:29 2011 From: jason-sage at creativetrax.com (Jason Grout) Date: Wed, 06 Apr 2011 17:56:29 -0500 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: <4D9C176E.80406@astro.uio.no> References: <4D9C176E.80406@astro.uio.no> Message-ID: <4D9CEF9D.401@creativetrax.com> On 4/6/11 2:34 AM, Dag Sverre Seljebotn wrote: > > Researchers: Please consider citing this paper if Cython helps your > research in non-trivial ways. Is this the canonical citation reference for Cython now? If so, can this be mentioned on the Cython webpage somewhere that is prominent enough to be found? Thanks, Jason From zstone at gmail.com Thu Apr 7 01:40:19 2011 From: zstone at gmail.com (Zak Stone) Date: Wed, 6 Apr 2011 19:40:19 -0400 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: <4D9CEF9D.401@creativetrax.com> References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> Message-ID: >> Researchers: Please consider citing this paper if Cython helps your >> research in non-trivial ways. > > Is this the canonical citation reference for Cython now? ?If so, can this be > mentioned on the Cython webpage somewhere that is prominent enough to be > found? On a related note, would it be possible to post a preprint somewhere that isn't behind a paywall? If that's allowed, I would be delighted to share the preprint with friends to introduce them to Cython. Thanks, Zak From robertwb at math.washington.edu Thu Apr 7 02:12:48 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Wed, 6 Apr 2011 17:12:48 -0700 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> Message-ID: On Wed, Apr 6, 2011 at 4:40 PM, Zak Stone wrote: >>> Researchers: Please consider citing this paper if Cython helps your >>> research in non-trivial ways. >> >> Is this the canonical citation reference for Cython now? ?If so, can this be >> mentioned on the Cython webpage somewhere that is prominent enough to be >> found? > > On a related note, would it be possible to post a preprint somewhere > that isn't behind a paywall? If that's allowed, I would be delighted > to share the preprint with friends to introduce them to Cython. Yes, I think we can post the pre-print, though I'm opposed to making this the "canonical citation" just because of this paywall. - Robert From zstone at gmail.com Thu Apr 7 02:20:37 2011 From: zstone at gmail.com (Zak Stone) Date: Wed, 6 Apr 2011 20:20:37 -0400 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> Message-ID: >>>> Researchers: Please consider citing this paper if Cython helps your >>>> research in non-trivial ways. >>> >>> Is this the canonical citation reference for Cython now? ?If so, can this be >>> mentioned on the Cython webpage somewhere that is prominent enough to be >>> found? >> >> On a related note, would it be possible to post a preprint somewhere >> that isn't behind a paywall? If that's allowed, I would be delighted >> to share the preprint with friends to introduce them to Cython. > > Yes, I think we can post the pre-print, though I'm opposed to making > this the "canonical citation" just because of this paywall. Agreed. Perhaps you could post the desired BibTeX citation text for the official version and a link to the official version right next to the preprint? Zak From baihaoyu at gmail.com Thu Apr 7 07:22:16 2011 From: baihaoyu at gmail.com (Haoyu Bai) Date: Thu, 7 Apr 2011 13:22:16 +0800 Subject: [Cython] cython broken In-Reply-To: References: Message-ID: On Thu, Apr 7, 2011 at 1:14 AM, Lisandro Dalcin wrote: > Since the commit below, Cython fails to compile itself. That fix > requires further work and definitely more tests. If that is impossible > right now, I would ask the guilty parties to revert the change and > continue working on this the bug tracker and repo clones. Please try > to keep cython-dev repo clean. > > I'm investigating this. For now, please revert this. Meanwhile, I'll try to get it fixed. -- Haoyu BAI School of Computing, National University of Singapore. From arthurdesribeiro at gmail.com Thu Apr 7 07:46:31 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Thu, 7 Apr 2011 02:46:31 -0300 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. Message-ID: I've wrote a proposal to the project: Reimplement C modules in CPython's standard library in Cython. I'd be glad if you could take a look a it and give me your feedback. the link for the proposal is: http://wiki.cython.org/arthursribeiro Thank you. Best Regards Arthur -------------- next part -------------- An HTML attachment was scrubbed... URL: From d.s.seljebotn at astro.uio.no Thu Apr 7 07:54:31 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Thu, 07 Apr 2011 07:54:31 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> Message-ID: <4D9D5197.3000206@astro.uio.no> On 04/07/2011 02:12 AM, Robert Bradshaw wrote: > On Wed, Apr 6, 2011 at 4:40 PM, Zak Stone wrote: >>>> Researchers: Please consider citing this paper if Cython helps your >>>> research in non-trivial ways. >>> Is this the canonical citation reference for Cython now? If so, can this be >>> mentioned on the Cython webpage somewhere that is prominent enough to be >>> found? >> On a related note, would it be possible to post a preprint somewhere >> that isn't behind a paywall? If that's allowed, I would be delighted >> to share the preprint with friends to introduce them to Cython. > Yes, I think we can post the pre-print, though I'm opposed to making > this the "canonical citation" just because of this paywall. Is this for ideological or practical reasons? This is probably the only paper in a "real" journal for some time, and citations are going to boost the authors' citation counts. Nobody would actually look up the citation anyway simply to learn about Cython, they'd just Google it. So unless we're trying to hide the existence of the paper, I think we should make it the default citation until there's something better. Next time we've got anything to share in a paper, let's do it here: http://www.openresearchcomputation.com/ Although that wasn't around when we started writing the paper. Posting the pre-print is a matter of making the necesarry references within it and formatting it. http://www.sherpa.ac.uk/romeo/search.php?jrule=ISSN&search=1521-9615 I'll fix it and post a link later today. Dag Sverre From r.rex at tu-bs.de Thu Apr 7 09:12:38 2011 From: r.rex at tu-bs.de (=?ISO-8859-1?Q?Ren=E9_Rex?=) Date: Thu, 7 Apr 2011 09:12:38 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> Message-ID: Hello > Agreed. Perhaps you could post the desired BibTeX citation text for > the official version and a link to the official version right next to > the preprint? BibTeX entry for your convecience: @article{bradshaw2010cython, title={{CYTHON: THE BEST OF BOTH WORLDS}}, author={Bradshaw, R. and Citro, C. and Seljebotn, D.S.}, journal={CiSE 2011 Special Python Issue}, pages={25}, year={2010} } - Ren? From stefan_ml at behnel.de Thu Apr 7 09:21:28 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Thu, 07 Apr 2011 09:21:28 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com>

Message-ID: <4D9D65F8.8080503@behnel.de> Ren? Rex, 07.04.2011 09:12: >> Agreed. Perhaps you could post the desired BibTeX citation text for >> the official version and a link to the official version right next to >> the preprint? > > BibTeX entry for your convecience: > > @article{bradshaw2010cython, > title={{CYTHON: THE BEST OF BOTH WORLDS}}, > author={Bradshaw, R. and Citro, C. and Seljebotn, D.S.}, > journal={CiSE 2011 Special Python Issue}, > pages={25}, > year={2010} > } Looks rather incomplete to me. Stefan From r.rex at tu-bs.de Thu Apr 7 09:34:59 2011 From: r.rex at tu-bs.de (=?ISO-8859-1?Q?Ren=E9_Rex?=) Date: Thu, 7 Apr 2011 09:34:59 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: <4D9D65F8.8080503@behnel.de> References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com>

<4D9D65F8.8080503@behnel.de> Message-ID: > Looks rather incomplete to me. Your are right. I got the wrong document. It is from the sage website and has the same title... http://sage.math.washington.edu/tmp/stein-cise-comments-may22.pdf#page=29 This should be the correct entry: @article{behnel2010cython, title={{Cython: The best of both worlds}}, author={Behnel, S. and Bradshaw, R. and Citro, C. and Dalcin, L. and Seljebotn, D.S. and Smith, K.}, journal={Computing in Science and Engineering}, issn={1521-9615}, year={2010}, publisher={IEEE Computer Society} } - Ren? From stefan_ml at behnel.de Thu Apr 7 10:00:41 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Thu, 07 Apr 2011 10:00:41 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: <4D9D5197.3000206@astro.uio.no> References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> <4D9D5197.3000206@astro.uio.no> Message-ID: <4D9D6F29.8080104@behnel.de> Dag Sverre Seljebotn, 07.04.2011 07:54: > On 04/07/2011 02:12 AM, Robert Bradshaw wrote: >> On Wed, Apr 6, 2011 at 4:40 PM, Zak Stone wrote: >>>>> Researchers: Please consider citing this paper if Cython helps your >>>>> research in non-trivial ways. >>>> Is this the canonical citation reference for Cython now? If so, can >>>> this be >>>> mentioned on the Cython webpage somewhere that is prominent enough to be >>>> found? >>> On a related note, would it be possible to post a preprint somewhere >>> that isn't behind a paywall? If that's allowed, I would be delighted >>> to share the preprint with friends to introduce them to Cython. >> Yes, I think we can post the pre-print, though I'm opposed to making >> this the "canonical citation" just because of this paywall. > > Is this for ideological or practical reasons? Both. > This is probably the only paper in a "real" journal for some time, and > citations are going to boost the authors' citation counts. Nobody would > actually look up the citation anyway simply to learn about Cython, they'd > just Google it. Depends on the reference. If it's just cited as "you know, Cython", people will either look for "Cython" directly and be happy, or they may look up the paper, see that it's paid, and keep searching, either for the paper or for the project. If it's cited as "in that paper, you can read about doing X with Cython", then people will try even harder to get at the paper. In either case, chances are that they need to invest more time because of the reference, compared to a plain link in a footnote. So citing this article is likely to be an inconvenience for interested readers of papers that cite it. > So unless we're trying to hide the existence of the paper, > I think we should make it the default citation until there's something better. We should then at least get a PDF preprint version ready that contains the relevant metadata and put the exact same file up on everyone's homepage (so that search engines don't get confused and can simply hash-compare them). > Next time we've got anything to share in a paper, let's do it here: > > http://www.openresearchcomputation.com/ Looks good to me (even though PyCon 2011 isn't really an "upcoming conference" ;). > Posting the pre-print is a matter of making the necesarry references within > it and formatting it. > > http://www.sherpa.ac.uk/romeo/search.php?jrule=ISSN&search=1521-9615 > > > I'll fix it and post a link later today. Great, thanks! Stefan From robertwb at math.washington.edu Thu Apr 7 10:01:06 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Thu, 7 Apr 2011 01:01:06 -0700 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: <4D9D5197.3000206@astro.uio.no> References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> <4D9D5197.3000206@astro.uio.no> Message-ID: On Wed, Apr 6, 2011 at 10:54 PM, Dag Sverre Seljebotn wrote: > On 04/07/2011 02:12 AM, Robert Bradshaw wrote: >> >> On Wed, Apr 6, 2011 at 4:40 PM, Zak Stone ?wrote: >>>>> >>>>> Researchers: Please consider citing this paper if Cython helps your >>>>> research in non-trivial ways. >>>> >>>> Is this the canonical citation reference for Cython now? ?If so, can >>>> this be >>>> mentioned on the Cython webpage somewhere that is prominent enough to be >>>> found? >>> >>> On a related note, would it be possible to post a preprint somewhere >>> that isn't behind a paywall? If that's allowed, I would be delighted >>> to share the preprint with friends to introduce them to Cython. >> >> Yes, I think we can post the pre-print, though I'm opposed to making >> this the "canonical citation" just because of this paywall. > > Is this for ideological or practical reasons? Both. Actually, opposed is probably too strong of a word here. I'm disinclined, but there isn't really a better option. Currently, people usually just cite the website, for whatever that's worth. http://scholar.google.com/scholar?q=cython > This is probably the only paper in a "real" journal for some time, and > citations are going to boost the authors' citation counts. Nobody would > actually look up the citation anyway simply to learn about Cython, they'd > just Google it. So unless we're trying to hide the existence of the paper, I > think we should make it the default citation until there's something better. > > Next time we've got anything to share in a paper, let's do it here: > > http://www.openresearchcomputation.com/ > > Although that wasn't around when we started writing the paper. Or at least look into this more carefully. Some of CiSE's papers are open access, I (naively) thought ours wouldn't be hard to get to either. It is a nice paper though and I think it'll hit a nice audience (who primarily won't even be aware that they're paying through it indirectly through university overhead and monolithic library subscriptions). > Posting the pre-print is a matter of making the necesarry references within > it and formatting it. > > http://www.sherpa.ac.uk/romeo/search.php?jrule=ISSN&search=1521-9615 > > > I'll fix it and post a link later today. Thanks. - Robert From d.s.seljebotn at astro.uio.no Thu Apr 7 10:08:35 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Thu, 07 Apr 2011 10:08:35 +0200 Subject: [Cython] CiSE Cython paper: Preprint up Message-ID: <4D9D7103.7060506@astro.uio.no> I should have put up this right away, sorry: http://folk.uio.no/dagss/cython_cise.pdf It is actually post-review, so it contains most things but some stylistic improvements and layout. Not sure about posting this on cython.org, but we could perhaps link to my webpage (http://folk.uio.no/dagss/) and say it is there... The repo is here: https://github.com/dagss/cython-cise-postprint If only the world could move to open access a bit quicker... Dag Sverre From d.s.seljebotn at astro.uio.no Thu Apr 7 10:33:08 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Thu, 07 Apr 2011 10:33:08 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: <4D9D6F29.8080104@behnel.de> References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> <4D9D5197.3000206@astro.uio.no> <4D9D6F29.8080104@behnel.de> Message-ID: <4D9D76C4.5070900@astro.uio.no> On 04/07/2011 10:00 AM, Stefan Behnel wrote: > Dag Sverre Seljebotn, 07.04.2011 07:54: >> On 04/07/2011 02:12 AM, Robert Bradshaw wrote: >>> On Wed, Apr 6, 2011 at 4:40 PM, Zak Stone wrote: >>>>>> Researchers: Please consider citing this paper if Cython helps your >>>>>> research in non-trivial ways. >>>>> Is this the canonical citation reference for Cython now? If so, can >>>>> this be >>>>> mentioned on the Cython webpage somewhere that is prominent enough >>>>> to be >>>>> found? >>>> On a related note, would it be possible to post a preprint somewhere >>>> that isn't behind a paywall? If that's allowed, I would be delighted >>>> to share the preprint with friends to introduce them to Cython. >>> Yes, I think we can post the pre-print, though I'm opposed to making >>> this the "canonical citation" just because of this paywall. >> >> Is this for ideological or practical reasons? > > Both. > > >> This is probably the only paper in a "real" journal for some time, and >> citations are going to boost the authors' citation counts. Nobody would >> actually look up the citation anyway simply to learn about Cython, >> they'd >> just Google it. > > Depends on the reference. If it's just cited as "you know, Cython", > people will either look for "Cython" directly and be happy, or they > may look up the paper, see that it's paid, and keep searching, either > for the paper or for the project. If it's cited as "in that paper, you > can read about doing X with Cython", then people will try even harder > to get at the paper. In either case, chances are that they need to > invest more time because of the reference, compared to a plain link in > a footnote. So citing this article is likely to be an inconvenience > for interested readers of papers that cite it. I guess this depends on the paper and reader in question then. Myself I'd never bother with the paper but go right to the website. Citing is just "paying the authors of the software through improving their citation stats". Then again my field is unfortunately very much pyramid-scheme-inflicted. I definitely think we should encourage giving a footnote as well. How about just presenting the situation as it is in a "Citing Cython" section, and leave the decision up to who's citing Cython? ("If you don't like to cite a paywall paper, a website reference is OK. At any rate, please link to the website in a footnote the first time you mention Cython.") Really, I hate the current situation as much as you do. But I see moving the world towards open access as the task of those whose already got a bit up the food chain; I'm just at the start of my PhD. (And it should be obvious I'm arguing with my own interests in mind here.) Dag Sverre From stefan_ml at behnel.de Thu Apr 7 10:40:50 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Thu, 07 Apr 2011 10:40:50 +0200 Subject: [Cython] [cython-users] CiSE Cython paper: Preprint up In-Reply-To: <4D9D7103.7060506@astro.uio.no> References: <4D9D7103.7060506@astro.uio.no> Message-ID: <4D9D7892.7030203@behnel.de> Dag Sverre Seljebotn, 07.04.2011 10:08: > The repo is here: https://github.com/dagss/cython-cise-postprint To include metadata, you can change the hyperref setup near the top as follows: """ \RequirePackage[colorlinks,breaklinks,pdftex, linkcolor=InnerLinkColor,filecolor=OuterLinkColor, menucolor=OuterLinkColor,urlcolor=OuterLinkColor, citecolor=InnerLinkColor, pdfauthor={Stefan Behnel, Robert Bradshaw, Craig Citro, Lisandro Dalcin, Dag Sverre Seljebotn, Kurt Smith}, pdftitle={Cython: The best of both worlds}, pdfkeywords={Cython language, Cython programming, NumPy} ]{hyperref} """ Any more keywords to add? There's also pdfsubject (and pdfcreator and pdfproducer), but that doesn't really apply here. Stefan From robertwb at math.washington.edu Thu Apr 7 10:45:14 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Thu, 7 Apr 2011 01:45:14 -0700 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: <4D9D76C4.5070900@astro.uio.no> References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> <4D9D5197.3000206@astro.uio.no> <4D9D6F29.8080104@behnel.de> <4D9D76C4.5070900@astro.uio.no> Message-ID: On Thu, Apr 7, 2011 at 1:33 AM, Dag Sverre Seljebotn wrote: > On 04/07/2011 10:00 AM, Stefan Behnel wrote: >> >> Dag Sverre Seljebotn, 07.04.2011 07:54: >>> >>> On 04/07/2011 02:12 AM, Robert Bradshaw wrote: >>>> >>>> On Wed, Apr 6, 2011 at 4:40 PM, Zak Stone wrote: >>>>>>> >>>>>>> Researchers: Please consider citing this paper if Cython helps your >>>>>>> research in non-trivial ways. >>>>>> >>>>>> Is this the canonical citation reference for Cython now? If so, can >>>>>> this be >>>>>> mentioned on the Cython webpage somewhere that is prominent enough to >>>>>> be >>>>>> found? >>>>> >>>>> On a related note, would it be possible to post a preprint somewhere >>>>> that isn't behind a paywall? If that's allowed, I would be delighted >>>>> to share the preprint with friends to introduce them to Cython. >>>> >>>> Yes, I think we can post the pre-print, though I'm opposed to making >>>> this the "canonical citation" just because of this paywall. >>> >>> Is this for ideological or practical reasons? >> >> Both. >> >> >>> This is probably the only paper in a "real" journal for some time, and >>> citations are going to boost the authors' citation counts. Nobody would >>> actually look up the citation anyway simply to learn about Cython, they'd >>> just Google it. >> >> Depends on the reference. If it's just cited as "you know, Cython", people >> will either look for "Cython" directly and be happy, or they may look up the >> paper, see that it's paid, and keep searching, either for the paper or for >> the project. If it's cited as "in that paper, you can read about doing X >> with Cython", then people will try even harder to get at the paper. In >> either case, chances are that they need to invest more time because of the >> reference, compared to a plain link in a footnote. So citing this article is >> likely to be an inconvenience for interested readers of papers that cite it. > > I guess this depends on the paper and reader in question then. Myself I'd > never bother with the paper but go right to the website. Citing is just > "paying the authors of the software through improving their citation stats". > Then again my field is unfortunately very much pyramid-scheme-inflicted. > > I definitely think we should encourage giving a footnote as well. > > How about just presenting the situation as it is in a "Citing Cython" > section, and leave the decision up to who's citing Cython? ("If you don't > like to cite a paywall paper, a website reference is OK. At any rate, please > link to the website in a footnote the first time you mention Cython.") Of course eventually it'd be nice if people just wrote "we coded this up in Cython" and a reference felt as out of place there as if they had provided a reference for Fortran or C :-). We're a long way from there though. > Really, I hate the current situation as much as you do. But I see moving the > world towards open access as the task of those whose already got a bit up > the food chain; I'm just at the start of my PhD. (And it should be obvious > I'm arguing with my own interests in mind here.) > > Dag Sverre > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From d.s.seljebotn at astro.uio.no Thu Apr 7 10:56:39 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Thu, 07 Apr 2011 10:56:39 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> <4D9D5197.3000206@astro.uio.no> Message-ID: <4D9D7C47.9050501@astro.uio.no> On 04/07/2011 10:01 AM, Robert Bradshaw wrote: > On Wed, Apr 6, 2011 at 10:54 PM, Dag Sverre Seljebotn > wrote: >> On 04/07/2011 02:12 AM, Robert Bradshaw wrote: >>> On Wed, Apr 6, 2011 at 4:40 PM, Zak Stone wrote: >>>>>> Researchers: Please consider citing this paper if Cython helps your >>>>>> research in non-trivial ways. >>>>> Is this the canonical citation reference for Cython now? If so, can >>>>> this be >>>>> mentioned on the Cython webpage somewhere that is prominent enough to be >>>>> found? >>>> On a related note, would it be possible to post a preprint somewhere >>>> that isn't behind a paywall? If that's allowed, I would be delighted >>>> to share the preprint with friends to introduce them to Cython. >>> Yes, I think we can post the pre-print, though I'm opposed to making >>> this the "canonical citation" just because of this paywall. >> Is this for ideological or practical reasons? > Both. > > Actually, opposed is probably too strong of a word here. I'm > disinclined, but there isn't really a better option. Currently, people > usually just cite the website, for whatever that's worth. > http://scholar.google.com/scholar?q=cython And I don't think that's worth very much. To me it's really looking like CiSE citation or no citation at all. >> Next time we've got anything to share in a paper, let's do it here: >> >> http://www.openresearchcomputation.com/ >> >> Although that wasn't around when we started writing the paper. > Or at least look into this more carefully. Some of CiSE's papers are > open access, I (naively) thought ours wouldn't be hard to get to > either. It is a nice paper though and I think it'll hit a nice > audience (who primarily won't even be aware that they're paying > through it indirectly through university overhead and monolithic > library subscriptions). I did the same mistake, because I couldn't see the paywall myself. At the time I actually had a hard time finding an internet connection that wouldn't transparently serve me the PDFs. And once I learned I figured it was a bit too late to back out. I've learned a lot since then. DS From r.rex at tu-bs.de Thu Apr 7 11:37:05 2011 From: r.rex at tu-bs.de (=?ISO-8859-1?Q?Ren=E9_Rex?=) Date: Thu, 7 Apr 2011 11:37:05 +0200 Subject: [Cython] [cython-users] CiSE Cython paper: Preprint up In-Reply-To: <4D9D7892.7030203@behnel.de> References: <4D9D7103.7060506@astro.uio.no> <4D9D7892.7030203@behnel.de> Message-ID: > Any more keywords to add? What about "Python"? ;) - Ren? From d.s.seljebotn at astro.uio.no Thu Apr 7 13:08:19 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Thu, 07 Apr 2011 13:08:19 +0200 Subject: [Cython] [cython-users] CiSE Cython paper: Preprint up In-Reply-To: References: <4D9D7103.7060506@astro.uio.no> <4D9D7892.7030203@behnel.de> Message-ID: <4D9D9B23.4090900@astro.uio.no> On 04/07/2011 11:37 AM, Ren? Rex wrote: >> Any more keywords to add? > What about "Python"? ;) > Done and done. Thanks for the patches. DS From d.s.seljebotn at astro.uio.no Thu Apr 7 13:26:50 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Thu, 07 Apr 2011 13:26:50 +0200 Subject: [Cython] Cython paper in Computing in Science & Engineering In-Reply-To: References: <4D9C176E.80406@astro.uio.no> <4D9CEF9D.401@creativetrax.com> <4D9D5197.3000206@astro.uio.no> Message-ID: <4D9D9F7A.9040600@astro.uio.no> On 04/07/2011 10:01 AM, Robert Bradshaw wrote: > On Wed, Apr 6, 2011 at 10:54 PM, Dag Sverre Seljebotn > wrote: >> On 04/07/2011 02:12 AM, Robert Bradshaw wrote: >>> On Wed, Apr 6, 2011 at 4:40 PM, Zak Stone wrote: >>>>>> Researchers: Please consider citing this paper if Cython helps your >>>>>> research in non-trivial ways. >>>>> Is this the canonical citation reference for Cython now? If so, can >>>>> this be >>>>> mentioned on the Cython webpage somewhere that is prominent enough to be >>>>> found? >>>> On a related note, would it be possible to post a preprint somewhere >>>> that isn't behind a paywall? If that's allowed, I would be delighted >>>> to share the preprint with friends to introduce them to Cython. >>> Yes, I think we can post the pre-print, though I'm opposed to making >>> this the "canonical citation" just because of this paywall. >> Is this for ideological or practical reasons? > Both. > > Actually, opposed is probably too strong of a word here. I'm > disinclined, but there isn't really a better option. Currently, people > usually just cite the website, for whatever that's worth. > http://scholar.google.com/scholar?q=cython OK, I wrote this: http://wiki.cython.org/FAQ#HowdoIciteCythoninanacademicpaper.3F If any of you can think of something better that that, just do it -- I won't start an edit war :-) Dag Sverre From stefan_ml at behnel.de Thu Apr 7 13:46:08 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Thu, 07 Apr 2011 13:46:08 +0200 Subject: [Cython] Hudson pyregr testing takes too long Message-ID: <4D9DA400.4060105@behnel.de> Hi, I just noticed that the CPython pyregr tests have jumped up from ~14 minutes for a run to ~4 hours when we added generator support. https://sage.math.washington.edu:8091/hudson/view/cython-devel/job/cython-devel-tests-pyregr-py26-c/buildTimeTrend I currently have no idea why that is (well, it's likely because we compile more tests now, but Vitja's branch ran the tests in ~30 minutes). It would be great if someone could find the time to analyse this problem. The current run time makes it basically impossible to keep these tests enabled. Stefan From stefan_ml at behnel.de Thu Apr 7 13:52:00 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Thu, 07 Apr 2011 13:52:00 +0200 Subject: [Cython] Hudson pyregr testing takes too long In-Reply-To: <4D9DA400.4060105@behnel.de> References: <4D9DA400.4060105@behnel.de> Message-ID: <4D9DA560.8070505@behnel.de> Stefan Behnel, 07.04.2011 13:46: > I just noticed that the CPython pyregr tests have jumped up from ~14 > minutes for a run to ~4 hours when we added generator support. > > https://sage.math.washington.edu:8091/hudson/view/cython-devel/job/cython-devel-tests-pyregr-py26-c/buildTimeTrend > > > I currently have no idea why that is (well, it's likely because we compile > more tests now, but Vitja's branch ran the tests in ~30 minutes). It would > be great if someone could find the time to analyse this problem. The > current run time makes it basically impossible to keep these tests enabled. Ok, it looks like this is mostly an issue with the Py2.6 tests. The Py2.7 tests take 30-45 minutes, which is very long, but not completely out of bounds. I've disabled the Py2.6 pyregr tests for now. Stefan From baihaoyu at gmail.com Thu Apr 7 15:04:03 2011 From: baihaoyu at gmail.com (Haoyu Bai) Date: Thu, 7 Apr 2011 21:04:03 +0800 Subject: [Cython] cython broken In-Reply-To: References:

Message-ID: On Thu, Apr 7, 2011 at 1:22 PM, Haoyu Bai wrote: > On Thu, Apr 7, 2011 at 1:14 AM, Lisandro Dalcin wrote: >> Since the commit below, Cython fails to compile itself. That fix >> requires further work and definitely more tests. If that is impossible >> right now, I would ask the guilty parties to revert the change and >> continue working on this the bug tracker and repo clones. Please try >> to keep cython-dev repo clean. >> >> > > I'm investigating this. For now, please revert this. Meanwhile, I'll > try to get it fixed. > I just started a pull request that fix the current compiling fail: https://github.com/cython/cython/pull/21 Thanks! -- Haoyu BAI School of Computing, National University of Singapore. From romain.py at gmail.com Thu Apr 7 17:01:11 2011 From: romain.py at gmail.com (Romain Guillebert) Date: Thu, 7 Apr 2011 16:01:11 +0100 Subject: [Cython] [GSoC] Python backend for Cython using PyPy's FFI Message-ID: <20110407150110.GA13395@ubuntu> Hi I proposed the Summer of Code project regarding the Python backend for Cython. As I said in my proposal this would translate Cython code to Python + FFI code (I don't know yet if it will use ctypes or something specific to PyPy). PyPy's ctypes is now really fast and this will allow people to port their Cython code to PyPy. For the moment I've been mostly in touch with the PyPy people and they seem happy with my proposal. Of course I'm available for questions. Cheers Romain From d.s.seljebotn at astro.uio.no Thu Apr 7 18:06:31 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Thu, 07 Apr 2011 18:06:31 +0200 Subject: [Cython] [GSoC] Python backend for Cython using PyPy's FFI In-Reply-To: <20110407150110.GA13395@ubuntu> References: <20110407150110.GA13395@ubuntu> Message-ID: <4D9DE107.9040301@astro.uio.no> On 04/07/2011 05:01 PM, Romain Guillebert wrote: > Hi > > I proposed the Summer of Code project regarding the Python backend for > Cython. > > As I said in my proposal this would translate Cython code to Python + > FFI code (I don't know yet if it will use ctypes or something specific > to PyPy). PyPy's ctypes is now really fast and this will allow people to > port their Cython code to PyPy. > > For the moment I've been mostly in touch with the PyPy people and they > seem happy with my proposal. > > Of course I'm available for questions. Disclaimer: I haven't read the proposal (don't have access yet but will soon). So perhaps the below is redundant. This seems similar to Carl Witty's port of Cython to .NET/IronPython. An important insight from that project is that Cython code does NOT specify an ABI, only an API which requires a C compiler to make sense. That is; many wrapped C libraries have plenty of macros, we only require partial definition of struct, we only require approximate typedef's, and so on. In the .NET port, the consequence was that rather than the original idea of generating C# code (with FFI specifications) was dropped, and one instead went with C++/CLR (which is a proper C++ compiler that really understands the C side on an API level, in addition to giving access to the .NET runtime). There are two ways around this: a) In addition to Python code, generate C code that can take (the friendlest) APIs and probe for the ABIs (such as, for instance, getting the offset of each struct field from the base pointer). Of course, this must really be rerun for each platform/build of the wrapped library. Essentially, you'd use Cython to generate C code that, in a target build, would generate Python code... b) Create a subset of the Cython language ("RCython" :-)), where you require explicit ABIs (essentially this means either disallowing "cdef extern from ...", or creating some new form of it). Most Cython extensions I know about would not work with this though, so there would need to be porting in each case. Ideally one should then have a similar mode for Cython+CPython so that one can debug with CPython as well. Dag Sverre From carl.witty at gmail.com Thu Apr 7 18:53:24 2011 From: carl.witty at gmail.com (Carl Witty) Date: Thu, 7 Apr 2011 09:53:24 -0700 Subject: [Cython] [GSoC] Python backend for Cython using PyPy's FFI In-Reply-To: <4D9DE107.9040301@astro.uio.no> References: <20110407150110.GA13395@ubuntu> <4D9DE107.9040301@astro.uio.no> Message-ID: On Thu, Apr 7, 2011 at 9:06 AM, Dag Sverre Seljebotn wrote: > On 04/07/2011 05:01 PM, Romain Guillebert wrote: >> >> Hi >> >> I proposed the Summer of Code project regarding the Python backend for >> Cython. >> >> As I said in my proposal this would translate Cython code to Python + >> FFI code (I don't know yet if it will use ctypes or something specific >> to PyPy). PyPy's ctypes is now really fast and this will allow people to >> port their Cython code to PyPy. >> >> For the moment I've been mostly in touch with the PyPy people and they >> seem happy with my proposal. >> >> Of course I'm available for questions. > > Disclaimer: I haven't read the proposal (don't have access yet but will > soon). So perhaps the below is redundant. > > This seems similar to Carl Witty's port of Cython to .NET/IronPython. An > important insight from that project is that Cython code does NOT specify an > ABI, only an API which requires a C compiler to make sense. That is; many > wrapped C libraries have plenty of macros, we only require partial > definition of struct, we only require approximate typedef's, and so on. > > In the .NET port, the consequence was that rather than the original idea of > generating C# code (with FFI specifications) was dropped, and one instead > went with C++/CLR (which is a proper C++ compiler that really understands > the C side on an API level, in addition to giving access to the .NET > runtime). > > There are two ways around this: > > ?a) In addition to Python code, generate C code that can take (the > friendlest) APIs and probe for the ABIs (such as, for instance, getting the > offset of each struct field from the base pointer). Of course, this must > really be rerun for each platform/build of the wrapped library. > > Essentially, you'd use Cython to generate C code that, in a target build, > would generate Python code... > > ?b) Create a subset of the Cython language ("RCython" :-)), where you > require explicit ABIs (essentially this means either disallowing "cdef > extern from ...", or creating some new form of it). Most Cython extensions I > know about would not work with this though, so there would need to be > porting in each case. Ideally one should then have a similar mode for > Cython+CPython so that one can debug with CPython as well. Note that a) is not sufficient in general -- it doesn't handle macros that expand into code, like errno and putc(). There's another option I considered, c) Given the API specification in the Cython file, generate C code that wraps that API with a known ABI. So for: cdef extern from "": int errno you would generate a C file something like: #include void _write_errno(int newval) { errno = newval; } int _read_errno() { return errno; } and for cdef extern from "": ctypedef int ino_t cdef extern from "": cdef struct stat: ino_t st_ino you would generate (in part): #include #include long long _read_struct_stat_st_ino(struct stat *ptr) { return ptr->st_ino; } void _write_struct_stat_st_ino(struct stat *ptr, long long newval) { ptr->st_ino = newval; } (Of course, you'd want to add more name mangling to these examples.) Note that I use "long long" for st_ino even though the Cython code claimed that st_ino was int; that's because Cython generates code that would work even if st_ino were "long long", and probably some modules would break if you used the types declared in Cython. Also, you could combine a), b), and c). For example, use a) to determine struct sizes, type sizes, and field offsets; use b) when you're not worried about macros; and use c) (perhaps triggered by a new annotation in the Cython source) when you want to handle arbitrary API's that may be implemented with macros. Carl From carl.witty at gmail.com Thu Apr 7 20:09:37 2011 From: carl.witty at gmail.com (Carl Witty) Date: Thu, 7 Apr 2011 11:09:37 -0700 Subject: [Cython] [GSoC] Python backend for Cython using PyPy's FFI In-Reply-To: <4D9DE107.9040301@astro.uio.no> References: <20110407150110.GA13395@ubuntu> <4D9DE107.9040301@astro.uio.no> Message-ID: On Thu, Apr 7, 2011 at 9:06 AM, Dag Sverre Seljebotn wrote: > This seems similar to Carl Witty's port of Cython to .NET/IronPython. An > important insight from that project is that Cython code does NOT specify an > ABI, only an API which requires a C compiler to make sense. That is; many > wrapped C libraries have plenty of macros, we only require partial > definition of struct, we only require approximate typedef's, and so on. > > In the .NET port, the consequence was that rather than the original idea of > generating C# code (with FFI specifications) was dropped, and one instead > went with C++/CLR (which is a proper C++ compiler that really understands > the C side on an API level, in addition to giving access to the .NET > runtime). Let me just add that a way to deal with the API vs. ABI issue would be useful for other potential Cython targets as well, such as IronPython using C# and Jython. (A C# port for IronPython would be more valuable than my C++/CLI port because it would work under Mono -- Mono doesn't have a C++/CLI compiler and probably never will.) Carl From romain.py at gmail.com Thu Apr 7 22:36:35 2011 From: romain.py at gmail.com (Romain) Date: Thu, 7 Apr 2011 21:36:35 +0100 Subject: [Cython] [GSoC] Python backend for Cython using PyPy's FFI In-Reply-To: References: <20110407150110.GA13395@ubuntu> <4D9DE107.9040301@astro.uio.no> Message-ID: Hi again PyPy has functions to parse C headers to get macros and constants so I could create C functions to wrap the macros and probably inline constants in the Python part of the wrapper. This doesn't solve the problem of ifdefs but this is a start. Cheers Romain 2011/4/7 Carl Witty > On Thu, Apr 7, 2011 at 9:06 AM, Dag Sverre Seljebotn > wrote: > > This seems similar to Carl Witty's port of Cython to .NET/IronPython. An > > important insight from that project is that Cython code does NOT specify > an > > ABI, only an API which requires a C compiler to make sense. That is; many > > wrapped C libraries have plenty of macros, we only require partial > > definition of struct, we only require approximate typedef's, and so on. > > > > In the .NET port, the consequence was that rather than the original idea > of > > generating C# code (with FFI specifications) was dropped, and one instead > > went with C++/CLR (which is a proper C++ compiler that really understands > > the C side on an API level, in addition to giving access to the .NET > > runtime). > > Let me just add that a way to deal with the API vs. ABI issue would be > useful for other potential Cython targets as well, such as IronPython > using C# and Jython. (A C# port for IronPython would be more valuable > than my C++/CLI port because it would work under Mono -- Mono doesn't > have a C++/CLI compiler and probably never will.) > > Carl > -------------- next part -------------- An HTML attachment was scrubbed... URL: From arthurdesribeiro at gmail.com Thu Apr 7 23:31:00 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Thu, 7 Apr 2011 18:31:00 -0300 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References: Message-ID: I've submitted to google the link is: http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# It would be really important if you could give me a feedback to my proposal... Thank you Best Regards Arthur 2011/4/7 Arthur de Souza Ribeiro > I've wrote a proposal to the project: Reimplement C modules in CPython's > standard library in Cython. > > I'd be glad if you could take a look a it and give me your feedback. > > the link for the proposal is: http://wiki.cython.org/arthursribeiro > > Thank you. > > Best Regards > > Arthur > -------------- next part -------------- An HTML attachment was scrubbed... URL: From stefan_ml at behnel.de Fri Apr 8 01:08:50 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 08 Apr 2011 01:08:50 +0200 Subject: [Cython] [GSoC] Python backend for Cython using PyPy's FFI In-Reply-To: References: <20110407150110.GA13395@ubuntu> <4D9DE107.9040301@astro.uio.no> Message-ID: <4D9E4402.7040004@behnel.de> [fixed up the citation order] Romain, 07.04.2011 22:36: > 2011/4/7 Carl Witty > >> On Thu, Apr 7, 2011 at 9:06 AM, Dag Sverre Seljebotn wrote: >>> This seems similar to Carl Witty's port of Cython to .NET/IronPython. An >>> important insight from that project is that Cython code does NOT specify >> an >>> ABI, only an API which requires a C compiler to make sense. That is; many >>> wrapped C libraries have plenty of macros, we only require partial >>> definition of struct, we only require approximate typedef's, and so on. >>> >>> In the .NET port, the consequence was that rather than the original idea >> of >>> generating C# code (with FFI specifications) was dropped, and one instead >>> went with C++/CLR (which is a proper C++ compiler that really understands >>> the C side on an API level, in addition to giving access to the .NET >>> runtime). >> >> Let me just add that a way to deal with the API vs. ABI issue would be >> useful for other potential Cython targets as well, such as IronPython >> using C# and Jython. (A C# port for IronPython would be more valuable >> than my C++/CLI port because it would work under Mono -- Mono doesn't >> have a C++/CLI compiler and probably never will.) > > PyPy has functions to parse C headers to get macros and constants so I could > create C functions to wrap the macros and probably inline constants in the > Python part of the wrapper. This doesn't solve the problem of ifdefs but > this is a start. Yes, I think this is the only way this can be handled. In the worst case, you'd have to additionally fire up a real C preprocessor and let it parse the referenced header files in order to get at the platform specific declarations, which should then usually be good enough to figure out the ABI. Stefan From robertwb at math.washington.edu Fri Apr 8 01:08:59 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Thu, 7 Apr 2011 16:08:59 -0700 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: On Thu, Apr 7, 2011 at 2:31 PM, Arthur de Souza Ribeiro wrote: > I've submitted to google the link > is:?http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# > It would be really important if you could give me a feedback to my > proposal... > Thank you > Best Regards > Arthur Some quick points: - Python ships with extensive regression tests--use (and possibly augment) those to test your work rather than writing your own. - Three modules for a whole summer seems a bit weak, especially for someone who already knows Cython. Target at least one module/week seems like a good pace; some will be quickies, others might take 40+ hours. And I guarantee you'll get better and faster with practice. - Now that generators are supported, it could also be interesting to look at compiling all the non-C modules and fixing exposed bugs if any, but that might be out of scope. What I'd like to see is an implementation of a single simple but not entirely trivial (e.g. not math) module, passing regression tests with comprable if not better speed than the current C version (though I think it'd probably make sense to start out with the Python version and optimize that). E.g. http://docs.python.org/library/json.html looks like a good candidate. That should only take 8 hours or so, maybe two days at most, given your background. I'm not expecting anything before the application deadline, but if you could whip something like this out in the next week to point to that would help your application out immensely. In fact, one of the Python foundation's requirements is that students submit a patch before being accepted, and this would knock out that requirement and give you a chance to prove yourself. Create an account on https://github.com and commit your code into a new repository there. Hope that helps. - Robert > 2011/4/7 Arthur de Souza Ribeiro >> >> I've wrote a proposal to the project:?Reimplement C modules in CPython's >> standard library in Cython. >> I'd be glad if you could take a look a it and give me your feedback. >> the link for the proposal is:?http://wiki.cython.org/arthursribeiro >> Thank you. >> Best Regards >> Arthur > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > > From arthurdesribeiro at gmail.com Fri Apr 8 02:43:29 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Thu, 7 Apr 2011 21:43:29 -0300 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: 2011/4/7 Robert Bradshaw > On Thu, Apr 7, 2011 at 2:31 PM, Arthur de Souza Ribeiro > wrote: > > I've submitted to google the link > > is: > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# > > It would be really important if you could give me a feedback to my > > proposal... > > Thank you > > Best Regards > > Arthur > > Some quick points: > > - Python ships with extensive regression tests--use (and possibly > augment) those to test your work rather than writing your own. > Thank you for that information Robert, I didn't realize this. > - Three modules for a whole summer seems a bit weak, especially for > someone who already knows Cython. Target at least one module/week > seems like a good pace; some will be quickies, others might take 40+ > hours. And I guarantee you'll get better and faster with practice. I'm going to refactor this Robert, as soon as I remake my project's roadmap I'll send to you again. > - Now that generators are supported, it could also be interesting to > look at compiling all the non-C modules and fixing exposed bugs if > any, but that might be out of scope. > I will try to take a look at this after implementing some cython code to a the module you suggested. > > What I'd like to see is an implementation of a single simple but not > entirely trivial (e.g. not math) module, passing regression tests with > comprable if not better speed than the current C version (though I > think it'd probably make sense to start out with the Python version > and optimize that). E.g. http://docs.python.org/library/json.html > looks like a good candidate. That should only take 8 hours or so, > maybe two days at most, given your background. I'm not expecting > anything before the application deadline, but if you could whip > something like this out in the next week to point to that would help > your application out immensely. In fact, one of the Python > foundation's requirements is that students submit a patch before being > accepted, and this would knock out that requirement and give you a > chance to prove yourself. Create an account on https://github.com and > commit your code into a new repository there. > > I will start the implementation of json module right now. I created my github account and as soon as I have code implemented I will send repository link. Thanks for all the points you listed, I will work on all of them and send an e-mail. Best Regards. []s Arthur Hope that helps. > > - Robert > > > > 2011/4/7 Arthur de Souza Ribeiro > >> > >> I've wrote a proposal to the project: Reimplement C modules in CPython's > >> standard library in Cython. > >> I'd be glad if you could take a look a it and give me your feedback. > >> the link for the proposal is: http://wiki.cython.org/arthursribeiro > >> Thank you. > >> Best Regards > >> Arthur > > > > _______________________________________________ > > cython-devel mailing list > > cython-devel at python.org > > http://mail.python.org/mailman/listinfo/cython-devel > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From arthurdesribeiro at gmail.com Fri Apr 8 08:38:46 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Fri, 8 Apr 2011 03:38:46 -0300 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: I've made some changes to my proposal, as you said, I changed the number of modules I'm going to reimplement, I jumped from three to twelve modules, what modules are these and when I want to implement is described at: http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# and http://wiki.cython.org/arthursribeiro If you could take another look I would appreciate a lot. Best Regards. []s Arthur 2011/4/7 Arthur de Souza Ribeiro > > > 2011/4/7 Robert Bradshaw > >> On Thu, Apr 7, 2011 at 2:31 PM, Arthur de Souza Ribeiro >> wrote: >> > I've submitted to google the link >> > is: >> http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# >> > It would be really important if you could give me a feedback to my >> > proposal... >> > Thank you >> > Best Regards >> > Arthur >> >> Some quick points: >> >> - Python ships with extensive regression tests--use (and possibly >> augment) those to test your work rather than writing your own. >> > > Thank you for that information Robert, I didn't realize this. > > >> - Three modules for a whole summer seems a bit weak, especially for >> someone who already knows Cython. Target at least one module/week >> seems like a good pace; some will be quickies, others might take 40+ >> hours. And I guarantee you'll get better and faster with practice. > > > I'm going to refactor this Robert, as soon as I remake my project's > roadmap I'll send to you again. > > >> - Now that generators are supported, it could also be interesting to >> look at compiling all the non-C modules and fixing exposed bugs if >> any, but that might be out of scope. >> > > I will try to take a look at this after implementing some cython code to a > the module you suggested. > > >> >> What I'd like to see is an implementation of a single simple but not >> entirely trivial (e.g. not math) module, passing regression tests with >> comprable if not better speed than the current C version (though I >> think it'd probably make sense to start out with the Python version >> and optimize that). E.g. http://docs.python.org/library/json.html >> looks like a good candidate. That should only take 8 hours or so, >> maybe two days at most, given your background. I'm not expecting >> anything before the application deadline, but if you could whip >> something like this out in the next week to point to that would help >> your application out immensely. In fact, one of the Python >> foundation's requirements is that students submit a patch before being >> accepted, and this would knock out that requirement and give you a >> chance to prove yourself. Create an account on https://github.com and >> commit your code into a new repository there. >> >> > I will start the implementation of json module right now. I created my > github account and as soon as I have code implemented I will send repository > link. > > Thanks for all the points you listed, I will work on all of them and send > an e-mail. > > Best Regards. > > []s > > Arthur > > > Hope that helps. >> >> - Robert >> >> >> > 2011/4/7 Arthur de Souza Ribeiro >> >> >> >> I've wrote a proposal to the project: Reimplement C modules in >> CPython's >> >> standard library in Cython. >> >> I'd be glad if you could take a look a it and give me your feedback. >> >> the link for the proposal is: http://wiki.cython.org/arthursribeiro >> >> Thank you. >> >> Best Regards >> >> Arthur >> > >> > _______________________________________________ >> > cython-devel mailing list >> > cython-devel at python.org >> > http://mail.python.org/mailman/listinfo/cython-devel >> > >> > >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From robertwb at math.washington.edu Fri Apr 8 10:50:45 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Fri, 8 Apr 2011 01:50:45 -0700 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: Looking better. I would add some details about how you plan to compare/profile your new implementations, and also at least add a note that compatibility will be ensured with the Python regression tests. It may make sense to let the exact list of modules be somewhat flexible, for example, based on feedback from the Python and Cython community on what would be the most worthwhile to Cythonize. Maybe the final milestone would be "re-implement several additional modules as chosen by the Python community to provide maximum value" or something like that. - Robert On Thu, Apr 7, 2011 at 11:38 PM, Arthur de Souza Ribeiro wrote: > I've made some changes to my proposal, as you said, I changed the number of > modules I'm going to reimplement, I jumped from three to twelve modules, > what modules are these and when I want to implement is described at: > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# > and > http://wiki.cython.org/arthursribeiro > If you could take another look I would appreciate a lot. > Best Regards. > []s > Arthur > > 2011/4/7 Arthur de Souza Ribeiro >> >> >> 2011/4/7 Robert Bradshaw >>> >>> On Thu, Apr 7, 2011 at 2:31 PM, Arthur de Souza Ribeiro >>> wrote: >>> > I've submitted to google the link >>> > >>> > is:?http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# >>> > It would be really important if you could give me a feedback to my >>> > proposal... >>> > Thank you >>> > Best Regards >>> > Arthur >>> >>> Some quick points: >>> >>> - Python ships with extensive regression tests--use (and possibly >>> augment) those to test your work rather than writing your own. >> >> Thank you for that information Robert, I didn't realize this. >> >>> >>> - Three modules for a whole summer seems a bit weak, especially for >>> someone who already knows Cython. Target at least one module/week >>> seems like a good pace; some will be quickies, others might take 40+ >>> hours. And I guarantee you'll get better and faster with practice. >> >> I'm going to refactor this Robert, as soon as I ?remake my project's >> roadmap I'll send to you again. >> >>> >>> - Now that generators are supported, it could also be interesting to >>> look at compiling all the non-C modules and fixing exposed bugs if >>> any, but that might be out of scope. >> >> I will try to take a look at this after implementing some cython code to a >> the module you suggested. >> >>> >>> What I'd like to see is an implementation of a single simple but not >>> entirely trivial (e.g. not math) module, passing regression tests with >>> comprable if not better speed than the current C version (though I >>> think it'd probably make sense to start out with the Python version >>> and optimize that). E.g. http://docs.python.org/library/json.html >>> looks like a good candidate. That should only take 8 hours or so, >>> maybe two days at most, given your background. I'm not expecting >>> anything before the application deadline, but if you could whip >>> something like this out in the next week to point to that would help >>> your application out immensely. In fact, one of the Python >>> foundation's requirements is that students submit a patch before being >>> accepted, and this would knock out that requirement and give you a >>> chance to prove yourself. Create an account on https://github.com and >>> commit your code into a new repository there. >>> >> >> I will start the implementation of json module right now. I created my >> github account and as soon as I have code implemented I will send repository >> link. >> Thanks for all the points you listed, I will work on all of them and send >> an e-mail. >> Best Regards. >> []s >> Arthur >> >>> Hope that helps. >>> >>> - Robert >>> >>> >>> > 2011/4/7 Arthur de Souza Ribeiro >>> >> >>> >> I've wrote a proposal to the project:?Reimplement C modules in >>> >> CPython's >>> >> standard library in Cython. >>> >> I'd be glad if you could take a look a it and give me your feedback. >>> >> the link for the proposal is:?http://wiki.cython.org/arthursribeiro >>> >> Thank you. >>> >> Best Regards >>> >> Arthur >>> > >>> > _______________________________________________ >>> > cython-devel mailing list >>> > cython-devel at python.org >>> > http://mail.python.org/mailman/listinfo/cython-devel >>> > >>> > >> > > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > > From stefan_ml at behnel.de Fri Apr 8 08:50:35 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 08 Apr 2011 08:50:35 +0200 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: <4D9EB03B.4020909@behnel.de> Robert Bradshaw, 08.04.2011 01:08: > On Thu, Apr 7, 2011 at 2:31 PM, Arthur de Souza Ribeiro wrote: >> I've submitted to google the link >> is: http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# >> It would be really important if you could give me a feedback to my >> proposal... >> Thank you >> Best Regards >> Arthur > > Some quick points: > > - Python ships with extensive regression tests--use (and possibly > augment) those to test your work rather than writing your own. > - Three modules for a whole summer seems a bit weak, especially for > someone who already knows Cython. Target at least one module/week > seems like a good pace; some will be quickies, others might take 40+ > hours. And I guarantee you'll get better and faster with practice. Absolutely. There certainly are tricky parts in the C code, and optimising the Cython/Python code won't come for free, either, but after a little bit of exercise this should run quite fluently. > - Now that generators are supported, it could also be interesting to > look at compiling all the non-C modules and fixing exposed bugs if > any, but that might be out of scope. > > What I'd like to see is an implementation of a single simple but not > entirely trivial (e.g. not math) module, passing regression tests with > comprable if not better speed than the current C version (though I > think it'd probably make sense to start out with the Python version > and optimize that). E.g. http://docs.python.org/library/json.html > looks like a good candidate. Right, that's a good one. Clearly more maintenance critical than purely time critical, and likely a good candidate for making it both more Python compatible (function argument handling?) and maybe even faster than the original. And if it can be implemented/optimised in Python syntax, that'd drop the maintenance overhead of the binary module by some 99%. > That should only take 8 hours or so, > maybe two days at most, given your background. I'm not expecting > anything before the application deadline, but if you could whip > something like this out in the next week to point to that would help > your application out immensely. +1 > In fact, one of the Python > foundation's requirements is that students submit a patch before being > accepted, and this would knock out that requirement and give you a > chance to prove yourself. Create an account on https://github.com and > commit your code into a new repository there. Maybe even clone it from CPython's own stdlib repository in hg. Stefan From arthurdesribeiro at gmail.com Fri Apr 8 19:31:12 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Fri, 8 Apr 2011 14:31:12 -0300 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: 2011/4/8 Robert Bradshaw > Looking better. I would add some details about how you plan to > compare/profile your new implementations, and also at least add a note > that compatibility will be ensured with the Python regression tests. > I'm going to update this on the wiki page, it's just because the application period in google-melange ends today and it's not recommended to put details os how you plan to make the project in there. About the comparison, I wonder talk to the community a little more about it, because I see in cython's web page and in the tutorials numbers telling how much more efficient cython is against python, so I planned to use the same strategy that is used to show the numbers that are there. And about the tests, I'm studying how can I check compatibility and do these tasks to put that on wiki too. > > It may make sense to let the exact list of modules be somewhat > flexible, for example, based on feedback from the Python and Cython > community on what would be the most worthwhile to Cythonize. Maybe the > final milestone would be "re-implement several additional modules as > chosen by the Python community to provide maximum value" or something > like that. > You mean just to tell how many modules I'm going to re-implement, but not telling what modules are these? Re-implementing by community demand? Thank you very much again. Best Regards. []s Arthur > > - Robert > > On Thu, Apr 7, 2011 at 11:38 PM, Arthur de Souza Ribeiro > wrote: > > I've made some changes to my proposal, as you said, I changed the number > of > > modules I'm going to reimplement, I jumped from three to twelve modules, > > what modules are these and when I want to implement is described at: > > > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# > > and > > http://wiki.cython.org/arthursribeiro > > If you could take another look I would appreciate a lot. > > Best Regards. > > []s > > Arthur > > > > 2011/4/7 Arthur de Souza Ribeiro > >> > >> > >> 2011/4/7 Robert Bradshaw > >>> > >>> On Thu, Apr 7, 2011 at 2:31 PM, Arthur de Souza Ribeiro > >>> wrote: > >>> > I've submitted to google the link > >>> > > >>> > is: > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# > >>> > It would be really important if you could give me a feedback to my > >>> > proposal... > >>> > Thank you > >>> > Best Regards > >>> > Arthur > >>> > >>> Some quick points: > >>> > >>> - Python ships with extensive regression tests--use (and possibly > >>> augment) those to test your work rather than writing your own. > >> > >> Thank you for that information Robert, I didn't realize this. > >> > >>> > >>> - Three modules for a whole summer seems a bit weak, especially for > >>> someone who already knows Cython. Target at least one module/week > >>> seems like a good pace; some will be quickies, others might take 40+ > >>> hours. And I guarantee you'll get better and faster with practice. > >> > >> I'm going to refactor this Robert, as soon as I remake my project's > >> roadmap I'll send to you again. > >> > >>> > >>> - Now that generators are supported, it could also be interesting to > >>> look at compiling all the non-C modules and fixing exposed bugs if > >>> any, but that might be out of scope. > >> > >> I will try to take a look at this after implementing some cython code to > a > >> the module you suggested. > >> > >>> > >>> What I'd like to see is an implementation of a single simple but not > >>> entirely trivial (e.g. not math) module, passing regression tests with > >>> comprable if not better speed than the current C version (though I > >>> think it'd probably make sense to start out with the Python version > >>> and optimize that). E.g. http://docs.python.org/library/json.html > >>> looks like a good candidate. That should only take 8 hours or so, > >>> maybe two days at most, given your background. I'm not expecting > >>> anything before the application deadline, but if you could whip > >>> something like this out in the next week to point to that would help > >>> your application out immensely. In fact, one of the Python > >>> foundation's requirements is that students submit a patch before being > >>> accepted, and this would knock out that requirement and give you a > >>> chance to prove yourself. Create an account on https://github.com and > >>> commit your code into a new repository there. > >>> > >> > >> I will start the implementation of json module right now. I created my > >> github account and as soon as I have code implemented I will send > repository > >> link. > >> Thanks for all the points you listed, I will work on all of them and > send > >> an e-mail. > >> Best Regards. > >> []s > >> Arthur > >> > >>> Hope that helps. > >>> > >>> - Robert > >>> > >>> > >>> > 2011/4/7 Arthur de Souza Ribeiro > >>> >> > >>> >> I've wrote a proposal to the project: Reimplement C modules in > >>> >> CPython's > >>> >> standard library in Cython. > >>> >> I'd be glad if you could take a look a it and give me your feedback. > >>> >> the link for the proposal is: http://wiki.cython.org/arthursribeiro > >>> >> Thank you. > >>> >> Best Regards > >>> >> Arthur > >>> > > >>> > _______________________________________________ > >>> > cython-devel mailing list > >>> > cython-devel at python.org > >>> > http://mail.python.org/mailman/listinfo/cython-devel > >>> > > >>> > > >> > > > > > > _______________________________________________ > > cython-devel mailing list > > cython-devel at python.org > > http://mail.python.org/mailman/listinfo/cython-devel > > > > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From robertwb at math.washington.edu Fri Apr 8 19:40:55 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Fri, 8 Apr 2011 10:40:55 -0700 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: On Fri, Apr 8, 2011 at 10:31 AM, Arthur de Souza Ribeiro wrote: > > > 2011/4/8 Robert Bradshaw >> >> Looking better. I would add some details about how you plan to >> compare/profile your new implementations, and also at least add a note >> that compatibility will be ensured with the Python regression tests. > > I'm going to update this on the wiki page, it's just because the application > period in google-melange ends today and it's not recommended to put details > os how you plan to make the project in there. About the comparison, I wonder > talk to the community a little more about it, because I see in cython's web > page and in the tutorials numbers telling how much more efficient cython is > against python, so I planned to use the same strategy that is used to show > the numbers that are there. And about the tests, I'm studying how can I > check compatibility and do these tasks to put that on wiki too. > >> >> It may make sense to let the exact list of modules be somewhat >> flexible, for example, based on feedback from the Python and Cython >> community on what would be the most worthwhile to Cythonize. Maybe the >> final milestone would be "re-implement several additional modules as >> chosen by the Python community to provide maximum value" or something >> like that. > > You mean just to tell how many modules I'm going to re-implement, but not > telling what modules are these? Re-implementing by community demand? Yes, exactly (for the last milestone, I think it's good to have more direction at the start as well as have something to point to when soliciting feedback. Maybe say "at least three" as it might be some big ones or a handful of little ones. You, I, and everyone else will have a better idea of what'll be most profitable at this point. > Thank you very much again. No problem, thanks for your interest. > Best Regards. > []s > Arthur > >> >> - Robert >> >> On Thu, Apr 7, 2011 at 11:38 PM, Arthur de Souza Ribeiro >> wrote: >> > I've made some changes to my proposal, as you said, I changed the number >> > of >> > modules I'm going to reimplement, I jumped from three to twelve modules, >> > what modules are these and when I want to implement is described at: >> > >> > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# >> > and >> > http://wiki.cython.org/arthursribeiro >> > If you could take another look I would appreciate a lot. >> > Best Regards. >> > []s >> > Arthur >> > >> > 2011/4/7 Arthur de Souza Ribeiro >> >> >> >> >> >> 2011/4/7 Robert Bradshaw >> >>> >> >>> On Thu, Apr 7, 2011 at 2:31 PM, Arthur de Souza Ribeiro >> >>> wrote: >> >>> > I've submitted to google the link >> >>> > >> >>> > >> >>> > is:?http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# >> >>> > It would be really important if you could give me a feedback to my >> >>> > proposal... >> >>> > Thank you >> >>> > Best Regards >> >>> > Arthur >> >>> >> >>> Some quick points: >> >>> >> >>> - Python ships with extensive regression tests--use (and possibly >> >>> augment) those to test your work rather than writing your own. >> >> >> >> Thank you for that information Robert, I didn't realize this. >> >> >> >>> >> >>> - Three modules for a whole summer seems a bit weak, especially for >> >>> someone who already knows Cython. Target at least one module/week >> >>> seems like a good pace; some will be quickies, others might take 40+ >> >>> hours. And I guarantee you'll get better and faster with practice. >> >> >> >> I'm going to refactor this Robert, as soon as I ?remake my project's >> >> roadmap I'll send to you again. >> >> >> >>> >> >>> - Now that generators are supported, it could also be interesting to >> >>> look at compiling all the non-C modules and fixing exposed bugs if >> >>> any, but that might be out of scope. >> >> >> >> I will try to take a look at this after implementing some cython code >> >> to a >> >> the module you suggested. >> >> >> >>> >> >>> What I'd like to see is an implementation of a single simple but not >> >>> entirely trivial (e.g. not math) module, passing regression tests with >> >>> comprable if not better speed than the current C version (though I >> >>> think it'd probably make sense to start out with the Python version >> >>> and optimize that). E.g. http://docs.python.org/library/json.html >> >>> looks like a good candidate. That should only take 8 hours or so, >> >>> maybe two days at most, given your background. I'm not expecting >> >>> anything before the application deadline, but if you could whip >> >>> something like this out in the next week to point to that would help >> >>> your application out immensely. In fact, one of the Python >> >>> foundation's requirements is that students submit a patch before being >> >>> accepted, and this would knock out that requirement and give you a >> >>> chance to prove yourself. Create an account on https://github.com and >> >>> commit your code into a new repository there. >> >>> >> >> >> >> I will start the implementation of json module right now. I created my >> >> github account and as soon as I have code implemented I will send >> >> repository >> >> link. >> >> Thanks for all the points you listed, I will work on all of them and >> >> send >> >> an e-mail. >> >> Best Regards. >> >> []s >> >> Arthur >> >> >> >>> Hope that helps. >> >>> >> >>> - Robert >> >>> >> >>> >> >>> > 2011/4/7 Arthur de Souza Ribeiro >> >>> >> >> >>> >> I've wrote a proposal to the project:?Reimplement C modules in >> >>> >> CPython's >> >>> >> standard library in Cython. >> >>> >> I'd be glad if you could take a look a it and give me your >> >>> >> feedback. >> >>> >> the link for the proposal is:?http://wiki.cython.org/arthursribeiro >> >>> >> Thank you. >> >>> >> Best Regards >> >>> >> Arthur >> >>> > >> >>> > _______________________________________________ >> >>> > cython-devel mailing list >> >>> > cython-devel at python.org >> >>> > http://mail.python.org/mailman/listinfo/cython-devel >> >>> > >> >>> > >> >> >> > >> > >> > _______________________________________________ >> > cython-devel mailing list >> > cython-devel at python.org >> > http://mail.python.org/mailman/listinfo/cython-devel >> > >> > >> _______________________________________________ >> cython-devel mailing list >> cython-devel at python.org >> http://mail.python.org/mailman/listinfo/cython-devel > > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > > From arthurdesribeiro at gmail.com Fri Apr 8 19:59:36 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Fri, 8 Apr 2011 14:59:36 -0300 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: The moduels suggested for the first two milestones you think are ok? Best Regards.. []s Arthur 2011/4/8 Robert Bradshaw > On Fri, Apr 8, 2011 at 10:31 AM, Arthur de Souza Ribeiro > wrote: > > > > > > 2011/4/8 Robert Bradshaw > >> > >> Looking better. I would add some details about how you plan to > >> compare/profile your new implementations, and also at least add a note > >> that compatibility will be ensured with the Python regression tests. > > > > I'm going to update this on the wiki page, it's just because the > application > > period in google-melange ends today and it's not recommended to put > details > > os how you plan to make the project in there. About the comparison, I > wonder > > talk to the community a little more about it, because I see in cython's > web > > page and in the tutorials numbers telling how much more efficient cython > is > > against python, so I planned to use the same strategy that is used to > show > > the numbers that are there. And about the tests, I'm studying how can I > > check compatibility and do these tasks to put that on wiki too. > > > >> > >> It may make sense to let the exact list of modules be somewhat > >> flexible, for example, based on feedback from the Python and Cython > >> community on what would be the most worthwhile to Cythonize. Maybe the > >> final milestone would be "re-implement several additional modules as > >> chosen by the Python community to provide maximum value" or something > >> like that. > > > > You mean just to tell how many modules I'm going to re-implement, but not > > telling what modules are these? Re-implementing by community demand? > > Yes, exactly (for the last milestone, I think it's good to have more > direction at the start as well as have something to point to when > soliciting feedback. Maybe say "at least three" as it might be some > big ones or a handful of little ones. You, I, and everyone else will > have a better idea of what'll be most profitable at this point. > > > Thank you very much again. > > No problem, thanks for your interest. > > > Best Regards. > > []s > > Arthur > > > >> > >> - Robert > >> > >> On Thu, Apr 7, 2011 at 11:38 PM, Arthur de Souza Ribeiro > >> wrote: > >> > I've made some changes to my proposal, as you said, I changed the > number > >> > of > >> > modules I'm going to reimplement, I jumped from three to twelve > modules, > >> > what modules are these and when I want to implement is described at: > >> > > >> > > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# > >> > and > >> > http://wiki.cython.org/arthursribeiro > >> > If you could take another look I would appreciate a lot. > >> > Best Regards. > >> > []s > >> > Arthur > >> > > >> > 2011/4/7 Arthur de Souza Ribeiro > >> >> > >> >> > >> >> 2011/4/7 Robert Bradshaw > >> >>> > >> >>> On Thu, Apr 7, 2011 at 2:31 PM, Arthur de Souza Ribeiro > >> >>> wrote: > >> >>> > I've submitted to google the link > >> >>> > > >> >>> > > >> >>> > is: > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/arthur_sr/1# > >> >>> > It would be really important if you could give me a feedback to my > >> >>> > proposal... > >> >>> > Thank you > >> >>> > Best Regards > >> >>> > Arthur > >> >>> > >> >>> Some quick points: > >> >>> > >> >>> - Python ships with extensive regression tests--use (and possibly > >> >>> augment) those to test your work rather than writing your own. > >> >> > >> >> Thank you for that information Robert, I didn't realize this. > >> >> > >> >>> > >> >>> - Three modules for a whole summer seems a bit weak, especially for > >> >>> someone who already knows Cython. Target at least one module/week > >> >>> seems like a good pace; some will be quickies, others might take 40+ > >> >>> hours. And I guarantee you'll get better and faster with practice. > >> >> > >> >> I'm going to refactor this Robert, as soon as I remake my project's > >> >> roadmap I'll send to you again. > >> >> > >> >>> > >> >>> - Now that generators are supported, it could also be interesting to > >> >>> look at compiling all the non-C modules and fixing exposed bugs if > >> >>> any, but that might be out of scope. > >> >> > >> >> I will try to take a look at this after implementing some cython code > >> >> to a > >> >> the module you suggested. > >> >> > >> >>> > >> >>> What I'd like to see is an implementation of a single simple but not > >> >>> entirely trivial (e.g. not math) module, passing regression tests > with > >> >>> comprable if not better speed than the current C version (though I > >> >>> think it'd probably make sense to start out with the Python version > >> >>> and optimize that). E.g. http://docs.python.org/library/json.html > >> >>> looks like a good candidate. That should only take 8 hours or so, > >> >>> maybe two days at most, given your background. I'm not expecting > >> >>> anything before the application deadline, but if you could whip > >> >>> something like this out in the next week to point to that would help > >> >>> your application out immensely. In fact, one of the Python > >> >>> foundation's requirements is that students submit a patch before > being > >> >>> accepted, and this would knock out that requirement and give you a > >> >>> chance to prove yourself. Create an account on https://github.comand > >> >>> commit your code into a new repository there. > >> >>> > >> >> > >> >> I will start the implementation of json module right now. I created > my > >> >> github account and as soon as I have code implemented I will send > >> >> repository > >> >> link. > >> >> Thanks for all the points you listed, I will work on all of them and > >> >> send > >> >> an e-mail. > >> >> Best Regards. > >> >> []s > >> >> Arthur > >> >> > >> >>> Hope that helps. > >> >>> > >> >>> - Robert > >> >>> > >> >>> > >> >>> > 2011/4/7 Arthur de Souza Ribeiro > >> >>> >> > >> >>> >> I've wrote a proposal to the project: Reimplement C modules in > >> >>> >> CPython's > >> >>> >> standard library in Cython. > >> >>> >> I'd be glad if you could take a look a it and give me your > >> >>> >> feedback. > >> >>> >> the link for the proposal is: > http://wiki.cython.org/arthursribeiro > >> >>> >> Thank you. > >> >>> >> Best Regards > >> >>> >> Arthur > >> >>> > > >> >>> > _______________________________________________ > >> >>> > cython-devel mailing list > >> >>> > cython-devel at python.org > >> >>> > http://mail.python.org/mailman/listinfo/cython-devel > >> >>> > > >> >>> > > >> >> > >> > > >> > > >> > _______________________________________________ > >> > cython-devel mailing list > >> > cython-devel at python.org > >> > http://mail.python.org/mailman/listinfo/cython-devel > >> > > >> > > >> _______________________________________________ > >> cython-devel mailing list > >> cython-devel at python.org > >> http://mail.python.org/mailman/listinfo/cython-devel > > > > > > _______________________________________________ > > cython-devel mailing list > > cython-devel at python.org > > http://mail.python.org/mailman/listinfo/cython-devel > > > > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From markflorisson88 at gmail.com Fri Apr 8 20:03:40 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Fri, 8 Apr 2011 20:03:40 +0200 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: On 8 April 2011 19:59, Arthur de Souza Ribeiro wrote: > The moduels suggested for the first two milestones you think are ok? > Best Regards.. > []s > Arthur You mention the 'dis' module, but isn't that one (and 'opcode' too) entirely written in Python? From arthurdesribeiro at gmail.com Fri Apr 8 20:12:00 2011 From: arthurdesribeiro at gmail.com (Arthur de Souza Ribeiro) Date: Fri, 8 Apr 2011 15:12:00 -0300 Subject: [Cython] GSoC Proposal - Reimplement C modules in CPython's standard library in Cython. In-Reply-To: References:

Message-ID: My mistake, I just typed wrong, I was talking about the nis one... By the way, I changed this module to the array one... Best Regards. Arthur 2011/4/8 mark florisson > On 8 April 2011 19:59, Arthur de Souza Ribeiro > wrote: > > The moduels suggested for the first two milestones you think are ok? > > Best Regards.. > > []s > > Arthur > > You mention the 'dis' module, but isn't that one (and 'opcode' too) > entirely written in Python? > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From markflorisson88 at gmail.com Fri Apr 8 23:38:56 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Fri, 8 Apr 2011 23:38:56 +0200 Subject: [Cython] GSoC Proposal for Supporting Parallelism, Fused Types and Typed Views on Memory Message-ID: Hey, My GSoC proposal can be found here: http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/markflorisson88/1 . It's about implementing the prange CEP (524), Fused Types (522) and Typed Memory Views (517). I really hope to participate this year, but due to time constraints I may not find the required time to actually complete the GSoC, so next week I will decide whether I'll chicken out or not. Cheers, Mark From jason-sage at creativetrax.com Sat Apr 9 13:49:51 2011 From: jason-sage at creativetrax.com (Jason Grout) Date: Sat, 09 Apr 2011 06:49:51 -0500 Subject: [Cython] cython-docs repository Message-ID: <4DA047DF.9040000@creativetrax.com> What is the relationship between the cython-docs repository and the docs/ subdirectory of the cython repository? I see a recent commit [1] that seems to indicate that cython-docs has been merged into the main cython repository (+1 from me, for what it's worth). Is my interpretation correct, and is cython-docs now deprecated? I submitted a pull request for some typos [2], but I'm not sure if I should have submitted that pull request to the cython-docs repository. Thanks, Jason [1] https://github.com/cython/cython/commit/2bcb14fff262a9a9c7b50bacb360bd983e6a92ee [2] https://github.com/cython/cython/pull/22 -- Jason Grout From robertwb at math.washington.edu Sat Apr 9 19:02:52 2011 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Sat, 9 Apr 2011 10:02:52 -0700 Subject: [Cython] cython-docs repository In-Reply-To: <4DA047DF.9040000@creativetrax.com> References: <4DA047DF.9040000@creativetrax.com> Message-ID: On Sat, Apr 9, 2011 at 4:49 AM, Jason Grout wrote: > What is the relationship between the cython-docs repository and the docs/ > subdirectory of the cython repository? ?I see a recent commit [1] that seems > to indicate that cython-docs has been merged into the main cython repository > (+1 from me, for what it's worth). ?Is my interpretation correct, and is > cython-docs now deprecated? Yep, we did that during the workshop. I thought I had sent out an announcement, but I guess not. > I submitted a pull request for some typos [2], but I'm not sure if I should > have submitted that pull request to the cython-docs repository. Thanks, I'll take a look. - Robert From jason-sage at creativetrax.com Sat Apr 9 19:13:35 2011 From: jason-sage at creativetrax.com (Jason Grout) Date: Sat, 09 Apr 2011 12:13:35 -0500 Subject: [Cython] cython-docs repository In-Reply-To: References: <4DA047DF.9040000@creativetrax.com> Message-ID: <4DA093BF.7050306@creativetrax.com> On 4/9/11 12:02 PM, Robert Bradshaw wrote: > Yep, we did that during the workshop. I thought I had sent out an > announcement, but I guess not. Is there a summary anywhere of the exciting things that happened in the workshop? For example, it seems that generators are finally in, if I read the commit logs correctly. Is that true? If so, fantastic! Any idea of a timeline for that to make it into an official release? Thanks, Jason From markflorisson88 at gmail.com Mon Apr 11 10:45:34 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Mon, 11 Apr 2011 10:45:34 +0200 Subject: [Cython] prange CEP updated In-Reply-To: <4D9B7BA9.2060509@astro.uio.no> References: <4D9B7BA9.2060509@astro.uio.no> Message-ID: On 5 April 2011 22:29, Dag Sverre Seljebotn wrote: > I've done a pretty major revision to the prange CEP, bringing in a lot of > the feedback. > > Thread-private variables are now split in two cases: > > ?i) The safe cases, which really require very little technical knowledge -> > automatically inferred > > ?ii) As an advanced feature, unsafe cases that requires some knowledge of > threading -> must be explicitly declared > > I think this split simplifies things a great deal. Can't we obsolete the declaration entirely by assigning to variables that need to have firstprivate behaviour inside the with parallel block? Basically in the same way the scratch space is used. The only problem with that is that it won't be lastprivate, so the value will be undefined after the parallel block (but not after the worksharing loop). cdef int myvariable with nogil, parallel: myvariable = 2 for i in prange(...): use myvariable maybe assign to myvariable # myvariable is well-defined here # myvariable is not well-defined here If you still desperately want lastprivate behaviour you can simply assign myvariable to another variable in the loop body. > I'm rather excited over this now; this could turn out to be a really > user-friendly and safe feature that would not only allow us to support > OpenMP-like threading, but be more convenient to use in a range of common > cases. > > http://wiki.cython.org/enhancements/prange > > Dag Sverre > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > > From d.s.seljebotn at astro.uio.no Mon Apr 11 11:10:52 2011 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Mon, 11 Apr 2011 11:10:52 +0200 Subject: [Cython] prange CEP updated In-Reply-To: References: <4D9B7BA9.2060509@astro.uio.no> Message-ID: <4DA2C59C.8080703@astro.uio.no> On 04/11/2011 10:45 AM, mark florisson wrote: > On 5 April 2011 22:29, Dag Sverre Seljebotn wrote: >> I've done a pretty major revision to the prange CEP, bringing in a lot of >> the feedback. >> >> Thread-private variables are now split in two cases: >> >> i) The safe cases, which really require very little technical knowledge -> >> automatically inferred >> >> ii) As an advanced feature, unsafe cases that requires some knowledge of >> threading -> must be explicitly declared >> >> I think this split simplifies things a great deal. > > Can't we obsolete the declaration entirely by assigning to variables > that need to have firstprivate behaviour inside the with parallel > block? Basically in the same way the scratch space is used. The only > problem with that is that it won't be lastprivate, so the value will > be undefined after the parallel block (but not after the worksharing > loop). > > cdef int myvariable > > with nogil, parallel: > myvariable = 2 > for i in prange(...): > use myvariable > maybe assign to myvariable > > # myvariable is well-defined here > > # myvariable is not well-defined here > > If you still desperately want lastprivate behaviour you can simply > assign myvariable to another variable in the loop body. I don't care about lastprivate, I don't think that is an issue, as you say. My problem with this is that it means going into an area where possibly tricky things are implicit rather than explicit. I also see this as a rather special case that will be seldomly used, and implicit behaviour is more difficult to justify because of that. (The other instance of thread-local variables I feel is still explicit: You use prange instead of range, which means that you declare that values created in the iteration does not leak to the next iteration. The rest is just optimization from there.) As Robert said in his recent talk: A lot of languages are easy to write. The advantage of Python is that it is easy to *read*. That's what I feel is wrong with the proposal above: An assignment to a variable changes the semantics of it. Granted, it happens in a way so that it will almost always be correct, but I feel that reading the code, I'd spend some extra cycles to go "ah, so this variable is thread-local and therefore its values survive across a loop iteration". If I even knew about the feature in the first place. In seeing "threadprivate" spelled out, it is either obvious what it means, or obvious that I should look up the docs. There's *a lot* of things that can be made implicit in a programming language; Python/Cython simply usually leans towards the explicit side. Oh, and we may want to support writable shared variables (and flush) eventually too, and the above doesn't easily differentiate there? That's just my opinion, I'm happy to be overruled here. Dag Sverre From markflorisson88 at gmail.com Mon Apr 11 11:41:14 2011 From: markflorisson88 at gmail.com (mark florisson) Date: Mon, 11 Apr 2011 11:41:14 +0200 Subject: [Cython] prange CEP updated In-Reply-To: <4DA2C59C.8080703@astro.uio.no> References: <4D9B7BA9.2060509@astro.uio.no> <4DA2C59C.8080703@astro.uio.no> Message-ID: On 11 April 2011 11:10, Dag Sverre Seljebotn wrote: > On 04/11/2011 10:45 AM, mark florisson wrote: >> >> On 5 April 2011 22:29, Dag Sverre Seljebotn