From mansourmoufid at gmail.com Tue Jan 3 01:31:56 2012 From: mansourmoufid at gmail.com (Mansour Moufid) Date: Mon, 2 Jan 2012 19:31:56 -0500 Subject: [Cython] Fix integer width constant names in stdint.pxd Message-ID: Hello, Attached is a quick fix for some typos in stdint.pxd. Tested with Cython version 0.15.1. Mansour -------------- next part -------------- A non-text attachment was scrubbed... Name: 0001-Fix-integer-width-constant-names-in-stdint.pxd.patch Type: text/x-patch Size: 3155 bytes Desc: not available URL: From robertwb at math.washington.edu Tue Jan 3 01:56:13 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Mon, 2 Jan 2012 16:56:13 -0800 Subject: [Cython] Fix integer width constant names in stdint.pxd In-Reply-To: References: Message-ID: Thanks. On Mon, Jan 2, 2012 at 4:31 PM, Mansour Moufid wrote: > Hello, > > Attached is a quick fix for some typos in stdint.pxd. > > Tested with Cython version 0.15.1. > > Mansour > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From mansourmoufid at gmail.com Tue Jan 3 02:37:34 2012 From: mansourmoufid at gmail.com (Mansour Moufid) Date: Mon, 2 Jan 2012 20:37:34 -0500 Subject: [Cython] Fix integer width constant names in stdint.pxd In-Reply-To: References:

Message-ID: Now my issue is as follows. (I CCed the cython-users list if this question is more appropriate there.) I have a simple file, int.pyx: from libc.stdint cimport * print long(UINT8_MAX) print long(UINT16_MAX) print long(UINT32_MAX) print long(UINT64_MAX) with the usual setup.py stuff. Compiling and running: $ python setup.py build_ext --inplace ... int.c:566:3: warning: overflow in implicit constant conversion [-Woverflow] ... $ python -c 'import int' 255 65535 -1 -1 So obviously there are overflows here. Checking int.c, I see: /* "int.pyx":2 * from libc.stdint cimport * * print long(UINT8_MAX) # <<<<<<<<<<<<<< * print long(UINT16_MAX) * print long(UINT32_MAX) */ __pyx_t_1 = PyInt_FromLong(UINT8_MAX); and so on... PyInt_FromLong is used for all these constants, regardless of signedness or width, so any argument larger than LONG_MAX overflows, *before* being converted to the arbitrary-size Python integer type. I don't know if this is a bug, or if I'm overlooking something. Is there a way for me to use these constants with Python's arbitrary-size integers? Thanks, Mansour From dalcinl at gmail.com Tue Jan 3 02:48:40 2012 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Mon, 2 Jan 2012 22:48:40 -0300 Subject: [Cython] Fix integer width constant names in stdint.pxd In-Reply-To: References:

Message-ID: On 2 January 2012 22:37, Mansour Moufid wrote: > Now my issue is as follows. > > (I CCed the cython-users list if this question is more appropriate there.) > > I have a simple file, int.pyx: > > from libc.stdint cimport * > print long(UINT8_MAX) > print long(UINT16_MAX) > print long(UINT32_MAX) > print long(UINT64_MAX) > > with the usual setup.py stuff. Compiling and running: > > $ python setup.py build_ext --inplace > ... > int.c:566:3: warning: overflow in implicit constant conversion [-Woverflow] > ... > $ python -c 'import int' > 255 > 65535 > -1 > -1 > > So obviously there are overflows here. Checking int.c, I see: > > ?/* "int.pyx":2 > ?* from libc.stdint cimport * > ?* print long(UINT8_MAX) ? ? ? ? ? ? # <<<<<<<<<<<<<< > ?* print long(UINT16_MAX) > ?* print long(UINT32_MAX) > ?*/ > ?__pyx_t_1 = PyInt_FromLong(UINT8_MAX); > > and so on... > > PyInt_FromLong is used for all these constants, regardless of > signedness or width, so any argument larger than LONG_MAX overflows, > *before* being converted to the arbitrary-size Python integer type. > > I don't know if this is a bug, or if I'm overlooking something. Is > there a way for me to use these constants with Python's arbitrary-size > integers? > All these constants are declared as "enum", so Cython promotes them to "int". Once again, Cython should have something like a "const" type qualifier to poperly declare these compile-time constants. As workaround, you could explicitly cast the constants like this "print long(UINT8_MAX)" -- Lisandro Dalcin --------------- CIMEC (INTEC/CONICET-UNL) Predio CONICET-Santa Fe Colectora RN 168 Km 472, Paraje El Pozo 3000 Santa Fe, Argentina Tel: +54-342-4511594 (ext 1011) Tel/Fax: +54-342-4511169 From mansourmoufid at gmail.com Tue Jan 3 02:53:54 2012 From: mansourmoufid at gmail.com (Mansour Moufid) Date: Mon, 2 Jan 2012 20:53:54 -0500 Subject: [Cython] Fix integer width constant names in stdint.pxd In-Reply-To: References:

Message-ID: On Mon, Jan 2, 2012 at 8:48 PM, Lisandro Dalcin wrote: > On 2 January 2012 22:37, Mansour Moufid wrote: >> Now my issue is as follows. >> >> (I CCed the cython-users list if this question is more appropriate there.) >> >> I have a simple file, int.pyx: >> >> from libc.stdint cimport * >> print long(UINT8_MAX) >> print long(UINT16_MAX) >> print long(UINT32_MAX) >> print long(UINT64_MAX) >> >> with the usual setup.py stuff. Compiling and running: >> >> $ python setup.py build_ext --inplace >> ... >> int.c:566:3: warning: overflow in implicit constant conversion [-Woverflow] >> ... >> $ python -c 'import int' >> 255 >> 65535 >> -1 >> -1 >> >> So obviously there are overflows here. Checking int.c, I see: >> >> ?/* "int.pyx":2 >> ?* from libc.stdint cimport * >> ?* print long(UINT8_MAX) ? ? ? ? ? ? # <<<<<<<<<<<<<< >> ?* print long(UINT16_MAX) >> ?* print long(UINT32_MAX) >> ?*/ >> ?__pyx_t_1 = PyInt_FromLong(UINT8_MAX); >> >> and so on... >> >> PyInt_FromLong is used for all these constants, regardless of >> signedness or width, so any argument larger than LONG_MAX overflows, >> *before* being converted to the arbitrary-size Python integer type. >> >> I don't know if this is a bug, or if I'm overlooking something. Is >> there a way for me to use these constants with Python's arbitrary-size >> integers? >> > > All these constants are declared as "enum", so Cython promotes them to > "int". Once again, Cython should have something like a "const" type > qualifier to poperly declare these compile-time constants. > > As workaround, you could explicitly cast the constants like this > "print long(UINT8_MAX)" This works great. Exactly what I was looking for, thanks. Mansour From robertwb at math.washington.edu Tue Jan 3 03:00:23 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Mon, 2 Jan 2012 18:00:23 -0800 Subject: [Cython] Fix integer width constant names in stdint.pxd In-Reply-To: References:

Message-ID: On Mon, Jan 2, 2012 at 5:48 PM, Lisandro Dalcin wrote: > On 2 January 2012 22:37, Mansour Moufid wrote: >> Now my issue is as follows. >> >> (I CCed the cython-users list if this question is more appropriate there.) >> >> I have a simple file, int.pyx: >> >> from libc.stdint cimport * >> print long(UINT8_MAX) >> print long(UINT16_MAX) >> print long(UINT32_MAX) >> print long(UINT64_MAX) >> >> with the usual setup.py stuff. Compiling and running: >> >> $ python setup.py build_ext --inplace >> ... >> int.c:566:3: warning: overflow in implicit constant conversion [-Woverflow] >> ... >> $ python -c 'import int' >> 255 >> 65535 >> -1 >> -1 >> >> So obviously there are overflows here. Checking int.c, I see: >> >> ?/* "int.pyx":2 >> ?* from libc.stdint cimport * >> ?* print long(UINT8_MAX) ? ? ? ? ? ? # <<<<<<<<<<<<<< >> ?* print long(UINT16_MAX) >> ?* print long(UINT32_MAX) >> ?*/ >> ?__pyx_t_1 = PyInt_FromLong(UINT8_MAX); >> >> and so on... >> >> PyInt_FromLong is used for all these constants, regardless of >> signedness or width, so any argument larger than LONG_MAX overflows, >> *before* being converted to the arbitrary-size Python integer type. >> >> I don't know if this is a bug, or if I'm overlooking something. Is >> there a way for me to use these constants with Python's arbitrary-size >> integers? >> > > All these constants are declared as "enum", so Cython promotes them to > "int". Once again, Cython should have something like a "const" type > qualifier to poperly declare these compile-time constants. > > As workaround, you could explicitly cast the constants like this > "print long(UINT8_MAX)" I'm leaning towards declaring them as being the proper type to begin with; what's to be gained by declaring these extern values as enums (=const)? At least with the larger types we should do this to avoid patently incorrect behavior, and this way they would be consistant with the actual C for arithmetic promotion, etc. - Robert From stefan_ml at behnel.de Tue Jan 3 13:10:07 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Tue, 03 Jan 2012 13:10:07 +0100 Subject: [Cython] Fix integer width constant names in stdint.pxd In-Reply-To: References:

Message-ID: <4F02F01F.3090900@behnel.de> Robert Bradshaw, 03.01.2012 03:00: > On Mon, Jan 2, 2012 at 5:48 PM, Lisandro Dalcin wrote: >> On 2 January 2012 22:37, Mansour Moufid wrote: >>> Now my issue is as follows. >>> >>> (I CCed the cython-users list if this question is more appropriate there.) >>> >>> I have a simple file, int.pyx: >>> >>> from libc.stdint cimport * >>> print long(UINT8_MAX) >>> print long(UINT16_MAX) >>> print long(UINT32_MAX) >>> print long(UINT64_MAX) >>> >>> with the usual setup.py stuff. Compiling and running: >>> >>> $ python setup.py build_ext --inplace >>> ... >>> int.c:566:3: warning: overflow in implicit constant conversion [-Woverflow] >>> ... >>> $ python -c 'import int' >>> 255 >>> 65535 >>> -1 >>> -1 >>> >>> So obviously there are overflows here. Checking int.c, I see: >>> >>> /* "int.pyx":2 >>> * from libc.stdint cimport * >>> * print long(UINT8_MAX) #<<<<<<<<<<<<<< >>> * print long(UINT16_MAX) >>> * print long(UINT32_MAX) >>> */ >>> __pyx_t_1 = PyInt_FromLong(UINT8_MAX); >>> >>> and so on... >>> >>> PyInt_FromLong is used for all these constants, regardless of >>> signedness or width, so any argument larger than LONG_MAX overflows, >>> *before* being converted to the arbitrary-size Python integer type. >>> >>> I don't know if this is a bug, or if I'm overlooking something. Is >>> there a way for me to use these constants with Python's arbitrary-size >>> integers? >>> >> >> All these constants are declared as "enum", so Cython promotes them to >> "int". Once again, Cython should have something like a "const" type >> qualifier to poperly declare these compile-time constants. >> >> As workaround, you could explicitly cast the constants like this >> "print long(UINT8_MAX)" > > I'm leaning towards declaring them as being the proper type to begin > with; what's to be gained by declaring these extern values as enums > (=const)? At least with the larger types we should do this to avoid > patently incorrect behavior, and this way they would be consistant > with the actual C for arithmetic promotion, etc. +1 Stefan From dalcinl at gmail.com Thu Jan 5 00:52:12 2012 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Wed, 4 Jan 2012 20:52:12 -0300 Subject: [Cython] Fix integer width constant names in stdint.pxd In-Reply-To: References:

Message-ID: On 2 January 2012 23:00, Robert Bradshaw wrote: > On Mon, Jan 2, 2012 at 5:48 PM, Lisandro Dalcin wrote: >> On 2 January 2012 22:37, Mansour Moufid wrote: >>> Now my issue is as follows. >>> >>> (I CCed the cython-users list if this question is more appropriate there.) >>> >>> I have a simple file, int.pyx: >>> >>> from libc.stdint cimport * >>> print long(UINT8_MAX) >>> print long(UINT16_MAX) >>> print long(UINT32_MAX) >>> print long(UINT64_MAX) >>> >>> with the usual setup.py stuff. Compiling and running: >>> >>> $ python setup.py build_ext --inplace >>> ... >>> int.c:566:3: warning: overflow in implicit constant conversion [-Woverflow] >>> ... >>> $ python -c 'import int' >>> 255 >>> 65535 >>> -1 >>> -1 >>> >>> So obviously there are overflows here. Checking int.c, I see: >>> >>> ?/* "int.pyx":2 >>> ?* from libc.stdint cimport * >>> ?* print long(UINT8_MAX) ? ? ? ? ? ? # <<<<<<<<<<<<<< >>> ?* print long(UINT16_MAX) >>> ?* print long(UINT32_MAX) >>> ?*/ >>> ?__pyx_t_1 = PyInt_FromLong(UINT8_MAX); >>> >>> and so on... >>> >>> PyInt_FromLong is used for all these constants, regardless of >>> signedness or width, so any argument larger than LONG_MAX overflows, >>> *before* being converted to the arbitrary-size Python integer type. >>> >>> I don't know if this is a bug, or if I'm overlooking something. Is >>> there a way for me to use these constants with Python's arbitrary-size >>> integers? >>> >> >> All these constants are declared as "enum", so Cython promotes them to >> "int". Once again, Cython should have something like a "const" type >> qualifier to poperly declare these compile-time constants. >> >> As workaround, you could explicitly cast the constants like this >> "print long(UINT8_MAX)" > > I'm leaning towards declaring them as being the proper type to begin > with; what's to be gained by declaring these extern values as enums > (=const)? At least with the larger types we should do this to avoid > patently incorrect behavior, and this way they would be consistant > with the actual C for arithmetic promotion, etc. > Not sure about recent Cython releases, but in older ones you cannot do: cdef char buf[UINT8_MAX] unless UINT8_MAX was declared in Cython as a compile time constant. However, I do agree that for the case of stdint.h, using matching types instead of "enum" is way better. -- Lisandro Dalcin --------------- CIMEC (INTEC/CONICET-UNL) Predio CONICET-Santa Fe Colectora RN 168 Km 472, Paraje El Pozo 3000 Santa Fe, Argentina Tel: +54-342-4511594 (ext 1011) Tel/Fax: +54-342-4511169 From vitja.makarov at gmail.com Sat Jan 14 15:19:24 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Sat, 14 Jan 2012 18:19:24 +0400 Subject: [Cython] sage-tests failures Message-ID: I've recently merged my def-node-refactoring branch and found some bugs, thanks to sage-build. Then I've found that sage-tests has >100 failures. So I'm wondering does anybody know what's wrong with sage-tests? -- vitja. From robertwb at math.washington.edu Sat Jan 14 18:38:11 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Sat, 14 Jan 2012 09:38:11 -0800 Subject: [Cython] sage-tests failures In-Reply-To: References: Message-ID: On Sat, Jan 14, 2012 at 6:19 AM, Vitja Makarov wrote: > I've recently merged my def-node-refactoring branch and found some > bugs, thanks to sage-build. > > Then I've found that sage-tests has >100 failures. > So I'm wondering does anybody know what's wrong with sage-tests? Yeah, sage-tests is a great stress-tester for Cython. There were a couple of spurious failures before, but nothing this bad. I blame https://github.com/cython/cython/commit/bce8b981a3e71378a164e8c9acca5f00bbbe32d7 which is causing variables to be undefined. Arguably, this is a bug in Sage, but it was a backwards incompatible change. - Robert From robertwb at math.washington.edu Sun Jan 15 01:08:39 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Sat, 14 Jan 2012 16:08:39 -0800 Subject: [Cython] sage-tests failures In-Reply-To: References: Message-ID: OK, thinks are looking a lot better, but there's still quite a few random segfaults: https://sage.math.washington.edu:8091/hudson/view/ext-libs/job/sage-tests/674/console On Sat, Jan 14, 2012 at 9:38 AM, Robert Bradshaw wrote: > On Sat, Jan 14, 2012 at 6:19 AM, Vitja Makarov wrote: >> I've recently merged my def-node-refactoring branch and found some >> bugs, thanks to sage-build. >> >> Then I've found that sage-tests has >100 failures. >> So I'm wondering does anybody know what's wrong with sage-tests? > > Yeah, sage-tests is a great stress-tester for Cython. There were a > couple of spurious failures before, but nothing this bad. > > I blame https://github.com/cython/cython/commit/bce8b981a3e71378a164e8c9acca5f00bbbe32d7 > which is causing variables to be undefined. Arguably, this is a bug in > Sage, but it was a backwards incompatible change. > > - Robert From vitja.makarov at gmail.com Sun Jan 15 18:32:42 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Sun, 15 Jan 2012 21:32:42 +0400 Subject: [Cython] sage-tests failures In-Reply-To: References: Message-ID: 2012/1/15 Robert Bradshaw : > OK, thinks are looking a lot better, but there's still quite a few > random segfaults: > > https://sage.math.washington.edu:8091/hudson/view/ext-libs/job/sage-tests/674/console > Cool. Have you modified a private copy of sage? I've tried to reproduce segfaults at home but I was unable to compile sage due to incompatible changes in cython. > On Sat, Jan 14, 2012 at 9:38 AM, Robert Bradshaw > wrote: >> On Sat, Jan 14, 2012 at 6:19 AM, Vitja Makarov wrote: >>> I've recently merged my def-node-refactoring branch and found some >>> bugs, thanks to sage-build. >>> >>> Then I've found that sage-tests has >100 failures. >>> So I'm wondering does anybody know what's wrong with sage-tests? >> >> Yeah, sage-tests is a great stress-tester for Cython. There were a >> couple of spurious failures before, but nothing this bad. >> >> I blame https://github.com/cython/cython/commit/bce8b981a3e71378a164e8c9acca5f00bbbe32d7 >> which is causing variables to be undefined. Arguably, this is a bug in >> Sage, but it was a backwards incompatible change. >> >> - Robert > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel -- vitja. From dtcaciuc at gmail.com Sun Jan 15 19:01:43 2012 From: dtcaciuc at gmail.com (Dimitri Tcaciuc) Date: Sun, 15 Jan 2012 10:01:43 -0800 Subject: [Cython] Cannot assign type 'set &' to 'set' In-Reply-To: References: Message-ID: Hi folks, Since the original question, I've created a pull request with a failing test and tried to get some discussion going, but so far no answer. I'm a tad discouraged, since obviously there's movement on mail list and with pull requests. Is a pull request a proper way to do this? I completely understand if you guys don't have enough available time to deal with it right now, however at least some acknowledgement and feedback would be much appreciated. Thanks, Dimitri. On Sun, Dec 18, 2011 at 8:17 PM, Dimitri Tcaciuc wrote: > Hello everyone, > > Here's a small test case I'm trying to compile. I'm trying to pass a > STL set reference to a method in a template class. > > x.pyx: > > ? ?from libcpp.set cimport set as cpp_set > > ? ?cdef extern from "x.hh": > > ? ? ? ?cdef cppclass Foo [T]: > ? ? ? ? ? ?Foo() > ? ? ? ? ? ?void set_x(cpp_set[size_t] & x) > > ? ?cpdef func(): > ? ? ? ?cdef Foo[int] foo > > ? ? ? ?cdef cpp_set[size_t] x > ? ? ? ?cdef cpp_set[size_t] & xref = x > > ? ? ? ?foo.set_x(xref) > > x.hh: > > ? ?#include > > ? ?template > ? ?struct Foo { > ? ? ? ?void set_x(const std::set & x) { /* do nothing */ } > ? ?}; > > To compile, > > ? ?bash $ cython --cplus x.pyx > > Which results in > > ? ?foo.set_x(xref) > ? ? ? ? ? ? ? ? ^ > ------------------------------------------------------------ > x.pyx:15:18: Cannot assign type 'set &' to 'set' > > > However, if I remove the template parameter from Foo, everything works. > > > y.pyx: > > ? ?from libcpp.set cimport set as cpp_set > > ? ?cdef extern from "y.hh": > > ? ? ? ?cdef cppclass Foo: > ? ? ? ? ? ?Foo() > ? ? ? ? ? ?void set_x(cpp_set[size_t] & x) > > ? ?cpdef func(): > ? ? ? ?cdef Foo foo > > ? ? ? ?cdef cpp_set[size_t] x > ? ? ? ?cdef cpp_set[size_t] & xref = x > > ? ? ? ?foo.set_x(xref) > > y.hh: > > ? ?#include > > ? ?struct Foo { > ? ? ? ?void set_x(const std::set & x) { /* do nothing */ } > ? ?}; > > > From what I can tell, the CppClassType instance the CReferenceType is > pointing to has the correct name "set", however it's a > different class instance. The particular failing expression is in > `ExprNode.coerce_to` > > ? ?if not (str(src.type) == str(dst_type) or > dst_type.assignable_from(src_type)) > > > I wish I could suggest a patch, but unfortunately I'm a complete > newbie to Cython internals. Perhaps someone could give a few pointers > as to what should be done to fix this? > > Thanks, > > > Dimitri From markflorisson88 at gmail.com Mon Jan 16 19:09:47 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Mon, 16 Jan 2012 18:09:47 +0000 Subject: [Cython] Cannot assign type 'set &' to 'set' In-Reply-To: References: Message-ID: Dear Dimitri, Sorry for the delay, many developers are busy with their lives. Thanks for the report, it think that this was broken in 464923673475879fedc103ef2ee0260ba88d1493, the culprit is https://github.com/cython/cython/blob/master/Cython/Compiler/ExprNodes.py#L603 , I think it should read 'if dst_type.is_reference and not self.type.is_reference:'. If you want you could substitute that line with this line and write a simple test and make a new pull request, we will merge it. Cheers, Mark On 15 January 2012 18:01, Dimitri Tcaciuc wrote: > Hi folks, > > Since the original question, I've created a pull request with a > failing test and tried to get some discussion going, but so far no > answer. I'm a tad discouraged, since obviously there's movement on > mail list and with pull requests. Is a pull request a proper way to do > this? I completely understand if you guys don't have enough available > time to deal with it right now, however at least some acknowledgement > and feedback would be much appreciated. > > Thanks, > > > Dimitri. > > On Sun, Dec 18, 2011 at 8:17 PM, Dimitri Tcaciuc wrote: >> Hello everyone, >> >> Here's a small test case I'm trying to compile. I'm trying to pass a >> STL set reference to a method in a template class. >> >> x.pyx: >> >> ? ?from libcpp.set cimport set as cpp_set >> >> ? ?cdef extern from "x.hh": >> >> ? ? ? ?cdef cppclass Foo [T]: >> ? ? ? ? ? ?Foo() >> ? ? ? ? ? ?void set_x(cpp_set[size_t] & x) >> >> ? ?cpdef func(): >> ? ? ? ?cdef Foo[int] foo >> >> ? ? ? ?cdef cpp_set[size_t] x >> ? ? ? ?cdef cpp_set[size_t] & xref = x >> >> ? ? ? ?foo.set_x(xref) >> >> x.hh: >> >> ? ?#include >> >> ? ?template >> ? ?struct Foo { >> ? ? ? ?void set_x(const std::set & x) { /* do nothing */ } >> ? ?}; >> >> To compile, >> >> ? ?bash $ cython --cplus x.pyx >> >> Which results in >> >> ? ?foo.set_x(xref) >> ? ? ? ? ? ? ? ? ^ >> ------------------------------------------------------------ >> x.pyx:15:18: Cannot assign type 'set &' to 'set' >> >> >> However, if I remove the template parameter from Foo, everything works. >> >> >> y.pyx: >> >> ? ?from libcpp.set cimport set as cpp_set >> >> ? ?cdef extern from "y.hh": >> >> ? ? ? ?cdef cppclass Foo: >> ? ? ? ? ? ?Foo() >> ? ? ? ? ? ?void set_x(cpp_set[size_t] & x) >> >> ? ?cpdef func(): >> ? ? ? ?cdef Foo foo >> >> ? ? ? ?cdef cpp_set[size_t] x >> ? ? ? ?cdef cpp_set[size_t] & xref = x >> >> ? ? ? ?foo.set_x(xref) >> >> y.hh: >> >> ? ?#include >> >> ? ?struct Foo { >> ? ? ? ?void set_x(const std::set & x) { /* do nothing */ } >> ? ?}; >> >> >> From what I can tell, the CppClassType instance the CReferenceType is >> pointing to has the correct name "set", however it's a >> different class instance. The particular failing expression is in >> `ExprNode.coerce_to` >> >> ? ?if not (str(src.type) == str(dst_type) or >> dst_type.assignable_from(src_type)) >> >> >> I wish I could suggest a patch, but unfortunately I'm a complete >> newbie to Cython internals. Perhaps someone could give a few pointers >> as to what should be done to fix this? >> >> Thanks, >> >> >> Dimitri > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel From vitja.makarov at gmail.com Wed Jan 18 21:30:43 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Thu, 19 Jan 2012 00:30:43 +0400 Subject: [Cython] Speedup module-level lookup Message-ID: I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. In this example bar() is 1.6 time faster (500us against 842us): C = 123 def foo(a): ? ? return C * adef bar(): ? ? for i in range(10000):? ? ? ? foo(i) Here is proof of concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 So the question is: does it worth it? -- vitja. From robertwb at math.washington.edu Thu Jan 19 08:28:14 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Wed, 18 Jan 2012 23:28:14 -0800 Subject: [Cython] Speedup module-level lookup In-Reply-To: References: Message-ID: I think the right thing to do here is make all module-level globals into "cdef public" attributes, i.e. C globals with getters and setters for Python space. I'm not sure whether this would best be done by creating a custom dict or module subclass, but it would probably be cleaner and afford much more than a 1.6x speedup. - Robert On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov wrote: > I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. > > In this example bar() is 1.6 time faster (500us against 842us): > > C = 123 > def foo(a): > ? ? return C * adef bar(): > ? ? for i in range(10000):? ? ? ? foo(i) > Here is proof of > concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 > > So the question is: does it worth it? > > -- > vitja. > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel From sccolbert at gmail.com Thu Jan 19 08:42:11 2012 From: sccolbert at gmail.com (Chris Colbert) Date: Thu, 19 Jan 2012 01:42:11 -0600 Subject: [Cython] Speedup module-level lookup In-Reply-To: References: Message-ID: AFAIK, a module's dict is readonly, so I don't believe a dict subclass will work there (I could be wrong) unless you hack up the module object from C. You can do it with descriptors on a ModuleType however, which should be plenty fast from Cython-land. In [16]: class AGetter(object): ....: def __get__(self, obj, cls): ....: return obj.a ....: def __set__(self, obj, val): ....: obj.a = val ....: In [17]: class MyMod(types.ModuleType): ....: b = AGetter() ....: In [18]: mmod = MyMod('my_mod') In [20]: mmod.__dict__['a'] = 42 In [21]: mmod.a Out[21]: 42 In [22]: mmod.b Out[22]: 42 In [23]: mmod.b = 87 In [24]: mmod.a Out[24]: 87 -------------- next part -------------- An HTML attachment was scrubbed... URL: From vitja.makarov at gmail.com Thu Jan 19 08:49:51 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Thu, 19 Jan 2012 11:49:51 +0400 Subject: [Cython] Speedup module-level lookup In-Reply-To: References: Message-ID: 2012/1/19 Robert Bradshaw : > I think the right thing to do here is make all module-level globals > into "cdef public" attributes, i.e. C globals with getters and setters > for Python space. I'm not sure whether this would best be done by > creating a custom dict or module subclass, but it would probably be > cleaner and afford much more than a 1.6x speedup. > > - Robert > > On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov wrote: >> I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. >> >> In this example bar() is 1.6 time faster (500us against 842us): >> >> C = 123 >> def foo(a): >> ? ? return C * adef bar(): >> ? ? for i in range(10000):? ? ? ? foo(i) >> Here is proof of >> concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 >> >> So the question is: does it worth it? >> Yes, nice idea. It's possible to subclass PyModuleObject and I didn't find any use of PyModule_CheckExact() in CPython's sources: import types import sys global_foo = 1234 class CustomModule(types.ModuleType): def __init__(self, name): types.ModuleType.__init__(self, name) sys.modules[name] = self @property def foo(self): return global_foo @foo.setter def foo(self, value): global global_foo global_foo = value CustomModule('foo') import foo print foo.foo -- vitja. From vitja.makarov at gmail.com Thu Jan 19 08:53:23 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Thu, 19 Jan 2012 11:53:23 +0400 Subject: [Cython] Speedup module-level lookup In-Reply-To: References: Message-ID: 2012/1/19 Vitja Makarov : > 2012/1/19 Robert Bradshaw : >> I think the right thing to do here is make all module-level globals >> into "cdef public" attributes, i.e. C globals with getters and setters >> for Python space. I'm not sure whether this would best be done by >> creating a custom dict or module subclass, but it would probably be >> cleaner and afford much more than a 1.6x speedup. >> >> - Robert >> >> On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov wrote: >>> I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. >>> >>> In this example bar() is 1.6 time faster (500us against 842us): >>> >>> C = 123 >>> def foo(a): >>> ? ? return C * adef bar(): >>> ? ? for i in range(10000):? ? ? ? foo(i) >>> Here is proof of >>> concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 >>> >>> So the question is: does it worth it? >>> > > Yes, nice idea. > It's possible to subclass PyModuleObject and I didn't find any use of > PyModule_CheckExact() in CPython's sources: > > import types > import sys > > global_foo = 1234 > > class CustomModule(types.ModuleType): > ? ?def __init__(self, name): > ? ? ? ?types.ModuleType.__init__(self, name) > ? ? ? ?sys.modules[name] = self > > ? ?@property > ? ?def foo(self): > ? ? ? ?return global_foo > > ? ?@foo.setter > ? ?def foo(self, value): > ? ? ? ?global global_foo > ? ? ? ?global_foo = value > > CustomModule('foo') > > import foo > print foo.foo > But this seems to break globals(). -- vitja. From robertwb at math.washington.edu Thu Jan 19 09:00:51 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Thu, 19 Jan 2012 00:00:51 -0800 Subject: [Cython] Speedup module-level lookup In-Reply-To: References:

Message-ID: On Wed, Jan 18, 2012 at 11:53 PM, Vitja Makarov wrote: > 2012/1/19 Vitja Makarov : >> 2012/1/19 Robert Bradshaw : >>> I think the right thing to do here is make all module-level globals >>> into "cdef public" attributes, i.e. C globals with getters and setters >>> for Python space. I'm not sure whether this would best be done by >>> creating a custom dict or module subclass, but it would probably be >>> cleaner and afford much more than a 1.6x speedup. >>> >>> - Robert >>> >>> On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov wrote: >>>> I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. >>>> >>>> In this example bar() is 1.6 time faster (500us against 842us): >>>> >>>> C = 123 >>>> def foo(a): >>>> ? ? return C * adef bar(): >>>> ? ? for i in range(10000):? ? ? ? foo(i) >>>> Here is proof of >>>> concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 >>>> >>>> So the question is: does it worth it? >>>> >> >> Yes, nice idea. >> It's possible to subclass PyModuleObject and I didn't find any use of >> PyModule_CheckExact() in CPython's sources: >> >> import types >> import sys >> >> global_foo = 1234 >> >> class CustomModule(types.ModuleType): >> ? ?def __init__(self, name): >> ? ? ? ?types.ModuleType.__init__(self, name) >> ? ? ? ?sys.modules[name] = self >> >> ? ?@property >> ? ?def foo(self): >> ? ? ? ?return global_foo >> >> ? ?@foo.setter >> ? ?def foo(self, value): >> ? ? ? ?global global_foo >> ? ? ? ?global_foo = value >> >> CustomModule('foo') >> >> import foo >> print foo.foo >> > > But this seems to break globals(). How so? We have to hack globals() to get it to work for a Cython module anyways. (I wonder if this must return a dict, or would any mapping (or subclass of dict) be sufficient...) - Robert From markflorisson88 at gmail.com Thu Jan 19 09:04:30 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Thu, 19 Jan 2012 08:04:30 +0000 Subject: [Cython] Speedup module-level lookup In-Reply-To: References:

Message-ID: On 19 January 2012 08:00, Robert Bradshaw wrote: > On Wed, Jan 18, 2012 at 11:53 PM, Vitja Makarov wrote: >> 2012/1/19 Vitja Makarov : >>> 2012/1/19 Robert Bradshaw : >>>> I think the right thing to do here is make all module-level globals >>>> into "cdef public" attributes, i.e. C globals with getters and setters >>>> for Python space. I'm not sure whether this would best be done by >>>> creating a custom dict or module subclass, but it would probably be >>>> cleaner and afford much more than a 1.6x speedup. >>>> >>>> - Robert >>>> >>>> On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov wrote: >>>>> I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. >>>>> >>>>> In this example bar() is 1.6 time faster (500us against 842us): >>>>> >>>>> C = 123 >>>>> def foo(a): >>>>> ? ? return C * adef bar(): >>>>> ? ? for i in range(10000):? ? ? ? foo(i) >>>>> Here is proof of >>>>> concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 >>>>> >>>>> So the question is: does it worth it? >>>>> >>> >>> Yes, nice idea. >>> It's possible to subclass PyModuleObject and I didn't find any use of >>> PyModule_CheckExact() in CPython's sources: >>> >>> import types >>> import sys >>> >>> global_foo = 1234 >>> >>> class CustomModule(types.ModuleType): >>> ? ?def __init__(self, name): >>> ? ? ? ?types.ModuleType.__init__(self, name) >>> ? ? ? ?sys.modules[name] = self >>> >>> ? ?@property >>> ? ?def foo(self): >>> ? ? ? ?return global_foo >>> >>> ? ?@foo.setter >>> ? ?def foo(self, value): >>> ? ? ? ?global global_foo >>> ? ? ? ?global_foo = value >>> >>> CustomModule('foo') >>> >>> import foo >>> print foo.foo >>> >> >> But this seems to break globals(). > > How so? We have to hack globals() to get it to work for a Cython > module anyways. (I wonder if this must return a dict, or would any > mapping (or subclass of dict) be sufficient...) > > - Robert > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel You'd also want this to work from python space using module.__dict__ or vars(module). I think the custom dict could solve this. Or would you make __dict__ a property as well? (I don't know if vars() would still break). From sccolbert at gmail.com Thu Jan 19 09:18:21 2012 From: sccolbert at gmail.com (Chris Colbert) Date: Thu, 19 Jan 2012 02:18:21 -0600 Subject: [Cython] Speedup module-level lookup In-Reply-To: References:

Message-ID: If it doesn't pass PyDict_CheckExact you won't be able to use it as the globals to eval or exec. That's assuming you hack the module type so you can change its __dict__. On Jan 19, 2012 2:01 AM, "Robert Bradshaw" wrote: > On Wed, Jan 18, 2012 at 11:53 PM, Vitja Makarov > wrote: > > 2012/1/19 Vitja Makarov : > >> 2012/1/19 Robert Bradshaw : > >>> I think the right thing to do here is make all module-level globals > >>> into "cdef public" attributes, i.e. C globals with getters and setters > >>> for Python space. I'm not sure whether this would best be done by > >>> creating a custom dict or module subclass, but it would probably be > >>> cleaner and afford much more than a 1.6x speedup. > >>> > >>> - Robert > >>> > >>> On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov < > vitja.makarov at gmail.com> wrote: > >>>> I tried to optimize module lookups (__pyx_m) by caching internal > PyDict state. > >>>> > >>>> In this example bar() is 1.6 time faster (500us against 842us): > >>>> > >>>> C = 123 > >>>> def foo(a): > >>>> return C * adef bar(): > >>>> for i in range(10000): foo(i) > >>>> Here is proof of > >>>> concept: > https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 > >>>> > >>>> So the question is: does it worth it? > >>>> > >> > >> Yes, nice idea. > >> It's possible to subclass PyModuleObject and I didn't find any use of > >> PyModule_CheckExact() in CPython's sources: > >> > >> import types > >> import sys > >> > >> global_foo = 1234 > >> > >> class CustomModule(types.ModuleType): > >> def __init__(self, name): > >> types.ModuleType.__init__(self, name) > >> sys.modules[name] = self > >> > >> @property > >> def foo(self): > >> return global_foo > >> > >> @foo.setter > >> def foo(self, value): > >> global global_foo > >> global_foo = value > >> > >> CustomModule('foo') > >> > >> import foo > >> print foo.foo > >> > > > > But this seems to break globals(). > > How so? We have to hack globals() to get it to work for a Cython > module anyways. (I wonder if this must return a dict, or would any > mapping (or subclass of dict) be sufficient...) > > - Robert > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From robertwb at math.washington.edu Thu Jan 19 09:53:49 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Thu, 19 Jan 2012 00:53:49 -0800 Subject: [Cython] Speedup module-level lookup In-Reply-To: References:

Message-ID: On Thu, Jan 19, 2012 at 12:18 AM, Chris Colbert wrote: > If it doesn't pass PyDict_CheckExact you won't be able to use it as the > globals to eval or exec. :(. I wonder how many other places have similar restrictions, perhaps even implicitly. In particular, this would mean that an eval statement modifying globals() would be difficult to efficiently detect. Still, if this can be done at all, the massive speedup for the common case could make it worth it. > That's assuming you hack the module type so you can > change its __dict__. I don't think that's near as big of a hurdle (from C). > On Jan 19, 2012 2:01 AM, "Robert Bradshaw" > wrote: >> >> On Wed, Jan 18, 2012 at 11:53 PM, Vitja Makarov >> wrote: >> > 2012/1/19 Vitja Makarov : >> >> 2012/1/19 Robert Bradshaw : >> >>> I think the right thing to do here is make all module-level globals >> >>> into "cdef public" attributes, i.e. C globals with getters and setters >> >>> for Python space. I'm not sure whether this would best be done by >> >>> creating a custom dict or module subclass, but it would probably be >> >>> cleaner and afford much more than a 1.6x speedup. >> >>> >> >>> - Robert >> >>> >> >>> On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov >> >>> wrote: >> >>>> I tried to optimize module lookups (__pyx_m) by caching internal >> >>>> PyDict state. >> >>>> >> >>>> In this example bar() is 1.6 time faster (500us against 842us): >> >>>> >> >>>> C = 123 >> >>>> def foo(a): >> >>>> ? ? return C * adef bar(): >> >>>> ? ? for i in range(10000):? ? ? ? foo(i) >> >>>> Here is proof of >> >>>> >> >>>> concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 >> >>>> >> >>>> So the question is: does it worth it? >> >>>> >> >> >> >> Yes, nice idea. >> >> It's possible to subclass PyModuleObject and I didn't find any use of >> >> PyModule_CheckExact() in CPython's sources: >> >> >> >> import types >> >> import sys >> >> >> >> global_foo = 1234 >> >> >> >> class CustomModule(types.ModuleType): >> >> ? ?def __init__(self, name): >> >> ? ? ? ?types.ModuleType.__init__(self, name) >> >> ? ? ? ?sys.modules[name] = self >> >> >> >> ? ?@property >> >> ? ?def foo(self): >> >> ? ? ? ?return global_foo >> >> >> >> ? ?@foo.setter >> >> ? ?def foo(self, value): >> >> ? ? ? ?global global_foo >> >> ? ? ? ?global_foo = value >> >> >> >> CustomModule('foo') >> >> >> >> import foo >> >> print foo.foo >> >> >> > >> > But this seems to break globals(). >> >> How so? We have to hack globals() to get it to work for a Cython >> module anyways. (I wonder if this must return a dict, or would any >> mapping (or subclass of dict) be sufficient...) >> >> - Robert >> _______________________________________________ >> cython-devel mailing list >> cython-devel at python.org >> http://mail.python.org/mailman/listinfo/cython-devel > > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > From anders at embl.de Thu Jan 19 19:26:27 2012 From: anders at embl.de (Simon Anders) Date: Thu, 19 Jan 2012 19:26:27 +0100 Subject: [Cython] Cython crash: C++ class with missing default constructor Message-ID: <4F186053.1000706@embl.de> Hi, the following very short Cython code crashes the Cython compiler (v0.15.1): ---8<--- cdef extern from "foo.h": cdef cppclass foo: pass foo() ---8<--- The stack trace is attached. Best regards Simon -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: traceback URL: From anders at embl.de Thu Jan 19 21:10:47 2012 From: anders at embl.de (Simon Anders) Date: Thu, 19 Jan 2012 21:10:47 +0100 Subject: [Cython] non-template nested C++ classes Message-ID: <4F1878C7.1060705@embl.de> Hi, I'm currently experimenting with Cython's support for nested C++ classes and might havce encountered a bug. I have attempted to strip down the example from the documentation to its bare minimum. This here works fine: ---8<--- cdef extern from "foo": cdef cppclass outer[T]: cppclass inner: pass cdef outer[int].inner foo ---8<--- Next, I remove the template parameter as well. After all, not every outer class containing an inner class is a template class. ---8<--- cdef extern from "foo": cdef cppclass outer: cppclass inner: pass cdef outer.inner foo ---8<--- Now, I get this error message: 'outer' is not a cimported module It seems that without the square brackets, Cython no longer recognizes 'outer' as a class name and thinks it must be a module because it is followed by a dot. I suppose this is not what should happen, right? Best regards Simon From anders at embl.de Thu Jan 19 22:46:48 2012 From: anders at embl.de (Simon Anders) Date: Thu, 19 Jan 2012 22:46:48 +0100 Subject: [Cython] nested C++ classes, the third: default constructor Message-ID: <4F188F48.2080704@embl.de> Hi, sorry for spreading these issues into three mails, but I cannot quite figure out whether they are related or not. So, here is the third installment of my adventures with nested classes. Consider the following code which compiles fine: ---8<--- cdef extern from "foo": cdef cppclass outer[T]: outer( ) cppclass inner: pass cdef outer[int].inner bar ---8<--- If I change 'outer' to lose its default constructor, the last line, which instantiates 'inner', causes an error: ---8<--- cdef extern from "foo": cdef cppclass outer[T]: outer( int ) cppclass inner: pass cdef outer[int].inner bar ---8<--- The only change is that the constructor to 'outer' now takes an argument. Now, the last line causes this error: "C++ class must have a default constructor to be stack allocated". However, 'inner' does have a default constructor, only 'outer' does not. Could it be that Cython looks for the constructor for the outer class where it should look for the constructor for the inner class? Cheers Simon From mansourmoufid at gmail.com Fri Jan 20 20:13:28 2012 From: mansourmoufid at gmail.com (Mansour Moufid) Date: Fri, 20 Jan 2012 14:13:28 -0500 Subject: [Cython] More typed constants (was: Fix integer width constant names in stdint.pxd) Message-ID: Hello again, Attached is a patch that continues with the idea of declaring constants using their corresponding type. Great work on Cython, by the way. It's very useful. Mansour -------------- next part -------------- A non-text attachment was scrubbed... Name: 0001-Continue-defining-constants-using-corresponding-type.patch Type: text/x-patch Size: 3920 bytes Desc: not available URL: From stefan_ml at behnel.de Sat Jan 21 06:58:14 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Sat, 21 Jan 2012 06:58:14 +0100 Subject: [Cython] Speedup module-level lookup In-Reply-To: References:

Message-ID: <4F1A53F6.2050906@behnel.de> Chris Colbert, 19.01.2012 09:18: > If it doesn't pass PyDict_CheckExact you won't be able to use it as the > globals to eval or exec. What makes you say that? I tried and it worked for me, all the way back to Python 2.4: -------------------- Python 2.4.6 (#2, Jan 21 2010, 23:45:25) [GCC 4.4.1] on linux2 Type "help", "copyright", "credits" or "license" for more information. >>> class MyDict(dict): pass >>> eval('1+1', MyDict()) 2 >>> exec '1+1' in MyDict() >>> -------------------- I only see a couple of calls to PyDict_CheckExact() in CPython's sources and they usually seem to be related to special casing for performance reasons. Nothing that should impact a module's globals. Besides, Cython controls its own language usages of eval and exec. Stefan From stefan_ml at behnel.de Sat Jan 21 07:00:00 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Sat, 21 Jan 2012 07:00:00 +0100 Subject: [Cython] Speedup module-level lookup In-Reply-To: References: Message-ID: <4F1A5460.6090908@behnel.de> Vitja Makarov, 19.01.2012 08:49: > 2012/1/19 Robert Bradshaw: >> On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov wrote: >>> I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. >>> >>> In this example bar() is 1.6 time faster (500us against 842us): >>> >>> C = 123 >>> def foo(a): >>> return C * adef bar(): >>> for i in range(10000): foo(i) >>> Here is proof of >>> concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 >>> >>> So the question is: does it worth it? >> >> I think the right thing to do here is make all module-level globals >> into "cdef public" attributes, i.e. C globals with getters and setters >> for Python space. I'm not sure whether this would best be done by >> creating a custom dict or module subclass, but it would probably be >> cleaner and afford much more than a 1.6x speedup. > > Yes, nice idea. > It's possible to subclass PyModuleObject and I didn't find any use of > PyModule_CheckExact() in CPython's sources: > > import types > import sys > > global_foo = 1234 > > class CustomModule(types.ModuleType): > def __init__(self, name): > types.ModuleType.__init__(self, name) > sys.modules[name] = self > > @property > def foo(self): > return global_foo > > @foo.setter > def foo(self, value): > global global_foo > global_foo = value > > CustomModule('foo') > > import foo > print foo.foo The one thing I don't currently see is how to get the module subtype instantiated in a safe and portable way. The normal way to create the module in Python 2.x is a call to Py_InitModule*(), which internally does a PyImport_AddModule(). We may get away with creating and registering the module object before calling into Py_InitModule*(), so that PyImport_AddModule() finds it there. At least, the internal checks on modules seem to use PyModule_Check() and not PyModule_CheckExact(), so someone seems to have already thought about this. In Python 3.x, the situation is different. There is no lookup involved and the module is always newly instantiated. That may mean that we have to copy the module creation code into Cython. But that doesn't look like a huge drawback (except for compatibility to potential future changes), because we already do most of the module initialisation ourselves anyway, especially now that we have CyFunction. I start feeling a bit like Linus Torvalds when he broke his minix installation and went: "ok, what else do I need to add to this terminal emulator in order to make it an operating system?" Stefan From sccolbert at gmail.com Sat Jan 21 07:09:49 2012 From: sccolbert at gmail.com (Chris Colbert) Date: Sat, 21 Jan 2012 00:09:49 -0600 Subject: [Cython] Speedup module-level lookup In-Reply-To: <4F1A53F6.2050906@behnel.de> References:

<4F1A53F6.2050906@behnel.de> Message-ID: On Fri, Jan 20, 2012 at 11:58 PM, Stefan Behnel wrote: > Chris Colbert, 19.01.2012 09:18: > > If it doesn't pass PyDict_CheckExact you won't be able to use it as the > > globals to eval or exec. > > What makes you say that? I tried and it worked for me, all the way back to > Python 2.4: > > Ah, you're right. I was mixing up issues I'd dealt with recently. The issue with the eval/exec is that the globals must be a dict (or dict subclass) whereas the locals can be a mapping. However, if you subclass dict for your globals, any overridden __getitem__ or __setitem__ will not be called. There is this line for eval/exec in ceval.c if (!PyDict_Check(globals)) { PyErr_SetString(PyExc_TypeError, "exec: arg 2 must be a dictionary or None"); return -1; } This allows the eval loop to use PyDict_GetItem on the globals instead of PyObject_GetItem, so no subclass method overrides will be called. -------------- next part -------------- An HTML attachment was scrubbed... URL: From robertwb at math.washington.edu Sat Jan 21 07:21:37 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Fri, 20 Jan 2012 22:21:37 -0800 Subject: [Cython] Speedup module-level lookup In-Reply-To: <4F1A5460.6090908@behnel.de> References: <4F1A5460.6090908@behnel.de> Message-ID: On Fri, Jan 20, 2012 at 10:00 PM, Stefan Behnel wrote: > Vitja Makarov, 19.01.2012 08:49: >> 2012/1/19 Robert Bradshaw: >>> On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov wrote: >>>> I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. >>>> >>>> In this example bar() is 1.6 time faster (500us against 842us): >>>> >>>> C = 123 >>>> def foo(a): >>>> ? ? return C * adef bar(): >>>> ? ? for i in range(10000): ? ? ? ?foo(i) >>>> Here is proof of >>>> concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 >>>> >>>> So the question is: does it worth it? >>> >>> I think the right thing to do here is make all module-level globals >>> into "cdef public" attributes, i.e. C globals with getters and setters >>> for Python space. I'm not sure whether this would best be done by >>> creating a custom dict or module subclass, but it would probably be >>> cleaner and afford much more than a 1.6x speedup. >> >> Yes, nice idea. >> It's possible to subclass PyModuleObject and I didn't find any use of >> PyModule_CheckExact() in CPython's sources: >> >> import types >> import sys >> >> global_foo = 1234 >> >> class CustomModule(types.ModuleType): >> ? ? def __init__(self, name): >> ? ? ? ? types.ModuleType.__init__(self, name) >> ? ? ? ? sys.modules[name] = self >> >> ? ? @property >> ? ? def foo(self): >> ? ? ? ? return global_foo >> >> ? ? @foo.setter >> ? ? def foo(self, value): >> ? ? ? ? global global_foo >> ? ? ? ? global_foo = value >> >> CustomModule('foo') >> >> import foo >> print foo.foo > > The one thing I don't currently see is how to get the module subtype > instantiated in a safe and portable way. > > The normal way to create the module in Python 2.x is a call to > Py_InitModule*(), which internally does a PyImport_AddModule(). We may get > away with creating and registering the module object before calling into > Py_InitModule*(), so that PyImport_AddModule() finds it there. At least, > the internal checks on modules seem to use PyModule_Check() and not > PyModule_CheckExact(), so someone seems to have already thought about this. > > In Python 3.x, the situation is different. There is no lookup involved and > the module is always newly instantiated. That may mean that we have to copy > the module creation code into Cython. But that doesn't look like a huge > drawback (except for compatibility to potential future changes), because we > already do most of the module initialisation ourselves anyway, especially > now that we have CyFunction. Or swap out its ob_type pointer after it's created... It's going to be messy unless we can directly add hooks into its __dict__ though. > I start feeling a bit like Linus Torvalds when he broke his minix > installation and went: "ok, what else do I need to add to this terminal > emulator in order to make it an operating system?" > > Stefan > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel From vitja.makarov at gmail.com Sat Jan 21 09:35:44 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Sat, 21 Jan 2012 12:35:44 +0400 Subject: [Cython] Speedup module-level lookup In-Reply-To: <4F1A53F6.2050906@behnel.de> References:

<4F1A53F6.2050906@behnel.de> Message-ID: 2012/1/21 Stefan Behnel : > Chris Colbert, 19.01.2012 09:18: >> If it doesn't pass PyDict_CheckExact you won't be able to use it as the >> globals to eval or exec. > > What makes you say that? I tried and it worked for me, all the way back to > Python 2.4: > > -------------------- > Python 2.4.6 (#2, Jan 21 2010, 23:45:25) > [GCC 4.4.1] on linux2 > Type "help", "copyright", "credits" or "license" for more information. >>>> class MyDict(dict): pass >>>> eval('1+1', MyDict()) > 2 >>>> exec '1+1' in MyDict() >>>> > -------------------- > > I only see a couple of calls to PyDict_CheckExact() in CPython's sources > and they usually seem to be related to special casing for performance > reasons. Nothing that should impact a module's globals. > > Besides, Cython controls its own language usages of eval and exec. > Cool! It seems that python internally uses PyObject_GetItem() for module level lookups and not PyDict_GetItem(). Btw we use __Pyx_GetName() that calls PyObject_GetAttr() that isn't exactly the same for module lookups: # Works in Cython and doesn't work in Python print __class__ So we can override __getitem__() and __setitem__(): class MyDict(dict): def __init__(self): self._dict = {} def __getitem__(self, key): print '__getitem__', key return self._dict[key] def __setitem__(self, key, value): print '__setitem__', key, value self._dict[key] = value def __getattr__(self, key): print '__getattr__' d = MyDict() exec('x = 1; print x', d) eval('x', d) $ python foo.py __setitem__ x 1 __getitem__ x 1 __getitem__ x So we can make globals() return special dict with custom __setitem__()/__getitem__(). But it seems that we'll have to override many dict's standard methods like values(), update() and so on. That would be hard. -- vitja. From vitja.makarov at gmail.com Sat Jan 21 09:40:21 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Sat, 21 Jan 2012 12:40:21 +0400 Subject: [Cython] Speedup module-level lookup In-Reply-To: <4F1A5460.6090908@behnel.de> References: <4F1A5460.6090908@behnel.de> Message-ID: 2012/1/21 Stefan Behnel : > Vitja Makarov, 19.01.2012 08:49: >> 2012/1/19 Robert Bradshaw: >>> On Wed, Jan 18, 2012 at 12:30 PM, Vitja Makarov wrote: >>>> I tried to optimize module lookups (__pyx_m) by caching internal PyDict state. >>>> >>>> In this example bar() is 1.6 time faster (500us against 842us): >>>> >>>> C = 123 >>>> def foo(a): >>>> ? ? return C * adef bar(): >>>> ? ? for i in range(10000): ? ? ? ?foo(i) >>>> Here is proof of >>>> concept:https://github.com/vitek/cython/commit/1d134fe54a74e6fc6d39d09973db499680b2a8d9 >>>> >>>> So the question is: does it worth it? >>> >>> I think the right thing to do here is make all module-level globals >>> into "cdef public" attributes, i.e. C globals with getters and setters >>> for Python space. I'm not sure whether this would best be done by >>> creating a custom dict or module subclass, but it would probably be >>> cleaner and afford much more than a 1.6x speedup. >> >> Yes, nice idea. >> It's possible to subclass PyModuleObject and I didn't find any use of >> PyModule_CheckExact() in CPython's sources: >> >> import types >> import sys >> >> global_foo = 1234 >> >> class CustomModule(types.ModuleType): >> ? ? def __init__(self, name): >> ? ? ? ? types.ModuleType.__init__(self, name) >> ? ? ? ? sys.modules[name] = self >> >> ? ? @property >> ? ? def foo(self): >> ? ? ? ? return global_foo >> >> ? ? @foo.setter >> ? ? def foo(self, value): >> ? ? ? ? global global_foo >> ? ? ? ? global_foo = value >> >> CustomModule('foo') >> >> import foo >> print foo.foo > > The one thing I don't currently see is how to get the module subtype > instantiated in a safe and portable way. > We can do the same as types module: ModuleType = type(sys) or type(__builtins__) since we already got it (__pyx_b) > The normal way to create the module in Python 2.x is a call to > Py_InitModule*(), which internally does a PyImport_AddModule(). We may get > away with creating and registering the module object before calling into > Py_InitModule*(), so that PyImport_AddModule() finds it there. At least, > the internal checks on modules seem to use PyModule_Check() and not > PyModule_CheckExact(), so someone seems to have already thought about this. > > In Python 3.x, the situation is different. There is no lookup involved and > the module is always newly instantiated. That may mean that we have to copy > the module creation code into Cython. But that doesn't look like a huge > drawback (except for compatibility to potential future changes), because we > already do most of the module initialisation ourselves anyway, especially > now that we have CyFunction. > > I start feeling a bit like Linus Torvalds when he broke his minix > installation and went: "ok, what else do I need to add to this terminal > emulator in order to make it an operating system?" > -- vitja. From sccolbert at gmail.com Sat Jan 21 19:08:26 2012 From: sccolbert at gmail.com (Chris Colbert) Date: Sat, 21 Jan 2012 12:08:26 -0600 Subject: [Cython] Speedup module-level lookup In-Reply-To: References:

<4F1A53F6.2050906@behnel.de> Message-ID: On Sat, Jan 21, 2012 at 2:35 AM, Vitja Makarov wrote: > 2012/1/21 Stefan Behnel : > > Chris Colbert, 19.01.2012 09:18: > >> If it doesn't pass PyDict_CheckExact you won't be able to use it as the > >> globals to eval or exec. > > > > What makes you say that? I tried and it worked for me, all the way back > to > > Python 2.4: > > > > -------------------- > > Python 2.4.6 (#2, Jan 21 2010, 23:45:25) > > [GCC 4.4.1] on linux2 > > Type "help", "copyright", "credits" or "license" for more information. > >>>> class MyDict(dict): pass > >>>> eval('1+1', MyDict()) > > 2 > >>>> exec '1+1' in MyDict() > >>>> > > -------------------- > > > > I only see a couple of calls to PyDict_CheckExact() in CPython's sources > > and they usually seem to be related to special casing for performance > > reasons. Nothing that should impact a module's globals. > > > > Besides, Cython controls its own language usages of eval and exec. > > > > Cool! > It seems that python internally uses PyObject_GetItem() for module > level lookups and not PyDict_GetItem(). > Btw we use __Pyx_GetName() that calls PyObject_GetAttr() that isn't > exactly the same for module lookups: > > # Works in Cython and doesn't work in Python > print __class__ > > So we can override __getitem__() and __setitem__(): > class MyDict(dict): > def __init__(self): > self._dict = {} > > def __getitem__(self, key): > print '__getitem__', key > return self._dict[key] > > def __setitem__(self, key, value): > print '__setitem__', key, value > self._dict[key] = value > > def __getattr__(self, key): > print '__getattr__' > > d = MyDict() > exec('x = 1; print x', d) > eval('x', d) > $ python foo.py > __setitem__ x 1 > __getitem__ x > 1 > __getitem__ x > > > So we can make globals() return special dict with custom > __setitem__()/__getitem__(). But it seems that we'll have to override > many dict's standard methods like values(), update() and so on. That > would be hard. > > > Be careful. That only works because your dict subclass is being used as the locals as well. The LOAD_NAME opcode does a PyDict_CheckExact on the locals and will call PyDict_GetItem if true, PyObject_GetItem if False: case LOAD_NAME: w = GETITEM(names, oparg); if ((v = f->f_locals) == NULL) { PyErr_Format(PyExc_SystemError, "no locals when loading %s", PyObject_REPR(w)); why = WHY_EXCEPTION; break; } if (PyDict_CheckExact(v)) { x = PyDict_GetItem(v, w); Py_XINCREF(x); } else { x = PyObject_GetItem(v, w); if (x == NULL && PyErr_Occurred()) { if (!PyErr_ExceptionMatches( PyExc_KeyError)) break; PyErr_Clear(); } } You can see that the dict subclassing breaks down when you pass an empty dict as the locals: In [1]: class Foo(dict): ...: def __getitem__(self, name): ...: print 'get', name ...: return super(Foo, self).__getitem__(name) ...: In [2]: f = Foo(a=42) In [3]: eval('a', f) get a Out[3]: 42 In [4]: eval('a', f, {}) Out[4]: 42 -------------- next part -------------- An HTML attachment was scrubbed... URL: From vitja.makarov at gmail.com Sat Jan 21 19:43:28 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Sat, 21 Jan 2012 22:43:28 +0400 Subject: [Cython] Speedup module-level lookup In-Reply-To: References:

<4F1A53F6.2050906@behnel.de>

Message-ID: 2012/1/21 Chris Colbert : > > > On Sat, Jan 21, 2012 at 2:35 AM, Vitja Makarov > wrote: >> >> 2012/1/21 Stefan Behnel : >> > Chris Colbert, 19.01.2012 09:18: >> >> If it doesn't pass PyDict_CheckExact you won't be able to use it as the >> >> globals to eval or exec. >> > >> > What makes you say that? I tried and it worked for me, all the way back >> > to >> > Python 2.4: >> > >> > -------------------- >> > Python 2.4.6 (#2, Jan 21 2010, 23:45:25) >> > [GCC 4.4.1] on linux2 >> > Type "help", "copyright", "credits" or "license" for more information. >> >>>> class MyDict(dict): pass >> >>>> eval('1+1', MyDict()) >> > 2 >> >>>> exec '1+1' in MyDict() >> >>>> >> > -------------------- >> > >> > I only see a couple of calls to PyDict_CheckExact() in CPython's sources >> > and they usually seem to be related to special casing for performance >> > reasons. Nothing that should impact a module's globals. >> > >> > Besides, Cython controls its own language usages of eval and exec. >> > >> >> Cool! >> It seems that python internally uses PyObject_GetItem() for module >> level lookups and not PyDict_GetItem(). >> Btw we use __Pyx_GetName() that calls PyObject_GetAttr() that isn't >> exactly the same for module lookups: >> >> # Works in Cython and doesn't work in Python >> print __class__ >> >> So we can override __getitem__() and __setitem__(): >> class MyDict(dict): >> ? ?def __init__(self): >> ? ? ? ?self._dict = {} >> >> ? ?def __getitem__(self, key): >> ? ? ? ?print '__getitem__', key >> ? ? ? ?return self._dict[key] >> >> ? ?def __setitem__(self, key, value): >> ? ? ? ?print '__setitem__', key, value >> ? ? ? ?self._dict[key] = value >> >> ? ?def __getattr__(self, key): >> ? ? ? ?print '__getattr__' >> >> d = MyDict() >> exec('x = 1; print x', d) >> eval('x', d) >> $ python foo.py >> __setitem__ x 1 >> __getitem__ x >> 1 >> __getitem__ x >> >> >> So we can make globals() return special dict with custom >> __setitem__()/__getitem__(). But it seems that we'll have to override >> many dict's standard methods like values(), update() and so on. That >> would be hard. >> >> > > Be careful. That only works because your dict subclass is being used as the > locals as well. The LOAD_NAME opcode does a PyDict_CheckExact on the locals > and will call PyDict_GetItem if true, PyObject_GetItem if False: > > case LOAD_NAME: > w = GETITEM(names, oparg); > if ((v = f->f_locals) == NULL) { > PyErr_Format(PyExc_SystemError, > "no locals when loading %s", > PyObject_REPR(w)); > why = WHY_EXCEPTION; > break; > } > if (PyDict_CheckExact(v)) { > x = PyDict_GetItem(v, w); > Py_XINCREF(x); > } > else { > x = PyObject_GetItem(v, w); > if (x == NULL && PyErr_Occurred()) { > if (!PyErr_ExceptionMatches( > PyExc_KeyError)) > break; > PyErr_Clear(); > } > > } > > > You can see that the dict subclassing breaks down when you pass an empty > dict as the locals: > > In [1]: class Foo(dict): ...: def __getitem__(self, name): ...: print 'get', > name ...: return super(Foo, self).__getitem__(name) ...: In [2]: f = > Foo(a=42) In [3]: eval('a', f) get a Out[3]: 42 In [4]: eval('a', f, {}) > Out[4]: 42 > > Nice catch! It seems that globals MUST be a real dict. >>> help(eval) eval(...) eval(source[, globals[, locals]]) -> value Evaluate the source in the context of globals and locals. The source may be a string representing a Python expression or a code object as returned by compile(). The globals must be a dictionary and locals can be any mapping, defaulting to the current globals and locals. If only globals is given, locals defaults to it. -- vitja. From stefan_ml at behnel.de Sat Jan 21 19:50:42 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Sat, 21 Jan 2012 19:50:42 +0100 Subject: [Cython] AddTraceback() slows down generators Message-ID: <4F1B0902.1050903@behnel.de> Hi, I did some callgrind profiling on Cython's generators and was surprised to find that AddTraceback() represents a serious performance penalty for short running generators. I profiled a compiled Python implementation of itertools.groupby(), which yields (key, group) tuples where the group is an iterator again. I ran this code in Python for benchmarking: """ L = sorted(range(1000)*5) all(list(g) for k,g in groupby(L)) """ Groups tend to be rather short in real code, often just one or a couple of items, so unpacking the group iterator into a list will usually be a quick loop and then the generator raises StopIteration on termination and builds a traceback for it. According to callgrind (which, I should note, tends to overestimate the amount of time spent in memory allocation), the iteration during the group unpacking takes about 30% of the overall runtime of the all() loop, and the AddTraceback() call at the end of each group traversal takes up to 25% (!) on my side. That means that more than 80% of the group unpacking time goes into raising StopIteration from the generators. I attached the call graph with the relative timings. About half of the exception raising time is eaten by PyString_FromFormat() that builds the function-name + line-position string (which, I may note, is basically a convenience feature). This string is a constant for a generator's StopIteration exception, at least for each final return point in a generator, but here it is being recreated over and over again, for each exception that gets raised. Even if we keep creating a new frame instance each time (which should be ok because CPython has a frame instance cache already and we'd only create one during the generator lifetime), the whole code object could actually be cached after the first creation, preferably bound to the lifetime of the generator creator function/method. Or, more generally, one code object per generator termination point, which will be a single point in the majority of cases. For the specific code above, that should shave off almost 20% of the overall runtime of the all() loop. I think that's totally worth doing. Stefan -------------- next part -------------- A non-text attachment was scrubbed... Name: callgraph2.png Type: image/png Size: 22440 bytes Desc: not available URL: From d.s.seljebotn at astro.uio.no Sat Jan 21 22:16:20 2012 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Sat, 21 Jan 2012 22:16:20 +0100 Subject: [Cython] AddTraceback() slows down generators In-Reply-To: <4F1B0902.1050903@behnel.de> References: <4F1B0902.1050903@behnel.de> Message-ID: <4F1B2B24.8040302@astro.uio.no> On 01/21/2012 07:50 PM, Stefan Behnel wrote: > Hi, > > I did some callgrind profiling on Cython's generators and was surprised to > find that AddTraceback() represents a serious performance penalty for short > running generators. > > I profiled a compiled Python implementation of itertools.groupby(), which > yields (key, group) tuples where the group is an iterator again. I ran this > code in Python for benchmarking: > > """ > L = sorted(range(1000)*5) > > all(list(g) for k,g in groupby(L)) > """ > > Groups tend to be rather short in real code, often just one or a couple of > items, so unpacking the group iterator into a list will usually be a quick > loop and then the generator raises StopIteration on termination and builds > a traceback for it. According to callgrind (which, I should note, tends to > overestimate the amount of time spent in memory allocation), the iteration > during the group unpacking takes about 30% of the overall runtime of the > all() loop, and the AddTraceback() call at the end of each group traversal > takes up to 25% (!) on my side. That means that more than 80% of the group > unpacking time goes into raising StopIteration from the generators. I > attached the call graph with the relative timings. OT: Since you complain that callgrind is inaccurate; are you aware of sampling profilers, such as Google perftools? (I don't have experience with callgrind myself) http://google-perftools.googlecode.com/svn/trunk/doc/cpuprofile.html http://pypi.python.org/pypi/yep Dag > > About half of the exception raising time is eaten by PyString_FromFormat() > that builds the function-name + line-position string (which, I may note, is > basically a convenience feature). This string is a constant for a > generator's StopIteration exception, at least for each final return point > in a generator, but here it is being recreated over and over again, for > each exception that gets raised. > > Even if we keep creating a new frame instance each time (which should be ok > because CPython has a frame instance cache already and we'd only create one > during the generator lifetime), the whole code object could actually be > cached after the first creation, preferably bound to the lifetime of the > generator creator function/method. Or, more generally, one code object per > generator termination point, which will be a single point in the majority > of cases. For the specific code above, that should shave off almost 20% of > the overall runtime of the all() loop. > > I think that's totally worth doing. > > Stefan > > > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel From robertwb at math.washington.edu Sat Jan 21 23:09:40 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Sat, 21 Jan 2012 14:09:40 -0800 Subject: [Cython] AddTraceback() slows down generators In-Reply-To: <4F1B0902.1050903@behnel.de> References: <4F1B0902.1050903@behnel.de> Message-ID: On Sat, Jan 21, 2012 at 10:50 AM, Stefan Behnel wrote: > Hi, > > I did some callgrind profiling on Cython's generators and was surprised to > find that AddTraceback() represents a serious performance penalty for short > running generators. > > I profiled a compiled Python implementation of itertools.groupby(), which > yields (key, group) tuples where the group is an iterator again. I ran this > code in Python for benchmarking: > > """ > L = sorted(range(1000)*5) > > all(list(g) for k,g in groupby(L)) > """ > > Groups tend to be rather short in real code, often just one or a couple of > items, so unpacking the group iterator into a list will usually be a quick > loop and then the generator raises StopIteration on termination and builds > a traceback for it. According to callgrind (which, I should note, tends to > overestimate the amount of time spent in memory allocation), the iteration > during the group unpacking takes about 30% of the overall runtime of the > all() loop, and the AddTraceback() call at the end of each group traversal > takes up to 25% (!) on my side. That means that more than 80% of the group > unpacking time goes into raising StopIteration from the generators. I > attached the call graph with the relative timings. > > About half of the exception raising time is eaten by PyString_FromFormat() > that builds the function-name + line-position string (which, I may note, is > basically a convenience feature). This string is a constant for a > generator's StopIteration exception, at least for each final return point > in a generator, but here it is being recreated over and over again, for > each exception that gets raised. > > Even if we keep creating a new frame instance each time (which should be ok > because CPython has a frame instance cache already and we'd only create one > during the generator lifetime), the whole code object could actually be > cached after the first creation, preferably bound to the lifetime of the > generator creator function/method. Or, more generally, one code object per > generator termination point, which will be a single point in the majority > of cases. For the specific code above, that should shave off almost 20% of > the overall runtime of the all() loop. > > I think that's totally worth doing. Makes sense to me. I did some caching like this for profiling. - Robert From markflorisson88 at gmail.com Mon Jan 23 11:27:12 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Mon, 23 Jan 2012 10:27:12 +0000 Subject: [Cython] 0.16 release Message-ID: Hey, It's been almost three months since we talked about a 0.16 release, I think it's quite ready. It would already be a big release, it would be good to see how people like it, and to catch any issues etc before we pile on more features. Mark From konrad.hinsen at fastmail.net Tue Jan 24 12:37:50 2012 From: konrad.hinsen at fastmail.net (Konrad Hinsen) Date: Tue, 24 Jan 2012 12:37:50 +0100 Subject: [Cython] Bug in Cython producing incorrect C code Message-ID: <1327405070.15017.140661027320813@webmail.messagingengine.com> Compiling the attached Cython file produced the attached C file which has errors in lines 532-534: __pyx_v_self->xx = None; __pyx_v_self->yy = None; __pyx_v_self->zz = None; There is no C symbol "None", so this doesn't compile. I first noticed the bug in Cython 0.15, but it's still in the latest revision from Github. Konrad. -------------- next part -------------- A non-text attachment was scrubbed... Name: bug.pyx Type: application/octet-stream Size: 147 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: bug.c.gz Type: application/x-gzip Size: 10310 bytes Desc: not available URL: From markflorisson88 at gmail.com Tue Jan 24 14:53:20 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 24 Jan 2012 13:53:20 +0000 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: <1327405070.15017.140661027320813@webmail.messagingengine.com> References: <1327405070.15017.140661027320813@webmail.messagingengine.com> Message-ID: On 24 January 2012 11:37, Konrad Hinsen wrote: > Compiling the attached Cython file produced the attached C file which > has errors in lines 532-534: > > ?__pyx_v_self->xx = None; > ?__pyx_v_self->yy = None; > ?__pyx_v_self->zz = None; > > There is no C symbol "None", so this doesn't compile. > > I first noticed the bug in Cython 0.15, but it's still in the latest > revision from Github. > > Konrad. > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel > Hm, it seems the problem is that the call to the builtin float results in SimpleCallNode being replaced with PythonCApiNode, which then generates the result code, but the list of coerced nodes are CloneNodes of the original rhs, and CloneNode does not generate the result code of the original rhs (i.e. allocate and assign to a temp), which results in a None result. Maybe CascadedAssignmentNode should replace CloneNode.arg with the latest self.rhs in generate_assignment_code? I'm not entirely sure. From vitja.makarov at gmail.com Tue Jan 24 15:09:04 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Tue, 24 Jan 2012 18:09:04 +0400 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com> Message-ID: 2012/1/24 mark florisson : > On 24 January 2012 11:37, Konrad Hinsen wrote: >> Compiling the attached Cython file produced the attached C file which >> has errors in lines 532-534: >> >> ?__pyx_v_self->xx = None; >> ?__pyx_v_self->yy = None; >> ?__pyx_v_self->zz = None; >> >> There is no C symbol "None", so this doesn't compile. >> >> I first noticed the bug in Cython 0.15, but it's still in the latest >> revision from Github. >> >> Konrad. >> >> _______________________________________________ >> cython-devel mailing list >> cython-devel at python.org >> http://mail.python.org/mailman/listinfo/cython-devel >> > > Hm, it seems the problem is that the call to the builtin float results > in SimpleCallNode being replaced with PythonCApiNode, which then > generates the result code, but the list of coerced nodes are > CloneNodes of the original rhs, and CloneNode does not generate the > result code of the original rhs (i.e. allocate and assign to a temp), > which results in a None result. > > Maybe CascadedAssignmentNode should replace CloneNode.arg with the > latest self.rhs in generate_assignment_code? I'm not entirely sure. May be it's better to run OptimizeBuiltinCalls before AnalyseExpressionsTransform? I have a patch that initializes NameNode's entry at ControlFlowAnalysis stage. -- vitja. From robertwb at math.washington.edu Tue Jan 24 18:36:31 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Tue, 24 Jan 2012 09:36:31 -0800 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com> Message-ID: On Tue, Jan 24, 2012 at 6:09 AM, Vitja Makarov wrote: > 2012/1/24 mark florisson : >> On 24 January 2012 11:37, Konrad Hinsen wrote: >>> Compiling the attached Cython file produced the attached C file which >>> has errors in lines 532-534: >>> >>> ?__pyx_v_self->xx = None; >>> ?__pyx_v_self->yy = None; >>> ?__pyx_v_self->zz = None; >>> >>> There is no C symbol "None", so this doesn't compile. >>> >>> I first noticed the bug in Cython 0.15, but it's still in the latest >>> revision from Github. >>> >>> Konrad. >>> >>> _______________________________________________ >>> cython-devel mailing list >>> cython-devel at python.org >>> http://mail.python.org/mailman/listinfo/cython-devel >>> >> >> Hm, it seems the problem is that the call to the builtin float results >> in SimpleCallNode being replaced with PythonCApiNode, which then >> generates the result code, but the list of coerced nodes are >> CloneNodes of the original rhs, and CloneNode does not generate the >> result code of the original rhs (i.e. allocate and assign to a temp), >> which results in a None result. >> >> Maybe CascadedAssignmentNode should replace CloneNode.arg with the >> latest self.rhs in generate_assignment_code? I'm not entirely sure. > > > May be it's better to run OptimizeBuiltinCalls before > AnalyseExpressionsTransform? Doesn't OptimizeBuiltinCalls take advantage of type information? From vitja.makarov at gmail.com Tue Jan 24 19:30:43 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Tue, 24 Jan 2012 22:30:43 +0400 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com>

Message-ID: 2012/1/24 Robert Bradshaw : > On Tue, Jan 24, 2012 at 6:09 AM, Vitja Makarov wrote: >> 2012/1/24 mark florisson : >>> On 24 January 2012 11:37, Konrad Hinsen wrote: >>>> Compiling the attached Cython file produced the attached C file which >>>> has errors in lines 532-534: >>>> >>>> ?__pyx_v_self->xx = None; >>>> ?__pyx_v_self->yy = None; >>>> ?__pyx_v_self->zz = None; >>>> >>>> There is no C symbol "None", so this doesn't compile. >>>> >>>> I first noticed the bug in Cython 0.15, but it's still in the latest >>>> revision from Github. >>>> >>>> Konrad. >>>> >>>> _______________________________________________ >>>> cython-devel mailing list >>>> cython-devel at python.org >>>> http://mail.python.org/mailman/listinfo/cython-devel >>>> >>> >>> Hm, it seems the problem is that the call to the builtin float results >>> in SimpleCallNode being replaced with PythonCApiNode, which then >>> generates the result code, but the list of coerced nodes are >>> CloneNodes of the original rhs, and CloneNode does not generate the >>> result code of the original rhs (i.e. allocate and assign to a temp), >>> which results in a None result. >>> >>> Maybe CascadedAssignmentNode should replace CloneNode.arg with the >>> latest self.rhs in generate_assignment_code? I'm not entirely sure. Seems like a hack to me. >> >> >> May be it's better to run OptimizeBuiltinCalls before >> AnalyseExpressionsTransform? > > Doesn't OptimizeBuiltinCalls take advantage of type information? Yes, it does :( So as Mark said the problem is CascadedAssignmentNode.coerced_rhs_list is created before rhs is updated. -- vitja. From markflorisson88 at gmail.com Tue Jan 24 19:51:37 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 24 Jan 2012 18:51:37 +0000 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com>

Message-ID: On 24 January 2012 18:30, Vitja Makarov wrote: > 2012/1/24 Robert Bradshaw : >> On Tue, Jan 24, 2012 at 6:09 AM, Vitja Makarov wrote: >>> 2012/1/24 mark florisson : >>>> On 24 January 2012 11:37, Konrad Hinsen wrote: >>>>> Compiling the attached Cython file produced the attached C file which >>>>> has errors in lines 532-534: >>>>> >>>>> ?__pyx_v_self->xx = None; >>>>> ?__pyx_v_self->yy = None; >>>>> ?__pyx_v_self->zz = None; >>>>> >>>>> There is no C symbol "None", so this doesn't compile. >>>>> >>>>> I first noticed the bug in Cython 0.15, but it's still in the latest >>>>> revision from Github. >>>>> >>>>> Konrad. >>>>> >>>>> _______________________________________________ >>>>> cython-devel mailing list >>>>> cython-devel at python.org >>>>> http://mail.python.org/mailman/listinfo/cython-devel >>>>> >>>> >>>> Hm, it seems the problem is that the call to the builtin float results >>>> in SimpleCallNode being replaced with PythonCApiNode, which then >>>> generates the result code, but the list of coerced nodes are >>>> CloneNodes of the original rhs, and CloneNode does not generate the >>>> result code of the original rhs (i.e. allocate and assign to a temp), >>>> which results in a None result. >>>> >>>> Maybe CascadedAssignmentNode should replace CloneNode.arg with the >>>> latest self.rhs in generate_assignment_code? I'm not entirely sure. > > Seems like a hack to me. > >>> >>> >>> May be it's better to run OptimizeBuiltinCalls before >>> AnalyseExpressionsTransform? >> >> Doesn't OptimizeBuiltinCalls take advantage of type information? > > Yes, it does :( > > So as Mark said the problem is CascadedAssignmentNode.coerced_rhs_list > is created before rhs is updated. > I think deferring the CloneNode creation to code generation time works (are there any known problem with doing type coercions at code generation time?). E.g. save 'env' during analyse_types and in generate_assignment_code do rhs = CloneNode(self.rhs).coerce_to(lhs.type, self.env) rhs.generate_evaluation_code(code) lhs.generate_assignment_code(rhs, code) Seems to work. > > -- > vitja. > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel From vitja.makarov at gmail.com Tue Jan 24 20:05:48 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Tue, 24 Jan 2012 23:05:48 +0400 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com>

Message-ID: 2012/1/24 mark florisson : > On 24 January 2012 18:30, Vitja Makarov wrote: >> 2012/1/24 Robert Bradshaw : >>> On Tue, Jan 24, 2012 at 6:09 AM, Vitja Makarov wrote: >>>> 2012/1/24 mark florisson : >>>>> On 24 January 2012 11:37, Konrad Hinsen wrote: >>>>>> Compiling the attached Cython file produced the attached C file which >>>>>> has errors in lines 532-534: >>>>>> >>>>>> ?__pyx_v_self->xx = None; >>>>>> ?__pyx_v_self->yy = None; >>>>>> ?__pyx_v_self->zz = None; >>>>>> >>>>>> There is no C symbol "None", so this doesn't compile. >>>>>> >>>>>> I first noticed the bug in Cython 0.15, but it's still in the latest >>>>>> revision from Github. >>>>>> >>>>>> Konrad. >>>>>> >>>>>> _______________________________________________ >>>>>> cython-devel mailing list >>>>>> cython-devel at python.org >>>>>> http://mail.python.org/mailman/listinfo/cython-devel >>>>>> >>>>> >>>>> Hm, it seems the problem is that the call to the builtin float results >>>>> in SimpleCallNode being replaced with PythonCApiNode, which then >>>>> generates the result code, but the list of coerced nodes are >>>>> CloneNodes of the original rhs, and CloneNode does not generate the >>>>> result code of the original rhs (i.e. allocate and assign to a temp), >>>>> which results in a None result. >>>>> >>>>> Maybe CascadedAssignmentNode should replace CloneNode.arg with the >>>>> latest self.rhs in generate_assignment_code? I'm not entirely sure. >> >> Seems like a hack to me. >> >>>> >>>> >>>> May be it's better to run OptimizeBuiltinCalls before >>>> AnalyseExpressionsTransform? >>> >>> Doesn't OptimizeBuiltinCalls take advantage of type information? >> >> Yes, it does :( >> >> So as Mark said the problem is CascadedAssignmentNode.coerced_rhs_list >> is created before rhs is updated. >> > > I think deferring the CloneNode creation to code generation time works > (are there any known problem with doing type coercions at code > generation time?). Coercion errors at code generation time? > E.g. save 'env' during analyse_types and in > generate_assignment_code do > > ? ?rhs = CloneNode(self.rhs).coerce_to(lhs.type, self.env) > ? ?rhs.generate_evaluation_code(code) > ? ?lhs.generate_assignment_code(rhs, code) > > Seems to work. > Yeah, that's better. -- vitja. From d.s.seljebotn at astro.uio.no Tue Jan 24 20:18:06 2012 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Tue, 24 Jan 2012 20:18:06 +0100 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com>

Message-ID: <4F1F03EE.5000504@astro.uio.no> On 01/24/2012 08:05 PM, Vitja Makarov wrote: > 2012/1/24 mark florisson: >> On 24 January 2012 18:30, Vitja Makarov wrote: >>> 2012/1/24 Robert Bradshaw: >>>> On Tue, Jan 24, 2012 at 6:09 AM, Vitja Makarov wrote: >>>>> 2012/1/24 mark florisson: >>>>>> On 24 January 2012 11:37, Konrad Hinsen wrote: >>>>>>> Compiling the attached Cython file produced the attached C file which >>>>>>> has errors in lines 532-534: >>>>>>> >>>>>>> __pyx_v_self->xx = None; >>>>>>> __pyx_v_self->yy = None; >>>>>>> __pyx_v_self->zz = None; >>>>>>> >>>>>>> There is no C symbol "None", so this doesn't compile. >>>>>>> >>>>>>> I first noticed the bug in Cython 0.15, but it's still in the latest >>>>>>> revision from Github. >>>>>>> >>>>>>> Konrad. >>>>>>> >>>>>>> _______________________________________________ >>>>>>> cython-devel mailing list >>>>>>> cython-devel at python.org >>>>>>> http://mail.python.org/mailman/listinfo/cython-devel >>>>>>> >>>>>> >>>>>> Hm, it seems the problem is that the call to the builtin float results >>>>>> in SimpleCallNode being replaced with PythonCApiNode, which then >>>>>> generates the result code, but the list of coerced nodes are >>>>>> CloneNodes of the original rhs, and CloneNode does not generate the >>>>>> result code of the original rhs (i.e. allocate and assign to a temp), >>>>>> which results in a None result. >>>>>> >>>>>> Maybe CascadedAssignmentNode should replace CloneNode.arg with the >>>>>> latest self.rhs in generate_assignment_code? I'm not entirely sure. >>> >>> Seems like a hack to me. >>> >>>>> >>>>> >>>>> May be it's better to run OptimizeBuiltinCalls before >>>>> AnalyseExpressionsTransform? >>>> >>>> Doesn't OptimizeBuiltinCalls take advantage of type information? >>> >>> Yes, it does :( >>> >>> So as Mark said the problem is CascadedAssignmentNode.coerced_rhs_list >>> is created before rhs is updated. >>> >> >> I think deferring the CloneNode creation to code generation time works >> (are there any known problem with doing type coercions at code >> generation time?). > > Coercion errors at code generation time? Apologies up front for raising my voice, as my knowledge of the internals are getting so rusty...take this with a grain of salt. I'm +1 on working towards having the code generation phase be pure code generation. I did some refactorings to take mini-steps towards that once upon a time, moving some error conditions to before code generation. My preferred approach would be to do away with CascadedAssignmentNode at the parse tree stage: a = b = c = expr goes to tmp = expr c = tmp b = tmp a = tmp and so on. Of course it gets messier; (expr1)[expr2] = (expr3).attr = expr4 But apart from getting the time of evaluating each expression right the transform should be straightforward. One of the tempnodes/"let"-nodes (I forgot which one, or if they've been consolidated) should be able to fix this. Takes some more work though than a quick hack though... Dag > >> E.g. save 'env' during analyse_types and in >> generate_assignment_code do >> >> rhs = CloneNode(self.rhs).coerce_to(lhs.type, self.env) >> rhs.generate_evaluation_code(code) >> lhs.generate_assignment_code(rhs, code) >> >> Seems to work. >> > > Yeah, that's better. > > From markflorisson88 at gmail.com Tue Jan 24 21:28:54 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 24 Jan 2012 20:28:54 +0000 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: <4F1F03EE.5000504@astro.uio.no> References: <1327405070.15017.140661027320813@webmail.messagingengine.com>

<4F1F03EE.5000504@astro.uio.no> Message-ID: On 24 January 2012 19:18, Dag Sverre Seljebotn wrote: > On 01/24/2012 08:05 PM, Vitja Makarov wrote: >> >> 2012/1/24 mark florisson: >>> >>> On 24 January 2012 18:30, Vitja Makarov ?wrote: >>>> >>>> 2012/1/24 Robert Bradshaw: >>>>> >>>>> On Tue, Jan 24, 2012 at 6:09 AM, Vitja Makarov >>>>> ?wrote: >>>>>> >>>>>> 2012/1/24 mark florisson: >>>>>>> >>>>>>> On 24 January 2012 11:37, Konrad Hinsen >>>>>>> ?wrote: >>>>>>>> >>>>>>>> Compiling the attached Cython file produced the attached C file >>>>>>>> which >>>>>>>> has errors in lines 532-534: >>>>>>>> >>>>>>>> ?__pyx_v_self->xx = None; >>>>>>>> ?__pyx_v_self->yy = None; >>>>>>>> ?__pyx_v_self->zz = None; >>>>>>>> >>>>>>>> There is no C symbol "None", so this doesn't compile. >>>>>>>> >>>>>>>> I first noticed the bug in Cython 0.15, but it's still in the latest >>>>>>>> revision from Github. >>>>>>>> >>>>>>>> Konrad. >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> cython-devel mailing list >>>>>>>> cython-devel at python.org >>>>>>>> http://mail.python.org/mailman/listinfo/cython-devel >>>>>>>> >>>>>>> >>>>>>> Hm, it seems the problem is that the call to the builtin float >>>>>>> results >>>>>>> in SimpleCallNode being replaced with PythonCApiNode, which then >>>>>>> generates the result code, but the list of coerced nodes are >>>>>>> CloneNodes of the original rhs, and CloneNode does not generate the >>>>>>> result code of the original rhs (i.e. allocate and assign to a temp), >>>>>>> which results in a None result. >>>>>>> >>>>>>> Maybe CascadedAssignmentNode should replace CloneNode.arg with the >>>>>>> latest self.rhs in generate_assignment_code? I'm not entirely sure. >>>> >>>> >>>> Seems like a hack to me. >>>> >>>>>> >>>>>> >>>>>> May be it's better to run OptimizeBuiltinCalls before >>>>>> AnalyseExpressionsTransform? >>>>> >>>>> >>>>> Doesn't OptimizeBuiltinCalls take advantage of type information? >>>> >>>> >>>> Yes, it does :( >>>> >>>> So as Mark said the problem is CascadedAssignmentNode.coerced_rhs_list >>>> is created before rhs is updated. >>>> >>> >>> I think deferring the CloneNode creation to code generation time works >>> (are there any known problem with doing type coercions at code >>> generation time?). >> >> >> Coercion errors at code generation time? > > > Apologies up front for raising my voice, as my knowledge of the internals > are getting so rusty...take this with a grain of salt. > > I'm +1 on working towards having the code generation phase be pure code > generation. I did some refactorings to take mini-steps towards that once > upon a time, moving some error conditions to before code generation. > > My preferred approach would be to do away with CascadedAssignmentNode at the > parse tree stage: > > a = b = c = expr > > goes to > > tmp = expr > c = tmp > b = tmp > a = tmp > > and so on. Of course it gets messier; > > (expr1)[expr2] = (expr3).attr = expr4 > > But apart from getting the time of evaluating each expression right the > transform should be straightforward. One of the tempnodes/"let"-nodes (I > forgot which one, or if they've been consolidated) should be able to fix > this. > > Takes some more work though than a quick hack though... > > Dag > In principle it was doing the same thing, apart from the actual rewrite. I suppose the replacement problem can also be circumvented by manually wrapping self.rhs in a CoerceToTempNode. The problem with coerce_to_temp is that it does not create this node if the result is already in a temp. Creating it manually does mean an extra useless assignment, but it is an easy fix which happens at analyse_types time. Instead we could also use another node that just proxies a few things like generate_result_code and the result method. I like the idea though, it would be nice to only handle things in SingleAssignmentNode. I recently added broadcasting (inserting leading dimensions) and scalar assignment to memoryviews, and you can only catch that at the assignment point. Currently it only supports single assignments as the functionality is only in SingleAssignmentNode. I must say though, the following would look a bit weird: a = b[:] = c[:, :] = d as you always expect a kind of "cascade", e.g. you expect c[:, :] to be assignable to b[:], or 'a', but none of that may be true at all. So I'm fine with disallowing that, I think people should only use cascaded assignment for variables. >> >>> E.g. save 'env' during analyse_types and in >>> generate_assignment_code do >>> >>> ? ?rhs = CloneNode(self.rhs).coerce_to(lhs.type, self.env) >>> ? ?rhs.generate_evaluation_code(code) >>> ? ?lhs.generate_assignment_code(rhs, code) >>> >>> Seems to work. >>> >> >> Yeah, that's better. >> >> > > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel From markflorisson88 at gmail.com Tue Jan 24 21:58:13 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Tue, 24 Jan 2012 20:58:13 +0000 Subject: [Cython] inline defnode calls Message-ID: I just noticed the inline defnode call code. When I try to compile with 'cython -Xoptimize.inline_defnode_calls=True test.pyx' with the following code: def foo(x): print foo foo(10) I get Error compiling Cython file: ------------------------------------------------------------ ... def foo(x): print x foo(10) ^ ------------------------------------------------------------ test.pyx:4:3: Compiler crash in InlineDefNodeCalls ModuleNode.body = StatListNode(test.pyx:1:0) StatListNode.stats[2] = ExprStatNode(test.pyx:4:3) ExprStatNode.expr = SimpleCallNode(test.pyx:4:3, result_is_used = True, use_managed_ref = True) Compiler crash traceback from this point on: File "/Users/mark/cy/Cython/Compiler/Visitor.py", line 176, in _visitchild result = handler_method(child) File "/Users/mark/cy/Cython/Compiler/Optimize.py", line 1656, in visit_SimpleCallNode if not function_name.cf_state.is_single: AttributeError: 'NoneType' object has no attribute 'is_single' From robertwb at math.washington.edu Wed Jan 25 02:27:42 2012 From: robertwb at math.washington.edu (Robert Bradshaw) Date: Tue, 24 Jan 2012 17:27:42 -0800 Subject: [Cython] 0.16 release In-Reply-To: References: Message-ID: On Mon, Jan 23, 2012 at 2:27 AM, mark florisson wrote: > Hey, > > It's been almost three months since we talked about a 0.16 release, I > think it's quite ready. It would already be a big release, it would be > good to see how people like it, and to catch any issues etc before we > pile on more features. I would love to do a release soon. Last time this came up, I think the big issue was (compilation) performance regression. Has this been adequately addressed? The other issue is that there are a couple of doctest failures with Sage. One source of problems is decorators due to the (ugly) disallowing of function re-declarations, I'll try look into this one. There are also a huge number of segfaults (see the bottom of https://sage.math.washington.edu:8091/hudson/view/ext-libs/job/sage-tests/lastSuccessfulBuild/artifact/log.txt ) which we need to get to the bottom of. - Robert From vitja.makarov at gmail.com Wed Jan 25 07:49:52 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Wed, 25 Jan 2012 10:49:52 +0400 Subject: [Cython] inline defnode calls In-Reply-To: References: Message-ID: 2012/1/25 mark florisson : > I just noticed the inline defnode call code. When I try to compile > with 'cython -Xoptimize.inline_defnode_calls=True test.pyx' with the > following code: > > def foo(x): print foo > foo(10) > > I get > > Error compiling Cython file: > ------------------------------------------------------------ > ... > def foo(x): > ? ?print x > > foo(10) > ?^ > ------------------------------------------------------------ > > test.pyx:4:3: Compiler crash in InlineDefNodeCalls > > ModuleNode.body = StatListNode(test.pyx:1:0) > StatListNode.stats[2] = ExprStatNode(test.pyx:4:3) > ExprStatNode.expr = SimpleCallNode(test.pyx:4:3, > ? ?result_is_used = True, > ? ?use_managed_ref = True) > > Compiler crash traceback from this point on: > ?File "/Users/mark/cy/Cython/Compiler/Visitor.py", line 176, in _visitchild > ? ?result = handler_method(child) > ?File "/Users/mark/cy/Cython/Compiler/Optimize.py", line 1656, in > visit_SimpleCallNode > ? ?if not function_name.cf_state.is_single: > AttributeError: 'NoneType' object has no attribute 'is_single' Thanks for the report! The feature is still experimental and by default is disabled. Anyway it wouldn't work for your example. It works when we know what exactly function is referred by the name so it's closure case: def foo(): def bar(): pass bar() -- vitja. From vitja.makarov at gmail.com Wed Jan 25 07:59:43 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Wed, 25 Jan 2012 10:59:43 +0400 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com>

<4F1F03EE.5000504@astro.uio.no> Message-ID: 2012/1/25 mark florisson : > On 24 January 2012 19:18, Dag Sverre Seljebotn > wrote: >> On 01/24/2012 08:05 PM, Vitja Makarov wrote: >>> >>> 2012/1/24 mark florisson: >>>> >>>> On 24 January 2012 18:30, Vitja Makarov ?wrote: >>>>> >>>>> 2012/1/24 Robert Bradshaw: >>>>>> >>>>>> On Tue, Jan 24, 2012 at 6:09 AM, Vitja Makarov >>>>>> ?wrote: >>>>>>> >>>>>>> 2012/1/24 mark florisson: >>>>>>>> >>>>>>>> On 24 January 2012 11:37, Konrad Hinsen >>>>>>>> ?wrote: >>>>>>>>> >>>>>>>>> Compiling the attached Cython file produced the attached C file >>>>>>>>> which >>>>>>>>> has errors in lines 532-534: >>>>>>>>> >>>>>>>>> ?__pyx_v_self->xx = None; >>>>>>>>> ?__pyx_v_self->yy = None; >>>>>>>>> ?__pyx_v_self->zz = None; >>>>>>>>> >>>>>>>>> There is no C symbol "None", so this doesn't compile. >>>>>>>>> >>>>>>>>> I first noticed the bug in Cython 0.15, but it's still in the latest >>>>>>>>> revision from Github. >>>>>>>>> >>>>>>>>> Konrad. >>>>>>>>> >>>>>>>>> _______________________________________________ >>>>>>>>> cython-devel mailing list >>>>>>>>> cython-devel at python.org >>>>>>>>> http://mail.python.org/mailman/listinfo/cython-devel >>>>>>>>> >>>>>>>> >>>>>>>> Hm, it seems the problem is that the call to the builtin float >>>>>>>> results >>>>>>>> in SimpleCallNode being replaced with PythonCApiNode, which then >>>>>>>> generates the result code, but the list of coerced nodes are >>>>>>>> CloneNodes of the original rhs, and CloneNode does not generate the >>>>>>>> result code of the original rhs (i.e. allocate and assign to a temp), >>>>>>>> which results in a None result. >>>>>>>> >>>>>>>> Maybe CascadedAssignmentNode should replace CloneNode.arg with the >>>>>>>> latest self.rhs in generate_assignment_code? I'm not entirely sure. >>>>> >>>>> >>>>> Seems like a hack to me. >>>>> >>>>>>> >>>>>>> >>>>>>> May be it's better to run OptimizeBuiltinCalls before >>>>>>> AnalyseExpressionsTransform? >>>>>> >>>>>> >>>>>> Doesn't OptimizeBuiltinCalls take advantage of type information? >>>>> >>>>> >>>>> Yes, it does :( >>>>> >>>>> So as Mark said the problem is CascadedAssignmentNode.coerced_rhs_list >>>>> is created before rhs is updated. >>>>> >>>> >>>> I think deferring the CloneNode creation to code generation time works >>>> (are there any known problem with doing type coercions at code >>>> generation time?). >>> >>> >>> Coercion errors at code generation time? >> >> >> Apologies up front for raising my voice, as my knowledge of the internals >> are getting so rusty...take this with a grain of salt. >> >> I'm +1 on working towards having the code generation phase be pure code >> generation. I did some refactorings to take mini-steps towards that once >> upon a time, moving some error conditions to before code generation. >> >> My preferred approach would be to do away with CascadedAssignmentNode at the >> parse tree stage: >> >> a = b = c = expr >> >> goes to >> >> tmp = expr >> c = tmp >> b = tmp >> a = tmp >> >> and so on. Of course it gets messier; >> >> (expr1)[expr2] = (expr3).attr = expr4 >> >> But apart from getting the time of evaluating each expression right the >> transform should be straightforward. One of the tempnodes/"let"-nodes (I >> forgot which one, or if they've been consolidated) should be able to fix >> this. >> >> Takes some more work though than a quick hack though... >> >> Dag >> > > In principle it was doing the same thing, apart from the actual > rewrite. I suppose the replacement problem can also be circumvented by > manually wrapping self.rhs in a CoerceToTempNode. The problem with > coerce_to_temp is that it does not create this node if the result is > already in a temp. Creating it manually does mean an extra useless > assignment, but it is an easy fix which happens at analyse_types time. > ?Instead we could also use another node that just proxies a few things > like generate_result_code and the result method. > > I like the idea though, it would be nice to only handle things in > SingleAssignmentNode. I recently added broadcasting (inserting leading > dimensions) and scalar assignment to memoryviews, and you can only > catch that at the assignment point. Currently it only supports single > assignments as the functionality is only in SingleAssignmentNode. > > I must say though, the following would look a bit weird: > > ? ?a = b[:] = c[:, :] = d > > as you always expect a kind of "cascade", e.g. you expect c[:, :] to > be assignable to b[:], or 'a', but none of that may be true at all. So > I'm fine with disallowing that, I think people should only use > cascaded assignment for variables. > >>> >>>> E.g. save 'env' during analyse_types and in >>>> generate_assignment_code do >>>> >>>> ? ?rhs = CloneNode(self.rhs).coerce_to(lhs.type, self.env) >>>> ? ?rhs.generate_evaluation_code(code) >>>> ? ?lhs.generate_assignment_code(rhs, code) >>>> >>>> Seems to work. >>>> >>> >>> Yeah, that's better. >>> >>> >> I don't like idea of transforming cascade assignment into N single assignment since we might break some optimizations and loose CF info. I thought about playing with properties. We can make CloneNode.arg a property, e.g.: CloneNode(arg_getter=lambda:self.rhs) -- vitja. From stefan_ml at behnel.de Wed Jan 25 08:41:14 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Wed, 25 Jan 2012 08:41:14 +0100 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com> Message-ID: <4F1FB21A.9080407@behnel.de> mark florisson, 24.01.2012 14:53: > On 24 January 2012 11:37, Konrad Hinsen wrote: >> Compiling the attached Cython file produced the attached C file which >> has errors in lines 532-534: >> >> __pyx_v_self->xx = None; >> __pyx_v_self->yy = None; >> __pyx_v_self->zz = None; >> >> There is no C symbol "None", so this doesn't compile. >> >> I first noticed the bug in Cython 0.15, but it's still in the latest >> revision from Github. > > Hm, it seems the problem is that the call to the builtin float results > in SimpleCallNode being replaced with PythonCApiNode, which then > generates the result code, but the list of coerced nodes are > CloneNodes of the original rhs, and CloneNode does not generate the > result code of the original rhs (i.e. allocate and assign to a temp), > which results in a None result. Back to the old idea of separating the type analysis into 1) a basic typing, inference and entry creation step and 2) a proper type analysis, coercion, etc. step. The type driven optimisations would then run in between the two. That would simplify the optimisations (which would no longer have to unpack wrapped nodes) and improve the type analysis because it could work with the optimised types, e.g. return types of optimised builtin functions. I'm not entirely sure where the type inference should run. It may make more sense to move it after the tree optimisations to make use of optimised function calls. While we're at it, we should also replace the current type inference mechanism with a control flow based one. Sounds like a good topic for a Cython hacking workshop. Stefan From d.s.seljebotn at astro.uio.no Wed Jan 25 09:00:40 2012 From: d.s.seljebotn at astro.uio.no (Dag Sverre Seljebotn) Date: Wed, 25 Jan 2012 09:00:40 +0100 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com>

<4F1F03EE.5000504@astro.uio.no> Message-ID: <4F1FB6A8.8080103@astro.uio.no> On 01/25/2012 07:59 AM, Vitja Makarov wrote: > 2012/1/25 mark florisson: >> On 24 January 2012 19:18, Dag Sverre Seljebotn >> wrote: >>> On 01/24/2012 08:05 PM, Vitja Makarov wrote: >>>> >>>> 2012/1/24 mark florisson: >>>>> >>>>> On 24 January 2012 18:30, Vitja Makarov wrote: >>>>>> >>>>>> 2012/1/24 Robert Bradshaw: >>>>>>> >>>>>>> On Tue, Jan 24, 2012 at 6:09 AM, Vitja Makarov >>>>>>> wrote: >>>>>>>> >>>>>>>> 2012/1/24 mark florisson: >>>>>>>>> >>>>>>>>> On 24 January 2012 11:37, Konrad Hinsen >>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>> Compiling the attached Cython file produced the attached C file >>>>>>>>>> which >>>>>>>>>> has errors in lines 532-534: >>>>>>>>>> >>>>>>>>>> __pyx_v_self->xx = None; >>>>>>>>>> __pyx_v_self->yy = None; >>>>>>>>>> __pyx_v_self->zz = None; >>>>>>>>>> >>>>>>>>>> There is no C symbol "None", so this doesn't compile. >>>>>>>>>> >>>>>>>>>> I first noticed the bug in Cython 0.15, but it's still in the latest >>>>>>>>>> revision from Github. >>>>>>>>>> >>>>>>>>>> Konrad. >>>>>>>>>> >>>>>>>>>> _______________________________________________ >>>>>>>>>> cython-devel mailing list >>>>>>>>>> cython-devel at python.org >>>>>>>>>> http://mail.python.org/mailman/listinfo/cython-devel >>>>>>>>>> >>>>>>>>> >>>>>>>>> Hm, it seems the problem is that the call to the builtin float >>>>>>>>> results >>>>>>>>> in SimpleCallNode being replaced with PythonCApiNode, which then >>>>>>>>> generates the result code, but the list of coerced nodes are >>>>>>>>> CloneNodes of the original rhs, and CloneNode does not generate the >>>>>>>>> result code of the original rhs (i.e. allocate and assign to a temp), >>>>>>>>> which results in a None result. >>>>>>>>> >>>>>>>>> Maybe CascadedAssignmentNode should replace CloneNode.arg with the >>>>>>>>> latest self.rhs in generate_assignment_code? I'm not entirely sure. >>>>>> >>>>>> >>>>>> Seems like a hack to me. >>>>>> >>>>>>>> >>>>>>>> >>>>>>>> May be it's better to run OptimizeBuiltinCalls before >>>>>>>> AnalyseExpressionsTransform? >>>>>>> >>>>>>> >>>>>>> Doesn't OptimizeBuiltinCalls take advantage of type information? >>>>>> >>>>>> >>>>>> Yes, it does :( >>>>>> >>>>>> So as Mark said the problem is CascadedAssignmentNode.coerced_rhs_list >>>>>> is created before rhs is updated. >>>>>> >>>>> >>>>> I think deferring the CloneNode creation to code generation time works >>>>> (are there any known problem with doing type coercions at code >>>>> generation time?). >>>> >>>> >>>> Coercion errors at code generation time? >>> >>> >>> Apologies up front for raising my voice, as my knowledge of the internals >>> are getting so rusty...take this with a grain of salt. >>> >>> I'm +1 on working towards having the code generation phase be pure code >>> generation. I did some refactorings to take mini-steps towards that once >>> upon a time, moving some error conditions to before code generation. >>> >>> My preferred approach would be to do away with CascadedAssignmentNode at the >>> parse tree stage: >>> >>> a = b = c = expr >>> >>> goes to >>> >>> tmp = expr >>> c = tmp >>> b = tmp >>> a = tmp >>> >>> and so on. Of course it gets messier; >>> >>> (expr1)[expr2] = (expr3).attr = expr4 >>> >>> But apart from getting the time of evaluating each expression right the >>> transform should be straightforward. One of the tempnodes/"let"-nodes (I >>> forgot which one, or if they've been consolidated) should be able to fix >>> this. >>> >>> Takes some more work though than a quick hack though... >>> >>> Dag >>> >> >> In principle it was doing the same thing, apart from the actual >> rewrite. I suppose the replacement problem can also be circumvented by >> manually wrapping self.rhs in a CoerceToTempNode. The problem with >> coerce_to_temp is that it does not create this node if the result is >> already in a temp. Creating it manually does mean an extra useless >> assignment, but it is an easy fix which happens at analyse_types time. >> Instead we could also use another node that just proxies a few things >> like generate_result_code and the result method. >> >> I like the idea though, it would be nice to only handle things in >> SingleAssignmentNode. I recently added broadcasting (inserting leading >> dimensions) and scalar assignment to memoryviews, and you can only >> catch that at the assignment point. Currently it only supports single >> assignments as the functionality is only in SingleAssignmentNode. >> >> I must say though, the following would look a bit weird: >> >> a = b[:] = c[:, :] = d >> >> as you always expect a kind of "cascade", e.g. you expect c[:, :] to >> be assignable to b[:], or 'a', but none of that may be true at all. So >> I'm fine with disallowing that, I think people should only use >> cascaded assignment for variables. I don't think that is a problem myself; but that's perhaps just because I'm so used to it (and to "a = b.x = y" not invoking b.__getattr__, and so on). After all, that is what you get with Python and NumPy! This is, in a sense Python being a bit strange and us just following Python. So I'm +1 for supporting this if we can do it "by accident". >> >>>> >>>>> E.g. save 'env' during analyse_types and in >>>>> generate_assignment_code do >>>>> >>>>> rhs = CloneNode(self.rhs).coerce_to(lhs.type, self.env) >>>>> rhs.generate_evaluation_code(code) >>>>> lhs.generate_assignment_code(rhs, code) >>>>> >>>>> Seems to work. >>>>> >>>> >>>> Yeah, that's better. >>>> >>>> >>> > > I don't like idea of transforming cascade assignment into N single > assignment since we might break some optimizations and loose CF info. But what if the user decides to write tmp = expr a = tmp b = tmp manually? Shouldn't the same optimizations apply then? Consider that if we can get down to a single assignment node, we can then split it into SingleAssignmentNode into "AssignLocalNode", "AssignPythonModuleVarNode", "AssignAttributeNode", "AssignTypedVarNode" and so on, if we want to -- that should clean up some code... Dag Sverre From stefan_ml at behnel.de Wed Jan 25 09:04:02 2012 From: stefan_ml at behnel.de (Stefan Behnel) Date: Wed, 25 Jan 2012 09:04:02 +0100 Subject: [Cython] Bug in Cython producing incorrect C code In-Reply-To: References: <1327405070.15017.140661027320813@webmail.messagingengine.com>

<4F1F03EE.5000504@astro.uio.no> Message-ID: <4F1FB772.1030906@behnel.de> mark florisson, 24.01.2012 21:28: > I must say though, the following would look a bit weird: > > a = b[:] = c[:, :] = d > > as you always expect a kind of "cascade", e.g. you expect c[:, :] to > be assignable to b[:], or 'a', but none of that may be true at all. That's normal for a typed language that has type auto-coercion. I consider this a major feature. It certainly makes the internals tricky, but when working on the assignment code, I always tried to keep the coercions and the eventual assignment code independent, even in the face of efficient tuple unpacking, because I considered it the expected behaviour. Stefan From markflorisson88 at gmail.com Wed Jan 25 11:43:35 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Wed, 25 Jan 2012 10:43:35 +0000 Subject: [Cython] 0.16 release In-Reply-To: References: Message-ID: On 25 January 2012 01:27, Robert Bradshaw wrote: > On Mon, Jan 23, 2012 at 2:27 AM, mark florisson > wrote: >> Hey, >> >> It's been almost three months since we talked about a 0.16 release, I >> think it's quite ready. It would already be a big release, it would be >> good to see how people like it, and to catch any issues etc before we >> pile on more features. > > I would love to do a release soon. Last time this came up, I think the > big issue was (compilation) performance regression. Has this been > adequately addressed? Sort of. Basically if you don't use memoryviews it will be as fast as it used to be, otherwise there is about a 3 second constant time overhead (on my machine). > The other issue is that there are a couple of > doctest failures with Sage. One source of problems is decorators due > to the (ugly) disallowing of function re-declarations, I'll try look > into this one. There are also a huge number of segfaults (see the > bottom of https://sage.math.washington.edu:8091/hudson/view/ext-libs/job/sage-tests/lastSuccessfulBuild/artifact/log.txt > ) which we need to get to the bottom of. Oh I see. I suppose to try it out under a debugger one would have to compile the whole of sage from source? > - Robert > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel From markflorisson88 at gmail.com Wed Jan 25 11:44:40 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Wed, 25 Jan 2012 10:44:40 +0000 Subject: [Cython] inline defnode calls In-Reply-To: References:

Message-ID: On 25 January 2012 06:49, Vitja Makarov wrote: > 2012/1/25 mark florisson : >> I just noticed the inline defnode call code. When I try to compile >> with 'cython -Xoptimize.inline_defnode_calls=True test.pyx' with the >> following code: >> >> def foo(x): print foo >> foo(10) >> >> I get >> >> Error compiling Cython file: >> ------------------------------------------------------------ >> ... >> def foo(x): >> ? ?print x >> >> foo(10) >> ?^ >> ------------------------------------------------------------ >> >> test.pyx:4:3: Compiler crash in InlineDefNodeCalls >> >> ModuleNode.body = StatListNode(test.pyx:1:0) >> StatListNode.stats[2] = ExprStatNode(test.pyx:4:3) >> ExprStatNode.expr = SimpleCallNode(test.pyx:4:3, >> ? ?result_is_used = True, >> ? ?use_managed_ref = True) >> >> Compiler crash traceback from this point on: >> ?File "/Users/mark/cy/Cython/Compiler/Visitor.py", line 176, in _visitchild >> ? ?result = handler_method(child) >> ?File "/Users/mark/cy/Cython/Compiler/Optimize.py", line 1656, in >> visit_SimpleCallNode >> ? ?if not function_name.cf_state.is_single: >> AttributeError: 'NoneType' object has no attribute 'is_single' > > > Thanks for the report! The feature is still experimental and by > default is disabled. > Anyway it wouldn't work for your example. It works when we know what > exactly function is referred by the name so it's closure case: > > def foo(): > ? ?def bar(): > ? ? ? ?pass > ? ?bar() > > -- > vitja. > _______________________________________________ > cython-devel mailing list > cython-devel at python.org > http://mail.python.org/mailman/listinfo/cython-devel Ah, neat. I thought it was perhaps also defying monkeypatching. From vitja.makarov at gmail.com Wed Jan 25 12:24:08 2012 From: vitja.makarov at gmail.com (Vitja Makarov) Date: Wed, 25 Jan 2012 15:24:08 +0400 Subject: [Cython] inline defnode calls In-Reply-To: References:

Message-ID: 2012/1/25 mark florisson : > On 25 January 2012 06:49, Vitja Makarov wrote: >> 2012/1/25 mark florisson : >>> I just noticed the inline defnode call code. When I try to compile >>> with 'cython -Xoptimize.inline_defnode_calls=True test.pyx' with the >>> following code: >>> >>> def foo(x): print foo >>> foo(10) >>> >>> I get >>> >>> Error compiling Cython file: >>> ------------------------------------------------------------ >>> ... >>> def foo(x): >>> ? ?print x >>> >>> foo(10) >>> ?^ >>> ------------------------------------------------------------ >>> >>> test.pyx:4:3: Compiler crash in InlineDefNodeCalls >>> >>> ModuleNode.body = StatListNode(test.pyx:1:0) >>> StatListNode.stats[2] = ExprStatNode(test.pyx:4:3) >>> ExprStatNode.expr = SimpleCallNode(test.pyx:4:3, >>> ? ?result_is_used = True, >>> ? ?use_managed_ref = True) >>> >>> Compiler crash traceback from this point on: >>> ?File "/Users/mark/cy/Cython/Compiler/Visitor.py", line 176, in _visitchild >>> ? ?result = handler_method(child) >>> ?File "/Users/mark/cy/Cython/Compiler/Optimize.py", line 1656, in >>> visit_SimpleCallNode >>> ? ?if not function_name.cf_state.is_single: >>> AttributeError: 'NoneType' object has no attribute 'is_single' >> >> >> Thanks for the report! The feature is still experimental and by >> default is disabled. >> Anyway it wouldn't work for your example. It works when we know what >> exactly function is referred by the name so it's closure case: >> >> def foo(): >> ? ?def bar(): >> ? ? ? ?pass >> ? ?bar() >> >> -- >> vitja. >> _______________________________________________ >> cython-devel mailing list >> cython-devel at python.org >> http://mail.python.org/mailman/listinfo/cython-devel > > Ah, neat. I thought it was perhaps also defying monkeypatching. I'm thinking about implementing "conditional inlining": depending on what function actually is it'll make direct call to C function or PyObject_Call(). -- vitja. From markflorisson88 at gmail.com Wed Jan 25 12:32:34 2012 From: markflorisson88 at gmail.com (mark florisson) Date: Wed, 25 Jan 2012 11:32:34 +0000 Subject: [Cython] inline defnode calls In-Reply-To: References:

Message-ID: On 25 January 2012 11:24, Vitja Makarov wrote: > 2012/1/25 mark florisson