From matthew.brett at gmail.com Wed May 1 00:16:38 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Tue, 30 Apr 2013 21:16:38 -0700 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <20130501030849.GO5140@onerussian.com> References: <20130501030849.GO5140@onerussian.com> Message-ID: Hi, On Tue, Apr 30, 2013 at 8:08 PM, Yaroslav Halchenko wrote: > could anyone on 32bit system with fresh numpy (1.7.1) test following: > >> wget -nc http://www.onerussian.com/tmp/data.npy ; python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' > > 0.0 > > because unfortunately it seems on fresh ubuntu raring (in 32bit build only, > seems ok in 64 bit... also never ran into it on older numpy releases): > >> python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' > -1.11022302463e-16 > > PS detected by failed tests of pymvpa Reduced case on numpy 1.7.1, 32-bit Ubuntu 12.04.2 In [64]: data = np.array([[ 0.49505185, 0.47212842], [ 0.53529587, 0.04366172], [-0.13461665, -0.01664215]]) In [65]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] Out[65]: 1.1102230246251565e-16 No difference for single vector: In [4]: data1 = data[:, 0:1] In [5]: np.sum(data1[:, 0]) - np.sum(data1, axis=0)[0] Out[5]: 0.0 Puzzling to me... Cheers, Matthew From chris.barker at noaa.gov Wed May 1 01:13:53 2013 From: chris.barker at noaa.gov (Chris Barker - NOAA Federal) Date: Tue, 30 Apr 2013 22:13:53 -0700 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: Message-ID: <1888613198476891170@unknownmsgid> On Apr 30, 2013, at 6:37 PM, Benjamin Root wrote: > I can not think of any reason not to include these functions in v1.8. +1 > Of course, the documentation for discussed before: np.minmax(). My thinking is that it would return a 2xN array How about a tuple: (min, max)? -Chris From matthew.brett at gmail.com Wed May 1 01:20:33 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Tue, 30 Apr 2013 22:20:33 -0700 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <20130501030849.GO5140@onerussian.com> Message-ID: Hi, On Tue, Apr 30, 2013 at 9:16 PM, Matthew Brett wrote: > Hi, > > On Tue, Apr 30, 2013 at 8:08 PM, Yaroslav Halchenko > wrote: >> could anyone on 32bit system with fresh numpy (1.7.1) test following: >> >>> wget -nc http://www.onerussian.com/tmp/data.npy ; python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' >> >> 0.0 >> >> because unfortunately it seems on fresh ubuntu raring (in 32bit build only, >> seems ok in 64 bit... also never ran into it on older numpy releases): >> >>> python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' >> -1.11022302463e-16 >> >> PS detected by failed tests of pymvpa > > Reduced case on numpy 1.7.1, 32-bit Ubuntu 12.04.2 > > In [64]: data = np.array([[ 0.49505185, 0.47212842], > [ 0.53529587, 0.04366172], > [-0.13461665, -0.01664215]]) > > In [65]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > Out[65]: 1.1102230246251565e-16 > > No difference for single vector: > > In [4]: data1 = data[:, 0:1] > > In [5]: np.sum(data1[:, 0]) - np.sum(data1, axis=0)[0] > Out[5]: 0.0 Also true on current numpy trunk: In [2]: import numpy as np In [3]: np.__version__ Out[3]: '1.8.0.dev-a8805f6' In [4]: data = np.array([[ 0.49505185, 0.47212842], ....: [ 0.53529587, 0.04366172], ....: [-0.13461665, -0.01664215]]) In [5]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] Out[5]: 1.1102230246251565e-16 Not true on numpy 1.6.1: In [2]: np.__version__ Out[2]: '1.6.1' In [3]: data = np.array([[ 0.49505185, 0.47212842], ....: [ 0.53529587, 0.04366172], ....: [-0.13461665, -0.01664215]]) In [4]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] Out[4]: 0.0 Cheers, Matthew From brad.froehle at gmail.com Wed May 1 01:32:41 2013 From: brad.froehle at gmail.com (Bradley M. Froehle) Date: Tue, 30 Apr 2013 22:32:41 -0700 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <20130501030849.GO5140@onerussian.com> References: <20130501030849.GO5140@onerussian.com> Message-ID: On Tue, Apr 30, 2013 at 8:08 PM, Yaroslav Halchenko wrote: > could anyone on 32bit system with fresh numpy (1.7.1) test following: > > > wget -nc http://www.onerussian.com/tmp/data.npy ; python -c 'import > numpy as np; data1 = np.load("/tmp/data.npy"); print > np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' > > 0.0 > > because unfortunately it seems on fresh ubuntu raring (in 32bit build only, > seems ok in 64 bit... also never ran into it on older numpy releases): > > > python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print > np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' > -1.11022302463e-16 > Perhaps on the 32-bit system one call is using the 80-bit extended precision register for the summation and the other one is not? -Brad -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Wed May 1 05:00:13 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 01 May 2013 11:00:13 +0200 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <20130501030849.GO5140@onerussian.com> Message-ID: <1367398813.2545.11.camel@sebastian-laptop> On Tue, 2013-04-30 at 22:20 -0700, Matthew Brett wrote: > Hi, > > On Tue, Apr 30, 2013 at 9:16 PM, Matthew Brett wrote: > > Hi, > > > > On Tue, Apr 30, 2013 at 8:08 PM, Yaroslav Halchenko > > wrote: > >> could anyone on 32bit system with fresh numpy (1.7.1) test following: > >> > >>> wget -nc http://www.onerussian.com/tmp/data.npy ; python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' > >> > >> 0.0 > >> > >> because unfortunately it seems on fresh ubuntu raring (in 32bit build only, > >> seems ok in 64 bit... also never ran into it on older numpy releases): > >> > >>> python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' > >> -1.11022302463e-16 > >> > >> PS detected by failed tests of pymvpa > > > > Reduced case on numpy 1.7.1, 32-bit Ubuntu 12.04.2 > > > > In [64]: data = np.array([[ 0.49505185, 0.47212842], > > [ 0.53529587, 0.04366172], > > [-0.13461665, -0.01664215]]) > > > > In [65]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > > Out[65]: 1.1102230246251565e-16 > > > > No difference for single vector: > > > > In [4]: data1 = data[:, 0:1] > > > > In [5]: np.sum(data1[:, 0]) - np.sum(data1, axis=0)[0] > > Out[5]: 0.0 > > Also true on current numpy trunk: > > In [2]: import numpy as np > > In [3]: np.__version__ > Out[3]: '1.8.0.dev-a8805f6' > > In [4]: data = np.array([[ 0.49505185, 0.47212842], > ....: [ 0.53529587, 0.04366172], > ....: [-0.13461665, -0.01664215]]) > > In [5]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > Out[5]: 1.1102230246251565e-16 > > Not true on numpy 1.6.1: > > In [2]: np.__version__ > Out[2]: '1.6.1' > > In [3]: data = np.array([[ 0.49505185, 0.47212842], > ....: [ 0.53529587, 0.04366172], > ....: [-0.13461665, -0.01664215]]) > > In [4]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > Out[4]: 0.0 > Puzzles me, I didn't think calculation order was different in both cases and optimized for the reduction part. But maybe check the code, if it is optimized, it would calculate this more like `res += data[0]; res += data[1]; res += data[2]` (for faster memory access), which would probably kill the extended registers (I don't know this hardware stuff, so might be wrong). One simple try hinting that this may be going on would be to data fortran order. - Sebastian > Cheers, > > Matthew > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From sebastian at sipsolutions.net Wed May 1 05:08:15 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 01 May 2013 11:08:15 +0200 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <1367398813.2545.11.camel@sebastian-laptop> References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> Message-ID: <1367399295.2545.14.camel@sebastian-laptop> On Wed, 2013-05-01 at 11:00 +0200, Sebastian Berg wrote: > On Tue, 2013-04-30 at 22:20 -0700, Matthew Brett wrote: > > Hi, > > > > On Tue, Apr 30, 2013 at 9:16 PM, Matthew Brett wrote: > > > Hi, > > > > > > On Tue, Apr 30, 2013 at 8:08 PM, Yaroslav Halchenko > > > wrote: > > >> could anyone on 32bit system with fresh numpy (1.7.1) test following: > > >> > > >>> wget -nc http://www.onerussian.com/tmp/data.npy ; python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' > > >> > > >> 0.0 > > >> > > >> because unfortunately it seems on fresh ubuntu raring (in 32bit build only, > > >> seems ok in 64 bit... also never ran into it on older numpy releases): > > >> > > >>> python -c 'import numpy as np; data1 = np.load("/tmp/data.npy"); print np.sum(data1[1,:,0,1]) - np.sum(data1, axis=1)[1,0,1]' > > >> -1.11022302463e-16 > > >> > > >> PS detected by failed tests of pymvpa > > > > > > Reduced case on numpy 1.7.1, 32-bit Ubuntu 12.04.2 > > > > > > In [64]: data = np.array([[ 0.49505185, 0.47212842], > > > [ 0.53529587, 0.04366172], > > > [-0.13461665, -0.01664215]]) > > > > > > In [65]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > > > Out[65]: 1.1102230246251565e-16 > > > > > > No difference for single vector: > > > > > > In [4]: data1 = data[:, 0:1] > > > > > > In [5]: np.sum(data1[:, 0]) - np.sum(data1, axis=0)[0] > > > Out[5]: 0.0 > > > > Also true on current numpy trunk: > > > > In [2]: import numpy as np > > > > In [3]: np.__version__ > > Out[3]: '1.8.0.dev-a8805f6' > > > > In [4]: data = np.array([[ 0.49505185, 0.47212842], > > ....: [ 0.53529587, 0.04366172], > > ....: [-0.13461665, -0.01664215]]) > > > > In [5]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > > Out[5]: 1.1102230246251565e-16 > > > > Not true on numpy 1.6.1: > > > > In [2]: np.__version__ > > Out[2]: '1.6.1' > > > > In [3]: data = np.array([[ 0.49505185, 0.47212842], > > ....: [ 0.53529587, 0.04366172], > > ....: [-0.13461665, -0.01664215]]) > > > > In [4]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > > Out[4]: 0.0 > > > > Puzzles me, I didn't think calculation order was different in both cases > and optimized for the reduction part. But maybe check the code, if it > is optimized, it would calculate this more like `res += data[0]; res += > data[1]; res += data[2]` (for faster memory access), which would > probably kill the extended registers (I don't know this hardware stuff, > so might be wrong). One simple try hinting that this may be going on > would be to data fortran order. > Well I guess it is optimized and the reason I thought it was not, was because it probably was not before 1.7. so I think this is probably the reason. > - Sebastian > > > Cheers, > > > > Matthew > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From lists at onerussian.com Wed May 1 08:49:33 2013 From: lists at onerussian.com (Yaroslav Halchenko) Date: Wed, 1 May 2013 08:49:33 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <1367399295.2545.14.camel@sebastian-laptop> References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> Message-ID: <20130501124933.GP5140@onerussian.com> Thanks everyone for the feedback. Is it worth me starting a bisection to catch where it was introduced? On Wed, 01 May 2013, Sebastian Berg wrote: > > so might be wrong). One simple try hinting that this may be going on > > would be to data fortran order. > Well I guess it is optimized and the reason I thought it was not, was > because it probably was not before 1.7. so I think this is probably the > reason. -- Yaroslav O. Halchenko, Ph.D. http://neuro.debian.net http://www.pymvpa.org http://www.fail2ban.org Senior Research Associate, Psychological and Brain Sciences Dept. Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755 Phone: +1 (603) 646-9834 Fax: +1 (603) 646-1419 WWW: http://www.linkedin.com/in/yarik From njs at pobox.com Wed May 1 08:55:56 2013 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 1 May 2013 08:55:56 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <20130501124933.GP5140@onerussian.com> References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> Message-ID: On 1 May 2013 08:49, "Yaroslav Halchenko" wrote: > > Thanks everyone for the feedback. > > Is it worth me starting a bisection to catch where it was introduced? Is it a bug, or just typical fp rounding issues? Do we know which answer is correct? -n -------------- next part -------------- An HTML attachment was scrubbed... URL: From lists at onerussian.com Wed May 1 09:01:25 2013 From: lists at onerussian.com (Yaroslav Halchenko) Date: Wed, 1 May 2013 09:01:25 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> Message-ID: <20130501130125.GQ5140@onerussian.com> On Wed, 01 May 2013, Nathaniel Smith wrote: > > Thanks everyone for the feedback. > > Is it worth me starting a bisection to catch where it was introduced? > Is it a bug, or just typical fp rounding issues? Do we know which answer > is correct? to ignorant me, even without considering 'correctness', it is just a typical regression -- results changed from one release to another (and not to the better side). -- Yaroslav O. Halchenko, Ph.D. http://neuro.debian.net http://www.pymvpa.org http://www.fail2ban.org Senior Research Associate, Psychological and Brain Sciences Dept. Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755 Phone: +1 (603) 646-9834 Fax: +1 (603) 646-1419 WWW: http://www.linkedin.com/in/yarik From pav at iki.fi Wed May 1 09:12:08 2013 From: pav at iki.fi (Pauli Virtanen) Date: Wed, 01 May 2013 16:12:08 +0300 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <20130501130125.GQ5140@onerussian.com> References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> Message-ID: 01.05.2013 16:01, Yaroslav Halchenko kirjoitti: [clip] > to ignorant me, even without considering 'correctness', it is just > a typical regression -- results changed from one release to another (and > not to the better side). To me this seems to be a consequence of performing additions in a different order than previously. Both results are IMHO correct, so I'm not sure there is anything to fix here. Third-party code relying on a certain outcome of rounding error is likely incorrect anyway. -- Pauli Virtanen From ben.root at ou.edu Wed May 1 09:52:13 2013 From: ben.root at ou.edu (Benjamin Root) Date: Wed, 1 May 2013 09:52:13 -0400 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: <1888613198476891170@unknownmsgid> References: <1888613198476891170@unknownmsgid> Message-ID: On Wed, May 1, 2013 at 1:13 AM, Chris Barker - NOAA Federal < chris.barker at noaa.gov> wrote: > > Of course, the documentation for discussed before: np.minmax(). My > thinking is that it would return a 2xN array > > How about a tuple: (min, max)? > > I am not familiar enough with numpy internals to know which is the better approach to implement. I kind of feel that the 2xN array approach would be more flexible in case a user wants all of this information in a single array, while still allowing for unpacking as if it was a tuple. I would rather enable unforeseen use-cases rather than needlessly restricting them. Ben Root -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Wed May 1 09:54:56 2013 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 1 May 2013 09:54:56 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> Message-ID: On Wed, May 1, 2013 at 9:12 AM, Pauli Virtanen wrote: > 01.05.2013 16:01, Yaroslav Halchenko kirjoitti: > [clip] >> to ignorant me, even without considering 'correctness', it is just >> a typical regression -- results changed from one release to another (and >> not to the better side). > > To me this seems to be a consequence of performing additions in a > different order than previously. Both results are IMHO correct, so I'm > not sure there is anything to fix here. Third-party code relying on a > certain outcome of rounding error is likely incorrect anyway. Yeah, seems to just be the standard floating point indeterminism. Using Matthew's numbers and pure Python floats: In [9]: (0.49505185 + 0.53529587) + -0.13461665 Out[9]: 0.89573107 In [10]: 0.49505185 + (0.53529587 + -0.13461665) Out[10]: 0.8957310700000001 In [11]: _9 - _10 Out[11]: -1.1102230246251565e-16 Looks like a bug in pymvpa or its test suite to me. -n From davidmenhur at gmail.com Wed May 1 10:14:22 2013 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Wed, 1 May 2013 16:14:22 +0200 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: Message-ID: On 1 May 2013 03:36, Benjamin Root wrote: > Are there any other functions that others feel are "missing" from numpy and > would like to see for v1.8? Let's discuss them here. I would like to have sincos, to compute sin and cos of the same number faster. According to some benchmarks, it is barely slower than just computing one of them. On 1 May 2013 07:13, Chris Barker - NOAA Federal wrote: >> Of course, the documentation for discussed before: np.minmax(). My thinking is that it would return a 2xN array > > How about a tuple: (min, max)? Consider the case of np.minmax(matrix, axis=1), you will end up with a tuple of two arrays. In that scenario, you probably want to do computations with both numbers, so having them in an array seems more convenient. If there is enough reason, we could always add a "unpack=True" flag and then return a tuple. From juanlu001 at gmail.com Wed May 1 10:55:44 2013 From: juanlu001 at gmail.com (Juan Luis Cano) Date: Wed, 01 May 2013 16:55:44 +0200 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: Message-ID: <51812CF0.30004@gmail.com> On 05/01/2013 04:14 PM, Da?id wrote: > On 1 May 2013 03:36, Benjamin Root wrote: >> Are there any other functions that others feel are "missing" from numpy and >> would like to see for v1.8? Let's discuss them here. > I would like to have sincos, to compute sin and cos of the same number > faster. According to some benchmarks, it is barely slower than just > computing one of them. Where does this `sincos` function come from? From zploskey at gmail.com Wed May 1 11:06:17 2013 From: zploskey at gmail.com (Zachary Ploskey) Date: Wed, 1 May 2013 08:06:17 -0700 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: <51812CF0.30004@gmail.com> References: <51812CF0.30004@gmail.com> Message-ID: The sincos function is in the c standard library in math.h. On May 1, 2013 7:56 AM, "Juan Luis Cano" wrote: > On 05/01/2013 04:14 PM, Da?id wrote: > > On 1 May 2013 03:36, Benjamin Root wrote: > >> Are there any other functions that others feel are "missing" from numpy > and > >> would like to see for v1.8? Let's discuss them here. > > I would like to have sincos, to compute sin and cos of the same number > > faster. According to some benchmarks, it is barely slower than just > > computing one of them. > > Where does this `sincos` function come from? > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From toddrjen at gmail.com Wed May 1 11:13:49 2013 From: toddrjen at gmail.com (Todd) Date: Wed, 1 May 2013 17:13:49 +0200 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: Message-ID: On Wed, May 1, 2013 at 3:36 AM, Benjamin Root wrote: > > Are there any other functions that others feel are "missing" from numpy > and would like to see for v1.8? Let's discuss them here. > As I mentioned before, I think numpy should have some equations for dealing with n-dimensional vectors (but would also work with complex dtypes). This would include n-dimensional equivalents of np.abs and np.angle, as well as a function to go back to an n-dimensional vector from the length and angle. Considering how critical vector analysis is to signal processing, I am surprised these don't already exist. There aren't even functions that work with 2-dimensional vectors, you have to construct a complex array first (which isn't that easy to do). Speaking of which, I think there should be a function to construct a complex array out of two identically-shaped floating-point arrays, as well as perhaps an np.i class that converts a real array to an imaginary one (using __mul__ and such). -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Wed May 1 11:14:33 2013 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 1 May 2013 16:14:33 +0100 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <51812CF0.30004@gmail.com> Message-ID: On Wed, May 1, 2013 at 4:06 PM, Zachary Ploskey wrote: > The sincos function is in the c standard library in math.h. I don't think it's part of the C99 standard. It appears to be provided in glibc as a non-standard extension. We would have to provide our own copy, but one is available in the Cephes library. -- Robert Kern From jslavin at cfa.harvard.edu Wed May 1 11:20:21 2013 From: jslavin at cfa.harvard.edu (Jonathan Slavin) Date: Wed, 01 May 2013 11:20:21 -0400 Subject: [Numpy-discussion] clip with None argument changes dtype Message-ID: <1367421621.4115.16.camel@shevek> Hi all, I'm wondering if you think the following behavior in numpy.clip is a bug (it certainly confused me for a while): >>> x = np.arange(5.) >>> xx = x.clip(None,3.) >>> xx array([0.0, 1.0, 2.0, 3.0, 3.0], dtype=object) Since xx now has the dtype of object, doing things like >>> np.exp(xx) AttributeError Traceback (most recent call last) in () ----> 1 np.exp(xx) Which, if you don't know about the change in the dtype is a very confusing error message. It seems to me that either clip should give an error message when None is given for the a_min argument or, better, should not change the dtype of the input array as it does. This comes up because I want to only clip the maximum. Of course you can clip the minimum by simply omitting the second argument. The asymmetry of this is not good, I think. I suppose that using the maximum function is better in this situation (as I recently found out), though the docs make it seem that one needs to supply two arrays -- even though you don't. Jon -- ______________________________________________________________ Jonathan D. Slavin Harvard-Smithsonian CfA jslavin at cfa.harvard.edu 60 Garden Street, MS 83 phone: (617) 496-7981 Cambridge, MA 02138-1516 cell: (781) 363-0035 USA ______________________________________________________________ From davidmenhur at gmail.com Wed May 1 11:22:28 2013 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Wed, 1 May 2013 17:22:28 +0200 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References:

Message-ID: On 1 May 2013 17:13, Todd wrote: > Speaking of which, I think there should be a function to construct a complex > array out of two identically-shaped floating-point arrays, as well as > perhaps an np.i class that converts a real array to an imaginary one (using > __mul__ and such). np.i would be exactly the same as array * 1j, or am I missing anything? The same goes for constructing a complex, real + imag * 1j From toddrjen at gmail.com Wed May 1 11:34:34 2013 From: toddrjen at gmail.com (Todd) Date: Wed, 1 May 2013 17:34:34 +0200 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References:

Message-ID: On Wed, May 1, 2013 at 5:22 PM, Da?id wrote: > On 1 May 2013 17:13, Todd wrote: > > Speaking of which, I think there should be a function to construct a > complex > > array out of two identically-shaped floating-point arrays, as well as > > perhaps an np.i class that converts a real array to an imaginary one > (using > > __mul__ and such). > > np.i would be exactly the same as array * 1j, or am I missing anything? > > The same goes for constructing a complex, real + imag * 1j > > > it would always produce a numpy array. So array*1j and array*np.i (or np.j if you prefer, perhaps both) would be the same, but list*1j and list*np.i would not. The function version would also probably allow you to specify the dtype, which 1j does not. -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Wed May 1 11:36:13 2013 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 1 May 2013 16:36:13 +0100 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References:

Message-ID: On Wed, May 1, 2013 at 4:22 PM, Da?id wrote: > On 1 May 2013 17:13, Todd wrote: >> Speaking of which, I think there should be a function to construct a complex >> array out of two identically-shaped floating-point arrays, as well as >> perhaps an np.i class that converts a real array to an imaginary one (using >> __mul__ and such). > > np.i would be exactly the same as array * 1j, or am I missing anything? I don't think we have a ufunc loop for multiply() that takes a float64 and a complex128 and returns a complex128. We just have a (complex128,complex128)->complex128. `x * 1j` first converts `x` to a complex128 array with the value in the real component, then multiplies that with 1j to move that value over to the imag component. A single operation that takes a float64 array and just makes a complex128 array with the values put in the imag component will reduce a temporary. > The same goes for constructing a complex, real + imag * 1j Similarly, we can eliminate two temporaries here. Both of the cases are probably best addressed by a single function. The syntactic sugar of an np.i object is unnecessary, IMO. imag = np.tocomplex(0.0, x) z = np.tocomplex(x, y) -- Robert Kern From lists at onerussian.com Wed May 1 12:09:48 2013 From: lists at onerussian.com (Yaroslav Halchenko) Date: Wed, 1 May 2013 12:09:48 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> Message-ID: <20130501160948.GR5140@onerussian.com> On Wed, 01 May 2013, Nathaniel Smith wrote: > > not sure there is anything to fix here. Third-party code relying on a > > certain outcome of rounding error is likely incorrect anyway. > Yeah, seems to just be the standard floating point indeterminism. > Using Matthew's numbers and pure Python floats: > In [9]: (0.49505185 + 0.53529587) + -0.13461665 > Out[9]: 0.89573107 > In [10]: 0.49505185 + (0.53529587 + -0.13461665) > Out[10]: 0.8957310700000001 > In [11]: _9 - _10 > Out[11]: -1.1102230246251565e-16 > Looks like a bug in pymvpa or its test suite to me. well -- sure thing we will "fix" the unittest to not rely on precise correspondence any longer since released 1.7.1 is effected. So it is not a matter of me avoiding "fixing" pymvpa's "bug". I brought it to your attention because 1. from e.g. np.sum(data[:, 0]) - np.sum(data, axis=0)[0] which presumably should be the same order of additions for 0-th column it is not clear that they do not have to be identical. 2. so far they were identical across many numpy releases 3. they are identical on other architectures (e.g. amd64) -- Yaroslav O. Halchenko, Ph.D. http://neuro.debian.net http://www.pymvpa.org http://www.fail2ban.org Senior Research Associate, Psychological and Brain Sciences Dept. Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755 Phone: +1 (603) 646-9834 Fax: +1 (603) 646-1419 WWW: http://www.linkedin.com/in/yarik From warren.weckesser at gmail.com Wed May 1 12:19:38 2013 From: warren.weckesser at gmail.com (Warren Weckesser) Date: Wed, 1 May 2013 12:19:38 -0400 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: Message-ID: On Wed, May 1, 2013 at 10:14 AM, Da?id wrote: > On 1 May 2013 03:36, Benjamin Root wrote: > > Are there any other functions that others feel are "missing" from numpy > and > > would like to see for v1.8? Let's discuss them here. > > I would like to have sincos, to compute sin and cos of the same number > faster. According to some benchmarks, it is barely slower than just > computing one of them. > > +1 Warren > > On 1 May 2013 07:13, Chris Barker - NOAA Federal > wrote: > >> Of course, the documentation for discussed before: np.minmax(). My > thinking is that it would return a 2xN array > > > > How about a tuple: (min, max)? > > Consider the case of np.minmax(matrix, axis=1), you will end up with a > tuple of two arrays. In that scenario, you probably want to do > computations with both numbers, so having them in an array seems more > convenient. > > If there is enough reason, we could always add a "unpack=True" flag > and then return a tuple. > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Wed May 1 12:58:17 2013 From: chris.barker at noaa.gov (Chris Barker - NOAA Federal) Date: Wed, 1 May 2013 09:58:17 -0700 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid> Message-ID: On Wed, May 1, 2013 at 6:52 AM, Benjamin Root wrote: > How about a tuple: (min, max)? > >> >> > I am not familiar enough with numpy internals to know which is the better > approach to implement. I kind of feel that the 2xN array approach would be > more flexible in case a user wants all of this information in a single > array, while still allowing for unpacking as if it was a tuple. > hmm, my thinking is that the min and max values really are two different results, so getting two separate arrays makes sense to me. however, you are right, Python's nifty generic sequence unpacking lets you use a (2X...) array similarly to a tuple of two arrays, so why not? Food for thought on one reason: min, max = np.minmax(arr) would result in two arrays, but they would be views on the same array, Not sure if that matters, though. -Chris -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Wed May 1 13:24:37 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 1 May 2013 10:24:37 -0700 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <20130501160948.GR5140@onerussian.com> References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> Message-ID: HI, On Wed, May 1, 2013 at 9:09 AM, Yaroslav Halchenko wrote: > > On Wed, 01 May 2013, Nathaniel Smith wrote: >> > not sure there is anything to fix here. Third-party code relying on a >> > certain outcome of rounding error is likely incorrect anyway. > >> Yeah, seems to just be the standard floating point indeterminism. >> Using Matthew's numbers and pure Python floats: > >> In [9]: (0.49505185 + 0.53529587) + -0.13461665 >> Out[9]: 0.89573107 > >> In [10]: 0.49505185 + (0.53529587 + -0.13461665) >> Out[10]: 0.8957310700000001 > >> In [11]: _9 - _10 >> Out[11]: -1.1102230246251565e-16 > >> Looks like a bug in pymvpa or its test suite to me. > > well -- sure thing we will "fix" the unittest to not rely on precise > correspondence any longer since released 1.7.1 is effected. So it is not > a matter of me avoiding "fixing" pymvpa's "bug". > > I brought it to your attention because > > 1. from e.g. > > np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > > which presumably should be the same order of additions for 0-th column it is > not clear that they do not have to be identical. I agree it's surprising, but I guess it's reasonable for numpy to reserve the right to add these guys up in whatever order it chooses, and (in this case) maybe a different order for the axis=None, axis=X cases. Also, y'all may have noticed that it is the presence of the second vector in the array which causes the difference in the sums of the first (see my first email in this thread). If this is an order effect I guess this means that the order of operations in an sum(a, axis=X) operation depends on the shape of the array. And it looks like it depends on memory layout: In [24]: data = np.array([[ 0.49505185, 0], ....: [ 0.53529587, 0], ....: [-0.13461665, 0]]) In [25]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] Out[25]: 1.1102230246251565e-16 In [26]: data_F = np.array(data, order='F') In [27]: np.sum(data_F[:, 0]) - np.sum(data_F, axis=0)[0] Out[27]: 0.0 Do we allow the results to be different for different memory layout? > 2. so far they were identical across many numpy releases > > 3. they are identical on other architectures (e.g. amd64) To me that is surprising. I would have guessed that the order is the same on 32 and 64 bit, but something about the precision of intermediate operations is different. I don't know enough about amd64 to guess what that could be. Bradley's suggestion seems kind of reasonable but it's strange that numpy should use intel-80 bit intermediate values differently for 32 and 64 bit. Cheers, Matthew From ben.root at ou.edu Wed May 1 13:50:18 2013 From: ben.root at ou.edu (Benjamin Root) Date: Wed, 1 May 2013 13:50:18 -0400 Subject: [Numpy-discussion] Proposal of new function: iteraxis() In-Reply-To: References:

<20130426205002.GA4942@phare.normalesup.org> Message-ID: On Mon, Apr 29, 2013 at 2:10 PM, Andrew Giessel < andrew_giessel at hms.harvard.edu> wrote: > Matthew: Thanks for the link to array order discussion. > > Any more thoughts on Phil's slice() function? > > > I rather like Phil's solution. Just some caveats. Will it always return views or copies? It should be one or the other (I haven't looked closely enough to check), and it should be documented to that affect. Plus, tests should be added to make sure it does that. Cheers! Ben Root -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Wed May 1 14:07:14 2013 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 1 May 2013 19:07:14 +0100 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> Message-ID: On Wed, May 1, 2013 at 6:24 PM, Matthew Brett wrote: > HI, > > On Wed, May 1, 2013 at 9:09 AM, Yaroslav Halchenko wrote: >> 3. they are identical on other architectures (e.g. amd64) > > To me that is surprising. I would have guessed that the order is the > same on 32 and 64 bit, but something about the precision of > intermediate operations is different. I don't know enough about > amd64 to guess what that could be. Bradley's suggestion seems kind of > reasonable but it's strange that numpy should use intel-80 bit > intermediate values differently for 32 and 64 bit. "numpy" isn't doing anything different between the two. numpy generates the same C code. The C compiler may be generating different machine instructions for that code on different architectures, even closely related ones like i386 and amd64. Different optimization flags and compiler versions will probably also affect this, not just the target architecture. It's possible that those are actually the source of this observation. -- Robert Kern From blake.a.griffith at gmail.com Wed May 1 14:12:46 2013 From: blake.a.griffith at gmail.com (Blake Griffith) Date: Wed, 1 May 2013 13:12:46 -0500 Subject: [Numpy-discussion] GSoC proposal -- Numpy SciPy In-Reply-To: References:

Message-ID: Oh wow, I just assumed that `dot` was a ufunc... However, it would still be useful to have ufuncs working well with the sparse package. I don't understand everything that is going on in https://github.com/numpy/numpy/blob/master/numpy/core/src/umath/ufunc_object.c But I assumed that I would be able to add the ability to check for something like _ufunc_override_. I'm not sure where this piece of logic should be inserted, or what the performance implications to NumPy would be... I'm trying to figure this out. But major optimizations to ufuncs is out of the scope of this GSoC. I will look into what can be done about the `dot` function. On Tue, Apr 30, 2013 at 6:53 PM, Nathaniel Smith wrote: > On Tue, Apr 30, 2013 at 4:02 PM, Pauli Virtanen wrote: > > 30.04.2013 22:37, Nathaniel Smith kirjoitti: > > [clip] > >> How do you plan to go about this? The obvious option of just calling > >> scipy.sparse.issparse() on ufunc entry raises some problems, since > >> numpy can't depend on or even import scipy, and we might be reluctant > >> to add such a special case for what's a rather more general problem. > >> OTOH it might be possible to solve the problem in general, e.g., see > >> the prototyped _ufunc_override_ special method in: > >> > >> https://github.com/njsmith/numpyNEP/blob/master/numpyNEP.py > >> > >> but I don't know if you want to get into such a debate within the > >> scope of your GSoC. What were you thinking? > > > > To me it seems that the right thing to do here is the general solution. > > > > Do you see immediate problems in e.g. just enabling something like your > > _ufunc_override_? > > Just that we might want to think a bit about the design space before > implementing something. E.g., apparently doing Python attribute lookup > is very expensive -- we recently had a patch to skip > __array_interface__ checks whenever possible -- is adding another such > per-operation overhead ok? I guess we could use similar checks (skip > checking for known types like int/float/ndarray), or only check for > _ufunc_override_ on the class (not the instance) and cache the result > per-class? > > > The easy thing is that there are no backward compatibility problems > > here, since if the magic is missing, the old logic is used. Currently, > > the numpy dot() and ufuncs also most of the time do nothing sensible > > with sparse matrix inputs even though they in some cases return values. > > Which then makes writing generic sparse/dense code more painful than > > just __mul__ being matrix multiplication. > > I agree, but, if the main target is 'dot' then the current > _ufunc_override_ design alone won't do it, since 'dot' is not a > ufunc... > > -n > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From davidmenhur at gmail.com Wed May 1 14:32:42 2013 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Wed, 1 May 2013 20:32:42 +0200 Subject: [Numpy-discussion] GSoC proposal -- Numpy SciPy In-Reply-To: References:

Message-ID: On 1 May 2013 20:12, Blake Griffith wrote: > However, it would still be useful to have ufuncs working well with the > sparse package. How are you planning to deal with ufunc(0) != 0? cos(sparse) is actually dense. From raul at virtualmaterials.com Wed May 1 14:39:35 2013 From: raul at virtualmaterials.com (Raul Cota) Date: Wed, 01 May 2013 12:39:35 -0600 Subject: [Numpy-discussion] GSoC : Performance parity between numpy arrays and Python scalars In-Reply-To: References: Message-ID: <51816167.6040607@virtualmaterials.com> An HTML attachment was scrubbed... URL: From blake.a.griffith at gmail.com Wed May 1 15:02:55 2013 From: blake.a.griffith at gmail.com (Blake Griffith) Date: Wed, 1 May 2013 14:02:55 -0500 Subject: [Numpy-discussion] GSoC proposal -- Numpy SciPy In-Reply-To: References:

Message-ID: There are several situations where that comes up (Like comparing two sparse matrices A == B) There is a SparseEfficiancyWarning that can be thrown, but the way this should be implemented still needs to be discussed. I will be writing a specification on how ufuncs and ndarrays are handled by the sparse package, the spec can be found here https://github.com/cowlicks/scipy-sparse-ndarray-and-ufunc-spec/blob/master/Spec.markdown. In general, a unary ufunc operating on a sparse matrix should return a sparse matrix. If you really want to do cos(sparse) you will be able to. But if you are just interested in the initially non zero elements should probably do something like: sparse.data = np.cos(sparse.data) On Wed, May 1, 2013 at 1:32 PM, Da?id wrote: > On 1 May 2013 20:12, Blake Griffith wrote: > > However, it would still be useful to have ufuncs working well with the > > sparse package. > > How are you planning to deal with ufunc(0) != 0? cos(sparse) is actually > dense. > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From lists at onerussian.com Wed May 1 15:29:07 2013 From: lists at onerussian.com (Yaroslav Halchenko) Date: Wed, 1 May 2013 15:29:07 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> Message-ID: <20130501192907.GS5140@onerussian.com> just for completeness... I haven't yet double checked if I have done it correctly but here is the bisected commit: aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 is the first bad commit commit aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 Author: Mark Wiebe Date: Tue Aug 2 13:34:13 2011 -0500 ENH: ufunc: Rewrite PyUFunc_Reduce to be more general and easier to adapt to NA masks This generalizes the 'axis' parameter to accept None or a list of axes on which to do the reduction. :040000 040000 2bdd71a1ea60c0dbfe370c77f69724fab28038e1 44f54a15f480ccaf519d10e9c42032de86bd0dca M numpy bisect run success FWIW ( ;-) ): # git describe --tags aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 v0.3.0-7757-gaed9925 # git describe --tags --contains aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 v1.7.0b1~377^2~126 On Wed, 01 May 2013, Robert Kern wrote: > On Wed, May 1, 2013 at 6:24 PM, Matthew Brett wrote: > > HI, > > On Wed, May 1, 2013 at 9:09 AM, Yaroslav Halchenko wrote: > >> 3. they are identical on other architectures (e.g. amd64) > > To me that is surprising. I would have guessed that the order is the > > same on 32 and 64 bit, but something about the precision of > > intermediate operations is different. I don't know enough about > > amd64 to guess what that could be. Bradley's suggestion seems kind of > > reasonable but it's strange that numpy should use intel-80 bit > > intermediate values differently for 32 and 64 bit. > "numpy" isn't doing anything different between the two. numpy > generates the same C code. The C compiler may be generating different > machine instructions for that code on different architectures, even > closely related ones like i386 and amd64. Different optimization flags > and compiler versions will probably also affect this, not just the > target architecture. It's possible that those are actually the source > of this observation. -- Yaroslav O. Halchenko, Ph.D. http://neuro.debian.net http://www.pymvpa.org http://www.fail2ban.org Senior Research Associate, Psychological and Brain Sciences Dept. Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755 Phone: +1 (603) 646-9834 Fax: +1 (603) 646-1419 WWW: http://www.linkedin.com/in/yarik From josef.pktd at gmail.com Wed May 1 15:50:15 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 1 May 2013 15:50:15 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <20130501030849.GO5140@onerussian.com> <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> Message-ID: On Wed, May 1, 2013 at 1:24 PM, Matthew Brett wrote: > HI, > > On Wed, May 1, 2013 at 9:09 AM, Yaroslav Halchenko wrote: >> >> On Wed, 01 May 2013, Nathaniel Smith wrote: >>> > not sure there is anything to fix here. Third-party code relying on a >>> > certain outcome of rounding error is likely incorrect anyway. >> >>> Yeah, seems to just be the standard floating point indeterminism. >>> Using Matthew's numbers and pure Python floats: >> >>> In [9]: (0.49505185 + 0.53529587) + -0.13461665 >>> Out[9]: 0.89573107 >> >>> In [10]: 0.49505185 + (0.53529587 + -0.13461665) >>> Out[10]: 0.8957310700000001 >> >>> In [11]: _9 - _10 >>> Out[11]: -1.1102230246251565e-16 >> >>> Looks like a bug in pymvpa or its test suite to me. >> >> well -- sure thing we will "fix" the unittest to not rely on precise >> correspondence any longer since released 1.7.1 is effected. So it is not >> a matter of me avoiding "fixing" pymvpa's "bug". >> >> I brought it to your attention because >> >> 1. from e.g. >> >> np.sum(data[:, 0]) - np.sum(data, axis=0)[0] >> >> which presumably should be the same order of additions for 0-th column it is >> not clear that they do not have to be identical. > > I agree it's surprising, but I guess it's reasonable for numpy to > reserve the right to add these guys up in whatever order it chooses, > and (in this case) maybe a different order for the axis=None, axis=X > cases. > > Also, y'all may have noticed that it is the presence of the second > vector in the array which causes the difference in the sums of the > first (see my first email in this thread). If this is an order > effect I guess this means that the order of operations in an sum(a, > axis=X) operation depends on the shape of the array. And it looks > like it depends on memory layout: > > In [24]: data = np.array([[ 0.49505185, 0], > ....: [ 0.53529587, 0], > ....: [-0.13461665, 0]]) > > In [25]: np.sum(data[:, 0]) - np.sum(data, axis=0)[0] > Out[25]: 1.1102230246251565e-16 > > In [26]: data_F = np.array(data, order='F') > > In [27]: np.sum(data_F[:, 0]) - np.sum(data_F, axis=0)[0] > Out[27]: 0.0 > > Do we allow the results to be different for different memory layout? Wasn't this the point of some of Mark Wiebe's optimization? As far as I understand he got rid of some of the C bias, and made calculations faster for Fortran contiguous arrays. I rather have speed for my Fortran arrays, then relying on float precision issues, that bite anyway when running on many different kinds of machines. (I usually have to lower some unit test precision during Debian testing of statsmodels.) Josef > >> 2. so far they were identical across many numpy releases >> >> 3. they are identical on other architectures (e.g. amd64) > > To me that is surprising. I would have guessed that the order is the > same on 32 and 64 bit, but something about the precision of > intermediate operations is different. I don't know enough about > amd64 to guess what that could be. Bradley's suggestion seems kind of > reasonable but it's strange that numpy should use intel-80 bit > intermediate values differently for 32 and 64 bit. > > Cheers, > > Matthew > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From sebastian at sipsolutions.net Wed May 1 16:01:00 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 01 May 2013 22:01:00 +0200 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <20130501192907.GS5140@onerussian.com> References: <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> <20130501192907.GS5140@onerussian.com> Message-ID: <1367438460.2545.22.camel@sebastian-laptop> On Wed, 2013-05-01 at 15:29 -0400, Yaroslav Halchenko wrote: > just for completeness... I haven't yet double checked if I have done it > correctly but here is the bisected commit: > > aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 is the first bad commit > commit aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 > Author: Mark Wiebe > Date: Tue Aug 2 13:34:13 2011 -0500 > > ENH: ufunc: Rewrite PyUFunc_Reduce to be more general and easier to adapt to NA masks > > This generalizes the 'axis' parameter to accept None or a list of > axes on which to do the reduction. > > :040000 040000 2bdd71a1ea60c0dbfe370c77f69724fab28038e1 44f54a15f480ccaf519d10e9c42032de86bd0dca M numpy > bisect run success > > FWIW ( ;-) ): > There really is no point discussing here, this has to do with numpy doing iteration order optimization, and you actually *want* this. Lets for a second assume that the old behavior was better, then the next guy is going to ask: "Why is np.add.reduce(array, axis=0) so much slower then reduce(array, np.add)?". This is huge speed improvement by Marks new iterator for reductions over the slow axes, so instead of trying to track "regressions" down, I think the right thing is to say kudos for doing this improvement :). Just my opinion, Sebastian > # git describe --tags aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 > v0.3.0-7757-gaed9925 > > # git describe --tags --contains aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 > v1.7.0b1~377^2~126 > > > On Wed, 01 May 2013, Robert Kern wrote: > > > On Wed, May 1, 2013 at 6:24 PM, Matthew Brett wrote: > > > HI, > > > > On Wed, May 1, 2013 at 9:09 AM, Yaroslav Halchenko wrote: > > > >> 3. they are identical on other architectures (e.g. amd64) > > > > To me that is surprising. I would have guessed that the order is the > > > same on 32 and 64 bit, but something about the precision of > > > intermediate operations is different. I don't know enough about > > > amd64 to guess what that could be. Bradley's suggestion seems kind of > > > reasonable but it's strange that numpy should use intel-80 bit > > > intermediate values differently for 32 and 64 bit. > > > "numpy" isn't doing anything different between the two. numpy > > generates the same C code. The C compiler may be generating different > > machine instructions for that code on different architectures, even > > closely related ones like i386 and amd64. Different optimization flags > > and compiler versions will probably also affect this, not just the > > target architecture. It's possible that those are actually the source > > of this observation. From matthew.brett at gmail.com Wed May 1 16:19:39 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 1 May 2013 13:19:39 -0700 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <1367438460.2545.22.camel@sebastian-laptop> References: <1367398813.2545.11.camel@sebastian-laptop> <1367399295.2545.14.camel@sebastian-laptop> <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> <20130501192907.GS5140@onerussian.com> <1367438460.2545.22.camel@sebastian-laptop> Message-ID: Hi, On Wed, May 1, 2013 at 1:01 PM, Sebastian Berg wrote: > On Wed, 2013-05-01 at 15:29 -0400, Yaroslav Halchenko wrote: >> just for completeness... I haven't yet double checked if I have done it >> correctly but here is the bisected commit: >> >> aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 is the first bad commit >> commit aed9925a9d5fe9a407d0ca2c65cb577116c4d0f1 >> Author: Mark Wiebe >> Date: Tue Aug 2 13:34:13 2011 -0500 >> >> ENH: ufunc: Rewrite PyUFunc_Reduce to be more general and easier to adapt to NA masks >> >> This generalizes the 'axis' parameter to accept None or a list of >> axes on which to do the reduction. >> >> :040000 040000 2bdd71a1ea60c0dbfe370c77f69724fab28038e1 44f54a15f480ccaf519d10e9c42032de86bd0dca M numpy >> bisect run success >> >> FWIW ( ;-) ): >> > > There really is no point discussing here, this has to do with numpy > doing iteration order optimization, and you actually *want* this. Lets > for a second assume that the old behavior was better, then the next guy > is going to ask: "Why is np.add.reduce(array, axis=0) so much slower > then reduce(array, np.add)?". This is huge speed improvement by Marks > new iterator for reductions over the slow axes, so instead of trying to > track "regressions" down, I think the right thing is to say kudos for > doing this improvement :). I don't believe Yarick meant his bisection to be a criticism, but as an aid to full understanding. Is it an issue that Fortran and C contiguous arrays give different rounding error for the sums? Cheers, Matthew From lists at onerussian.com Wed May 1 16:33:27 2013 From: lists at onerussian.com (Yaroslav Halchenko) Date: Wed, 1 May 2013 16:33:27 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: References: <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> <20130501192907.GS5140@onerussian.com> <1367438460.2545.22.camel@sebastian-laptop> Message-ID: <20130501203327.GT5140@onerussian.com> On Wed, 01 May 2013, Matthew Brett wrote: > > There really is no point discussing here, this has to do with numpy > > doing iteration order optimization, and you actually *want* this. Lets > > for a second assume that the old behavior was better, then the next guy > > is going to ask: "Why is np.add.reduce(array, axis=0) so much slower > > then reduce(array, np.add)?". This is huge speed improvement by Marks > > new iterator for reductions over the slow axes, so instead of trying to > > track "regressions" down, I think the right thing is to say kudos for > > doing this improvement :). > I don't believe Yarick meant his bisection to be a criticism, but as > an aid to full understanding. Exactly right, Matthew -- thank you! And kudos to Mark! N.B. I am generally furry and kind, not fuzzy and evil -- Yaroslav O. Halchenko, Ph.D. http://neuro.debian.net http://www.pymvpa.org http://www.fail2ban.org Senior Research Associate, Psychological and Brain Sciences Dept. Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755 Phone: +1 (603) 646-9834 Fax: +1 (603) 646-1419 WWW: http://www.linkedin.com/in/yarik From lists at onerussian.com Wed May 1 16:37:07 2013 From: lists at onerussian.com (Yaroslav Halchenko) Date: Wed, 1 May 2013 16:37:07 -0400 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <1367438460.2545.22.camel@sebastian-laptop> References: <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> <20130501192907.GS5140@onerussian.com> <1367438460.2545.22.camel@sebastian-laptop> Message-ID: <20130501203707.GU5140@onerussian.com> On Wed, 01 May 2013, Sebastian Berg wrote: > There really is no point discussing here, this has to do with numpy > doing iteration order optimization, and you actually *want* this. Lets > for a second assume that the old behavior was better, then the next guy > is going to ask: "Why is np.add.reduce(array, axis=0) so much slower > then reduce(array, np.add)?". This is huge speed improvement by Marks > new iterator for reductions over the slow axes... btw -- is there something like panda's vbench for numpy? i.e. where it would be possible to track/visualize such performance improvements/hits? -- Yaroslav O. Halchenko, Ph.D. http://neuro.debian.net http://www.pymvpa.org http://www.fail2ban.org Senior Research Associate, Psychological and Brain Sciences Dept. Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755 Phone: +1 (603) 646-9834 Fax: +1 (603) 646-1419 WWW: http://www.linkedin.com/in/yarik From sebastian at sipsolutions.net Wed May 1 16:48:24 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 01 May 2013 22:48:24 +0200 Subject: [Numpy-discussion] could anyone check on a 32bit system? In-Reply-To: <20130501203707.GU5140@onerussian.com> References: <20130501124933.GP5140@onerussian.com> <20130501130125.GQ5140@onerussian.com> <20130501160948.GR5140@onerussian.com> <20130501192907.GS5140@onerussian.com> <1367438460.2545.22.camel@sebastian-laptop> <20130501203707.GU5140@onerussian.com> Message-ID: <1367441304.2545.27.camel@sebastian-laptop> On Wed, 2013-05-01 at 16:37 -0400, Yaroslav Halchenko wrote: > On Wed, 01 May 2013, Sebastian Berg wrote: > > > There really is no point discussing here, this has to do with numpy > > doing iteration order optimization, and you actually *want* this. Lets > > for a second assume that the old behavior was better, then the next guy > > is going to ask: "Why is np.add.reduce(array, axis=0) so much slower > > then reduce(array, np.add)?". This is huge speed improvement by Marks > > new iterator for reductions over the slow axes... > > btw -- is there something like panda's vbench for numpy? i.e. where > it would be possible to track/visualize such performance > improvements/hits? > Sorry if it seemed harsh, but only skimmed mails and it seemed a bit like the an obvious piece was missing... There are no benchmark tests I am aware of. You can try: a = np.random.random((1000, 1000)) and then time a.sum(1) and a.sum(0), on 1.7. the fast axis (1), is only slightly faster then the sum over the slow axis. On earlier numpy versions you will probably see something like half the speed for the slow axis (only got ancient or 1.7 numpy right now, so reluctant to give exact timings). - Sebastian From ben.root at ou.edu Wed May 1 21:10:11 2013 From: ben.root at ou.edu (Benjamin Root) Date: Wed, 1 May 2013 21:10:11 -0400 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid> Message-ID: So, to summarize the thread so far: Consensus: np.nanmean() np.nanstd() np.minmax() np.argminmax() Vague Consensus: np.sincos() No Consensus (possibly out of scope for this topic): Better constructors for complex types I can probably whip up the PR for the nanmean() and nanstd(), and can certainly help out with the minmax funcs. Cheers! Ben Root -------------- next part -------------- An HTML attachment was scrubbed... URL: From mmick003 at ucr.edu Wed May 1 21:42:57 2013 From: mmick003 at ucr.edu (Mark Micklich) Date: Wed, 1 May 2013 18:42:57 -0700 Subject: [Numpy-discussion] numPy not imported into Python Message-ID: Hello -- After installing numPy, I'm getting the following error message when attempting to import numarray: ImportError: No module named numarray I do have numPy installed. I'm running under Lubuntu 12.10 and the Spyder 2.1.10 IDE. I'm fairly new to developing Python on Linux. I assume there is some path issue, but I'm not clear where to start. If numPy is installed, how to I point Spyder to the numPy modules so I can get numarray to work? Thanks, Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: From chanley at gmail.com Wed May 1 21:47:37 2013 From: chanley at gmail.com (Christopher Hanley) Date: Wed, 1 May 2013 21:47:37 -0400 Subject: [Numpy-discussion] numPy not imported into Python In-Reply-To: References: Message-ID: Mark, Numpy is not numarray. Numarray is an older package that has long since been replaced by numpy. You should only use numpy in any development from now on. Chris On Wednesday, May 1, 2013, Mark Micklich wrote: > Hello -- After installing numPy, I'm getting the following error message > when attempting to import numarray: > > ImportError: No module named numarray > > I do have numPy installed. I'm running under Lubuntu 12.10 and the Spyder > 2.1.10 IDE. I'm fairly new to developing Python on Linux. I assume there > is some path issue, but I'm not clear where to start. If numPy is > installed, how to I point Spyder to the numPy modules so I can get numarray > to work? > > Thanks, > Mark > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mmick003 at ucr.edu Wed May 1 21:52:28 2013 From: mmick003 at ucr.edu (Mark Micklich) Date: Wed, 1 May 2013 18:52:28 -0700 Subject: [Numpy-discussion] numPy not imported into Python In-Reply-To: References: Message-ID: Oh! I imported numpy and that worked. I probably should have mentioned I'm working through the 2005 "*Numerical Methods in Engineering with Python*" textbook from the school library. The examples are still good, but the Import statement used in the book is obsolete. Thanks for the quick reply. Mark On Wed, May 1, 2013 at 6:47 PM, Christopher Hanley wrote: > Mark, > > Numpy is not numarray. Numarray is an older package that has long since > been replaced by numpy. You should only use numpy in any development from > now on. > > Chris > > > On Wednesday, May 1, 2013, Mark Micklich wrote: > >> Hello -- After installing numPy, I'm getting the following error message >> when attempting to import numarray: >> >> ImportError: No module named numarray >> >> I do have numPy installed. I'm running under Lubuntu 12.10 and the Spyder >> 2.1.10 IDE. I'm fairly new to developing Python on Linux. I assume there >> is some path issue, but I'm not clear where to start. If numPy is >> installed, how to I point Spyder to the numPy modules so I can get numarray >> to work? >> >> Thanks, >> Mark >> >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Wed May 1 23:12:31 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 1 May 2013 21:12:31 -0600 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid>

Message-ID: On Wed, May 1, 2013 at 7:10 PM, Benjamin Root wrote: > So, to summarize the thread so far: > > Consensus: > np.nanmean() > np.nanstd() > np.minmax() > np.argminmax() > > Vague Consensus: > np.sincos() > > If the return of sincos (cossin?) is an array, then it could be reshaped to be exp(1j*x), which together with exp(2*pi*1j*x) would cover some pretty common cases. No Consensus (possibly out of scope for this topic): > Better constructors for complex types > > I can probably whip up the PR for the nanmean() and nanstd(), and can > certainly help out with the minmax funcs. > > Cheers! > Ben Root > > Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Wed May 1 23:28:45 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 1 May 2013 21:28:45 -0600 Subject: [Numpy-discussion] GSoC : Performance parity between numpy arrays and Python scalars In-Reply-To: References: Message-ID: On Tue, Apr 30, 2013 at 8:26 PM, Arink Verma wrote: > Hi all! > I have written my application[1] for *Performance parity between numpy > arrays and Python scalars[2]. *It would be a great help if you view it. > Does it look achievable and deliverable according to the project. > > [1] > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2013/arinkverma/40001# > [2] http://projects.scipy.org/scipy/wiki/SummerofCodeIdeas > > > Hi Arink, Have you already done some profiling? That could be tricky at the C level. I'm also curious about the hash table, what gets hashed and where do you get the improved efficiency? Admittedly, the way in which ufuncs currently detect scalars is a bit heavy weight and a fast path for certain inputs values could help. Is that what you are doing? As to the schedule, I suspect that it may be a bit ambitious but I don't see that as fatal by any means. Identifying bottlenecks and experimenting with solutions would be useful work. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From arinkverma at iitrpr.ac.in Thu May 2 00:25:12 2013 From: arinkverma at iitrpr.ac.in (Arink Verma) Date: Thu, 2 May 2013 09:55:12 +0530 Subject: [Numpy-discussion] GSoC : Performance parity between numpy arrays and Python scalars In-Reply-To: <51816167.6040607@virtualmaterials.com> References: <51816167.6040607@virtualmaterials.com> Message-ID: @Raul I will pull new version, and try to include that also. What is wrong with macros for inline function? Yes, time for ufunc is reduced to almost half, for lookup table, I am generating key from argument type and returning the appropriated value.[1] @Chuck Yes I did some profiling with oprofiler for "python -m timeit -n 1000000 -s 'import numpy as np;x = np.asarray(1.0)' 'x+x'". see data sheet.[2] As every time a ufunc is invoked, the code has to check every single data type possible (bool, int, double, etc) until if finds the best match for the data that the operation is being performed on. In scalar, we can send best match, from pre-populated table. At present the implementation is not well-structured and support only addition for int+int and float+float. [1] [1] https://github.com/arinkverma/numpy/commit/e2d8de7e7b643c7a76ff92bc1219847f9328aad0 [2] https://docs.google.com/spreadsheet/ccc?key=0AnPqyp8kuQw0dG1hdjZiazE2dGtTY1JXVGFsWEEzbXc#gid=0 On Thu, May 2, 2013 at 12:09 AM, Raul Cota wrote: > > It is great that you are looking into this !! We are currently running on > a fork of numpy because we really need these performance improvements . > > > I noticed that, as suggested, you took from the pull request I posted a > while ago for the > PyObject_GetAttrString > PyObject_GetBuffer > > issues. > > ( https://github.com/raulcota/numpy ) > > > A couple of comments on that, > > - Seems like you did not grab the latest revisions of that code that I > posted that fixes the style of the comments and 'attempts' to fix an issue > reported about Python 3 . I say 'attempts' because I thought it was fixed > but I someone mentioned this was not correct. > > - There was also some feedback from Nathaniel about not liking the macros > and siding for inline functions. I have not gotten around to it, but it > would be nice if you jump on that boat. > > On the has lookup table, haven't looked at the implementation but the > speed up is remarkable. > > > Cheers ! > > Raul > > > > On 30/04/2013 8:26 PM, Arink Verma wrote: > > Hi all! > I have written my application[1] for *Performance parity between numpy > arrays and Python scalars[2]. *It would be a great help if you view it. > Does it look achievable and deliverable according to the project. > > [1] > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2013/arinkverma/40001# > [2] http://projects.scipy.org/scipy/wiki/SummerofCodeIdeas > > > -- > Arink > Computer Science and Engineering > Indian Institute of Technology Ropar > www.arinkverma.in > > > _______________________________________________ > NumPy-Discussion mailing listNumPy-Discussion at scipy.orghttp://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Arink Computer Science and Engineering Indian Institute of Technology Ropar www.arinkverma.in -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Thu May 2 00:42:16 2013 From: cournape at gmail.com (David Cournapeau) Date: Thu, 2 May 2013 05:42:16 +0100 Subject: [Numpy-discussion] GSoC : Performance parity between numpy arrays and Python scalars In-Reply-To: References: <51816167.6040607@virtualmaterials.com> Message-ID: On Thu, May 2, 2013 at 5:25 AM, Arink Verma wrote: > @Raul > I will pull new version, and try to include that also. > What is wrong with macros for inline function? > Yes, time for ufunc is reduced to almost half, for lookup table, I am > generating key from argument type and returning the appropriated value.[1] > > @Chuck > Yes I did some profiling with oprofiler for "python -m timeit -n 1000000 -s > 'import numpy as np;x = np.asarray(1.0)' 'x+x'". see data sheet.[2] > > As every time a ufunc is invoked, the code has to check every single data > type possible (bool, int, double, etc) until if finds the best match for the > data that the operation is being performed on. In scalar, we can send best > match, from pre-populated table. At present the implementation is not > well-structured and support only addition for int+int and float+float. [1] You are pointing out something that may well be the main difficulty: the code there is messy, and we need to ensure that optimisations don't preclude later extensions (especially with regard to new dtype addition). David From ben.root at ou.edu Thu May 2 01:01:14 2013 From: ben.root at ou.edu (Benjamin Root) Date: Thu, 2 May 2013 01:01:14 -0400 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid>

Message-ID: I have created a PR for the first two (and got np.nanvar() for free). https://github.com/numpy/numpy/pull/3297 Cheers! Ben Root -------------- next part -------------- An HTML attachment was scrubbed... URL: From arinkverma at iitrpr.ac.in Thu May 2 06:26:58 2013 From: arinkverma at iitrpr.ac.in (Arink Verma) Date: Thu, 2 May 2013 15:56:58 +0530 Subject: [Numpy-discussion] GSoC : Performance parity between numpy arrays and Python scalars In-Reply-To: References: <51816167.6040607@virtualmaterials.com>

Message-ID: Yes, we need to ensure that.. Code generator can be made, which can create code for table of registered dtype during build time itself. Also at present there lot of duplicate code that attempts to work around these slow paths, simplification of that code is also required. On Thu, May 2, 2013 at 10:12 AM, David Cournapeau wrote: > On Thu, May 2, 2013 at 5:25 AM, Arink Verma > wrote: > > @Raul > > I will pull new version, and try to include that also. > > What is wrong with macros for inline function? > > Yes, time for ufunc is reduced to almost half, for lookup table, I am > > generating key from argument type and returning the appropriated > value.[1] > > > > @Chuck > > Yes I did some profiling with oprofiler for "python -m timeit -n 1000000 > -s > > 'import numpy as np;x = np.asarray(1.0)' 'x+x'". see data sheet.[2] > > > > As every time a ufunc is invoked, the code has to check every single data > > type possible (bool, int, double, etc) until if finds the best match for > the > > data that the operation is being performed on. In scalar, we can send > best > > match, from pre-populated table. At present the implementation is not > > well-structured and support only addition for int+int and float+float. > [1] > > You are pointing out something that may well be the main difficulty: > the code there is messy, and we need to ensure that optimisations > don't preclude later extensions (especially with regard to new dtype > addition). > > David > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -- Arink Computer Science and Engineering Indian Institute of Technology Ropar www.arinkverma.in -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Thu May 2 07:03:45 2013 From: njs at pobox.com (Nathaniel Smith) Date: Thu, 2 May 2013 07:03:45 -0400 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid>

Message-ID: On 1 May 2013 23:12, "Charles R Harris" wrote: > > > > On Wed, May 1, 2013 at 7:10 PM, Benjamin Root wrote: >> >> So, to summarize the thread so far: >> >> Consensus: >> np.nanmean() >> np.nanstd() >> np.minmax() >> np.argminmax() >> >> Vague Consensus: >> np.sincos() >> > > If the return of sincos (cossin?) is an array, then it could be reshaped to be exp(1j*x), which together with exp(2*pi*1j*x) would cover some pretty common cases. Ufuncs already have some convention for what to do with multiple output arguments, right? Presumably whatever they do is what sincos should do. (And minmax/argminmax likewise, for consistency, even if they aren't ufuncs. Though they could be generalized ufuncs, or minmax could be minimummaximum.reduce.) I haven't checked, but I assume that what multiple output argument ufuncs do is to return a tuple. You can't use a single array in the general case, because the multiple output types might not be homogenous. -n -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Thu May 2 07:15:47 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Thu, 02 May 2013 13:15:47 +0200 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid>

Message-ID: <1367493347.3046.3.camel@sebastian-laptop> On Thu, 2013-05-02 at 07:03 -0400, Nathaniel Smith wrote: > On 1 May 2013 23:12, "Charles R Harris" > wrote: > > > > > > > > On Wed, May 1, 2013 at 7:10 PM, Benjamin Root > wrote: > >> > >> So, to summarize the thread so far: > >> > >> Consensus: > >> np.nanmean() > >> np.nanstd() > >> np.minmax() > >> np.argminmax() > >> > >> Vague Consensus: > >> np.sincos() > >> > > > > If the return of sincos (cossin?) is an array, then it could be > reshaped to be exp(1j*x), which together with exp(2*pi*1j*x) would > cover some pretty common cases. > > Ufuncs already have some convention for what to do with multiple > output arguments, right? Presumably whatever they do is what sincos > should do. (And minmax/argminmax likewise, for consistency, even if > they aren't ufuncs. Though they could be generalized ufuncs, or minmax > could be minimummaximum.reduce.) > I think for sincos it makes sense, for an example a ufunc with multiple arguments is `np.modf`. But I doubt reductions are automatically defined for these, so minmax probably needs to be a generalized ufunc (can you have an axis argument with those?). - Sebastian > I haven't checked, but I assume that what multiple output argument > ufuncs do is to return a tuple. You can't use a single array in the > general case, because the multiple output types might not be > homogenous. > > -n > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From cournape at gmail.com Thu May 2 08:42:13 2013 From: cournape at gmail.com (David Cournapeau) Date: Thu, 2 May 2013 13:42:13 +0100 Subject: [Numpy-discussion] GSoC : Performance parity between numpy arrays and Python scalars In-Reply-To: References: <51816167.6040607@virtualmaterials.com>

Message-ID: On Thu, May 2, 2013 at 11:26 AM, Arink Verma wrote: > Yes, we need to ensure that.. > Code generator can be made, which can create code for table of registered > dtype during build time itself. So dtypes can be registered at runtime as well. In an ideal world, 'native' numpy types would not be special cases. This is too big for a GSoC, but we should make sure we don't make it worse. > > Also at present there lot of duplicate code that attempts to work around > these slow paths, simplification of that code is also required. That there is room for consolidation would be an understatement :) David > > On Thu, May 2, 2013 at 10:12 AM, David Cournapeau > wrote: >> >> On Thu, May 2, 2013 at 5:25 AM, Arink Verma >> wrote: >> > @Raul >> > I will pull new version, and try to include that also. >> > What is wrong with macros for inline function? >> > Yes, time for ufunc is reduced to almost half, for lookup table, I am >> > generating key from argument type and returning the appropriated >> > value.[1] >> > >> > @Chuck >> > Yes I did some profiling with oprofiler for "python -m timeit -n 1000000 >> > -s >> > 'import numpy as np;x = np.asarray(1.0)' 'x+x'". see data sheet.[2] >> > >> > As every time a ufunc is invoked, the code has to check every single >> > data >> > type possible (bool, int, double, etc) until if finds the best match for >> > the >> > data that the operation is being performed on. In scalar, we can send >> > best >> > match, from pre-populated table. At present the implementation is not >> > well-structured and support only addition for int+int and float+float. >> > [1] >> >> You are pointing out something that may well be the main difficulty: >> the code there is messy, and we need to ensure that optimisations >> don't preclude later extensions (especially with regard to new dtype >> addition). >> >> David >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > -- > Arink > Computer Science and Engineering > Indian Institute of Technology Ropar > www.arinkverma.in > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From njs at pobox.com Thu May 2 09:14:32 2013 From: njs at pobox.com (Nathaniel Smith) Date: Thu, 2 May 2013 09:14:32 -0400 Subject: [Numpy-discussion] GSoC : Performance parity between numpy arrays and Python scalars In-Reply-To: References: <51816167.6040607@virtualmaterials.com>

Message-ID: On Thu, May 2, 2013 at 6:26 AM, Arink Verma wrote: > Yes, we need to ensure that.. > Code generator can be made, which can create code for table of registered > dtype during build time itself. I'd probably just generate it at run-time on an as-needed basis. (I.e., use the full lookup logic the first time, then save the result.) New dtypes can be registered, which will mean the tables need to change size at runtime anyway. If someone does some strange thing like add float16's and float64's, we can do the lookup to determine that this should be handled by the float64/float64 loop, and then store that information so that the next time it's fast (but we probably don't want to be calculating all combinations at build-time, which would require running the full type resolution machinery, esp. since it wouldn't really bring any benefits that I can see). * Re: the profiling, I wrote a full oprofile->callgrind format script years ago: http://vorpus.org/~njs/op2calltree.py Haven't used it in years either but neither oprofile nor kcachegrind are terribly fast-moving projects so it's probably still working, or could be made so without much work. Or easier is to use the gperftools CPU profiler: https://gperftools.googlecode.com/svn/trunk/doc/cpuprofile.html Instead of linking to it at build time, you can just use ctypes: In [7]: profiler = ctypes.CDLL("libprofiler.so.0") In [8]: profiler.ProfilerStart("some-file-name-here") Out[8]: 1 In [9]: # do stuff here In [10]: profiler.ProfilerStop() PROFILE: interrupts/evictions/bytes = 2/0/592 Out[10]: 46 Then all the pprof analysis tools are available as described on that webpage. * Please don't trust those random suggestions for possible improvements I threw out when writing the original description. Probably it's true that FP flag checking and ufunc type lookup are expensive, but one should fix what the profile says to fix, not what someone guessed might be good to fix based on a few minutes thought. * Instead of making a giant table of everything that needs to be done to make stuff fast first, before writing any code, I'd suggest picking one operation, figuring out what change would be the biggest improvement for it, making that change, checking that it worked, and then repeat until that operation is really fast. Then if there's still time pick another operation. Producing a giant todo list isn't very productive by itself if there's no time then to actually do all the things on the list :-). * Did you notice this line on the requirements page? "Having your first pull request merged before the GSoC application deadline (May 3) is required for your application to be accepted." -n From cournape at gmail.com Thu May 2 09:25:28 2013 From: cournape at gmail.com (David Cournapeau) Date: Thu, 2 May 2013 14:25:28 +0100 Subject: [Numpy-discussion] Profiling (was GSoC : Performance parity between numpy arrays and Python scalars) Message-ID: > * Re: the profiling, I wrote a full oprofile->callgrind format script > years ago: http://vorpus.org/~njs/op2calltree.py > Haven't used it in years either but neither oprofile nor kcachegrind > are terribly fast-moving projects so it's probably still working, or > could be made so without much work. > Or easier is to use the gperftools CPU profiler: > https://gperftools.googlecode.com/svn/trunk/doc/cpuprofile.html I don't have experience with gperftools, but on recent linux kernels, you can also use perf, which can't be made easier to use (no runtime support needed), but you need a 'recent' kernel http://indico.cern.ch/getFile.py/access?contribId=20&sessionId=4&resId=0&materialId=slides&confId=141309 I am hoping to talk a bit about those for our diving into numpy c code tutorial in June, what's the + of gperf in your opinion ? David From robert.kern at gmail.com Thu May 2 09:28:55 2013 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 2 May 2013 14:28:55 +0100 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid>

Message-ID: On Thu, May 2, 2013 at 12:03 PM, Nathaniel Smith wrote: > On 1 May 2013 23:12, "Charles R Harris" wrote: >> >> On Wed, May 1, 2013 at 7:10 PM, Benjamin Root wrote: >>> >>> So, to summarize the thread so far: >>> >>> Consensus: >>> np.nanmean() >>> np.nanstd() >>> np.minmax() >>> np.argminmax() >>> >>> Vague Consensus: >>> np.sincos() >>> >> >> If the return of sincos (cossin?) is an array, then it could be reshaped >> to be exp(1j*x), which together with exp(2*pi*1j*x) would cover some pretty >> common cases. It couldn't be a mere reshape, since the complex dtype requires the real and imag components to be adjacent to each other. They wouldn't be so if sincos's return type is an array (nor even the cossin alternative). It always requires a memory copy (except in the "who cares?" case of a scalar). Composition with an efficient np.tocomplex(real, imag) implementation would cover those use cases whether sincos returns tuples or arrays. > Ufuncs already have some convention for what to do with multiple output > arguments, right? Presumably whatever they do is what sincos should do. (And > minmax/argminmax likewise, for consistency, even if they aren't ufuncs. > Though they could be generalized ufuncs, or minmax could be > minimummaximum.reduce.) > > I haven't checked, but I assume that what multiple output argument ufuncs do > is to return a tuple. You can't use a single array in the general case, > because the multiple output types might not be homogenous. Correct. [~] |19> np.modf.nout 2 [~] |20> np.modf(np.linspace(0, 1, 5)) (array([ 0. , 0.25, 0.5 , 0.75, 0. ]), array([ 0., 0., 0., 0., 1.])) -- Robert Kern From charlesr.harris at gmail.com Thu May 2 09:38:55 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 2 May 2013 07:38:55 -0600 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid>

Message-ID: On Thu, May 2, 2013 at 7:28 AM, Robert Kern wrote: > On Thu, May 2, 2013 at 12:03 PM, Nathaniel Smith wrote: > > On 1 May 2013 23:12, "Charles R Harris" > wrote: > >> > >> On Wed, May 1, 2013 at 7:10 PM, Benjamin Root wrote: > >>> > >>> So, to summarize the thread so far: > >>> > >>> Consensus: > >>> np.nanmean() > >>> np.nanstd() > >>> np.minmax() > >>> np.argminmax() > >>> > >>> Vague Consensus: > >>> np.sincos() > >>> > >> > >> If the return of sincos (cossin?) is an array, then it could be reshaped > >> to be exp(1j*x), which together with exp(2*pi*1j*x) would cover some > pretty > >> common cases. > > It couldn't be a mere reshape, since the complex dtype requires the > real and imag components to be adjacent to each other. They wouldn't > be so if sincos's return type is an array (nor even the cossin > alternative). It always requires a memory copy (except in the "who > cares?" case of a scalar). Composition with an efficient > np.tocomplex(real, imag) implementation would cover those use cases > whether sincos returns tuples or arrays. > I would assume the basic return type would be complex, i.e., the cos/sin adjacent. The cos/sin parts would then be real/imag views into the array. > > > Ufuncs already have some convention for what to do with multiple output > > arguments, right? Presumably whatever they do is what sincos should do. > (And > > minmax/argminmax likewise, for consistency, even if they aren't ufuncs. > > Though they could be generalized ufuncs, or minmax could be > > minimummaximum.reduce.) > > > > I haven't checked, but I assume that what multiple output argument > ufuncs do > > is to return a tuple. You can't use a single array in the general case, > > because the multiple output types might not be homogenous. > > Correct. > > [~] > |19> np.modf.nout > 2 > > [~] > |20> np.modf(np.linspace(0, 1, 5)) > (array([ 0. , 0.25, 0.5 , 0.75, 0. ]), array([ 0., 0., 0., 0., > 1.])) > > -- > Robert Kern > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Thu May 2 09:47:05 2013 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 2 May 2013 14:47:05 +0100 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid>

Message-ID: On Thu, May 2, 2013 at 2:38 PM, Charles R Harris wrote: > > On Thu, May 2, 2013 at 7:28 AM, Robert Kern wrote: >> >> On Thu, May 2, 2013 at 12:03 PM, Nathaniel Smith wrote: >> > On 1 May 2013 23:12, "Charles R Harris" >> > wrote: >> >> >> >> On Wed, May 1, 2013 at 7:10 PM, Benjamin Root wrote: >> >>> >> >>> So, to summarize the thread so far: >> >>> >> >>> Consensus: >> >>> np.nanmean() >> >>> np.nanstd() >> >>> np.minmax() >> >>> np.argminmax() >> >>> >> >>> Vague Consensus: >> >>> np.sincos() >> >>> >> >> >> >> If the return of sincos (cossin?) is an array, then it could be >> >> reshaped >> >> to be exp(1j*x), which together with exp(2*pi*1j*x) would cover some >> >> pretty >> >> common cases. >> >> It couldn't be a mere reshape, since the complex dtype requires the >> real and imag components to be adjacent to each other. They wouldn't >> be so if sincos's return type is an array (nor even the cossin >> alternative). It always requires a memory copy (except in the "who >> cares?" case of a scalar). Composition with an efficient >> np.tocomplex(real, imag) implementation would cover those use cases >> whether sincos returns tuples or arrays. > > I would assume the basic return type would be complex, i.e., the cos/sin > adjacent. The cos/sin parts would then be real/imag views into the array. You mean that the implementation of cossin (to make things easier on ourselves) would create an (N,2) contiguous array, fill it with the cos and sin results, then reshape it to return the expected (2,N) array (or 2-tuple)? How would the user then reconstitute the exp(1j*x) result efficiently? If the use case is that important, I would just make exp(1j*x) into its own ufunc and have it use the C sincos() function internally. -- Robert Kern From njs at pobox.com Thu May 2 09:58:52 2013 From: njs at pobox.com (Nathaniel Smith) Date: Thu, 2 May 2013 09:58:52 -0400 Subject: [Numpy-discussion] Profiling (was GSoC : Performance parity between numpy arrays and Python scalars) In-Reply-To: References: Message-ID: On Thu, May 2, 2013 at 9:25 AM, David Cournapeau wrote: >> * Re: the profiling, I wrote a full oprofile->callgrind format script >> years ago: http://vorpus.org/~njs/op2calltree.py >> Haven't used it in years either but neither oprofile nor kcachegrind >> are terribly fast-moving projects so it's probably still working, or >> could be made so without much work. >> Or easier is to use the gperftools CPU profiler: >> https://gperftools.googlecode.com/svn/trunk/doc/cpuprofile.html > > I don't have experience with gperftools, but on recent linux kernels, > you can also use perf, which can't be made easier to use (no runtime > support needed), but you need a 'recent' kernel > > http://indico.cern.ch/getFile.py/access?contribId=20&sessionId=4&resId=0&materialId=slides&confId=141309 > > I am hoping to talk a bit about those for our diving into numpy c code > tutorial in June, what's the + of gperf in your opinion ? For what I've used profiling for, THE key feature is proper callgraph support ("show me the *total* time spent in each function, including callees"). Otherwise, silly example, let's say you have a bug where you wrote: func1() { for (i = 0; i < 10000000; i++) foo = add(foo, bar[0]) } Obviously this is a waste of time, since you're actually performing the same operation over and over. Many profilers, given this, will tell you that all the time is spent in 'add', which is useless, because you don't want to speed up 'add', you want to speed up 'func1' (probably by not calling 'add' so many times!). If you have relatively flat code like most kernel code this isn't an issue, but I generally don't... perf is a fabulous framework and doesn't have any way to get full callgraph information out so IME it's been useless. They have reporting modes that claim to (like some "fractal" thing?) but AFAI been able to tell from docs/googling/mailing lists, there is nobody who understands how to interpret this output except the people who wrote it. Really a shame that it falls down in the last mile like that, hopefully they will fix this soon. callgrind has the *fabulous* kcachegrind front-end, but it only measures memory access performance on a simulated machine, which is very useful sometimes (if you're trying to optimize cache locality), but there's no guarantee that the bottlenecks on its simulated machine are the same as the bottlenecks on your real machine. oprofile is getting long in the tooth (superseded by perf), and it's built-in reporting tools are merely ok, but it does have full callgraph information and with the script above you can get the output into kcachegrind. perftools don't have all the fancy features of the in-kernel options, but they're trivial to use, and their reporting options are genuinely useful (though not quite as awesome as kcachegrind). So while in theory it's the least whizz-bang awesome of all of these options, in practice I find it the most useful. (Also, beware of terminology collision, "gperf" is something else again...) -n From arinkverma at iitrpr.ac.in Thu May 2 10:18:39 2013 From: arinkverma at iitrpr.ac.in (Arink Verma) Date: Thu, 2 May 2013 19:48:39 +0530 Subject: [Numpy-discussion] GSoC : Performance parity between numpy arrays and Python scalars In-Reply-To: References: <51816167.6040607@virtualmaterials.com>

Message-ID: Updating table at runtime, seems a good option. But then we have maintain separate file for caching and storing. I will see, op2calltree.py and gperftools both. >* Instead of making a giant table of everything that needs to be done >to make stuff fast first, before writing any code, I'd suggest picking >one operation, figuring out what change would be the biggest >improvement for it, making that change, checking that it worked, and >then repeat until that operation is really fast. Working like that only, firstly optimizing sum operation specifically for int scalar then will move to other. >* Did you notice this line on the requirements page? "Having your >first pull request merged before the GSoC application deadline (May 3) >is required for your application to be accepted." Thanks for reminding! I was too busy with my university exams, I forgot to do that. Does the merge has to be related to gsoc project, or any other improvement can be consider? On Thu, May 2, 2013 at 6:44 PM, Nathaniel Smith wrote: > On Thu, May 2, 2013 at 6:26 AM, Arink Verma > wrote: > > Yes, we need to ensure that.. > > Code generator can be made, which can create code for table of registered > > dtype during build time itself. > > I'd probably just generate it at run-time on an as-needed basis. > (I.e., use the full lookup logic the first time, then save the > result.) New dtypes can be registered, which will mean the tables need > to change size at runtime anyway. If someone does some strange thing > like add float16's and float64's, we can do the lookup to determine > that this should be handled by the float64/float64 loop, and then > store that information so that the next time it's fast (but we > probably don't want to be calculating all combinations at build-time, > which would require running the full type resolution machinery, esp. > since it wouldn't really bring any benefits that I can see). > > * Re: the profiling, I wrote a full oprofile->callgrind format script > years ago: http://vorpus.org/~njs/op2calltree.py > Haven't used it in years either but neither oprofile nor kcachegrind > are terribly fast-moving projects so it's probably still working, or > could be made so without much work. > Or easier is to use the gperftools CPU profiler: > https://gperftools.googlecode.com/svn/trunk/doc/cpuprofile.html > > Instead of linking to it at build time, you can just use ctypes: > > In [7]: profiler = ctypes.CDLL("libprofiler.so.0") > > In [8]: profiler.ProfilerStart("some-file-name-here") > Out[8]: 1 > > In [9]: # do stuff here > > In [10]: profiler.ProfilerStop() > PROFILE: interrupts/evictions/bytes = 2/0/592 > Out[10]: 46 > > Then all the pprof analysis tools are available as described on that > webpage. > > * Please don't trust those random suggestions for possible > improvements I threw out when writing the original description. > Probably it's true that FP flag checking and ufunc type lookup are > expensive, but one should fix what the profile says to fix, not what > someone guessed might be good to fix based on a few minutes thought. > > * Instead of making a giant table of everything that needs to be done > to make stuff fast first, before writing any code, I'd suggest picking > one operation, figuring out what change would be the biggest > improvement for it, making that change, checking that it worked, and > then repeat until that operation is really fast. Then if there's still > time pick another operation. Producing a giant todo list isn't very > productive by itself if there's no time then to actually do all the > things on the list :-). > > * Did you notice this line on the requirements page? "Having your > first pull request merged before the GSoC application deadline (May 3) > is required for your application to be accepted." > > -n > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -- Arink Computer Science and Engineering Indian Institute of Technology Ropar www.arinkverma.in -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu May 2 10:28:58 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 2 May 2013 08:28:58 -0600 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: <1888613198476891170@unknownmsgid>

Message-ID: On Thu, May 2, 2013 at 7:47 AM, Robert Kern wrote: > On Thu, May 2, 2013 at 2:38 PM, Charles R Harris > wrote: > > > > On Thu, May 2, 2013 at 7:28 AM, Robert Kern > wrote: > >> > >> On Thu, May 2, 2013 at 12:03 PM, Nathaniel Smith wrote: > >> > On 1 May 2013 23:12, "Charles R Harris" > >> > wrote: > >> >> > >> >> On Wed, May 1, 2013 at 7:10 PM, Benjamin Root > wrote: > >> >>> > >> >>> So, to summarize the thread so far: > >> >>> > >> >>> Consensus: > >> >>> np.nanmean() > >> >>> np.nanstd() > >> >>> np.minmax() > >> >>> np.argminmax() > >> >>> > >> >>> Vague Consensus: > >> >>> np.sincos() > >> >>> > >> >> > >> >> If the return of sincos (cossin?) is an array, then it could be > >> >> reshaped > >> >> to be exp(1j*x), which together with exp(2*pi*1j*x) would cover some > >> >> pretty > >> >> common cases. > >> > >> It couldn't be a mere reshape, since the complex dtype requires the > >> real and imag components to be adjacent to each other. They wouldn't > >> be so if sincos's return type is an array (nor even the cossin > >> alternative). It always requires a memory copy (except in the "who > >> cares?" case of a scalar). Composition with an efficient > >> np.tocomplex(real, imag) implementation would cover those use cases > >> whether sincos returns tuples or arrays. > > > > I would assume the basic return type would be complex, i.e., the cos/sin > > adjacent. The cos/sin parts would then be real/imag views into the array. > > You mean that the implementation of cossin (to make things easier on > ourselves) would create an (N,2) contiguous array, fill it with the > cos and sin results, then reshape it to return the expected (2,N) > Just return the transpose. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Thu May 2 10:40:43 2013 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 2 May 2013 15:40:43 +0100 Subject: [Numpy-discussion] nanmean(), nanstd() and other "missing" functions for 1.8 In-Reply-To: References: