From enzomich at gmail.com Sat Jan 1 06:23:37 2011 From: enzomich at gmail.com (Enzo Michelangeli) Date: Sat, 1 Jan 2011 19:23:37 +0800 Subject: [Numpy-discussion] Optimization suggestion sought References: Message-ID: ----- Original Message ----- From: "Robert Bradshaw" Sent: Wednesday, December 29, 2010 4:47 PM [...] >> Regarding Justin's suggestion, before trying Cython (which, according to >> http://wiki.cython.org/tutorials/numpy , seems to require a bit of work >> to >> handle numpy arrays properly) > > Cython doesn't have to be that complicated. For your example, you just > have to unroll the vectorization (and account for the fact that the > result is mutated in place, which was your original goal). Thanks, but the full de-vectorization forces to give up any use of BLAS (I suppose that for array products numpy relies on its routines). In my tests, the performance in terms of speed is more or less the same as the original pure-numpy code (which may be made less memory-hungry with the chunking suggested by Josef). Instead, it would be nice to have a native function able to perform evaluation of arbitrary numpy expressions without converting the intermediate results in Python format (a sort of "better weave.blitz", able to understand slicing, broadcasting rules etc.). That would give us the best of both worlds: code execution at BLAS speeds, and savings in unnecessary conversions and temporary variable allocations. Such "numpy calculator" could also be a simple interpreter, avoiding the complexities and site dependencies deriving from the use of a C compiler: it should build temporary C data structures for the parameters in input, call the relevant C ATLAS/BLAS/LAPACK functions in the right order (possibly allocating temporary C arrays), and convert only the final result back to a Python object. Enzo From ralf.gommers at googlemail.com Sat Jan 1 06:40:58 2011 From: ralf.gommers at googlemail.com (Ralf Gommers) Date: Sat, 1 Jan 2011 19:40:58 +0800 Subject: [Numpy-discussion] OS X binaries. In-Reply-To: References: Message-ID: On Sat, Jan 1, 2011 at 5:44 AM, Gideon wrote: > I noticed that 1.5.1 was released, and sourceforge is suggesting I use > the package numpy-1.5.1-py2.6-python.org-macosx10.3.dmg. However, I > have an OS X 10.6 machine. > > Can/should I use this binary? > Yes you can. The naming scheme corresponds to the one used by Python itself on python.org. For 2.6 the ..macosx10.3.dmg works for all supported versions of OS X. For 2.7 you have the choice of 2 versions if you are on 10.6, depending on whether or not you want 32-bit or 64-bit. Cheers, Ralf > > Should I just compile from source? > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jsalvati at u.washington.edu Sat Jan 1 14:23:22 2011 From: jsalvati at u.washington.edu (John Salvatier) Date: Sat, 1 Jan 2011 11:23:22 -0800 Subject: [Numpy-discussion] Giving numpy the ability to multi-iterate excluding an axis In-Reply-To: References: Message-ID: This thread is a bit old, but since it's not possible to use the C-API is possible to accomplish this same thing with the Python API? On Tue, Dec 21, 2010 at 5:12 PM, Mark Wiebe wrote: > On Mon, Dec 20, 2010 at 1:42 PM, John Salvatier > wrote: > >> A while ago, I asked a whether it was possible to multi-iterate over >> several ndarrays but exclude a certain axis( >> http://www.mail-archive.com/numpy-discussion at scipy.org/msg29204.html), >> sort of a combination of PyArray_IterAllButAxis and PyArray_MultiIterNew. My >> goal was to allow creation of relatively complex ufuncs that can allow >> reduction or directionally dependent computation and still use broadcasting >> (for example a moving averaging ufunc that can have changing averaging >> parameters). I didn't get any solutions, which I take to mean that no one >> knew how to do this. >> >> I am thinking about trying to make a numpy patch with this functionality, >> and I have some questions: 1) How difficult would this kind of task be for >> someone with non-expert C knowledge and good numpy knowledge? 2) Does anyone >> have advice on how to do this kind of thing? >> > > You may be able to do what you would like with the new iterator I've > written. In particular, it supports nesting multiple iterators by providing > either pointers or offsets, and allowing you to specify any subset of the > axes to iterate. Here's how the code to do this in a simple 3D case might > look, for making axis 1 the inner loop: > > PyArrayObject *op[2] = {a,b}; > npy_intp axes_outer[2] = {0,2}}; > npy_intp *op_axes[2]; > npy_intp axis_inner = 1; > npy_int32 flags[2] = {NPY_ITER_READONLY, NPY_ITER_READONLY}; > NpyIter *outer, *inner; > NpyIter_IterNext_Fn oiternext, iiternext; > npy_intp *ooffsets; > char **idataptrs; > > op_axes[0] = op_axes[1] = axes_outer; > outer = NpyIter_MultiNew(2, op, NPY_ITER_OFFSETS, > NPY_KEEPORDER, NPY_NO_CASTING, flags, NULL, 2, > op_axes, 0); > op_axes[0] = op_axes[1] = &axis_inner; > inner = NpyIter_MultiNew(2, op, 0, NPY_KEEPORDER, NPY_NO_CASTING, flags, > NULL, 1, op_axes, 0); > > oiternext = NpyIter_GetIterNext(outer); > iiternext = NpyIter_GetIterNext(inner); > > ooffsets = (npy_intp *)NpyIter_GetDataPtrArray(outer); > idataptrs = NpyIter_GetDataPtrArray(inner); > > do { > do { > char *a_data = idataptrs[0] + ooffsets[0], *b_data = idataptrs[0] + > ooffsets[0]; > /* Do stuff with the data */ > } while(iiternext()); > NpyIter_Reset(inner); > } while(oiternext()); > > NpyIter_Deallocate(outer); > NpyIter_Deallocate(inner); > > Extending to more dimensions, or making both the inner and outer loops have > multiple dimensions, isn't too crazy. Is this along the lines of what you > need? > > If you check out my code, note that it currently isn't exposed as NumPy API > yet, but you can try a lot of things with the Python exposure. > > Cheers, > Mark > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From enzomich at gmail.com Sat Jan 1 20:42:02 2011 From: enzomich at gmail.com (Enzo Michelangeli) Date: Sun, 2 Jan 2011 09:42:02 +0800 Subject: [Numpy-discussion] Arrays with aliased elements? Message-ID: Is there any way, not involving compilation of C code, to define ndarrays where some rows or columns share the same data buffers? For example, something built by a hypothetical variant of the np.repeat() function, such that, if a = array([2,3]), calling: b = np.aliasedrepeat(x, [1, 2], axis=0) would return in b: array([[2, 3], [2, 3], [2, 3]]) ...with the understanding that the three rows would actually share the same data, so setting e.g.: b[0,1] = 5 ...would change b into: array([[2, 5], [2, 5], [2, 5]]) In other words, something with a behaviour similar to a list of lists: >>> a = [2,3] >>> b = [a,a,a] >>> b [[2, 3], [2, 3], [2, 3]] >>> b[0][1] = 5 >>> b [[2, 5], [2, 5], [2, 5]] This would save memory (and time spent in unnecessary copying) in some applications with large arrays, and would allow to cope with the current inability of weave.blitz to understand broadcasting rules, e.g. for calculating outer products (I mentioned this in a previous thread). Enzo From zachary.pincus at yale.edu Sat Jan 1 20:53:13 2011 From: zachary.pincus at yale.edu (Zachary Pincus) Date: Sat, 1 Jan 2011 20:53:13 -0500 Subject: [Numpy-discussion] Arrays with aliased elements? In-Reply-To: References: Message-ID: <7DBBF9B7-9AE2-44C4-B126-5039F1B5AC67@yale.edu> def repeat(arr, num): arr = numpy.asarray(arr) return numpy.ndarray(arr.shape+(num,), dtype=arr.dtype, buffer=arr, strides=arr.strides+(0,)) There are limits to what these sort of stride tricks can accomplish, but repeating as above, or similar, is feasible. On Jan 1, 2011, at 8:42 PM, Enzo Michelangeli wrote: > Is there any way, not involving compilation of C code, to define > ndarrays > where some rows or columns share the same data buffers? For example, > something built by a hypothetical variant of the np.repeat() > function, such > that, if a = array([2,3]), calling: > > b = np.aliasedrepeat(x, [1, 2], axis=0) > > would return in b: > > array([[2, 3], > [2, 3], > [2, 3]]) > > ...with the understanding that the three rows would actually share > the same > data, so setting e.g.: > > b[0,1] = 5 > > ...would change b into: > > array([[2, 5], > [2, 5], > [2, 5]]) > > In other words, something with a behaviour similar to a list of lists: > >>>> a = [2,3] >>>> b = [a,a,a] >>>> b > [[2, 3], [2, 3], [2, 3]] >>>> b[0][1] = 5 >>>> b > [[2, 5], [2, 5], [2, 5]] > > This would save memory (and time spent in unnecessary copying) in some > applications with large arrays, and would allow to cope with the > current > inability of weave.blitz to understand broadcasting rules, e.g. for > calculating outer products (I mentioned this in a previous thread). > > Enzo > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From robert.kern at gmail.com Sat Jan 1 21:08:05 2011 From: robert.kern at gmail.com (Robert Kern) Date: Sat, 1 Jan 2011 20:08:05 -0600 Subject: [Numpy-discussion] Arrays with aliased elements? In-Reply-To: References: Message-ID: On Sat, Jan 1, 2011 at 19:42, Enzo Michelangeli wrote: > Is there any way, not involving compilation of C code, to define ndarrays > where some rows or columns share the same data buffers? For example, > something built by a hypothetical variant of the np.repeat() function, such > that, if a = array([2,3]), calling: > > ? b = np.aliasedrepeat(x, [1, 2], axis=0) > > would return in b: > > ? array([[2, 3], > ? ? ? ? ?[2, 3], > ? ? ? ? ?[2, 3]]) > > ...with the understanding that the three rows would actually share the same > data, so setting e.g.: > > ? b[0,1] = 5 > > ...would change b into: > > ? array([[2, 5], > ? ? ? ? ?[2, 5], > ? ? ? ? ?[2, 5]]) > > In other words, something with a behaviour similar to a list of lists: > >>>> a = [2,3] >>>> b = [a,a,a] >>>> b > [[2, 3], [2, 3], [2, 3]] >>>> b[0][1] = 5 >>>> b > [[2, 5], [2, 5], [2, 5]] > > This would save memory (and time spent in unnecessary copying) in some > applications with large arrays, and would allow to cope with the current > inability of weave.blitz to understand broadcasting rules, e.g. for > calculating outer products (I mentioned this in a previous thread). See numpy.lib.stride_tricks for tools to do this, specifically the as_strided() function. See numpy.broadcast_arrays() for the latter functionality. http://docs.scipy.org/doc/numpy/reference/generated/numpy.broadcast_arrays.html -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." ? -- Umberto Eco From enzomich at gmail.com Sun Jan 2 02:01:24 2011 From: enzomich at gmail.com (Enzo Michelangeli) Date: Sun, 2 Jan 2011 15:01:24 +0800 Subject: [Numpy-discussion] Arrays with aliased elements? References: Message-ID: <63A2AB5B2FCE4426B473D469EE06ADC1@EMLT> Thanks. Meanwhile, I had arrived to a solution similar to the one suggested by Zachary: >>> a = array([2,3]) >>> ndarray((3,a.shape[0]), strides=(0,a.itemsize), buffer = a, offset=0, >>> dtype=a.dtype) array([[2, 3], [2, 3], [2, 3]]) ...but I'd say that numpy.broadcast_arrays is the cleanest way of obtaining pre-broadcasted views to pass to weave.blitz(). But alas, it appears that blitz doesn't work well with such non-contiguous views: tsb, pivb = broadcast_arrays(tableau[:,cand:cand+1], pivot) tableau = tableau - tsb * pivb ...works, but: tsb, pivb = broadcast_arrays(tableau[:,cand:cand+1], pivot) weave.blitz('tableau = tableau - tsb * pivb') ...returns wrong results. And, of course, converting them to contiguous through the array() function defeats the intended savings in memory and CPU cycles... Enzo ----- Original Message ----- From: "Robert Kern" To: "Discussion of Numerical Python" Sent: Sunday, January 02, 2011 10:08 AM Subject: Re: [Numpy-discussion] Arrays with aliased elements? > On Sat, Jan 1, 2011 at 19:42, Enzo Michelangeli > wrote: >> Is there any way, not involving compilation of C code, to define ndarrays >> where some rows or columns share the same data buffers? For example, >> something built by a hypothetical variant of the np.repeat() function, >> such >> that, if a = array([2,3]), calling: >> >> b = np.aliasedrepeat(x, [1, 2], axis=0) >> >> would return in b: >> >> array([[2, 3], >> [2, 3], >> [2, 3]]) >> >> ...with the understanding that the three rows would actually share the >> same >> data, so setting e.g.: >> >> b[0,1] = 5 >> >> ...would change b into: >> >> array([[2, 5], >> [2, 5], >> [2, 5]]) >> >> In other words, something with a behaviour similar to a list of lists: >> >>>>> a = [2,3] >>>>> b = [a,a,a] >>>>> b >> [[2, 3], [2, 3], [2, 3]] >>>>> b[0][1] = 5 >>>>> b >> [[2, 5], [2, 5], [2, 5]] >> >> This would save memory (and time spent in unnecessary copying) in some >> applications with large arrays, and would allow to cope with the current >> inability of weave.blitz to understand broadcasting rules, e.g. for >> calculating outer products (I mentioned this in a previous thread). > > See numpy.lib.stride_tricks for tools to do this, specifically the > as_strided() function. See numpy.broadcast_arrays() for the latter > functionality. > > http://docs.scipy.org/doc/numpy/reference/generated/numpy.broadcast_arrays.html > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From cournape at gmail.com Mon Jan 3 01:46:24 2011 From: cournape at gmail.com (David Cournapeau) Date: Mon, 3 Jan 2011 15:46:24 +0900 Subject: [Numpy-discussion] Prime size FFT: bluestein transform vs general chirp/z transform ? Message-ID: Hi, I finally took the time to clean up my code to speed up prime-size FFT (which use a O(N^2) algo in both numpy and scipy). The code is there: https://github.com/cournape/numpy/tree/bluestein (most of the code is tests, because numpy.fft had almost none). Bottom line: it is used only for prime numbers, and is faster than the current code for complex transforms > 500. Because of python + inherent bluestein overhead, this is mostly useful for "long" fft (where the speed up is significant - already 100x speed up for prime size ~ 50000). Several comments: - the overhead is pretty significant (on my machine, bluestein transfrom is slower for prime size < 500) - it could be used as such for real transforms, but the overhead would be even more significant (there is no bluestein transform for real transforms, so one needs to re-rexpress real transforms in term of complex ones, multiplying the overhead by 2x). There are several alternatives to make things faster (Rader-like transform, as used by fftw), but I think this would be quite hard to do in python without significant slowdown, because the code cannot be vectorized. - one could also decide to provide a chirp-z transform, of which Bluestein transform is a special case. Maybe this is more adapted to scipy ? - more generic code will require a few simple (but not trivial) arithmetic-like functions (find prime factors, find generator of Z/nZ groups with n prime, etc...). Where should I put those ? cheers, David From seb.haase at gmail.com Mon Jan 3 05:13:25 2011 From: seb.haase at gmail.com (Sebastian Haase) Date: Mon, 3 Jan 2011 11:13:25 +0100 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: Message-ID: Hi Erik, This is really neat ! Do I understand correctly, that you mean by "stride tricks", that your rolling_window is _not_ allocating any new memory ? IOW, If I have a large array using 500MB of memory, say of float32 of shape 125,1000,1000 and I want the last axis rolling of window size 11, what would the peak memory usage of that operation be ? How about renaming the option `window` to `window_size` (first I was thinking of things like hamming and hanning windows...)... ? Thanks, Sebastian Haase On Sat, Jan 1, 2011 at 5:29 AM, Erik Rigtorp wrote: > Hi, > > Implementing moving average, moving std and other functions working > over rolling windows using python for loops are slow. This is a > effective stride trick I learned from Keith Goodman's > Bottleneck code but generalized into arrays of > any dimension. This trick allows the loop to be performed in C code > and in the future hopefully using multiple cores. > > import numpy as np > > def rolling_window(a, window): > ? ?""" > ? ?Make an ndarray with a rolling window of the last dimension > > ? ?Parameters > ? ?---------- > ? ?a : array_like > ? ? ? ?Array to add rolling window to > ? ?window : int > ? ? ? ?Size of rolling window > > ? ?Returns > ? ?------- > ? ?Array that is a view of the original array with a added dimension > ? ?of size w. > > ? ?Examples > ? ?-------- > ? ?>>> x=np.arange(10).reshape((2,5)) > ? ?>>> rolling_window(x, 3) > ? ?array([[[0, 1, 2], [1, 2, 3], [2, 3, 4]], > ? ? ? ? ? [[5, 6, 7], [6, 7, 8], [7, 8, 9]]]) > > ? ?Calculate rolling mean of last dimension: > ? ?>>> np.mean(rolling_window(x, 3), -1) > ? ?array([[ 1., ?2., ?3.], > ? ? ? ? ? [ 6., ?7., ?8.]]) > > ? ?""" > ? ?if window < 1: > ? ? ? ?raise ValueError, "`window` must be at least 1." > ? ?if window > a.shape[-1]: > ? ? ? ?raise ValueError, "`window` is too long." > ? ?shape = a.shape[:-1] + (a.shape[-1] - window + 1, window) > ? ?strides = a.strides + (a.strides[-1],) > ? ?return np.lib.stride_tricks.as_strided(a, shape=shape, strides=strides) > > > Using np.swapaxes(-1, axis) rolling aggregations over any axis can be computed. > > I submitted a pull request to add this to the stride_tricks module. > > Erik From erik at rigtorp.com Mon Jan 3 08:37:21 2011 From: erik at rigtorp.com (Erik Rigtorp) Date: Mon, 3 Jan 2011 08:37:21 -0500 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: Message-ID: On Mon, Jan 3, 2011 at 05:13, Sebastian Haase wrote: > Hi Erik, > This is really neat ! ?Do I understand correctly, that you mean by > "stride tricks", that your rolling_window is _not_ allocating any new > memory ? Yes, it's only a view. > IOW, If I have a large array using 500MB of memory, say of float32 of > shape 125,1000,1000 and I want the last axis rolling of window size > 11, what would the peak memory usage of that operation be ? It's only a view of the array, no copying is done. Though some operations like np.std() will copy the array, but that's more of a bug. In general It's hard to imagine any speedup gains by copying a 10GB array. > How about renaming the option `window` to `window_size` ?(first I was > thinking of things like hamming and hanning windows...)... ? Sounds fare. From kwgoodman at gmail.com Mon Jan 3 10:32:44 2011 From: kwgoodman at gmail.com (Keith Goodman) Date: Mon, 3 Jan 2011 07:32:44 -0800 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: Message-ID: On Fri, Dec 31, 2010 at 8:29 PM, Erik Rigtorp wrote: > Implementing moving average, moving std and other functions working > over rolling windows using python for loops are slow. This is a > effective stride trick I learned from Keith Goodman's > Bottleneck code but generalized into arrays of > any dimension. This trick allows the loop to be performed in C code > and in the future hopefully using multiple cores. I like using strides for moving window functions. The one downside I found is that it is slow when window * (arr.shape[axis] - window) is large: >> a = np.random.rand(1000000) >> b = rolling_window(a, 5000) >> import bottleneck as bn >> timeit bn.nanmean(b, axis=1) 1 loops, best of 3: 7.1 s per loop >> timeit bn.move_nanmean(a, window=5000, axis=0) 100 loops, best of 3: 7.99 ms per loop From kwgoodman at gmail.com Mon Jan 3 10:36:47 2011 From: kwgoodman at gmail.com (Keith Goodman) Date: Mon, 3 Jan 2011 07:36:47 -0800 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: Message-ID: On Mon, Jan 3, 2011 at 5:37 AM, Erik Rigtorp wrote: > It's only a view of the array, no copying is done. Though some > operations like np.std() ?will copy the array, but that's more of a > bug. In general It's hard to imagine any speedup gains by copying a > 10GB array. I don't think that np.std makes a copy of the input data if the input is an array. If the input is, for example, a list, then an array is created. From erik at rigtorp.com Mon Jan 3 10:41:11 2011 From: erik at rigtorp.com (Erik Rigtorp) Date: Mon, 3 Jan 2011 10:41:11 -0500 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: Message-ID: On Mon, Jan 3, 2011 at 10:36, Keith Goodman wrote: > On Mon, Jan 3, 2011 at 5:37 AM, Erik Rigtorp wrote: > >> It's only a view of the array, no copying is done. Though some >> operations like np.std() ?will copy the array, but that's more of a >> bug. In general It's hard to imagine any speedup gains by copying a >> 10GB array. > > I don't think that np.std makes a copy of the input data if the input > is an array. If the input is, for example, a list, then an array is > created. When I tried it on a big array, it tried to allocate a huge amount of memory. As I said it's probably a bug. From kwgoodman at gmail.com Mon Jan 3 10:52:28 2011 From: kwgoodman at gmail.com (Keith Goodman) Date: Mon, 3 Jan 2011 07:52:28 -0800 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References:

Message-ID: On Mon, Jan 3, 2011 at 7:41 AM, Erik Rigtorp wrote: > On Mon, Jan 3, 2011 at 10:36, Keith Goodman wrote: >> On Mon, Jan 3, 2011 at 5:37 AM, Erik Rigtorp wrote: >> >>> It's only a view of the array, no copying is done. Though some >>> operations like np.std() ?will copy the array, but that's more of a >>> bug. In general It's hard to imagine any speedup gains by copying a >>> 10GB array. >> >> I don't think that np.std makes a copy of the input data if the input >> is an array. If the input is, for example, a list, then an array is >> created. > > When I tried it on a big array, it tried to allocate a huge amount of > memory. As I said it's probably a bug. Yes, that would be a big bug. np.std does have to initialize the output array. If the window size is small compared to arr.shape[axis] then the memory taken by the output array is of the same order as that of the input array. Could that be what you are seeing? >> a = np.arange(10) Small window, output array shape (8,): >> rolling_window(a, 2) array([[0, 1], [1, 2], [2, 3], [3, 4], [4, 5], [5, 6], [6, 7], [7, 8], [8, 9]]) Big window, output array shape (2,): >> rolling_window(a, 9) array([[0, 1, 2, 3, 4, 5, 6, 7, 8], [1, 2, 3, 4, 5, 6, 7, 8, 9]]) From erik at rigtorp.com Mon Jan 3 10:55:32 2011 From: erik at rigtorp.com (Erik Rigtorp) Date: Mon, 3 Jan 2011 10:55:32 -0500 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References:

Message-ID: On Mon, Jan 3, 2011 at 10:52, Keith Goodman wrote: > On Mon, Jan 3, 2011 at 7:41 AM, Erik Rigtorp wrote: >> On Mon, Jan 3, 2011 at 10:36, Keith Goodman wrote: >>> On Mon, Jan 3, 2011 at 5:37 AM, Erik Rigtorp wrote: >>> >>>> It's only a view of the array, no copying is done. Though some >>>> operations like np.std() ?will copy the array, but that's more of a >>>> bug. In general It's hard to imagine any speedup gains by copying a >>>> 10GB array. >>> >>> I don't think that np.std makes a copy of the input data if the input >>> is an array. If the input is, for example, a list, then an array is >>> created. >> >> When I tried it on a big array, it tried to allocate a huge amount of >> memory. As I said it's probably a bug. > > Yes, that would be a big bug. > > np.std does have to initialize the output array. If the window size is > small compared to arr.shape[axis] then the memory taken by the output > array is of the same order as that of the input array. Could that be > what you are seeing? > >>> a = np.arange(10) > > Small window, output array shape (8,): > >>> rolling_window(a, 2) > array([[0, 1], > ? ? ? [1, 2], > ? ? ? [2, 3], > ? ? ? [3, 4], > ? ? ? [4, 5], > ? ? ? [5, 6], > ? ? ? [6, 7], > ? ? ? [7, 8], > ? ? ? [8, 9]]) > > Big window, output array shape (2,): > >>> rolling_window(a, 9) > array([[0, 1, 2, 3, 4, 5, 6, 7, 8], > ? ? ? [1, 2, 3, 4, 5, 6, 7, 8, 9]]) No the array was (500,2000) and i did np.std(rolling_window(a,252),-1) and it started to allocate > 2GB. From efiring at hawaii.edu Mon Jan 3 11:26:37 2011 From: efiring at hawaii.edu (Eric Firing) Date: Mon, 03 Jan 2011 06:26:37 -1000 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: Message-ID: <4D21F8BD.60003@hawaii.edu> On 12/31/2010 06:29 PM, Erik Rigtorp wrote: > Hi, > > Implementing moving average, moving std and other functions working > over rolling windows using python for loops are slow. This is a > effective stride trick I learned from Keith Goodman's > Bottleneck code but generalized into arrays of > any dimension. This trick allows the loop to be performed in C code > and in the future hopefully using multiple cores. > An alternative is to go straight to C, with a cython interface. If you look in the num/src subdirectory of http://currents.soest.hawaii.edu/hgstage/hgwebdir.cgi/pycurrents/ you will find this approach, labeled "ringbuf" and "runstats". See pycurrents/setup.py, and its driver, pycurrents/runsetup.py, to see how runstats is presently being built. Instead of calculating statistics independently each time the window is advanced one data point, the statistics are updated. I have not done any benchmarking, but I expect this approach to be quick. The code is old; I have not tried to update it to take advantage of cython's advances over pyrex. If I were writing it now, I might not bother with the C level at all; it could all be done in cython, probably with no speed penalty, and maybe even with reduced overhead. Eric From erik at rigtorp.com Mon Jan 3 11:32:07 2011 From: erik at rigtorp.com (Erik Rigtorp) Date: Mon, 3 Jan 2011 11:32:07 -0500 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: <4D21F8BD.60003@hawaii.edu> References: <4D21F8BD.60003@hawaii.edu> Message-ID: On Mon, Jan 3, 2011 at 11:26, Eric Firing wrote: > Instead of calculating statistics independently each time the window is > advanced one data point, the statistics are updated. ?I have not done > any benchmarking, but I expect this approach to be quick. This might accumulate numerical errors. But could be fine for many applications. > The code is old; I have not tried to update it to take advantage of > cython's advances over pyrex. ?If I were writing it now, I might not > bother with the C level at all; it could all be done in cython, probably > with no speed penalty, and maybe even with reduced overhead. > No doubt this would be faster, I just wanted to offer a general way to this in NumPy. From pivanov314 at gmail.com Mon Jan 3 16:44:09 2011 From: pivanov314 at gmail.com (Paul Ivanov) Date: Mon, 3 Jan 2011 13:44:09 -0800 Subject: [Numpy-discussion] numpy installation In-Reply-To: <962444.76273.qm@web29613.mail.ird.yahoo.com> References: <962444.76273.qm@web29613.mail.ird.yahoo.com> Message-ID: <20110103214409.GC17029@ykcyc> Waqar Rashid, on 2011-01-02 00:38, wrote: > Hi, > > trying to install numpy on MacOS with python 3.1 > > Having installation issues. Has anyone managed to install this on the Mac? > > regards > Waqar - you sent this to the IPython-User list, but I think you probably meant to send it to the numpy-discussion list, since your question does not pertain to IPython itself, so I'm forwarding your email there. Also, can you be more specific about what issues you are having? best, -- Paul Ivanov 314 address only used for lists, off-list direct email at: http://pirsquared.org | GPG/PGP key id: 0x0F3E28F7 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 198 bytes Desc: Digital signature URL: From SSharma84 at slb.com Tue Jan 4 02:31:00 2011 From: SSharma84 at slb.com (Sachin Kumar Sharma) Date: Tue, 4 Jan 2011 07:31:00 +0000 Subject: [Numpy-discussion] newbie question (curve fitting Z=f(X,Y)) Message-ID: <75C2FED246299A478280FA1470EDA4430C9025E4@NL0230MBX06N1.DIR.slb.com> Hi, Absolute basic question I want to do following * Read data from excel sheet (three columns - X,Y,Z) * Curve fit a function Z=F(X,Y) * Plot X Vs Z (from data) and plot X Vs Z (from curve fit) Kindly advise me how to write a basic python script for the same. Thanks & Regards Sachin ************************************************************************ Sachin Kumar Sharma Senior Geomodeler -------------- next part -------------- An HTML attachment was scrubbed... URL: From jpscipy at gmail.com Tue Jan 4 03:30:24 2011 From: jpscipy at gmail.com (Justin Peel) Date: Tue, 4 Jan 2011 01:30:24 -0700 Subject: [Numpy-discussion] Submitting patches Message-ID: Hi all, I've been submitting some patches recently just by putting them on Trac. However, I noticed in the Numpy Developer Guide that it says: The recommended way to proceed is either to attach these files to an enhancement ticket in the Numpy Trac and send a mail about the enhancement to the NumPy mailing list. This line is rather confusing. Either the 'either' should be removed or 'and'->'or'. In other words, is it sufficient to submit the patch in Trac or should I also be emailing the Numpy mailing list about each patch I submit? Justin From cournape at gmail.com Tue Jan 4 07:34:19 2011 From: cournape at gmail.com (David Cournapeau) Date: Tue, 4 Jan 2011 21:34:19 +0900 Subject: [Numpy-discussion] NEP for faster ufuncs In-Reply-To: References: <4D115D2B.7070904@silveregg.co.jp> Message-ID: On Wed, Dec 22, 2010 at 11:20 AM, Mark Wiebe wrote: > On Tue, Dec 21, 2010 at 6:06 PM, David wrote: >> >> >> >> This looks pretty cool. I hope to be able to take a look at it during >> the christmas holidays. > > Thanks! Ok, I took some time to look into it, but I am far from understanding everything yet. I will need more time. One design issue which bothers me a bit is the dynamically created structure for the iterator - do you have some benchmarks which show that this design is significantly better than a plain old C data structure with a couple of dynamically allocated arrays ? Besides bypassing the compiler type checks, I am a bit worried about the ability to extend the iterator through "inheritence in C" like I did with neighborhood iterator, but maybe I should just try it. I think the code would benefit from smaller functions, too - 500+ lines functions is just too much IMO, it should be split up. To get a deeper understanding of the code, I am starting to implement several benchmarks to compare old and new iterator - do you already have some of them handy ? Thanks for the hard work, that's a really nice piece of code, David From brockp at umich.edu Tue Jan 4 10:40:08 2011 From: brockp at umich.edu (Brock Palen) Date: Tue, 4 Jan 2011 10:40:08 -0500 Subject: [Numpy-discussion] NumPy on HPC podcast Message-ID: <466AE4A6-89BF-40FE-8A95-3DC88D32C6ED@umich.edu> I host and HPC podcast with Jeff Squyres of OpenMPI fame: www.rce-cast.com We would like to have a developer or two from NumPy on the show to represent the project. We do this over phone or skype and takes about an hour. Feel free to contact me out of band. I hope to hear from you soon! Brock Palen www.umich.edu/~brockp Center for Advanced Computing brockp at umich.edu (734)936-1985 From seb.haase at gmail.com Tue Jan 4 11:06:05 2011 From: seb.haase at gmail.com (Sebastian Haase) Date: Tue, 4 Jan 2011 17:06:05 +0100 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: <4D21F8BD.60003@hawaii.edu> Message-ID: On Mon, Jan 3, 2011 at 5:32 PM, Erik Rigtorp wrote: > On Mon, Jan 3, 2011 at 11:26, Eric Firing wrote: >> Instead of calculating statistics independently each time the window is >> advanced one data point, the statistics are updated. ?I have not done >> any benchmarking, but I expect this approach to be quick. > > This might accumulate numerical errors. But could be fine for many applications. > >> The code is old; I have not tried to update it to take advantage of >> cython's advances over pyrex. ?If I were writing it now, I might not >> bother with the C level at all; it could all be done in cython, probably >> with no speed penalty, and maybe even with reduced overhead. >> > > No doubt this would be faster, I just wanted to offer a general way to > this in NumPy. > _______________________________________________ BTW, some of these operations can be done using scipy's ndimage - right ? Any comments ? How does the performance compare ? ndimage might have more options regarding edge handling, or ? Cheers, Sebastian Haase From kwgoodman at gmail.com Tue Jan 4 11:14:58 2011 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 4 Jan 2011 08:14:58 -0800 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: <4D21F8BD.60003@hawaii.edu>

Message-ID: On Tue, Jan 4, 2011 at 8:06 AM, Sebastian Haase wrote: > On Mon, Jan 3, 2011 at 5:32 PM, Erik Rigtorp wrote: >> On Mon, Jan 3, 2011 at 11:26, Eric Firing wrote: >>> Instead of calculating statistics independently each time the window is >>> advanced one data point, the statistics are updated. ?I have not done >>> any benchmarking, but I expect this approach to be quick. >> >> This might accumulate numerical errors. But could be fine for many applications. >> >>> The code is old; I have not tried to update it to take advantage of >>> cython's advances over pyrex. ?If I were writing it now, I might not >>> bother with the C level at all; it could all be done in cython, probably >>> with no speed penalty, and maybe even with reduced overhead. >>> >> >> No doubt this would be faster, I just wanted to offer a general way to >> this in NumPy. >> _______________________________________________ > > BTW, some of these operations can be done using scipy's ndimage ?- right ? > Any comments ? ?How does the performance compare ? > ndimage might have more options regarding edge handling, or ? Take a look at the moving window function in the development version of the la package: https://github.com/kwgoodman/la/blob/master/la/farray/mov.py Many of the moving window functions offer three calculation methods: filter (ndimage), strides (the strides trick discussed in this thread), and loop (a simple python loop). For example: >> a = np.random.rand(500,2000) >> timeit la.farray.mov_max(a, window=252, axis=-1, method='filter') 1 loops, best of 3: 336 ms per loop >> timeit la.farray.mov_max(a, window=252, axis=-1, method='strides') 1 loops, best of 3: 609 ms per loop >> timeit la.farray.mov_max(a, window=252, axis=-1, method='loop') 1 loops, best of 3: 638 ms per loop No one method is best for all situations. That is one of the reasons I started the Bottleneck package. I figured Cython could beat them all. From jpscipy at gmail.com Tue Jan 4 13:49:32 2011 From: jpscipy at gmail.com (Justin Peel) Date: Tue, 4 Jan 2011 11:49:32 -0700 Subject: [Numpy-discussion] Question regarding submitting patches Message-ID: Hi all, I've been submitting some patches recently just by putting them on Trac. However, I noticed in the Numpy Developer Guide that it says: The recommended way to proceed is either to attach these files to an enhancement ticket in the Numpy Trac and send a mail about the enhancement to the NumPy mailing list. This line is rather confusing. Either the 'either' should be removed or 'and'->'or'. In other words, is it sufficient to submit the patch in Trac or should I also be emailing the Numpy mailing list about each patch I submit? Justin From mwwiebe at gmail.com Tue Jan 4 15:04:34 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Tue, 4 Jan 2011 12:04:34 -0800 Subject: [Numpy-discussion] Giving numpy the ability to multi-iterate excluding an axis In-Reply-To: References: Message-ID: On Sat, Jan 1, 2011 at 11:23 AM, John Salvatier wrote: > This thread is a bit old, but since it's not possible to use the C-API is > possible to accomplish this same thing with the Python API? > I've committed Python exposure for nested iteration to the new_iterator branch. In doing so, I also changed the mechanism in C. I found that it was simpler to expose to Python if I added a Reset function which gives new base data pointers, and this also simplifies C code using nested iterators. The Python code a = arange(2).reshape(2,1) b = arange(3).reshape(1,3) i, j = np.nested_iters([a,b], [[0],[1]]) for x in i: print "inner:" for y in j: print y[0], y[1] gives inner: 0 0 0 1 0 2 inner: 1 0 1 1 1 2 and C code for nested iteration looks something like this: NpyIter *iter1, *iter1; NpyIter_IterNext_Fn iternext1, iternext2; char **dataptrs1; /* * With the exact same operands, no copies allowed, and * no axis in op_axes used both in iter1 and iter2. * Buffering may be enabled for iter2, but not for iter1. */ iter1 = ...; iter2 = ...; iternext1 = NpyIter_GetIterNext(iter1); iternext2 = NpyIter_GetIterNext(iter2); dataptrs1 = NpyIter_GetDataPtrArray(iter1); do { NpyIter_ResetBasePointers(iter2, dataptrs1); do { /* Use the iter2 values */ } while (iternext2(iter2)); } while (iternext1(iter1)); Cheers, Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: From jsalvati at u.washington.edu Tue Jan 4 15:15:56 2011 From: jsalvati at u.washington.edu (John Salvatier) Date: Tue, 4 Jan 2011 12:15:56 -0800 Subject: [Numpy-discussion] Giving numpy the ability to multi-iterate excluding an axis In-Reply-To: References: Message-ID: Wow, great! I'm excited to try this. I think your patch significantly increases the extendability of numpy. Is the C-API exposed currently? Can you use the API from Cython (meaning is the numpy.pxd file updated)? On Tue, Jan 4, 2011 at 12:04 PM, Mark Wiebe wrote: > On Sat, Jan 1, 2011 at 11:23 AM, John Salvatier > wrote: > >> This thread is a bit old, but since it's not possible to use the C-API is >> possible to accomplish this same thing with the Python API? >> > > I've committed Python exposure for nested iteration to the new_iterator > branch. In doing so, I also changed the mechanism in C. I found that it > was simpler to expose to Python if I added a Reset function which gives new > base data pointers, and this also simplifies C code using nested iterators. > > The Python code > > a = arange(2).reshape(2,1) > b = arange(3).reshape(1,3) > > i, j = np.nested_iters([a,b], [[0],[1]]) > for x in i: > print "inner:" > for y in j: > print y[0], y[1] > > > gives > > inner: > 0 0 > 0 1 > 0 2 > inner: > 1 0 > 1 1 > 1 2 > > > and C code for nested iteration looks something like this: > > NpyIter *iter1, *iter1; > NpyIter_IterNext_Fn iternext1, iternext2; > char **dataptrs1; > > /* > * With the exact same operands, no copies allowed, and > * no axis in op_axes used both in iter1 and iter2. > * Buffering may be enabled for iter2, but not for iter1. > */ > iter1 = ...; iter2 = ...; > > iternext1 = NpyIter_GetIterNext(iter1); > iternext2 = NpyIter_GetIterNext(iter2); > dataptrs1 = NpyIter_GetDataPtrArray(iter1); > > do { > NpyIter_ResetBasePointers(iter2, dataptrs1); > do { > /* Use the iter2 values */ > } while (iternext2(iter2)); > } while (iternext1(iter1)); > > Cheers, > Mark > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mwwiebe at gmail.com Tue Jan 4 15:59:48 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Tue, 4 Jan 2011 12:59:48 -0800 Subject: [Numpy-discussion] Giving numpy the ability to multi-iterate excluding an axis In-Reply-To: References:

Message-ID: On Tue, Jan 4, 2011 at 12:15 PM, John Salvatier wrote: > Wow, great! I'm excited to try this. I think your patch significantly > increases the extendability of numpy. > > Is the C-API exposed currently? Can you use the API from Cython (meaning is > the numpy.pxd file updated)? > The C-API isn't exposed yet, but that won't be too difficult since it's mostly a matter of adding all the functions to the arrays in the python setup files. I thought I might do that and look at plugging it into numexpr at the same time, since to be able to use the iterator's buffering and numexpr's multithreading together will require some small additions to the iterator. Cheers, Mark On Tue, Jan 4, 2011 at 12:04 PM, Mark Wiebe wrote: > >> On Sat, Jan 1, 2011 at 11:23 AM, John Salvatier < >> jsalvati at u.washington.edu> wrote: >> >>> This thread is a bit old, but since it's not possible to use the C-API is >>> possible to accomplish this same thing with the Python API? >>> >> >> I've committed Python exposure for nested iteration to the new_iterator >> branch. In doing so, I also changed the mechanism in C. I found that it >> was simpler to expose to Python if I added a Reset function which gives new >> base data pointers, and this also simplifies C code using nested iterators. >> >> The Python code >> >> a = arange(2).reshape(2,1) >> b = arange(3).reshape(1,3) >> >> i, j = np.nested_iters([a,b], [[0],[1]]) >> for x in i: >> print "inner:" >> for y in j: >> print y[0], y[1] >> >> >> gives >> >> inner: >> 0 0 >> 0 1 >> 0 2 >> inner: >> 1 0 >> 1 1 >> 1 2 >> >> >> and C code for nested iteration looks something like this: >> >> NpyIter *iter1, *iter1; >> NpyIter_IterNext_Fn iternext1, iternext2; >> char **dataptrs1; >> >> /* >> * With the exact same operands, no copies allowed, and >> * no axis in op_axes used both in iter1 and iter2. >> * Buffering may be enabled for iter2, but not for iter1. >> */ >> iter1 = ...; iter2 = ...; >> >> iternext1 = NpyIter_GetIterNext(iter1); >> iternext2 = NpyIter_GetIterNext(iter2); >> dataptrs1 = NpyIter_GetDataPtrArray(iter1); >> >> do { >> NpyIter_ResetBasePointers(iter2, dataptrs1); >> do { >> /* Use the iter2 values */ >> } while (iternext2(iter2)); >> } while (iternext1(iter1)); >> >> Cheers, >> Mark >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mwwiebe at gmail.com Tue Jan 4 16:01:44 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Tue, 4 Jan 2011 13:01:44 -0800 Subject: [Numpy-discussion] Giving numpy the ability to multi-iterate excluding an axis In-Reply-To: References:

Message-ID: Oh, and I'm not sure about Cython, since I've never looked into its details. I imagine Cython will want to short circuit some of the Python exposure code, since accessing the iterator values creates new array objects. -Mark On Tue, Jan 4, 2011 at 12:59 PM, Mark Wiebe wrote: > On Tue, Jan 4, 2011 at 12:15 PM, John Salvatier > wrote: > >> Wow, great! I'm excited to try this. I think your patch significantly >> increases the extendability of numpy. >> >> Is the C-API exposed currently? Can you use the API from Cython (meaning >> is the numpy.pxd file updated)? >> > > The C-API isn't exposed yet, but that won't be too difficult since it's > mostly a matter of adding all the functions to the arrays in the python > setup files. I thought I might do that and look at plugging it into numexpr > at the same time, since to be able to use the iterator's buffering and > numexpr's multithreading together will require some small additions to the > iterator. > > Cheers, > Mark > > On Tue, Jan 4, 2011 at 12:04 PM, Mark Wiebe wrote: >> >>> On Sat, Jan 1, 2011 at 11:23 AM, John Salvatier < >>> jsalvati at u.washington.edu> wrote: >>> >>>> This thread is a bit old, but since it's not possible to use the C-API >>>> is possible to accomplish this same thing with the Python API? >>>> >>> >>> I've committed Python exposure for nested iteration to the new_iterator >>> branch. In doing so, I also changed the mechanism in C. I found that it >>> was simpler to expose to Python if I added a Reset function which gives new >>> base data pointers, and this also simplifies C code using nested iterators. >>> >>> The Python code >>> >>> a = arange(2).reshape(2,1) >>> b = arange(3).reshape(1,3) >>> >>> i, j = np.nested_iters([a,b], [[0],[1]]) >>> for x in i: >>> print "inner:" >>> for y in j: >>> print y[0], y[1] >>> >>> >>> gives >>> >>> inner: >>> 0 0 >>> 0 1 >>> 0 2 >>> inner: >>> 1 0 >>> 1 1 >>> 1 2 >>> >>> >>> and C code for nested iteration looks something like this: >>> >>> NpyIter *iter1, *iter1; >>> NpyIter_IterNext_Fn iternext1, iternext2; >>> char **dataptrs1; >>> >>> /* >>> * With the exact same operands, no copies allowed, and >>> * no axis in op_axes used both in iter1 and iter2. >>> * Buffering may be enabled for iter2, but not for iter1. >>> */ >>> iter1 = ...; iter2 = ...; >>> >>> iternext1 = NpyIter_GetIterNext(iter1); >>> iternext2 = NpyIter_GetIterNext(iter2); >>> dataptrs1 = NpyIter_GetDataPtrArray(iter1); >>> >>> do { >>> NpyIter_ResetBasePointers(iter2, dataptrs1); >>> do { >>> /* Use the iter2 values */ >>> } while (iternext2(iter2)); >>> } while (iternext1(iter1)); >>> >>> Cheers, >>> Mark >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>> >>> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jsalvati at u.washington.edu Tue Jan 4 16:05:44 2011 From: jsalvati at u.washington.edu (John Salvatier) Date: Tue, 4 Jan 2011 13:05:44 -0800 Subject: [Numpy-discussion] Giving numpy the ability to multi-iterate excluding an axis In-Reply-To: References:

Message-ID: Cython just has interfaces to the C-API, I think. On Tue, Jan 4, 2011 at 1:01 PM, Mark Wiebe wrote: > Oh, and I'm not sure about Cython, since I've never looked into its > details. I imagine Cython will want to short circuit some of the Python > exposure code, since accessing the iterator values creates new array > objects. > > -Mark > > > On Tue, Jan 4, 2011 at 12:59 PM, Mark Wiebe wrote: > >> On Tue, Jan 4, 2011 at 12:15 PM, John Salvatier < >> jsalvati at u.washington.edu> wrote: >> >>> Wow, great! I'm excited to try this. I think your patch significantly >>> increases the extendability of numpy. >>> >>> Is the C-API exposed currently? Can you use the API from Cython (meaning >>> is the numpy.pxd file updated)? >>> >> >> The C-API isn't exposed yet, but that won't be too difficult since it's >> mostly a matter of adding all the functions to the arrays in the python >> setup files. I thought I might do that and look at plugging it into numexpr >> at the same time, since to be able to use the iterator's buffering and >> numexpr's multithreading together will require some small additions to the >> iterator. >> >> Cheers, >> Mark >> >> On Tue, Jan 4, 2011 at 12:04 PM, Mark Wiebe wrote: >>> >>>> On Sat, Jan 1, 2011 at 11:23 AM, John Salvatier < >>>> jsalvati at u.washington.edu> wrote: >>>> >>>>> This thread is a bit old, but since it's not possible to use the C-API >>>>> is possible to accomplish this same thing with the Python API? >>>>> >>>> >>>> I've committed Python exposure for nested iteration to the new_iterator >>>> branch. In doing so, I also changed the mechanism in C. I found that it >>>> was simpler to expose to Python if I added a Reset function which gives new >>>> base data pointers, and this also simplifies C code using nested iterators. >>>> >>>> The Python code >>>> >>>> a = arange(2).reshape(2,1) >>>> b = arange(3).reshape(1,3) >>>> >>>> i, j = np.nested_iters([a,b], [[0],[1]]) >>>> for x in i: >>>> print "inner:" >>>> for y in j: >>>> print y[0], y[1] >>>> >>>> >>>> gives >>>> >>>> inner: >>>> 0 0 >>>> 0 1 >>>> 0 2 >>>> inner: >>>> 1 0 >>>> 1 1 >>>> 1 2 >>>> >>>> >>>> and C code for nested iteration looks something like this: >>>> >>>> NpyIter *iter1, *iter1; >>>> NpyIter_IterNext_Fn iternext1, iternext2; >>>> char **dataptrs1; >>>> >>>> /* >>>> * With the exact same operands, no copies allowed, and >>>> * no axis in op_axes used both in iter1 and iter2. >>>> * Buffering may be enabled for iter2, but not for iter1. >>>> */ >>>> iter1 = ...; iter2 = ...; >>>> >>>> iternext1 = NpyIter_GetIterNext(iter1); >>>> iternext2 = NpyIter_GetIterNext(iter2); >>>> dataptrs1 = NpyIter_GetDataPtrArray(iter1); >>>> >>>> do { >>>> NpyIter_ResetBasePointers(iter2, dataptrs1); >>>> do { >>>> /* Use the iter2 values */ >>>> } while (iternext2(iter2)); >>>> } while (iternext1(iter1)); >>>> >>>> Cheers, >>>> Mark >>>> >>>> _______________________________________________ >>>> NumPy-Discussion mailing list >>>> NumPy-Discussion at scipy.org >>>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>>> >>>> >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>> >>> >> > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mwwiebe at gmail.com Tue Jan 4 16:37:01 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Tue, 4 Jan 2011 13:37:01 -0800 Subject: [Numpy-discussion] NEP for faster ufuncs In-Reply-To: References: <4D115D2B.7070904@silveregg.co.jp>

Message-ID: On Tue, Jan 4, 2011 at 4:34 AM, David Cournapeau wrote: > > Ok, I took some time to look into it, but I am far from understanding > everything yet. I will need more time. > Yeah, it ended up being pretty large. I think the UFunc code will shrink substantially when it uses this iterator, which is something I was targeting. One design issue which bothers me a bit is the dynamically created > structure for the iterator - do you have some benchmarks which show > that this design is significantly better than a plain old C data > structure with a couple of dynamically allocated arrays ? Besides > bypassing the compiler type checks, I am a bit worried about the > ability to extend the iterator through "inheritence in C" like I did > with neighborhood iterator, but maybe I should just try it. > I know what you mean - if I could use C++ templates the implementation could probably have the best of both worlds, but seeing as NumPy is in C I tried to compromise mostly towards higher performance. I don't have benchmarks showing that the implementation is faster, but I did validate that the compiler does the optimizations I want it to do. For example, the specialized iternext function for 1 operand and 1 dimension, a common case because of dimension coalescing, looks like this on my machine: 0: 48 83 47 58 01 addq $0x1,0x58(%rdi) 5: 48 8b 47 60 mov 0x60(%rdi),%rax 9: 48 01 47 68 add %rax,0x68(%rdi) d: 48 8b 47 50 mov 0x50(%rdi),%rax 11: 48 39 47 58 cmp %rax,0x58(%rdi) 15: 0f 9c c0 setl %al 18: 0f b6 c0 movzbl %al,%eax 1b: c3 retq The function has no branches and all memory accesses are directly offset from the iter pointer %rdi, something I think is pretty good. If this data was in separately allocated arrays, I think it would hurt locality as well as add some more instructions. In the implementation, I tried to structure the data access macros so errors are easy to spot. Accessing the bufferdata and the axisdata isn't typed, but I can think of ways to do that. I was viewing the implementation as fully opaque to any non-iterator code, even within NumPy, do you think such access will be necessary? I think the code would benefit from smaller functions, too - 500+ > lines functions is just too much IMO, it should be split up. > I definitely agree, I've been splitting things up as they got large, but that's not finished. I also think the main iterator .c file is too large and needs splitting up. To get a deeper understanding of the code, I am starting to implement > several benchmarks to compare old and new iterator - do you already > have some of them handy ? > So far I've just done timing with the Python exposure, C-based benchmarking is welcome. Where possible, NPY_ITER_NO_INNER_ITERATION should be used, since it exposes the possibility of longer inner loops with no function calls. An example where this is not possible is when coordinates are required. I should probably put together a collection of copy/paste templates for typical use. Thanks for the hard work, that's a really nice piece of code, > Thanks for taking the time to look into it, Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: From oliphant at enthought.com Tue Jan 4 17:21:43 2011 From: oliphant at enthought.com (Travis Oliphant) Date: Tue, 4 Jan 2011 16:21:43 -0600 Subject: [Numpy-discussion] NumPy on HPC podcast In-Reply-To: <466AE4A6-89BF-40FE-8A95-3DC88D32C6ED@umich.edu> References: <466AE4A6-89BF-40FE-8A95-3DC88D32C6ED@umich.edu> Message-ID: <1DA9923A-4AB9-444F-948D-8627EA5E014D@enthought.com> Hi Brock, I would be happy to participate if I can. When is it? -Travis On Jan 4, 2011, at 9:40 AM, Brock Palen wrote: > I host and HPC podcast with Jeff Squyres of OpenMPI fame: > > www.rce-cast.com > > We would like to have a developer or two from NumPy on the show to represent the project. We do this over phone or skype and takes about an hour. > > Feel free to contact me out of band. I hope to hear from you soon! > > Brock Palen > www.umich.edu/~brockp > Center for Advanced Computing > brockp at umich.edu > (734)936-1985 > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion --- Travis Oliphant Enthought, Inc. oliphant at enthought.com 1-512-536-1057 http://www.enthought.com From brockp at umich.edu Tue Jan 4 19:06:28 2011 From: brockp at umich.edu (Brock Palen) Date: Tue, 4 Jan 2011 19:06:28 -0500 Subject: [Numpy-discussion] NumPy on HPC podcast In-Reply-To: <1DA9923A-4AB9-444F-948D-8627EA5E014D@enthought.com> References: <466AE4A6-89BF-40FE-8A95-3DC88D32C6ED@umich.edu> <1DA9923A-4AB9-444F-948D-8627EA5E014D@enthought.com> Message-ID: We record the show in advance, edit and then release. We would hope to record in the next week or two. Brock Palen www.umich.edu/~brockp Center for Advanced Computing brockp at umich.edu (734)936-1985 On Jan 4, 2011, at 5:21 PM, Travis Oliphant wrote: > Hi Brock, > > I would be happy to participate if I can. When is it? > > -Travis > > On Jan 4, 2011, at 9:40 AM, Brock Palen wrote: > >> I host and HPC podcast with Jeff Squyres of OpenMPI fame: >> >> www.rce-cast.com >> >> We would like to have a developer or two from NumPy on the show to represent the project. We do this over phone or skype and takes about an hour. >> >> Feel free to contact me out of band. I hope to hear from you soon! >> >> Brock Palen >> www.umich.edu/~brockp >> Center for Advanced Computing >> brockp at umich.edu >> (734)936-1985 >> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > > --- > Travis Oliphant > Enthought, Inc. > oliphant at enthought.com > 1-512-536-1057 > http://www.enthought.com > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > From faltet at pytables.org Wed Jan 5 07:26:08 2011 From: faltet at pytables.org (Francesc Alted) Date: Wed, 5 Jan 2011 13:26:08 +0100 Subject: [Numpy-discussion] NEP for faster ufuncs In-Reply-To: References: <4D115D2B.7070904@silveregg.co.jp>

Message-ID: 2011/1/4, Mark Wiebe : >> To get a deeper understanding of the code, I am starting to implement >> several benchmarks to compare old and new iterator - do you already >> have some of them handy ? >> > > So far I've just done timing with the Python exposure, C-based benchmarking > is welcome. Where possible, NPY_ITER_NO_INNER_ITERATION should be used, > since it exposes the possibility of longer inner loops with no function > calls. An example where this is not possible is when coordinates are > required. I should probably put together a collection of copy/paste > templates for typical use. Sorry for the naive question, but I use the numpy.fromiter() iterator quite a few in my projects. and I'm curious on whether this new iterator would allow numpy.fromiter() to go faster (I mean, in Python space). Any hint? -- Francesc Alted From jsseabold at gmail.com Wed Jan 5 10:30:08 2011 From: jsseabold at gmail.com (Skipper Seabold) Date: Wed, 5 Jan 2011 10:30:08 -0500 Subject: [Numpy-discussion] Question regarding submitting patches In-Reply-To: References: Message-ID: On Tue, Jan 4, 2011 at 1:49 PM, Justin Peel wrote: > Hi all, > > I've been submitting some patches recently just by putting them on > Trac. However, I noticed in the Numpy Developer Guide that it says: > > ? The recommended way to proceed is either to attach these files to > an enhancement ticket in the Numpy Trac and send a mail about the > enhancement to the NumPy mailing list. > > This line is rather confusing. Either the 'either' should be removed > or 'and'->'or'. In other words, is it sufficient to submit the patch > in Trac or should I also be emailing the Numpy mailing list about each > patch I submit? > I am not positive, but I think that having them on Trac ensures that they're not lost and is sufficient. An e-mail serves to draw some attention (or not) for a review or speedier inclusion. Should this recommendation in the docs be changed or amended with the switch to git? Skipper From ben.root at ou.edu Wed Jan 5 11:07:13 2011 From: ben.root at ou.edu (Benjamin Root) Date: Wed, 5 Jan 2011 10:07:13 -0600 Subject: [Numpy-discussion] Question regarding submitting patches In-Reply-To: References:

Message-ID: On Wed, Jan 5, 2011 at 9:30 AM, Skipper Seabold wrote: > On Tue, Jan 4, 2011 at 1:49 PM, Justin Peel wrote: > > Hi all, > > > > I've been submitting some patches recently just by putting them on > > Trac. However, I noticed in the Numpy Developer Guide that it says: > > > > The recommended way to proceed is either to attach these files to > > an enhancement ticket in the Numpy Trac and send a mail about the > > enhancement to the NumPy mailing list. > > > > This line is rather confusing. Either the 'either' should be removed > > or 'and'->'or'. In other words, is it sufficient to submit the patch > > in Trac or should I also be emailing the Numpy mailing list about each > > patch I submit? > > > > I am not positive, but I think that having them on Trac ensures that > they're not lost and is sufficient. An e-mail serves to draw some > attention (or not) for a review or speedier inclusion. > > Should this recommendation in the docs be changed or amended with the > switch to git? > > Skipper > At the very least, the wording is grammatically incorrect and should be fixed. Ben Root -------------- next part -------------- An HTML attachment was scrubbed... URL: From jdh2358 at gmail.com Wed Jan 5 11:27:32 2011 From: jdh2358 at gmail.com (John Hunter) Date: Wed, 5 Jan 2011 10:27:32 -0600 Subject: [Numpy-discussion] segfault on complex array on solaris x86 Message-ID: johnh at udesktop253:~> gcc --version gcc (GCC) 3.4.3 (csl-sol210-3_4-branch+sol_rpath) Copyright (C) 2004 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. johnh at udesktop253:~> uname -a SunOS udesktop253 5.10 Generic_142910-17 i86pc i386 i86pc johnh at udesktop253:~> cat test.py import numpy as np print np.__version__ fs = 1000 t = np.linspace(0, 0.3, 301) A = np.array([2, 8]).reshape(-1, 1) f = np.array([150, 140]).reshape(-1, 1) xn = (A * np.exp(2j * np.pi * f * t)).sum(axis=0) johnh at udesktop253:~> python test.py 2.0.0.dev-9451260 Segmentation Fault (core dumped) johnh at udesktop253:~> johnh at udesktop253:~> sudo pstack /var/core/core.python.957 core '/var/core/core.python.957' of 9397: python test.py febf1928 cexp (0, 0, 0, 0, 8060ab0, 84321ac) + 1b0 fe9657e0 npy_cexp (80458e0, 0, 0, 0, 0, 84e2530) + 30 fe95064f nc_exp (8045920, 84e72a0, 8045978, 8045920, 10, 10) + 3f fe937d5b PyUFunc_D_D (84e2530, 84e20f4, 84e25b0, fe950610, 1, 0) + 5b fe95e818 PyUFunc_GenericFunction (81e96e0, 807deac, 0, 80460b8, 2, 2) + 448 fe95fb10 ufunc_generic_call (81e96e0, 807deac, 0, fe98a820) + 70 feeb2d78 PyObject_Call (81e96e0, 807deac, 0, 80a24ec, 8061c08, 0) + 28 fef11900 PyEval_EvalFrame (80a2394, 81645a0, 8079824, 8079824) + 146c fef17708 PyEval_EvalCodeEx (81645a0, 8079824, 8079824, 0, 0, 0) + 620 fef178af PyEval_EvalCode (81645a0, 8079824, 8079824, 8061488, fef3d9ee, 0) + 2f fef3d095 PyRun_FileExFlags (feb91c98, 804687b, 101, 8079824, 8079824, 1) + 75 fef3d9ee PyRun_SimpleFileExFlags (feb91c98, 804687b, 1, 80465a8, fef454a1, 804687b) + 172 fef3e4fd PyRun_AnyFileExFlags (feb91c98, 804687b, 1, 80465a8) + 61 fef454a1 Py_Main (1, 80466b8, feb1cf35, fea935a1, 29, feb96750) + 9d9 08050862 main (2, 80466b8, 80466c4) + 22 08050758 _start (2, 8046874, 804687b, 0, 8046883, 80468ad) + 60 -------------- next part -------------- An HTML attachment was scrubbed... URL: From millman at berkeley.edu Wed Jan 5 12:50:16 2011 From: millman at berkeley.edu (Jarrod Millman) Date: Wed, 5 Jan 2011 18:50:16 +0100 Subject: [Numpy-discussion] Question regarding submitting patches In-Reply-To: References: Message-ID: On Tue, Jan 4, 2011 at 7:49 PM, Justin Peel wrote: > I've been submitting some patches recently just by putting them on > Trac. However, I noticed in the Numpy Developer Guide that it says: > > ? The recommended way to proceed is either to attach these files to > an enhancement ticket in the Numpy Trac and send a mail about the > enhancement to the NumPy mailing list. For now, we should just remove the 'either'. I will take a look at how to reintegrate the changes into the master gitwash document later tonight: https://github.com/matthew-brett/gitwash/blob/master/gitwash/patching.rst Thanks for pointing out the grammatical error. Jarrod PS. Just to be clear, the developer doc you are referring to is: http://docs.scipy.org/doc/numpy/dev/gitwash/patching.html I just want to make sure that there isn't some old wiki page somewhere that needs to be deleted. From mwwiebe at gmail.com Wed Jan 5 13:01:18 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Wed, 5 Jan 2011 10:01:18 -0800 Subject: [Numpy-discussion] NEP for faster ufuncs In-Reply-To: References: <4D115D2B.7070904@silveregg.co.jp>

Message-ID: On Wed, Jan 5, 2011 at 4:26 AM, Francesc Alted wrote: > Sorry for the naive question, but I use the numpy.fromiter() iterator > quite a few in my projects. and I'm curious on whether this new > iterator would allow numpy.fromiter() to go faster (I mean, in Python > space). Any hint? > The new iterator doesn't offer any help to fromiter in general, but if the iterator being given to the function is a new iterator it would be possible to handle it specially and get a big speedup. -Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: From ben.root at ou.edu Wed Jan 5 21:06:18 2011 From: ben.root at ou.edu (Benjamin Root) Date: Wed, 5 Jan 2011 20:06:18 -0600 Subject: [Numpy-discussion] newbie question (curve fitting Z=f(X,Y)) In-Reply-To: <75C2FED246299A478280FA1470EDA4430C9025E4@NL0230MBX06N1.DIR.slb.com> References: <75C2FED246299A478280FA1470EDA4430C9025E4@NL0230MBX06N1.DIR.slb.com> Message-ID: On Tue, Jan 4, 2011 at 1:31 AM, Sachin Kumar Sharma wrote: > Hi, > > > > Absolute basic question I want to do following > > > > ? Read data from excel sheet (three columns ? X,Y,Z) > > ? Curve fit a function Z=F(X,Y) > > ? Plot X Vs Z (from data) and plot X Vs Z (from curve fit) > > > > > > Kindly advise me how to write a basic python script for the same. > > > > Thanks & Regards > > > > Sachin > > Sachin, If you need to read data from excel files directly, there is a tool called python-excel: http://www.python-excel.org/ Although, personally, I would just simply recommend exporting the excel data into a csv file and then use numpy's loadtxt() function to read the text file. As for curve-fitting, you are likely want to use scipy's optimize toolkit: http://docs.scipy.org/doc/scipy/reference/optimize.html Finally, you use matplotlib for plotting. This is very high-level and doesn't go into much detail, but I am sure if you read up on how to use these tools, you will be able to get what you want. If you have any specific questions, then try asking either here in the numpy mailing list (for numpy-related issues), or the scipy-users mailing list (for scipy related issues) or the matplotlib-users mailing list (for plotting issues). I hope that helps! Ben Root -------------- next part -------------- An HTML attachment was scrubbed... URL: From faltet at pytables.org Thu Jan 6 04:48:18 2011 From: faltet at pytables.org (Francesc Alted) Date: Thu, 6 Jan 2011 10:48:18 +0100 Subject: [Numpy-discussion] NEP for faster ufuncs In-Reply-To: References: <4D115D2B.7070904@silveregg.co.jp>

Message-ID: 2011/1/5, Mark Wiebe : > On Wed, Jan 5, 2011 at 4:26 AM, Francesc Alted wrote: > >> Sorry for the naive question, but I use the numpy.fromiter() iterator >> quite a few in my projects. and I'm curious on whether this new >> iterator would allow numpy.fromiter() to go faster (I mean, in Python >> space). Any hint? >> > > The new iterator doesn't offer any help to fromiter in general, but if the > iterator being given to the function is a new iterator it would be possible > to handle it specially and get a big speedup. Ah, that's what I thought. Thanks for the clarification. -- Francesc Alted From josef.pktd at gmail.com Thu Jan 6 05:14:15 2011 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 6 Jan 2011 05:14:15 -0500 Subject: [Numpy-discussion] aa.astype(int) truncates and doesn't round Message-ID: just something I bumped into and wasn't aware of >>> aa array([ 1., 1., 1., 1., 1.]) >>> aa.astype(int) array([0, 1, 0, 0, 0]) >>> aa - 1 array([ -2.22044605e-16, 2.22044605e-16, -2.22044605e-16, -3.33066907e-16, -3.33066907e-16]) >>> np.round(aa).astype(int) array([1, 1, 1, 1, 1]) Josef From kwgoodman at gmail.com Thu Jan 6 10:40:57 2011 From: kwgoodman at gmail.com (Keith Goodman) Date: Thu, 6 Jan 2011 07:40:57 -0800 Subject: [Numpy-discussion] aa.astype(int) truncates and doesn't round In-Reply-To: References: Message-ID: On Thu, Jan 6, 2011 at 2:14 AM, wrote: > just something I bumped into and wasn't aware of > >>>> aa > array([ 1., ?1., ?1., ?1., ?1.]) >>>> aa.astype(int) > array([0, 1, 0, 0, 0]) >>>> aa - 1 > array([ -2.22044605e-16, ? 2.22044605e-16, ?-2.22044605e-16, > ? ? ? ?-3.33066907e-16, ?-3.33066907e-16]) >>>> np.round(aa).astype(int) > array([1, 1, 1, 1, 1]) >> a = np.ones(100) >> a.astype(int) array([1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]) My default numpy int is 64 bits. Try 32 bits: >> a = np.ones(100, np.int32) >> a.astype(int) array([1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]) From josef.pktd at gmail.com Thu Jan 6 12:04:56 2011 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 6 Jan 2011 12:04:56 -0500 Subject: [Numpy-discussion] aa.astype(int) truncates and doesn't round In-Reply-To: References: Message-ID: On Thu, Jan 6, 2011 at 10:40 AM, Keith Goodman wrote: > On Thu, Jan 6, 2011 at 2:14 AM, ? wrote: >> just something I bumped into and wasn't aware of >> >>>>> aa >> array([ 1., ?1., ?1., ?1., ?1.]) >>>>> aa.astype(int) >> array([0, 1, 0, 0, 0]) >>>>> aa - 1 >> array([ -2.22044605e-16, ? 2.22044605e-16, ?-2.22044605e-16, >> ? ? ? ?-3.33066907e-16, ?-3.33066907e-16]) >>>>> np.round(aa).astype(int) >> array([1, 1, 1, 1, 1]) > >>> a = np.ones(100) >>> a.astype(int) > array([1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1]) > > My default numpy int is 64 bits. Try 32 bits: > >>> a = np.ones(100, np.int32) >>> a.astype(int) > > array([1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1]) the full exercise includes some calculations first, so we are not precisely at 1 >>> d_ array([[ 1., 0., 0., 0., 0.], [ 1., -1., 0., 0., 0.], [ 1., 0., -1., 0., 0.], [ 1., 0., 0., -1., 0.], [ 1., 0., 0., 0., -1.]]) >>> np.set_printoptions(precision=2) >>> np.linalg.pinv(d_) array([[ 1.00e+00, -2.34e-16, -8.50e-17, -7.63e-17, -8.50e-17], [ 1.00e+00, -1.00e+00, -1.06e-16, 2.19e-16, -6.18e-17], [ 1.00e+00, -2.21e-16, -1.00e+00, -2.27e-16, 9.38e-18], [ 1.00e+00, -6.40e-17, -2.84e-17, -1.00e+00, -7.65e-17], [ 1.00e+00, -1.70e-16, -9.55e-17, -1.52e-17, -1.00e+00]]) >>> np.linalg.pinv(d_).astype(int) array([[ 0, 0, 0, 0, 0], [ 0, -1, 0, 0, 0], [ 0, 0, -1, 0, 0], [ 0, 0, 0, -1, 0], [ 0, 0, 0, 0, -1]]) >>> np.linalg.inv(d_).astype(int) array([[ 1, 0, 0, 0, 0], [ 1, -1, 0, 0, 0], [ 1, 0, -1, 0, 0], [ 1, 0, 0, -1, 0], [ 1, 0, 0, 0, -1]]) Josef > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From robert.kern at gmail.com Thu Jan 6 12:09:00 2011 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 6 Jan 2011 11:09:00 -0600 Subject: [Numpy-discussion] aa.astype(int) truncates and doesn't round In-Reply-To: References: Message-ID: On Thu, Jan 6, 2011 at 09:40, Keith Goodman wrote: > On Thu, Jan 6, 2011 at 2:14 AM, ? wrote: >> just something I bumped into and wasn't aware of >> >>>>> aa >> array([ 1., ?1., ?1., ?1., ?1.]) >>>>> aa.astype(int) >> array([0, 1, 0, 0, 0]) >>>>> aa - 1 >> array([ -2.22044605e-16, ? 2.22044605e-16, ?-2.22044605e-16, >> ? ? ? ?-3.33066907e-16, ?-3.33066907e-16]) >>>>> np.round(aa).astype(int) >> array([1, 1, 1, 1, 1]) > >>> a = np.ones(100) >>> a.astype(int) > array([1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1]) > > My default numpy int is 64 bits. Try 32 bits: > >>> a = np.ones(100, np.int32) >>> a.astype(int) > > array([1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, > ? ? ? 1, 1, 1, 1, 1, 1, 1, 1]) He's not pointing out a bug. His array does not have 1s in them, but values very close to 1, some slightly above and some slightly below, such that numpy's default printing rounds them to 1. See the "aa - 1" line. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." ? -- Umberto Eco From robert.kern at gmail.com Thu Jan 6 12:11:20 2011 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 6 Jan 2011 11:11:20 -0600 Subject: [Numpy-discussion] aa.astype(int) truncates and doesn't round In-Reply-To: References: Message-ID: On Thu, Jan 6, 2011 at 04:14, wrote: > just something I bumped into and wasn't aware of > >>>> aa > array([ 1., ?1., ?1., ?1., ?1.]) >>>> aa.astype(int) > array([0, 1, 0, 0, 0]) >>>> aa - 1 > array([ -2.22044605e-16, ? 2.22044605e-16, ?-2.22044605e-16, > ? ? ? ?-3.33066907e-16, ?-3.33066907e-16]) >>>> np.round(aa).astype(int) > array([1, 1, 1, 1, 1]) This is behavior inherited from C and matches Python's behavior. int(aa[0]) == 0. Similarly, inside the C code, (int)(1.0 - 2.22e-16) == 0. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." ? -- Umberto Eco From brian.murphy at unitn.it Fri Jan 7 06:33:59 2011 From: brian.murphy at unitn.it (Brian Murphy) Date: Fri, 7 Jan 2011 12:33:59 +0100 Subject: [Numpy-discussion] code for multidimensional scaling? In-Reply-To: <4D26F96C.3070607@unitn.it> References: <4D26F96C.3070607@unitn.it> Message-ID: <4D26FA27.8000100@unitn.it> Hi, I'm new to the list, so I hope my question is appropriate (I've already sent the same posting to the SciPy Users list). I'm looking for code that implements multi-dimensional scaling (e.g. like Matlab's mdscale command) in Python. My best guess was that I would find it in the Scikit Learn package, but couldn't turn anything up. Any suggestions? thanks and regards, Brian -- Brian Murphy Post-Doctoral Researcher Language, Interaction and Computation Lab Centre for Mind/Brain Sciences University of Trento http://clic.cimec.unitn.it/brian/ From Thomas.EMMEL at 3ds.com Fri Jan 7 10:58:04 2011 From: Thomas.EMMEL at 3ds.com (EMMEL Thomas) Date: Fri, 7 Jan 2011 15:58:04 +0000 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array Message-ID: <3A0080EEBFB19C4993C24098DD0A78D108D12609@EU-DCC-MBX01.dsone.3ds.com> Hi, There are some discussions on the speed of numpy compared to Numeric in this list, however I have a topic I don't understand in detail, maybe someone can enlighten me... I use python 2.6 on a SuSE installation and test this: #Python 2.6 (r26:66714, Mar 30 2010, 00:29:28) #[GCC 4.3.2 [gcc-4_3-branch revision 141291]] on linux2 #Type "help", "copyright", "credits" or "license" for more information. import timeit #creation of arrays and tuples (timeit number=1000000 by default) timeit.Timer('a((1.,2.,3.))','from numpy import array as a').timeit() #8.2061841487884521 timeit.Timer('a((1.,2.,3.))','from Numeric import array as a').timeit() #9.6958281993865967 timeit.Timer('a((1.,2.,3.))','a=tuple').timeit() #0.13814711570739746 #Result: tuples - of course - are much faster than arrays and numpy is a bit faster in creating arrays than Numeric #working with arrays timeit.Timer('d=x1-x2;sum(d*d)','from Numeric import array as a; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() #3.263314962387085 timeit.Timer('d=x1-x2;sum(d*d)','from numpy import array as a; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() #9.7236979007720947 #Result: Numeric is three times faster than numpy! Why? #working with components: timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','a=tuple; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() #0.64785194396972656 timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','from numpy import array as a; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() #3.4181499481201172 timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','from Numeric import array as a; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() #0.97426199913024902 Result: tuples are again the fastest variant, Numeric is faster than numpy and both are faster than the variant above using the high-level functions! Why? For various reasons I need to use numpy in the future where I used Numeric before. Is there any better solution in numpy I missed? Kind regards and thanks in advance Thomas This email and any attachments are intended solely for the use of the individual or entity to whom it is addressed and may be confidential and/or privileged. If you are not one of the named recipients or have received this email in error, (i) you should not read, disclose, or copy it, (ii) please notify sender of your receipt by reply email and delete this email and all attachments, (iii) Dassault Systemes does not accept or assume any liability or responsibility for any use of or reliance on this email.For other languages, go to http://www.3ds.com/terms/email-disclaimer. -------------- next part -------------- An HTML attachment was scrubbed... URL: From jsalvati at u.washington.edu Fri Jan 7 11:09:41 2011 From: jsalvati at u.washington.edu (John Salvatier) Date: Fri, 7 Jan 2011 08:09:41 -0800 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array In-Reply-To: <3A0080EEBFB19C4993C24098DD0A78D108D12609@EU-DCC-MBX01.dsone.3ds.com> References: <3A0080EEBFB19C4993C24098DD0A78D108D12609@EU-DCC-MBX01.dsone.3ds.com> Message-ID: Did you try larger arrays/tuples? I would guess that makes a significant difference. On Fri, Jan 7, 2011 at 7:58 AM, EMMEL Thomas wrote: > Hi, > > There are some discussions on the speed of numpy compared to Numeric in > this list, however I have a topic > I don't understand in detail, maybe someone can enlighten me... > I use python 2.6 on a SuSE installation and test this: > > #Python 2.6 (r26:66714, Mar 30 2010, 00:29:28) > #[GCC 4.3.2 [gcc-4_3-branch revision 141291]] on linux2 > #Type "help", "copyright", "credits" or "license" for more information. > > import timeit > > #creation of arrays and tuples (timeit number=1000000 by default) > > timeit.Timer('a((1.,2.,3.))','from numpy import array as a').timeit() > #8.2061841487884521 > timeit.Timer('a((1.,2.,3.))','from Numeric import array as a').timeit() > #9.6958281993865967 > timeit.Timer('a((1.,2.,3.))','a=tuple').timeit() > #0.13814711570739746 > > #Result: tuples - of course - are much faster than arrays and numpy is a > bit faster in creating arrays than Numeric > > #working with arrays > > timeit.Timer('d=x1-x2;sum(d*d)','from Numeric import array as a; > x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #3.263314962387085 > timeit.Timer('d=x1-x2;sum(d*d)','from numpy import array as a; > x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #9.7236979007720947 > > #Result: Numeric is three times faster than numpy! Why? > > #working with components: > > timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','a=tuple; > x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #0.64785194396972656 > timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','from > numpy import array as a; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #3.4181499481201172 > timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','from > Numeric import array as a; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #0.97426199913024902 > > Result: tuples are again the fastest variant, Numeric is faster than numpy > and both are faster than the variant above using the high-level functions! > Why? > > For various reasons I need to use numpy in the future where I used Numeric > before. > Is there any better solution in numpy I missed? > > Kind regards and thanks in advance > > Thomas > > This email and any attachments are intended solely for the use of the > individual or entity to whom it is addressed and may be confidential and/or > privileged. > > If you are not one of the named recipients or have received this email in > error, > > (i) you should not read, disclose, or copy it, > > (ii) please notify sender of your receipt by reply email and delete this > email and all attachments, > > (iii) Dassault Systemes does not accept or assume any liability or > responsibility for any use of or reliance on this email. > > For other languages, Click Here > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ben.root at ou.edu Fri Jan 7 12:49:44 2011 From: ben.root at ou.edu (Benjamin Root) Date: Fri, 7 Jan 2011 11:49:44 -0600 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array In-Reply-To: <3A0080EEBFB19C4993C24098DD0A78D108D12609@EU-DCC-MBX01.dsone.3ds.com> References: <3A0080EEBFB19C4993C24098DD0A78D108D12609@EU-DCC-MBX01.dsone.3ds.com> Message-ID: On Fri, Jan 7, 2011 at 9:58 AM, EMMEL Thomas wrote: > Hi, > > There are some discussions on the speed of numpy compared to Numeric in > this list, however I have a topic > I don't understand in detail, maybe someone can enlighten me... > I use python 2.6 on a SuSE installation and test this: > > #Python 2.6 (r26:66714, Mar 30 2010, 00:29:28) > #[GCC 4.3.2 [gcc-4_3-branch revision 141291]] on linux2 > #Type "help", "copyright", "credits" or "license" for more information. > > import timeit > > #creation of arrays and tuples (timeit number=1000000 by default) > > timeit.Timer('a((1.,2.,3.))','from numpy import array as a').timeit() > #8.2061841487884521 > timeit.Timer('a((1.,2.,3.))','from Numeric import array as a').timeit() > #9.6958281993865967 > timeit.Timer('a((1.,2.,3.))','a=tuple').timeit() > #0.13814711570739746 > > #Result: tuples - of course - are much faster than arrays and numpy is a > bit faster in creating arrays than Numeric > > #working with arrays > > timeit.Timer('d=x1-x2;sum(d*d)','from Numeric import array as a; > x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #3.263314962387085 > timeit.Timer('d=x1-x2;sum(d*d)','from numpy import array as a; > x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #9.7236979007720947 > > #Result: Numeric is three times faster than numpy! Why? > > #working with components: > > timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','a=tuple; > x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #0.64785194396972656 > timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','from > numpy import array as a; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #3.4181499481201172 > timeit.Timer('d0=x1[0]-x2[0];d1=x1[1]-x2[1];d2=x1[2]-x2[2];d0*d0+d1*d1+d2*d2','from > Numeric import array as a; x1=a((1.,2.,3.));x2=a((2.,4.,6.))').timeit() > #0.97426199913024902 > > Result: tuples are again the fastest variant, Numeric is faster than numpy > and both are faster than the variant above using the high-level functions! > Why? > > For various reasons I need to use numpy in the future where I used Numeric > before. > Is there any better solution in numpy I missed? > > Kind regards and thanks in advance > > Thomas > Don't know how much of an impact it would have, but those timeit statements for array creation include the import process, which are going to be different for each module and are probably not indicative of the speed of array creation. Ben Root -------------- next part -------------- An HTML attachment was scrubbed... URL: From friedrichromstedt at gmail.com Sat Jan 8 16:32:48 2011 From: friedrichromstedt at gmail.com (Friedrich Romstedt) Date: Sat, 8 Jan 2011 22:32:48 +0100 Subject: [Numpy-discussion] bincount question In-Reply-To: <4D10F7BB.3000905@gmail.com> References: <4D10F7BB.3000905@gmail.com> Message-ID: 2010/12/21 Alan G Isaac : > :: > > ? ? >>> np.bincount([]) > ? ? Traceback (most recent call last): > ? ? ? File "", line 1, in > ? ? ValueError: The first argument cannot be empty. > > Why not? > (I.e., why isn't an empty array the right answer?) >From the (i.e. "a", or, even more precise, "my") mathematical pov: Define the "bincount" sequence, which will mostly consist of trailing zeros for large indices. Then, the return value is the smallest sequence, s.t. there are no non-zero items left outside the return chunk of the sequence, and of course it must include the zeroth bincount sequence element. So, yes, [] would be the correct answer. >From the algorithmic point of view: Define the length of the sequence returned by the max() of the array handed in + 1. So, since max([]) is undefined, such is bincount in that case. I'm a bit in favour of the mathematical approach. But unfortunately, I cannot fix it, although I think it will break nothing because nothing should rely on this corner case yielding an Exception (but I might be proven wrong, I don't really know). In any case, it might be worth documenting this, by adding it to the ValueError section of the "Raises" part (http://docs.scipy.org/doc/numpy/reference/generated/numpy.bincount.html?highlight=bincount#numpy.bincount). This is something I might be able to do. Friedrich From mwwiebe at gmail.com Sun Jan 9 17:45:02 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Sun, 9 Jan 2011 14:45:02 -0800 Subject: [Numpy-discussion] numexpr with the new iterator Message-ID: As a benchmark of C-based iterator usage and to make it work properly in a multi-threaded context, I've updated numexpr to use the new iterator. In addition to some performance improvements, this also made it easy to add optional out= and order= parameters to the evaluate function. The numexpr repository with this update is available here: https://github.com/m-paradox/numexpr To use it, you need the new_iterator branch of NumPy from here: https://github.com/m-paradox/numpy In all cases tested, the iterator version of numexpr's evaluate function matches or beats the standard version. The timing results are below, with some explanatory comments placed inline: -Mark In [1]: import numexpr as ne # numexpr front page example In [2]: a = np.arange(1e6) In [3]: b = np.arange(1e6) In [4]: timeit a**2 + b**2 + 2*a*b 1 loops, best of 3: 121 ms per loop In [5]: ne.set_num_threads(1) # iterator version performance matches standard version In [6]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") 10 loops, best of 3: 24.8 ms per loop In [7]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") 10 loops, best of 3: 24.3 ms per loop In [8]: ne.set_num_threads(2) # iterator version performance matches standard version In [9]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") 10 loops, best of 3: 21 ms per loop In [10]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") 10 loops, best of 3: 20.5 ms per loop # numexpr front page example with a 10x bigger array In [11]: a = np.arange(1e7) In [12]: b = np.arange(1e7) In [13]: ne.set_num_threads(2) # the iterator version performance improvement is due to # a small task scheduler tweak In [14]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") 1 loops, best of 3: 282 ms per loop In [15]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") 1 loops, best of 3: 255 ms per loop # numexpr front page example with a Fortran contiguous array In [16]: a = np.arange(1e7).reshape(10,100,100,100).T In [17]: b = np.arange(1e7).reshape(10,100,100,100).T In [18]: timeit a**2 + b**2 + 2*a*b 1 loops, best of 3: 3.22 s per loop In [19]: ne.set_num_threads(1) # even with a C-ordered output, the iterator version performs better In [20]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") 1 loops, best of 3: 3.74 s per loop In [21]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") 1 loops, best of 3: 379 ms per loop In [22]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b", order='C') 1 loops, best of 3: 2.03 s per loop In [23]: ne.set_num_threads(2) # the standard version just uses 1 thread here, I believe # the iterator version performs the same as for the flat 1e7-sized array In [24]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") 1 loops, best of 3: 3.92 s per loop In [25]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") 1 loops, best of 3: 254 ms per loop In [26]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b", order='C') 1 loops, best of 3: 1.74 s per loop -------------- next part -------------- An HTML attachment was scrubbed... URL: From jsalvati at u.washington.edu Sun Jan 9 18:33:41 2011 From: jsalvati at u.washington.edu (John Salvatier) Date: Sun, 9 Jan 2011 15:33:41 -0800 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: References: Message-ID: Is evaluate_iter basically numpexpr but using your numpy branch or are there other changes? On Sun, Jan 9, 2011 at 2:45 PM, Mark Wiebe wrote: > As a benchmark of C-based iterator usage and to make it work properly in a > multi-threaded context, I've updated numexpr to use the new iterator. In > addition to some performance improvements, this also made it easy to add > optional out= and order= parameters to the evaluate function. The numexpr > repository with this update is available here: > > https://github.com/m-paradox/numexpr > > To use it, you need the new_iterator branch of NumPy from here: > > https://github.com/m-paradox/numpy > > In all cases tested, the iterator version of numexpr's evaluate function > matches or beats the standard version. The timing results are below, with > some explanatory comments placed inline: > > -Mark > > In [1]: import numexpr as ne > > # numexpr front page example > > In [2]: a = np.arange(1e6) > In [3]: b = np.arange(1e6) > > In [4]: timeit a**2 + b**2 + 2*a*b > 1 loops, best of 3: 121 ms per loop > > In [5]: ne.set_num_threads(1) > > # iterator version performance matches standard version > > In [6]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") > 10 loops, best of 3: 24.8 ms per loop > In [7]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") > 10 loops, best of 3: 24.3 ms per loop > > In [8]: ne.set_num_threads(2) > > # iterator version performance matches standard version > > In [9]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") > 10 loops, best of 3: 21 ms per loop > In [10]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") > 10 loops, best of 3: 20.5 ms per loop > > # numexpr front page example with a 10x bigger array > > In [11]: a = np.arange(1e7) > In [12]: b = np.arange(1e7) > > In [13]: ne.set_num_threads(2) > > # the iterator version performance improvement is due to > # a small task scheduler tweak > > In [14]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") > 1 loops, best of 3: 282 ms per loop > In [15]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") > 1 loops, best of 3: 255 ms per loop > > # numexpr front page example with a Fortran contiguous array > > In [16]: a = np.arange(1e7).reshape(10,100,100,100).T > In [17]: b = np.arange(1e7).reshape(10,100,100,100).T > > In [18]: timeit a**2 + b**2 + 2*a*b > 1 loops, best of 3: 3.22 s per loop > > In [19]: ne.set_num_threads(1) > > # even with a C-ordered output, the iterator version performs better > > In [20]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") > 1 loops, best of 3: 3.74 s per loop > In [21]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") > 1 loops, best of 3: 379 ms per loop > In [22]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b", order='C') > 1 loops, best of 3: 2.03 s per loop > > In [23]: ne.set_num_threads(2) > > # the standard version just uses 1 thread here, I believe > # the iterator version performs the same as for the flat 1e7-sized array > > In [24]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") > 1 loops, best of 3: 3.92 s per loop > In [25]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") > 1 loops, best of 3: 254 ms per loop > In [26]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b", order='C') > 1 loops, best of 3: 1.74 s per loop > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mwwiebe at gmail.com Sun Jan 9 21:23:43 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Sun, 9 Jan 2011 18:23:43 -0800 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: References: Message-ID: That's right, essentially all I've done is replaced the code that handled preparing the arrays and producing blocks of values for the inner loops. There are three new parameters to evaluate_iter as well. It has an "out=" parameter just like ufuncs do, an "order=" parameter which controls the layout of the output if it's created by the function, and a "casting=" parameter which controls what kind of data conversions are permitted. -Mark On Sun, Jan 9, 2011 at 3:33 PM, John Salvatier wrote: > Is evaluate_iter basically numpexpr but using your numpy branch or are > there other changes? > > On Sun, Jan 9, 2011 at 2:45 PM, Mark Wiebe wrote: > >> As a benchmark of C-based iterator usage and to make it work properly in a >> multi-threaded context, I've updated numexpr to use the new iterator. In >> addition to some performance improvements, this also made it easy to add >> optional out= and order= parameters to the evaluate function. The numexpr >> repository with this update is available here: >> >> https://github.com/m-paradox/numexpr >> >> To use it, you need the new_iterator branch of NumPy from here: >> >> https://github.com/m-paradox/numpy >> >> In all cases tested, the iterator version of numexpr's evaluate function >> matches or beats the standard version. The timing results are below, with >> some explanatory comments placed inline: >> >> -Mark >> >> In [1]: import numexpr as ne >> >> # numexpr front page example >> >> In [2]: a = np.arange(1e6) >> In [3]: b = np.arange(1e6) >> >> In [4]: timeit a**2 + b**2 + 2*a*b >> 1 loops, best of 3: 121 ms per loop >> >> In [5]: ne.set_num_threads(1) >> >> # iterator version performance matches standard version >> >> In [6]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") >> 10 loops, best of 3: 24.8 ms per loop >> In [7]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") >> 10 loops, best of 3: 24.3 ms per loop >> >> In [8]: ne.set_num_threads(2) >> >> # iterator version performance matches standard version >> >> In [9]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") >> 10 loops, best of 3: 21 ms per loop >> In [10]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") >> 10 loops, best of 3: 20.5 ms per loop >> >> # numexpr front page example with a 10x bigger array >> >> In [11]: a = np.arange(1e7) >> In [12]: b = np.arange(1e7) >> >> In [13]: ne.set_num_threads(2) >> >> # the iterator version performance improvement is due to >> # a small task scheduler tweak >> >> In [14]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") >> 1 loops, best of 3: 282 ms per loop >> In [15]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") >> 1 loops, best of 3: 255 ms per loop >> >> # numexpr front page example with a Fortran contiguous array >> >> In [16]: a = np.arange(1e7).reshape(10,100,100,100).T >> In [17]: b = np.arange(1e7).reshape(10,100,100,100).T >> >> In [18]: timeit a**2 + b**2 + 2*a*b >> 1 loops, best of 3: 3.22 s per loop >> >> In [19]: ne.set_num_threads(1) >> >> # even with a C-ordered output, the iterator version performs better >> >> In [20]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") >> 1 loops, best of 3: 3.74 s per loop >> In [21]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") >> 1 loops, best of 3: 379 ms per loop >> In [22]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b", order='C') >> 1 loops, best of 3: 2.03 s per loop >> >> In [23]: ne.set_num_threads(2) >> >> # the standard version just uses 1 thread here, I believe >> # the iterator version performs the same as for the flat 1e7-sized array >> >> In [24]: timeit ne.evaluate("a**2 + b**2 + 2*a*b") >> 1 loops, best of 3: 3.92 s per loop >> In [25]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b") >> 1 loops, best of 3: 254 ms per loop >> In [26]: timeit ne.evaluate_iter("a**2 + b**2 + 2*a*b", order='C') >> 1 loops, best of 3: 1.74 s per loop >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Thomas.EMMEL at 3ds.com Mon Jan 10 03:09:17 2011 From: Thomas.EMMEL at 3ds.com (EMMEL Thomas) Date: Mon, 10 Jan 2011 08:09:17 +0000 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array Message-ID: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> To John: > Did you try larger arrays/tuples? I would guess that makes a significant > difference. No I didn't, due to the fact that these values are coordinates in 3D (x,y,z). In fact I work with a list/array/tuple of arrays with 100000 to 1M of elements or more. What I need to do is to calculate the distance of each of these elements (coordinates) to a given coordinate and filter for the nearest. The brute force method would look like this: #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ def bruteForceSearch(points, point): minpt = min([(vec2Norm(pt, point), pt, i) for i, pt in enumerate(points)], key=itemgetter(0)) return sqrt(minpt[0]), minpt[1], minpt[2] #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ def vec2Norm(pt1,pt2): xDis = pt1[0]-pt2[0] yDis = pt1[1]-pt2[1] zDis = pt1[2]-pt2[2] return xDis*xDis+yDis*yDis+zDis*zDis I have a more clever method but it still takes a lot of time in the vec2norm-function. If you like I can attach a running example. To Ben: > Don't know how much of an impact it would have, but those timeit statements > for array creation include the import process, which are going to be > different for each module and are probably not indicative of the speed of > array creation. No, the timeit statements counts the time for the statement in the first argument only, the import-thing isn't included in the time. Thomas This email and any attachments are intended solely for the use of the individual or entity to whom it is addressed and may be confidential and/or privileged. If you are not one of the named recipients or have received this email in error, (i) you should not read, disclose, or copy it, (ii) please notify sender of your receipt by reply email and delete this email and all attachments, (iii) Dassault Systemes does not accept or assume any liability or responsibility for any use of or reliance on this email.For other languages, go to http://www.3ds.com/terms/email-disclaimer. From cournape at gmail.com Mon Jan 10 03:53:01 2011 From: cournape at gmail.com (David Cournapeau) Date: Mon, 10 Jan 2011 17:53:01 +0900 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array In-Reply-To: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> References: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> Message-ID: On Mon, Jan 10, 2011 at 5:09 PM, EMMEL Thomas wrote: > To John: > >> Did you try larger arrays/tuples? I would guess that makes a significant >> difference. > > No I didn't, due to the fact that these values are coordinates in 3D (x,y,z). > In fact I work with a list/array/tuple of arrays with 100000 to 1M of elements or more. > What I need to do is to calculate the distance of each of these elements (coordinates) > to a given coordinate and filter for the nearest. Note that for this exact problem, there are much better methods than brute force (O(N^2) for N vectors), through e.g. kd-trees, which work very well in low-dimension. This will matter much more than numeric vs numpy cheers, David From Thomas.EMMEL at 3ds.com Mon Jan 10 04:04:48 2011 From: Thomas.EMMEL at 3ds.com (EMMEL Thomas) Date: Mon, 10 Jan 2011 09:04:48 +0000 Subject: [Numpy-discussion] speed of numpy.ndarray compared toNumeric.array In-Reply-To: References: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> Message-ID: <3A0080EEBFB19C4993C24098DD0A78D108D1281D@EU-DCC-MBX01.dsone.3ds.com> > On Mon, Jan 10, 2011 at 5:09 PM, EMMEL Thomas > wrote: > > To John: > > > >> Did you try larger arrays/tuples? I would guess that makes a > significant > >> difference. > > > > No I didn't, due to the fact that these values are coordinates in 3D > (x,y,z). > > In fact I work with a list/array/tuple of arrays with 100000 to 1M of > elements or more. > > What I need to do is to calculate the distance of each of these > elements (coordinates) > > to a given coordinate and filter for the nearest. > > Note that for this exact problem, there are much better methods than > brute force (O(N^2) for N vectors), through e.g. kd-trees, which work > very well in low-dimension. This will matter much more than numeric vs > numpy > > cheers, > > David David, Yes, of course and my real implementation uses exactly these methods, but there are still issues with the arrays. Example: If I would use brute-force it will take ~5000s for a particular example to find all points in a list of points. Theoretically it should be possible to come to O(N*log(N)) with would mean ~2s in my case. My method need ~28s with tuples, but it takes ~30s with Numeric arrays and ~60s and more with numpy.ndarrays! I just use the brute-force method since it delivers the most reusable results for performance testing, the other methods are a bit dependent on the distribution of points in space. Thomas This email and any attachments are intended solely for the use of the individual or entity to whom it is addressed and may be confidential and/or privileged. If you are not one of the named recipients or have received this email in error, (i) you should not read, disclose, or copy it, (ii) please notify sender of your receipt by reply email and delete this email and all attachments, (iii) Dassault Systemes does not accept or assume any liability or responsibility for any use of or reliance on this email.For other languages, go to http://www.3ds.com/terms/email-disclaimer. From cournape at gmail.com Mon Jan 10 04:14:38 2011 From: cournape at gmail.com (David Cournapeau) Date: Mon, 10 Jan 2011 18:14:38 +0900 Subject: [Numpy-discussion] speed of numpy.ndarray compared toNumeric.array In-Reply-To: <3A0080EEBFB19C4993C24098DD0A78D108D1281D@EU-DCC-MBX01.dsone.3ds.com> References: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> <3A0080EEBFB19C4993C24098DD0A78D108D1281D@EU-DCC-MBX01.dsone.3ds.com> Message-ID: On Mon, Jan 10, 2011 at 6:04 PM, EMMEL Thomas wrote: > > Yes, of course and my real implementation uses exactly these methods, > but there are still issues with the arrays. Did you try kd-trees in scipy ? David From Thomas.EMMEL at 3ds.com Mon Jan 10 04:42:10 2011 From: Thomas.EMMEL at 3ds.com (EMMEL Thomas) Date: Mon, 10 Jan 2011 09:42:10 +0000 Subject: [Numpy-discussion] speed of numpy.ndarray comparedtoNumeric.array In-Reply-To: References: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com><3A0080EEBFB19C4993C24098DD0A78D108D1281D@EU-DCC-MBX01.dsone.3ds.com> Message-ID: <3A0080EEBFB19C4993C24098DD0A78D108D1284B@EU-DCC-MBX01.dsone.3ds.com> > -----Original Message----- > From: numpy-discussion-bounces at scipy.org [mailto:numpy-discussion- > bounces at scipy.org] On Behalf Of David Cournapeau > Sent: Montag, 10. Januar 2011 10:15 > To: Discussion of Numerical Python > Subject: Re: [Numpy-discussion] speed of numpy.ndarray compared > toNumeric.array > > On Mon, Jan 10, 2011 at 6:04 PM, EMMEL Thomas > wrote: > > > > > Yes, of course and my real implementation uses exactly these methods, > > but there are still issues with the arrays. > > Did you try kd-trees in scipy ? > > David David, No, I didn't, however, my method is very similar and as far as I understood kd-trees, they need some time for pre-conditioning the search-area and this is the same as I did. In fact I think my method is more or less the same as a kd-tree. The problem remains that I need to calculate the distance of some points at a certain point in my code (when I am in a leaf of a kd-tree). For example when I use 100000 points I end up in a leaf of my kd-tree where I need to calculate the distance for only 100 points or less (depends on the tree). The problem still remains and I use cProfile to get into the details. Most of the time it takes is in vec2Norm, everything else is very short but I need to call it as often as I have points (again 100000) and this is why 100000*0.001s takes some time. For numpy.ndarray this is 0.002s-0.003s, for Numeric.array 0.001-0.002s and for tuple ~0.001s (values from cProfile). And, by the way, the same problem appears when I need to calculate the cross-product of several vectors. In this case I have a geometry in 3D with a surface of thousands of triangles and I need to calculate the normal of each of these triangles. Again, doing a loop over tuples is faster than arrays, although in this case numpy.cross is twice as fast as Numeric.cross_product. Thomas This email and any attachments are intended solely for the use of the individual or entity to whom it is addressed and may be confidential and/or privileged. If you are not one of the named recipients or have received this email in error, (i) you should not read, disclose, or copy it, (ii) please notify sender of your receipt by reply email and delete this email and all attachments, (iii) Dassault Systemes does not accept or assume any liability or responsibility for any use of or reliance on this email.For other languages, go to http://www.3ds.com/terms/email-disclaimer. This email and any attachments are intended solely for the use of the individual or entity to whom it is addressed and may be confidential and/or privileged. If you are not one of the named recipients or have received this email in error, (i) you should not read, disclose, or copy it, (ii) please notify sender of your receipt by reply email and delete this email and all attachments, (iii) Dassault Systemes does not accept or assume any liability or responsibility for any use of or reliance on this email.For other languages, go to http://www.3ds.com/terms/email-disclaimer. From faltet at pytables.org Mon Jan 10 05:05:27 2011 From: faltet at pytables.org (Francesc Alted) Date: Mon, 10 Jan 2011 11:05:27 +0100 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: References: Message-ID: <201101101105.27421.faltet@pytables.org> A Sunday 09 January 2011 23:45:02 Mark Wiebe escrigu?: > As a benchmark of C-based iterator usage and to make it work properly > in a multi-threaded context, I've updated numexpr to use the new > iterator. In addition to some performance improvements, this also > made it easy to add optional out= and order= parameters to the > evaluate function. The numexpr repository with this update is > available here: > > https://github.com/m-paradox/numexpr > > To use it, you need the new_iterator branch of NumPy from here: > > https://github.com/m-paradox/numpy > > In all cases tested, the iterator version of numexpr's evaluate > function matches or beats the standard version. The timing results > are below, with some explanatory comments placed inline: [clip] Your patch looks mostly fine to my eyes; good job! Unfortunately, I've been unable to compile your new_iterator branch of NumPy: numpy/core/src/multiarray/multiarraymodule.c:45:33: fatal error: new_iterator_pywrap.h: El fitxer o directori no existeix Apparently, you forgot to add the new_iterator_pywrap.h file. My idea would be to merge your patch in numexpr and make the new `evaluate_iter()` the default (i.e. make it `evaluate()`). However, by looking into the code, it seems to me that unaligned arrays (this is an important use case when operating with columns of structured arrays) may need more fine-tuning for Intel platforms. When I can compile the new_iterator branch, I'll give a try at unaligned data benchs. Also, I'd like to try out the new thread scheduling that you suggested to me privately (i.e. T0T1T0T1... vs T0T0...T1T1...). Thanks! -- Francesc Alted From sebastian at sipsolutions.net Mon Jan 10 06:22:50 2011 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Mon, 10 Jan 2011 12:22:50 +0100 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array In-Reply-To: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> References: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> Message-ID: <1294658570.2490.33.camel@sebastian> Hey, On Mon, 2011-01-10 at 08:09 +0000, EMMEL Thomas wrote: > #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > def bruteForceSearch(points, point): > > minpt = min([(vec2Norm(pt, point), pt, i) > for i, pt in enumerate(points)], key=itemgetter(0)) > return sqrt(minpt[0]), minpt[1], minpt[2] > > #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > def vec2Norm(pt1,pt2): > xDis = pt1[0]-pt2[0] > yDis = pt1[1]-pt2[1] > zDis = pt1[2]-pt2[2] > return xDis*xDis+yDis*yDis+zDis*zDis > > I have a more clever method but it still takes a lot of time in the vec2norm-function. > If you like I can attach a running example. > if you use the vec2Norm function as you wrote it there, this code is not vectorized at all, and as such of course numpy would be slowest as it has the most overhead and no advantages for non vectorized code, you simply can't write python code like that and expect it to be fast for these kind of calculations. Your function should look more like this: import numpy as np def bruteForceSearch(points, point): dists = points - point # that may need point[None,:] or such for broadcasting to work dists *= dists dists = dists.sum(1) I = np.argmin(dists) return sqrt(dists[I]), points[I], I If points is small, this may not help much (though compared to this exact code my guess is it probably would), if points is larger it should speed up things tremendously (unless you run into RAM problems). It may be that you need to fiddle around with axes, I did not check the code. If this is not good enough for you (you will need to port it (and maybe the next outer loop as well) to Cython or write it in C/C++ and make sure it can optimize things right. Also I think somewhere in scipy there were some distance tools that may be already in C and nice fast, but not sure. I hope I got this right and it helps, Sebastian From faltet at pytables.org Mon Jan 10 06:55:16 2011 From: faltet at pytables.org (Francesc Alted) Date: Mon, 10 Jan 2011 12:55:16 +0100 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: <201101101105.27421.faltet@pytables.org> References: <201101101105.27421.faltet@pytables.org> Message-ID: <201101101255.16286.faltet@pytables.org> A Monday 10 January 2011 11:05:27 Francesc Alted escrigu?: > Also, I'd like to try out the new thread scheduling that you > suggested to me privately (i.e. T0T1T0T1... vs T0T0...T1T1...). I've just implemented the new partition schema in numexpr (T0T0...T1T1..., being the original T0T1T0T1...). I'm attaching the patch for this. The results are a bit confusing. For example, using the attached benchmark (poly.py), I get these results for a common dual- core machine, non-NUMA machine: With the T0T1...T0T1... (original) schema: Computing: '((.25*x + .75)*x - 1.5)*x - 2' with 100000000 points Using numpy: *** Time elapsed: 3.497 Using numexpr: *** Time elapsed for 1 threads: 1.279000 *** Time elapsed for 2 threads: 0.688000 With the T0T0...T1T1... (new) schema: Computing: '((.25*x + .75)*x - 1.5)*x - 2' with 100000000 points Using numpy: *** Time elapsed: 3.454 Using numexpr: *** Time elapsed for 1 threads: 1.268000 *** Time elapsed for 2 threads: 0.754000 which is around a 10% slower (2 threads) than the original partition. The results are a bit different on a NUMA machine (8 physical cores, 16 logical cores via hyper-threading): With the T0T1...T0T1... (original) partition: Computing: '((.25*x + .75)*x - 1.5)*x - 2' with 100000000 points Using numpy: *** Time elapsed: 3.005 Using numexpr: *** Time elapsed for 1 threads: 1.109000 *** Time elapsed for 2 threads: 0.677000 *** Time elapsed for 3 threads: 0.496000 *** Time elapsed for 4 threads: 0.394000 *** Time elapsed for 5 threads: 0.324000 *** Time elapsed for 6 threads: 0.287000 *** Time elapsed for 7 threads: 0.247000 *** Time elapsed for 8 threads: 0.234000 *** Time elapsed for 9 threads: 0.242000 *** Time elapsed for 10 threads: 0.239000 *** Time elapsed for 11 threads: 0.241000 *** Time elapsed for 12 threads: 0.235000 *** Time elapsed for 13 threads: 0.226000 *** Time elapsed for 14 threads: 0.214000 *** Time elapsed for 15 threads: 0.235000 *** Time elapsed for 16 threads: 0.218000 With the T0T0...T1T1... (new) partition: Computing: '((.25*x + .75)*x - 1.5)*x - 2' with 100000000 points Using numpy: *** Time elapsed: 3.003 Using numexpr: *** Time elapsed for 1 threads: 1.106000 *** Time elapsed for 2 threads: 0.617000 *** Time elapsed for 3 threads: 0.442000 *** Time elapsed for 4 threads: 0.345000 *** Time elapsed for 5 threads: 0.296000 *** Time elapsed for 6 threads: 0.257000 *** Time elapsed for 7 threads: 0.237000 *** Time elapsed for 8 threads: 0.260000 *** Time elapsed for 9 threads: 0.245000 *** Time elapsed for 10 threads: 0.261000 *** Time elapsed for 11 threads: 0.238000 *** Time elapsed for 12 threads: 0.210000 *** Time elapsed for 13 threads: 0.218000 *** Time elapsed for 14 threads: 0.200000 *** Time elapsed for 15 threads: 0.235000 *** Time elapsed for 16 threads: 0.198000 In this case, the performance is similar, with perhaps a slight advantage for the new partition scheme, but I don't know if it is worth to make it the default (probably not, as this partition performs clearly worse on non-NUMA machines). At any rate, both partitions perform very close to the aggregated memory bandwidth of NUMA machines (around 10 GB/s in the above case). In general, I don't think there is much point in using Intel's TBB in numexpr because the existing implementation already hits memory bandwidth limits pretty early (around 10 threads in the latter example). -- Francesc Alted -------------- next part -------------- A non-text attachment was scrubbed... Name: new_partition.diff Type: text/x-patch Size: 3778 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: poly.py Type: text/x-python Size: 1620 bytes Desc: not available URL: From totonixsame at gmail.com Mon Jan 10 08:53:05 2011 From: totonixsame at gmail.com (totonixsame at gmail.com) Date: Mon, 10 Jan 2011 11:53:05 -0200 Subject: [Numpy-discussion] Drawing circles in a numpy array Message-ID: Hi all, I have this problem: Given some point draw a circle centered in this point with radius r. I'm doing that using numpy this way (Snippet code from here [1]): >>> # Create the initial black and white image >>> import numpy as np >>> from scipy import ndimage >>> a = np.zeros((512, 512)).astype(uint8) #unsigned integer type needed by watershed >>> y, x = np.ogrid[0:512, 0:512] >>> m1 = ((y-200)**2 + (x-100)**2 < 30**2) >>> m2 = ((y-350)**2 + (x-400)**2 < 20**2) >>> m3 = ((y-260)**2 + (x-200)**2 < 20**2) >>> a[m1+m2+m3]=1 >>> imshow(a, cmap = cm.gray)# left plot in the image above The problem is that it have to evaluate all values from 0 to image size (in snippet, from 0 to 512 in X and Y dimensions). There is a faster way of doing that? Without evaluate all that values? For example: only evaluate from 0 to 30, in a circle centered in (0, 0) with radius 30. Thanks! Thiago Franco de Moraes [1] - http://www.scipy.org/Cookbook/Watershed From Thomas.EMMEL at 3ds.com Mon Jan 10 08:54:50 2011 From: Thomas.EMMEL at 3ds.com (EMMEL Thomas) Date: Mon, 10 Jan 2011 13:54:50 +0000 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array In-Reply-To: <1294658570.2490.33.camel@sebastian> References: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> <1294658570.2490.33.camel@sebastian> Message-ID: <3A0080EEBFB19C4993C24098DD0A78D121073232@EU-DCC-MBX02.dsone.3ds.com> Hey back... > > > #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > ~ > ~~~ > > def bruteForceSearch(points, point): > > > > minpt = min([(vec2Norm(pt, point), pt, i) > > for i, pt in enumerate(points)], key=itemgetter(0)) > > return sqrt(minpt[0]), minpt[1], minpt[2] > > > > > #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > ~ > ~~~~ > > def vec2Norm(pt1,pt2): > > xDis = pt1[0]-pt2[0] > > yDis = pt1[1]-pt2[1] > > zDis = pt1[2]-pt2[2] > > return xDis*xDis+yDis*yDis+zDis*zDis > > > > I have a more clever method but it still takes a lot of time in the > vec2norm-function. > > If you like I can attach a running example. > > > > if you use the vec2Norm function as you wrote it there, this code is > not vectorized at all, and as such of course numpy would be slowest as > it has the most overhead and no advantages for non vectorized code, > you simply can't write python code like that and expect it to be fast > for these kind of calculations. > > Your function should look more like this: > > import numpy as np > > def bruteForceSearch(points, point): > dists = points - point > # that may need point[None,:] or such for broadcasting to work > dists *= dists > dists = dists.sum(1) > I = np.argmin(dists) > return sqrt(dists[I]), points[I], I > > If points is small, this may not help much (though compared to this > exact code my guess is it probably would), if points is larger it > should speed up things tremendously (unless you run into RAM > problems). It may be that you need to fiddle around with axes, I did > not check the code. > If this is not good enough for you (you will need to port it (and > maybe the next outer loop as well) to Cython or write it in C/C++ and > make sure it can optimize things right. Also I think somewhere in > scipy there were some distance tools that may be already in C and nice > fast, but not sure. > > I hope I got this right and it helps, > > Sebastian > I see the point and it was very helpful to understand the behavior of the arrays a bit better. And your attempt improved the bruteForceSearch which is up to 6 times faster. But in case of a leaf in a kd-tree you end up with 50, 20, 10 or less points where the speed-up is reversed. In this particular case 34000 runs take 90s with your method and 50s with mine (not the bruteForce). I see now the limits of the arrays but of course I see the chances and - coming back to my original question - it seems that Numeric arrays were faster for my kind of application but they might be slower for larger amounts of data. Regards Thomas This email and any attachments are intended solely for the use of the individual or entity to whom it is addressed and may be confidential and/or privileged. If you are not one of the named recipients or have received this email in error, (i) you should not read, disclose, or copy it, (ii) please notify sender of your receipt by reply email and delete this email and all attachments, (iii) Dassault Systemes does not accept or assume any liability or responsibility for any use of or reliance on this email.For other languages, go to http://www.3ds.com/terms/email-disclaimer. From pascal22p at parois.net Mon Jan 10 10:25:33 2011 From: pascal22p at parois.net (Pascal) Date: Mon, 10 Jan 2011 16:25:33 +0100 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array In-Reply-To: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> References: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> Message-ID: <4D2B24ED.10105@parois.net> Hi, On 01/10/2011 09:09 AM, EMMEL Thomas wrote: > > No I didn't, due to the fact that these values are coordinates in 3D (x,y,z). > In fact I work with a list/array/tuple of arrays with 100000 to 1M of elements or more. > What I need to do is to calculate the distance of each of these elements (coordinates) > to a given coordinate and filter for the nearest. > The brute force method would look like this: > > > #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > def bruteForceSearch(points, point): > > minpt = min([(vec2Norm(pt, point), pt, i) > for i, pt in enumerate(points)], key=itemgetter(0)) > return sqrt(minpt[0]), minpt[1], minpt[2] > > #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > def vec2Norm(pt1,pt2): > xDis = pt1[0]-pt2[0] > yDis = pt1[1]-pt2[1] > zDis = pt1[2]-pt2[2] > return xDis*xDis+yDis*yDis+zDis*zDis > I am not sure I understood the problem properly but here what I would use to calculate a distance from horizontally stacked vectors (big): ref=numpy.array([0.1,0.2,0.3]) big=numpy.random.randn(1000000, 3) big=numpy.add(big,-ref) distsquared=numpy.sum(big**2, axis=1) Pascal From n.becker at amolf.nl Mon Jan 10 11:08:38 2011 From: n.becker at amolf.nl (Nils Becker) Date: Mon, 10 Jan 2011 17:08:38 +0100 Subject: [Numpy-discussion] indexing of rank-0 structured arrays: why not? Message-ID: <4D2B2F06.6090100@amolf.nl> Hi, I noticed that I can index into a dtype when I take an element of a rank-1 array but not if I make a rank-0 array directly. This seems inconsistent. A bug? Nils In [76]: np.version.version Out[76]: '1.5.1' In [78]: dt = np.dtype([('x', ' in () IndexError: 0-d arrays can't be indexed In [87]: a_rank_0['x'] Out[87]: array(0.0) From renesd at gmail.com Mon Jan 10 11:23:05 2011 From: renesd at gmail.com (=?ISO-8859-1?Q?Ren=E9_Dudfield?=) Date: Mon, 10 Jan 2011 16:23:05 +0000 Subject: [Numpy-discussion] speed of numpy.ndarray compared to Numeric.array In-Reply-To: <4D2B24ED.10105@parois.net> References: <3A0080EEBFB19C4993C24098DD0A78D108D1275F@EU-DCC-MBX01.dsone.3ds.com> <4D2B24ED.10105@parois.net> Message-ID: Hi, Spatial hashes are the common solution. Another common optimization is using the distance squared for collision detection. Since you do not need the expensive sqrt for this calc. cu. On Mon, Jan 10, 2011 at 3:25 PM, Pascal wrote: > Hi, > > On 01/10/2011 09:09 AM, EMMEL Thomas wrote: >> >> No I didn't, due to the fact that these values are coordinates in 3D (x,y,z). >> In fact I work with a list/array/tuple of arrays with 100000 to 1M of elements or more. >> What I need to do is to calculate the distance of each of these elements (coordinates) >> to a given coordinate and filter for the nearest. >> The brute force method would look like this: >> >> >> #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ >> def bruteForceSearch(points, point): >> >> ? ? ?minpt = min([(vec2Norm(pt, point), pt, i) >> ? ? ? ? ? ? ? ? ? for i, pt in enumerate(points)], key=itemgetter(0)) >> ? ? ?return sqrt(minpt[0]), minpt[1], minpt[2] >> >> #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ >> def vec2Norm(pt1,pt2): >> ? ? ?xDis = pt1[0]-pt2[0] >> ? ? ?yDis = pt1[1]-pt2[1] >> ? ? ?zDis = pt1[2]-pt2[2] >> ? ? ?return xDis*xDis+yDis*yDis+zDis*zDis >> > > I am not sure I understood the problem properly but here what I would > use to calculate a distance from horizontally stacked vectors (big): > > ref=numpy.array([0.1,0.2,0.3]) > big=numpy.random.randn(1000000, 3) > > big=numpy.add(big,-ref) > distsquared=numpy.sum(big**2, axis=1) > > Pascal > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From mwwiebe at gmail.com Mon Jan 10 11:54:16 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Mon, 10 Jan 2011 08:54:16 -0800 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: <201101101105.27421.faltet@pytables.org> References: <201101101105.27421.faltet@pytables.org> Message-ID: On Mon, Jan 10, 2011 at 2:05 AM, Francesc Alted wrote: > > > Your patch looks mostly fine to my eyes; good job! Unfortunately, I've > been unable to compile your new_iterator branch of NumPy: > > numpy/core/src/multiarray/multiarraymodule.c:45:33: fatal error: > new_iterator_pywrap.h: El fitxer o directori no existeix > > Apparently, you forgot to add the new_iterator_pywrap.h file. > Oops, that's added now. > My idea would be to merge your patch in numexpr and make the new > `evaluate_iter()` the default (i.e. make it `evaluate()`). However, by > looking into the code, it seems to me that unaligned arrays (this is an > important use case when operating with columns of structured arrays) may > need more fine-tuning for Intel platforms. When I can compile the > new_iterator branch, I'll give a try at unaligned data benchs. > The aligned case should just be a matter of conditionally removing the NPY_ITER_ALIGNED flag in two places. The new code also needs support for the reduce operation. I didn't look too closely at the code for that, but a nested iteration pattern is probably appropriate. If the inner loop is just allowed to be one dimension, it could be done without actually creating the inner iterator. -Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Mon Jan 10 12:05:45 2011 From: robert.kern at gmail.com (Robert Kern) Date: Mon, 10 Jan 2011 11:05:45 -0600 Subject: [Numpy-discussion] indexing of rank-0 structured arrays: why not? In-Reply-To: <4D2B2F06.6090100@amolf.nl> References: <4D2B2F06.6090100@amolf.nl> Message-ID: On Mon, Jan 10, 2011 at 10:08, Nils Becker wrote: > Hi, > > I noticed that I can index into a dtype when I take an element > of a rank-1 array but not if I make a rank-0 array directly. This seems > inconsistent. A bug? Not a bug. Since there is no axis, you cannot use integers to index into a rank-0 array. Use an empty tuple instead. [~] |1> dt = np.dtype([('x', ' a_rank_0 = np.zeros((), dtype=dt) [~] |3> a_rank_0[()] (0.0, 0.0) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." ? -- Umberto Eco From faltet at pytables.org Mon Jan 10 12:47:02 2011 From: faltet at pytables.org (Francesc Alted) Date: Mon, 10 Jan 2011 18:47:02 +0100 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: References: <201101101105.27421.faltet@pytables.org> Message-ID: <201101101847.02890.faltet@pytables.org> A Monday 10 January 2011 17:54:16 Mark Wiebe escrigu?: > > Apparently, you forgot to add the new_iterator_pywrap.h file. > > Oops, that's added now. Excellent. It works now. > The aligned case should just be a matter of conditionally removing > the NPY_ITER_ALIGNED flag in two places. Wow, the support for unaligned in current `evaluate_iter()` seems pretty nice already: $ python unaligned-simple.py -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- Numexpr version: 1.5.dev NumPy version: 2.0.0.dev-ebc963d Python version: 2.6.1 (r261:67515, Feb 3 2009, 17:34:37) [GCC 4.3.2 [gcc-4_3-branch revision 141291]] Platform: linux2-x86_64 AMD/Intel CPU? True VML available? False Detected cores: 2 -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- NumPy aligned: 0.658 s NumPy unaligned: 1.597 s Numexpr aligned: 0.59 s Numexpr aligned (new iter): 0.59 s Numexpr unaligned: 0.51 s Numexpr unaligned (new_iter): 0.528 s so, the new code is just < 5% slower. I suppose that removing the NPY_ITER_ALIGNED flag would give us a bit more performance, but that's great as it is now. How did you do that? Your new_iter branch in NumPy already deals with unaligned data, right? > The new code also needs support for the reduce operation. I didn't > look too closely at the code for that, but a nested iteration > pattern is probably appropriate. If the inner loop is just allowed > to be one dimension, it could be done without actually creating the > inner iterator. Well, if you can support reduce operations with your patch that would be extremely good news as I'm afraid that the current reduce code is a bit broken in Numexpr (at least, I vaguely remember seeing it working badly in some cases). -- Francesc Alted From n.becker at amolf.nl Mon Jan 10 13:15:13 2011 From: n.becker at amolf.nl (Nils Becker) Date: Mon, 10 Jan 2011 19:15:13 +0100 Subject: [Numpy-discussion] indexing of rank-0 structured arrays: why not? Message-ID: <4D2B4CB1.2090404@amolf.nl> Robert, your answer does work: after indexing with () I can then further index into the datatype. In [115]: a_rank_0[()][0] Out[115]: 0.0 I guess I just found the fact confusing that a_rank_1[0] and a_rank_0 compare and print equal but behave differently under indexing. More precisely if I do In [117]: b = a_rank_1[0] then In [118]: b.shape Out[118]: () and In [120]: a_rank_0 == b Out[120]: True but In [119]: b[0] Out[119]: 0.0 works but a_rank_0[0] doesn't. I thought b is a rank-0 array which it apparently is not since it can be indexed. So maybe b[0] should fail for consistency? N. From mwwiebe at gmail.com Mon Jan 10 13:29:33 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Mon, 10 Jan 2011 10:29:33 -0800 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: <201101101847.02890.faltet@pytables.org> References: <201101101105.27421.faltet@pytables.org> <201101101847.02890.faltet@pytables.org> Message-ID: On Mon, Jan 10, 2011 at 9:47 AM, Francesc Alted wrote: > > > so, the new code is just < 5% slower. I suppose that removing the > NPY_ITER_ALIGNED flag would give us a bit more performance, but that's > great as it is now. How did you do that? Your new_iter branch in NumPy > already deals with unaligned data, right? > Take a look at lowlevel_strided_loops.c.src. In this case, the buffering setup code calls PyArray_GetDTypeTransferFunction, which in turn calls PyArray_GetStridedCopyFn, which on an x86 platform returns _aligned_strided_to_contig_size8. This function has a simple loop of copies using a npy_uint64 data type. > The new code also needs support for the reduce operation. I didn't > > look too closely at the code for that, but a nested iteration > > pattern is probably appropriate. If the inner loop is just allowed > > to be one dimension, it could be done without actually creating the > > inner iterator. > > Well, if you can support reduce operations with your patch that would be > extremely good news as I'm afraid that the current reduce code is a bit > broken in Numexpr (at least, I vaguely remember seeing it working badly > in some cases). > Cool, I'll take a look at some point. I imagine with the most obvious implementation small reductions would perform poorly. -Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Mon Jan 10 14:16:02 2011 From: robert.kern at gmail.com (Robert Kern) Date: Mon, 10 Jan 2011 13:16:02 -0600 Subject: [Numpy-discussion] indexing of rank-0 structured arrays: why not? In-Reply-To: <4D2B4CB1.2090404@amolf.nl> References: <4D2B4CB1.2090404@amolf.nl> Message-ID: On Mon, Jan 10, 2011 at 12:15, Nils Becker wrote: > Robert, > > your answer does work: after indexing with () I can then further index > into the datatype. > > In [115]: a_rank_0[()][0] > Out[115]: 0.0 > > I guess I just found the fact confusing that a_rank_1[0] and a_rank_0 > compare and print equal but behave differently under indexing. They do not print equal. Many things compare equal but do not behave the same. > More precisely if I do > In [117]: b = a_rank_1[0] > > then > > In [118]: b.shape > Out[118]: () > > and > > In [120]: a_rank_0 == b > Out[120]: True > > but > > In [119]: b[0] > Out[119]: 0.0 > > works but a_rank_0[0] doesn't. I thought b is a rank-0 array which it > apparently is not since it can be indexed. So maybe b[0] should fail for > consistency? No, b is a record scalar. It can be indexed because it is often convient to treat such records like tuples. This replaces the default indexing behavior of scalars (which is to simply disallow indexing). a_rank_0 is an array, so the array indexing semantics are the default, and we do not change them. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." ? -- Umberto Eco From mwwiebe at gmail.com Mon Jan 10 14:35:08 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Mon, 10 Jan 2011 11:35:08 -0800 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: <201101101255.16286.faltet@pytables.org> References: <201101101105.27421.faltet@pytables.org> <201101101255.16286.faltet@pytables.org> Message-ID: I'm a bit curious why the jump from 1 to 2 threads is scaling so poorly. Your timings have improvement factors of 1.85, 1.68, 1.64, and 1.79. Since the computation is trivial data parallelism, and I believe it's still pretty far off the memory bandwidth limit, I would expect a speedup of 1.95 or higher. One reason I suggest TBB is that it can produce a pretty good schedule while still adapting to load produced by other processes and threads. Numexpr currently does that well, but simply dividing the data into one piece per thread doesn't handle that case very well, and makes it possible that one thread spends a fair bit of time finishing up while the others idle at the end. Perhaps using Cilk would be a better option than TBB, since the code could remain in C. -Mark On Mon, Jan 10, 2011 at 3:55 AM, Francesc Alted wrote: > A Monday 10 January 2011 11:05:27 Francesc Alted escrigu?: > > Also, I'd like to try out the new thread scheduling that you > > suggested to me privately (i.e. T0T1T0T1... vs T0T0...T1T1...). > > I've just implemented the new partition schema in numexpr > (T0T0...T1T1..., being the original T0T1T0T1...). I'm attaching the > patch for this. The results are a bit confusing. For example, using > the attached benchmark (poly.py), I get these results for a common dual- > core machine, non-NUMA machine: > > With the T0T1...T0T1... (original) schema: > > Computing: '((.25*x + .75)*x - 1.5)*x - 2' with 100000000 points > Using numpy: > *** Time elapsed: 3.497 > Using numexpr: > *** Time elapsed for 1 threads: 1.279000 > *** Time elapsed for 2 threads: 0.688000 > > With the T0T0...T1T1... (new) schema: > > Computing: '((.25*x + .75)*x - 1.5)*x - 2' with 100000000 points > Using numpy: > *** Time elapsed: 3.454 > Using numexpr: > *** Time elapsed for 1 threads: 1.268000 > *** Time elapsed for 2 threads: 0.754000 > > which is around a 10% slower (2 threads) than the original partition. > > The results are a bit different on a NUMA machine (8 physical cores, 16 > logical cores via hyper-threading): > > With the T0T1...T0T1... (original) partition: > > Computing: '((.25*x + .75)*x - 1.5)*x - 2' with 100000000 points > Using numpy: > *** Time elapsed: 3.005 > Using numexpr: > *** Time elapsed for 1 threads: 1.109000 > *** Time elapsed for 2 threads: 0.677000 > *** Time elapsed for 3 threads: 0.496000 > *** Time elapsed for 4 threads: 0.394000 > *** Time elapsed for 5 threads: 0.324000 > *** Time elapsed for 6 threads: 0.287000 > *** Time elapsed for 7 threads: 0.247000 > *** Time elapsed for 8 threads: 0.234000 > *** Time elapsed for 9 threads: 0.242000 > *** Time elapsed for 10 threads: 0.239000 > *** Time elapsed for 11 threads: 0.241000 > *** Time elapsed for 12 threads: 0.235000 > *** Time elapsed for 13 threads: 0.226000 > *** Time elapsed for 14 threads: 0.214000 > *** Time elapsed for 15 threads: 0.235000 > *** Time elapsed for 16 threads: 0.218000 > > With the T0T0...T1T1... (new) partition: > > Computing: '((.25*x + .75)*x - 1.5)*x - 2' with 100000000 points > Using numpy: > *** Time elapsed: 3.003 > Using numexpr: > *** Time elapsed for 1 threads: 1.106000 > *** Time elapsed for 2 threads: 0.617000 > *** Time elapsed for 3 threads: 0.442000 > *** Time elapsed for 4 threads: 0.345000 > *** Time elapsed for 5 threads: 0.296000 > *** Time elapsed for 6 threads: 0.257000 > *** Time elapsed for 7 threads: 0.237000 > *** Time elapsed for 8 threads: 0.260000 > *** Time elapsed for 9 threads: 0.245000 > *** Time elapsed for 10 threads: 0.261000 > *** Time elapsed for 11 threads: 0.238000 > *** Time elapsed for 12 threads: 0.210000 > *** Time elapsed for 13 threads: 0.218000 > *** Time elapsed for 14 threads: 0.200000 > *** Time elapsed for 15 threads: 0.235000 > *** Time elapsed for 16 threads: 0.198000 > > In this case, the performance is similar, with perhaps a slight > advantage for the new partition scheme, but I don't know if it is worth > to make it the default (probably not, as this partition performs clearly > worse on non-NUMA machines). At any rate, both partitions perform very > close to the aggregated memory bandwidth of NUMA machines (around 10 > GB/s in the above case). > > In general, I don't think there is much point in using Intel's TBB in > numexpr because the existing implementation already hits memory > bandwidth limits pretty early (around 10 threads in the latter example). > > -- > Francesc Alted > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mwwiebe at gmail.com Tue Jan 11 00:45:28 2011 From: mwwiebe at gmail.com (Mark Wiebe) Date: Mon, 10 Jan 2011 21:45:28 -0800 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: References: <201101101105.27421.faltet@pytables.org> <201101101255.16286.faltet@pytables.org> Message-ID: On Mon, Jan 10, 2011 at 11:35 AM, Mark Wiebe wrote: > I'm a bit curious why the jump from 1 to 2 threads is scaling so poorly. > Your timings have improvement factors of 1.85, 1.68, 1.64, and 1.79. Since > the computation is trivial data parallelism, and I believe it's still pretty > far off the memory bandwidth limit, I would expect a speedup of 1.95 or > higher. It looks like it is the memory bandwidth which is limiting the scalability. The slower operations scale much better than faster ones. Below are some timings of successively faster operations. When the operation is slow enough, it scales like I was expecting... -Mark Computing: 'cos(x**1.1) + sin(x**1.3) + tan(x**2.3)' with 20000000 points Using numpy: *** Time elapsed: 14.47 Using numexpr: *** Time elapsed for 1 threads: 12.659000 *** Time elapsed for 2 threads: 6.357000 *** Ratio from 1 to 2 threads: 1.991348 Using numexpr_iter: *** Time elapsed for 1 threads: 12.573000 *** Time elapsed for 2 threads: 6.398000 *** Ratio from 1 to 2 threads: 1.965145 Computing: 'x**2.345' with 20000000 points Using numpy: *** Time elapsed: 3.506 Using numexpr: *** Time elapsed for 1 threads: 3.375000 *** Time elapsed for 2 threads: 1.747000 *** Ratio from 1 to 2 threads: 1.931883 Using numexpr_iter: *** Time elapsed for 1 threads: 3.266000 *** Time elapsed for 2 threads: 1.760000 *** Ratio from 1 to 2 threads: 1.855682 Computing: '1*x+2*x+3*x+4*x+5*x+6*x+7*x+8*x+9*x+10*x+11*x+12*x+13*x+14*x' with 20000000 points Using numpy: *** Time elapsed: 9.774 Using numexpr: *** Time elapsed for 1 threads: 1.314000 *** Time elapsed for 2 threads: 0.703000 *** Ratio from 1 to 2 threads: 1.869132 Using numexpr_iter: *** Time elapsed for 1 threads: 1.257000 *** Time elapsed for 2 threads: 0.683000 *** Ratio from 1 to 2 threads: 1.840410 Computing: 'x+2.345' with 20000000 points Using numpy: *** Time elapsed: 0.343 Using numexpr: *** Time elapsed for 1 threads: 0.348000 *** Time elapsed for 2 threads: 0.300000 *** Ratio from 1 to 2 threads: 1.160000 Using numexpr_iter: *** Time elapsed for 1 threads: 0.354000 *** Time elapsed for 2 threads: 0.293000 *** Ratio from 1 to 2 threads: 1.208191 -------------- next part -------------- An HTML attachment was scrubbed... URL: From faltet at pytables.org Tue Jan 11 05:44:15 2011 From: faltet at pytables.org (Francesc Alted) Date: Tue, 11 Jan 2011 11:44:15 +0100 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: References: <201101101847.02890.faltet@pytables.org> Message-ID: <201101111144.15649.faltet@pytables.org> A Monday 10 January 2011 19:29:33 Mark Wiebe escrigu?: > > so, the new code is just < 5% slower. I suppose that removing the > > NPY_ITER_ALIGNED flag would give us a bit more performance, but > > that's great as it is now. How did you do that? Your new_iter > > branch in NumPy already deals with unaligned data, right? > > Take a look at lowlevel_strided_loops.c.src. In this case, the > buffering setup code calls PyArray_GetDTypeTransferFunction, which > in turn calls PyArray_GetStridedCopyFn, which on an x86 platform > returns > _aligned_strided_to_contig_size8. This function has a simple loop of > copies using a npy_uint64 data type. I see. Brilliant! > > Well, if you can support reduce operations with your patch that > > would be extremely good news as I'm afraid that the current reduce > > code is a bit broken in Numexpr (at least, I vaguely remember > > seeing it working badly in some cases). > > Cool, I'll take a look at some point. I imagine with the most > obvious implementation small reductions would perform poorly. IMO, reductions like sum() or prod() are mainly limited my memory access, so my advise would be to not try to over-optimize here, and just make use of the new iterator. We can refine performance later on. -- Francesc Alted From faltet at pytables.org Tue Jan 11 06:58:27 2011 From: faltet at pytables.org (Francesc Alted) Date: Tue, 11 Jan 2011 12:58:27 +0100 Subject: [Numpy-discussion] numexpr with the new iterator In-Reply-To: References:

Message-ID: <201101111258.27489.faltet@pytables.org> A Tuesday 11 January 2011 06:45:28 Mark Wiebe escrigu?: > On Mon, Jan 10, 2011 at 11:35 AM, Mark Wiebe wrote: > > I'm a bit curious why the jump from 1 to 2 threads is scaling so > > poorly. > > > > Your timings have improvement factors of 1.85, 1.68, 1.64, and > > 1.79. Since > > > > the computation is trivial data parallelism, and I believe it's > > still pretty far off the memory bandwidth limit, I would expect a > > speedup of 1.95 or higher. > > It looks like it is the memory bandwidth which is limiting the > scalability. Indeed, this is an increasingly important problem for modern computers. You may want to read: http://www.pytables.org/docs/CISE-12-2-ScientificPro.pdf ;-) > The slower operations scale much better than faster > ones. Below are some timings of successively faster operations. > When the operation is slow enough, it scales like I was expecting... [clip] Yeah, for another example on this with more threads, see: http://code.google.com/p/numexpr/wiki/MultiThreadVM OTOH, I was curious about the performance of the new iterator with Intel's VML, but it seems to work decently too: $ python bench/vml_timing.py (original numexpr, *no* VML support) *************** Numexpr vs NumPy speed-ups ******************* Contiguous case: 1.72 (mean), 0.92 (min), 3.07 (max) Strided case: 2.1 (mean), 0.98 (min), 3.52 (max) Unaligned case: 2.35 (mean), 1.35 (min), 3.31 (max) $ python bench/vml_timing.py (original numexpr, VML support) *************** Numexpr vs NumPy speed-ups ******************* Contiguous case: 3.83 (mean), 1.1 (min), 10.19 (max) Strided case: 3.21 (mean), 0.98 (min), 7.45 (max) Unaligned case: 3.6 (mean), 1.47 (min), 7.87 (max) $ python bench/vml_timing.py (new iter numexpr, VML support) *************** Numexpr vs NumPy speed-ups ******************* Contiguous case: 3.56 (mean), 1.12 (min), 7.38 (max) Strided case: 2.37 (mean), 0.09 (min), 7.63 (max) Unaligned case: 3.56 (mean), 2.08 (min), 5.88 (max) However, there a couple of quirks here. 1) The original Numexpr performs generally faster than the iter version. 2) The strided case is quite worse for the iter version. I've isolated the tests that performs worse for the iter version, and here are a couple of samples: *************** Expression: exp(f3) numpy: 0.0135 numpy strided: 0.0144 numpy unaligned: 0.0200 numexpr: 0.0020 Speed-up of numexpr over numpy: 6.6584 numexpr strided: 0.1495 Speed-up of numexpr over numpy: 0.0962 numexpr unaligned: 0.0049 Speed-up of numexpr over numpy: 4.0859 *************** Expression: sin(f3)>cos(f4) numpy: 0.0291 numpy strided: 0.0366 numpy unaligned: 0.0407 numexpr: 0.0166 Speed-up of numexpr over numpy: 1.7518 numexpr strided: 0.1551 Speed-up of numexpr over numpy: 0.2361 numexpr unaligned: 0.0175 Speed-up of numexpr over numpy: 2.3246 Maybe you can shed some light on what's going on here (shall we discuss this off-the-list so as to not bore people too much?). -- Francesc Alted From totonixsame at gmail.com Tue Jan 11 11:13:03 2011 From: totonixsame at gmail.com (totonixsame at gmail.com) Date: Tue, 11 Jan 2011 14:13:03 -0200 Subject: [Numpy-discussion] Drawing circles in a numpy array In-Reply-To: References: Message-ID: On Mon, Jan 10, 2011 at 11:53 AM, totonixsame at gmail.com wrote: > Hi all, > > I have this problem: Given some point draw a circle centered in this > point with radius r. I'm doing that using numpy this way (Snippet code > from here [1]): > >>>> # Create the initial black and white image >>>> import numpy as np >>>> from scipy import ndimage >>>> a = np.zeros((512, 512)).astype(uint8) #unsigned integer type needed by watershed >>>> y, x = np.ogrid[0:512, 0:512] >>>> m1 = ((y-200)**2 + (x-100)**2 < 30**2) >>>> m2 = ((y-350)**2 + (x-400)**2 < 20**2) >>>> m3 = ((y-260)**2 + (x-200)**2 < 20**2) >>>> a[m1+m2+m3]=1 >>>> imshow(a, cmap = cm.gray)# left plot in the image above > > The problem is that it have to evaluate all values from 0 to image > size (in snippet, from 0 to 512 in X and Y dimensions). There is a > faster way of doing that? Without evaluate all that values? For > example: only evaluate from 0 to 30, in a circle centered in (0, 0) > with radius 30. > > Thanks! > Thiago Franco de Moraes > > [1] - http://www.scipy.org/Cookbook/Watershed Hi, I've just seen I can do something like this: >>> radius = 10 >>> a = np.zeros((512, 512)).astype('uint8') >>> cx, cy = 100, 100 # The center of circle >>> y, x = np.ogrid[-radius: radius, -radius: radius] >>> index = x**2 + y**2 <= radius**2 >>> a[cy-radius:cy+radius, cx-radius:cx+radius][index] = 255 Numpy is very cool! Is there other way of doing that? Only to know ... Thanks! Thiago Franco de Moraes From kwgoodman at gmail.com Tue Jan 11 13:46:21 2011 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 11 Jan 2011 10:46:21 -0800 Subject: [Numpy-discussion] Rolling window (moving average, moving std, and more) In-Reply-To: References: <4D21F8BD.60003@hawaii.edu>

Message-ID: On Tue, Jan 4, 2011 at 8:14 AM, Keith Goodman wrote: > On Tue, Jan 4, 2011 at 8:06 AM, Sebastian Haase wrote: >> On Mon, Jan 3, 2011 at 5:32 PM, Erik Rigtorp wrote: >>> On Mon, Jan 3, 2011 at 11:26, Eric Firing wrote: >>>> Instead of calculating statistics independently each time the window is >>>> advanced one data point, the statistics are updated. ?I have not done >>>> any benchmarking, but I expect this approach to be quick. >>> >>> This might accumulate numerical errors. But could be fine for many applications. >>> >>>> The code is old; I have not tried to update it to take advantage of >>>> cython's advances over pyrex. ?If I were writing it now, I might not >>>> bother with the C level at all; it could all be done in cython, probably >>>> with no speed penalty, and maybe even with reduced overhead. >>>> >>> >>> No doubt this would be faster, I just wanted to offer a general way to >>> this in NumPy. >>> _______________________________________________ >> >> BTW, some of these operations can be done using scipy's ndimage ?- right ? >> Any comments ? ?How does the performance compare ? >> ndimage might have more options regarding edge handling, or ? > > Take a look at the moving window function in the development version > of the la package: > > https://github.com/kwgoodman/la/blob/master/la/farray/mov.py > > Many of the moving window functions offer three calculation methods: > filter (ndimage), strides (the strides trick discussed in this > thread), and loop (a simple python loop). > > For example: > >>> a = np.random.rand(500,2000) >>> timeit la.farray.mov_max(a, window=252, axis=-1, method='filter') > 1 loops, best of 3: 336 ms per loop >>> timeit la.farray.mov_max(a, window=252, axis=-1, method='strides') > 1 loops, best of 3: 609 ms per loop >>> timeit la.farray.mov_max(a, window=252, axis=-1, method='loop') > 1 loops, best of 3: 638 ms per loop > > No one method is best for all situations. That is one of the reasons I > started the Bottleneck package. I figured Cython could beat them all. I added four new function to Bottleneck: move_min, move_max, move_nanmin, move_nanmax. They are much faster than using SciPy's ndimage.maximum_filter1d or the strides trick: >> a = np.random.rand(500,2000) >> timeit la.farray.mov_max(a, window=252, axis=-1, method='filter') # ndimage 1 loops, best of 3: 336 ms per loop >> timeit bn.move_max(a, window=252, axis=-1) # bottleneck 100 loops, best of 3: 14.1 ms per loop That looks too good to be true. Are the outputs the same? >> a1 = la.farray.mov_max(a, window=252, axis=-1, method='filter') >> a2 = bn.move_max(a, window=252, axis=-1) >> np.testing.assert_array_almost_equal(a1, a2) >> Yes. From mfrank at ari.uni-heidelberg.de Tue Jan 11 14:21:40 2011 From: mfrank at ari.uni-heidelberg.de (Matthias Frank) Date: Tue, 11 Jan 2011 20:21:40 +0100 Subject: [Numpy-discussion] histogram2d and decreasing bin edges Message-ID: <4D2CADC4.1010100@ari.uni-heidelberg.de> Hi all, I've noticed a change in numpy.histogram2d between (possibly very much) older versions and the current one: The function can no longer handle the situation where bin edges decrease instead of increasing monotonically. The reason for this seems to be the handling of outliers histogramdd, see the output of minimal example below. If I understand correctly, this is the only place where histogramdd implicitly assumes monotonically increasing bin edges. If so, this could be fixed to work with increasing and decreasing bin edges by taking abs(dedges[i]).min() when calculating the rounding precision. If not, it might be more consistent, and produce a more meaningful error message, if histogram2d asserted that bin edges increase monotonically and otherwise raised an AttributeError as the 1-d histogram() function does in that case (see below) Matthias In [1]: import numpy In [2]: numpy.__version__ Out[2]: '1.5.1' In [3]: ascending=numpy.array([0,1]) In [4]: descending=numpy.array([1,0]) In [5]: numpy.histogram2d([0.5],[0.5],bins=(ascending,ascending)) Out[5]: (array([[ 1.]]), array([ 0., 1.]), array([ 0., 1.])) In [6]: numpy.histogram2d([0.5],[0.5],bins=(descending,descending)) Warning: invalid value encountered in log10 --------------------------------------------------------------------------- ValueError Traceback (most recent call last) /lib/python2.6/site-packages/numpy/lib/twodim_base.pyc in histogram2d(x, y, bins, range, normed, weights) 613 xedges = yedges = asarray(bins, float) 614 bins = [xedges, yedges] --> 615 hist, edges = histogramdd([x,y], bins, range, normed, weights) 616 return hist, edges[0], edges[1] 617 /lib/python2.6/site-packages/numpy/lib/function_base.pyc in histogramdd(sample, bins, range, normed, weights) 312 for i in arange(D): 313 # Rounding precision --> 314 decimal = int(-log10(dedges[i].min())) +6 315 # Find which points are on the rightmost edge. 316 on_edge = where(around(sample[:,i], decimal) == around(edges[i][-1], ValueError: cannot convert float NaN to integer Behavior of the 1-d histogram() In [8]: numpy.histogram([0.5],bins=ascending) Out[8]: (array([1]), array([0, 1])) In [9]: numpy.histogram([0.5],bins=descending) --------------------------------------------------------------------------- AttributeError Traceback (most recent call last) /lib/python2.6/site-packages/numpy/lib/function_base.pyc in histogram(a, bins, range, normed, weights) 160 if (np.diff(bins) < 0).any(): 161 raise AttributeError( --> 162 'bins must increase monotonically.') 163 164 # Histogram is an integer or a float array depending on the weights. AttributeError: bins must increase monotonically. From bje at air.net.au Wed Jan 12 06:45:54 2011 From: bje at air.net.au (Ben Elliston) Date: Wed, 12 Jan 2011 22:45:54 +1100 Subject: [Numpy-discussion] mapping a function to a masked array Message-ID: <20110112114554.GA23259@air.net.au> I have a masked array of values that I would like to transform through a user-defined function. Naturally, I want to ignore any values that are masked in the initial array. The user-defined function examines other points around the value in question, so I need to use ndenumerate (or similar) to get the array index as I iterate over the array. So, I have two questions: how to make this run without looping in Python, and how to avoid masked values. Here is the clunky solution I have so far: result = ma.copy (data) for i, val in ndenumerate (data): if not data.mask[i]: result[i] = myfunc (data, i, val) Any suggestions? Thanks, Ben -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: Digital signature URL: From pgmdevlist at gmail.com Wed Jan 12 07:04:02 2011 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 12 Jan 2011 13:04:02 +0100 Subject: [Numpy-discussion] mapping a function to a masked array In-Reply-To: <20110112114554.GA23259@air.net.au> References: <20110112114554.GA23259@air.net.au> Message-ID: <8528B7DB-F803-418F-93D1-DEDA86E8657B@gmail.com> On Jan 12, 2011, at 12:45 PM, Ben Elliston wrote: > I have a masked array of values that I would like to transform through > a user-defined function. Naturally, I want to ignore any values that > are masked in the initial array. > > The user-defined function examines other points around the value in > question, so I need to use ndenumerate (or similar) to get the array > index as I iterate over the array. Can your function accept arrays as input ? > > So, I have two questions: how to make this run without looping in > Python, and how to avoid masked values. Here is the clunky solution I > have so far: > > result = ma.copy (data) > for i, val in ndenumerate (data): > if not data.mask[i]: > result[i] = myfunc (data, i, val) `result` doesn't have to be a masked array, right ? result = np.empty_like(data) ndata = data.data for (i, (v, m)) in enumerate(zip(ndata, data.mask)): if not m: result[i] = myfunc(ndata, i, v) The main point is to avoid looping on the masked array itself. Instead, you loop on the `data` and `mask` attributes , that are regular ndarrays only. Should be far more efficient that way. Same thing for myfunc: don't call it on the masked array, just on the data part. About looping: well, if you can vectorize your function, you may avoid the loop. You may also wanna try a list comprehension: >>> result = [myfunc(ndata,i,v) for (i,(v,m)) in enumerate(zip(ndata,data.mask)) if not m] and retransform result to a ndarray afterwards. Or use fromiterator ? Let me know how it goes Cheers P. From dstaley at usgs.gov Wed Jan 12 10:31:31 2011 From: dstaley at usgs.gov (dstaley) Date: Wed, 12 Jan 2011 07:31:31 -0800 (PST) Subject: [Numpy-discussion] Variable in an array name? Message-ID: <30645276.post@talk.nabble.com> Is it possible to use a variable in an array name? I am looping through a bunch of calculations, and need to have each array as a separate entity. I'm pretty new to python and numpy, so forgive my ignorance. I'm sure there is a simple answer, but I can't seem to find it. let's say i have a variable 'i': i = 5 I would like my array to have the name array5 I know how I could do this manually, but not in a loop where i is redefined several times. any thoughts/comments/suggestions are appreciated. Thanks. -DS -- View this message in context: http://old.nabble.com/Variable-in-an-array-name--tp30645276p30645276.html Sent from the Numpy-discussion mailing list archive at Nabble.com. From zachary.pincus at yale.edu Wed Jan 12 10:34:31 2011 From: zachary.pincus at yale.edu (Zachary Pincus) Date: Wed, 12 Jan 2011 10:34:31 -0500 Subject: [Numpy-discussion] Variable in an array name? In-Reply-To: <30645276.post@talk.nabble.com> References: <30645276.post@talk.nabble.com> Message-ID: <5D4AFFDB-9E6C-414B-AE41-48EBDB827A3B@yale.edu> > Is it possible to use a variable in an array name? I am looping > through a > bunch of calculations, and need to have each array as a separate > entity. > I'm pretty new to python and numpy, so forgive my ignorance. I'm > sure there > is a simple answer, but I can't seem to find it. > > let's say i have a variable 'i': > > i = 5 > > I would like my array to have the name array5 > > I know how I could do this manually, but not in a loop where i is > redefined > several times. There are ways to do this, but what you likely actually want is just to put several arrays in a python list and then index into the list, instead of constructing numbered names. e.g.: array_list = [] for whatever: array_list.append(numpy.array(whatever)) for array in array_list: do_something(array) given_array = array_list[i] From dstaley at usgs.gov Wed Jan 12 10:40:51 2011 From: dstaley at usgs.gov (dstaley) Date: Wed, 12 Jan 2011 07:40:51 -0800 (PST) Subject: [Numpy-discussion] Variable in an array name? In-Reply-To: <5D4AFFDB-9E6C-414B-AE41-48EBDB827A3B@yale.edu> References: <30645276.post@talk.nabble.com> <5D4AFFDB-9E6C-414B-AE41-48EBDB827A3B@yale.edu> Message-ID: <30654306.post@talk.nabble.com> Zachary Pincus-2 wrote: > >> Is it possible to use a variable in an array name? I am looping >> through a >> bunch of calculations, and need to have each array as a separate >> entity. >> I'm pretty new to python and numpy, so forgive my ignorance. I'm >> sure there >> is a simple answer, but I can't seem to find it. >> >> let's say i have a variable 'i': >> >> i = 5 >> >> I would like my array to have the name array5 >> >> I know how I could do this manually, but not in a loop where i is >> redefined >> several times. > > There are ways to do this, but what you likely actually want is just > to put several arrays in a python list and then index into the list, > instead of constructing numbered names. > > e.g.: > > array_list = [] > > for whatever: > array_list.append(numpy.array(whatever)) > > for array in array_list: > do_something(array) > > given_array = array_list[i] > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > Thank you very much for the prompt response. I have already done what you have suggested, but there are a few cases where I do need to have an array named with a variable (looping through large numbers of unrelated files and calculations that need to be dumped into different analyses). It would be extraordinarily helpful if someone could post a solution to this problem, regardless of inefficiency of the method. Thanks a ton for any additional help. -- View this message in context: http://old.nabble.com/Variable-in-an-array-name--tp30645276p30654306.html Sent from the Numpy-discussion mailing list archive at Nabble.com. From paul.anton.letnes at gmail.com Wed Jan 12 11:02:46 2011 From: paul.anton.letnes at gmail.com (Paul Anton Letnes) Date: Wed, 12 Jan 2011 17:02:46 +0100 Subject: [Numpy-discussion] Variable in an array name? In-Reply-To: <30654306.post@talk.nabble.com> References: <30645276.post@talk.nabble.com> <5D4AFFDB-9E6C-414B-AE41-48EBDB827A3B@yale.edu> <30654306.post@talk.nabble.com> Message-ID: <8EE54CE7-CD8A-4756-8E12-608022F8A1FC@gmail.com> On 12. jan. 2011, at 16.40, dstaley wrote: > > > Zachary Pincus-2 wrote: >> >>> Is it possible to use a variable in an array name? I am looping >>> through a >>> bunch of calculations, and need to have each array as a separate >>> entity. >>> I'm pretty new to python and numpy, so forgive my ignorance. I'm >>> sure there >>> is a simple answer, but I can't seem to find it. >>> >>> let's say i have a variable 'i': >>> >>> i = 5 >>> >>> I would like my array to have the name array5 >>> >>> I know how I could do this manually, but not in a loop where i is >>> redefined >>> several times. >> >> There are ways to do this, but what you likely actually want is just >> to put several arrays in a python list and then index into the list, >> instead of constructing numbered names. >> >> e.g.: >> >> array_list = [] >> >> for whatever: >> array_list.append(numpy.array(whatever)) >> >> for array in array_list: >> do_something(array) >> >> given_array = array_list[i] >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > > > Thank you very much for the prompt response. I have already done what you > have suggested, but there are a few cases where I do need to have an array > named with a variable (looping through large numbers of unrelated files and > calculations that need to be dumped into different analyses). It would be > extraordinarily helpful if someone could post a solution to this problem, > regardless of inefficiency of the method. Thanks a ton for any additional > help. > -- This may be obvious, but I sometimes forget myself: have you tried python dicts? >>> from numpy import * >>> a = linspace(0,10) >>> b = a.copy() >>> d = {'array1':a, 'array2':b} >>> for key in d: ... dosomething(d[key]) That way, you can assign a name / key for each array variable, and use this name for file names, or whatever you need names for. Cheers Paul. From zachary.pincus at yale.edu Wed Jan 12 11:05:58 2011 From: zachary.pincus at yale.edu (Zachary Pincus) Date: Wed, 12 Jan 2011 11:05:58 -0500 Subject: [Numpy-discussion] Variable in an array name? In-Reply-To: <30654306.post@talk.nabble.com> References: <30645276.post@talk.nabble.com> <5D4AFFDB-9E6C-414B-AE41-48EBDB827A3B@yale.edu> <30654306.post@talk.nabble.com> Message-ID: > Thank you very much for the prompt response. I have already done > what you > have suggested, but there are a few cases where I do need to have an > array > named with a variable (looping through large numbers of unrelated > files and > calculations that need to be dumped into different analyses). It > would be > extraordinarily helpful if someone could post a solution to this > problem, > regardless of inefficiency of the method. Thanks a ton for any > additional > help. You could store arrays associated with string names, or other identifiers, (as opposed to integer indices) in a python dict. Global and local namespaces are also just dicts that you can grab with globals() and locals(), if you really want to look up variable names algorithmically, but I promise you that this is really not what you want to be doing. Zach From bsouthey at gmail.com Wed Jan 12 11:20:39 2011 From: bsouthey at gmail.com (Bruce Southey) Date: Wed, 12 Jan 2011 10:20:39 -0600 Subject: [Numpy-discussion] Output dtype In-Reply-To: References: <4D067FF1.9090001@gmail.com> Message-ID: <4D2DD4D7.8020509@gmail.com> On 12/13/2010 04:53 PM, Keith Goodman wrote: > On Mon, Dec 13, 2010 at 12:20 PM, Bruce Southey wrote: > >> Unless something has changed since the docstring was written, this is >> probably an inherited 'bug' from np.mean() as the author expected that >> the docstring of mean was correct. For my 'old' 2.0 dev version: >> >> >>> np.mean( np.array([[0,1,2,3,4,5]], dtype='float32'), axis=1).dtype >> dtype('float32') >> >>> np.mean( np.array([[0,1,2,3,4,5]], dtype='float32')).dtype >> dtype('float64') > Are you saying the bug is in the doc string, the output, or both? I > think it is both; I expect the second result above to be float32. > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion Sorry as I filed a bug for this as 1710 http://projects.scipy.org/numpy/ticket/1710 but this is the same as ticket 518 that is listed as won't fix: http://projects.scipy.org/numpy/ticket/518 My expectation is that the internal and output dtypes should not depend on the axis argument. Related to this, I also think that internal dtypes should be the same as the output dtype (see ticket 465 regarding the internal precision http://projects.scipy.org/numpy/ticket/465). If the consensus is still won't fix then I or someone needs to edit the documentation to clearly reflect these situations. Bruce From josef.pktd at gmail.com Wed Jan 12 11:22:49 2011 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 12 Jan 2011 11:22:49 -0500 Subject: [Numpy-discussion] Variable in an array name? In-Reply-To: References: <30645276.post@talk.nabble.com> <5D4AFFDB-9E6C-414B-AE41-48EBDB827A3B@yale.edu> <30654306.post@talk.nabble.com> Message-ID: On Wed, Jan 12, 2011 at 11:05 AM, Zachary Pincus wrote: >> Thank you very much for the prompt response. ?I have already done >> what you >> have suggested, but there are a few cases where I do need to have an >> array >> named with a variable (looping through large numbers of unrelated >> files and >> calculations that need to be dumped into different analyses). ?It >> would be >> extraordinarily helpful if someone could post a solution to this >> problem, >> regardless of inefficiency of the method. ?Thanks a ton for any >> additional >> help. > > You could store arrays associated with string names, or other > identifiers, (as opposed to integer indices) in a python dict. > > Global and local namespaces are also just dicts that you can grab with > globals() and locals(), if you really want to look up variable names > algorithmically, but I promise you that this is really not what you > want to be doing. or (pretending to translate matlab) >>> a = 5 >>> for i in range(5): exec('var_%02d = np.array([%d])'%(i, a+i)) >>> [i for i in globals() if i[:3] == 'var'] ['var_00', 'var_01', 'var_02', 'var_03', 'var_04'] >>> var_00 array([5]) >>> var_01 array([6]) not very pythonic (?) Josef > > Zach > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From matthewturk at gmail.com Wed Jan 12 11:28:47 2011 From: matthewturk at gmail.com (Matthew Turk) Date: Wed, 12 Jan 2011 11:28:47 -0500 Subject: [Numpy-discussion] Autosummary using numpydoc Message-ID: Hi there, I've been trying to take the numpy docstring and apply the same methodology to a different project I work on, but there are a couple details that I think I'm unclear on, and I was hoping for some pointers or at least reassurances that it's working as intended, despite Sphinx's protests. The process of applying the numpy docstring method seems to take a few steps: 0) Have sphinx >=1.0 installed. 1) Write a numpydoc-compliant docstring in a class and its methods. 2) Make available to sphinx the numpydoc extension (in this case, I am using the current numpy git tip, which self-reports as 0.4) 3) Add the sphinx.ext.autosummary and numpydoc extensions to conf.py in the appropriate sphinx project 4) Copy the autosummary/class.rst template to the appropriate _template directory in the sphinx project 5) Set up the autosummary_generate variable in conf.py to contain all the files containing autosummary directives. 6) Build docs As a few other quick notes, my reading of the overridden class.rst template is that it comments out the individual method inclusions, using the HACK (comment) directive. The build mostly proceeds correctly when these steps have been taken, but a copious number of warnings are emitted. (The same character of warnings are emitted by the numpy documentation build.) These mostly show up, for instance, as: WARNING: toctree contains reference to nonexisting document 'reference/api/generated/add_phase_object' In this case, add_phase_object is a method hanging off an autosummary'ed class. It looks like a warning gets emitted for every method on every autosummary'ed class -- this can number into the thousands very easily. Additionally, warnings that look like: reference/api/generated/yt.visualization.api.PlotCollection.add_phase_object.rst:: WARNING: document isn't included in any toctree show up, where these reflect the full method name and class name. It looks like the toctree is gaining references to the method names without prefixing them with the class names, but the generated docs all have the full name resolution in their filenames. (There also seems to be a glitch in the output when using the ".. HACK" class.rst template, as within my code it appears to strip the one-line descriptions from the methods.) Is this behaving as expected, or have I perhaps gone through the steps wrong? Having so many warnings can confuse debugging other portions of the build. I appreciate any suggestions you might have -- thanks very much! Best, Matt From kwgoodman at gmail.com Wed Jan 12 12:28:05 2011 From: kwgoodman at gmail.com (Keith Goodman) Date: Wed, 12 Jan 2011 09:28:05 -0800 Subject: [Numpy-discussion] Output dtype In-Reply-To: <4D2DD4D7.8020509@gmail.com> References: <4D067FF1.9090001@gmail.com> <4D2DD4D7.8020509@gmail.com> Message-ID: On Wed, Jan 12, 2011 at 8:20 AM, Bruce Southey wrote: > On 12/13/2010 04:53 PM, Keith Goodman wrote: >> On Mon, Dec 13, 2010 at 12:20 PM, Bruce Southey ?wrote: >> >>> Unless something has changed since the docstring was written, this is >>> probably an inherited 'bug' from np.mean() as the author expected that >>> the docstring of mean was correct. For my 'old' 2.0 dev version: >>> >>> ? >>> ?np.mean( np.array([[0,1,2,3,4,5]], dtype='float32'), axis=1).dtype >>> dtype('float32') >>> ? >>> ?np.mean( np.array([[0,1,2,3,4,5]], dtype='float32')).dtype >>> dtype('float64') >> Are you saying the bug is in the doc string, the output, or both? I >> think it is both; I expect the second result above to be float32. >> >> > Sorry as I filed a bug for this as 1710 > http://projects.scipy.org/numpy/ticket/1710 > but this is the same as ticket 518 that is listed as won't fix: > http://projects.scipy.org/numpy/ticket/518 I fixed ticket 518 in bottleneck: >> a = np.array([1,2,3], dtype='float32') >> bn.median(a).dtype dtype('float32') >> np.median(a).dtype dtype('float64') Not sure I would have done that if I knew that numpy has a won't fix on it. From gregory.guyomarch at gmail.com Wed Jan 12 12:34:32 2011 From: gregory.guyomarch at gmail.com (=?utf-8?b?R3LDqWdvcnk=?= Guyomarc'h) Date: Wed, 12 Jan 2011 17:34:32 +0000 (UTC) Subject: [Numpy-discussion] Non-deterministic floating point behavior in numpy 1.5.1 ? Message-ID: Hello, I have noticed strange non-deterministic behaviours with numpy 1.5.1 when using floating point arrays. The following script were run on 6 different machines, all Intel Core i7: - 3 of them running numpy 1.5.1 with either Python 2.7.1 (x86) or 2.5.2 (x86) and, - 3 of them running numpy 1.3 and Python 2.5.2.(x86). import numpy x = numpy.array([[0.00010876945607980702], [0.22568137594619658], [5.6435218858623557]]) for i in range(10): m = numpy.array([[36.0 * 36.0, 36.0, 1.0] for j in range(6)]) y = (numpy.dot(m, x) - 13.90901663) * 1000.0 print y[0] The output on each machine running 1.5.1 are similar to this one: [ 5.00486230e-06] [ 5.00486408e-06] [ 5.00486230e-06] [ 5.00486408e-06] [ 5.00486230e-06] [ 5.00486408e-06] [ 5.00486230e-06] [ 5.00486408e-06] [ 5.00486230e-06] [ 5.00486408e-06] I cannot make sense of the changes of the least significant digits across different iterations of the for loop since its body is actually constant. Note that this behavior is hard to reproduce: on some machines I had to insert dummy print statements here and there to reproduce the bug or increase the number of iterations or the length of the arrays inside the loop. Also, I could not reproduce it with older versions of numpy such as 1.3. Is this behavior expected ? Is there a way to make sure the results of a numpy floating point computations remains the same for multiple runs ? Thanks, Gregory. From pav at iki.fi Wed Jan 12 13:05:59 2011 From: pav at iki.fi (Pauli Virtanen) Date: Wed, 12 Jan 2011 18:05:59 +0000 (UTC) Subject: [Numpy-discussion] Non-deterministic floating point behavior in numpy 1.5.1 ? References: Message-ID: Wed, 12 Jan 2011 17:34:32 +0000, Gr?gory Guyomarc'h wrote: [clip] > y = (numpy.dot(m, x) - 13.90901663) * 1000.0 print y[0] [clip] > Also, I could not reproduce it with older versions of numpy such as 1.3. > Is this behavior expected ? Is there a way to make sure the results of a > numpy floating point computations remains the same for multiple runs? There are essentially no changes in the dot() routine since 1.3.0 in Numpy. The non-determinism is probably in the BLAS linear algebra library you have linked Numpy with. What platform are you using? (Windows? Linux? Where did you obtain Numpy binaries?) What do you get if you replace `numpy.dot` with `numpy.core.multiarray.dot` (which does not use BLAS)? There's another thread on a similar issue here: http://permalink.gmane.org/gmane.comp.python.scientific.user/27444 -- Pauli Virtanen From davecortesi at gmail.com Wed Jan 12 15:57:58 2011 From: davecortesi at gmail.com (David Cortesi) Date: Wed, 12 Jan 2011 12:57:58 -0800 Subject: [Numpy-discussion] Numpy 1.5.1 - Mac - with Activestate Python 3 Message-ID: I have installed ActiveState's Python 3 packages on Mac OS X 10.6.6. There exists: /Library/Frameworks/Python.framework/Versions/Current/Python* When I run the Mac OS installer it shows all disks as ineligible and the error message, "numpy 1.5.1 can't be installed on this disk. numpy requires System Python 2.6 to install." What can I do to persuade numpy to install? Must I build it from source to get it to use Python 3? Sorry for the Noob question, Dave Cortesi -------------- next part -------------- An HTML attachment was scrubbed... URL: From Chris.Barker at noaa.gov Wed Jan 12 21:32:00 2011 From: Chris.Barker at noaa.gov (Chris Barker) Date: Wed, 12 Jan 2011 18:32:00 -0800 Subject: [Numpy-discussion] Numpy 1.5.1 - Mac - with Activestate Python 3 In-Reply-To: References: Message-ID: <4D2E6420.4000700@noaa.gov> On 1/12/2011 12:57 PM, David Cortesi wrote: > I have installed ActiveState's Python 3 packages on Mac OS X 10.6.6. > When I run the Mac OS installer it shows all disks as ineligible and > the error message, "numpy 1.5.1 can't be installed on this disk. numpy > requires System Python 2.6 to install." Sorry, that is is a bad error message. What I'm pretty sure it means to say is: "numpy requires the Python 2.6 binary from python.org " I looked at this error message ages ago, and It's less trivial to fix that you'd think -- but I thought it had been fixed. > What can I do to persuade numpy to install? Must I build it from source > to get it to use Python 3? You *may* need the python.org binary, rather than ActiveState, but it looks like you're trying to install a numpy binary for 2.6 -- that's not going to work on 3.* -- look for a binary for 3.* -- I'm not sure it exists, though. NOTE: if you're still confused, tell us exactly what file you are trying to install from, and where you downloaded it from. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From seb.haase at gmail.com Thu Jan 13 04:06:16 2011 From: seb.haase at gmail.com (Sebastian Haase) Date: Thu, 13 Jan 2011 10:06:16 +0100 Subject: [Numpy-discussion] Numpy 1.5.1 - Mac - with Activestate Python 3 In-Reply-To: <4D2E6420.4000700@noaa.gov> References: <4D2E6420.4000700@noaa.gov> Message-ID: On Thu, Jan 13, 2011 at 3:32 AM, Chris Barker wrote: > On 1/12/2011 12:57 PM, David Cortesi wrote: >> I have installed ActiveState's Python 3 packages on Mac OS X 10.6.6. > >> When I run the Mac OS installer it shows all disks as ineligible and >> the error message, "numpy 1.5.1 can't be installed on this disk. numpy >> requires System Python 2.6 to install." > > Sorry, that is is a bad error message. What I'm pretty sure it means to > say is: > > "numpy requires the Python 2.6 binary from python.org " > > I looked at this error message ages ago, and It's less trivial to fix > that you'd think -- but I thought it had been fixed. > >> What can I do to persuade numpy to install? Must I build it from source >> to get it to use Python 3? > > You *may* need the python.org binary, rather than ActiveState, but it > looks like you're trying to install a numpy binary for 2.6 -- that's not > going to work on 3.* -- look for a binary for 3.* -- I'm not sure it > exists, though. > > NOTE: if you're still confused, tell us exactly what file you are trying > to install from, and where you downloaded it from. > > -Chris > Hi David, the simple answer you might be looking for is: it's easier to stay with Python 2.x for a while... Can you deinstall the ActiveState 3 version ? Cheers, - Sebastian Haase From ralf.gommers at googlemail.com Thu Jan 13 05:09:41 2011 From: ralf.gommers at googlemail.com (Ralf Gommers) Date: Thu, 13 Jan 2011 18:09:41 +0800 Subject: [Numpy-discussion] Numpy 1.5.1 - Mac - with Activestate Python 3 In-Reply-To: References: <4D2E6420.4000700@noaa.gov> Message-ID: On Thu, Jan 13, 2011 at 5:06 PM, Sebastian Haase wrote: > On Thu, Jan 13, 2011 at 3:32 AM, Chris Barker > wrote: > > On 1/12/2011 12:57 PM, David Cortesi wrote: > >> I have installed ActiveState's Python 3 packages on Mac OS X 10.6.6. > > > >> When I run the Mac OS installer it shows all disks as ineligible and > >> the error message, "numpy 1.5.1 can't be installed on this disk. numpy > >> requires System Python 2.6 to install." > > > > Sorry, that is is a bad error message. What I'm pretty sure it means to > > say is: > > > > "numpy requires the Python 2.6 binary from python.org " > > > > I looked at this error message ages ago, and It's less trivial to fix > > that you'd think -- but I thought it had been fixed. > That message comes from bdist_mpkg. I fixed it on my machine, and a fix was also committed to the svn repo. However, I think there was no new bdist_mpkg release on pypi (I did get a "why don't you just use eggs instead?") and the 1.5.1 binaries were not made on my machine. So the problem returned. > > > >> What can I do to persuade numpy to install? Must I build it from source > >> to get it to use Python 3? > > Yes, there is no binary for Python 3 at the moment. But unless you have a specific need/desire to use 3.1/3.2 I'd suggest staying with 2.6 or 2.7 from python.org for now. With Activestate you have to compile yourself or use their (paid?) pypm repo. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From Thomas.EMMEL at 3ds.com Thu Jan 13 11:04:33 2011 From: Thomas.EMMEL at 3ds.com (EMMEL Thomas) Date: Thu, 13 Jan 2011 16:04:33 +0000 Subject: [Numpy-discussion] Any idea to run the dot-product on many arrays Message-ID: <3A0080EEBFB19C4993C24098DD0A78D1226CC110@EU-DCC-MBX01.dsone.3ds.com> Hi, I need to rotate many vectors (x,y,z) with a given rotation matrix (3x3). I can always do for v in vectors: tv += np.dot(mat, v) where mat is my fixed matrix (or array of arrays) and v is a single array. Is there any efficient way to use an array of vectors to do the transfomation for all of these vectors at once? Kind regards Thomas This email and any attachments are intended solely for the use of the individual or entity to whom it is addressed and may be confidential and/or privileged. If you are not one of the named recipients or have received this email in error, (i) you should not read, disclose, or copy it, (ii) please notify sender of your receipt by reply email and delete this email and all attachments, (iii) Dassault Systemes does not accept or assume any liability or responsibility for any use of or reliance on this email.For other languages, go to http://www.3ds.com/terms/email-disclaimer. -------------- next part -------------- An HTML attachment was scrubbed... URL: From pascal22p at parois.net Thu Jan 13 11:13:24 2011 From: pascal22p at parois.net (Pascal) Date: Thu, 13 Jan 2011 17:13:24 +0100 Subject: [Numpy-discussion] Any idea to run the dot-product on many arrays In-Reply-To: <3A0080EEBFB19C4993C24098DD0A78D1226CC110@EU-DCC-MBX01.dsone.3ds.com> References: <3A0080EEBFB19C4993C24098DD0A78D1226CC110@EU-DCC-MBX01.dsone.3ds.com> Message-ID: <4D2F24A4.6050807@parois.net> On 01/13/2011 05:04 PM, EMMEL Thomas wrote: > Hi, > > I need to rotate many vectors (x,y,z) with a given rotation matrix (3x3). > I can always do > > for v in vectors: > tv += np.dot(mat, v) > > where mat is my fixed matrix (or array of arrays) and v is a single array. > Is there any efficient way to use an array of vectors to do the > transfomation > for all of these vectors at once? numpy.dot(rotationmatrix , coordinates.T).T Where coordinates is a n*3 matrix of n stacked vectors in rows. It works with vectors stacked in column without the the two transpose. It's even possible to apply a symmetry operation to a bunch of second rank tensors in one go. Pascal From davecortesi at gmail.com Thu Jan 13 13:49:17 2011 From: davecortesi at gmail.com (David Cortesi) Date: Thu, 13 Jan 2011 10:49:17 -0800 Subject: [Numpy-discussion] Is python 3 supported or not? Message-ID: I asked about getting numpy to install on OS X with Activestate Python 3. I got thoughtful & responsive replies from three of you, many thanks to all! I am sad that the consistent message was, "forget it." Chris said, "...look for a binary for 3.* -- I'm not sure it exists, though." Sebastian said, "the simple answer you might be looking for is: it's easier to stay with Python 2.x..." Ralf said, "?unless you have a specific need/desire to use 3.1/3.2 I'd suggest staying with 2.6 or 2.7 from python.org for now..." I would like to point out that the wikipedia article on numpy says, "The release version 1.5 of NumPy is compatible with Python versions 2.4?2.7 and Python 3," citing the release note of september 2010, which itself opens with the following lines: > Highlights > > Python 3 compatibility > > This is the first NumPy release which is compatible with Python 3. There is an obvious disconnect here. Is it or isn't it? This is an important question because of the large number of packages at PyPI that depend on numpy. Numpy is a major gateway, or bottleneck, on the way to Python 3. I came looking for numpy because I want to work with an audio package, and all the audio packages at PyPI seem to have numpy dependencies. Ditto the packages for dealing with FITS data format, etc. etc. As to using Activestate's versus python.org's distro, *regardless* of which I use, the package will end up located in /Library/Frameworks/Python.framework/*. It will not be installed in /System/Library/etc. as the Apple distribution is; but it WILL be located at a known location with an executable named Python under Versions/Current. Not that it matters, but the reason I'm using Activestate is because I also needed their up to date version of Tcl/Tk, and python.org python3 wouldn't work with that. As to why I'm using Python 3, it's because I'm starting a new project with no prior dependencies and want the current and future language -- which is now TWO FRAKKIN' YEARS OLD! -- but that's a rant for another time. Thanks again for your attention, Dave Cortesi From numpy-discussion at maubp.freeserve.co.uk Thu Jan 13 14:24:41 2011 From: numpy-discussion at maubp.freeserve.co.uk (Peter) Date: Thu, 13 Jan 2011 19:24:41 +0000 Subject: [Numpy-discussion] Is python 3 supported or not? In-Reply-To: References: Message-ID: On Thu, Jan 13, 2011 at 6:49 PM, David Cortesi wrote: > > I asked about getting numpy to install on OS X with Activestate Python > 3. I got thoughtful & responsive replies from three of you, many > thanks to all! I am sad that the consistent message was, "forget it." I thought the message was since there isn't the easy option of a binary installer provided for Python 3 (yet), you should just install NumPy from source if you really want to use Python 3. That works for me fine on Mac OS X 10.6 (using both Python 3.1 and the current beta of Python 3.2, both themselves compiled from source). Peter P.S. You forgot to reference the thread, for those that missed it see: http://mail.scipy.org/pipermail/numpy-discussion/2011-January/054486.html From pav at iki.fi Thu Jan 13 14:25:47 2011 From: pav at iki.fi (Pauli Virtanen) Date: Thu, 13 Jan 2011 19:25:47 +0000 (UTC) Subject: [Numpy-discussion] Is python 3 supported or not? References: Message-ID: On Thu, 13 Jan 2011 10:49:17 -0800, David Cortesi wrote: [clip] >> Highlights >> >> Python 3 compatibility >> >> This is the first NumPy release which is compatible with Python 3. > > There is an obvious disconnect here. Is it or isn't it? There is no disconnect. The fact just is that nobody has yet built easily redistributable binary packages for OSX. If you really want to run Python 3, just build it yourself from the sources. -- Pauli Virtanen From pav at iki.fi Thu Jan 13 14:27:12 2011 From: pav at iki.fi (Pauli Virtanen) Date: Thu, 13 Jan 2011 19:27:12 +0000 (UTC) Subject: [Numpy-discussion] Is python 3 supported or not? References: Message-ID: On Thu, 13 Jan 2011 19:25:47 +0000, Pauli Virtanen wrote: [clip] > If you really want to run Python 3, just build it yourself from the > sources. Of course, this should have been: "..., just build Numpy from the sources." From Chris.Barker at noaa.gov Thu Jan 13 17:20:17 2011 From: Chris.Barker at noaa.gov (Chris Barker) Date: Thu, 13 Jan 2011 14:20:17 -0800 Subject: [Numpy-discussion] Is python 3 supported or not? In-Reply-To: References: Message-ID: <4D2F7AA1.4010304@noaa.gov> On 1/13/2011 10:49 AM, David Cortesi wrote: > I would like to point out that the wikipedia article on numpy says, > "The release version 1.5 of NumPy is compatible with Python versions > 2.4?2.7 and Python 3," citing the release note of september 2010, > which itself opens with the following lines: > There is an obvious disconnect here. Is it or isn't it? Support is not an absolute thing -- Python 2 is certainly better supported at this point in many ways, but yes, numpy works with Python3 > the large number of packages at PyPI > that depend on numpy. Numpy is a major gateway, or bottleneck, on the > way to Python 3. yes, but I doubt that many (any) of the packages that require numpy don't work on 2.0. Indeed, many of them probably are not yet ported to 3. Personally, I can't move 'till PIL and wxPython are ported, and maybe Pylons (Pyramid), too. > I came looking for numpy because I want to work with > an audio package, and all the audio packages at PyPI seem to have > numpy dependencies. Ditto the packages for dealing with FITS data > format, etc. etc. I'd make darn sure EVERYTHING you think you'll need is py3 compatible. > As to using Activestate's versus python.org's distro, *regardless* of > which I use, the package will end up located in > /Library/Frameworks/Python.framework/*. It will not be installed in > /System/Library/etc. as the Apple distribution is; but it WILL be > located at a known location with an executable named Python under > Versions/Current. That's just a link -- I Hope the actuall package is not in exactly the same place as the python,org binary gets installed -- but maybe it is -- in the past, they have generally been pretty compatible. There are way, way too many ways to get Python on the Mac -- varierty is good, but it is very confusing for newbies, and difficult for anyone that wants to distribute binaries. In general, the community tried to build binaries for the python.org builds, so there are advantages there. > As to why I'm using Python 3, it's because I'm starting a new project > with no prior dependencies and want the current and future language -- > which is now TWO FRAKKIN' YEARS OLD! -- but that's a rant for another > time. umm isn't that amazing, py3 has only been around for two years, and numpy and many other packages already support it! Fabulous! How long has python been around? how long numpy (and numeric before it?) How much work have you done to port things to Py3? > you should just install > NumPy from source if you really want to use Python 3. That works > for me fine on Mac OS X 10.6 (using both Python 3.1 and the > current beta of Python 3.2, both themselves compiled from source). yup -- as it happens, Apple delivers LAPACK, and has a standard, and freely available compiler -- building numpy on OS-X is a piece of cake. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From gael.varoquaux at normalesup.org Fri Jan 14 03:26:11 2011 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Fri, 14 Jan 2011 09:26:11 +0100 Subject: [Numpy-discussion] Is python 3 supported or not? In-Reply-To: References: Message-ID: <20110114082611.GA19526@phare.normalesup.org> On Thu, Jan 13, 2011 at 10:49:17AM -0800, David Cortesi wrote: > As to why I'm using Python 3, it's because I'm starting a new project > with no prior dependencies and want the current and future language -- > which is now TWO FRAKKIN' YEARS OLD! -- but that's a rant for another > time. Oh, you're saying that you'd like to help with building and distributing Python 3 binaries of numpy? G :$ From seb.haase at gmail.com Fri Jan 14 03:47:58 2011 From: seb.haase at gmail.com (Sebastian Haase) Date: Fri, 14 Jan 2011 09:47:58 +0100 Subject: [Numpy-discussion] Is python 3 supported or not? In-Reply-To: <20110114082611.GA19526@phare.normalesup.org> References: <20110114082611.GA19526@phare.normalesup.org> Message-ID: On Fri, Jan 14, 2011 at 9:26 AM, Gael Varoquaux wrote: > On Thu, Jan 13, 2011 at 10:49:17AM -0800, David Cortesi wrote: >> As to why I'm using Python 3, it's because I'm starting a new project >> with no prior dependencies and want the current and future language -- >> which is now TWO FRAKKIN' YEARS OLD! -- but that's a rant for another >> time. > > Oh, you're saying that you'd like to help with building and distributing > Python 3 binaries of numpy? > > G :$ David, One of the greatest things about Python - I found - is that it doesn't change every year. The fact that 3.0 came out 2 years ago does not change the fact that everyone says they are still committed to support Python 2 for 10 more years to come. (I hope this is the right number, but it is certainly is > 5 yrs) Python 3 is somewhat of a "bigger change" and the various sub-project communities where reluctant to switch right away. Don't confuse the degree of change with "Perl 6" - for what I have heard, that "change" is rather a new language, .... while in Python - as example - 1/2 while now be .5 and you would have to write 1//2 to get the old results of 0 . My answer, I gave you few days ago, was kept as general as possible - since you didn't say at the time what your actual needs/plans were. The fact that Numpy is now ready for Python 3 does nowhere imply that everything you might likely want to use with it (SciPy) is also as stable and well tested with Python 3 as Numpy is. Finally - let me teach you some python: (take it with a grain of salt ;-) ) if you write in Python 2(!!) from __future__ import division from __future__ import print_function from __future__ importabsolute_import at the beginning of each module you(!) write you can essentially already use most (many) features of Python 3 in Python 2. This way you can use all packages as they are available for Python 2 and already write your new modules "the Python 3 way". [see also e.g. http://stackoverflow.com/questions/388069/python-graceful-future-feature-future-import ] I'm sorry to tell you that this is not the list for flame wars, but rather the list of the bunch of most helpful people I found. Cheers, Sebastian From joonpyro at gmail.com Fri Jan 14 15:03:16 2011 From: joonpyro at gmail.com (Joon Ro) Date: Fri, 14 Jan 2011 14:03:16 -0600 Subject: [Numpy-discussion] NaN value processing in weave.inline code Message-ID: Hi, I was wondering if it is possible to process (in if statement - check if the given value is NaN) numpy NaN value inside the weave.inline c code. testcode = ''' if (test(0)) { return_val = test(0); } ''' err = weave.inline(testcode, ['test'], type_converters = converters.blitz, force = 0, verbose = 1) with test(0) = nan returns err = nan correctly, but I don't know how to check the nan value inside the c inline c code. Is there any way I can get similar functionality as isnan? Thank you, Joon -- -------------- next part -------------- An HTML attachment was scrubbed... URL: From kwgoodman at gmail.com Fri Jan 14 15:06:42 2011 From: kwgoodman at gmail.com (Keith Goodman) Date: Fri, 14 Jan 2011 12:06:42 -0800 Subject: [Numpy-discussion] NaN value processing in weave.inline code In-Reply-To: References: Message-ID: On Fri, Jan 14, 2011 at 12:03 PM, Joon Ro wrote: > Hi, > I was wondering if it is possible to process (in if statement - check if the > given value is NaN) numpy NaN value inside the weave.inline c code. > > testcode = ''' > if (test(0)) { > ? ? ? return_val = test(0); > } > ''' > > err = weave.inline(testcode, > ?['test'], > type_converters = converters.blitz, force = 0, verbose = 1) > > > with test(0) = nan returns err = nan correctly, but I don't know how to > check the nan value inside the c inline c code. Is there any way I can get > similar functionality as isnan? To check if a scalar, x, is NaN: if x == x: # No, it is not a NaN else: # Yes, it is a NaN From joonpyro at gmail.com Fri Jan 14 15:13:43 2011 From: joonpyro at gmail.com (Joon Ro) Date: Fri, 14 Jan 2011 14:13:43 -0600 Subject: [Numpy-discussion] NaN value processing in weave.inline code In-Reply-To: References: Message-ID: Oops .. I guess isnan() inside the weave code just works fine. Should have tried this first. By the way, is there any speed lost doing this? Should I convert all NaN values into a integer and use it inside the weave inline c code? -Joon On Fri, 14 Jan 2011 14:03:16 -0600, Joon Ro wrote: > Hi, > > I was wondering if it is possible to process (in if statement - check if > the given value is NaN) numpy NaN value inside the weave.inline c code. > > > testcode = ''' > if (test(0)) { > return_val = test(0); > } > ''' > > err = weave.inline(testcode, > ['test'], > type_converters = converters.blitz, force = 0, verbose = 1) > > > with test(0) = nan returns err = nan correctly, but I don't know how to > check the nan value inside the c inline c code. Is there any way I can > get similar functionality as isnan? > > Thank you, > Joon > -- > -- -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Fri Jan 14 15:33:03 2011 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 14 Jan 2011 15:33:03 -0500 Subject: [Numpy-discussion] isposinf returns array, isinf doesn't Message-ID: