From cournape at gmail.com Sat Aug 1 00:14:30 2009 From: cournape at gmail.com (David Cournapeau) Date: Sat, 1 Aug 2009 13:14:30 +0900 Subject: [Numpy-discussion] ** On entry to ILAENV parameter number 2 had an illegal value In-Reply-To: <4A72D363.8090603@ar.media.kyoto-u.ac.jp> References:

<4A72A656.7040802@ar.media.kyoto-u.ac.jp> <1249030460.336856@nntpgw.ncl.ac.uk> <4A72B40D.7080004@ar.media.kyoto-u.ac.jp> <1249033198.686107@nntpgw.ncl.ac.uk> <1249037111.723816@nntpgw.ncl.ac.uk> <4A72CD4E.7000606@ar.media.kyoto-u.ac.jp> <4A72D363.8090603@ar.media.kyoto-u.ac.jp> Message-ID: <5b8d13220907312114h7106bb8fi1cedc192e980ec2f@mail.gmail.com> On Fri, Jul 31, 2009 at 8:20 PM, David Cournapeau wrote: > Steven Coutts wrote: >> David Cournapeau ar.media.kyoto-u.ac.jp> writes: >> >> ?If you are willing to do >> >>> it, I would ?also be interested whether numpy works ok if linked against >>> BLAS/LAPACK instead of atlas (i.e. build numpy, again from scratch, with >>> ATLAS=None python setup.py build, and then run the test suite). >>> >>> >> >> Yes that appears to work fine, all tests run. >> > > So that's a problem with ATLAS. Maybe a gcc bug ? Another user contacted > me privately for my rpm repository, and got exactly the same problem > with the rpms, on CENTOS 5.3 as well. I will try to look at it on a > centos VM if I have time this WE, Ok, I have installed CENTOS 5.3 on my machine (kudos to vmware fusion which installs the OS automatically), build numpy 1.3.0 with atlas 3.8.3 + lapack 3.1.1 on 64 bits. But I could not reproduce the bug, unfortunately. Are you using the threaded atlas ? cheers, David From cournape at gmail.com Sat Aug 1 00:15:51 2009 From: cournape at gmail.com (David Cournapeau) Date: Sat, 1 Aug 2009 13:15:51 +0900 Subject: [Numpy-discussion] ** On entry to ILAENV parameter number 2 had an illegal value In-Reply-To: <5b8d13220907312114h7106bb8fi1cedc192e980ec2f@mail.gmail.com> References: <4A72A656.7040802@ar.media.kyoto-u.ac.jp> <1249030460.336856@nntpgw.ncl.ac.uk> <4A72B40D.7080004@ar.media.kyoto-u.ac.jp> <1249033198.686107@nntpgw.ncl.ac.uk> <1249037111.723816@nntpgw.ncl.ac.uk> <4A72CD4E.7000606@ar.media.kyoto-u.ac.jp> <4A72D363.8090603@ar.media.kyoto-u.ac.jp> <5b8d13220907312114h7106bb8fi1cedc192e980ec2f@mail.gmail.com> Message-ID: <5b8d13220907312115y7af850afva4659559a84f15b8@mail.gmail.com> On Sat, Aug 1, 2009 at 1:14 PM, David Cournapeau wrote: > On Fri, Jul 31, 2009 at 8:20 PM, David > Cournapeau wrote: >> Steven Coutts wrote: >>> David Cournapeau ar.media.kyoto-u.ac.jp> writes: >>> >>> ?If you are willing to do >>> >>>> it, I would ?also be interested whether numpy works ok if linked against >>>> BLAS/LAPACK instead of atlas (i.e. build numpy, again from scratch, with >>>> ATLAS=None python setup.py build, and then run the test suite). >>>> >>>> >>> >>> Yes that appears to work fine, all tests run. >>> >> >> So that's a problem with ATLAS. Maybe a gcc bug ? Another user contacted >> me privately for my rpm repository, and got exactly the same problem >> with the rpms, on CENTOS 5.3 as well. I will try to look at it on a >> centos VM if I have time this WE, > > Ok, I have installed CENTOS 5.3 on my machine (kudos to vmware fusion > which installs the OS automatically), build numpy 1.3.0 with atlas > 3.8.3 + lapack 3.1.1 on 64 bits. But I could not reproduce the bug, > unfortunately. Are you using the threaded atlas ? I forgot: another thing which would be helpful since you can reproduce the bug would be to build a debug version of numpy (python setup.py build_ext -g), and reproduce the bug under gdb to have a traceback. David From scott.sinclair.za at gmail.com Sat Aug 1 05:16:08 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Sat, 1 Aug 2009 11:16:08 +0200 Subject: [Numpy-discussion] Doc-editor internal error Message-ID: <6a17e9ee0908010216y411b2f0eve2dfe74f5ee68553@mail.gmail.com> Hi, I'm seeing "500 Internal Error" at http://docs.scipy.org/numpy/stats/ Cheers, Scott From scott.sinclair.za at gmail.com Sat Aug 1 05:33:06 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Sat, 1 Aug 2009 11:33:06 +0200 Subject: [Numpy-discussion] Doc-editor internal error In-Reply-To: <6a17e9ee0908010216y411b2f0eve2dfe74f5ee68553@mail.gmail.com> References: <6a17e9ee0908010216y411b2f0eve2dfe74f5ee68553@mail.gmail.com> Message-ID: <6a17e9ee0908010233m58e67a1ejef88333666d3c517@mail.gmail.com> Ignore the noise. Seems to be fixed now.. 2009/8/1 Scott Sinclair : > Hi, > > I'm seeing "500 Internal Error" at http://docs.scipy.org/numpy/stats/ > > Cheers, > Scott > From gav451 at gmail.com Sun Aug 2 11:14:11 2009 From: gav451 at gmail.com (Gerard Vermeulen) Date: Sun, 2 Aug 2009 17:14:11 +0200 Subject: [Numpy-discussion] PyQwt-5.2.0 released Message-ID: <20090802171411.41ec88ad@jupiter.rozan.fr> What is PyQwt ( http://pyqwt.sourceforge.net ) ? - it is a set of Python bindings for the Qwt C++ class library which extends the Qt framework with widgets for scientific and engineering applications. It provides a 2-dimensional plotting widget and various widgets to display and control bounded or unbounded floating point values. - it requires and extends PyQt, a set of Python bindings for Qt. - it supports the use of PyQt, Qt, Qwt, and optionally NumPy or SciPy in a GUI Python application or in an interactive Python session. - it runs on POSIX, Mac OS X and Windows platforms (practically any platform supported by Qt and Python). - it plots fast: fairly good hardware allows a rate of 100,000 points/second. (PyQwt with Qt-3 is faster than with Qt-4). - it is licensed under the GPL with an exception to allow dynamic linking with non-free releases of Qt and PyQt. The most important new features of PyQwt v5.2.0 are: - support for Qwt v5.2.0 - support for PyQt4 upto v4.5.4, PyQt3 upto v3.18.1, and SIP upto v4.8.2. - switch to documentation generated by Sphinx. - provide a normal qwt plugin for the pyuic4 user interface compiler instead of the anormal qwt plugin included in PyQt. The most important bug fixes in PyQwt-5.2.0 are: - fixed crashes in the QImage-array conversion functions. - fixed three transfer of ownership bugs. PyQwt-5.2.0 supports: 1. Python v2.6.x and v2.5.x. 2. PyQt v3.18.1 downto v3.17.5. 3 PyQt v4.5.x, v4.4.x. 4 SIP v4.8.x downto v4.7.3. 5. Qt v3.3.x. 6. Qt v4.5.x, v4.4.x, and v4.3.x. 7. Qwt v5.2.x, v5.1.x, and v5.0.x. 8. Recent versions of NumPy, numarray, and/or Numeric. Enjoy -- Gerard Vermeulen From dwf at cs.toronto.edu Mon Aug 3 02:12:29 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Mon, 3 Aug 2009 02:12:29 -0400 Subject: [Numpy-discussion] Differences Between Arrays and Matrices in Numpy In-Reply-To: References: Message-ID: <209B50BC-3867-4A38-B429-E4F3571B69D2@cs.toronto.edu> On 30-Jul-09, at 1:14 PM, Nanime Puloski wrote: > What are some differences between arrays and matrices using the Numpy > library? When would one want to use arrays instead of matrices and > vice > versa? This is answered in the online documentation in several places: http://preview.tinyurl.com/n6of54 http://docs.scipy.org/doc/numpy/reference/arrays.classes.html#matrix-objects Regards, David From stevec at couttsnet.com Mon Aug 3 04:30:26 2009 From: stevec at couttsnet.com (Steven Coutts) Date: Mon, 3 Aug 2009 08:30:26 +0000 (UTC) Subject: [Numpy-discussion] =?utf-8?q?**_On_entry_to_ILAENV_parameter_numb?= =?utf-8?q?er_2_had=09an_illegal_value?= References: <4A72A656.7040802@ar.media.kyoto-u.ac.jp> <1249030460.336856@nntpgw.ncl.ac.uk> <4A72B40D.7080004@ar.media.kyoto-u.ac.jp> <1249033198.686107@nntpgw.ncl.ac.uk> <1249037111.723816@nntpgw.ncl.ac.uk> <4A72CD4E.7000606@ar.media.kyoto-u.ac.jp> <4A72D363.8090603@ar.media.kyoto-u.ac.jp> <5b8d13220907312114h7106bb8fi1cedc192e980ec2f@mail.gmail.com> <5b8d13220907312115y7af850afva4659559a84f15b8@mail.gmail.com> Message-ID: David Cournapeau gmail.com> writes: > > I forgot: another thing which would be helpful since you can reproduce > the bug would be to build a debug version of numpy (python setup.py > build_ext -g), and reproduce the bug under gdb to have a traceback. > > David Ok I have rebuilt numpy-1.3.0 with debugging, and it segfaults as soon as I import numpy in python2.5 Backtrace -: http://pastebin.com/d27fbd2a5 Regards From stevec at couttsnet.com Mon Aug 3 04:35:31 2009 From: stevec at couttsnet.com (Steven Coutts) Date: Mon, 3 Aug 2009 08:35:31 +0000 (UTC) Subject: [Numpy-discussion] =?utf-8?q?**_On_entry_to_ILAENV_parameter_numb?= =?utf-8?q?er_2_had=09an_illegal_value?= References: <4A72A656.7040802@ar.media.kyoto-u.ac.jp> <1249030460.336856@nntpgw.ncl.ac.uk> <4A72B40D.7080004@ar.media.kyoto-u.ac.jp> <1249033198.686107@nntpgw.ncl.ac.uk> <1249037111.723816@nntpgw.ncl.ac.uk> <4A72CD4E.7000606@ar.media.kyoto-u.ac.jp> <4A72D363.8090603@ar.media.kyoto-u.ac.jp> <5b8d13220907312114h7106bb8fi1cedc192e980ec2f@mail.gmail.com> <5b8d13220907312115y7af850afva4659559a84f15b8@mail.gmail.com> Message-ID: Steven Coutts couttsnet.com> writes: > > Ok I have rebuilt numpy-1.3.0 with debugging, and it segfaults as soon as I > import numpy in python2.5 > > Backtrace -: > http://pastebin.com/d27fbd2a5 > > Regards > Sorry ignore this, I cleanded out numpy properly, re-installed 1.3.0 and the tests are all running now. Regards From david at ar.media.kyoto-u.ac.jp Mon Aug 3 04:26:58 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Mon, 03 Aug 2009 17:26:58 +0900 Subject: [Numpy-discussion] ** On entry to ILAENV parameter number 2 had an illegal value In-Reply-To: References: <4A72A656.7040802@ar.media.kyoto-u.ac.jp> <1249030460.336856@nntpgw.ncl.ac.uk> <4A72B40D.7080004@ar.media.kyoto-u.ac.jp> <1249033198.686107@nntpgw.ncl.ac.uk> <1249037111.723816@nntpgw.ncl.ac.uk> <4A72CD4E.7000606@ar.media.kyoto-u.ac.jp> <4A72D363.8090603@ar.media.kyoto-u.ac.jp> <5b8d13220907312114h7106bb8fi1cedc192e980ec2f@mail.gmail.com> <5b8d13220907312115y7af850afva4659559a84f15b8@mail.gmail.com>

Message-ID: <4A769F52.7000705@ar.media.kyoto-u.ac.jp> Steven Coutts wrote: > > > Sorry ignore this, I cleanded out numpy properly, re-installed 1.3.0 and the > tests are all running now. > Do you mean that if you build with debug information, everything else being equal, you cannot reproduce the crashes ? cheers, David From stevec at couttsnet.com Mon Aug 3 05:32:03 2009 From: stevec at couttsnet.com (Steven Coutts) Date: Mon, 03 Aug 2009 10:32:03 +0100 Subject: [Numpy-discussion] ** On entry to ILAENV parameter number 2 had an illegal value References: <4A72A656.7040802@ar.media.kyoto-u.ac.jp> <1249030460.336856@nntpgw.ncl.ac.uk> <4A72B40D.7080004@ar.media.kyoto-u.ac.jp> <1249033198.686107@nntpgw.ncl.ac.uk> <1249037111.723816@nntpgw.ncl.ac.uk> <4A72CD4E.7000606@ar.media.kyoto-u.ac.jp> <4A72D363.8090603@ar.media.kyoto-u.ac.jp> <5b8d13220907312114h7106bb8fi1cedc192e980ec2f@mail.gmail.com> <5b8d13220907312115y7af850afva4659559a84f15b8@mail.gmail.com>

<4A769F52.7000705@ar.media.kyoto-u.ac.jp> Message-ID: <1249291923.686115@nntpgw.ncl.ac.uk> David Cournapeau wrote: > > Do you mean that if you build with debug information, everything else > being equal, you cannot reproduce the crashes ? > > cheers, > > David That does appear to be the case, SciPy 1.7.0 is now also running fine. Regards From cournape at gmail.com Mon Aug 3 09:25:18 2009 From: cournape at gmail.com (David Cournapeau) Date: Mon, 3 Aug 2009 22:25:18 +0900 Subject: [Numpy-discussion] ** On entry to ILAENV parameter number 2 had an illegal value In-Reply-To: <1249291923.686115@nntpgw.ncl.ac.uk> References: <4A72CD4E.7000606@ar.media.kyoto-u.ac.jp> <4A72D363.8090603@ar.media.kyoto-u.ac.jp> <5b8d13220907312114h7106bb8fi1cedc192e980ec2f@mail.gmail.com> <5b8d13220907312115y7af850afva4659559a84f15b8@mail.gmail.com>

<4A769F52.7000705@ar.media.kyoto-u.ac.jp> <1249291923.686115@nntpgw.ncl.ac.uk> Message-ID: <5b8d13220908030625j55a77a3ak6d75582bc734971f@mail.gmail.com> On Mon, Aug 3, 2009 at 6:32 PM, Steven Coutts wrote: > David Cournapeau wrote: > >> >> Do you mean that if you build with debug information, everything else >> being equal, you cannot reproduce the crashes ? >> >> cheers, >> >> David > > That does appear to be the case, SciPy 1.7.0 is now also running fine. It is just getting weirder - the fact that numpy worked with bare BLAS/LAPACK and crashed with atlas lead me to think that it was an atlas problem. But now, this smells more like a compiler problem. I would first really check that the only difference between crash vs. no crash is debug vs non debug (both with ATLAS), to avoid chasing wrong hints. Practically, I would advised you to "clone" the numpy sources (one debug, one non debug), and build from scratch with a script to do things in a repeatable manner. Then, if indeed you have crash only with non debug build, I would recommend to install my project numscons, to be able to "play" with flags, and first try building with the exact same flags as a normal build, but adding the -g flag. For example: CFLAGS="-O2 -fno-strict-aliasing -g" python setupscons.py install --prefix=blabla Hopefully, you will be able to reproduce the crash and get a backtrace, cheers, David From afriedle at indiana.edu Mon Aug 3 09:32:57 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Mon, 03 Aug 2009 09:32:57 -0400 Subject: [Numpy-discussion] strange sin/cos performance Message-ID: <4A76E709.9090100@indiana.edu> While working on GSoC stuff I came across this weird performance behavior for sine and cosine -- using float32 is way slower than float64. On a 2ghz opteron: sin float32 1.12447786331 sin float64 0.133481025696 cos float32 1.14155912399 cos float64 0.131420135498 The times are in seconds, and are best of three runs of ten iterations of numpy.{sin,cos} over a 1000-element array (script attached). I've produced similar results on a PS3 system also. The opteron is running Python 2.6.1 and NumPy 1.3.0, while the PS3 has Python 2.5.1 and NumPy 1.1.1. I haven't jumped into the code yet, but does anyone know why sin/cos are ~8.5x slower for 32-bit floats compared to 64-bit doubles? Side question: I see people in emails writing things like 'timeit foo(x)' and having it run some sort of standard benchmark, how exactly do I do that? Is that some environment other than a normal Python? Thanks, Andrew -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: cos.py URL: From afriedle at indiana.edu Mon Aug 3 09:38:46 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Mon, 03 Aug 2009 09:38:46 -0400 Subject: [Numpy-discussion] Add/multiply reduction confusion In-Reply-To: <20090705185700.GA8888@phare.normalesup.org> References: <4A48D2DB.10508@indiana.edu> <4A50CAAE.5060400@indiana.edu> <9457e7c80907050937u305f3508o1829a00f28d9f8f5@mail.gmail.com> <4A50F536.6040200@indiana.edu> <20090705185700.GA8888@phare.normalesup.org> Message-ID: <4A76E866.90003@indiana.edu> Gael Varoquaux wrote: > On Sun, Jul 05, 2009 at 02:47:18PM -0400, Andrew Friedley wrote: >> St?fan van der Walt wrote: >>> 2009/7/5 Andrew Friedley : >>>> I found the check that does the type 'upcasting' in >>>> umath_ufunc_object.inc around line 3072 (NumPy 1.3.0). Turns out all I >>>> need to do is make sure my add and multiply ufuncs are actually named >>>> 'add' and 'multiply' and arrays will be upcasted appropriately. > >>> Would you please be so kind as to add your findings here: > >>> http://docs.scipy.org/numpy/docs/numpy-docs/reference/index.rst/#reference-index > >>> I haven't read through that document recently, so it may be in there already. > >> I created an account (afriedle) but looks like I don't have edit >> permissions. > > I have added you to the Editor list. Thanks and sorry about the delay; I went and added the comment I proposed. Andrew From cournape at gmail.com Mon Aug 3 09:44:28 2009 From: cournape at gmail.com (David Cournapeau) Date: Mon, 3 Aug 2009 22:44:28 +0900 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A76E709.9090100@indiana.edu> References: <4A76E709.9090100@indiana.edu> Message-ID: <5b8d13220908030644v55366931hffbc80267d998667@mail.gmail.com> On Mon, Aug 3, 2009 at 10:32 PM, Andrew Friedley wrote: > While working on GSoC stuff I came across this weird performance behavior > for sine and cosine -- using float32 is way slower than float64. ?On a 2ghz > opteron: > > sin float32 1.12447786331 > sin float64 0.133481025696 > cos float32 1.14155912399 > cos float64 0.131420135498 Which OS are you on ? FWIW, on max os x, with recent svn checkout, I get expected results (float32 ~ twice faster). > > The times are in seconds, and are best of three runs of ten iterations of > numpy.{sin,cos} over a 1000-element array (script attached). ?I've produced > similar results on a PS3 system also. ?The opteron is running Python 2.6.1 > and NumPy 1.3.0, while the PS3 has Python 2.5.1 and NumPy 1.1.1. > > I haven't jumped into the code yet, but does anyone know why sin/cos are > ~8.5x slower for 32-bit floats compared to 64-bit doubles? My guess would be that you are on a platform where there is no sinf, and our sinf replacement is bad for some reason. > Side question: ?I see people in emails writing things like 'timeit foo(x)' > and having it run some sort of standard benchmark, how exactly do I do that? > ?Is that some environment other than a normal Python? Yes, that's in ipython. cheers, David From emmanuelle.gouillart at normalesup.org Mon Aug 3 09:45:56 2009 From: emmanuelle.gouillart at normalesup.org (Emmanuelle Gouillart) Date: Mon, 3 Aug 2009 15:45:56 +0200 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A76E709.9090100@indiana.edu> References: <4A76E709.9090100@indiana.edu> Message-ID: <20090803134556.GA31036@phare.normalesup.org> Hi Andrew, %timeit is an Ipython magic command that uses the timeit module, see http://ipython.scipy.org/doc/stable/html/interactive/reference.html?highlight=timeit for more information about how to use it. So you were right to suppose that it is not a "normal Python". However, I was not able to reproduce your observations. >>> import numpy as np >>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) >>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) >>> %timeit -n 10 np.sin(a) 10 loops, best of 3: 8.67 ms per loop >>> %timeit -n 10 np.sin(b) 10 loops, best of 3: 9.29 ms per loop Emmanuelle On Mon, Aug 03, 2009 at 09:32:57AM -0400, Andrew Friedley wrote: > While working on GSoC stuff I came across this weird performance > behavior for sine and cosine -- using float32 is way slower than > float64. On a 2ghz opteron: > > sin float32 1.12447786331 > sin float64 0.133481025696 > cos float32 1.14155912399 > cos float64 0.131420135498 > > The times are in seconds, and are best of three runs of ten iterations > of numpy.{sin,cos} over a 1000-element array (script attached). I've > produced similar results on a PS3 system also. The opteron is running > Python 2.6.1 and NumPy 1.3.0, while the PS3 has Python 2.5.1 and NumPy > 1.1.1. > > I haven't jumped into the code yet, but does anyone know why sin/cos are > ~8.5x slower for 32-bit floats compared to 64-bit doubles? > > Side question: I see people in emails writing things like 'timeit > foo(x)' and having it run some sort of standard benchmark, how exactly > do I do that? Is that some environment other than a normal Python? > > Thanks, > > Andrew > import timeit > t = timeit.Timer("numpy.sin(a)", > "import numpy\n" > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float32)") > print "sin float32", min(t.repeat(3, 10)) > t = timeit.Timer("numpy.sin(a)", > "import numpy\n" > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float64)") > print "sin float64", min(t.repeat(3, 10)) > t = timeit.Timer("numpy.cos(a)", > "import numpy\n" > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float32)") > print "cos float32", min(t.repeat(3, 10)) > t = timeit.Timer("numpy.cos(a)", > "import numpy\n" > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float64)") > print "cos float64", min(t.repeat(3, 10)) > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From afriedle at indiana.edu Mon Aug 3 10:08:42 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Mon, 03 Aug 2009 10:08:42 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <5b8d13220908030644v55366931hffbc80267d998667@mail.gmail.com> References: <4A76E709.9090100@indiana.edu> <5b8d13220908030644v55366931hffbc80267d998667@mail.gmail.com> Message-ID: <4A76EF6A.5060400@indiana.edu> Thanks for the quick responses. David Cournapeau wrote: > On Mon, Aug 3, 2009 at 10:32 PM, Andrew Friedley wrote: >> While working on GSoC stuff I came across this weird performance behavior >> for sine and cosine -- using float32 is way slower than float64. On a 2ghz >> opteron: >> >> sin float32 1.12447786331 >> sin float64 0.133481025696 >> cos float32 1.14155912399 >> cos float64 0.131420135498 > > Which OS are you on ? FWIW, on max os x, with recent svn checkout, I > get expected results (float32 ~ twice faster). The numbers above are on linux, RHEL 5.2. The PS3 is running Fedora 9 I think. I just ran on a PPC OSX 10.5 system: sin float32 0.111793041229 sin float64 0.0902218818665 cos float32 0.112202882767 cos float64 0.0917768478394 Much more reasonable, but still not what I'd expect or what you seem to expect. >> The times are in seconds, and are best of three runs of ten iterations of >> numpy.{sin,cos} over a 1000-element array (script attached). I've produced >> similar results on a PS3 system also. The opteron is running Python 2.6.1 >> and NumPy 1.3.0, while the PS3 has Python 2.5.1 and NumPy 1.1.1. >> >> I haven't jumped into the code yet, but does anyone know why sin/cos are >> ~8.5x slower for 32-bit floats compared to 64-bit doubles? > > My guess would be that you are on a platform where there is no sinf, > and our sinf replacement is bad for some reason. I think linux has sinf, is there a quick/easy way to check if numpy is using it? >> Side question: I see people in emails writing things like 'timeit foo(x)' >> and having it run some sort of standard benchmark, how exactly do I do that? >> Is that some environment other than a normal Python? > > Yes, that's in ipython. Thanks for the pointer. Andrew From afriedle at indiana.edu Mon Aug 3 10:10:27 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Mon, 03 Aug 2009 10:10:27 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <20090803134556.GA31036@phare.normalesup.org> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> Message-ID: <4A76EFD3.5010508@indiana.edu> Emmanuelle Gouillart wrote: > Hi Andrew, > > %timeit is an Ipython magic command that uses the timeit module, > see > http://ipython.scipy.org/doc/stable/html/interactive/reference.html?highlight=timeit > for more information about how to use it. So you were right to suppose > that it is not a "normal Python". Thanks for the pointer, I'm not familiar with IPython at all, will check it out. > However, I was not able to reproduce your observations. > >>>> import numpy as np >>>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) >>>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) >>>> %timeit -n 10 np.sin(a) > 10 loops, best of 3: 8.67 ms per loop >>>> %timeit -n 10 np.sin(b) > 10 loops, best of 3: 9.29 ms per loop OK, I'm curious, what OS/Python/Numpy are you using? Andrew From emmanuelle.gouillart at normalesup.org Mon Aug 3 10:21:12 2009 From: emmanuelle.gouillart at normalesup.org (Emmanuelle Gouillart) Date: Mon, 3 Aug 2009 16:21:12 +0200 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A76EFD3.5010508@indiana.edu> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> Message-ID: <20090803142112.GA7495@phare.normalesup.org> > >>>> import numpy as np > >>>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) > >>>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) > >>>> %timeit -n 10 np.sin(a) > > 10 loops, best of 3: 8.67 ms per loop > >>>> %timeit -n 10 np.sin(b) > > 10 loops, best of 3: 9.29 ms per loop > OK, I'm curious, what OS/Python/Numpy are you using? Sorry, I should have specified these information earlier: OS: Linux Ubuntu 9.04 (running a Dual Core Intel Pentium E5200 @ 2.50GHz) Python: 2.6.2 Numpy: 1.2.1 Emmanuelle From josef.pktd at gmail.com Mon Aug 3 10:44:51 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Mon, 3 Aug 2009 10:44:51 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <20090803142112.GA7495@phare.normalesup.org> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> Message-ID: <1cd32cbb0908030744x7b924ee6g808cd9f85905660d@mail.gmail.com> On Mon, Aug 3, 2009 at 10:21 AM, Emmanuelle Gouillart wrote: >> >>>> import numpy as np >> >>>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) >> >>>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) >> >>>> %timeit -n 10 np.sin(a) >> > 10 loops, best of 3: 8.67 ms per loop >> >>>> %timeit -n 10 np.sin(b) >> > 10 loops, best of 3: 9.29 ms per loop > >> OK, I'm curious, what OS/Python/Numpy are you using? > > Sorry, I should have specified these information earlier: > > OS: Linux Ubuntu 9.04 (running a Dual Core Intel Pentium E5200 ?@ > 2.50GHz) > Python: 2.6.2 > Numpy: 1.2.1 > > Emmanuelle > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > just for reference: on a plain single core WindowsXP (32bit) notebook with official numpy 1.3.0, I get with some variation sin float32 0.0963996820825 sin float64 0.164140135129 cos float32 0.124504371366 cos float64 0.149174266562 Josef From cournape at gmail.com Mon Aug 3 11:13:49 2009 From: cournape at gmail.com (David Cournapeau) Date: Tue, 4 Aug 2009 00:13:49 +0900 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A76EF6A.5060400@indiana.edu> References: <4A76E709.9090100@indiana.edu> <5b8d13220908030644v55366931hffbc80267d998667@mail.gmail.com> <4A76EF6A.5060400@indiana.edu> Message-ID: <5b8d13220908030813o7f00a975j43712cb458f728d5@mail.gmail.com> On Mon, Aug 3, 2009 at 11:08 PM, Andrew Friedley wrote: > Thanks for the quick responses. > > David Cournapeau wrote: >> On Mon, Aug 3, 2009 at 10:32 PM, Andrew Friedley wrote: >>> While working on GSoC stuff I came across this weird performance behavior >>> for sine and cosine -- using float32 is way slower than float64. ?On a 2ghz >>> opteron: >>> >>> sin float32 1.12447786331 >>> sin float64 0.133481025696 >>> cos float32 1.14155912399 >>> cos float64 0.131420135498 >> >> Which OS are you on ? FWIW, on max os x, with recent svn checkout, I >> get expected results (float32 ~ twice faster). > > The numbers above are on linux, RHEL 5.2. ?The PS3 is running Fedora 9 I > think. I know next to nothing about the PS3 hardware, but I know that it is quite different compared to conventional x86 CPU. Does it even have both 4 and 8 bytes native float ? > Much more reasonable, but still not what I'd expect or what you seem to > expect. On a x86 system with sinf available in the math lib, I would expect the float32 to be faster than float64. Other than that, the exact ratio depends on too many factors (sse vs x87 usage, cache size, compiler, math library performances). One order magnitude slower seems very strange in any case. > >>> The times are in seconds, and are best of three runs of ten iterations of >>> numpy.{sin,cos} over a 1000-element array (script attached). ?I've produced >>> similar results on a PS3 system also. ?The opteron is running Python 2.6.1 >>> and NumPy 1.3.0, while the PS3 has Python 2.5.1 and NumPy 1.1.1. >>> >>> I haven't jumped into the code yet, but does anyone know why sin/cos are >>> ~8.5x slower for 32-bit floats compared to 64-bit doubles? >> >> My guess would be that you are on a platform where there is no sinf, >> and our sinf replacement is bad for some reason. > > I think linux has sinf, is there a quick/easy way to check if numpy is > using it? You can look at the config.h in numpy/core/include/numpy, and see if there is a HAVE_SINF defined (for numpy >= 1.2.0 at least). cheers, David From kwgoodman at gmail.com Mon Aug 3 11:17:21 2009 From: kwgoodman at gmail.com (Keith Goodman) Date: Mon, 3 Aug 2009 08:17:21 -0700 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <20090803142112.GA7495@phare.normalesup.org> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> Message-ID: On Mon, Aug 3, 2009 at 7:21 AM, Emmanuelle Gouillart wrote: >> >>>> import numpy as np >> >>>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) >> >>>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) >> >>>> %timeit -n 10 np.sin(a) >> > 10 loops, best of 3: 8.67 ms per loop >> >>>> %timeit -n 10 np.sin(b) >> > 10 loops, best of 3: 9.29 ms per loop > >> OK, I'm curious, what OS/Python/Numpy are you using? > > Sorry, I should have specified these information earlier: > > OS: Linux Ubuntu 9.04 (running a Dual Core Intel Pentium E5200 ?@ > 2.50GHz) > Python: 2.6.2 > Numpy: 1.2.1 Why are my times so different from yours? >> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) >> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) >> timeit -n 10 np.sin(a) 10 loops, best of 3: 46.8 ms per loop >> timeit -n 10 np.sin(b) 10 loops, best of 3: 7.43 ms per loop Ubuntu 9.04 on Core i7 920 (Quad 2.66GHz) Python 2.6.2 Numpy 1.3.0 And even though it is not used for this problem: ATLAS 3.8.3 (single threaded) From sccolbert at gmail.com Mon Aug 3 12:23:13 2009 From: sccolbert at gmail.com (Chris Colbert) Date: Mon, 3 Aug 2009 12:23:13 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> Message-ID: <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> I get similar results as the OP: In [1]: import numpy as np In [2]: a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32) In [3]: b = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float64) In [4]: %timeit -n 10 np.sin(a) 10 loops, best of 3: 63.8 ms per loop In [5]: %timeit -n 10 np.sin(b) 10 loops, best of 3: 10.8 ms per loop In [6]: %timeit np.sin(a) 10 loops, best of 3: 63.6 ms per loop In [7]: %timeit np.sin(b) 100 loops, best of 3: 8.85 ms per loop machine: ubuntu 9.04 AMD64 Intel Qx9300 @ 2.53 numpy 1.3 with Atlas 3.8.3 python 2.6.2 On Mon, Aug 3, 2009 at 11:17 AM, Keith Goodman wrote: > On Mon, Aug 3, 2009 at 7:21 AM, Emmanuelle > Gouillart wrote: >>> >>>> import numpy as np >>> >>>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) >>> >>>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) >>> >>>> %timeit -n 10 np.sin(a) >>> > 10 loops, best of 3: 8.67 ms per loop >>> >>>> %timeit -n 10 np.sin(b) >>> > 10 loops, best of 3: 9.29 ms per loop >> >>> OK, I'm curious, what OS/Python/Numpy are you using? >> >> Sorry, I should have specified these information earlier: >> >> OS: Linux Ubuntu 9.04 (running a Dual Core Intel Pentium E5200 ?@ >> 2.50GHz) >> Python: 2.6.2 >> Numpy: 1.2.1 > > Why are my times so different from yours? > >>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) >>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) >>> timeit -n 10 np.sin(a) > 10 loops, best of 3: 46.8 ms per loop >>> timeit -n 10 np.sin(b) > 10 loops, best of 3: 7.43 ms per loop > > Ubuntu 9.04 on Core i7 920 (Quad 2.66GHz) > Python 2.6.2 > Numpy 1.3.0 > And even though it is not used for this problem: ATLAS 3.8.3 (single threaded) > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From emmanuelle.gouillart at normalesup.org Mon Aug 3 13:09:32 2009 From: emmanuelle.gouillart at normalesup.org (Emmanuelle Gouillart) Date: Mon, 3 Aug 2009 19:09:32 +0200 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> Message-ID: <20090803170932.GA23528@phare.normalesup.org> On Mon, Aug 03, 2009 at 08:17:21AM -0700, Keith Goodman wrote: > On Mon, Aug 3, 2009 at 7:21 AM, Emmanuelle > Gouillart wrote: > >> >>>> import numpy as np > >> >>>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) > >> >>>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) > >> >>>> %timeit -n 10 np.sin(a) > >> > 10 loops, best of 3: 8.67 ms per loop > >> >>>> %timeit -n 10 np.sin(b) > >> > 10 loops, best of 3: 9.29 ms per loop > >> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) > >> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) > >> timeit -n 10 np.sin(a) > 10 loops, best of 3: 46.8 ms per loop > >> timeit -n 10 np.sin(b) > 10 loops, best of 3: 7.43 ms per loop > Why are my times so different from yours? No idea, sorry... All I can say is that I get similar results (around 11 and 12 ms per loop) with my other computer (wich has the same Ubuntu/Python/Numpy configuration, and has 2 Intel T5600 @ 1.83GHz CPUs). Emmanuelle From charlesr.harris at gmail.com Mon Aug 3 13:38:14 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 3 Aug 2009 11:38:14 -0600 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> Message-ID: On Mon, Aug 3, 2009 at 10:23 AM, Chris Colbert wrote: > I get similar results as the OP: > > > In [1]: import numpy as np > > In [2]: a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32) > > In [3]: b = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float64) > > In [4]: %timeit -n 10 np.sin(a) > 10 loops, best of 3: 63.8 ms per loop > > In [5]: %timeit -n 10 np.sin(b) > 10 loops, best of 3: 10.8 ms per loop > > In [6]: %timeit np.sin(a) > 10 loops, best of 3: 63.6 ms per loop > > In [7]: %timeit np.sin(b) > 100 loops, best of 3: 8.85 ms per loop > > > machine: > > ubuntu 9.04 AMD64 > Intel Qx9300 @ 2.53 > numpy 1.3 with Atlas 3.8.3 > python 2.6.2 > > On Mon, Aug 3, 2009 at 11:17 AM, Keith Goodman wrote: > > On Mon, Aug 3, 2009 at 7:21 AM, Emmanuelle > > Gouillart wrote: > >>> >>>> import numpy as np > >>> >>>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) > >>> >>>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) > >>> >>>> %timeit -n 10 np.sin(a) > >>> > 10 loops, best of 3: 8.67 ms per loop > >>> >>>> %timeit -n 10 np.sin(b) > >>> > 10 loops, best of 3: 9.29 ms per loop > >> > >>> OK, I'm curious, what OS/Python/Numpy are you using? > >> > >> Sorry, I should have specified these information earlier: > >> > >> OS: Linux Ubuntu 9.04 (running a Dual Core Intel Pentium E5200 @ > >> 2.50GHz) > >> Python: 2.6.2 > >> Numpy: 1.2.1 > > > > Why are my times so different from yours? > > > >>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) > >>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) > >>> timeit -n 10 np.sin(a) > > 10 loops, best of 3: 46.8 ms per loop > >>> timeit -n 10 np.sin(b) > > 10 loops, best of 3: 7.43 ms per loop > > > > Ubuntu 9.04 on Core i7 920 (Quad 2.66GHz) > > Python 2.6.2 > > Numpy 1.3.0 > > And even though it is not used for this problem: ATLAS 3.8.3 (single > threaded) > What compiler versions are folks using? In the slow cases, what is the timing for converting to double, computing the sin, then casting back to single? Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From afriedle at indiana.edu Mon Aug 3 13:39:39 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Mon, 03 Aug 2009 13:39:39 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <5b8d13220908030813o7f00a975j43712cb458f728d5@mail.gmail.com> References: <4A76E709.9090100@indiana.edu> <5b8d13220908030644v55366931hffbc80267d998667@mail.gmail.com> <4A76EF6A.5060400@indiana.edu> <5b8d13220908030813o7f00a975j43712cb458f728d5@mail.gmail.com> Message-ID: <4A7720DB.80005@indiana.edu> David Cournapeau wrote: >> David Cournapeau wrote: >>> On Mon, Aug 3, 2009 at 10:32 PM, Andrew Friedley wrote: >>>> While working on GSoC stuff I came across this weird performance behavior >>>> for sine and cosine -- using float32 is way slower than float64. On a 2ghz >>>> opteron: >>>> >>>> sin float32 1.12447786331 >>>> sin float64 0.133481025696 >>>> cos float32 1.14155912399 >>>> cos float64 0.131420135498 >>> Which OS are you on ? FWIW, on max os x, with recent svn checkout, I >>> get expected results (float32 ~ twice faster). >> The numbers above are on linux, RHEL 5.2. The PS3 is running Fedora 9 I >> think. > > I know next to nothing about the PS3 hardware, but I know that it is > quite different compared to conventional x86 CPU. Does it even have > both 4 and 8 bytes native float ? Yes. As far as this discussion is concerned, the PS3/Cell is just a slow PowerPC. Quite different from x86, but probably not as different as you think :) >> Much more reasonable, but still not what I'd expect or what you seem to >> expect. > > On a x86 system with sinf available in the math lib, I would expect > the float32 to be faster than float64. Other than that, the exact > ratio depends on too many factors (sse vs x87 usage, cache size, > compiler, math library performances). One order magnitude slower seems > very strange in any case. OK. I'll probably investigate this a bit further, but I don't have anything that really depends on this issue. It does explain a large part of why my cos ufunc was so much faster. Since I'm observing this on both x86 and PPC (PS3), I don't think its a hardware issue -- something in the software stack. And now there's two people reporting results with only different numpy versions. >>>> The times are in seconds, and are best of three runs of ten iterations of >>>> numpy.{sin,cos} over a 1000-element array (script attached). I've produced >>>> similar results on a PS3 system also. The opteron is running Python 2.6.1 >>>> and NumPy 1.3.0, while the PS3 has Python 2.5.1 and NumPy 1.1.1. >>>> >>>> I haven't jumped into the code yet, but does anyone know why sin/cos are >>>> ~8.5x slower for 32-bit floats compared to 64-bit doubles? >>> My guess would be that you are on a platform where there is no sinf, >>> and our sinf replacement is bad for some reason. >> I think linux has sinf, is there a quick/easy way to check if numpy is >> using it? > > You can look at the config.h in numpy/core/include/numpy, and see if > there is a HAVE_SINF defined (for numpy >= 1.2.0 at least). OK, I see HAVE_SINF (and HAVE_COSF) for my 1.3.0 build on the opteron system. I'm using the distro-provided packages on other systems, so I guess I can't check those. I don't think this matters -- numpy/core/src/npy_math.c just defines sinf as a function calling sin. So if HAVE_SINF wasn't set, I'd expect the performance different to be very little, with floats still being slightly faster (less mem traffic). Also I just went and wrote a C program to do a similar benchmark, and I am unable to reproduce the issue there. Makes me think the problem is in NumPy, but I have no idea where to look. Suggestions welcome :) Andrew From afriedle at indiana.edu Mon Aug 3 13:51:36 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Mon, 03 Aug 2009 13:51:36 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> Message-ID: <4A7723A8.8010107@indiana.edu> Charles R Harris wrote: > What compiler versions are folks using? In the slow cases, what is the > timing for converting to double, computing the sin, then casting back to > single? I did this, is this the right way to do that? t = timeit.Timer("numpy.sin(a.astype(numpy.float64)).astype(numpy.float32)", "import numpy\n" "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float64)") print "sin converted float 32/64", min(t.repeat(3, 10)) Timings on my opteron system (2-socket 2-core 2GHz): sin float32 1.13407707214 sin float64 0.133460998535 sin converted float 32/64 0.18202996254 Not too surprising I guess. gcc --version shows: gcc (GCC) 4.1.2 20080704 (Red Hat 4.1.2-44) My compile flags for my Python 2.6.1/NumPy 1.3.0 builds: -Os -fomit-frame-pointer -pipe -s -march=k8 -m64 Andrew From charlesr.harris at gmail.com Mon Aug 3 14:09:49 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 3 Aug 2009 12:09:49 -0600 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A7723A8.8010107@indiana.edu> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> Message-ID: On Mon, Aug 3, 2009 at 11:51 AM, Andrew Friedley wrote: > Charles R Harris wrote: > > What compiler versions are folks using? In the slow cases, what is the > > timing for converting to double, computing the sin, then casting back to > > single? > > I did this, is this the right way to do that? > > t = > timeit.Timer("numpy.sin(a.astype(numpy.float64)).astype(numpy.float32)", > "import numpy\n" > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, > dtype=numpy.float64)") > print "sin converted float 32/64", min(t.repeat(3, 10)) > > Timings on my opteron system (2-socket 2-core 2GHz): > > sin float32 1.13407707214 > sin float64 0.133460998535 > sin converted float 32/64 0.18202996254 > > Not too surprising I guess. > > gcc --version shows: > > gcc (GCC) 4.1.2 20080704 (Red Hat 4.1.2-44) > > My compile flags for my Python 2.6.1/NumPy 1.3.0 builds: > > -Os -fomit-frame-pointer -pipe -s -march=k8 -m64 > That looks right. When numpy doesn't find a *f version it basically does that conversion. This is beginning to look like a hardware/software implementation problem, maybe compiler related. That is, I suspect the fast times come from using a hardware implementation. What happens if you use -O2 instead of -Os? Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From bsouthey at gmail.com Mon Aug 3 14:19:01 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Mon, 03 Aug 2009 13:19:01 -0500 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A7723A8.8010107@indiana.edu> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> Message-ID: <4A772A15.8090407@gmail.com> On 08/03/2009 12:51 PM, Andrew Friedley wrote: > Charles R Harris wrote: > >> What compiler versions are folks using? In the slow cases, what is the >> timing for converting to double, computing the sin, then casting back to >> single? >> > > I did this, is this the right way to do that? > > t = timeit.Timer("numpy.sin(a.astype(numpy.float64)).astype(numpy.float32)", > "import numpy\n" > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, > dtype=numpy.float64)") > print "sin converted float 32/64", min(t.repeat(3, 10)) > > Timings on my opteron system (2-socket 2-core 2GHz): > > sin float32 1.13407707214 > sin float64 0.133460998535 > sin converted float 32/64 0.18202996254 > > Not too surprising I guess. > > gcc --version shows: > > gcc (GCC) 4.1.2 20080704 (Red Hat 4.1.2-44) > > My compile flags for my Python 2.6.1/NumPy 1.3.0 builds: > > -Os -fomit-frame-pointer -pipe -s -march=k8 -m64 > > Andrew > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > Hi, Can you try these from the command line: python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32)" python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32); b=np.sin(a)" python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32); np.sin(a)" python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32)" "np.sin(a)" The first should be similar for different dtypes because it is just array creation. The second extends that by storing the sin into another array. I am not sure how to interpret the third but in the Python prompt it would print it to screen. The last causes Python to handle two arguments which is slow using float32 but not for float64 and float128 suggesting compiler issue such as not using SSE or similar. Bruce -------------- next part -------------- An HTML attachment was scrubbed... URL: From Chris.Barker at noaa.gov Mon Aug 3 14:47:55 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Mon, 03 Aug 2009 11:47:55 -0700 Subject: [Numpy-discussion] (newbie) How can I use NumPy to wrap my C++ class with 2-dimensional arrays? In-Reply-To: <4A72DAD0.7040601@zonnet.nl> References: <4A7140A8.2040305@zonnet.nl> <4A715F40.1060903@zonnet.nl> <4A7162C4.8030605@zonnet.nl> <4A7173A4.6070608@zonnet.nl> <4A72DAD0.7040601@zonnet.nl> Message-ID: <4A7730DB.3020907@noaa.gov> Raymond de Vries wrote: > Thanks for the explanation. After having looked at the documentation, I > decided to do my own plain Python c-api implementation. That is unlikely to be the best option these days -- it's simply too easy to make a type checking and or reference counting error. If SWIG isn't your cup of tea, take a look at Cython or Ctypes -- lower level and more control, but still handle much of the book keeping for you. The Cython team is working on better C++ support, though I don't know where they are at with that. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From cekees at gmail.com Mon Aug 3 15:57:07 2009 From: cekees at gmail.com (Chris Kees) Date: Mon, 3 Aug 2009 14:57:07 -0500 Subject: [Numpy-discussion] PDE BoF at SciPy2009 Message-ID: <1963DA80-8CE5-4033-BCC8-EBEF05352AAB@gmail.com> Is there any interest in a BoF session on implementing numerical methods for partial differential equations using modules like numpy, cython, mpi4py, etc.? Regards, Chris From reedev at zonnet.nl Mon Aug 3 16:55:22 2009 From: reedev at zonnet.nl (Raymond de Vries) Date: Mon, 03 Aug 2009 22:55:22 +0200 Subject: [Numpy-discussion] (newbie) How can I use NumPy to wrap my C++ class with 2-dimensional arrays? In-Reply-To: <4A7730DB.3020907@noaa.gov> References: <4A7140A8.2040305@zonnet.nl> <4A715F40.1060903@zonnet.nl> <4A7162C4.8030605@zonnet.nl> <4A7173A4.6070608@zonnet.nl> <4A72DAD0.7040601@zonnet.nl> <4A7730DB.3020907@noaa.gov> Message-ID: <4A774EBA.6040208@zonnet.nl> Hi Chris, >> Thanks for the explanation. After having looked at the documentation, I >> decided to do my own plain Python c-api implementation. >> > > That is unlikely to be the best option these days -- it's simply too > easy to make a type checking and or reference counting error. > > If SWIG isn't your cup of tea, take a look at Cython or Ctypes -- lower > level and more control, but still handle much of the book keeping for you. > > The Cython team is working on better C++ support, though I don't know > where they are at with that. > Oops, I guess I didn't express myself clearly enough: I have used plain Python c-api (in my case a list of lists for my 2-dimensional arrays) for my typemaps. Sorry for the unclearness. Actually because NumPy is not my cup of tea... Especially because Matthieu suggested that I should convert my data into a contiguous array. So no matter what I use, either swig, cython, or.. I still have the NumPy issue. Do you, or someone else, see another possibility? regards Raymond > -Chris > > > From d_l_goldsmith at yahoo.com Mon Aug 3 17:26:17 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Mon, 3 Aug 2009 14:26:17 -0700 (PDT) Subject: [Numpy-discussion] PDE BoF at SciPy2009 Message-ID: <701667.11074.qm@web52111.mail.re2.yahoo.com> Please remind: BoF = ? DG --- On Mon, 8/3/09, Chris Kees wrote: > From: Chris Kees > Subject: [Numpy-discussion] PDE BoF at SciPy2009 > To: "Discussion of Numerical Python" > Date: Monday, August 3, 2009, 12:57 PM > Is there any interest in a BoF > session on implementing numerical? > methods for partial differential equations using modules > like numpy,? > cython, mpi4py, etc.? > > Regards, > Chris > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From gael.varoquaux at normalesup.org Mon Aug 3 17:27:17 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Mon, 3 Aug 2009 23:27:17 +0200 Subject: [Numpy-discussion] PDE BoF at SciPy2009 In-Reply-To: <701667.11074.qm@web52111.mail.re2.yahoo.com> References: <701667.11074.qm@web52111.mail.re2.yahoo.com> Message-ID: <20090803212717.GH32408@phare.normalesup.org> On Mon, Aug 03, 2009 at 02:26:17PM -0700, David Goldsmith wrote: > Please remind: BoF = ? http://conference.scipy.org/bofs G. From Chris.Barker at noaa.gov Mon Aug 3 19:17:10 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Mon, 03 Aug 2009 16:17:10 -0700 Subject: [Numpy-discussion] (newbie) How can I use NumPy to wrap my C++ class with 2-dimensional arrays? In-Reply-To: <4A774EBA.6040208@zonnet.nl> References: <4A7140A8.2040305@zonnet.nl> <4A715F40.1060903@zonnet.nl> <4A7162C4.8030605@zonnet.nl> <4A7173A4.6070608@zonnet.nl> <4A72DAD0.7040601@zonnet.nl> <4A7730DB.3020907@noaa.gov> <4A774EBA.6040208@zonnet.nl> Message-ID: <4A776FF6.6080500@noaa.gov> Raymond de Vries wrote: > Oops, I guess I didn't express myself clearly enough: I have used plain > Python c-api (in my case a list of lists for my 2-dimensional arrays) > for my typemaps. Sorry for the unclearness. Actually because NumPy is > not my cup of tea... Well, for almost any purpose, numpy arrays are a better fit for a 2-d array of numbers than a list of lists, so I'm not sure what kind of tea you like ;-) > Especially because Matthieu suggested that I should > convert my data into a contiguous array. a Python list can't share a pointer with your C++ data type anyway, so you'll have to copy anyway -- why not make a contiguous array? If you need a "ragged" array, then it's a different story, but a list of numpy arrays may be a better fit. > So no matter what I use, either swig, cython, or.. I still have the > NumPy issue. True, it's probably best to decide what sort of representation you want in Python, then decide how to build your wrappers. If you have a big 2-d array in C++, numpy is the obvious choice. Another choice is a wrapper around your C++ class -- don't convert to a python type at all. Depending on what you need to do, it may be OK to lost a lot of functionality numpy give you. This is how the SWIG C++ vector wrappers work, for instance. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From matthew.brett at gmail.com Mon Aug 3 19:35:35 2009 From: matthew.brett at gmail.com (Matthew Brett) Date: Mon, 3 Aug 2009 16:35:35 -0700 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? Message-ID: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> Hi, We are using numpy.distutils, and have run into this odd behavior in windows: I have XP, Mingw, latest numpy SVN, python.org python 2.6. All the commands below I am running from within the 'numpy' root directory (where 'numpy' is a subdirectory). If I run python setup.py build I get the following expected error: ''' No module named msccompiler in numpy.distutils; trying from distutils error: Unable to find vccarsall.bat ''' because, I don't have MSVC. If I run: python setup.py build -c mingw32 - that works. But. running python setup.py build_ext -c mingw32 generates the same error as above. Similarly: python setup.py build_ext -c completely_unknown Ignores the attempt to set the 'completely_unknown' compiler, whereas python setup.py build -c completely_unknown raises a sensible error. I conclude that the numpy.distutils build_ext command is ignoring at least the compiler options. Is that correct? Thanks a lot, Matthew From david at ar.media.kyoto-u.ac.jp Mon Aug 3 22:42:17 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 04 Aug 2009 11:42:17 +0900 Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback Message-ID: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> Hi All, I (David Cournapeau) and the people at Berkeley (Jarrod Millman, Fernando Perez, Matthew Brett) have been in discussion so that I could do some funded work on NumPy/SciPy. Although they are obviously interested in improvements that help their own projects, they are willing to make sure the work will impact numpy/scipy as a whole. As such we would like to get some feedback about the proposal. There are several areas we discussed about, but the main 'vision' is to make more of the C code in numpy reusable to 3rd parties, in particular purely computational (fft, linear algebra, etc...) code. A first draft of the proposal is pasted below. Comments, request for details, objections are welcomed, Thank you for your attention, The Berkeley team, Gael Varoquaux and David Cournapeau ================================== Proposal for improvements to numpy ================================== NumPy is a solid foundation for efficient numerical computation with the python programming language. It consists in a set of extensions to add a powerful multi-dimensional array object. SciPy is built upon NumPy to add more high level functionalities such as numerical integration, linear algebra, statistical functions, etc\.\.\. Although the numpy codebase is mature, and can be reused both at the C and Python levels, there are some limitations in the numpy codebase which prevent some functionalities from being reused by third parties. This means that users of numpy either need to reimplement the functionalities, or to use workarounds. The main goal of this proposal is to improve numpy to circumvent those limitations in general manner. Reusable C libraries ==================== A lot of NumPy and SciPy code is in a compiled language (mostly C and Fortran). For computational code, it is generally advisable to split it into a purely computational code and a wrapping part, marshalling back and forth python objects/structures into basic C types. For example, when computing the exponential of the items in an array, most of NumPy's job is to extract the data from the array into one of the basic C type (int, double, etc...), call the C function exp, and marshall the data back into python objects. Making the marshalling and purely computational code separate has several advantages: 1. The code is easier to follow 2. The purely computational code could be reused by third parties. For example, even for simple C math functions, there is a vast difference in platform/toolchains support. NumPy makes sure that functions to handle special float values (NaN, Inf, etc...) work on every supported platform, in a consistent manner. Making those functions available to third parties would enable developers to reuse this portable functions, and stay consistent even on platforms they don't care about. 3. Working on optimizing the computational code is easier. 4. It would enable easier replacement of the purely computational code at runtime. For example, one could imagine loading SSE-enabled code if the CPU supports SSE extensions. 5. It would also helps for py3k porting, as only the marshalling code would need to change To make purely computational code available to third parties, two things are needed: 1. the code itself needs to make the split explicit. 2. there needs to be support so that reusing those functionalities is as painless as possible, from a build point of view (Note: this is almost done in the upcoming numpy 1.4.0 as long as static linking is OK). Splitting the code ------------------ The amount of work is directly proportional to the amount of functions to be made available. The most obvious candidates are: 1. C99 math functions: a lot of this has already been done. In particular math constants, and special values support is already implemented. Almost every real function in numpy has a portable npy\_ implementation in C. 2. C99-like complex support: this naturally extends the previous series. The main difficult is to support platforms without C99 complex support, and the corresponding C99 complex functions. 3. FFT code: there is no support to reuse FFT at the C level at the moment. 4. Random: there is no support either 5. Linalg: idem. Build support ------------- Once the code itself is split, there needs some support so that the code can be reused by third-parties. The following issues need to be solved: 1. Compile the computational code into shared or static library 2. Once built, making the libraries available to third parties (distutils issues). Ideally, it should work in installed, in-place builds, etc\.\.\. situations. 3. Versioning, ABI/API compatibility issues Iterators ========= When dealing with multi-dimensional arrays, the best abstraction to deal with indexing in a dimension-independent way is to use iterator. NumPy already has some iterators to walk into every item of an array, or every item but one axis. More general iterators are useful for more complicated cases, when one needs to walk into a subset of every item of the array. For example, for image processing, it is often necessary to walk in a neighborhood of an array. Boundaries conditions can be handled automatically, so that padding is transparant to the user. More elaborate iterators, e.g. with a mask (for morphological image processing) can be considered as well. Several packages in scipy implement those iterators (ndimage), or handle boundaries conditions manually in the algorithmic code (scipy.signal). Implementing iterators in numpy would enable better code reuse. Possible iterators ------------------ A neighborhood iterator is already available in numpy. It can handles zero, one, constant, and mirror padding. Potential improvements can be considered from a speed POV, in particular by splitting areas which neeed boundaries handling from the ones which do not. A masked neighborhood iterator is not available. Itk is one toolkit which implemented such an iterator. C code coverage and static numpy linking ======================================== NumPy community has focused a lot on improving the test suite. We went from a few hundred of unit tests in 2006 to more than 2000 unit tests for numpy 1.4. Although code coverage at the python level is relatively easy to obtain using some nose plugins, C code coverage is not possible ATM. The traditional code coverage tool for C code is gprof, the GNU profiler. Unfortunately, gprof cannot profile code which is dynamically linked, as is the case for python extensions. One solution is thus to statically link numpy to the python interpreter. This poses challenges both as build and code levels. Some preliminary work showed that the approach works, but something which could be integrated upstream, and make numpy easily linkable to the python interpreter would be better. Also, some people have expressed interest in distributing a python interpreter with numpy statically linked (e.g. for easy distribution). From david at ar.media.kyoto-u.ac.jp Mon Aug 3 23:11:29 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 04 Aug 2009 12:11:29 +0900 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> Message-ID: <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> Matthew Brett wrote: > Hi, > > We are using numpy.distutils, and have run into this odd behavior in windows: > > I have XP, Mingw, latest numpy SVN, python.org python 2.6. All the > commands below I am running from within the 'numpy' root directory > (where 'numpy' is a subdirectory). > > If I run > > python setup.py build > > I get the following expected error: > > ''' > No module named msccompiler in numpy.distutils; trying from distutils > error: Unable to find vccarsall.bat > ''' > > because, I don't have MSVC. > > If I run: > > python setup.py build -c mingw32 > > - that works. But. running > > python setup.py build_ext -c mingw32 > > generates the same error as above. Similarly: > > python setup.py build_ext -c completely_unknown > > Ignores the attempt to set the 'completely_unknown' compiler, whereas > > python setup.py build -c completely_unknown > > raises a sensible error. I conclude that the numpy.distutils > build_ext command is ignoring at least the compiler options. > > Is that correct? > Short answer: I am afraid it cannot work as you want. Basically, when you pass an option to build_ext, it does not affect other distutils commands, which are run before build_ext, and need the compiler (config in this case I think). So you need to pass the -c option to every command affected by the compiler (build_ext, build_clib and config IIRC). Long answer: The reason is linked to the single most annoying "feature" of distutils: distutils fundamentally works by running some commands, one after the other. Commands have subcommands. For Numpy, as far as compiled code is concerned, it goes like this: config - build - build_clib - build_ext (the build command calls all the subcommands build_* and config). Now, each command options set is independent on the other (build_ext vs. config in this case), but if you pass an option to a command it affects all its subcommands I believe. cheers, David From charlesr.harris at gmail.com Tue Aug 4 00:23:47 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 3 Aug 2009 22:23:47 -0600 Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> References: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> Message-ID: On Mon, Aug 3, 2009 at 8:42 PM, David Cournapeau < david at ar.media.kyoto-u.ac.jp> wrote: > Hi All, > > I (David Cournapeau) and the people at Berkeley (Jarrod Millman, > Fernando Perez, Matthew Brett) have been in discussion so that I could > do some funded work on NumPy/SciPy. Although they are obviously > interested in improvements that help their own projects, they are > willing to make sure the work will impact numpy/scipy as a whole. As > such we would like to get some feedback about the proposal. > > There are several areas we discussed about, but the main 'vision' is to > make more of the C code in numpy reusable to 3rd parties, in particular > purely computational (fft, linear algebra, etc...) code. A first draft > of the proposal is pasted below. > > Comments, request for details, objections are welcomed, > > Thank you for your attention, > > The Berkeley team, Gael Varoquaux and David Cournapeau > > ================================== > Proposal for improvements to numpy > ================================== > > NumPy is a solid foundation for efficient numerical computation with the > python > programming language. It consists in a set of extensions to add a powerful > multi-dimensional array object. SciPy is built upon NumPy to add more high > level functionalities such as numerical integration, linear algebra, > statistical functions, etc\.\.\. Although the numpy codebase is mature, and > can be > reused both at the C and Python levels, there are some limitations in the > numpy > codebase which prevent some functionalities from being reused by third > parties. > This means that users of numpy either need to reimplement the > functionalities, > or to use workarounds. The main goal of this proposal is to improve numpy > to circumvent > those limitations in general manner. > > Reusable C libraries > ==================== > > A lot of NumPy and SciPy code is in a compiled language (mostly C and > Fortran). > For computational code, it is generally advisable to split it into a purely > computational code and a wrapping part, marshalling back and forth python > objects/structures into basic C types. For example, when computing the > exponential of the items in an array, most of NumPy's job is to extract the > data from the array into one of the basic C type (int, double, etc...), > call > the C function exp, and marshall the data back into python objects. Making > the > marshalling and purely computational code separate has several advantages: > > 1. The code is easier to follow > 2. The purely computational code could be reused by third parties. For > example, > even for simple C math functions, there is a vast difference in > platform/toolchains support. NumPy makes sure that functions to handle > special float values (NaN, Inf, etc...) work on every supported platform, > in > a consistent manner. Making those functions available to third parties > would > enable developers to reuse this portable functions, and stay consistent > even > on platforms they don't care about. > 3. Working on optimizing the computational code is easier. > 4. It would enable easier replacement of the purely computational code at > runtime. For example, one could imagine loading SSE-enabled code if the > CPU > supports SSE extensions. > 5. It would also helps for py3k porting, as only the marshalling code would > need to change > > To make purely computational code available to third parties, two things > are > needed: > > 1. the code itself needs to make the split explicit. > 2. there needs to be support so that reusing those functionalities is as > painless as possible, from a build point of view (Note: this is almost > done in the upcoming numpy 1.4.0 as long as static linking is OK). > Ah, it itches. This is certainly a worthy goal, but are there third parties who have expressed an interest in this? I mean, besides trying to avoid duplicate bits of code in Scipy. > > Splitting the code > ------------------ > > The amount of work is directly proportional to the amount of functions to > be > made available. The most obvious candidates are: > > 1. C99 math functions: a lot of this has already been done. In particular > math > constants, and special values support is already implemented. Almost > every > real function in numpy has a portable npy\_ implementation in C. > 2. C99-like complex support: this naturally extends the previous series. > The > main difficult is to support platforms without C99 complex support, and > the > corresponding C99 complex functions. > 3. FFT code: there is no support to reuse FFT at the C level at the moment. > 4. Random: there is no support either > 5. Linalg: idem. > This is good. I think it should go along with code reorganization. The files are now broken up but I am not convinced that everything is yet where it should be. The complex support could be a major effort in its own right if we need to rewrite all the current functions. That said, it would be nice if the complex support was separated out like the current real support. Test to go along with it would be helpful. This also ties in with having build support for many platforms. > Build support > ------------- > > Once the code itself is split, there needs some support so that the code > can be > reused by third-parties. The following issues need to be solved: > > 1. Compile the computational code into shared or static library > 2. Once built, making the libraries available to third parties (distutils > issues). Ideally, it should work in installed, in-place builds, etc\.\.\. > situations. > 3. Versioning, ABI/API compatibility issues > > Trying to break out the build support itself might be useful. It would be good to get some feedback here from other projects that might be interested. But this is a wheel that probably gets reinvented on a regular basis. > > Iterators > ========= > > When dealing with multi-dimensional arrays, the best abstraction to deal > with > indexing in a dimension-independent way is to use iterator. NumPy already > has > some iterators to walk into every item of an array, or every item but one > axis. > More general iterators are useful for more complicated cases, when one > needs to > walk into a subset of every item of the array. For example, for image > processing, it is often necessary to walk in a neighborhood of an array. > Boundaries conditions can be handled automatically, so that padding is > transparant to the user. More elaborate iterators, e.g. with a mask (for > morphological image processing) can be considered as well. > > Several packages in scipy implement those iterators (ndimage), or handle > boundaries conditions manually in the algorithmic code (scipy.signal). > Implementing > iterators in numpy would enable better code reuse. > > Possible iterators > ------------------ > > A neighborhood iterator is already available in numpy. It can handles zero, > one, constant, and mirror padding. Potential improvements can be considered > from a speed POV, in particular by splitting areas which neeed boundaries > handling from the ones which do not. > > A masked neighborhood iterator is not available. Itk is one toolkit which > implemented such an iterator. > > I think this needs some thought. This would essentially be a c library of iterator code. C++ is probably an easier language for such things as it handles the classes and inlining automatically. Which is to say if I had to deal with a lot of iterators I might choose a different language for implementation. > > C code coverage and static numpy linking > ======================================== > > NumPy community has focused a lot on improving the test suite. We went from > a > few hundred of unit tests in 2006 to more than 2000 unit tests for numpy > 1.4. > Although code coverage at the python level is relatively easy to obtain > using > some nose plugins, C code coverage is not possible ATM. > > The traditional code coverage tool for C code is gprof, the GNU profiler. > Unfortunately, gprof cannot profile code which is dynamically linked, as is > the > case for python extensions. One solution is thus to statically link numpy > to > the python interpreter. This poses challenges both as build and code > levels. > Some preliminary work showed that the approach works, but something which > could > be integrated upstream, and make numpy easily linkable to the python > interpreter would be better. > > Also, some people have expressed interest in distributing a python > interpreter > with numpy statically linked (e.g. for easy distribution). > I don't have an opinion here. As a side issue, it would be nice to have some infrastructure for documenting the c code. That way after I have worked my way through one of the numpy functions I could document it so that I wouldn't have to repeat the whole process at some later date. As to choosing a project, you should pick one that really interests you. How would you rank your own interest in these various proposals? Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From dave.hirschfeld at gmail.com Tue Aug 4 03:56:31 2009 From: dave.hirschfeld at gmail.com (Dave) Date: Tue, 4 Aug 2009 07:56:31 +0000 (UTC) Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> Message-ID: David Cournapeau ar.media.kyoto-u.ac.jp> writes: > > Matthew Brett wrote: > > Hi, > > > > We are using numpy.distutils, and have run into this odd behavior in windows: > > > > Short answer: > > I am afraid it cannot work as you want. Basically, when you pass an > option to build_ext, it does not affect other distutils commands, which > are run before build_ext, and need the compiler (config in this case I > think). So you need to pass the -c option to every command affected by > the compiler (build_ext, build_clib and config IIRC). > > cheers, > > David > I'm having the same problems! Running windows XP, Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit (Intel)]. In my distutils.cfg I've got: [build] compiler=mingw32 [config] compiler = mingw32 and previously a python setup.py bdist_wininst would create an .exe installer, now I get the following error message: error: Python was built with Visual Studio 2003; extensions must be built with a compiler than can generate compatible binaries. Visual Studio 2003 was not found on this system. If you have Cygwin installed, you can try compiling with MingW32, by passing "-c mingw32" to setup.py. python setup.py build build_ext --compiler=mingw32 appeared to work (barring a warning: numpy\core\setup_common.py:81: MismatchCAPIWarning) but then how do I create a .exe installer afterwards? python setup.py bdist_wininst fails with the same error message as before and python setup.py bdist_wininst --compiler=mingw32 fails with the message: error: option --compiler not recognized Is it still possible to create a .exe installer on Windows and if so what are the commands we need to make it work? Thanks in advance for any help/workarounds it would be much appreciated! Regards, Dave From david at ar.media.kyoto-u.ac.jp Tue Aug 4 03:37:06 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 04 Aug 2009 16:37:06 +0900 Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: References: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> Message-ID: <4A77E522.50300@ar.media.kyoto-u.ac.jp> Hi Chuck, Charles R Harris wrote: > > > > To make purely computational code available to third parties, two > things are > needed: > > 1. the code itself needs to make the split explicit. > 2. there needs to be support so that reusing those functionalities > is as > painless as possible, from a build point of view (Note: this is > almost > done in the upcoming numpy 1.4.0 as long as static linking is OK). > > > Ah, it itches. This is certainly a worthy goal, but are there third > parties who have expressed an interest in this? I mean, besides trying > to avoid duplicate bits of code in Scipy. Actually, I think that's what interests people around the Nipy project the most. In particular, they need to reuse lapack and random quite a bit, and for now, they just duplicate the code, with all the problems it brings (duplication, lack of reliability as far as cross platform is concerned, etc...). > > > > Splitting the code > ------------------ > > The amount of work is directly proportional to the amount of > functions to be > made available. The most obvious candidates are: > > 1. C99 math functions: a lot of this has already been done. In > particular math > constants, and special values support is already implemented. > Almost every > real function in numpy has a portable npy\_ implementation in C. > 2. C99-like complex support: this naturally extends the previous > series. The > main difficult is to support platforms without C99 complex > support, and the > corresponding C99 complex functions. > 3. FFT code: there is no support to reuse FFT at the C level at > the moment. > 4. Random: there is no support either > 5. Linalg: idem. > > > This is good. I think it should go along with code reorganization. The > files are now broken up but I am not convinced that everything is yet > where it should be. Oh, definitely agreed. Another thing I would like in that spirit is to split the numy headers like in Python itself: ndarrayobject.h would still pull out everything (for backward compatibility reasons), but people could only include a few headers if they want to. The rationale for me is when I work on numpy itself: it is kind of stupid that everytime I change the iterator structures, the whole numpy core has to be rebuilt. That's quite wasteful and frustrating. Another rationale is to be able to compile and test a very minimal core numpy (the array object + a few things). I don't see py3k port being possible in a foreseeable future without this. > > The complex support could be a major effort in its own right if we > need to rewrite all the current functions. That said, it would be nice > if the complex support was separated out like the current real > support. Test to go along with it would be helpful. This also ties in > with having build support for many platforms. Pauli has worked on this a little, and I have actually worked quite a bit myself because I need a minimal support for windows 64 bits support (to fake libgfortran). I have already implemented around 10 core complex functions (cabs, cangle, creal, cimag, cexp, cpow, csqrt, clog, ccos, csin, ctan), in such a way that native C99 complex are used on platforms which support it, and there is a quite thorough test suite which tests every special value condition (negative zero, inf, nan) as specified in the C99 standard. Still lacks actual values (!), FPU exception and branch cuts tests, and thorough tests on major platforms. And quite a few other functions would be useful (hyperbolic trigo). > > > > Build support > ------------- > > Once the code itself is split, there needs some support so that > the code can be > reused by third-parties. The following issues need to be solved: > > 1. Compile the computational code into shared or static library > 2. Once built, making the libraries available to third parties > (distutils > issues). Ideally, it should work in installed, in-place builds, > etc\.\.\. > situations. > 3. Versioning, ABI/API compatibility issues > > > Trying to break out the build support itself might be useful. What do you mean by breakout exactly ? I have documented the already implemented support: http://docs.scipy.org/doc/numpy/reference/distutils.html#building-installable-c-libraries > I think this needs some thought. This would essentially be a c library > of iterator code. C++ is probably an easier language for such things > as it handles the classes and inlining automatically. Which is to say > if I had to deal with a lot of iterators I might choose a different > language for implementation. C++ is not an option for numpy (and if I had to chose another language compared to C, I would rather take D, or one language which outputs C in the spirit of vala :) ). I think handling iterators in C is OK: sure, it is a bit messy, because of the lack of namespace, template and operator overloading, but the increased portability and implementation simplicity worths it IMHO. When looking at ITK, I don't find it much more readable/easy to use than our own. I also need to think more about this after I finish reading the recent presentation from A. Alexandrescu ("why iterators must go"). Maybe there are some bits which could be applied to numpy iterators design. > As to choosing a project, you should pick one that really interests > you. How would you rank your own interest in these various proposals? Well, that's not me to decide what I work on exactly here :) I must say that almost all of the above are things which are needed for NumPy, things which I have thought about, and would enjoy working on. Maybe that's masochism, but I spent so much time understanding the C code in numpy that I actually enjoy working on it now :) cheers, David From david at ar.media.kyoto-u.ac.jp Tue Aug 4 03:54:40 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 04 Aug 2009 16:54:40 +0900 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> Message-ID: <4A77E940.7070807@ar.media.kyoto-u.ac.jp> Dave wrote: > David Cournapeau ar.media.kyoto-u.ac.jp> writes: > > >> Matthew Brett wrote: >> >>> Hi, >>> >>> We are using numpy.distutils, and have run into this odd behavior in windows: >>> >>> >> Short answer: >> >> I am afraid it cannot work as you want. Basically, when you pass an >> option to build_ext, it does not affect other distutils commands, which >> are run before build_ext, and need the compiler (config in this case I >> think). So you need to pass the -c option to every command affected by >> the compiler (build_ext, build_clib and config IIRC). >> >> cheers, >> >> David >> >> > > I'm having the same problems! Running windows XP, Python 2.5.4 (r254:67916, > Dec 23 2008, 15:10:54) [MSC v.1310 32 bit (Intel)]. > > In my distutils.cfg I've got: > > [build] > compiler=mingw32 > > [config] > compiler = mingw32 > > Yes, config files are an alternative I did not mention. I never use them because I prefer controlling the build on a per package basis, and the interaction between command line and config files is not always clear. > python setup.py build build_ext --compiler=mingw32 appeared to work (barring a > warning: numpy\core\setup_common.py:81: MismatchCAPIWarning) The warning is harmless: it is just a reminder that before releasing numpy 1.4.0, we will need to raise the C API version (to avoid problems we had in the past with mismatched numpy version). There is no point updating it during dev time I think. > but then how do I > create a .exe installer afterwards? python setup.py bdist_wininst fails with > the same error message as before and python setup.py bdist_wininst > --compiler=mingw32 fails with the message: > error: option --compiler not recognized > You need to do as follows, if you want to control from the command line: python setup.py build -c mingw32 bdist_wininst That's how I build the official binaries . cheers, David From dave.hirschfeld at gmail.com Tue Aug 4 04:34:44 2009 From: dave.hirschfeld at gmail.com (Dave) Date: Tue, 4 Aug 2009 08:34:44 +0000 (UTC) Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> Message-ID: David Cournapeau ar.media.kyoto-u.ac.jp> writes: > > You need to do as follows, if you want to control from the command line: > > python setup.py build -c mingw32 bdist_wininst > > That's how I build the official binaries . > > cheers, > > David > Running the command: C:\dev\src\numpy>python setup.py build -c mingw32 bdist_wininst > build.txt still gives me the error: error: Python was built with Visual Studio 2003; extensions must be built with a compiler than can generate compatible binaries. Visual Studio 2003 was not found on this system. If you have Cygwin installed, you can try compiling with MingW32, by passing "-c mingw32" to setup.py. I tried without a distutils.cfg file and deleted the build directory both times. In case it helps the bulid log should be available from http://pastebin.com/m607992ba Am I doing something wrong? -Dave From david at ar.media.kyoto-u.ac.jp Tue Aug 4 04:28:46 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 04 Aug 2009 17:28:46 +0900 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> Message-ID: <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> Dave wrote: > David Cournapeau ar.media.kyoto-u.ac.jp> writes: > > >> You need to do as follows, if you want to control from the command line: >> >> python setup.py build -c mingw32 bdist_wininst >> >> That's how I build the official binaries . >> >> cheers, >> >> David >> >> > > Running the command: > > C:\dev\src\numpy>python setup.py build -c mingw32 bdist_wininst > build.txt > > still gives me the error: > > error: Python was built with Visual Studio 2003; > extensions must be built with a compiler than can generate compatible binaries. > Visual Studio 2003 was not found on this system. If you have Cygwin installed, > you can try compiling with MingW32, by passing "-c mingw32" to setup.py. > > I tried without a distutils.cfg file and deleted the build directory both times. > > In case it helps the bulid log should be available from > http://pastebin.com/m607992ba > > Am I doing something wrong? > No, I think you and Matthew actually found a bug in recent changes I have done in distutils. I will fix it right away, cheers, David From cournape at gmail.com Tue Aug 4 05:54:19 2009 From: cournape at gmail.com (David Cournapeau) Date: Tue, 4 Aug 2009 18:54:19 +0900 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> Message-ID: <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com> On Tue, Aug 4, 2009 at 5:28 PM, David Cournapeau wrote: > > No, I think you and Matthew actually found a bug in recent changes I > have done in distutils. I will fix it right away, Ok, not right away, but could you check that r7280 fixed it for you ? cheers, David From dave.hirschfeld at gmail.com Tue Aug 4 06:03:27 2009 From: dave.hirschfeld at gmail.com (Dave) Date: Tue, 4 Aug 2009 10:03:27 +0000 (UTC) Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com> Message-ID: David Cournapeau gmail.com> writes: > > On Tue, Aug 4, 2009 at 5:28 PM, David > Cournapeau ar.media.kyoto-u.ac.jp> wrote: > > > > > No, I think you and Matthew actually found a bug in recent changes I > > have done in distutils. I will fix it right away, > > Ok, not right away, but could you check that r7280 fixed it for you ? > > cheers, > > David > Work's for me. adding 'SCRIPTS\f2py.py' creating dist removing 'build\bdist.win32\wininst' (and everything under it) Thanks for the quick fix! -Dave From dave.hirschfeld at gmail.com Tue Aug 4 06:23:55 2009 From: dave.hirschfeld at gmail.com (Dave) Date: Tue, 4 Aug 2009 10:23:55 +0000 (UTC) Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com> Message-ID: Dave gmail.com> writes: > > Work's for me. > > -Dave > Except now when trying to compile the latest scipy I get the following error: C:\dev\src\scipy>svn up Fetching external item into 'doc\sphinxext' External at revision 7280. At revision 5890. C:\dev\src\scipy>python setup.py bdist_wininst Traceback (most recent call last): File "setup.py", line 160, in setup_package() File "setup.py", line 152, in setup_package configuration=configuration ) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\core.py", line 152, in setup config = configuration() File "setup.py", line 118, in configuration config.add_subpackage('scipy') File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\misc_util.py", line 890, in add_subpackage caller_level = 2) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\misc_util.py", line 859, in get_subpackage caller_level = caller_level + 1) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\misc_util.py", line 796, in _get_configuration_from_setup_py config = setup_module.configuration(*args) File "scipy\setup.py", line 20, in configuration config.add_subpackage('special') File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\misc_util.py", line 890, in add_subpackage caller_level = 2) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\misc_util.py", line 859, in get_subpackage caller_level = caller_level + 1) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\misc_util.py", line 796, in _get_configuration_from_setup_py config = setup_module.configuration(*args) File "scipy\special\setup.py", line 45, in configuration extra_info=get_info("npymath") File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\misc_util.py", line 1954, in get_info pkg_info = get_pkg_info(pkgname, dirs) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils\misc_util.py", line 1921, in get_pkg_info return read_config(pkgname, dirs) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils \npy_pkg_config.py", line 235, in read_config v = _read_config_imp(pkg_to_filename(pkgname), dirs) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils \npy_pkg_config.py", line 221, in _read_config_imp meta, vars, sections, reqs = _read_config(filenames) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils \npy_pkg_config.py", line 205, in _read_config meta, vars, sections, reqs = parse_config(f, dirs) File "C:\dev\bin\Python25\Lib\site-packages\numpy\distutils \npy_pkg_config.py", line 177, in parse_config raise PkgNotFound("Could not find file(s) %s" % str(filenames)) numpy.distutils.npy_pkg_config.PkgNotFound: Could not find file(s) ['C:\\dev\\bin\\Python25\\lib\\site-packages\\numpy\\core\\lib \\npy-pkg-config\\npymath.ini'] In the numpy\core\lib directory there is no npy-pkg-config sub-directory, only a single file - libnpymath.a Is this expected - has scipy not yet caught up with the numpy changes or is this a numpy issue? -Dave From david at ar.media.kyoto-u.ac.jp Tue Aug 4 06:07:45 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 04 Aug 2009 19:07:45 +0900 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com>

Message-ID: <4A780871.5070408@ar.media.kyoto-u.ac.jp> Dave wrote: > Dave gmail.com> writes: > > >> Work's for me. >> >> -Dave >> >> > > Except now when trying to compile the latest scipy I get the following error: > Was numpy installed from a bdist_wininst installer, or did you use the install method directly ? David From dave.hirschfeld at gmail.com Tue Aug 4 07:20:30 2009 From: dave.hirschfeld at gmail.com (Dave) Date: Tue, 4 Aug 2009 11:20:30 +0000 (UTC) Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com>

<4A780871.5070408@ar.media.kyoto-u.ac.jp> Message-ID: David Cournapeau ar.media.kyoto-u.ac.jp> writes: > > Dave wrote: > > Dave gmail.com> writes: > > > > > >> Work's for me. > >> > >> -Dave > >> > >> > > > > Except now when trying to compile the latest scipy I get the following error: > > > > Was numpy installed from a bdist_wininst installer, or did you use the > install method directly ? > > David > Numpy was installed with the bdist_wininst installer. In case it's relevant the installer seemed to create 2 egg-info files: numpy-1.4.0.dev7277-py2.5.egg-info numpy-1.4.0.dev7280-py2.5.egg-info Deleting the numpy directory and the egg-info files and re-installing from the bdist_wininst installer gave the same result (with the above 2 egg-info files) Installing numpy with python setup.py install seemed to work (at least the npymath.ini file was now in the numpy\core\lib\npy-pkg-config folder) Compiling scipy got much further now, but still failed with the below error message: C:\dev\src\scipy>python setup.py bdist_wininst > build.txt Warning: No configuration returned, assuming unavailable. C:\dev\bin\Python25\lib\site-packages\numpy\distutils\command\config.py:394: DeprecationWarning: +++++++++++++++++++++++++++++++++++++++++++++++++ Usage of get_output is deprecated: please do not use it anymore, and avoid configuration checks involving running executable on the target machine. +++++++++++++++++++++++++++++++++++++++++++++++++ DeprecationWarning) C:\dev\bin\Python25\lib\site-packages\numpy\distutils\system_info.py:452: UserWarning: UMFPACK sparse solver (http://www.cise.ufl.edu/research/sparse/umfpack/) not found. Directories to search for the libraries can be specified in the numpy/distutils/site.cfg file (section [umfpack]) or by setting the UMFPACK environment variable. warnings.warn(self.notfounderror.__doc__) error: Command "C:\dev\bin\mingw\bin\g77.exe -g -Wall -mno-cygwin -g -Wall -mno-cygwin -shared build\temp.win32-2.5\Release\scipy\special\_cephesmodule.o build\temp.win32-2.5\Release\scipy\special\amos_wrappers.o build\temp.win32-2.5\Release\scipy\special\specfun_wrappers.o build\temp.win32-2.5\Release\scipy\special\toms_wrappers.o build\temp.win32-2.5\Release\scipy\special\cdf_wrappers.o build\temp.win32-2.5\Release\scipy\special\ufunc_extras.o -LC:\dein\Python25\Lib\site-packages -LC:\dev\bin\mingw\lib -LC:\dev\bin\mingw\lib\gcc\mingw32\3.4.5 -LC:\dev\bin\Python25\libs -LC:\dev\bin\Python25\PCBuild -Lbuild\temp.win32-2.5 -lsc_amos -lsc_toms -lsc_c_misc -lsc_cephes -lsc_mach -lsc_cdf -lsc_specfun -lnpymath -lpython25 -lg2c -o build\lib.win32-2.5\scipy\special\_cephes.pyd" failed with exit status 1 The output of the build is available from http://pastebin.com/d3efe5650 Note the strange character on line 4600. In my terminal window this is displayed as: compile options: '-D_USE_MATH_DEFINES -D_USE_MATH_DEFINES -IC:\dein\Python25\Lib\site-packages -IC:\dev\bin\Python25\lib\site-packages\numpy\core\include -IC:\dev\bin\Python25\include -IC:\dev\bin\Python25\PC -c' HTH, Dave From markbak at gmail.com Tue Aug 4 07:23:08 2009 From: markbak at gmail.com (Mark Bakker) Date: Tue, 4 Aug 2009 13:23:08 +0200 Subject: [Numpy-discussion] speed of atleast_1d and friends Message-ID: <6946b9500908040423n5ed4beawd5c1b0ca21823d06@mail.gmail.com> Hello all, I am making a lot of use of atleast_1d and atleast_2d in my routines. Does anybody know whether this will slow down my code significantly? Thanks, Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: From david at ar.media.kyoto-u.ac.jp Tue Aug 4 07:13:59 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 04 Aug 2009 20:13:59 +0900 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77A6E1.1050403@ar.media.kyoto-u.ac.jp> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com>

<4A780871.5070408@ar.media.kyoto-u.ac.jp> Message-ID: <4A7817F7.3@ar.media.kyoto-u.ac.jp> Dave wrote: > David Cournapeau ar.media.kyoto-u.ac.jp> writes: > > >> Dave wrote: >> >>> Dave gmail.com> writes: >>> >>> >>> >>>> Work's for me. >>>> >>>> -Dave >>>> >>>> >>>> >>> Except now when trying to compile the latest scipy I get the following >>> > error: > >>> >>> >> Was numpy installed from a bdist_wininst installer, or did you use the >> install method directly ? >> >> David >> >> > > Numpy was installed with the bdist_wininst installer. > > In case it's relevant the installer seemed to create 2 egg-info files: > numpy-1.4.0.dev7277-py2.5.egg-info > numpy-1.4.0.dev7280-py2.5.egg-info > > Deleting the numpy directory and the egg-info files and re-installing from the > bdist_wininst installer gave the same result (with the above 2 egg-info files) > > Installing numpy with python setup.py install seemed to work (at least the > npymath.ini file was now in the numpy\core\lib\npy-pkg-config folder) > I think I understand the problem. Unfortunately, that's looks tricky to solve... I hate distutils. David From emmanuelle.gouillart at normalesup.org Tue Aug 4 09:03:41 2009 From: emmanuelle.gouillart at normalesup.org (Emmanuelle Gouillart) Date: Tue, 4 Aug 2009 15:03:41 +0200 Subject: [Numpy-discussion] speed of atleast_1d and friends In-Reply-To: <6946b9500908040423n5ed4beawd5c1b0ca21823d06@mail.gmail.com> References: <6946b9500908040423n5ed4beawd5c1b0ca21823d06@mail.gmail.com> Message-ID: <20090804130341.GD9488@phare.normalesup.org> Hello, > I am making a lot of use of atleast_1d and atleast_2d in my routines. > Does anybody know whether this will slow down my code significantly? if there is no need to make copies (i.e. if you take arrays as parameters (?)), calls to atleast_1d and atleast_2d should be extremely fast: it's just a question of creating a different view, I think. Did you profile your code to check? Cheers, Emmanuelle From afriedle at indiana.edu Tue Aug 4 09:39:15 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Tue, 04 Aug 2009 09:39:15 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A772A15.8090407@gmail.com> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> <4A772A15.8090407@gmail.com> Message-ID: <4A783A03.9090709@indiana.edu> Bruce Southey wrote: > Hi, > Can you try these from the command line: > python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, > (2*3.14159) / 1000, dtype=np.float32)" > python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, > (2*3.14159) / 1000, dtype=np.float32); b=np.sin(a)" > python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, > (2*3.14159) / 1000, dtype=np.float32); np.sin(a)" > python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, > (2*3.14159) / 1000, dtype=np.float32)" "np.sin(a)" > > The first should be similar for different dtypes because it is just > array creation. The second extends that by storing the sin into another > array. I am not sure how to interpret the third but in the Python prompt > it would print it to screen. The last causes Python to handle two > arguments which is slow using float32 but not for float64 and float128 > suggesting compiler issue such as not using SSE or similar. Results: $ python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32)" 100 loops, best of 3: 0.0811 usec per loop $ python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32); b=np.sin(a)" 100 loops, best of 3: 0.11 usec per loop $ python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32); np.sin(a)" 100 loops, best of 3: 0.11 usec per loop $ python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float32)" "np.sin(a)" 100 loops, best of 3: 112 msec per loop $ python -m timeit -n 100 -s "import numpy as np; a = np.arange(0.0, 1000, (2*3.14159) / 1000, dtype=np.float64)" "np.sin(a)" 100 loops, best of 3: 13.2 msec per loop I think the second and third are effectively the same; both create an array containing the result. The second assigns that array to a value, while the third does not, so it should get garbage collected. The fourth one is the only one that actually runs the sin in the timing loop. I don't understand what you mean by causing Pyton to handle two arguments? The fifth run I added uses float64 to compare (and reproduces the problem). Andrew From kwgoodman at gmail.com Tue Aug 4 10:37:03 2009 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 4 Aug 2009 07:37:03 -0700 Subject: [Numpy-discussion] speed of atleast_1d and friends In-Reply-To: <6946b9500908040423n5ed4beawd5c1b0ca21823d06@mail.gmail.com> References: <6946b9500908040423n5ed4beawd5c1b0ca21823d06@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 4:23 AM, Mark Bakker wrote: > Hello all, > I am making a lot of use of atleast_1d and atleast_2d in my routines. > Does anybody know whether this will slow down my code significantly? > Thanks, > Mark Here's atleast_1d: def atleast_1d(*arys): res = [] for ary in arys: res.append(array(ary,copy=False,subok=True,ndmin=1)) if len(res) == 1: return res[0] else: return res If you only pass in on array at a time, that reduces to: def myatleast_1d(ary): return array(ary, copy=False, subok=True, ndmin=1) That might save some time. I'm always amazed at the solutions people come up with on this list. So if you send an example, someone might be able to get rid of the need for atleast_1d. From gael.varoquaux at normalesup.org Tue Aug 4 10:41:01 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Tue, 4 Aug 2009 16:41:01 +0200 Subject: [Numpy-discussion] speed of atleast_1d and friends In-Reply-To: References: <6946b9500908040423n5ed4beawd5c1b0ca21823d06@mail.gmail.com> Message-ID: <20090804144101.GM17519@phare.normalesup.org> On Tue, Aug 04, 2009 at 07:37:03AM -0700, Keith Goodman wrote: > I'm always amazed at the solutions people come up with on this list. > So if you send an example, someone might be able to get rid of the > need for atleast_1d. On the other hand, it costs almost no time, and makes your API more robusts (for instance it can be used with numbers as well as arrays). I am all for abusive use of np.atleast_1d. Ga?l From afriedle at indiana.edu Tue Aug 4 11:14:59 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Tue, 04 Aug 2009 11:14:59 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> Message-ID: <4A785073.9060009@indiana.edu> Charles R Harris wrote: > On Mon, Aug 3, 2009 at 11:51 AM, Andrew Friedley wrote: > >> Charles R Harris wrote: >>> What compiler versions are folks using? In the slow cases, what is the >>> timing for converting to double, computing the sin, then casting back to >>> single? >> I did this, is this the right way to do that? >> >> t = >> timeit.Timer("numpy.sin(a.astype(numpy.float64)).astype(numpy.float32)", >> "import numpy\n" >> "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, >> dtype=numpy.float64)") >> print "sin converted float 32/64", min(t.repeat(3, 10)) >> >> Timings on my opteron system (2-socket 2-core 2GHz): >> >> sin float32 1.13407707214 >> sin float64 0.133460998535 >> sin converted float 32/64 0.18202996254 >> >> Not too surprising I guess. >> >> gcc --version shows: >> >> gcc (GCC) 4.1.2 20080704 (Red Hat 4.1.2-44) >> >> My compile flags for my Python 2.6.1/NumPy 1.3.0 builds: >> >> -Os -fomit-frame-pointer -pipe -s -march=k8 -m64 >> > > That looks right. When numpy doesn't find a *f version it basically does > that conversion. This is beginning to look like a hardware/software > implementation problem, maybe compiler related. That is, I suspect the fast > times come from using a hardware implementation. What happens if you use -O2 > instead of -Os? Do you know where this conversion is, in the code? The impression I got from my quick look at the code was that a wrapper sinf was defined that just calls sin. I guess the typecast to float in there will do the conversion, is that what you are referring to, or something at a higher level? I recompiled the same versions of Python/NumPy, using the same flags except -O2 instead of -Os, the behavior is still the same. Andrew From cournape at gmail.com Tue Aug 4 11:20:13 2009 From: cournape at gmail.com (David Cournapeau) Date: Wed, 5 Aug 2009 00:20:13 +0900 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A785073.9060009@indiana.edu> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> <4A785073.9060009@indiana.edu> Message-ID: <5b8d13220908040820u79b6d5felc3dd2e338cc55433@mail.gmail.com> On Wed, Aug 5, 2009 at 12:14 AM, Andrew Friedley wrote: > Do you know where this conversion is, in the code? ?The impression I got > from my quick look at the code was that a wrapper sinf was defined that > just calls sin. ?I guess the typecast to float in there will do the > conversion Exact. Given your CPU, compared to my macbook, it looks like the float32 is the problem (i.e. the float64 is not particularly fast). I really can't see what could cause such a slowdown: the range over which you evaluate sin should not cause denormal numbers - just to be sure, could you try the same benchmark but using a simple array of constant values (say numpy.ones(1000)) ? Also, you may want to check what happens if you force raising errors in case of FPU exceptions (numpy.seterr(raise="all")). cheers, David From cournape at gmail.com Tue Aug 4 12:31:04 2009 From: cournape at gmail.com (David Cournapeau) Date: Wed, 5 Aug 2009 01:31:04 +0900 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: <4A7817F7.3@ar.media.kyoto-u.ac.jp> References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com>

<4A780871.5070408@ar.media.kyoto-u.ac.jp> <4A7817F7.3@ar.media.kyoto-u.ac.jp> Message-ID: <5b8d13220908040931ide8259dn7c07ecf8e82e3099@mail.gmail.com> On Tue, Aug 4, 2009 at 8:13 PM, David Cournapeau wrote: > I think I understand the problem. Unfortunately, that's looks tricky to > solve... I hate distutils. Ok - should be fixed in r7281. David From gokhansever at gmail.com Tue Aug 4 12:46:57 2009 From: gokhansever at gmail.com (=?UTF-8?Q?G=C3=B6khan_Sever?=) Date: Tue, 4 Aug 2009 11:46:57 -0500 Subject: [Numpy-discussion] Why NaN? Message-ID: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> Hello, I know this has to have a very simple answer, but stuck at this very moment and can't get a meaningful result out of np.mean() In [121]: a = array([NaN, 4, NaN, 12]) In [122]: b = array([NaN, 2, NaN, 3]) In [123]: c = a/b In [124]: mean(c) Out[124]: nan In [125]: mean a --------> mean(a) Out[125]: nan Further when I tried: In [138]: c Out[138]: array([ NaN, 2., NaN, 4.]) In [139]: np.where(c==NaN) Out[139]: (array([], dtype=int32),) In [141]: mask = [c != NaN] In [142]: mask Out[142]: [array([ True, True, True, True], dtype=bool)] Any ideas? -- G?khan -------------- next part -------------- An HTML attachment was scrubbed... URL: From dave.hirschfeld at gmail.com Tue Aug 4 12:51:56 2009 From: dave.hirschfeld at gmail.com (Dave) Date: Tue, 4 Aug 2009 16:51:56 +0000 (UTC) Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77E940.7070807@ar.media.kyoto-u.ac.jp> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com>

<4A780871.5070408@ar.media.kyoto-u.ac.jp> <4A7817F7.3@ar.media.kyoto-u.ac.jp> <5b8d13220908040931ide8259dn7c07ecf8e82e3099@mail.gmail.com> Message-ID: David Cournapeau gmail.com> writes: > > On Tue, Aug 4, 2009 at 8:13 PM, David > Cournapeau ar.media.kyoto-u.ac.jp> wrote: > > > I think I understand the problem. Unfortunately, that's looks tricky to > > solve... I hate distutils. > > Ok - should be fixed in r7281. > > David > Well, that seemed to fix the bdist_wininst issue. The problem compiling scipy remains, but I assume that's probably something I should take up on the scipy list? FWIW running the full numpy test suite (verbose=10) I get 7 failures. The results are available from http://pastebin.com/m5505d4b5 The "errors" seem to be be related to the NaN handling. Thanks for the help today! -Dave From robert.kern at gmail.com Tue Aug 4 12:51:56 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 11:51:56 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> Message-ID: <3d375d730908040951u2090502cxc3a1704a0e1c906a@mail.gmail.com> On Tue, Aug 4, 2009 at 11:46, G?khan Sever wrote: > Hello, > > I know this has to have a very simple answer, but stuck at this very moment > and can't get a meaningful result out of np.mean() > > > In [121]: a = array([NaN, 4, NaN, 12]) > > In [122]: b = array([NaN, 2, NaN, 3]) > > In [123]: c = a/b > > In [124]: mean(c) > Out[124]: nan > > In [125]: mean a > --------> mean(a) > Out[125]: nan > > Further when I tried: > > In [138]: c > Out[138]: array([ NaN,?? 2.,? NaN,?? 4.]) > > In [139]: np.where(c==NaN) > Out[139]: (array([], dtype=int32),) > > > In [141]: mask = [c != NaN] > > In [142]: mask > Out[142]: [array([ True,? True,? True,? True], dtype=bool)] Yeah, NaN != NaN. It's a feature, not a bug. Use np.ma.masked_invalid(c).mean(). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From kwgoodman at gmail.com Tue Aug 4 12:54:15 2009 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 4 Aug 2009 09:54:15 -0700 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 9:46 AM, G?khan Sever wrote: > Hello, > > I know this has to have a very simple answer, but stuck at this very moment > and can't get a meaningful result out of np.mean() > > > In [121]: a = array([NaN, 4, NaN, 12]) > > In [122]: b = array([NaN, 2, NaN, 3]) > > In [123]: c = a/b > > In [124]: mean(c) > Out[124]: nan > > In [125]: mean a > --------> mean(a) > Out[125]: nan > > Further when I tried: > > In [138]: c > Out[138]: array([ NaN,?? 2.,? NaN,?? 4.]) > > In [139]: np.where(c==NaN) > Out[139]: (array([], dtype=int32),) > > > In [141]: mask = [c != NaN] > > In [142]: mask > Out[142]: [array([ True,? True,? True,? True], dtype=bool)] > > > Any ideas? >> a = array([NaN, 4, NaN, 12]) >> b = array([NaN, 2, NaN, 3]) >> c = a/b >> from scipy import stats >> stats.nan [tab] stats.nanmean stats.nanmedian stats.nanstd >> stats.nanmean(c) 3.0 >> stats.nanmean(a) 8.0 >> c[isnan(c)] array([ NaN, NaN]) From perry at stsci.edu Tue Aug 4 12:57:23 2009 From: perry at stsci.edu (Perry Greenfield) Date: Tue, 4 Aug 2009 12:57:23 -0400 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> Message-ID: Note that NaN generally contaminates sums and other net results (as it should). You should filter them out (there is more than one way to do that). But also note that the IEEE standard for floating point numbers requires NaN != Nan. Thus any attempts to find where NaNs that way is destined to fail. Use the function isnan() instead to generate a mask. Perry On Aug 4, 2009, at 12:46 PM, G?khan Sever wrote: > Hello, > > I know this has to have a very simple answer, but stuck at this very > moment and can't get a meaningful result out of np.mean() > > > In [121]: a = array([NaN, 4, NaN, 12]) > > In [122]: b = array([NaN, 2, NaN, 3]) > > In [123]: c = a/b > > In [124]: mean(c) > Out[124]: nan > > In [125]: mean a > --------> mean(a) > Out[125]: nan > > Further when I tried: > > In [138]: c > Out[138]: array([ NaN, 2., NaN, 4.]) > > In [139]: np.where(c==NaN) > Out[139]: (array([], dtype=int32),) > > > In [141]: mask = [c != NaN] > > In [142]: mask > Out[142]: [array([ True, True, True, True], dtype=bool)] > > > Any ideas? > > -- > G?khan > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From kwgoodman at gmail.com Tue Aug 4 12:59:06 2009 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 4 Aug 2009 09:59:06 -0700 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 9:54 AM, Keith Goodman wrote: > On Tue, Aug 4, 2009 at 9:46 AM, G?khan Sever wrote: >> Hello, >> >> I know this has to have a very simple answer, but stuck at this very moment >> and can't get a meaningful result out of np.mean() >> >> >> In [121]: a = array([NaN, 4, NaN, 12]) >> >> In [122]: b = array([NaN, 2, NaN, 3]) >> >> In [123]: c = a/b >> >> In [124]: mean(c) >> Out[124]: nan >> >> In [125]: mean a >> --------> mean(a) >> Out[125]: nan >> >> Further when I tried: >> >> In [138]: c >> Out[138]: array([ NaN,?? 2.,? NaN,?? 4.]) >> >> In [139]: np.where(c==NaN) >> Out[139]: (array([], dtype=int32),) >> >> >> In [141]: mask = [c != NaN] >> >> In [142]: mask >> Out[142]: [array([ True,? True,? True,? True], dtype=bool)] >> >> >> Any ideas? > >>> a = array([NaN, 4, NaN, 12]) >>> b = array([NaN, 2, NaN, 3]) >>> c = a/b >>> from scipy import stats >>> stats.nan [tab] > stats.nanmean ? ?stats.nanmedian ?stats.nanstd >>> stats.nanmean(c) > ? 3.0 >>> stats.nanmean(a) > ? 8.0 >>> c[isnan(c)] > ? array([ NaN, ?NaN]) One more: >> c[isfinite(c)].mean() 3.0 From josef.pktd at gmail.com Tue Aug 4 13:05:01 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 4 Aug 2009 13:05:01 -0400 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> Message-ID: <1cd32cbb0908041005m7405c9f8j5b75619c127c9180@mail.gmail.com> On Tue, Aug 4, 2009 at 12:59 PM, Keith Goodman wrote: > On Tue, Aug 4, 2009 at 9:54 AM, Keith Goodman wrote: >> On Tue, Aug 4, 2009 at 9:46 AM, G?khan Sever wrote: >>> Hello, >>> >>> I know this has to have a very simple answer, but stuck at this very moment >>> and can't get a meaningful result out of np.mean() >>> >>> >>> In [121]: a = array([NaN, 4, NaN, 12]) >>> >>> In [122]: b = array([NaN, 2, NaN, 3]) >>> >>> In [123]: c = a/b >>> >>> In [124]: mean(c) >>> Out[124]: nan >>> >>> In [125]: mean a >>> --------> mean(a) >>> Out[125]: nan >>> >>> Further when I tried: >>> >>> In [138]: c >>> Out[138]: array([ NaN,?? 2.,? NaN,?? 4.]) >>> >>> In [139]: np.where(c==NaN) >>> Out[139]: (array([], dtype=int32),) >>> >>> >>> In [141]: mask = [c != NaN] >>> >>> In [142]: mask >>> Out[142]: [array([ True,? True,? True,? True], dtype=bool)] >>> >>> >>> Any ideas? >> >>>> a = array([NaN, 4, NaN, 12]) >>>> b = array([NaN, 2, NaN, 3]) >>>> c = a/b >>>> from scipy import stats >>>> stats.nan [tab] >> stats.nanmean ? ?stats.nanmedian ?stats.nanstd >>>> stats.nanmean(c) >> ? 3.0 >>>> stats.nanmean(a) >> ? 8.0 >>>> c[isnan(c)] >> ? array([ NaN, ?NaN]) > > One more: > >>> c[isfinite(c)].mean() > ? 3.0 > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > What's going on with the response time here? I cannot even finish reading the question and start python. Josef From robert.kern at gmail.com Tue Aug 4 13:08:36 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 12:08:36 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <1cd32cbb0908041005m7405c9f8j5b75619c127c9180@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <1cd32cbb0908041005m7405c9f8j5b75619c127c9180@mail.gmail.com> Message-ID: <3d375d730908041008k33d2161ayf9a35b04b0ab85ca@mail.gmail.com> On Tue, Aug 4, 2009 at 12:05, wrote: > What's going on with the response time here? > > I cannot even finish reading the question and start python. Practice. :-) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From kwgoodman at gmail.com Tue Aug 4 13:11:07 2009 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 4 Aug 2009 10:11:07 -0700 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <1cd32cbb0908041005m7405c9f8j5b75619c127c9180@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <1cd32cbb0908041005m7405c9f8j5b75619c127c9180@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 10:05 AM, wrote: > What's going on with the response time here? > > I cannot even finish reading the question and start python. The trick is to not read the entire question. I usually reply after reading the subj line. Or just auto-reply with "x.sort() returns None" which seems to be the most common question. From charlesr.harris at gmail.com Tue Aug 4 13:16:28 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 4 Aug 2009 11:16:28 -0600 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com>

<4A780871.5070408@ar.media.kyoto-u.ac.jp> <4A7817F7.3@ar.media.kyoto-u.ac.jp> <5b8d13220908040931ide8259dn7c07ecf8e82e3099@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 10:51 AM, Dave wrote: > David Cournapeau gmail.com> writes: > > > > > On Tue, Aug 4, 2009 at 8:13 PM, David > > Cournapeau ar.media.kyoto-u.ac.jp> wrote: > > > > > I think I understand the problem. Unfortunately, that's looks tricky to > > > solve... I hate distutils. > > > > Ok - should be fixed in r7281. > > > > David > > > > Well, that seemed to fix the bdist_wininst issue. > > The problem compiling scipy remains, but I assume that's probably something > I > should take up on the scipy list? > > FWIW running the full numpy test suite (verbose=10) I get 7 failures. The > results are available from http://pastebin.com/m5505d4b5 > > The "errors" seem to be be related to the NaN handling. > The nan problems come from these tests: # atan2(+-infinity, -infinity) returns +-3*pi/4. yield assert_almost_equal, ncu.arctan2( np.inf, -np.inf), 0.75 * np.pi yield assert_almost_equal, ncu.arctan2(-np.inf, -np.inf), -0.75 * np.pi # atan2(+-infinity, +infinity) returns +-pi/4. yield assert_almost_equal, ncu.arctan2( np.inf, np.inf), 0.25 * np.pi yield assert_almost_equal, ncu.arctan2(-np.inf, np.inf), -0.25 * np.pi So the problem seems to be with the inf handling. Windows arctan2 is known to be wtf-buggy and I suspect that is what is being tested. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From afriedle at indiana.edu Tue Aug 4 13:19:22 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Tue, 04 Aug 2009 13:19:22 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <5b8d13220908040820u79b6d5felc3dd2e338cc55433@mail.gmail.com> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <4A76EFD3.5010508@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> <4A785073.9060009@indiana.edu> <5b8d13220908040820u79b6d5felc3dd2e338cc55433@mail.gmail.com> Message-ID: <4A786D9A.1040703@indiana.edu> David Cournapeau wrote: > On Wed, Aug 5, 2009 at 12:14 AM, Andrew Friedley wrote: > >> Do you know where this conversion is, in the code? The impression I got >> from my quick look at the code was that a wrapper sinf was defined that >> just calls sin. I guess the typecast to float in there will do the >> conversion > > Exact. Given your CPU, compared to my macbook, it looks like the > float32 is the problem (i.e. the float64 is not particularly fast). I > really can't see what could cause such a slowdown: the range over > which you evaluate sin should not cause denormal numbers - just to be > sure, could you try the same benchmark but using a simple array of > constant values (say numpy.ones(1000)) ? Also, you may want to check > what happens if you force raising errors in case of FPU exceptions > (numpy.seterr(raise="all")). OK, have some interesting results. First is my array creation was not doing what I thought it was. This (what I've been doing) creates an array of 159161 elements: numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float32) Which isn't what I was after (1000 elements ranging from 0 to 2PI). So the values in that array climb up to 999.999. Running with numpy.ones() gives a much different timing (I did numpy.ones(159161) to keep the array lengths the same): sin float32 0.078202009201 sin float64 0.0767619609833 cos float32 0.0750858783722 cos float64 0.088515996933 Much better, but still a little strange, float32 should be relatively faster yet. I tried with 1000 elements and got similar results. So the performance has something to do with the input values. This is believable, but I don't think it explains why float32 would behave that way and not float64, unless there's something else I don't understand. Also I assume you meant seterr(all='raise'). This didn't seem to do anything, I don't have any exceptions thrown or other output. Andrew From robert.kern at gmail.com Tue Aug 4 13:24:15 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 12:24:15 -0500 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A786D9A.1040703@indiana.edu> References: <4A76E709.9090100@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> <4A785073.9060009@indiana.edu> <5b8d13220908040820u79b6d5felc3dd2e338cc55433@mail.gmail.com> <4A786D9A.1040703@indiana.edu> Message-ID: <3d375d730908041024p616fae36k3ce5205b70ed0e48@mail.gmail.com> On Tue, Aug 4, 2009 at 12:19, Andrew Friedley wrote: > OK, have some interesting results. ?First is my array creation was not > doing what I thought it was. ?This (what I've been doing) creates an > array of 159161 elements: > > numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float32) > > Which isn't what I was after (1000 elements ranging from 0 to 2PI). ?So > the values in that array climb up to 999.999. One uses arange() like so: numpy.arange(start, stop, step), just like the builtin range(). You want numpy.linspace(0.0, 2*numpy.pi, 1000). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From d_l_goldsmith at yahoo.com Tue Aug 4 13:45:01 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 4 Aug 2009 10:45:01 -0700 (PDT) Subject: [Numpy-discussion] Why NaN? In-Reply-To: <3d375d730908041008k33d2161ayf9a35b04b0ab85ca@mail.gmail.com> Message-ID: <145439.20090.qm@web52104.mail.re2.yahoo.com> Actually, Robert's really a robot (indeed, the Kernel of all robot minds) - no way a biologic is going to beat him. ;-) DG --- On Tue, 8/4/09, Robert Kern wrote: > From: Robert Kern > Subject: Re: [Numpy-discussion] Why NaN? > To: "Discussion of Numerical Python" > Date: Tuesday, August 4, 2009, 10:08 AM > On Tue, Aug 4, 2009 at 12:05, > wrote: > > > What's going on with the response time here? > > > > I cannot even finish reading the question and start > python. > > Practice. :-) > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, > a harmless > enigma that is made terrible by our own mad attempt to > interpret it as > though it had an underlying truth." > ? -- Umberto Eco > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From charlesr.harris at gmail.com Tue Aug 4 13:48:34 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 4 Aug 2009 11:48:34 -0600 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A786D9A.1040703@indiana.edu> References: <4A76E709.9090100@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> <4A785073.9060009@indiana.edu> <5b8d13220908040820u79b6d5felc3dd2e338cc55433@mail.gmail.com> <4A786D9A.1040703@indiana.edu> Message-ID: On Tue, Aug 4, 2009 at 11:19 AM, Andrew Friedley wrote: > David Cournapeau wrote: > > On Wed, Aug 5, 2009 at 12:14 AM, Andrew Friedley > wrote: > > > >> Do you know where this conversion is, in the code? The impression I got > >> from my quick look at the code was that a wrapper sinf was defined that > >> just calls sin. I guess the typecast to float in there will do the > >> conversion > > > > Exact. Given your CPU, compared to my macbook, it looks like the > > float32 is the problem (i.e. the float64 is not particularly fast). I > > really can't see what could cause such a slowdown: the range over > > which you evaluate sin should not cause denormal numbers - just to be > > sure, could you try the same benchmark but using a simple array of > > constant values (say numpy.ones(1000)) ? Also, you may want to check > > what happens if you force raising errors in case of FPU exceptions > > (numpy.seterr(raise="all")). > > OK, have some interesting results. First is my array creation was not > doing what I thought it was. This (what I've been doing) creates an > array of 159161 elements: > > numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float32) > > Which isn't what I was after (1000 elements ranging from 0 to 2PI). So > the values in that array climb up to 999.999. > > Running with numpy.ones() gives a much different timing (I did > numpy.ones(159161) to keep the array lengths the same): > > sin float32 0.078202009201 > sin float64 0.0767619609833 > cos float32 0.0750858783722 > cos float64 0.088515996933 > > Much better, but still a little strange, float32 should be relatively > faster yet. I tried with 1000 elements and got similar results. > Depends on the CPU, FPU and the compiler flags. The computations could very well be done using double precision internally with conversions on load/store. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From afriedle at indiana.edu Tue Aug 4 13:57:05 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Tue, 04 Aug 2009 13:57:05 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: References: <4A76E709.9090100@indiana.edu> <20090803142112.GA7495@phare.normalesup.org> <7f014ea60908030923s30b958a6meb2afbad38269052@mail.gmail.com> <4A7723A8.8010107@indiana.edu> <4A785073.9060009@indiana.edu> <5b8d13220908040820u79b6d5felc3dd2e338cc55433@mail.gmail.com> <4A786D9A.1040703@indiana.edu> Message-ID: <4A787671.5060501@indiana.edu> Charles R Harris wrote: > Depends on the CPU, FPU and the compiler flags. The computations could very > well be done using double precision internally with conversions on > load/store. Sure, but if this is the case, why is the performance blowing up on larger input values for float32 but not float64? Both should blow up, not just one or the other. In other words I think they are using different implementations :) Am I missing something? Andrew From josef.pktd at gmail.com Tue Aug 4 14:11:57 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 4 Aug 2009 14:11:57 -0400 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <145439.20090.qm@web52104.mail.re2.yahoo.com> References: <3d375d730908041008k33d2161ayf9a35b04b0ab85ca@mail.gmail.com> <145439.20090.qm@web52104.mail.re2.yahoo.com> Message-ID: <1cd32cbb0908041111s3d6c167dv2a216d7c245c0045@mail.gmail.com> On Tue, Aug 4, 2009 at 1:45 PM, David Goldsmith wrote: > > Actually, Robert's really a robot (indeed, the Kernel of all robot minds) - no way a biologic is going to beat him. ;-) So, what is the conclusion, do we need more practice, or can we sit back and let Robert take care of things? Josef > > DG > > --- On Tue, 8/4/09, Robert Kern wrote: > >> From: Robert Kern >> Subject: Re: [Numpy-discussion] Why NaN? >> To: "Discussion of Numerical Python" >> Date: Tuesday, August 4, 2009, 10:08 AM >> On Tue, Aug 4, 2009 at 12:05, >> wrote: >> >> > What's going on with the response time here? >> > >> > I cannot even finish reading the question and start >> python. >> >> Practice. :-) >> >> -- >> Robert Kern >> >> "I have come to believe that the whole world is an enigma, >> a harmless >> enigma that is made terrible by our own mad attempt to >> interpret it as >> though it had an underlying truth." >> ? -- Umberto Eco >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From d_l_goldsmith at yahoo.com Tue Aug 4 14:30:45 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 4 Aug 2009 11:30:45 -0700 (PDT) Subject: [Numpy-discussion] Why NaN? In-Reply-To: <1cd32cbb0908041111s3d6c167dv2a216d7c245c0045@mail.gmail.com> Message-ID: <348270.25448.qm@web52105.mail.re2.yahoo.com> Uh-oh, if my joke is going to promote wide-spread complacency, I take it back, I take it back! DG --- On Tue, 8/4/09, josef.pktd at gmail.com wrote: > From: josef.pktd at gmail.com > Subject: Re: [Numpy-discussion] Why NaN? > To: "Discussion of Numerical Python" > Date: Tuesday, August 4, 2009, 11:11 AM > On Tue, Aug 4, 2009 at 1:45 PM, David > Goldsmith > wrote: > > > > Actually, Robert's really a robot (indeed, the Kernel > of all robot minds) - no way a biologic is going to beat > him. ;-) > > So, what is the conclusion, do we need more practice, or > can we sit > back and let Robert take care of things? > > Josef > > > > > > DG > > > > --- On Tue, 8/4/09, Robert Kern > wrote: > > > >> From: Robert Kern > >> Subject: Re: [Numpy-discussion] Why NaN? > >> To: "Discussion of Numerical Python" > >> Date: Tuesday, August 4, 2009, 10:08 AM > >> On Tue, Aug 4, 2009 at 12:05, > >> wrote: > >> > >> > What's going on with the response time here? > >> > > >> > I cannot even finish reading the question and > start > >> python. > >> > >> Practice. :-) > >> > >> -- > >> Robert Kern > >> > >> "I have come to believe that the whole world is an > enigma, > >> a harmless > >> enigma that is made terrible by our own mad > attempt to > >> interpret it as > >> though it had an underlying truth." > >> ? -- Umberto Eco > >> _______________________________________________ > >> NumPy-Discussion mailing list > >> NumPy-Discussion at scipy.org > >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > >> > > > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From gael.varoquaux at normalesup.org Tue Aug 4 14:34:52 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Tue, 4 Aug 2009 20:34:52 +0200 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <1cd32cbb0908041111s3d6c167dv2a216d7c245c0045@mail.gmail.com> References: <3d375d730908041008k33d2161ayf9a35b04b0ab85ca@mail.gmail.com> <145439.20090.qm@web52104.mail.re2.yahoo.com> <1cd32cbb0908041111s3d6c167dv2a216d7c245c0045@mail.gmail.com> Message-ID: <20090804183452.GA11772@phare.normalesup.org> On Tue, Aug 04, 2009 at 02:11:57PM -0400, josef.pktd at gmail.com wrote: > On Tue, Aug 4, 2009 at 1:45 PM, David Goldsmith wrote: > > Actually, Robert's really a robot (indeed, the Kernel of all robot minds) - no way a biologic is going to beat him. ;-) > So, what is the conclusion, do we need more practice, or can we sit > back and let Robert take care of things? No, we need to get the master schematics of Robert and replicate him! Robert, please? Ga?l From gokhansever at gmail.com Tue Aug 4 14:40:31 2009 From: gokhansever at gmail.com (=?UTF-8?Q?G=C3=B6khan_Sever?=) Date: Tue, 4 Aug 2009 13:40:31 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> Message-ID: <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> This is the loveliest of all solutions: c[isfinite(c)].mean() You are all very helpful and funny. I am sure most of you spend more than 16 hours a day in front of or by your screens :) On Tue, Aug 4, 2009 at 11:46 AM, G?khan Sever wrote: > Hello, > > I know this has to have a very simple answer, but stuck at this very moment > and can't get a meaningful result out of np.mean() > > > In [121]: a = array([NaN, 4, NaN, 12]) > > In [122]: b = array([NaN, 2, NaN, 3]) > > In [123]: c = a/b > > In [124]: mean(c) > Out[124]: nan > > In [125]: mean a > --------> mean(a) > Out[125]: nan > > Further when I tried: > > In [138]: c > Out[138]: array([ NaN, 2., NaN, 4.]) > > In [139]: np.where(c==NaN) > Out[139]: (array([], dtype=int32),) > > > In [141]: mask = [c != NaN] > > In [142]: mask > Out[142]: [array([ True, True, True, True], dtype=bool)] > > > Any ideas? > > -- > G?khan > -- G?khan -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Tue Aug 4 14:43:54 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 13:43:54 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> Message-ID: <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> On Tue, Aug 4, 2009 at 13:40, G?khan Sever wrote: > This is the loveliest of all solutions: > > c[isfinite(c)].mean() I kind of like c[c == c].mean(), but only because it's a bit mind-blowing. :-) > You are all very helpful and funny. I am sure most of you spend more than 16 > hours a day in front of or by your screens :) Hey! I resemble that remark! -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From meine at informatik.uni-hamburg.de Tue Aug 4 14:46:39 2009 From: meine at informatik.uni-hamburg.de (Hans Meine) Date: Tue, 4 Aug 2009 20:46:39 +0200 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A786D9A.1040703@indiana.edu> References: <4A76E709.9090100@indiana.edu> <5b8d13220908040820u79b6d5felc3dd2e338cc55433@mail.gmail.com> <4A786D9A.1040703@indiana.edu> Message-ID: <200908042046.39426.meine@informatik.uni-hamburg.de> On Tuesday 04 August 2009 19:19:22 Andrew Friedley wrote: > OK, have some interesting results. First is my array creation was not > doing what I thought it was. This (what I've been doing) creates an > array of 159161 elements: > > numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=numpy.float32) Aaaaah. And I wondered why taking the sin/cos of 1000 elements took so long... ;-) (actually, I would've used larger arrays for benchmarking to begin with) Indeed, the value range fixes stuff here (Linux, GCC/amd64, Xeon X5450 @ 3.00GHz, NumPy 1.3.0), too: Before: float64 10 loops, best of 3: 54.2 ms per loop float32 10 loops, best of 3: 7.62 ms per loop After: float64 10 loops, best of 3: 6.03 ms per loop float32 10 loops, best of 3: 3.81 ms per loop Best, Hans From gael.varoquaux at normalesup.org Tue Aug 4 14:48:24 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Tue, 4 Aug 2009 20:48:24 +0200 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> Message-ID: <20090804184824.GB11772@phare.normalesup.org> On Tue, Aug 04, 2009 at 01:43:54PM -0500, Robert Kern wrote: > I kind of like c[c == c].mean(), but only because it's a bit mind-blowing. :-) > > You are all very helpful and funny. I am sure most of you spend more than 16 > > hours a day in front of or by your screens :) > Hey! I resemble that remark! Out of these 16 hours, 14 are spent staring at two terminals: one with IPython on one side, and another with vim on the other. Yeah baby! Ga?l From gokhansever at gmail.com Tue Aug 4 14:54:49 2009 From: gokhansever at gmail.com (=?UTF-8?Q?G=C3=B6khan_Sever?=) Date: Tue, 4 Aug 2009 13:54:49 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <20090804184824.GB11772@phare.normalesup.org> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> <20090804184824.GB11772@phare.normalesup.org> Message-ID: <49d6b3500908041154g7383eed7w7cbd00ddea55f035@mail.gmail.com> On Tue, Aug 4, 2009 at 1:48 PM, Gael Varoquaux < gael.varoquaux at normalesup.org> wrote: > On Tue, Aug 04, 2009 at 01:43:54PM -0500, Robert Kern wrote: > > I kind of like c[c == c].mean(), but only because it's a bit > mind-blowing. :-) > > > > You are all very helpful and funny. I am sure most of you spend more > than 16 > > > hours a day in front of or by your screens :) > > > Hey! I resemble that remark! > > Out of these 16 hours, 14 are spent staring at two terminals: one with > IPython on one side, and another with vim on the other. > > Yeah baby! > > Ga?l > I see that you should have a browser embedding plugin for Ipyhon which you don't want to share with us :) And do you only fix Mayavi issues in that not-included 2 hours? -- G?khan -------------- next part -------------- An HTML attachment was scrubbed... URL: From pgmdevlist at gmail.com Tue Aug 4 15:29:47 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Tue, 4 Aug 2009 15:29:47 -0400 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> Message-ID: <95747771-B7B5-44F3-B12C-094B944A33CA@gmail.com> On Aug 4, 2009, at 2:43 PM, Robert Kern wrote: > On Tue, Aug 4, 2009 at 13:40, G?khan Sever > wrote: >> This is the loveliest of all solutions: >> >> c[isfinite(c)].mean() > > I kind of like c[c == c].mean(), but only because it's a bit mind- > blowing. :-) But it doesn't give the same result as the previous one when there's an inf... From robert.kern at gmail.com Tue Aug 4 15:40:19 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 14:40:19 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <95747771-B7B5-44F3-B12C-094B944A33CA@gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> <95747771-B7B5-44F3-B12C-094B944A33CA@gmail.com> Message-ID: <3d375d730908041240g44fe6079obc03c4eae4a9968@mail.gmail.com> On Tue, Aug 4, 2009 at 14:29, Pierre GM wrote: > > On Aug 4, 2009, at 2:43 PM, Robert Kern wrote: > >> On Tue, Aug 4, 2009 at 13:40, G?khan Sever >> wrote: >>> This is the loveliest of all solutions: >>> >>> c[isfinite(c)].mean() >> >> I kind of like c[c == c].mean(), but only because it's a bit mind- >> blowing. :-) > > But it doesn't give the same result as the previous one when there's > an inf... NaNs might be markers of missing data, but I see infs as data. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From gael.varoquaux at normalesup.org Tue Aug 4 15:59:36 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Tue, 4 Aug 2009 21:59:36 +0200 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908041154g7383eed7w7cbd00ddea55f035@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> <20090804184824.GB11772@phare.normalesup.org> <49d6b3500908041154g7383eed7w7cbd00ddea55f035@mail.gmail.com> Message-ID: <20090804195936.GF11772@phare.normalesup.org> On Tue, Aug 04, 2009 at 01:54:49PM -0500, G?khan Sever wrote: > I see that you should have a browser embedding plugin for Ipyhon which you > don't want to share with us :) No, I answer e-mail using vim. > And do you only fix Mayavi issues in that not-included 2 hours? No, during the other hours, using IPython and vim, what else? Ga?l From kwgoodman at gmail.com Tue Aug 4 16:06:38 2009 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 4 Aug 2009 13:06:38 -0700 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <20090804195936.GF11772@phare.normalesup.org> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> <20090804184824.GB11772@phare.normalesup.org> <49d6b3500908041154g7383eed7w7cbd00ddea55f035@mail.gmail.com> <20090804195936.GF11772@phare.normalesup.org> Message-ID: On Tue, Aug 4, 2009 at 12:59 PM, Gael Varoquaux wrote: > On Tue, Aug 04, 2009 at 01:54:49PM -0500, G?khan Sever wrote: >> ? ?I see that you should have a browser embedding plugin for Ipyhon which you >> ? ?don't want to share with us :) > > No, I answer e-mail using vim. Yeah, I'm trying that right now. :wq :q! :dammit From matthew.brett at gmail.com Tue Aug 4 16:09:13 2009 From: matthew.brett at gmail.com (Matthew Brett) Date: Tue, 4 Aug 2009 13:09:13 -0700 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: <5b8d13220908040931ide8259dn7c07ecf8e82e3099@mail.gmail.com> References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com>

<4A780871.5070408@ar.media.kyoto-u.ac.jp> <4A7817F7.3@ar.media.kyoto-u.ac.jp> <5b8d13220908040931ide8259dn7c07ecf8e82e3099@mail.gmail.com> Message-ID: <1e2af89e0908041309s6d96d638o2a46d4526b57202@mail.gmail.com> Hi, On Tue, Aug 4, 2009 at 9:31 AM, David Cournapeau wrote: > On Tue, Aug 4, 2009 at 8:13 PM, David > Cournapeau wrote: > >> I think I understand the problem. Unfortunately, that's looks tricky to >> solve... I hate distutils. > > Ok - should be fixed in r7281. Just to clarify - it's still true I guess that this: python setup.py build_ext --compiler=mingw32 --inplace just can't work - because the --compiler flag does not get passed to the build step? I noticed, when I was trying to be fancy: python setup.py build build_ext --inplace this error: File "/home/mb312/usr/local/lib/python2.5/site-packages/numpy/distutils/command/build_ext.py", line 74, in run self.library_dirs.append(build_clib.build_clib) UnboundLocalError: local variable 'build_clib' referenced before assignment because of the check for inplace builds above that, leaving build_clib undefined. I'm afraid I wasn't quite sure what the right thing to do was. Thanks a lot, Matthew From robert.kern at gmail.com Tue Aug 4 16:23:36 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 15:23:36 -0500 Subject: [Numpy-discussion] Is this a bug in numpy.distutils ? In-Reply-To: <1e2af89e0908041309s6d96d638o2a46d4526b57202@mail.gmail.com> References: <1e2af89e0908031635i676135afmde57bd8993a4ca82@mail.gmail.com> <4A77F13E.2020402@ar.media.kyoto-u.ac.jp> <5b8d13220908040254n538c1e4flfe7ee2dd9aba96ef@mail.gmail.com>

<4A780871.5070408@ar.media.kyoto-u.ac.jp> <4A7817F7.3@ar.media.kyoto-u.ac.jp> <5b8d13220908040931ide8259dn7c07ecf8e82e3099@mail.gmail.com> <1e2af89e0908041309s6d96d638o2a46d4526b57202@mail.gmail.com> Message-ID: <3d375d730908041323i20a155a3g37718104163db591@mail.gmail.com> On Tue, Aug 4, 2009 at 15:09, Matthew Brett wrote: > ?File "/home/mb312/usr/local/lib/python2.5/site-packages/numpy/distutils/command/build_ext.py", > line 74, in run > ? ?self.library_dirs.append(build_clib.build_clib) > UnboundLocalError: local variable 'build_clib' referenced before assignment > > because of the check for inplace builds above that, leaving build_clib > undefined. ?I'm afraid I wasn't quite sure what the right thing to do > was. Probably just build_clib = self.distribution.get_command_obj('build_clib') after the log.warn(). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From npuloski at gmail.com Tue Aug 4 16:36:28 2009 From: npuloski at gmail.com (Nanime Puloski) Date: Tue, 4 Aug 2009 16:36:28 -0400 Subject: [Numpy-discussion] Features in SciPy That are Absent in NumPy Message-ID: What features does SciPy have that are absent in NumPy? -------------- next part -------------- An HTML attachment was scrubbed... URL: From nmb at wartburg.edu Tue Aug 4 16:40:55 2009 From: nmb at wartburg.edu (Neil Martinsen-Burrell) Date: Tue, 04 Aug 2009 14:40:55 -0600 Subject: [Numpy-discussion] Features in SciPy That are Absent in NumPy In-Reply-To: References: Message-ID: <4A789CD7.4020109@wartburg.edu> On 2009-08-04 14:36 , Nanime Puloski wrote: > What features does SciPy have that are absent in NumPy? Many. SciPy includes algorithms for optimization, solving differential equations, numerical integration among many others. NumPy primarily provides a useful n-dimensional array container. While there are some basic scientific features such as FFTs in NumPy, these appear in more detail in SciPy. If you can give more specifics on what features you would be interested in, we can offer more help about which package contains those features. -Neil From bsouthey at gmail.com Tue Aug 4 16:53:00 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Tue, 4 Aug 2009 15:53:00 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 1:40 PM, G?khan Sever wrote: > This is the loveliest of all solutions: > > c[isfinite(c)].mean() This handling of nonfinite elements has come up before. Please remember that this only for 1d or flatten array so it not work in general especially along an axis. Bruce From kwgoodman at gmail.com Tue Aug 4 17:05:18 2009 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 4 Aug 2009 14:05:18 -0700 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 1:53 PM, Bruce Southey wrote: > On Tue, Aug 4, 2009 at 1:40 PM, G?khan Sever wrote: >> This is the loveliest of all solutions: >> >> c[isfinite(c)].mean() > > This handling of nonfinite elements has come up before. > Please remember that this only for 1d or flatten array so it not work > in general especially along an axis. If you don't want to use nanmean from scipy.stats you could use: np.nansum(c, axis=0) / (~np.isnan(c)).sum(axis=0) or np.nansum(c, axis=0) / (c == c).sum(axis=0) But if c contains ints then you'll run into trouble with the division, so you'll need to protect against that. From bsouthey at gmail.com Tue Aug 4 17:24:35 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Tue, 4 Aug 2009 16:24:35 -0500 Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> References: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> Message-ID: On Mon, Aug 3, 2009 at 9:42 PM, David Cournapeau wrote: > Hi All, > > ? ?I (David Cournapeau) and the people at Berkeley (Jarrod Millman, > Fernando Perez, Matthew Brett) have been in discussion so that I could > do some funded work on NumPy/SciPy. Although they are obviously > interested in improvements that help their own projects, they are > willing to make sure the work will impact numpy/scipy as a whole. As > such we would like to get some feedback about the proposal. > > There are several areas we discussed about, but the main 'vision' is to > make more of the C code in numpy reusable to 3rd parties, in particular > purely computational (fft, linear algebra, etc...) code. A first draft > of the proposal is pasted below. > > Comments, request for details, objections are welcomed, > > Thank you for your attention, > > The Berkeley team, Gael Varoquaux and David Cournapeau > [snip] Almost a year ago Travis send an email : 'Report from SciPy'? http://mail.scipy.org/pipermail/numpy-discussion/2008-August/036909.html Of importance was that " * NumPy 2.0 will be a library and will not automagically import numpy.fft * We will suggest that other libraries use from numpy import fft instead of import numpy as np; np.fft " I sort of see that the proposed work could help make numpy a library as a whole but it is not clear that the work is heading towards that goal. So if numpy 2.0 is still planned as a library then I would like to see a clearer statement towards that goal. Not really understanding the problems of C99, but I know that trying to cover all the little details can be very time consuming when more effort could be spent on things. So if 'C99-like' is going to be the near term future, is there any point in supporting non-C99 environments with this work? That is, is the limitation in the compiler, operating system, processor or some combination of these? Anyhow, these are only my thoughts and pale in comparison to the work you are doing so feel free ignore them. Thanks Bruce From d_l_goldsmith at yahoo.com Tue Aug 4 17:43:12 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 4 Aug 2009 14:43:12 -0700 (PDT) Subject: [Numpy-discussion] Features in SciPy That are Absent in NumPy In-Reply-To: <4A789CD7.4020109@wartburg.edu> Message-ID: <74401.51536.qm@web52107.mail.re2.yahoo.com> --- On Tue, 8/4/09, Neil Martinsen-Burrell wrote: > > What features does SciPy have that are absent in > NumPy? > > Many.? And that's an understatement! DG > SciPy includes algorithms for optimization, > solving differential > equations, numerical integration among many others.? > NumPy primarily > provides a useful n-dimensional array container.? > While there are some > basic scientific features such as FFTs in NumPy, these > appear in more > detail in SciPy.? If you can give more specifics on > what features you > would be interested in, we can offer more help about which > package > contains those features. > > -Neil > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From charlesr.harris at gmail.com Tue Aug 4 17:43:28 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 4 Aug 2009 15:43:28 -0600 Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: References: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> Message-ID: On Tue, Aug 4, 2009 at 3:24 PM, Bruce Southey wrote: > On Mon, Aug 3, 2009 at 9:42 PM, David > Cournapeau wrote: > > Hi All, > > > > I (David Cournapeau) and the people at Berkeley (Jarrod Millman, > > Fernando Perez, Matthew Brett) have been in discussion so that I could > > do some funded work on NumPy/SciPy. Although they are obviously > > interested in improvements that help their own projects, they are > > willing to make sure the work will impact numpy/scipy as a whole. As > > such we would like to get some feedback about the proposal. > > > > There are several areas we discussed about, but the main 'vision' is to > > make more of the C code in numpy reusable to 3rd parties, in particular > > purely computational (fft, linear algebra, etc...) code. A first draft > > of the proposal is pasted below. > > > > Comments, request for details, objections are welcomed, > > > > Thank you for your attention, > > > > The Berkeley team, Gael Varoquaux and David Cournapeau > > > > [snip] > > > Almost a year ago Travis send an email : > 'Report from SciPy'? > http://mail.scipy.org/pipermail/numpy-discussion/2008-August/036909.html > > Of importance was that > " * NumPy 2.0 will be a library and will not automagically import numpy.fft > * We will suggest that other libraries use from numpy import fft > instead of import numpy as np; np.fft > " > > I sort of see that the proposed work could help make numpy a library > as a whole but it is not clear that the work is heading towards that > goal. So if numpy 2.0 is still planned as a library then I would like > to see a clearer statement towards that goal. > > Not really understanding the problems of C99, but I know that trying > to cover all the little details can be very time consuming when more > effort could be spent on things. > So if 'C99-like' is going to be the near term future, is there any > point in supporting non-C99 environments with this work? Windows? I don't know the status of the most recent MSVC compilers, but they haven't been c99 compliant in the past and compliance doesn't seem to be a priority. Other compilers are a mixed bag. This is the git conundrum: support isn't sufficiently widespread on all platforms to make the transition so we are stuck with the lowest common denominator. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From d_l_goldsmith at yahoo.com Tue Aug 4 17:49:48 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 4 Aug 2009 14:49:48 -0700 (PDT) Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: Message-ID: <999200.9547.qm@web52106.mail.re2.yahoo.com> --- On Tue, 8/4/09, Bruce Southey wrote: > [snip] > > Almost a year ago Travis send an email : > 'Report from SciPy'? > http://mail.scipy.org/pipermail/numpy-discussion/2008-August/036909.html > > Of importance was that > " * NumPy 2.0 will be a library and will not automagically > import numpy.fft As someone who tends to think of "modules" as "libraries" (renamed for Python for "branding" purposes), what's the difference? DG > * We will suggest that other libraries use from numpy > import fft > instead of import numpy as np; np.fft > " > > I sort of see that the proposed work could help make numpy > a library > as a whole but it is not clear that the work is heading > towards that > goal. So if numpy 2.0 is still planned as a library then I > would like > to see a clearer statement towards that goal. > > Not really understanding the problems of C99, but I know > that trying > to cover all the little details can be very time consuming > when more > effort could be spent on things. > So if 'C99-like' is going to be the near term future, is > there any > point in supporting non-C99 environments with this work? > That is, is the limitation in the compiler, operating > system, > processor or some combination of these? > > Anyhow, these are only my thoughts and pale in comparison > to the work > you are doing so feel free ignore them. > > Thanks > Bruce > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From robert.kern at gmail.com Tue Aug 4 17:53:37 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 16:53:37 -0500 Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: <999200.9547.qm@web52106.mail.re2.yahoo.com> References: <999200.9547.qm@web52106.mail.re2.yahoo.com> Message-ID: <3d375d730908041453k3ffcd219p50482a5d53d683a@mail.gmail.com> On Tue, Aug 4, 2009 at 16:49, David Goldsmith wrote: > > --- On Tue, 8/4/09, Bruce Southey wrote: > >> [snip] >> >> Almost a year ago Travis send an email : >> 'Report from SciPy'? >> http://mail.scipy.org/pipermail/numpy-discussion/2008-August/036909.html >> >> Of importance was that >> " * NumPy 2.0 will be a library and will not automagically >> import numpy.fft > > As someone who tends to think of "modules" as "libraries" (renamed for Python for "branding" purposes), what's the difference? Poor phrasing. I believe Travis meant something along the lines of "NumPy 2.0 will be a [well-behaved] library and will not automagically import numpy.fft." The informative part is the latter point. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From d_l_goldsmith at yahoo.com Tue Aug 4 17:57:02 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 4 Aug 2009 14:57:02 -0700 (PDT) Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: <3d375d730908041453k3ffcd219p50482a5d53d683a@mail.gmail.com> Message-ID: <233855.28363.qm@web52103.mail.re2.yahoo.com> Gotchya, thanks! DG --- On Tue, 8/4/09, Robert Kern wrote: > From: Robert Kern > Subject: Re: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback > To: "Discussion of Numerical Python" > Date: Tuesday, August 4, 2009, 2:53 PM > On Tue, Aug 4, 2009 at 16:49, David > Goldsmith > wrote: > > > > --- On Tue, 8/4/09, Bruce Southey > wrote: > > > >> [snip] > >> > >> Almost a year ago Travis send an email : > >> 'Report from SciPy'? > >> http://mail.scipy.org/pipermail/numpy-discussion/2008-August/036909.html > >> > >> Of importance was that > >> " * NumPy 2.0 will be a library and will not > automagically > >> import numpy.fft > > > > As someone who tends to think of "modules" as > "libraries" (renamed for Python for "branding" purposes), > what's the difference? > > Poor phrasing. I believe Travis meant something along the > lines of > "NumPy 2.0 will be a [well-behaved] library and will not > automagically > import numpy.fft." The informative part is the latter > point. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, > a harmless > enigma that is made terrible by our own mad attempt to > interpret it as > though it had an underlying truth." > ? -- Umberto Eco > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From liukis at usc.edu Tue Aug 4 18:36:30 2009 From: liukis at usc.edu (Maria Liukis) Date: Tue, 04 Aug 2009 15:36:30 -0700 Subject: [Numpy-discussion] scipy.stats.poisson.ppf raises "OverflowError: cannot convert float infinity to long" Message-ID: Hello everybody, I'm using the following versions of scipy and numpy: >>> scipy.__version__ '0.6.0' >>> import numpy >>> numpy.__version__ '1.1.1' Would anybody happen to know why I get an exception when calling scipy.stats.poisson.ppf function: >>> from scipy.stats import * >>> poisson.ppf(0.9999, 4) Traceback (most recent call last): File "", line 1, in File "/usr/lib64/python2.5/site-packages/scipy/stats/ distributions.py", line 3601, in ppf place(output,cond2,self.b) File "/usr/lib64/python2.5/site-packages/numpy/lib/ function_base.py", line 957, in place return _insert(arr, mask, vals) OverflowError: cannot convert float infinity to long >>> Thanks a lot in advance, Masha -------------------- liukis at usc.edu -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Tue Aug 4 19:01:03 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 4 Aug 2009 19:01:03 -0400 Subject: [Numpy-discussion] scipy.stats.poisson.ppf raises "OverflowError: cannot convert float infinity to long" In-Reply-To: References: Message-ID: <1cd32cbb0908041601s55216ffesa301443ff104054d@mail.gmail.com> On Tue, Aug 4, 2009 at 6:36 PM, Maria Liukis wrote: > Hello everybody, > I'm using the following versions of scipy and numpy: >>>> scipy.__version__ > '0.6.0' >>>> import numpy >>>> numpy.__version__ > '1.1.1' > Would anybody happen to know why I get an exception when calling > scipy.stats.poisson.ppf function: >>>> from scipy.stats import * >>>> poisson.ppf(0.9999, 4) > Traceback (most recent call last): > ??File "", line 1, in > ??File "/usr/lib64/python2.5/site-packages/scipy/stats/distributions.py", > line 3601, in ppf > ?? ?place(output,cond2,self.b) > ??File "/usr/lib64/python2.5/site-packages/numpy/lib/function_base.py", line > 957, in place > ?? ?return _insert(arr, mask, vals) > OverflowError: cannot convert float infinity to long >>> stats.poisson.ppf(0.9999, 4) 13.0 >>> stats.poisson.cdf(13, 4) 0.99992367158465667 should be fixed since scipy 0.7.0 Josef >>>> > Thanks a lot in advance, > Masha > -------------------- > liukis at usc.edu > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > From liukis at usc.edu Tue Aug 4 19:03:44 2009 From: liukis at usc.edu (Maria Liukis) Date: Tue, 04 Aug 2009 16:03:44 -0700 Subject: [Numpy-discussion] scipy.stats.poisson.ppf raises "OverflowError: cannot convert float infinity to long" In-Reply-To: <1cd32cbb0908041601s55216ffesa301443ff104054d@mail.gmail.com> References: <1cd32cbb0908041601s55216ffesa301443ff104054d@mail.gmail.com> Message-ID: <8295F59A-5A15-443F-BB6E-ED31AC7054FF@usc.edu> Josef, Thanks a bunch! Masha -------------------- liukis at usc.edu On Aug 4, 2009, at 4:01 PM, josef.pktd at gmail.com wrote: > On Tue, Aug 4, 2009 at 6:36 PM, Maria Liukis wrote: >> Hello everybody, >> I'm using the following versions of scipy and numpy: >>>>> scipy.__version__ >> '0.6.0' >>>>> import numpy >>>>> numpy.__version__ >> '1.1.1' >> Would anybody happen to know why I get an exception when calling >> scipy.stats.poisson.ppf function: >>>>> from scipy.stats import * >>>>> poisson.ppf(0.9999, 4) >> Traceback (most recent call last): >> File "", line 1, in >> File "/usr/lib64/python2.5/site-packages/scipy/stats/ >> distributions.py", >> line 3601, in ppf >> place(output,cond2,self.b) >> File "/usr/lib64/python2.5/site-packages/numpy/lib/ >> function_base.py", line >> 957, in place >> return _insert(arr, mask, vals) >> OverflowError: cannot convert float infinity to long > >>>> stats.poisson.ppf(0.9999, 4) > 13.0 >>>> stats.poisson.cdf(13, 4) > 0.99992367158465667 > > should be fixed since scipy 0.7.0 > > Josef > >>>>> >> Thanks a lot in advance, >> Masha >> -------------------- >> liukis at usc.edu >> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From dwf at cs.toronto.edu Tue Aug 4 19:49:28 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Tue, 4 Aug 2009 19:49:28 -0400 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908041154g7383eed7w7cbd00ddea55f035@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> <20090804184824.GB11772@phare.normalesup.org> <49d6b3500908041154g7383eed7w7cbd00ddea55f035@mail.gmail.com> Message-ID: On 4-Aug-09, at 2:54 PM, G?khan Sever wrote: > I see that you should have a browser embedding plugin for Ipyhon > which you > don't want to share with us :) Ondrej's well on his way to fixing that: http://pythonnb.appspot.com/ David From josef.pktd at gmail.com Tue Aug 4 20:00:28 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 4 Aug 2009 20:00:28 -0400 Subject: [Numpy-discussion] scipy.stats.poisson.ppf raises "OverflowError: cannot convert float infinity to long" In-Reply-To: <8295F59A-5A15-443F-BB6E-ED31AC7054FF@usc.edu> References: <1cd32cbb0908041601s55216ffesa301443ff104054d@mail.gmail.com> <8295F59A-5A15-443F-BB6E-ED31AC7054FF@usc.edu> Message-ID: <1cd32cbb0908041700w6c041b0h9c0d4bd4fca05557@mail.gmail.com> On Tue, Aug 4, 2009 at 7:03 PM, Maria Liukis wrote: > Josef, > Thanks a bunch! > Masha You're welcome. Josef > -------------------- > liukis at usc.edu > > > On Aug 4, 2009, at 4:01 PM, josef.pktd at gmail.com wrote: > > On Tue, Aug 4, 2009 at 6:36 PM, Maria Liukis wrote: > > Hello everybody, > I'm using the following versions of scipy and numpy: > > scipy.__version__ > > '0.6.0' > > import numpy > numpy.__version__ > > '1.1.1' > Would anybody happen to know why I get an exception when calling > scipy.stats.poisson.ppf function: > > from scipy.stats import * > poisson.ppf(0.9999, 4) > > Traceback (most recent call last): > ??File "", line 1, in > ??File "/usr/lib64/python2.5/site-packages/scipy/stats/distributions.py", > line 3601, in ppf > ?? ?place(output,cond2,self.b) > ??File "/usr/lib64/python2.5/site-packages/numpy/lib/function_base.py", line > 957, in place > ?? ?return _insert(arr, mask, vals) > OverflowError: cannot convert float infinity to long > > stats.poisson.ppf(0.9999, 4) > > 13.0 > > stats.poisson.cdf(13, 4) > > 0.99992367158465667 > should be fixed since scipy 0.7.0 > Josef > > Thanks a lot in advance, > Masha > -------------------- > liukis at usc.edu > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > From gokhansever at gmail.com Tue Aug 4 20:03:43 2009 From: gokhansever at gmail.com (=?UTF-8?Q?G=C3=B6khan_Sever?=) Date: Tue, 4 Aug 2009 19:03:43 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> <20090804184824.GB11772@phare.normalesup.org> <49d6b3500908041154g7383eed7w7cbd00ddea55f035@mail.gmail.com> Message-ID: <49d6b3500908041703i55782893ice32802a58dcd2e3@mail.gmail.com> On Tue, Aug 4, 2009 at 6:49 PM, David Warde-Farley wrote: > On 4-Aug-09, at 2:54 PM, G?khan Sever wrote: > > > I see that you should have a browser embedding plugin for Ipyhon > > which you > > don't want to share with us :) > > Ondrej's well on his way to fixing that: http://pythonnb.appspot.com/ > > David > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > Hehe :) I would not be surprised if someone brings a real python snake into the conference then :) -- G?khan -------------- next part -------------- An HTML attachment was scrubbed... URL: From cycomanic at gmail.com Tue Aug 4 21:18:33 2009 From: cycomanic at gmail.com (Jochen) Date: Wed, 5 Aug 2009 11:18:33 +1000 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <20090803134556.GA31036@phare.normalesup.org> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> Message-ID: <20090805111833.645c6d93@cudos0803> Hi all, I see something similar on my system. OK I've just done a test. System is Ubuntu 9.04 AMD64 there seems to be a regression for float32 with high values: In [47]: a=np.random.rand(10000).astype(np.float32) In [48]: b=np.random.rand(10000).astype(np.float64) In [49]: c=1000*np.random.rand(10000).astype(np.float32) In [50]: d=1000*np.random.rand(1000).astype(np.float64) In [51]: %timeit -n 10 np.sin(a) 10 loops, best of 3: 251 ?s per loop In [52]: %timeit -n 10 np.sin(b) 10 loops, best of 3: 395 ?s per loop In [53]: %timeit -n 10 np.sin(c) 10 loops, best of 3: 5.65 ms per loop In [54]: %timeit -n 10 np.sin(d) 10 loops, best of 3: 87.7 ?s per loop In [55]: %timeit -n 10 np.sin(c.astype(np.float64)).astype(np.float32) 10 loops, best of 3: 891 ?s per loop Cheers Jochen On Mon, 3 Aug 2009 15:45:56 +0200 Emmanuelle Gouillart wrote: > Hi Andrew, > > %timeit is an Ipython magic command that uses the timeit > module, see > http://ipython.scipy.org/doc/stable/html/interactive/reference.html?highlight=timeit > for more information about how to use it. So you were right to suppose > that it is not a "normal Python". > > However, I was not able to reproduce your observations. > > >>> import numpy as np > >>> a = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32) > >>> b = np.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float64) > >>> %timeit -n 10 np.sin(a) > 10 loops, best of 3: 8.67 ms per loop > >>> %timeit -n 10 np.sin(b) > 10 loops, best of 3: 9.29 ms per loop > > Emmanuelle > > On Mon, Aug 03, 2009 at 09:32:57AM -0400, Andrew Friedley wrote: > > While working on GSoC stuff I came across this weird performance > > behavior for sine and cosine -- using float32 is way slower than > > float64. On a 2ghz opteron: > > > > sin float32 1.12447786331 > > sin float64 0.133481025696 > > cos float32 1.14155912399 > > cos float64 0.131420135498 > > > > The times are in seconds, and are best of three runs of ten > > iterations of numpy.{sin,cos} over a 1000-element array (script > > attached). I've produced similar results on a PS3 system also. > > The opteron is running Python 2.6.1 and NumPy 1.3.0, while the PS3 > > has Python 2.5.1 and NumPy 1.1.1. > > > > I haven't jumped into the code yet, but does anyone know why > > sin/cos are ~8.5x slower for 32-bit floats compared to 64-bit > > doubles? > > > > Side question: I see people in emails writing things like 'timeit > > foo(x)' and having it run some sort of standard benchmark, how > > exactly do I do that? Is that some environment other than a normal > > Python? > > > > Thanks, > > > > Andrew > > > import timeit > > > t = timeit.Timer("numpy.sin(a)", > > "import numpy\n" > > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, > > dtype=numpy.float32)") print "sin float32", min(t.repeat(3, 10)) > > > t = timeit.Timer("numpy.sin(a)", > > "import numpy\n" > > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, > > dtype=numpy.float64)") print "sin float64", min(t.repeat(3, 10)) > > > t = timeit.Timer("numpy.cos(a)", > > "import numpy\n" > > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, > > dtype=numpy.float32)") print "cos float32", min(t.repeat(3, 10)) > > > t = timeit.Timer("numpy.cos(a)", > > "import numpy\n" > > "a = numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, > > dtype=numpy.float64)") print "cos float64", min(t.repeat(3, 10)) > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From charlesr.harris at gmail.com Tue Aug 4 22:42:40 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 4 Aug 2009 20:42:40 -0600 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <20090805111833.645c6d93@cudos0803> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <20090805111833.645c6d93@cudos0803> Message-ID: On Tue, Aug 4, 2009 at 7:18 PM, Jochen wrote: > Hi all, > I see something similar on my system. > OK I've just done a test. System is Ubuntu 9.04 AMD64 > there seems to be a regression for float32 with high values: > > In [47]: a=np.random.rand(10000).astype(np.float32) > > In [48]: b=np.random.rand(10000).astype(np.float64) > > In [49]: c=1000*np.random.rand(10000).astype(np.float32) > > In [50]: d=1000*np.random.rand(1000).astype(np.float64) > > In [51]: %timeit -n 10 np.sin(a) > 10 loops, best of 3: 251 ?s per loop > > In [52]: %timeit -n 10 np.sin(b) > 10 loops, best of 3: 395 ?s per loop > > In [53]: %timeit -n 10 np.sin(c) > 10 loops, best of 3: 5.65 ms per loop > > In [54]: %timeit -n 10 np.sin(d) > 10 loops, best of 3: 87.7 ?s per loop > > In [55]: %timeit -n 10 np.sin(c.astype(np.float64)).astype(np.float32) > 10 loops, best of 3: 891 ?s per loop > Is anyone with this problem *not* running ubuntu? Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From gael.varoquaux at normalesup.org Wed Aug 5 01:22:40 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Wed, 5 Aug 2009 07:22:40 +0200 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <49d6b3500908041703i55782893ice32802a58dcd2e3@mail.gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <3d375d730908041143k5ead2616y34df06748ad57289@mail.gmail.com> <20090804184824.GB11772@phare.normalesup.org> <49d6b3500908041154g7383eed7w7cbd00ddea55f035@mail.gmail.com> <49d6b3500908041703i55782893ice32802a58dcd2e3@mail.gmail.com> Message-ID: <20090805052240.GA6038@phare.normalesup.org> On Tue, Aug 04, 2009 at 07:03:43PM -0500, G?khan Sever wrote: > I would not be surprised if someone brings a real python snake into the > conference then :) http://picasaweb.google.com/ziade.tarek/PyconFR#slideshow/5342502528927090354 From fperez.net at gmail.com Wed Aug 5 02:48:07 2009 From: fperez.net at gmail.com (Fernando Perez) Date: Tue, 4 Aug 2009 23:48:07 -0700 Subject: [Numpy-discussion] [ANN] IPython 0.10 is out. Message-ID: Hi all, on behalf of the IPython development team, I'm happy to announce that we've just put out IPython 0.10 final. Many thanks to all those who contributed ideas, bug reports and code. You can download it from the usual location: - http://ipython.scipy.org/moin/Download: direct links to various formats - http://ipython.scipy.org/dist: all files are stored here. The official documentation for this release can be found at: - http://ipython.scipy.org/doc/rel-0.10/html: as HTML pages. - http://ipython.scipy.org/doc/rel-0.10/ipython.pdf: as a single PDF. In brief, this release gathers all recent work and in a sense closes a cycle of the current useful-but-internally-messy structure of the IPython code. We are now well into the work of a major internal cleanup that will inevitably change some APIs and will likely take some time to stabilize, so the 0.10 release should be used for a while until the dust settles on the development branch. The 0.10 release fixes many bugs, including some very problematic ones (a major memory leak with repeated %run is closed), and also brings a number of new features, stability improvements and improved documentation. Some highlights: - Improved WX-based ipythonx and ipython-wx tools, suitable for embedding into other applications and standalone use. - Better interactive demos with the IPython.demo module. - Refactored ipcluster with support for local execution, MPI, PBS and systems with SSH key access preconfigured. - Integration with the TextMate editor in the %edit command. The full release notes are available here with all the details: http://ipython.scipy.org/doc/rel-0.10/html/changes.html#release-0-10 We hope you enjoy it, please report any problems as usual either on the mailing list, or by filing a bug report at our Launchpad tracker: https://bugs.launchpad.net/ipython Cheers, The IPython team. From dave.hirschfeld at gmail.com Wed Aug 5 03:40:04 2009 From: dave.hirschfeld at gmail.com (Dave) Date: Wed, 5 Aug 2009 07:40:04 +0000 (UTC) Subject: [Numpy-discussion] strange sin/cos performance References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <20090805111833.645c6d93@cudos0803> Message-ID: Charles R Harris gmail.com> writes: > > > Is anyone with this problem *not* running ubuntu?Chuck > All I can say is that it (surprisingly?) doesn't appear to affect my windoze (XP) box. Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit (Intel)] In [2]: a=np.random.rand(10000).astype(np.float32) In [3]: b=np.random.rand(10000).astype(np.float64) In [4]: c=1000*np.random.rand(10000).astype(np.float32) In [5]: d=1000*np.random.rand(1000).astype(np.float64) In [6]: timeit -n 10 np.sin(a) 10 loops, best of 3: 442 us per loop In [7]: timeit -n 10 np.sin(b) 10 loops, best of 3: 513 us per loop In [8]: timeit -n 10 np.sin(c) 10 loops, best of 3: 474 us per loop In [9]: timeit -n 10 np.sin(d) 10 loops, best of 3: 63.1 us per loop In [10]: timeit -n 10 np.sin(c.astype(np.float64)).astype(np.float32) 10 loops, best of 3: 587 us per loop In [11]: !gcc --version gcc (GCC) 3.4.5 (mingw-vista special r3) From bsouthey at gmail.com Wed Aug 5 04:40:17 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Wed, 5 Aug 2009 03:40:17 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 4:05 PM, Keith Goodman wrote: > On Tue, Aug 4, 2009 at 1:53 PM, Bruce Southey wrote: >> On Tue, Aug 4, 2009 at 1:40 PM, G?khan Sever wrote: >>> This is the loveliest of all solutions: >>> >>> c[isfinite(c)].mean() >> >> This handling of nonfinite elements has come up before. >> Please remember that this only for 1d or flatten array so it not work >> in general especially along an axis. > > If you don't want to use nanmean from scipy.stats you could use: > > np.nansum(c, axis=0) / (~np.isnan(c)).sum(axis=0) > > or > > np.nansum(c, axis=0) / (c == c).sum(axis=0) > > But if c contains ints then you'll run into trouble with the division, > so you'll need to protect against that. That is not a problem because nan and infinity are only defined for floating point numbers not integers. So any array that have nonfinite elements like nans and infinity must have a floating point dtype. Bruce From bsouthey at gmail.com Wed Aug 5 05:20:12 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Wed, 5 Aug 2009 04:20:12 -0500 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <20090805111833.645c6d93@cudos0803> Message-ID: On Tue, Aug 4, 2009 at 9:42 PM, Charles R Harris wrote: > > > On Tue, Aug 4, 2009 at 7:18 PM, Jochen wrote: >> >> Hi all, >> I see something similar on my system. >> OK I've just done a test. System is Ubuntu 9.04 AMD64 >> there seems to be a regression for float32 with high values: >> >> In [47]: a=np.random.rand(10000).astype(np.float32) >> >> In [48]: b=np.random.rand(10000).astype(np.float64) >> >> In [49]: c=1000*np.random.rand(10000).astype(np.float32) >> >> In [50]: d=1000*np.random.rand(1000).astype(np.float64) >> >> In [51]: %timeit -n 10 np.sin(a) >> 10 loops, best of 3: 251 ?s per loop >> >> In [52]: %timeit -n 10 np.sin(b) >> 10 loops, best of 3: 395 ?s per loop >> >> In [53]: %timeit -n 10 np.sin(c) >> 10 loops, best of 3: 5.65 ms per loop >> >> In [54]: %timeit -n 10 np.sin(d) >> 10 loops, best of 3: 87.7 ?s per loop >> >> In [55]: %timeit -n 10 np.sin(c.astype(np.float64)).astype(np.float32) >> 10 loops, best of 3: 891 ?s per loop > > Is anyone with this problem *not* running ubuntu? > Yes but I do not consider a 'problem'. While not an expert in this, this looks to be related to 64 bit OSes running on 64bit processors, probably compiler related, and probably a feature of Python. As I have tried to show, I do not think that these timing are being performed correctly because when you pass a single argument to Python at the command prompt you get the comparable timings. The difference in timing occurs when you pass two arguments to Python. I do not use IPython so I am only guessing but you need to do the timing at the same time. Probably something like: %timeit -n 10 np.sin(numpy.arange(0.0, 1000, (2 * 3.14159) / 1000, dtype=np.float32)) Note there is a most likely a penalty involved in type conversion that needs to be addressed in any timings. Bruce From david at ar.media.kyoto-u.ac.jp Wed Aug 5 07:45:30 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Wed, 05 Aug 2009 20:45:30 +0900 Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: References: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> Message-ID: <4A7970DA.50302@ar.media.kyoto-u.ac.jp> Bruce Southey wrote: > So if 'C99-like' is going to be the near term future, is there any > point in supporting non-C99 environments with this work? > There may be a misunderstanding: if the platform support C99 complex, then we will use it, and otherwise, we will do as today, that is define our own type. The advantages of reusing the C99 complex type if available: - if yourself do not care about portability, you can use the numpy complex typedef as a C99 complex, using addition, division, etc... operators. - we can reuse the math library. I also need some sort of proper C99 support for windows 64 (more exactly to reimplement a minimal libgfortran buildable by MS compiler). > That is, is the limitation in the compiler, operating system, > processor or some combination of these? > That's purely a compiler issue. Of course, the main culprit is MS compiler. MS explicitly stated they did not care about proper C support. cheers, David From jdh2358 at gmail.com Wed Aug 5 09:44:34 2009 From: jdh2358 at gmail.com (John Hunter) Date: Wed, 5 Aug 2009 08:44:34 -0500 Subject: [Numpy-discussion] yubnub and numpy examples Message-ID: <88e473830908050644t71825829uc965ad213e652b3@mail.gmail.com> yubnub is pretty cool -- it's a command line interface for the web. You can enable it in firefox by typing "about:config" in the URL bar, scrolling down to "keyword.URL", right click on the line and choose modify, and set the value to be http://www.yubnub.org/parser/parse?default=g2&command= Then, you can type yubnub commands in the URL bar, eg, to see all commands related to python, type "ls python" in the URL bar. It's easy to create new commands; I just created a new command to load the docs for a numpy function; just type in the URL bar: npfunc convolve which takes you directly to http://docs.scipy.org/doc/numpy/reference/generated/numpy.convolve.html I was hoping to create a similar command for the numpy examples, but the URL links in http://www.scipy.org/Numpy_Example_List_With_Doc are some md5 gobbledy-gook. Is it possible to have nice URLs on this page, so they can be more readily yubnub-ized? JDH From daniel.wheeler2 at gmail.com Wed Aug 5 10:20:14 2009 From: daniel.wheeler2 at gmail.com (Daniel Wheeler) Date: Wed, 5 Aug 2009 10:20:14 -0400 Subject: [Numpy-discussion] PDE BoF at SciPy2009 In-Reply-To: <1963DA80-8CE5-4033-BCC8-EBEF05352AAB@gmail.com> References: <1963DA80-8CE5-4033-BCC8-EBEF05352AAB@gmail.com> Message-ID: <80b160a0908050720i11a147d0ibc6e40f4762fb5f3@mail.gmail.com> On Mon, Aug 3, 2009 at 3:57 PM, Chris Kees wrote: > Is there any interest in a BoF session on implementing numerical > methods for partial differential equations using modules like numpy, > cython, mpi4py, etc.? Yes! My colleague, Jon Guyer, will be attending the meeting and speaking on this subject. He isn't on this list. He will be there from midday the Wednesday of the conference. Is this BoF still of interest? -- Daniel Wheeler From kwgoodman at gmail.com Wed Aug 5 10:18:17 2009 From: kwgoodman at gmail.com (Keith Goodman) Date: Wed, 5 Aug 2009 07:18:17 -0700 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> Message-ID: On Wed, Aug 5, 2009 at 1:40 AM, Bruce Southey wrote: > On Tue, Aug 4, 2009 at 4:05 PM, Keith Goodman wrote: >> On Tue, Aug 4, 2009 at 1:53 PM, Bruce Southey wrote: >>> On Tue, Aug 4, 2009 at 1:40 PM, G?khan Sever wrote: >>>> This is the loveliest of all solutions: >>>> >>>> c[isfinite(c)].mean() >>> >>> This handling of nonfinite elements has come up before. >>> Please remember that this only for 1d or flatten array so it not work >>> in general especially along an axis. >> >> If you don't want to use nanmean from scipy.stats you could use: >> >> np.nansum(c, axis=0) / (~np.isnan(c)).sum(axis=0) >> >> or >> >> np.nansum(c, axis=0) / (c == c).sum(axis=0) >> >> But if c contains ints then you'll run into trouble with the division, >> so you'll need to protect against that. > > That is not a problem because nan and infinity are only defined for > floating point numbers not integers. So any array that have nonfinite > elements like nans and infinity must have a floating point dtype. That is true. But I was thnking of this case (no nans or infs): >> c array([[1, 2, 3], [4, 5, 6]]) >> c.mean(0) array([ 2.5, 3.5, 4.5]) <--- good >> np.nansum(c, axis=0) / (c == c).sum(axis=0) array([2, 3, 4]) <--- bad >> np.nansum(c, axis=0) / (c == c).sum(axis=0, dtype=np.float) array([ 2.5, 3.5, 4.5]) <--- good From josef.pktd at gmail.com Wed Aug 5 10:30:55 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 5 Aug 2009 10:30:55 -0400 Subject: [Numpy-discussion] yubnub and numpy examples In-Reply-To: <88e473830908050644t71825829uc965ad213e652b3@mail.gmail.com> References: <88e473830908050644t71825829uc965ad213e652b3@mail.gmail.com> Message-ID: <1cd32cbb0908050730h1ca6262y8d535f566295af9f@mail.gmail.com> On Wed, Aug 5, 2009 at 9:44 AM, John Hunter wrote: > yubnub is pretty cool -- it's a command line interface for the web. > You can enable it in firefox by typing "about:config" in the URL bar, > scrolling down to "keyword.URL", right click on the line and choose > modify, and set the value to be > > http://www.yubnub.org/parser/parse?default=g2&command= > > Then, you can type yubnub commands in the URL bar, eg, to see all > commands related to python, type "ls python" in the URL bar. > > It's easy to create new commands; I just created a new command to load > the docs for a numpy function; just type in the URL bar: > > ?npfunc convolve > > which takes you directly to > http://docs.scipy.org/doc/numpy/reference/generated/numpy.convolve.html > > I was hoping to create a similar command for the numpy examples, but > the URL links in http://www.scipy.org/Numpy_Example_List_With_Doc are > some md5 gobbledy-gook. ?Is it possible to have nice URLs on this > page, so they can be more readily yubnub-ized? > > JDH > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > looks pretty good, but I would recommend a safe install, instead of overwriting the keyword default. This requires typing one additional letter, e.g "y npfunc convolve", (and avoids invalidating the firefox warranty, and you can do the same with other search/link shortcuts) Josef from http://www.yubnub.org/documentation/describe_installation """ Safe Firefox Installation. The safest way to install YubNub is to make a Firefox keyword for it. If you're using the Firefox web browser: * Right-click the input box at the top of the page (the one under the words "Type in a command") * Click "Add a Keyword for this Search" * For the Name, enter "YubNub", and for the Keyword, enter "y" * Press OK Now you can use YubNub directly from the address bar. For example, try typing "y gim porsche 911" into your address bar. Don't forget the "y" in front! You may have noticed that I said that this is the "safest way" to install YubNub. Why safest? Because you must explicitly enter a "y" before the YubNub command. This prevents "command spoofing". For example, suppose someone made a "michael" command. If you typed "michael jordan" into YubNub, intending to do a search, you would instead go to the site of the person who made the "michael" command. Rats! But if you installed YubNub into your Firefox address bar as described above, typing "michael jordan" into your address bar would do a search for "michael jordan", as you intended. The only way to get to that other person's site would be to type "y michael". If you like to live on the edge like me, you can try one of the other installation methods, many of which do not require an initial keyword like "y". """ From meine at informatik.uni-hamburg.de Wed Aug 5 10:41:21 2009 From: meine at informatik.uni-hamburg.de (Hans Meine) Date: Wed, 5 Aug 2009 16:41:21 +0200 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <20090804195936.GF11772@phare.normalesup.org> Message-ID: <200908051641.21306.meine@informatik.uni-hamburg.de> On Tuesday 04 August 2009 22:06:38 Keith Goodman wrote: > On Tue, Aug 4, 2009 at 12:59 PM, Gael > > Varoquaux wrote: > > On Tue, Aug 04, 2009 at 01:54:49PM -0500, G?khan Sever wrote: > >> I see that you should have a browser embedding plugin for Ipyhon > >> which you don't want to share with us :) > > > > No, I answer e-mail using vim. > > Yeah, I'm trying that right now. :wq :q! :dammit Vim? Isn't that the editor with the two modes, one which destroys your text and one that beeps? ;-) Have a nice day, Hans PS: Yes, it's a free translation of a German chat (IRC/bash?) citation.. From ralf.gommers at googlemail.com Wed Aug 5 10:49:43 2009 From: ralf.gommers at googlemail.com (Ralf Gommers) Date: Wed, 5 Aug 2009 10:49:43 -0400 Subject: [Numpy-discussion] yubnub and numpy examples In-Reply-To: <88e473830908050644t71825829uc965ad213e652b3@mail.gmail.com> References: <88e473830908050644t71825829uc965ad213e652b3@mail.gmail.com> Message-ID: On Wed, Aug 5, 2009 at 9:44 AM, John Hunter wrote: > yubnub is pretty cool -- it's a command line interface for the web. > You can enable it in firefox by typing "about:config" in the URL bar, > scrolling down to "keyword.URL", right click on the line and choose > modify, and set the value to be > > http://www.yubnub.org/parser/parse?default=g2&command= > > Then, you can type yubnub commands in the URL bar, eg, to see all > commands related to python, type "ls python" in the URL bar. > > It's easy to create new commands; I just created a new command to load > the docs for a numpy function; just type in the URL bar: > > npfunc convolve very cool, thanks! > > > which takes you directly to > http://docs.scipy.org/doc/numpy/reference/generated/numpy.convolve.html > > I was hoping to create a similar command for the numpy examples, but > the URL links in http://www.scipy.org/Numpy_Example_List_With_Doc are > some md5 gobbledy-gook. Is it possible to have nice URLs on this > page, so they can be more readily yubnub-ized? Most of those examples have been integrated in the docstrings, and many more have been written in the doc wiki. They also use "from numpy import *" instead of the np namespace. So instead of spending time fixing links, it might make more sense to generate a new version of this page (with more useful links) from the docstrings themselves. Cheers, Ralf > > JDH > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Wed Aug 5 10:52:43 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 5 Aug 2009 10:52:43 -0400 Subject: [Numpy-discussion] yubnub and numpy examples In-Reply-To: <1cd32cbb0908050730h1ca6262y8d535f566295af9f@mail.gmail.com> References: <88e473830908050644t71825829uc965ad213e652b3@mail.gmail.com> <1cd32cbb0908050730h1ca6262y8d535f566295af9f@mail.gmail.com> Message-ID: <1cd32cbb0908050752rf8b7bf2n8cd330e345992da9@mail.gmail.com> On Wed, Aug 5, 2009 at 10:30 AM, wrote: > On Wed, Aug 5, 2009 at 9:44 AM, John Hunter wrote: >> yubnub is pretty cool -- it's a command line interface for the web. >> You can enable it in firefox by typing "about:config" in the URL bar, >> scrolling down to "keyword.URL", right click on the line and choose >> modify, and set the value to be >> >> http://www.yubnub.org/parser/parse?default=g2&command= >> >> Then, you can type yubnub commands in the URL bar, eg, to see all >> commands related to python, type "ls python" in the URL bar. >> >> It's easy to create new commands; I just created a new command to load >> the docs for a numpy function; just type in the URL bar: >> >> ?npfunc convolve >> >> which takes you directly to >> http://docs.scipy.org/doc/numpy/reference/generated/numpy.convolve.html Still, it is a lot slower than windows htmlhelp, which is available for numy and scipy but not for others. "y mplcodex histogram" takes pretty long to load >> >> I was hoping to create a similar command for the numpy examples, but >> the URL links in http://www.scipy.org/Numpy_Example_List_With_Doc are >> some md5 gobbledy-gook. ?Is it possible to have nice URLs on this >> page, so they can be more readily yubnub-ized? my impression of the example list page: This page is not really maintained anymore, it is still at numpy 1.2.1 and mostly superseded by the new docs, with examples as part of the docstrings. (Also because of it's page size, I think it's more appropriate for browsing than for quick lookups.) Josef >> >> JDH >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> > > looks pretty good, but I would recommend a safe install, instead of > overwriting the keyword default. This requires typing one additional > letter, e.g "y npfunc convolve", (and avoids invalidating the firefox > warranty, and you can do the same with other search/link shortcuts) > > Josef > > > from http://www.yubnub.org/documentation/describe_installation > > """ > > Safe Firefox Installation. The safest way to install YubNub is to make > a Firefox keyword for it. If you're using the Firefox web browser: > > ? ?* Right-click the input box at the top of the page (the one under > the words "Type in a command") > ? ?* Click "Add a Keyword for this Search" > ? ?* For the Name, enter "YubNub", and for the Keyword, enter "y" > ? ?* Press OK > > Now you can use YubNub directly from the address bar. For example, try > typing "y gim porsche 911" into your address bar. Don't forget the "y" > in front! > You may have noticed that I said that this is the "safest way" to > install YubNub. Why safest? Because you must explicitly enter a "y" > before the YubNub command. This prevents "command spoofing". > > For example, suppose someone made a "michael" command. If you typed > "michael jordan" into YubNub, intending to do a search, you would > instead go to the site of the person who made the "michael" command. > Rats! But if you installed YubNub into your Firefox address bar as > described above, typing "michael jordan" into your address bar would > do a search for "michael jordan", as you intended. The only way to get > to that other person's site would be to type "y michael". > > If you like to live on the edge like me, you can try one of the other > installation methods, many of which do not require an initial keyword > like "y". > > """ > From bsouthey at gmail.com Wed Aug 5 10:04:47 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Wed, 05 Aug 2009 09:04:47 -0500 Subject: [Numpy-discussion] Funded work on Numpy: proposed improvements and request for feedback In-Reply-To: <4A7970DA.50302@ar.media.kyoto-u.ac.jp> References: <4A77A009.9060104@ar.media.kyoto-u.ac.jp> <4A7970DA.50302@ar.media.kyoto-u.ac.jp> Message-ID: <4A79917F.7020108@gmail.com> On 08/05/2009 06:45 AM, David Cournapeau wrote: > Bruce Southey wrote: > >> So if 'C99-like' is going to be the near term future, is there any >> point in supporting non-C99 environments with this work? >> >> > > There may be a misunderstanding: Really ignorance :-) > if the platform support C99 complex, > then we will use it, and otherwise, we will do as today, that is define > our own type. > Actually I did understand that much. > The advantages of reusing the C99 complex type if available: > - if yourself do not care about portability, you can use the numpy > complex typedef as a C99 complex, using addition, division, etc... > operators. > - we can reuse the math library. > I also need some sort of proper C99 support for windows 64 (more exactly > to reimplement a minimal libgfortran buildable by MS compiler). > > >> That is, is the limitation in the compiler, operating system, >> processor or some combination of these? >> >> > > That's purely a compiler issue. Of course, the main culprit is MS > compiler. MS explicitly stated they did not care about proper C support. > Obviously complicated by the distribution of the official Python MS compiled binaries. Ultimately, my view is looking at long term maintenance when people have moved on and the code gets somewhat stale. Definitely your proposal would help long term maintenance of Numpy using C99 supported compilers if included. So my concern is avoiding divergence of the code base between the Numpy and the library so there is no unnecessary code duplication, no need to merge code in the future and fixes (bugs or enhancements) get fixed once that applies to both aspects. Provided these aspects are addressed I have no problems with the proposal. Bruce -------------- next part -------------- An HTML attachment was scrubbed... URL: From afriedle at indiana.edu Wed Aug 5 11:19:52 2009 From: afriedle at indiana.edu (Andrew Friedley) Date: Wed, 05 Aug 2009 11:19:52 -0400 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <20090805111833.645c6d93@cudos0803> Message-ID: <4A79A318.8050907@indiana.edu> > Is anyone with this problem *not* running ubuntu? Me - RHEL 5.2 opteron: Python 2.6.1 (r261:67515, Jan 5 2009, 10:19:01) [GCC 4.1.2 20071124 (Red Hat 4.1.2-42)] on linux2 Fedora 9 PS3/PPC: Python 2.5.1 (r251:54863, Jul 17 2008, 13:25:23) [GCC 4.3.1 20080708 (Red Hat 4.3.1-4)] on linux2 Actually I now have some interesting results that indicate the issue isn't in Python or NumPy at all. I just wrote a C program to try to reproduce the error, and was able to do so (actually the difference is even larger). Opteron: float (32) time in usecs: 179698 double (64) time in usecs: 13795 PS3/PPC: float (32) time in usecs: 614821 double (64) time in usecs: 37163 I've attached the code for others to review and/or try out. I guess this is worth showing to the libc people? Andrew -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: cos.c URL: From bsouthey at gmail.com Wed Aug 5 11:20:43 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Wed, 05 Aug 2009 10:20:43 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> Message-ID: <4A79A34B.8070102@gmail.com> On 08/05/2009 09:18 AM, Keith Goodman wrote: > On Wed, Aug 5, 2009 at 1:40 AM, Bruce Southey wrote: > >> On Tue, Aug 4, 2009 at 4:05 PM, Keith Goodman wrote: >> >>> On Tue, Aug 4, 2009 at 1:53 PM, Bruce Southey wrote: >>> >>>> On Tue, Aug 4, 2009 at 1:40 PM, G?khan Sever wrote: >>>> >>>>> This is the loveliest of all solutions: >>>>> >>>>> c[isfinite(c)].mean() >>>>> >>>> This handling of nonfinite elements has come up before. >>>> Please remember that this only for 1d or flatten array so it not work >>>> in general especially along an axis. >>>> >>> If you don't want to use nanmean from scipy.stats you could use: >>> >>> np.nansum(c, axis=0) / (~np.isnan(c)).sum(axis=0) >>> >>> or >>> >>> np.nansum(c, axis=0) / (c == c).sum(axis=0) >>> >>> But if c contains ints then you'll run into trouble with the division, >>> so you'll need to protect against that. >>> >> That is not a problem because nan and infinity are only defined for >> floating point numbers not integers. So any array that have nonfinite >> elements like nans and infinity must have a floating point dtype. >> > > That is true. But I was thnking of this case (no nans or infs): > > >>> c >>> > array([[1, 2, 3], > [4, 5, 6]]) > >>> c.mean(0) >>> > array([ 2.5, 3.5, 4.5])<--- good > >>> np.nansum(c, axis=0) / (c == c).sum(axis=0) >>> > array([2, 3, 4])<--- bad > >>> np.nansum(c, axis=0) / (c == c).sum(axis=0, dtype=np.float) >>> > array([ 2.5, 3.5, 4.5])<--- good > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > Sure but that is about ints versus floats and not about nans or infs. Your 'good' examples are really about first converting an int array into a float array and your 'bad' example maintains int dtype (same result if you cast the arrays from 'good' approaches back to an int dtype). The correct answer depends on what you want the dtype to be. For example, With floating point division: np.mean(c/0.0,axis=0) gives the expected floating point answer: array([ Inf, Inf, Inf]) With integer division: np.mean(c/0,axis=0) gives the expected integer answer: array([ 0., 0., 0.]) Note the default action of mean is to convert ints to float64 which is why the output is a float instead of an int. Although the numpy.mean dtype argument does not appear to work for int dtypes. Bruce -------------- next part -------------- An HTML attachment was scrubbed... URL: From scott.sinclair.za at gmail.com Wed Aug 5 11:26:39 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Wed, 5 Aug 2009 17:26:39 +0200 Subject: [Numpy-discussion] strange sin/cos performance In-Reply-To: <4A79A318.8050907@indiana.edu> References: <4A76E709.9090100@indiana.edu> <20090803134556.GA31036@phare.normalesup.org> <20090805111833.645c6d93@cudos0803> <4A79A318.8050907@indiana.edu> Message-ID: <6a17e9ee0908050826q595364a9pe9b2f53d8bd65482@mail.gmail.com> > 2009/8/5 Andrew Friedley : > >> Is anyone with this problem *not* running ubuntu? > > Me - RHEL 5.2 opteron: > > Python 2.6.1 (r261:67515, Jan ?5 2009, 10:19:01) > [GCC 4.1.2 20071124 (Red Hat 4.1.2-42)] on linux2 > > Fedora 9 PS3/PPC: > > Python 2.5.1 (r251:54863, Jul 17 2008, 13:25:23) > [GCC 4.3.1 20080708 (Red Hat 4.3.1-4)] on linux2 > > > Actually I now have some interesting results that indicate the issue isn't > in Python or NumPy at all. ?I just wrote a C program to try to reproduce the > error, and was able to do so (actually the difference is even larger). > > Opteron: > > float (32) time in usecs: 179698 > double (64) time in usecs: 13795 > > PS3/PPC: > > float (32) time in usecs: 614821 > double (64) time in usecs: 37163 > > I've attached the code for others to review and/or try out. ?I guess this is > worth showing to the libc people? For whatever it's worth, not much difference on my machine 32-bit Ubuntu, GCC 4.3.3. float (32) time in usecs: 13804 double (64) time in usecs: 15394 Cheers, Scott From d_l_goldsmith at yahoo.com Wed Aug 5 13:12:13 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 10:12:13 -0700 (PDT) Subject: [Numpy-discussion] PDE BoF at SciPy2009 In-Reply-To: <80b160a0908050720i11a147d0ibc6e40f4762fb5f3@mail.gmail.com> Message-ID: <487426.84589.qm@web52103.mail.re2.yahoo.com> I already replied to OP, but I'll say publically: "+1", as long as it's not at the same time as the as-yet-potential BoF on "the Future of SciPy". DG --- On Wed, 8/5/09, Daniel Wheeler wrote: > From: Daniel Wheeler > Subject: Re: [Numpy-discussion] PDE BoF at SciPy2009 > To: "Discussion of Numerical Python" > Date: Wednesday, August 5, 2009, 7:20 AM > On Mon, Aug 3, 2009 at 3:57 PM, Chris > Kees > wrote: > > Is there any interest in a BoF session on implementing > numerical > > methods for partial differential equations using > modules like numpy, > > cython, mpi4py, etc.? > > Yes! My colleague, Jon Guyer, will be attending the meeting > and > speaking on this subject. He isn't on this list. He will be > there from > midday the Wednesday of the conference. Is this BoF still > of interest? > > -- > Daniel Wheeler > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From romain.brette at ens.fr Wed Aug 5 06:45:42 2009 From: romain.brette at ens.fr (Romain Brette) Date: Wed, 5 Aug 2009 10:45:42 +0000 (UTC) Subject: [Numpy-discussion] GPU Numpy Message-ID: Hi everyone, I was wondering if you had any plan to incorporate some GPU support to numpy, or perhaps as a separate module. What I have in mind is something that would mimick the syntax of numpy arrays, with a new dtype (gpufloat), like this: from gpunumpy import * x=zeros(100,dtype='gpufloat') # Creates an array of 100 elements on the GPU y=ones(100,dtype='gpufloat') z=exp(2*x+y) # z in on the GPU, all operations on GPU with no transfer z_cpu=array(z,dtype='float') # z is copied to the CPU i=(z>2.3).nonzero()[0] # operation on GPU, returns a CPU integer array I came across a paper about something like that but couldn't find any public release: http://www.tricity.wsu.edu/~bobl/personal/mypubs/2009_gpupy_toms.pdf There is a library named GPULib (http://www.txcorp.com/products/GPULib/) that does similar things, but unfortunately they don't support Python (I think their main Python developer left). I think this would be very useful for many people. For our project (a neural network simulator, http://www.briansimulator.org) we use PyCuda (http://mathema.tician.de/software/pycuda), which is great, but it is mainly for low-level GPU programming. Cheers Romain From cekees at gmail.com Wed Aug 5 14:23:40 2009 From: cekees at gmail.com (Chris Kees) Date: Wed, 5 Aug 2009 13:23:40 -0500 Subject: [Numpy-discussion] PDE BoF at SciPy2009 In-Reply-To: <487426.84589.qm@web52103.mail.re2.yahoo.com> References: <487426.84589.qm@web52103.mail.re2.yahoo.com> Message-ID: <6B54DD7C-8B1C-4886-9822-C1E8210945CD@gmail.com> OK. I contacted several attendees who are not on the numpy list, and it looks like we've got six or seven people interested. I've never been to the conference or organized a session like this. Any guidance? Chris On Aug 5, 2009, at 12:12 PM, David Goldsmith wrote: > I already replied to OP, but I'll say publically: > > "+1", as long as it's not at the same time as the as-yet-potential > BoF on "the Future of SciPy". > > DG > > --- On Wed, 8/5/09, Daniel Wheeler wrote: > >> From: Daniel Wheeler >> Subject: Re: [Numpy-discussion] PDE BoF at SciPy2009 >> To: "Discussion of Numerical Python" >> Date: Wednesday, August 5, 2009, 7:20 AM >> On Mon, Aug 3, 2009 at 3:57 PM, Chris >> Kees >> wrote: >>> Is there any interest in a BoF session on implementing >> numerical >>> methods for partial differential equations using >> modules like numpy, >>> cython, mpi4py, etc.? >> >> Yes! My colleague, Jon Guyer, will be attending the meeting >> and >> speaking on this subject. He isn't on this list. He will be >> there from >> midday the Wednesday of the conference. Is this BoF still >> of interest? >> >> -- >> Daniel Wheeler >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From gael.varoquaux at normalesup.org Wed Aug 5 14:27:02 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Wed, 5 Aug 2009 20:27:02 +0200 Subject: [Numpy-discussion] PDE BoF at SciPy2009 In-Reply-To: <6B54DD7C-8B1C-4886-9822-C1E8210945CD@gmail.com> References: <487426.84589.qm@web52103.mail.re2.yahoo.com> <6B54DD7C-8B1C-4886-9822-C1E8210945CD@gmail.com> Message-ID: <20090805182702.GB26054@phare.normalesup.org> On Wed, Aug 05, 2009 at 01:23:40PM -0500, Chris Kees wrote: > OK. I contacted several attendees who are not on the numpy list, and > it looks like we've got six or seven people interested. > I've never been to the conference or organized a session like this. > Any guidance? Just contact one of the organisers during the conference (as early as possible) and we'll sort out the room. It will happen in the one of the two evenings, preferably on Thursday. Ga?l From charlesr.harris at gmail.com Wed Aug 5 14:34:27 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 5 Aug 2009 12:34:27 -0600 Subject: [Numpy-discussion] GPU Numpy In-Reply-To: References: Message-ID: On Wed, Aug 5, 2009 at 4:45 AM, Romain Brette wrote: > Hi everyone, > > I was wondering if you had any plan to incorporate some GPU support to > numpy, or > perhaps as a separate module. What I have in mind is something that would > mimick > the syntax of numpy arrays, with a new dtype (gpufloat), like this: > > from gpunumpy import * > x=zeros(100,dtype='gpufloat') # Creates an array of 100 elements on the GPU > y=ones(100,dtype='gpufloat') > z=exp(2*x+y) # z in on the GPU, all operations on GPU with no transfer > z_cpu=array(z,dtype='float') # z is copied to the CPU > i=(z>2.3).nonzero()[0] # operation on GPU, returns a CPU integer array > > I came across a paper about something like that but couldn't find any > public > release: > http://www.tricity.wsu.edu/~bobl/personal/mypubs/2009_gpupy_toms.pdf > > There is a library named GPULib (http://www.txcorp.com/products/GPULib/) > that > does similar things, but unfortunately they don't support Python (I think > their > main Python developer left). > I think this would be very useful for many people. For our project (a > neural > network simulator, http://www.briansimulator.org) we use PyCuda > (http://mathema.tician.de/software/pycuda), which is great, but it is > mainly for > low-level GPU programming. > What sort of functionality are you looking for? It could be you could slip in a small mod that would do what you want. In the larger picture, the use of GPUs has been discussed on the list several times going back at least a year. The main problems with using GPUs were that CUDA was only available for nvidia video cards and there didn't seem to be any hope for a CUDA version of LAPACK. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From d_l_goldsmith at yahoo.com Wed Aug 5 14:37:17 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 11:37:17 -0700 (PDT) Subject: [Numpy-discussion] PDE BoF at SciPy2009 In-Reply-To: <6B54DD7C-8B1C-4886-9822-C1E8210945CD@gmail.com> Message-ID: <802291.90959.qm@web52105.mail.re2.yahoo.com> Lot's of food and alcohol! (Just kidding.) DG --- On Wed, 8/5/09, Chris Kees wrote: > From: Chris Kees > Subject: Re: [Numpy-discussion] PDE BoF at SciPy2009 > To: "Discussion of Numerical Python" > Date: Wednesday, August 5, 2009, 11:23 AM > OK.? I contacted several > attendees who are not on the numpy list, and? > it looks like we've got six or seven people interested. > > I've never been to the conference or organized? a > session like this.? ? > Any guidance? > > Chris > > On Aug 5, 2009, at 12:12 PM, David Goldsmith wrote: > > > I already replied to OP, but I'll say publically: > > > > "+1", as long as it's not at the same time as the > as-yet-potential? > > BoF on "the Future of SciPy". > > > > DG > > > > --- On Wed, 8/5/09, Daniel Wheeler > wrote: > > > >> From: Daniel Wheeler > >> Subject: Re: [Numpy-discussion] PDE BoF at > SciPy2009 > >> To: "Discussion of Numerical Python" > >> Date: Wednesday, August 5, 2009, 7:20 AM > >> On Mon, Aug 3, 2009 at 3:57 PM, Chris > >> Kees > >> wrote: > >>> Is there any interest in a BoF session on > implementing > >> numerical > >>> methods for partial differential equations > using > >> modules like numpy, > >>> cython, mpi4py, etc.? > >> > >> Yes! My colleague, Jon Guyer, will be attending > the meeting > >> and > >> speaking on this subject. He isn't on this list. > He will be > >> there from > >> midday the Wednesday of the conference. Is this > BoF still > >> of interest? > >> > >> -- > >> Daniel Wheeler > >> _______________________________________________ > >> NumPy-Discussion mailing list > >> NumPy-Discussion at scipy.org > >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > >> > > > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From charlesr.harris at gmail.com Wed Aug 5 14:47:16 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 5 Aug 2009 12:47:16 -0600 Subject: [Numpy-discussion] BOF c coders. Message-ID: Hi All, At the present time David C. and myself are doing most of the work in the numpy c code base. I am wondering if there are more people out there who might want to get involved in that end of things and if there are ways we can help them get started. If folks are interested we could have a BOF meeting at the SciPy conference. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From pgmdevlist at gmail.com Wed Aug 5 15:11:46 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 5 Aug 2009 15:11:46 -0400 Subject: [Numpy-discussion] Why NaN? In-Reply-To: <4A79A34B.8070102@gmail.com> References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com> <4A79A34B.8070102@gmail.com> Message-ID: And, er... masked arrays anyone ? On Aug 5, 2009, at 11:20 AM, Bruce Southey wrote: > On 08/05/2009 09:18 AM, Keith Goodman wrote: >> >> On Wed, Aug 5, 2009 at 1:40 AM, Bruce Southey >> wrote: >> >>> On Tue, Aug 4, 2009 at 4:05 PM, Keith Goodman >>> wrote: >>> >>>> On Tue, Aug 4, 2009 at 1:53 PM, Bruce Southey >>>> wrote: >>>> >>>>> On Tue, Aug 4, 2009 at 1:40 PM, G?khan >>>>> Sever wrote: >>>>> >>>>>> This is the loveliest of all solutions: >>>>>> >>>>>> c[isfinite(c)].mean() >>>>>> >>>>> This handling of nonfinite elements has come up before. >>>>> Please remember that this only for 1d or flatten array so it not >>>>> work >>>>> in general especially along an axis. >>>>> >>>> If you don't want to use nanmean from scipy.stats you could use: >>>> >>>> np.nansum(c, axis=0) / (~np.isnan(c)).sum(axis=0) >>>> >>>> or >>>> >>>> np.nansum(c, axis=0) / (c == c).sum(axis=0) >>>> >>>> But if c contains ints then you'll run into trouble with the >>>> division, >>>> so you'll need to protect against that. >>>> >>> That is not a problem because nan and infinity are only defined for >>> floating point numbers not integers. So any array that have >>> nonfinite >>> elements like nans and infinity must have a floating point dtype. >>> >> >> That is true. But I was thnking of this case (no nans or infs): >> >> >>>> c >>>> >> array([[1, 2, 3], >> [4, 5, 6]]) >> >>>> c.mean(0) >>>> >> array([ 2.5, 3.5, 4.5]) <--- good >> >>>> np.nansum(c, axis=0) / (c == c).sum(axis=0) >>>> >> array([2, 3, 4]) <--- bad >> >>>> np.nansum(c, axis=0) / (c == c).sum(axis=0, dtype=np.float) >>>> >> array([ 2.5, 3.5, 4.5]) <--- good >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> > Sure but that is about ints versus floats and not about nans or > infs. Your 'good' examples are really about first converting an int > array into a float array and your 'bad' example maintains int dtype > (same result if you cast the arrays from 'good' approaches back to > an int dtype). > > The correct answer depends on what you want the dtype to be. For > example, > With floating point division: > np.mean(c/0.0,axis=0) > > gives the expected floating point answer: > array([ Inf, Inf, Inf]) > > With integer division: > np.mean(c/0,axis=0) > > gives the expected integer answer: > array([ 0., 0., 0.]) > > Note the default action of mean is to convert ints to float64 which > is why the output is a float instead of an int. Although the > numpy.mean dtype argument does not appear to work for int dtypes. > > > Bruce > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From robert.kern at gmail.com Wed Aug 5 15:14:28 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 5 Aug 2009 14:14:28 -0500 Subject: [Numpy-discussion] Why NaN? In-Reply-To: References: <49d6b3500908040946v2a06e615t7f77bffabf22e066@mail.gmail.com> <49d6b3500908041140g505e9a5csdffafb420b79b4ca@mail.gmail.com>