From elias.pampalk at gmail.com Wed Jul 1 12:18:51 2009 From: elias.pampalk at gmail.com (Elias Pampalk) Date: Wed, 1 Jul 2009 17:18:51 +0100 Subject: [SciPy-user] scipy.stats Documentation? In-Reply-To: <43958ee60906301624l7fd7f445sebd0d69b4ac32e61@mail.gmail.com> References: <43958ee60906301516s637b180fm35b96d5b61a1b549@mail.gmail.com> <4A4A92A6.9070503@apstat.com> <43958ee60906301624l7fd7f445sebd0d69b4ac32e61@mail.gmail.com> Message-ID: <4a4b8c6f.1a135e0a.7a2d.0b82@mx.google.com> If I don't want to be searching through the source code - where should I be looking for up-to-date documentation? Searching via Google for the documentation on scipy.stats Mann Whitney U test I found: http://www.scipy.org/doc/api_docs/SciPy.stats.stats.html#mannwhitneyu http://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.mstats.mannw hitneyu.html Both seem outdated compared to what I found here: http://svn.scipy.org/svn/scipy/trunk/scipy/stats/stats.py http://projects.scipy.org/scipy/browser/tags/0.7.1/scipy/stats/stats.py Thank you, Elias From cournape at gmail.com Wed Jul 1 12:33:22 2009 From: cournape at gmail.com (David Cournapeau) Date: Thu, 2 Jul 2009 01:33:22 +0900 Subject: [SciPy-user] scipy.stats Documentation? In-Reply-To: <4a4b8c6f.1a135e0a.7a2d.0b82@mx.google.com> References: <43958ee60906301516s637b180fm35b96d5b61a1b549@mail.gmail.com> <4A4A92A6.9070503@apstat.com> <43958ee60906301624l7fd7f445sebd0d69b4ac32e61@mail.gmail.com> <4a4b8c6f.1a135e0a.7a2d.0b82@mx.google.com> Message-ID: <5b8d13220907010933j7463ec93tb0bb92f02fbf9308@mail.gmail.com> On Thu, Jul 2, 2009 at 1:18 AM, Elias Pampalk wrote: > If I don't want to be searching through the source code - where should I be > looking for up-to-date documentation? The most uptodate documentation is almost always the docstring. If you use an advanced interpreter such as ipython, you get the docstring without looking at the sources: import numpy as np help np.mean There is a current effort to bring the official documentation on par with the docstring, but I don't think it has been done completely for scipy.stats yet. David From Scott.Daniels at Acm.Org Wed Jul 1 13:21:22 2009 From: Scott.Daniels at Acm.Org (Scott David Daniels) Date: Wed, 01 Jul 2009 10:21:22 -0700 Subject: [SciPy-user] assign to diagonal values? In-Reply-To: <3d375d730905231433p5003abfak9dca94a975825a39@mail.gmail.com> References: <47A128A5.7010406@sci.utah.edu> <4A18647D.50207@american.edu> <3d375d730905231405t474f1e63n67d68d55f317e20a@mail.gmail.com> <4A186A21.5040102@american.edu> <3d375d730905231433p5003abfak9dca94a975825a39@mail.gmail.com> Message-ID: Robert Kern wrote: > On Sat, May 23, 2009 at 16:26, Alan G Isaac wrote: >>> On Sat, May 23, 2009 at 16:02, Alan G Isaac wrote: >>>> On 1/31/2008 1:37 AM Anne Archibald apparently wrote: >>>>> m[range(n),range(n)]=new_diagonal >>>> Will that work with range objects (in Python 3)? >> On 5/23/2009 5:05 PM Robert Kern apparently wrote: >>> No. The automatic conversion to arrays does not consume iterators (nor >>> will it when we port to Python 3). >> Sure, but range objects are not iterators. >> They are "almost" sequences. > > The answer is still no. Perhaps someone will write special support for > that type when we do the Python 3 port, but there's nothing in numpy > that would make it work automatically. For example, xrange() does not > work as an index with the current numpy. Well, ranges are more capable than you think in Python 3: v = range(25) print (v[3], v[0], v[22], v) prints: 3 0 22 range(0, 25) -Scott From pav at iki.fi Wed Jul 1 13:57:09 2009 From: pav at iki.fi (Pauli Virtanen) Date: Wed, 1 Jul 2009 17:57:09 +0000 (UTC) Subject: [SciPy-user] scipy.stats Documentation? References: <43958ee60906301516s637b180fm35b96d5b61a1b549@mail.gmail.com> <4A4A92A6.9070503@apstat.com> <43958ee60906301624l7fd7f445sebd0d69b4ac32e61@mail.gmail.com> <4a4b8c6f.1a135e0a.7a2d.0b82@mx.google.com> Message-ID: On 2009-07-01, Elias Pampalk wrote: > If I don't want to be searching through the source code - where should I be > looking for up-to-date documentation? > > Searching via Google for the documentation on scipy.stats Mann Whitney U > test I found: > > http://www.scipy.org/doc/api_docs/SciPy.stats.stats.html#mannwhitneyu > http://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.mstats.mannwhitneyu.html The second one is up-to-date but it's for scipy.stats.mstats.mannwhitneyu, not scipy.stats.mannwhitneyu. Apparently, the main stats.mannwhitneyu function was not included in the documentation. Fixed, should appear tomorrow. -- Pauli Virtanen From robert.kern at gmail.com Wed Jul 1 14:38:49 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 1 Jul 2009 13:38:49 -0500 Subject: [SciPy-user] assign to diagonal values? In-Reply-To: References: <47A128A5.7010406@sci.utah.edu> <4A18647D.50207@american.edu> <3d375d730905231405t474f1e63n67d68d55f317e20a@mail.gmail.com> <4A186A21.5040102@american.edu> <3d375d730905231433p5003abfak9dca94a975825a39@mail.gmail.com> Message-ID: <3d375d730907011138g204ffb57kfc7cd12b9a8b45ce@mail.gmail.com> On Wed, Jul 1, 2009 at 12:21, Scott David Daniels wrote: > Robert Kern wrote: >> On Sat, May 23, 2009 at 16:26, Alan G Isaac wrote: >>>> On Sat, May 23, 2009 at 16:02, Alan G Isaac wrote: >>>>> On 1/31/2008 1:37 AM Anne Archibald apparently wrote: >>>>>> m[range(n),range(n)]=new_diagonal >>>>> Will that work with range objects (in Python 3)? >>> On 5/23/2009 5:05 PM Robert Kern apparently wrote: >>>> No. The automatic conversion to arrays does not consume iterators (nor >>>> will it when we port to Python 3). >>> Sure, but range objects are not iterators. >>> They are "almost" sequences. >> >> The answer is still no. Perhaps someone will write special support for >> that type when we do the Python 3 port, but there's nothing in numpy >> that would make it work automatically. For example, xrange() does not >> work as an index with the current numpy. > > Well, ranges are more capable than you think in Python 3: > ? ? v = range(25) > ? ? print (v[3], v[0], v[22], v) > prints: > ? ? 3 0 22 range(0, 25) No, they are exactly as capable as I think, i.e. as capable as xrange() is in Python 2: In [10]: v = xrange(25) In [11]: print v[3], v[0], v[22], v 3 0 22 xrange(25) Quite simply, numpy does not support arbitrary sequences and sequence-like objects as indices. If the eventual numpy port to Python 3 supports range() objects as indices, it will be because someone will have written special code for it. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From fperez.net at gmail.com Wed Jul 1 15:38:12 2009 From: fperez.net at gmail.com (Fernando Perez) Date: Wed, 1 Jul 2009 12:38:12 -0700 Subject: [SciPy-user] assign to diagonal values? In-Reply-To: <2bc7a5a50801302128g79da7ea6jc95ef9abb00b3785@mail.gmail.com> References: <47A128A5.7010406@sci.utah.edu>

<2bc7a5a50801302128g79da7ea6jc95ef9abb00b3785@mail.gmail.com> Message-ID: On Wed, Jan 30, 2008 at 10:28 PM, Anand Patil wrote: > Yeah, that's awesome. I have so many for-loops littering my code for > setting diagonals. Here's an nd-version: > > def setdiag(a, d): > ? ?assert(all([s == len(d) for s in a.shape])) > ? ?stride = 1+sum(cumprod(a.shape[:-1])) > ? ?a.flat[::stride] = d > If someone feels like reviewing this ticket: http://projects.scipy.org/numpy/attachment/ticket/1132/numpy-index-funcs.diff it has the above and a few more utilities, with docs and tests. Cheers, f From lorenzo.isella at gmail.com Thu Jul 2 09:25:42 2009 From: lorenzo.isella at gmail.com (Lorenzo Isella) Date: Thu, 02 Jul 2009 15:25:42 +0200 Subject: [SciPy-user] Creating a Matrix from a Sum Message-ID: <4A4CB556.6090203@gmail.com> Dear All, I need some help to efficiently write an array manipulation. I suspect this could be a one-liner. Assume that you have a histogram of observations that you store in a vector x. Let us say that its i-th entry, x_i, corresponds to the number of observations in the i-th channel, for i=1,2...N. (or 0,1...N-1, please let me know if there is any potential 0/1 pitfall in the following). Now, for any two channels i and j, I want to calculate the probability of having an observation in any other channel k, where k>=max(i,j). That is to say P(i,j)=sum_{k=max(i,j)}^N x_k /C, where C=sum_{i=1}^N x_i is just a normalization factor. Does anyone know a good way of writing this in a code to get the P(i,j) matrix? Many thanks Lorenzo From Dharhas.Pothina at twdb.state.tx.us Thu Jul 2 11:58:43 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Thu, 02 Jul 2009 10:58:43 -0500 Subject: [SciPy-user] Add a column to masked array. Message-ID: <4A4C92E3.63BA.009B.0@twdb.state.tx.us> Hi, I have a masked array 'data' where data.shape = (1805,4) and another masked array 'a' where a.shape = (1805,) I would like to generate a new masked array with the 'a' as the last column , i.e. the final shape should be (1805,5). I've tried np.hstack which gave an error 'arrays must have same number of dimensions' and ma.hstack just combines both masked arrays into a list. how would I go about doing what I need? thanks - dharhas From pgmdevlist at gmail.com Thu Jul 2 12:57:34 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 2 Jul 2009 12:57:34 -0400 Subject: [SciPy-user] Add a column to masked array. In-Reply-To: <4A4C92E3.63BA.009B.0@twdb.state.tx.us> References: <4A4C92E3.63BA.009B.0@twdb.state.tx.us> Message-ID: <6E2CE761-723F-4CB4-8452-256115F18D27@gmail.com> On Jul 2, 2009, at 11:58 AM, Dharhas Pothina wrote: > Hi, > > I have a masked array 'data' where data.shape = (1805,4) and another > masked array 'a' where a.shape = (1805,) > > I would like to generate a new masked array with the 'a' as the last > column , i.e. the final shape should be (1805,5). Try ma.column_stack >>> x=ma.masked_all((10,3)) >>> y=np.random.rand(10) >>> ma.column_stack((x,y)).shape (10,4) > I've tried np.hstack which gave an error 'arrays must have same > number of dimensions' The corresponding standard numpy functions don't know how to process the mask and tend to fail miserably. > and ma.hstack just combines both masked arrays into a list. I've never been able to remember what vstack, hstack and stack do respectively, so I use error/trial and the online help... From elias.pampalk at gmail.com Thu Jul 2 13:43:22 2009 From: elias.pampalk at gmail.com (Elias Pampalk) Date: Thu, 2 Jul 2009 18:43:22 +0100 Subject: [SciPy-user] scipy.stats.stats mannwhitneyu vs ranksums? In-Reply-To: <5b8d13220907010933j7463ec93tb0bb92f02fbf9308@mail.gmail.com> References: <43958ee60906301516s637b180fm35b96d5b61a1b549@mail.gmail.com> <4A4A92A6.9070503@apstat.com> <43958ee60906301624l7fd7f445sebd0d69b4ac32e61@mail.gmail.com> <4a4b8c6f.1a135e0a.7a2d.0b82@mx.google.com> <5b8d13220907010933j7463ec93tb0bb92f02fbf9308@mail.gmail.com> Message-ID: <4a4cf1bf.05a4100a.7834.ffffcf2e@mx.google.com> Why does stats.py contain both mannwhitneyu and ranksums? See also: http://en.wikipedia.org/wiki/Mann-Whitney-Wilcoxon_test "Mann-Whitney U test (also called the Mann-Whitney-Wilcoxon (MWW), Wilcoxon rank-sum test, or Wilcoxon-Mann-Whitney test)" http://www.mathworks.com/access/helpdesk/help/toolbox/stats/index.html?/acce ss/helpdesk/help/toolbox/stats/ranksum.html "ranksum [...] The test is equivalent to a Mann-Whitney U-test." Thanks, Elias PS. @David, @Pauli: Thank you for answering my question wrt documentation yesterday! From Dharhas.Pothina at twdb.state.tx.us Thu Jul 2 13:52:17 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Thu, 02 Jul 2009 12:52:17 -0500 Subject: [SciPy-user] Add a column to masked array. In-Reply-To: <6E2CE761-723F-4CB4-8452-256115F18D27@gmail.com> References: <4A4C92E3.63BA.009B.0@twdb.state.tx.us> <6E2CE761-723F-4CB4-8452-256115F18D27@gmail.com> Message-ID: <4A4CAD81.63BA.009B.0@twdb.state.tx.us> thank you. ma.column_stack works great. - d >>> Pierre GM 7/2/2009 11:57 AM >>> On Jul 2, 2009, at 11:58 AM, Dharhas Pothina wrote: > Hi, > > I have a masked array 'data' where data.shape = (1805,4) and another > masked array 'a' where a.shape = (1805,) > > I would like to generate a new masked array with the 'a' as the last > column , i.e. the final shape should be (1805,5). Try ma.column_stack >>> x=ma.masked_all((10,3)) >>> y=np.random.rand(10) >>> ma.column_stack((x,y)).shape (10,4) > I've tried np.hstack which gave an error 'arrays must have same > number of dimensions' The corresponding standard numpy functions don't know how to process the mask and tend to fail miserably. > and ma.hstack just combines both masked arrays into a list. I've never been able to remember what vstack, hstack and stack do respectively, so I use error/trial and the online help... _______________________________________________ SciPy-user mailing list SciPy-user at scipy.org http://mail.scipy.org/mailman/listinfo/scipy-user From dwf at cs.toronto.edu Thu Jul 2 19:54:33 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Thu, 2 Jul 2009 19:54:33 -0400 Subject: [SciPy-user] scipy.stats.stats mannwhitneyu vs ranksums? In-Reply-To: <4a4cf1bf.05a4100a.7834.ffffcf2e@mx.google.com> References: <43958ee60906301516s637b180fm35b96d5b61a1b549@mail.gmail.com> <4A4A92A6.9070503@apstat.com> <43958ee60906301624l7fd7f445sebd0d69b4ac32e61@mail.gmail.com> <4a4b8c6f.1a135e0a.7a2d.0b82@mx.google.com> <5b8d13220907010933j7463ec93tb0bb92f02fbf9308@mail.gmail.com> <4a4cf1bf.05a4100a.7834.ffffcf2e@mx.google.com> Message-ID: <4F46BA01-652B-4AEA-84CE-277D54F26693@cs.toronto.edu> There's also wilcoxon, which is related but subtly different. There's an open ticket for clarifying these docs: http://projects.scipy.org/scipy/ticket/901 The discussion there might illuminate things. David On 2-Jul-09, at 1:43 PM, Elias Pampalk wrote: > Why does stats.py contain both mannwhitneyu and ranksums? > > See also: > http://en.wikipedia.org/wiki/Mann-Whitney-Wilcoxon_test > > "Mann-Whitney U test (also called the Mann-Whitney-Wilcoxon (MWW), > Wilcoxon > rank-sum test, or Wilcoxon-Mann-Whitney test)" > > http://www.mathworks.com/access/helpdesk/help/toolbox/stats/index.html?/acce > ss/helpdesk/help/toolbox/stats/ranksum.html > > "ranksum [...] The test is equivalent to a Mann-Whitney U-test." > > Thanks, > Elias > > PS. @David, @Pauli: Thank you for answering my question wrt > documentation > yesterday! > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From oanjao at yahoo.com Thu Jul 2 22:54:09 2009 From: oanjao at yahoo.com (Craig Finch) Date: Thu, 2 Jul 2009 19:54:09 -0700 (PDT) Subject: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension Message-ID: <524620.24680.qm@web30402.mail.mud.yahoo.com> I have two computers that have almost identical Python installations, but one of them has problems with inline Weave code. Here is a simple test script that I am using: #!/usr/bin/env python from scipy import weave input_val = 10 code=r"""return_val = 10*input_val;""" print weave.inline(code, ['input_val'], headers=[], compiler='gcc') It runs successfully on one computer with the following configuration: Python 2.5.2 (r252:60911, Apr 17 2009, 18:42:17) [GCC 4.1.1 (Gentoo 4.1.1-r3)] on linux2 Scipy 0.7.0 Numpy 1.3.0 On the other computer, it fails with the following message: Traceback (most recent call last): File "test.py", line 12, in print weave.inline(code, ['input_val'], headers=[], compiler='gcc') File "/home/cfinch/lib/python2.5/site-packages/scipy/weave/inline_tools.py", line 335, in inline **kw) File "/home/cfinch/lib/python2.5/site-packages/scipy/weave/inline_tools.py", line 468, in compile_function exec 'import ' + module_name File "", line 1, in ImportError: /home/cfinch/.python25_compiled/sc_71b2502f9a0b0ca9f89b0cdc7ad3819e0.so: undefined symbol: _ZNSt8ios_base4InitD1Ev I used nm to check, and that symbol is indeed present in the compiled .so file. The configuration of this computer is: Python 2.5.4 (r254:67916, Apr 22 2009, 15:52:10) [GCC 4.1.1 20070105 (Red Hat 4.1.1-52)] on linux2 Scipy 0.7.0 Numpy 1.3.0 One complication is that the default compiler on this system is icc, and numpy distutils doesn't offer a way to force it to use gcc for everything. I have tried everything I can think of--deleting the files in .python25_compiled, rebuilding Scipy to ensure it was built with gcc, even hacking Numpy's distutils to force it to compile the extension with gcc. I also ran the unit tests that come with weave. Running weave.test() produces no errors, but running weave.test("full") produces lots of errors: Ran 390 tests in 145.246s FAILED (KNOWNFAIL=6, errors=196) There are no errors on the other system. Can anyone suggest a way to fix this? Craig From david at ar.media.kyoto-u.ac.jp Fri Jul 3 03:19:36 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 03 Jul 2009 16:19:36 +0900 Subject: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension In-Reply-To: <524620.24680.qm@web30402.mail.mud.yahoo.com> References: <524620.24680.qm@web30402.mail.mud.yahoo.com> Message-ID: <4A4DB108.8050407@ar.media.kyoto-u.ac.jp> Craig Finch wrote: > I have two computers that have almost identical Python installations, > but one of them has problems with inline Weave code. Here is a simple > test script that I am using: > > #!/usr/bin/env python > from scipy import weave > input_val = 10 > > code=r"""return_val = 10*input_val;""" > > print weave.inline(code, ['input_val'], headers=[], compiler='gcc') > > It runs successfully on one computer with the following configuration: > Python 2.5.2 (r252:60911, Apr 17 2009, 18:42:17) > [GCC 4.1.1 (Gentoo 4.1.1-r3)] on linux2 > Scipy 0.7.0 > Numpy 1.3.0 > > On the other computer, it fails with the following message: > Traceback (most recent call last): > File "test.py", line 12, in > print weave.inline(code, ['input_val'], headers=[], compiler='gcc') > File "/home/cfinch/lib/python2.5/site-packages/scipy/weave/inline_tools.py", line 335, in inline > **kw) > File "/home/cfinch/lib/python2.5/site-packages/scipy/weave/inline_tools.py", line 468, in compile_function > exec 'import ' + module_name > File "", line 1, in > ImportError: /home/cfinch/.python25_compiled/sc_71b2502f9a0b0ca9f89b0cdc7ad3819e0.so: undefined symbol: _ZNSt8ios_base4InitD1Ev > > I used nm to check, and that symbol is indeed present in the compiled .so file. > > The configuration of this computer is: > Python 2.5.4 (r254:67916, Apr 22 2009, 15:52:10) > [GCC 4.1.1 20070105 (Red Hat 4.1.1-52)] on linux2 > Scipy 0.7.0 > Numpy 1.3.0 > One > complication is that the default compiler on this system is icc, and > numpy distutils doesn't offer a way to force it to use gcc for > everything. > Mixing C++ compilers is difficult. Icc goes through great length to be compatible with g++ on Linux, but it is highly version dependent. In particular, the C++ library often causes problem, which is the case here it seems. So you're right that you should use gcc to compile the extension. One thing you could do is to have a script called icc which calls gcc, I think this works if the script comes first in your path. That's an horrible hack, though. cheers, David From elias.pampalk at gmail.com Fri Jul 3 12:06:58 2009 From: elias.pampalk at gmail.com (Elias Pampalk) Date: Fri, 3 Jul 2009 17:06:58 +0100 Subject: [SciPy-user] scipy.stats.stats mannwhitneyu vs ranksums? In-Reply-To: <4F46BA01-652B-4AEA-84CE-277D54F26693@cs.toronto.edu> References: <43958ee60906301516s637b180fm35b96d5b61a1b549@mail.gmail.com> <4A4A92A6.9070503@apstat.com> <43958ee60906301624l7fd7f445sebd0d69b4ac32e61@mail.gmail.com> <4a4b8c6f.1a135e0a.7a2d.0b82@mx.google.com> <5b8d13220907010933j7463ec93tb0bb92f02fbf9308@mail.gmail.com> <4a4cf1bf.05a4100a.7834.ffffcf2e@mx.google.com> <4F46BA01-652B-4AEA-84CE-277D54F26693@cs.toronto.edu> Message-ID: <4a4e2ca7.0c92100a.2f71.0392@mx.google.com> Thanks David! I did a quick comparison between Matlab/stats (R14SP3), R (2.8.1), and Python/SciPy (0.7). Maybe this is somehow useful for others too. (I'm intentionally violating the continuous distribution assumptions.) Samples: A1 <-> B: not paired with ties A2 <-> B: not paired without ties A1 <-> C: paired with zeros A2 <-> C: paired without zeros - Matlab A1 = 0:19 A2 = A1 + (1:20)./100 B = 0:39 C = [0:14,16:20] - R A1 <- 0:19 A2 <- A1 + 1:20/100 B <- 0:39 C <- c(0:14,16:20) - SciPy A1 = numpy.arange(20) A2 = A1 + numpy.arange(1,21)/100.0 B = numpy.arange(40) C = numpy.array(range(15) + range(16,21)) 2 Samples, Not Paired ===================== (from scipy.stats import stats) Kruskal-Wallis Test ------------------- Same p-values for all. Samples contain ties: - Matlab: kruskalwallis([A1,B],[A1*0,B*0+1]) = 0.00170615101265 - R: kruskal.test(list(A1,B)) = 0.00170615101265 - R: wilcox.test(A1,B, correct=FALSE) = 0.00170615101265 (+warning: ties) - SciPy: stats.kruskal(A1,B) = 0.00170615101265 (R: kruskal = wilcox without correction for continuity) Samples without ties: - Matlab: kruskalwallis([A2,B], [A2*0,B*0+1]) = 0.00288777919292 - R: kruskal.test(list(A2,B)) = 0.00288777919292 - SciPy: stats.kruskal(A2,B) = 0.00288777919292 Wilcoxon Rank Sum (aka Mann Whitney U) Test ------------------------------------------- Matlab and R identical (but different defaults wrt exact/approximate), SciPy computes approximate results and does not correct for continuity (changed in version 7.1 for stats.mannwhitneyu?). Samples contain ties: - Matlab: ranksum(A1,B) = 0.00175235702866 - R: wilcox.test(A1,B) = 0.00175235702866 (+warning: ties) - R: wilcox.test(A1,B,correct=FALSE) = 0.001706151012654 (+warning: ties) - SciPy: stats.mannwhitneyu(A1,B)[1]*2 = 0.0017086895586986284 - SciPy: stats.ranksums(A1,B) = 0.0017112312247389294 Samples without ties: - Matlab: ranksum(A2,B) = 0.00296255173431 - R: wilcox.test(A2,B, exact=FALSE) = 0.00296255173431 - Matlab: ranksum(A2,B,'method','exact') = 0.00246078580826 - R: wilcox.test(A2,B) = 0.00246078580826 - R: wilcox.test(A2,B, exact=FALSE, correct=FALSE) = 0.00288777919292 - SciPy: stats.mannwhitneyu(A2,B)[1]*2 = 0.00288777919292 - SciPy: stats.ranksums(A2,B) = 0.00288777919292 (SciPy: mannwhitneyu = ranksums = kruskal if no ties) 2 Samples, Paired, Wilcoxon Sign Rank Test ========================================== (from scipy.stats import wilcoxon) Matlab and SciPy do not correct for continuity and R does. Matlab and R have different defaults for exact/approximate. Matlab computes exact results also if ties/zeros exist. With zeros: - Matlab: signrank(A1,C,'method','approximate') = 0.02534731867747 - R: wilcox.test(A1 - C, correct=FALSE) = 0.02534731867747 (+warnings: ties + zeros) - Matlab: signrank(A1,C) = 0.06250000000000 - R: wilcox.test(A1 - C) = 0.0368884257070 (+warnings: ties + zeros) - SciPy: wilcoxon(A1,C) = nan (+error: sample size too small) Without zeros: - Matlab: signrank(A2,C,'method','exact') = 0.59581947326660 - R: wilcox.test(A2 - C) = 0.59581947326660 - Matlab: signrank(A2,C) = 0.57548622813650 - R: wilcox.test(A2 - C, exact=FALSE, correct=FALSE) = 0.57548622813650 - SciPy: wilcoxon(A2,C) = 0.57548622813650 - R: wilcox.test(A2 - C, exact=FALSE) = 0.5882844808893 Elias -------------- next part -------------- An HTML attachment was scrubbed... URL: From tpk at kraussfamily.org Fri Jul 3 13:35:51 2009 From: tpk at kraussfamily.org (Tom K.) Date: Fri, 3 Jul 2009 10:35:51 -0700 (PDT) Subject: [SciPy-user] Creating a Matrix from a Sum In-Reply-To: <4A4CB556.6090203@gmail.com> References: <4A4CB556.6090203@gmail.com> Message-ID: <24326539.post@talk.nabble.com> Lorenzo Isella wrote: > > I need some help to efficiently write an array manipulation. I suspect > this could be a one-liner. > Assume that you have a histogram of observations that you store in a > vector x. > Let us say that its i-th entry, x_i, corresponds to the number of > observations in the i-th channel, for i=1,2...N. (or 0,1...N-1, please > let me know if there is any potential 0/1 pitfall in the following). > Now, for any two channels i and j, I want to calculate the probability > of having an observation in any other channel k, where k>=max(i,j). > That is to say > > P(i,j)=sum_{k=max(i,j)}^N x_k /C, > where C=sum_{i=1}^N x_i is just a normalization factor. > > Does anyone know a good way of writing this in a code to get the P(i,j) > matrix? > I'm not sure about a 1 liner. Here are two ways that work: 1) Accumulate the array starting with the last element. P = np.zeros((len(x), len(x))) i = len(x) for xi in x[::-1]: P[:i, :i] += xi i -= 1 2) Index into the reversed cumulative sum with a max_ij array. i = np.arange(len(x))[:,np.newaxis] j = np.arange(len(x)) max_ij = np.maximum(i,j) P = x[::-1].cumsum()[::-1][max_ij] -- View this message in context: http://www.nabble.com/Creating-a-Matrix-from-a-Sum-tp24306698p24326539.html Sent from the Scipy-User mailing list archive at Nabble.com. From oanjao at yahoo.com Fri Jul 3 14:38:06 2009 From: oanjao at yahoo.com (Craig Finch) Date: Fri, 3 Jul 2009 11:38:06 -0700 (PDT) Subject: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension In-Reply-To: <4A4DB108.8050407@ar.media.kyoto-u.ac.jp> References: <524620.24680.qm@web30402.mail.mud.yahoo.com> <4A4DB108.8050407@ar.media.kyoto-u.ac.jp> Message-ID: <829316.46979.qm@web30408.mail.mud.yahoo.com> Thank you for the suggestion. I have some new information, but unfortunately no solution. I re-built Numpy and Scipy with gcc and gfortran, and did some hacking to ensure that my Weave extension was also being built with gcc. I get the same error when I try to import the Weave extension. I do have some new information that might help--I found that I have other library problems in my Scipy build, rather than a Weave-specific issue. Here is another manifestation of the problem: >>> from scipy.interpolate import UnivariateSpline Traceback (most recent call last): File "", line 1, in File "/home/cfinch/lib/python2.5/site-packages/scipy/interpolate/__init__.py", line 13, in from rbf import Rbf File "/home/cfinch/lib/python2.5/site-packages/scipy/interpolate/rbf.py", line 47, in from scipy import linalg File "/home/cfinch/lib/python2.5/site-packages/scipy/linalg/__init__.py", line 13, in from iterative import * File "/home/cfinch/lib/python2.5/site-packages/scipy/linalg/iterative.py", line 5, in from scipy.sparse.linalg import isolve File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/__init__.py", line 6, in from csr import * File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/csr.py", line 12, in from sparsetools import csr_tocsc, csr_tobsr, csr_count_blocks, \ File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/__init__.py", line 4, in from csr import * File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/csr.py", line 7, in import _csr ImportError: /home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/_csr.so: undefined symbol: __gxx_personality_v0 As far as I know, everything Python-related on this machine has now been built with GNU tools, but something is still broken. One more thing--Python, Numpy, and Scipy are all locally installed in my /home/cfinch directory because the system Python is frozen at 2.4. Any more ideas? Craig ----- Original Message ---- From: David Cournapeau To: Craig Finch ; SciPy Users List Sent: Friday, July 3, 2009 3:19:36 AM Subject: Re: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension Craig Finch wrote: > I have two computers that have almost identical Python installations, > but one of them has problems with inline Weave code. Here is a simple > test script that I am using: > > #!/usr/bin/env python > from scipy import weave > input_val = 10 > > code=r"""return_val = 10*input_val;""" > > print weave.inline(code, ['input_val'], headers=[], compiler='gcc') > > It runs successfully on one computer with the following configuration: > Python 2.5.2 (r252:60911, Apr 17 2009, 18:42:17) > [GCC 4.1.1 (Gentoo 4.1.1-r3)] on linux2 > Scipy 0.7.0 > Numpy 1.3.0 > > On the other computer, it fails with the following message: > Traceback (most recent call last): > File "test.py", line 12, in > print weave.inline(code, ['input_val'], headers=[], compiler='gcc') > File "/home/cfinch/lib/python2.5/site-packages/scipy/weave/inline_tools.py", line 335, in inline > **kw) > File "/home/cfinch/lib/python2.5/site-packages/scipy/weave/inline_tools.py", line 468, in compile_function > exec 'import ' + module_name > File "", line 1, in > ImportError: /home/cfinch/.python25_compiled/sc_71b2502f9a0b0ca9f89b0cdc7ad3819e0.so: undefined symbol: _ZNSt8ios_base4InitD1Ev > > I used nm to check, and that symbol is indeed present in the compiled .so file. > > The configuration of this computer is: > Python 2.5.4 (r254:67916, Apr 22 2009, 15:52:10) > [GCC 4.1.1 20070105 (Red Hat 4.1.1-52)] on linux2 > Scipy 0.7.0 > Numpy 1.3.0 > One > complication is that the default compiler on this system is icc, and > numpy distutils doesn't offer a way to force it to use gcc for > everything. > Mixing C++ compilers is difficult. Icc goes through great length to be compatible with g++ on Linux, but it is highly version dependent. In particular, the C++ library often causes problem, which is the case here it seems. So you're right that you should use gcc to compile the extension. One thing you could do is to have a script called icc which calls gcc, I think this works if the script comes first in your path. That's an horrible hack, though. cheers, David From dwf at cs.toronto.edu Fri Jul 3 14:45:23 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Fri, 3 Jul 2009 14:45:23 -0400 Subject: [SciPy-user] Creating a Matrix from a Sum In-Reply-To: <4A4CB556.6090203@gmail.com> References: <4A4CB556.6090203@gmail.com> Message-ID: On 2-Jul-09, at 9:25 AM, Lorenzo Isella wrote: > Dear All, > I need some help to efficiently write an array manipulation. I suspect > this could be a one-liner. > Assume that you have a histogram of observations that you store in a > vector x. > Let us say that its i-th entry, x_i, corresponds to the number of > observations in the i-th channel, for i=1,2...N. (or 0,1...N-1, please > let me know if there is any potential 0/1 pitfall in the following). > Now, for any two channels i and j, I want to calculate the probability > of having an observation in any other channel k, where k>=max(i,j). > That is to say > > P(i,j)=sum_{k=max(i,j)}^N x_k /C, > where C=sum_{i=1}^N x_i is just a normalization factor. Hm, there might be a simpler/more efficient way but this is what I came up with as far as 1-liners: P = np.max(np.concatenate(np.broadcast_arrays(x[np.newaxis, :, np.newaxis], x[:, np.newaxis, np.newaxis]),axis=2),axis=2); P /= x.sum() From dwf at cs.toronto.edu Fri Jul 3 15:01:44 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Fri, 3 Jul 2009 15:01:44 -0400 Subject: [SciPy-user] Creating a Matrix from a Sum In-Reply-To: References: <4A4CB556.6090203@gmail.com> Message-ID: <7F843EB2-C294-4F1C-A1B6-19C1CF7BEDFA@cs.toronto.edu> On 3-Jul-09, at 2:45 PM, David Warde-Farley wrote: > On 2-Jul-09, at 9:25 AM, Lorenzo Isella wrote: > >> Dear All, >> I need some help to efficiently write an array manipulation. I >> suspect >> this could be a one-liner. >> Assume that you have a histogram of observations that you store in a >> vector x. >> Let us say that its i-th entry, x_i, corresponds to the number of >> observations in the i-th channel, for i=1,2...N. (or 0,1...N-1, >> please >> let me know if there is any potential 0/1 pitfall in the following). >> Now, for any two channels i and j, I want to calculate the >> probability >> of having an observation in any other channel k, where k>=max(i,j). >> That is to say >> >> P(i,j)=sum_{k=max(i,j)}^N x_k /C, >> where C=sum_{i=1}^N x_i is just a normalization factor. > > Hm, there might be a simpler/more efficient way but this is what I > came up with as far as 1-liners: > > P = np.max(np.concatenate(np.broadcast_arrays(x[np.newaxis, :, > np.newaxis], x[:, np.newaxis, np.newaxis]),axis=2),axis=2); P /= > x.sum() Oops, nevermind, that's wrong. Here's how I would do it. You want the sum from k to the end (N) so I'd extract a vector like this (the cumulative sum of the reversed array, reversed again sums = np.cumsum(x[::-1])[::-1] Then you want the max of i and j at each position in the matrix. For this, though there may be a more efficient way, you can use mgrid to get i and j indices at each position in the matrix and then use concatenate and max on them to concatenate along a third axis and then do a max along that same axis to recover a 2D array: idx = np.concatenate([M[:,:,np.newaxis] for M in mgrid[0:len(x), 0:len(x)]],axis=2).max(axis=2) Finally, use fancy indexing and divide by your normalizer: P = sums[idx] / x.sum() Or, as a oneliner, much less readable: P = cumsum(x[::-1])[::-1][np.concatenate([M[:,:,np.newaxis] for M in mgrid[0:len(x), 0:len(x)]],axis=2).max(axis=2)] / x.sum() HTH, David From tpk at kraussfamily.org Fri Jul 3 21:18:40 2009 From: tpk at kraussfamily.org (Tom K.) Date: Fri, 3 Jul 2009 18:18:40 -0700 (PDT) Subject: [SciPy-user] Creating a Matrix from a Sum In-Reply-To: <7F843EB2-C294-4F1C-A1B6-19C1CF7BEDFA@cs.toronto.edu> References: <4A4CB556.6090203@gmail.com> <7F843EB2-C294-4F1C-A1B6-19C1CF7BEDFA@cs.toronto.edu> Message-ID: <24330442.post@talk.nabble.com> David Warde-Farley-2 wrote: > > > On 3-Jul-09, at 2:45 PM, David Warde-Farley wrote: > >> On 2-Jul-09, at 9:25 AM, Lorenzo Isella wrote: > > Then you want the max of i and j at each position in the matrix. For > this, though there may be a more efficient way, you can use mgrid to > get i and j indices at each position in the matrix and then use > concatenate and max on them to concatenate along a third axis and then > do a max along that same axis to recover a 2D array: > > idx = np.concatenate([M[:,:,np.newaxis] for M in mgrid[0:len(x), > 0:len(x)]],axis=2).max(axis=2) > David this is interesting however I think the use of mgrid followed by concatenation into a 3D array here is overkill. The idx array is the same as the max_ij array that I suggested - a one liner for it is max_ij = np.maximum(np.arange(len(x))[:,np.newaxis], np.arange(len(x))) This problem involves O(N) sums and O(N^2) assignments. The vectorized versions that David and I posted (my version 2) are doing just that much work - however with overhead of several intermediate arrays. A more straight-forward approach in python is just a double for-loop, which may be the most readable and perfectly sufficient unless your N is huge and/or this function becomes the most significant item in a profile of your application. sums = np.cumsum(x[::-1])[::-1] P = np.zeros((len(x), len(x)) for i in range(len(x)): for j in range(len(x)): P[i, j] = sums[np.max((i, j))] #Here it is with list comprehensions: P = np.array([[sums[np.max((i, j))] for j in range(len(x))] for i in range(len(x))]) By the way I do NOT recommend using the first method I posted in my earlier post - I think it is doing O(N^3) adds and O(N^3) assigns! -- View this message in context: http://www.nabble.com/Creating-a-Matrix-from-a-Sum-tp24306698p24330442.html Sent from the Scipy-User mailing list archive at Nabble.com. From cournape at gmail.com Fri Jul 3 22:21:28 2009 From: cournape at gmail.com (David Cournapeau) Date: Sat, 4 Jul 2009 11:21:28 +0900 Subject: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension In-Reply-To: <829316.46979.qm@web30408.mail.mud.yahoo.com> References: <524620.24680.qm@web30402.mail.mud.yahoo.com> <4A4DB108.8050407@ar.media.kyoto-u.ac.jp> <829316.46979.qm@web30408.mail.mud.yahoo.com> Message-ID: <5b8d13220907031921v24626b17hc353705f766435d6@mail.gmail.com> On Sat, Jul 4, 2009 at 3:38 AM, Craig Finch wrote: > ImportError: /home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/_csr.so: undefined symbol: __gxx_personality_v0 > > As far as I know, everything Python-related on this machine has now been built with GNU tools, but something is still broken. ?One more thing--Python, Numpy, and Scipy are all locally installed in my /home/cfinch directory because the system Python is frozen at 2.4. This is still a C++ problem, I have sometimes noticed this problem on some machines. Normally this is caused by a C++ object file being linked with gcc (instead of g++). Can you post the build log of scipy ? David From dwf at cs.toronto.edu Sat Jul 4 06:56:27 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Sat, 4 Jul 2009 06:56:27 -0400 Subject: [SciPy-user] Creating a Matrix from a Sum In-Reply-To: <24330442.post@talk.nabble.com> References: <4A4CB556.6090203@gmail.com> <7F843EB2-C294-4F1C-A1B6-19C1CF7BEDFA@cs.toronto.edu> <24330442.post@talk.nabble.com> Message-ID: <9483AA21-7F65-4D53-ABD2-3FD31339C9D7@cs.toronto.edu> On 3-Jul-09, at 9:18 PM, Tom K. wrote: > A more > straight-forward approach in python is just a double for-loop, which > may be > the most readable and perfectly sufficient unless your N is huge and/ > or this > function becomes the most significant item in a profile of your > application. > > sums = np.cumsum(x[::-1])[::-1] > P = np.zeros((len(x), len(x)) > for i in range(len(x)): > for j in range(len(x)): > P[i, j] = sums[np.max((i, j))] > > #Here it is with list comprehensions: > P = np.array([[sums[np.max((i, j))] for j in range(len(x))] for i in > range(len(x))]) Before I begin, a quick tip: replace range() with xrange() to do the same with less memory (range() causes a list allocation, xrange() provides a constant-size generator). This is an instance where the intuition gleaned from asymptotic notation fails to really address one critical piece of the puzzle: both the implementation above and the implementation I posted are doing roughly the same amount of work (mine is doing a little more in the form of copies and allocations), but one is doing it in C whereas the other is doing it in interpreted Python. It also depends on your definition of a "huge" problem. Speed wise, Python-vs-C starts to matter very, very quickly, i.e. on the order of hundreds of elements, and by 1000 things get really horrible: In [144]: def python_impl(x): P = np.zeros((len(x), len(x))) sums = np.cumsum(x[::-1])[::-1] for i in range(len(x)): for j in range(len(x)): P[i, j] = sums[np.max((i,j))] .....: In [150]: def numpy_impl(x): sums = np.cumsum(x[::-1])[::-1] idx = np.concatenate([M[:,:,np.newaxis] for M in np.mgrid[0:len(x), 0:len(x)]],axis=2).max(axis=2) P = sums[idx] / x.sum() .....: In [154]: x = randn(1000) In [155]: timeit numpy_impl(x) 1 loops, best of 3: 200 ms per loop In [156]: timeit python_impl(x) 1 loops, best of 3: 37.7 s per loop In [157]: x = randn(5000) In [158]: timeit numpy_impl(x) 1 loops, best of 3: 5.41 s per loop I did start running python_impl with len(x) = 5000 but my back of the envelope calculation is that it will be taking at least 1 hour per try, so timeit will be done in about 3 hours. :( The overhead of the Python nested loops may be a showstopper even for moderately sized problems. In a lot of these situations I find I run out of patience far before I run out of memory to throw around. :) David From stefan at sun.ac.za Sat Jul 4 08:23:01 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Sat, 4 Jul 2009 14:23:01 +0200 Subject: [SciPy-user] assign to diagonal values? In-Reply-To: References: <47A128A5.7010406@sci.utah.edu>

<2bc7a5a50801302128g79da7ea6jc95ef9abb00b3785@mail.gmail.com> Message-ID: <9457e7c80907040523g734aa3c0nab686af1476a731d@mail.gmail.com> 2009/7/1 Fernando Perez : > If someone feels like reviewing this ticket: > > http://projects.scipy.org/numpy/attachment/ticket/1132/numpy-index-funcs.diff > > it has the above and a few more utilities, with docs and tests. Thanks! Reviewed and applied in r7101 through r7103. Cheers St?fan From tpk at kraussfamily.org Sat Jul 4 09:31:18 2009 From: tpk at kraussfamily.org (Tom K.) Date: Sat, 4 Jul 2009 06:31:18 -0700 (PDT) Subject: [SciPy-user] Access to IPython's timeit result Message-ID: <24334619.post@talk.nabble.com> Hi, Looking at timeit in IPython, you get a nice output e.g. In [14]: timeit x=range(100000) 100 loops, best of 3: 5.75 ms per loop but what if I want to get access to that 5.75 ms number so I can write a program to capture these over a list of "N" values and make a plot of time versus "N"? - Tom K. -- View this message in context: http://www.nabble.com/Access-to-IPython%27s-timeit-result-tp24334619p24334619.html Sent from the Scipy-User mailing list archive at Nabble.com. From zunzun at zunzun.com Sat Jul 4 12:22:23 2009 From: zunzun at zunzun.com (James Phillips) Date: Sat, 4 Jul 2009 11:22:23 -0500 Subject: [SciPy-user] Additional Kern-ification of zunzun.com Message-ID: <268756d30907040922k76cfdde1gc73cfc7cdbab4fcd@mail.gmail.com> ODR is now one of the available fitting targets at http://zunzun.com, and again my thanks to Robert Kern. James Phillips http://zunzun.com On Fri, Jun 03, 2005 at 02:40:09AM -0700, Robert Kern wrote: > > I always use the full-blown nonlinear implementation, > which is not yet integrated into scipy. > > http://starship.python.net/crew/kernr/source/odr-0.6.1.tar.gz Say, that is very interesting work. May I add it to my web site as a fitting method? I'm also curious as to when it might make it into the SciPy distribution. James Phillips http://zunzun.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From jkington at wisc.edu Sat Jul 4 14:44:34 2009 From: jkington at wisc.edu (Joe Kington) Date: Sat, 4 Jul 2009 13:44:34 -0500 Subject: [SciPy-user] Access to IPython's timeit result In-Reply-To: References: <24334619.post@talk.nabble.com> Message-ID: Hi Tom, Have you looked at the timeit module? http://docs.python.org/library/timeit.html At least in 2.6, timeit.timeit() is builtin. In older versions, you just need to build a timeit.Time class instance and call it's timeit() function, which returns an average over a number of executions (default=1000000). If you really want the "best of 3" like ipython gives you, you can use the repeat function of the timeit.Time module to return a list of the execution times for each run. Hope that helps, -Joe On Sat, Jul 4, 2009 at 12:02 PM, Joe Kington wrote: > Hi Tom, > > Have you looked at the timeit module? > > http://docs.python.org/library/timeit.html > > At least in 2.6, timeit.timeit() is builtin. In older versions, you just > need to build a timeit.Time class instance and call it's timeit() function, > which returns an average over a number of executions (default=1000000). > > If you really want the "best of 3" like ipython gives you, you can use the > repeat function of the timeit.Time module to return a list of the execution > times for each run. > > Hope that helps, > -Joe > > On Sat, Jul 4, 2009 at 8:31 AM, Tom K. wrote: > >> >> Hi, >> Looking at timeit in IPython, you get a nice output e.g. >> >> In [14]: timeit x=range(100000) >> 100 loops, best of 3: 5.75 ms per loop >> >> but what if I want to get access to that 5.75 ms number so I can write a >> program to capture these over a list of "N" values and make a plot of time >> versus "N"? >> >> - Tom K. >> -- >> View this message in context: >> http://www.nabble.com/Access-to-IPython%27s-timeit-result-tp24334619p24334619.html >> Sent from the Scipy-User mailing list archive at Nabble.com. >> >> _______________________________________________ >> SciPy-user mailing list >> SciPy-user at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From dwf at cs.toronto.edu Sat Jul 4 16:25:03 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Sat, 4 Jul 2009 16:25:03 -0400 Subject: [SciPy-user] Creating a Matrix from a Sum In-Reply-To: <9483AA21-7F65-4D53-ABD2-3FD31339C9D7@cs.toronto.edu> References: <4A4CB556.6090203@gmail.com> <7F843EB2-C294-4F1C-A1B6-19C1CF7BEDFA@cs.toronto.edu> <24330442.post@talk.nabble.com> <9483AA21-7F65-4D53-ABD2-3FD31339C9D7@cs.toronto.edu> Message-ID: > In [157]: x = randn(5000) > > In [158]: timeit numpy_impl(x) > 1 loops, best of 3: 5.41 s per loop > > I did start running python_impl with len(x) = 5000 but my back of the > envelope calculation is that it will be taking at least 1 hour per > try, so timeit will be done in about 3 hours. :( So, things didn't scale quite as badly as I thought they would (I was running on very little sleep, heh) In [157]: x = randn(5000) In [158]: timeit numpy_impl(x) 1 loops, best of 3: 5.41 s per loop In [159]: timeit python_impl(x) 1 loops, best of 3: 1.01e+12 ns per loop Why IPython gave me an answer in nanoseconds I'm not sure... But that's about 17 minutes. In either case, then, you're looking at roughly a 200x speed up of the numpy version over the python loops, which is quite significant. If you can spare the ~2N^2 intermediate memory usage, it's definitely worth it. If you can't, but you still can't tolerate the speed hit, you can run the Python implementation Tom gave through Cython and add type specifiers for i, j, P and x, see http://docs.cython.org/docs/numpy_tutorial.html - that should achieve similar speedups without the memory waste. David From oanjao at yahoo.com Sun Jul 5 14:30:18 2009 From: oanjao at yahoo.com (Craig Finch) Date: Sun, 5 Jul 2009 11:30:18 -0700 (PDT) Subject: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension In-Reply-To: <5b8d13220907031921v24626b17hc353705f766435d6@mail.gmail.com> References: <524620.24680.qm@web30402.mail.mud.yahoo.com> <4A4DB108.8050407@ar.media.kyoto-u.ac.jp> <829316.46979.qm@web30408.mail.mud.yahoo.com> <5b8d13220907031921v24626b17hc353705f766435d6@mail.gmail.com> Message-ID: <783243.63027.qm@web30408.mail.mud.yahoo.com> By "build log" I assume you mean the output to stderr and stdout when when building Scipy. It's a huge amount of output, and I'm not sure exactly what to look for. Do you want me to post a particular section? Below is the output from sections where g++ is called. It seems to be linked with g++ everywhere. One more potential problem area: I'm linking with the Intel Math Kernel Library (MKL) which was almost certainly built with Intel compilers. Could that be causing this problem? Craig ----------------------- building 'scipy.interpolate._interpolate' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-Iscipy/interpolate/src -I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/interpolate/src/_interpolate.cpp g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/interpolate/src/_interpolate.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/interpolate/_interpolate.so ----------------------- building 'scipy.sparse.sparsetools._csr' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC creating build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/csr_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/csr_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_csr.so building 'scipy.sparse.sparsetools._csc' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/csc_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/csc_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_csc.so building 'scipy.sparse.sparsetools._coo' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/coo_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/coo_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_coo.so building 'scipy.sparse.sparsetools._bsr' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/bsr_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/bsr_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_bsr.so building 'scipy.sparse.sparsetools._dia' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/dia_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/dia_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_dia.so ----------------------- ----- Original Message ---- From: David Cournapeau To: Craig Finch ; SciPy Users List Sent: Friday, July 3, 2009 10:21:28 PM Subject: Re: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension On Sat, Jul 4, 2009 at 3:38 AM, Craig Finch wrote: > ImportError: /home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/_csr.so: undefined symbol: __gxx_personality_v0 > > As far as I know, everything Python-related on this machine has now been built with GNU tools, but something is still broken. One more thing--Python, Numpy, and Scipy are all locally installed in my /home/cfinch directory because the system Python is frozen at 2.4. This is still a C++ problem, I have sometimes noticed this problem on some machines. Normally this is caused by a C++ object file being linked with gcc (instead of g++). Can you post the build log of scipy ? David From d_l_goldsmith at yahoo.com Sun Jul 5 23:19:45 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Sun, 5 Jul 2009 20:19:45 -0700 (PDT) Subject: [SciPy-user] The baffling behavior that just won't die Message-ID: <753940.28516.qm@web52101.mail.re2.yahoo.com> Hi, folks. I'm having a problem using numpy.lookfor() that is very reminiscent of this thread: http://mail.scipy.org/pipermail/scipy-user/2008-June/017294.html The simplest reproduction of the problem is illustrated thusly: c:\Python25\Lib\site-packages\numpy>dir random.py Volume in drive C is SQ004816V03 Volume Serial Number is 8E67-90B7 Directory of c:\Python25\Lib\site-packages\numpy File Not Found c:\Python25\Lib\site-packages\numpy>python Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import numpy as np >>> 'lookfor' in dir(np) True >>> help(np.lookfor) cannot import name Random And that's all she wrote. I read the thread cited above and it led me to examine the goings-on using python -v; as warned, there was a waterfall of imports, but at the end of it all was: . : # C:\Python25\lib\repr.pyc matches C:\Python25\lib\repr.py import repr # precompiled from C:\Python25\lib\repr.pyc import collections # builtin # C:\Python25\lib\tempfile.pyc matches C:\Python25\lib\tempfile.py import tempfile # precompiled from C:\Python25\lib\tempfile.pyc import random # directory random # random\__init__.pyc matches random\__init__.py import random # precompiled from random\__init__.pyc # random\info.pyc matches random\info.py import random.info # precompiled from random\info.pyc import random.mtrand # dynamically loaded from random\mtrand.pyd cannot import name Random Please help. Thanks! DG From lorenzo.isella at gmail.com Mon Jul 6 04:18:07 2009 From: lorenzo.isella at gmail.com (Lorenzo Isella) Date: Mon, 06 Jul 2009 10:18:07 +0200 Subject: [SciPy-user] Finding Gaps in an Array Message-ID: <4A51B33F.7060306@gmail.com> Dear All, Suppose you have an integer array whose entries are like the one below (no entry is ever repeated and the array is sorted in increasing order) 1240070020 1240070040 1240070060 1240070080 1240070100 1240070180 1240070200 1240070620 1240070640 1240070880 1240070900 1240070940 1240070980 1240071000 1240071020 1240071040 1240071060 1240071080 1240071100 1240071120 1240071140 1240071160 1240071180 1240071200 1240071220 1240071240 1240071260 1240071280 1240071300 Let us assume that the entries represent contact times (in seconds) between 2 objects (one does not need to know anything else for this post). The value of the initial time does not matter (i.e. you could subtract it to all the elements in the array to have a list starting from zero). Now, you define the contact duration as the time spanned by consecutive entries evenly spaced by 20 seconds. E.g. in the array above, the first 5 elements define a contact of duration 1240070100-1240070020=80. Then there is an 80-sec gap (1240070180-1240070100) followed by a contact of duration 20 (1240070200-1240070180). How can you automatically calculate these intervals? Any suggestion is welcome. Cheers Lorenzo From david_baddeley at yahoo.com.au Mon Jul 6 08:01:14 2009 From: david_baddeley at yahoo.com.au (David Baddeley) Date: Mon, 6 Jul 2009 05:01:14 -0700 (PDT) Subject: [SciPy-user] Finding Gaps in an Array In-Reply-To: <4A51B33F.7060306@gmail.com> References: <4A51B33F.7060306@gmail.com> Message-ID: <672254.30337.qm@web33004.mail.mud.yahoo.com> Hi Lorenzo, numpy.diff(your_array) will give you the times between entries, numpy.where(numpy.diff(your_array) > 20) will give the indices where the gap was longer than 20 cheers, David ----- Original Message ---- From: Lorenzo Isella To: scipy-user at scipy.org Sent: Monday, 6 July, 2009 8:18:07 PM Subject: [SciPy-user] Finding Gaps in an Array Dear All, Suppose you have an integer array whose entries are like the one below (no entry is ever repeated and the array is sorted in increasing order) 1240070020 1240070040 1240070060 1240070080 1240070100 1240070180 1240070200 1240070620 1240070640 1240070880 1240070900 1240070940 1240070980 1240071000 1240071020 1240071040 1240071060 1240071080 1240071100 1240071120 1240071140 1240071160 1240071180 1240071200 1240071220 1240071240 1240071260 1240071280 1240071300 Let us assume that the entries represent contact times (in seconds) between 2 objects (one does not need to know anything else for this post). The value of the initial time does not matter (i.e. you could subtract it to all the elements in the array to have a list starting from zero). Now, you define the contact duration as the time spanned by consecutive entries evenly spaced by 20 seconds. E.g. in the array above, the first 5 elements define a contact of duration 1240070100-1240070020=80. Then there is an 80-sec gap (1240070180-1240070100) followed by a contact of duration 20 (1240070200-1240070180). How can you automatically calculate these intervals? Any suggestion is welcome. Cheers Lorenzo _______________________________________________ SciPy-user mailing list SciPy-user at scipy.org http://mail.scipy.org/mailman/listinfo/scipy-user From adrian.prw at gmail.com Mon Jul 6 10:18:31 2009 From: adrian.prw at gmail.com (Adrian Price-Whelan) Date: Mon, 6 Jul 2009 10:18:31 -0400 Subject: [SciPy-user] Misunderstanding scipy.interpolate.interp2d? In-Reply-To: References: Message-ID: Hey all -- Perhaps I am misunderstanding the documentation for the function scipy.interpolate.interp2d, or maybe I just don't understand the math of how it works. If it is a true 2-D interpolating function, why would it take 2 1-dimensional arrays as input? For instance, say I had image data in a 2-D array that I wanted to shift by some sub-pixel amount, like 0.3 pixels, to some arbitrary direction. How would I take an array ( something like [[1,2,3],[4,5,6],[7,8,9]] ) and break this into 2 1-D arrays x and y to use this function to interpolate new values based on the sub-pixel shift? Thanks!! -Adrian From lorenzo.isella at gmail.com Mon Jul 6 13:22:07 2009 From: lorenzo.isella at gmail.com (Lorenzo Isella) Date: Mon, 06 Jul 2009 19:22:07 +0200 Subject: [SciPy-user] Finding Gaps in an Array In-Reply-To: References: Message-ID: <4A5232BF.5080802@gmail.com> > Date: Mon, 06 Jul 2009 10:18:07 +0200 > From: Lorenzo Isella > Subject: [SciPy-user] Finding Gaps in an Array > To: scipy-user at scipy.org > Message-ID: <4A51B33F.7060306 at gmail.com> > Content-Type: text/plain; charset=ISO-8859-1; format=flowed > > Dear All, > Suppose you have an integer array whose entries are like the one below > (no entry is ever repeated and the array is sorted in increasing order) > > 1240070020 > 1240070040 > 1240070060 > 1240070080 > 1240070100 > 1240070180 > 1240070200 > 1240070620 > 1240070640 > 1240070880 > 1240070900 > 1240070940 > 1240070980 > 1240071000 > 1240071020 > 1240071040 > 1240071060 > 1240071080 > 1240071100 > 1240071120 > 1240071140 > 1240071160 > 1240071180 > 1240071200 > 1240071220 > 1240071240 > 1240071260 > 1240071280 > 1240071300 > > Let us assume that the entries represent contact times (in seconds) > between 2 objects (one does not need to know anything else for this > post). The value of the initial time does not matter (i.e. you could > subtract it to all the elements in the array to have a list starting > from zero). Now, you define the contact duration as the time spanned by > consecutive entries evenly spaced by 20 seconds. > E.g. in the array above, the first 5 elements define a contact of > duration 1240070100-1240070020=80. Then there is an 80-sec gap > (1240070180-1240070100) followed by a contact of duration 20 > (1240070200-1240070180). > How can you automatically calculate these intervals? > Any suggestion is welcome. > Cheers > > Lorenzo > > > > > Date: Mon, 6 Jul 2009 05:01:14 -0700 (PDT) > From: David Baddeley > Subject: Re: [SciPy-user] Finding Gaps in an Array > To: SciPy Users List > Message-ID: <672254.30337.qm at web33004.mail.mud.yahoo.com> > Content-Type: text/plain; charset=utf-8 > > > Hi Lorenzo, > > numpy.diff(your_array) > > will give you the times between entries, > > numpy.where(numpy.diff(your_array) > 20) > > will give the indices where the gap was longer than 20 > > cheers, > David > > > > Thanks David, But I also need something more. Let us make it real easy. Consider the array import numpy as n f=n.array([0,1,2,4,9,22,23,24,32,33,59,60,76]) I want to calculate the length of the subarrays consisting of numbers evenly spaced by 1 (like in the example, but now the gap is 1). In the case of f, these subarrays are: [0,1,2], [22,23,24], [32,33] and [59,60]. Hence, I would like to get back an array giving me the counts [3,3,2,2]. Does anyone know how to do this efficiently? Cheers Lorenzo From pgmdevlist at gmail.com Mon Jul 6 13:45:33 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Mon, 6 Jul 2009 13:45:33 -0400 Subject: [SciPy-user] Finding Gaps in an Array In-Reply-To: <4A5232BF.5080802@gmail.com> References: <4A5232BF.5080802@gmail.com> Message-ID: <687BC23D-6850-44AC-B008-1C404498E1E5@gmail.com> On Jul 6, 2009, at 1:22 PM, Lorenzo Isella wrote: > > Thanks David, > But I also need something more. > Let us make it real easy. Consider the array > > import numpy as n > f=n.array([0,1,2,4,9,22,23,24,32,33,59,60,76]) > > I want to calculate the length of the subarrays consisting of numbers > evenly spaced by 1 (like in the example, but now the gap is 1). > In the case of f, these subarrays are: > > [0,1,2], [22,23,24], [32,33] and [59,60]. > > Hence, I would like to get back an array giving me the counts > [3,3,2,2]. > > Does anyone know how to do this efficiently? > Cheers > > Lorenzo Lorenzo, I have implemented an object that should help you in scikits.hydroclimpy: http://hydroclimpy.sourceforge.net/ More specifically: http://hydroclimpy.sourceforge.net/core.objects.html#cluster-object Roughly, given an array and an increment parameter, Cluster computes a sequence of slices that differ by less than increment (or more than increment, depending on yet another parameter). Here's the relevant portion of the code. Note : in an upcoming version, this code will be implemented in Cython, and renamed to Clumps (as the name Cluster is misleading). ########## class Cluster(object): """ Groups consecutive data from an array according to a clustering condition. A cluster is defined as a group of consecutive values differing by at most the increment value. Missing values are **not** handled: the input sequence must therefore be free of missing values. Parameters ---------- darray : ndarray Input data array to clusterize. increment : {float}, optional Increment between two consecutive values to group. By default, use a value of 1. operator : {function}, optional Comparison operator for the definition of clusters. By default, use :func:`numpy.less_equal`. Attributes ---------- inishape Shape of the argument array (stored for resizing). inisize Size of the argument array. uniques : sequence List of unique cluster values, as they appear in chronological order. slices : sequence List of the slices corresponding to each cluster of data. starts : ndarray Array of the indices at which the clusters start. clustered : list List of clustered data. Examples -------- >>> A = [0, 0, 1, 2, 2, 2, 3, 4, 3, 4, 4, 4] >>> klust = cluster(A,0) >>> [list(_) for _ in klust.clustered] [[0, 0], [1], [2, 2, 2], [3], [4], [3], [4, 4, 4]] >>> klust.uniques array([0, 1, 2, 3, 4, 3, 4]) >>> x = [ 1.8, 1.3, 2.4, 1.2, 2.5, 3.9, 1. , 3.8, 4.2, 3.3, ... 1.2, 0.2, 0.9, 2.7, 2.4, 2.8, 2.7, 4.7, 4.2, 0.4] >>> Cluster(x,1).starts array([ 0, 2, 3, 4, 5, 6, 7, 10, 11, 13, 17, 19]) >>> Cluster(x,1.5).starts array([ 0, 6, 7, 10, 13, 17, 19]) >>> Cluster(x,2.5).starts array([ 0, 6, 7, 19]) >>> Cluster(x,2.5,greater).starts array([ 0, 1, 2, 3, 4, 5, 8, 9, 10, ... 11, 12, 13, 14, 15, 16, 17, 18]) >>> y = [ 0, -1, 0, 0, 0, 1, 1, -1, -1, -1, 1, 1, 0, 0, 0, 0, 1, 1, 0, 0] >>> Cluster(y,1).starts array([ 0, 1, 2, 5, 7, 10, 12, 16, 18]) """ def __init__(self, darray, increment=1, operator=np.less_equal): """ Initializes instance. Parameters ---------- darray : ndarray Input data array to clusterize. increment : {float}, optional Increment between two consecutive values to group. By default, use a value of 1. operator : {function}, optional Comparison operator for the definition of clusters. By default, use :func:`numpy.less_equal` """ if hasattr(darray,'mask') and darray.mask.any(): raise ma.MAError("Masked arrays should be filled prior clustering.") else: darray = np.asanyarray(darray) n = darray.size self.inishape = darray.shape self.inisize = darray.size darray = darray.ravel() clustercond = 1 - operator(np.absolute(np.diff(darray)), increment) sid = np.concatenate(([0, ], np.arange(1, n).compress(clustercond), [n, ])) slobj = np.asarray([slice(i, d) for (i, d) in np.broadcast(sid[:-1], sid[1:])]) # self.uniques = darray[sid[:-1]] self.clustered = [darray[k] for k in slobj] self.sizes = np.asarray(np.diff(sid)) self.slices = slobj self.starts = sid[:-1] def markonsize(self, operator, sizethresh): """ Creates a **mask** for the clusters that do not meet a size requirement. Thus, outputs ``False`` if the size requirement is met, ``True`` otherwise. Parameters ---------- operator : function Comparison operator sizethresh : float Requirement for the sizes of the clusters """ resmask = np.empty(self.inisize, dtype=bool) resmask[:] = True # for k in self.slices.compress(operator(self.sizes,sizethresh)): for k in self.slices[operator(self.sizes, sizethresh)]: resmask[k] = False return resmask.reshape(self.inishape) def mark_greaterthan(self, sizemin): """ Shortcut for :meth:`markonsize(greater_equal,sizemin)`. Thus, the command outputs ``False`` for clusters larger than ``sizemin``, and ``True`` for clusters smaller than ``sizemin``. Parameters ---------- sizemin : int Minimum size of the clusters. See Also -------- :meth:`markonsize` Creates a **mask** for the clusters that do not meet a size requirement. """ return self.markonsize(np.greater_equal, sizemin) def grouped_slices(self): """ Returns a dictionary with the unique values of ``self`` as keys, and a list of slices for the corresponding values. See Also -------- Cluster.grouped_limits that does the same thing """ # uniques = self.uniques.view(np.ndarray) output = dict([(k, []) for k in np.unique1d(uniques)]) for (k, v) in zip(self.uniques, self.slices): output[k].append(v) return output def grouped_limits(self): """ Returns a dictionary with the unique values of ``self`` as keys, and a list of tuples (starting index, ending index) for the corresponding values. See Also -------- Cluster.grouped_slices """ output = dict([(k, []) for k in np.unique1d(self.uniques)]) for (k, v) in zip(self.uniques, self.slices): output[k].append((v.start, v.stop)) for k in output: output[k] = np.array(output[k]) return output From jturner at gemini.edu Mon Jul 6 15:22:56 2009 From: jturner at gemini.edu (James Turner) Date: Mon, 06 Jul 2009 15:22:56 -0400 Subject: [SciPy-user] Scipy.org down? In-Reply-To: <4A4A74DB.50002@stsci.edu> References: <4A4A74DB.50002@stsci.edu> Message-ID: <4A524F10.5000206@gemini.edu> I can't load it or ping it. Same thing from a remote ping web site. Thanks! James. From josef.pktd at gmail.com Mon Jul 6 17:01:51 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Mon, 6 Jul 2009 17:01:51 -0400 Subject: [SciPy-user] Misunderstanding scipy.interpolate.interp2d? In-Reply-To: References:

Message-ID: <1cd32cbb0907061401s588e756eid9df01ded1db7021@mail.gmail.com> On Mon, Jul 6, 2009 at 10:18 AM, Adrian Price-Whelan wrote: > Hey all -- > > Perhaps I am misunderstanding the documentation for the function > scipy.interpolate.interp2d, or maybe I just don't understand the math > of how it works. If it is a true 2-D interpolating function, why would > it take 2 1-dimensional arrays as input? For instance, say I had image > data in a 2-D array that I wanted to shift by some sub-pixel amount, > like 0.3 pixels, to some arbitrary direction. How would I take an > array ( something like [[1,2,3],[4,5,6],[7,8,9]] ) and break this into > 2 1-D arrays x and y to use this function to interpolate new values > based on the sub-pixel shift? > > Thanks!! > -Adrian x and y are the coordinates, z are the values. If I understand your example correctly then something like this should work import numpy as np from scipy import interpolate z = np.array([[1,2,3],[4,5,6],[7,8,9]]) #t,x = np.mgrid[0:3,0:3] #for full grid values x = np.arange(3) y = np.arange(3) ip = interpolate.interp2d(x,y,z) xn = np.linspace(0,2,7) yn = np.linspace(0,2,7) print ip(xn,yn) # interpolated value with call Josef From oanjao at yahoo.com Mon Jul 6 19:16:42 2009 From: oanjao at yahoo.com (Craig Finch) Date: Mon, 6 Jul 2009 16:16:42 -0700 (PDT) Subject: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension Message-ID: <823916.80562.qm@web30402.mail.mud.yahoo.com> I've been dealing with issues like this, on this one system, for a year now. At one point, I had a working combination of python, numpy, and scipy. I tried so many things that I can't really remember how I got to that point. I think I used gcc to build scipy 0.6 because I couldn't get it to work with icc, and I linked it with the Intel MKL successfully. However, when I upgraded to scipy-0.7.0 I am again facing this problem. The latest thing I tried was to build both Numpy and Scipy with the Intel C and Fortran compilers (icc and ifort). Numpy builds fine and passes all its unit tests. Scipy builds without errors or warnings, but I still have problems when trying to import any Scipy module that depends on "sparse." Build command: python setup.py config --compiler=intel --fcompiler=intelem install --prefix=/home/cfinch 2>&1 > scipy_build_log.txt Results: >>> from scipy import sparse Traceback (most recent call last): File "", line 1, in File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/__init__.py", line 6, in from csr import * File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/csr.py", line 12, in from sparsetools import csr_tocsc, csr_tobsr, csr_count_blocks, \ File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/__init__.py", line 4, in from csr import * File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/csr.py", line 7, in import _csr ImportError: /home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/_csr.so: undefined symbol: _ZNSt8ios_base4InitD1Ev When I use nm to look at the symbols in _csr.so, I see that these are undefined, along with several dozen other Python-related symbols: ... 00000000000d2810 W _ZNSt6vectorIxSaIxEE6resizeEmx 00000000000d2fdc W _ZNSt6vectorIySaIyEE6resizeEmy U _ZNSt8ios_base4InitC1Ev U _ZNSt8ios_base4InitD1Ev I have no idea what this symbol is, or where it's supposed to come from. Has anyone seen an error like this before? I am going to see what kind of support I can get from Intel. Craig ----- Original Message ---- From: Craig Finch To: David Cournapeau ; SciPy Users List Sent: Sunday, July 5, 2009 2:30:18 PM Subject: Re: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension By "build log" I assume you mean the output to stderr and stdout when when building Scipy. It's a huge amount of output, and I'm not sure exactly what to look for. Do you want me to post a particular section? Below is the output from sections where g++ is called. It seems to be linked with g++ everywhere. One more potential problem area: I'm linking with the Intel Math Kernel Library (MKL) which was almost certainly built with Intel compilers. Could that be causing this problem? Craig ----------------------- building 'scipy.interpolate._interpolate' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-Iscipy/interpolate/src -I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/interpolate/src/_interpolate.cpp g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/interpolate/src/_interpolate.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/interpolate/_interpolate.so ----------------------- building 'scipy.sparse.sparsetools._csr' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC creating build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/csr_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/csr_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_csr.so building 'scipy.sparse.sparsetools._csc' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/csc_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/csc_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_csc.so building 'scipy.sparse.sparsetools._coo' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/coo_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/coo_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_coo.so building 'scipy.sparse.sparsetools._bsr' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/bsr_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/bsr_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_bsr.so building 'scipy.sparse.sparsetools._dia' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/dia_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/dia_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_dia.so ----------------------- ----- Original Message ---- From: David Cournapeau To: Craig Finch ; SciPy Users List Sent: Friday, July 3, 2009 10:21:28 PM Subject: Re: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension On Sat, Jul 4, 2009 at 3:38 AM, Craig Finch wrote: > ImportError: /home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/_csr.so: undefined symbol: __gxx_personality_v0 > > As far as I know, everything Python-related on this machine has now been built with GNU tools, but something is still broken. One more thing--Python, Numpy, and Scipy are all locally installed in my /home/cfinch directory because the system Python is frozen at 2.4. This is still a C++ problem, I have sometimes noticed this problem on some machines. Normally this is caused by a C++ object file being linked with gcc (instead of g++). Can you post the build log of scipy ? David From eadrogue at gmx.net Tue Jul 7 05:36:41 2009 From: eadrogue at gmx.net (Ernest =?iso-8859-1?Q?Adrogu=E9?=) Date: Tue, 7 Jul 2009 11:36:41 +0200 Subject: [SciPy-user] Scipy.org down? In-Reply-To: <4A524F10.5000206@gemini.edu> References: <4A4A74DB.50002@stsci.edu> <4A524F10.5000206@gemini.edu> Message-ID: <20090707093641.GA31531@doriath.local> 6/07/09 @ 15:22 (-0400), thus spake James Turner: > I can't load it or ping it. Same thing from a remote ping web site. It's working now. Ernest From joshua.stults at gmail.com Tue Jul 7 06:28:45 2009 From: joshua.stults at gmail.com (Joshua Stults) Date: Tue, 7 Jul 2009 06:28:45 -0400 Subject: [SciPy-user] Bootstrap? Message-ID: Hello, I was wondering if scipy had something similar to Octave/Matlab's empricial_rnd(). Here's the blurb from Octave's help describing the function: -- Function File: empirical_rnd (N, DATA) -- Function File: empirical_rnd (DATA, R, C) -- Function File: empirical_rnd (DATA, SZ) Generate a bootstrap sample of size N from the empirical distribution obtained from the univariate sample DATA. If R and C are given create a matrix with R rows and C columns. Or if SZ is a vector, create a matrix of size SZ. So basically you pass it an array of data, and it returns bootstrap samples (resampling from the array with replacement). I did a quick search on 'scipy bootstrap', 'scipy distributions' and 'scipy empirical_rnd', but didn't turn up anything promising. Any help / pointers greatly appreciated. Thanks. -- Joshua Stults Website: http://j-stults.blogspot.com From josef.pktd at gmail.com Tue Jul 7 07:25:49 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 7 Jul 2009 07:25:49 -0400 Subject: [SciPy-user] Bootstrap? In-Reply-To: References: Message-ID: <1cd32cbb0907070425j4572c688r3f49b7e6ea2154bb@mail.gmail.com> On Tue, Jul 7, 2009 at 6:28 AM, Joshua Stults wrote: > Hello, > > I was wondering if scipy had something similar to Octave/Matlab's > empricial_rnd(). ?Here's the blurb from Octave's help describing the > function: > > ?-- Function File: ?empirical_rnd (N, DATA) > ?-- Function File: ?empirical_rnd (DATA, R, C) > ?-- Function File: ?empirical_rnd (DATA, SZ) > ? ? Generate a bootstrap sample of size N from the empirical > ? ? distribution obtained from the univariate sample DATA. > > ? ? If R and C are given create a matrix with R rows and C columns. Or > ? ? if SZ is a vector, create a matrix of size SZ. > > So basically you pass it an array of data, and it returns bootstrap > samples (resampling from the array with replacement). > > I did a quick search on 'scipy bootstrap', 'scipy distributions' and > 'scipy empirical_rnd', but didn't turn up anything promising. ?Any > help / pointers greatly appreciated. ?Thanks. > > -- > Joshua Stults > Website: http://j-stults.blogspot.com > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > I was looking for bootstrap in python a while ago and didn't find much except for one blog post. Drawing from the data array can be done with random integers as indices: d is data array sample_size = len(d) # Choose #sample_size members of d at random, with replacement choices = numpy.random.random_integers(0, sample_size-1, sample_size) sample = d[choices] Josef From eadrogue at gmx.net Tue Jul 7 08:07:52 2009 From: eadrogue at gmx.net (Ernest =?iso-8859-1?Q?Adrogu=E9?=) Date: Tue, 7 Jul 2009 14:07:52 +0200 Subject: [SciPy-user] Bootstrap? In-Reply-To: References: Message-ID: <20090707120752.GA31860@doriath.local> 7/07/09 @ 06:28 (-0400), thus spake Joshua Stults: > So basically you pass it an array of data, and it returns bootstrap > samples (resampling from the array with replacement). You can use the random module from Python: In [40]: d = (1,2,3,4) In [41]: [random.choice(d) for i in range(len(d))] Out[41]: [4, 4, 2, 1] In [42]: [random.choice(d) for i in range(len(d))] Out[42]: [4, 2, 4, 2] In [43]: [random.choice(d) for i in range(len(d))] Out[43]: [2, 1, 2, 3] Bye. Ernest From V.vanBeveren at rijnhuizen.nl Tue Jul 7 10:03:07 2009 From: V.vanBeveren at rijnhuizen.nl (Vincent van Beveren) Date: Tue, 7 Jul 2009 16:03:07 +0200 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) Message-ID: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> Hello everyone, I'm an engineer at Rijnhuizen, which is a research institute for plasma physics. We use Python to drive one of our main research projects, however, in the scientific circles in this institute Python (and SciPy) are largely unknown. Time to change this, I think :). However, since I am an engineer and not a scientist I my viewpoint on Python and SciPy are more of an engineering perspective, like its Open Source, Free, modern language, functional programming, etc... I'm not entirely sure these are compelling arguments for a scientist to start working with Python (or atleast not on it self). So I was wondering, if I was to promote Python in the scientific community here at Rijnhuizen. So I have a few questions: - In what aspects does SciPy excel, compared to say IDL or matlab? - In what ways allows it a scientist to be more effective? - How usable is SciPy for Plasma physics, molucular dynamics and nanolayer Surface and Interface Physics (the 3 main areas at Rijnhuizen)? - How stable is it compared to other packages. (bugs, computation)? Any comments and insights are welcome! Regards, Vincent PS. Posted in scipy-dev by accident. ___ Ing. V. van Beveren Software Engineer, FOM Rijnhuizen E: V.vanBeveren at rijnhuizen.nl -------------- next part -------------- An HTML attachment was scrubbed... URL: From devicerandom at gmail.com Tue Jul 7 10:33:49 2009 From: devicerandom at gmail.com (ms) Date: Tue, 07 Jul 2009 15:33:49 +0100 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> Message-ID: <4A535CCD.9060503@gmail.com> Vincent van Beveren ha scritto: > - In what aspects does SciPy excel, compared to say IDL or matlab? I would say that its main strength is being a library of a full-featured, well-supported, cross-platform programming language. This means you can build arbitrarily complex applications using Python and Scipy, while it would be very painful with Matlab. > - In what ways allows it a scientist to be more effective? See above. > - How usable is SciPy for Plasma physics, molucular dynamics and nanolayer Surface and Interface Physics (the 3 main areas at Rijnhuizen)? I don't think one could write molecular dynamics engines in python, it's anyway too slow (I hope someone can tell me I'm wrong!) But it can be very useful for all kinds of data analysis and not-enormously-heavy computations (that's what I use it for). m. From gokhansever at gmail.com Tue Jul 7 10:58:51 2009 From: gokhansever at gmail.com (=?UTF-8?Q?G=C3=B6khan_SEVER?=) Date: Tue, 7 Jul 2009 09:58:51 -0500 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> Message-ID: <49d6b3500907070758t7d9907b8ob9b893ded99b4ff6@mail.gmail.com> On Tue, Jul 7, 2009 at 9:03 AM, Vincent van Beveren < V.vanBeveren at rijnhuizen.nl> wrote: > Hello everyone, > > > > I?m an engineer at Rijnhuizen, which is a research institute for plasma > physics. We use Python to drive one of our main research projects, however, > in the scientific circles in this institute Python (and SciPy) are largely > unknown. Time to change this, I think J. However, since I am an engineer > and not a scientist I my viewpoint on Python and SciPy are more of an > engineering perspective, like its Open Source, Free, modern language, > functional programming, etc... I?m not entirely sure these are compelling > arguments for a scientist to start working with Python (or atleast not on it > self). So I was wondering, if I was to promote Python in the scientific > community here at Rijnhuizen. So I have a few questions: > > > > - In what aspects does SciPy excel, compared to say IDL or matlab > ? > > - In what ways allows it a scientist to be more effective? > > - How usable is SciPy for Plasma physics, molucular dynamics and > nanolayer Surface and Interface Physics (the 3 main areas at Rijnhuizen)? > > - How stable is it compared to other packages. (bugs, > computation)? > > > > Any comments and insights are welcome! > > > > Regards, > > Vincent > > > > PS. Posted in scipy-dev by accident. > > ___ > > Ing. V. van Beveren > > Software Engineer, FOM Rijnhuizen > > E: V.vanBeveren at rijnhuizen.nl > > > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > > Vincent, Here a couple of sources that has inspired me to choose Python for my scientific data analysis and visualization tasks: http://conference.scipy.org/proceedings/SciPy2008/ https://cirl.berkeley.edu/fperez/ A Demonstration of the 'IPython' Interactive Shell Scientific Computing with Python Matplotlib by John D. Hunter (watched on April 12, 2009) ''Sage: Unifying Mathematical Software''', video part 1, video part 2 -- G?khan -------------- next part -------------- An HTML attachment was scrubbed... URL: From hoytak at cs.ubc.ca Tue Jul 7 11:29:30 2009 From: hoytak at cs.ubc.ca (Hoyt Koepke) Date: Tue, 7 Jul 2009 08:29:30 -0700 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> Message-ID: <4db580fd0907070829mc6ce8fet1e4f211b92fc9b38@mail.gmail.com> Hi Vincent, > -????????? In what aspects does SciPy excel, compared to say IDL or matlab? > > -????????? In what ways allows it a scientist to be more effective? A while ago, I wrote up a quick article on a few reasons that I think python is great for research stuff. It's a bit unpolished, but it gives 10 points that I think are really advantageous. It's at http://www.stat.washington.edu/~hoytak/blog/whypython.html. Let me know if anyone has feedback or comments on it; it's a work in progress at this point. --Hoyt ++++++++++++++++++++++++++++++++++++++++++++++++ + Hoyt Koepke + University of Washington Department of Statistics + http://www.stat.washington.edu/~hoytak/ + hoytak at gmail.com ++++++++++++++++++++++++++++++++++++++++++ From oanjao at yahoo.com Tue Jul 7 11:42:02 2009 From: oanjao at yahoo.com (Craig Finch) Date: Tue, 7 Jul 2009 08:42:02 -0700 (PDT) Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ; ) (Vincent van Beveren) In-Reply-To: References: Message-ID: <767369.32981.qm@web30401.mail.mud.yahoo.com> As a PhD candidate working in computational nanoscience, I have found Python to be extremely helpful. I had extensive experience using Matlab before starting my research. I will tell you about the incident that prompted me to switch to Python. I wrote a GUI for a data analysis program using Matlab. It worked well for about six months, and then I decided to make some changes in the GUI. I went back to look at my code and I had no idea what was going on. I spent a few hours trying to figure out the mess and decided it would just be faster to do it in Python, even though I was much less familiar with that language. Matlab is fundamentally a procedural language--it's fine for writing simple one-page programs, but it quickly becomes unmanageable when writing larger, more complex programs. I find that my Python code is much easier to read and organize. There is a Python interface (through Numpy and Scipy) to all of the classic numerical routines found on Netlib.org. The iPython interactive shell, combined with Matplotlib, provide interactive computing and high-quality plotting. I have not run into any situation in my research where I regretted using Python because some feature was not available, nor have I hit any critical bugs. I am using Python to write a Brownian dynamics simulator for interface chemistry. While the interpreted parts of a Python program run more slowly than C or Fortran, most of the time-consuming numerical operations are performed compiled C or Fortran libraries, so the speed difference is not as much as you might think. Further, there are numerous ways to accelerate Python programs by integrating compiled code. Check out scipy.weave.inline, which lets you write C code write in the middle of a Python program to accelerate the slow parts. Tools like boost.python and f2py make it easy for you to write C and Fortran libraries for the slowest portions of your code. My development cycle is simple: quickly write everything in Python, profile the code to find what runs slowly, and write those parts in C or Fortran (if necessary). Overall, I'd recommend Python to another scientist because it's the fastest way to go from ideas to computational results. Craig Message: 3 Date: Tue, 7 Jul 2009 16:03:07 +0200 From: Vincent van Beveren Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) To: "'scipy-user at scipy.org'" Message-ID: <2926F4BC94217A43A2D21792DE8818931C5B245B29 at ex1.rijnh.nl> Content-Type: text/plain; charset="us-ascii" Hello everyone, I'm an engineer at Rijnhuizen, which is a research institute for plasma physics. We use Python to drive one of our main research projects, however, in the scientific circles in this institute Python (and SciPy) are largely unknown. Time to change this, I think :). However, since I am an engineer and not a scientist I my viewpoint on Python and SciPy are more of an engineering perspective, like its Open Source, Free, modern language, functional programming, etc... I'm not entirely sure these are compelling arguments for a scientist to start working with Python (or atleast not on it self). So I was wondering, if I was to promote Python in the scientific community here at Rijnhuizen. So I have a few questions: - In what aspects does SciPy excel, compared to say IDL or matlab? - In what ways allows it a scientist to be more effective? - How usable is SciPy for Plasma physics, molucular dynamics and nanolayer Surface and Interface Physics (the 3 main areas at Rijnhuizen)? - How stable is it compared to other packages. (bugs, computation)? Any comments and insights are welcome! Regards, Vincent PS. Posted in scipy-dev by accident. ___ Ing. V. van Beveren Software Engineer, FOM Rijnhuizen E: V.vanBeveren at rijnhuizen.nl From dwf at cs.toronto.edu Tue Jul 7 12:14:03 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Tue, 7 Jul 2009 12:14:03 -0400 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> Message-ID: <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> On 7-Jul-09, at 10:03 AM, Vincent van Beveren wrote: > - In what aspects does SciPy excel, compared to say IDL or > matlab? Vectorized operations are (almost) as fast. Depending on what BLAS you use, linear algebra can be just as fast. String processing in Python is so incredibly better than Matlab that the thought of doing string operations in Matlab almost makes me cry. Python has bindings to nearly everything -- a dozen different GUI toolkits to low-level hardware interfaces. Likewise there are a handful of different ways to interface your code to code written in other languages -- ctypes, swig, boost::python, cython for C/C++, PyObjC for Objective C, f2py for Fortran, scikits.mlabwrap for Matlab, rpy and rpy2 for R... > - In what ways allows it a scientist to be more effective? As has been echoed here, writing large programs will be easier. Python is a real programming language and was designed that way from the ground up. Matlab was designed as an interactive environment first and programming language features were grafted on as an afterthought. > - How stable is it compared to other packages. (bugs, > computation)? I won't lie in that I occasionally run into bugs in SciPy, but one of three things usually happens: - I look at the code myself -- all of which is available to view and modify, not the case with Matlab -- and file a bug report and/or patch - I look at the code and realize it's not a bug but me misunderstanding something, at which point I might edit the documentation at http://docs.scipy.org/scipy/ to make it clearer if necessary. - If the code is incomprehensible to me, I ask on the mailing lists and (either already or shortly thereafter) it's fixed by someone else in Subversion. From bsouthey at gmail.com Tue Jul 7 12:39:48 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Tue, 07 Jul 2009 11:39:48 -0500 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> Message-ID: <4A537A54.2010601@gmail.com> On 07/07/2009 11:14 AM, David Warde-Farley wrote: > On 7-Jul-09, at 10:03 AM, Vincent van Beveren wrote: > > >> - In what aspects does SciPy excel, compared to say IDL or >> matlab? >> > > Vectorized operations are (almost) as fast. Depending on what BLAS you > use, linear algebra can be just as fast. String processing in Python > is so incredibly better than Matlab that the thought of doing string > operations in Matlab almost makes me cry. Python has bindings to > nearly everything -- a dozen different GUI toolkits to low-level > hardware interfaces. Likewise there are a handful of different ways > to interface your code to code written in other languages -- ctypes, > swig, boost::python, cython for C/C++, PyObjC for Objective C, f2py > for Fortran, scikits.mlabwrap for Matlab, rpy and rpy2 for R... > I would add because it is not always obvious and overlooked is that Python really integrates very different things into a single unified yet simple interface. As an example, you can very easily 'bolt on' web stuff to an existing peice of Python code to create an almost instant web application. (I have mixed linear algebra with html processing including parsing cgi and creating html tags and images to provide a couple of web apps.) So suddenly instead of saying I can do solve this problem (and perhaps here is the code), you can just say 'go to this web site and do it'. Plus, I can easily fix bugs and add new code without the user doing anything. This is great publicly to grant agencies and such like because you can actually show what you have done and also provide some outreach to the community. As a shameless plug, Frontiers in Neuroinformatics had a special issue on Python: http://www.frontiersin.org/neuroinformatics/ http://www.frontiersin.org/neuroinformatics/specialtopics/8/ Bruce -------------- next part -------------- An HTML attachment was scrubbed... URL: From fperez.net at gmail.com Tue Jul 7 12:53:29 2009 From: fperez.net at gmail.com (Fernando Perez) Date: Tue, 7 Jul 2009 09:53:29 -0700 Subject: [SciPy-user] assign to diagonal values? In-Reply-To: <9457e7c80907040523g734aa3c0nab686af1476a731d@mail.gmail.com> References: <47A128A5.7010406@sci.utah.edu>

<2bc7a5a50801302128g79da7ea6jc95ef9abb00b3785@mail.gmail.com> <9457e7c80907040523g734aa3c0nab686af1476a731d@mail.gmail.com> Message-ID: Hey Stefan, 2009/7/4 St?fan van der Walt : > 2009/7/1 Fernando Perez : >> If someone feels like reviewing this ticket: >> >> http://projects.scipy.org/numpy/attachment/ticket/1132/numpy-index-funcs.diff >> >> it has the above and a few more utilities, with docs and tests. > > Thanks! Reviewed and applied in r7101 through r7103. thanks for working on that ticket! BTW, looking at numpy/lib/index_tricks.py line 740: step = cumprod((1,)+a.shape[:-1]).sum() might be better written as suggested by Anand: step = 1+(cumprod(a.shape[:-1])).sum() which replaces a tuple concatenation by a simple numerical addition. Order-epsilon nit though. Thanks for the extra tests too! Best, f From robince at gmail.com Tue Jul 7 12:55:48 2009 From: robince at gmail.com (Robin) Date: Tue, 7 Jul 2009 17:55:48 +0100 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> Message-ID: On Tue, Jul 7, 2009 at 3:03 PM, Vincent van Beveren wrote: > -????????? In what aspects does SciPy excel, compared to say IDL or matlab? > > -????????? In what ways allows it a scientist to be more effective? > > -????????? How usable is SciPy for Plasma physics, molucular dynamics and > nanolayer Surface and Interface Physics (the 3 main areas at Rijnhuizen)? > > -????????? How stable is it compared to other packages. (bugs, computation)? I think it's probably been covered by others but some of the key things for me are: - how easy it is to extend performance critical portions in C or FORTRAN (cython, ctypes, f2py, weave etc.) http://www.scipy.org/PerformancePython may be slightly out of date but I think it's a great illustration. - how easy it is (relatively speaking) to employ parralel processing. http://www.scipy.org/ParallelProgramming . Using the multiprocessing module (available as a backport for 2.5) it is literally 2 lines to have a for loop parallellised over 8 cores (or 16 if I had them). In MATLAB anything more than 4 cores gets into really silly money (not usually included in site license). This is probably the major factor driving interest in Python from colleagues in my lab. Also of course there are parralel python, ipython etc. and python bindings to MPI, cuda, opencl etc. which I think would usually allow much quicker progress than working in C. - just how nice it is to use; broadcasting... even just how nice it is to handle default arguments in functions and update code with new parameters in a backwards compatible was was a bit of a revelation for me after matlab. - slices as views - in my work I usually have one long data set which I slice and view in different ways. In matlab I think slices make copies (at least they used to - possibly its changed now, but another thing is that you can never really know what matlabs going to do, whereas in Python you really can control how memory is laid out and manipulated at a low level) but in Python all my different views point to the same data. cheers Robin From dwf at cs.toronto.edu Tue Jul 7 13:22:35 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Tue, 7 Jul 2009 13:22:35 -0400 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <4A537A54.2010601@gmail.com> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> <4A537A54.2010601@gmail.com> Message-ID: <1BB30B2C-4CE8-44C7-82FF-62E9EFF0FB7C@cs.toronto.edu> On 7-Jul-09, at 12:39 PM, Bruce Southey wrote: > As an example, you can very easily 'bolt on' web stuff to an > existing peice of Python code to create an almost instant web > application. (I have mixed linear algebra with html processing > including parsing cgi and creating html tags and images to provide a > couple of web apps.) So suddenly instead of saying I can do solve > this problem (and perhaps here is the code), you can just say 'go to > this web site and do it'. Plus, I can easily fix bugs and add new > code without the user doing anything. Indeed, I've done this too. Getting matplotlib to render PNGs for TurboGears/Django/your-favourite-web-framework is dead simple, too so you can provide really quite fancy visualization in your web-based interface without having to do anything extra. David From cimrman3 at ntc.zcu.cz Tue Jul 7 13:42:57 2009 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Tue, 07 Jul 2009 19:42:57 +0200 Subject: [SciPy-user] circumference in raster image Message-ID: <4A538921.9060800@ntc.zcu.cz> Hi! Hoping that there are some raster image experts around: I need to compute an approximation of circumference of an object (with smooth boundary) represented as a filled area in a 2d array. Can scipy.ndimage deal with that? thanks, r. From zachary.pincus at yale.edu Tue Jul 7 14:34:04 2009 From: zachary.pincus at yale.edu (Zachary Pincus) Date: Tue, 7 Jul 2009 14:34:04 -0400 Subject: [SciPy-user] circumference in raster image In-Reply-To: <4A538921.9060800@ntc.zcu.cz> References: <4A538921.9060800@ntc.zcu.cz> Message-ID: Hi Robert, Basically, assuming the object is in a binarized array, you could use ndimage to do one iteration of erosion, giving you the same object but one pixel smaller. Then xor the eroded and original binary images to give an image where the single-pixel border around the object is 1 and the rest is zero; from here you can just sum the pixels to give a (very rough) perimeter value. (Note of course that this doesn't account for the spacing between pixels being different on the diagonal than horizontal or vertical... for that you'd need some chain code things, which I think ndimage doesn't provide.) Personally, in situations like these -- especially when the original image is not binary and I'd need to threshold to get a binary image -- I usually run a marching-squares algorithm over the data to extract interpolated iso-intensity contours for a particular threshold; these contours are polygons with which it is easy to calculate fairly accurate perimeter, area, etc. values. I can send a C extension that does this very quickly, if desired. Zach On Jul 7, 2009, at 1:42 PM, Robert Cimrman wrote: > Hi! > > Hoping that there are some raster image experts around: I need to > compute an approximation of circumference of an object (with smooth > boundary) represented as a filled area in a 2d array. Can > scipy.ndimage > deal with that? > > thanks, > r. > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From seb.haase at gmail.com Tue Jul 7 16:22:47 2009 From: seb.haase at gmail.com (Sebastian Haase) Date: Tue, 7 Jul 2009 22:22:47 +0200 Subject: [SciPy-user] circumference in raster image In-Reply-To: References: <4A538921.9060800@ntc.zcu.cz> Message-ID: On Tue, Jul 7, 2009 at 8:34 PM, Zachary Pincus wrote: > Hi Robert, > > Basically, assuming the object is in a binarized array, you could use > ndimage to do one iteration of erosion, giving you the same object but > one pixel smaller. Then xor the eroded and original binary images to > give an image where the single-pixel border around the object is 1 and > the rest is zero; from here you can just sum the pixels to give a > (very rough) perimeter value. (Note of course that this doesn't > account for the spacing between pixels being different on the diagonal > than horizontal or vertical... for that you'd need some chain code > things, which I think ndimage doesn't provide.) > > Personally, in situations like these -- especially when the original > image is not binary and I'd need to threshold to get a binary image -- > I usually run a marching-squares algorithm over the data to extract > interpolated iso-intensity contours for a particular threshold; these > contours are polygons with which it is easy to calculate fairly > accurate perimeter, area, etc. values. I can send a C extension that > does this very quickly, if desired. > > Zach > Hi Zach, I would also be interested in that code - if you don't mind sending me a copy .... Thanks, Sebastian > > > On Jul 7, 2009, at 1:42 PM, Robert Cimrman wrote: > >> Hi! >> >> Hoping that there are some raster image experts around: I need to >> compute an approximation of circumference of an object (with smooth >> boundary) represented as a filled area in a 2d array. Can >> scipy.ndimage >> deal with that? >> >> thanks, >> r. From zachary.pincus at yale.edu Tue Jul 7 16:56:32 2009 From: zachary.pincus at yale.edu (Zachary Pincus) Date: Tue, 7 Jul 2009 16:56:32 -0400 Subject: [SciPy-user] circumference in raster image In-Reply-To: References: <4A538921.9060800@ntc.zcu.cz> Message-ID: Hi all, Attached is find_contours, an extension module that finds all contours in a 2D array at a specified iso-value. The contour position is linearly interpolated between pixels. Let me know if you have any questions, etc. (Looking over the code, I'm surprised how well I documented and commented it... lucky break for anyone who wants to use or modify it, I guess.) It's GPL because it comes from a larger GPL'd project of mine, but just ask me and I'll send it under a different license. Zach On Jul 7, 2009, at 4:22 PM, Sebastian Haase wrote: > On Tue, Jul 7, 2009 at 8:34 PM, Zachary > Pincus wrote: >> Hi Robert, >> >> Basically, assuming the object is in a binarized array, you could use >> ndimage to do one iteration of erosion, giving you the same object >> but >> one pixel smaller. Then xor the eroded and original binary images to >> give an image where the single-pixel border around the object is 1 >> and >> the rest is zero; from here you can just sum the pixels to give a >> (very rough) perimeter value. (Note of course that this doesn't >> account for the spacing between pixels being different on the >> diagonal >> than horizontal or vertical... for that you'd need some chain code >> things, which I think ndimage doesn't provide.) >> >> Personally, in situations like these -- especially when the original >> image is not binary and I'd need to threshold to get a binary image >> -- >> I usually run a marching-squares algorithm over the data to extract >> interpolated iso-intensity contours for a particular threshold; these >> contours are polygons with which it is easy to calculate fairly >> accurate perimeter, area, etc. values. I can send a C extension that >> does this very quickly, if desired. >> >> Zach >> > Hi Zach, > I would also be interested in that code - if you don't mind sending me > a copy .... > > Thanks, > Sebastian > -------------- next part -------------- A non-text attachment was scrubbed... Name: find_contours.zip Type: application/zip Size: 6411 bytes Desc: not available URL: From gruben at bigpond.net.au Tue Jul 7 19:03:54 2009 From: gruben at bigpond.net.au (Gary Ruben) Date: Wed, 08 Jul 2009 09:03:54 +1000 Subject: [SciPy-user] circumference in raster image In-Reply-To: <4A538921.9060800@ntc.zcu.cz> References: <4A538921.9060800@ntc.zcu.cz> Message-ID: <4A53D45A.1050908@bigpond.net.au> I recently sat in on a lecture on how to do this. In summary, there is no best way to do it - it's an open research problem. Search for "Best Line Unbiased Estimators" to find out more. If you just walk around the discretised 8-connected circumference and try to compute pi from it, it will converge to 8(sqrt(2)-1), which is close to pi and comes from the sum of sides of an octagon enclosing the circle (this makes sense because all the traversals can be ordered to lie on this octagon), so you need to be smarter, unless that's an acceptable uncertainty for your problem. Zach's suggestion to use find_contours is probably a good one, depending on how the contouring is done - certainly sounds like a promising approach, Gary R. Robert Cimrman wrote: > Hi! > > Hoping that there are some raster image experts around: I need to > compute an approximation of circumference of an object (with smooth > boundary) represented as a filled area in a 2d array. Can scipy.ndimage > deal with that? > > thanks, > r. From robert.kern at gmail.com Tue Jul 7 21:20:47 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 7 Jul 2009 20:20:47 -0500 Subject: [SciPy-user] Additional Kern-ification of zunzun.com In-Reply-To: <268756d30907040922k76cfdde1gc73cfc7cdbab4fcd@mail.gmail.com> References: <268756d30907040922k76cfdde1gc73cfc7cdbab4fcd@mail.gmail.com> Message-ID: <3d375d730907071820w16ba7aaeq518275d966569c4a@mail.gmail.com> On Sat, Jul 4, 2009 at 11:22, James Phillips wrote: > ODR is now one of the available fitting targets at http://zunzun.com, and > again my thanks to Robert Kern. Look on my works, ye mighty, and despair! Good work James! Thanks for the heads up. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From jh at physics.ucf.edu Tue Jul 7 21:36:29 2009 From: jh at physics.ucf.edu (Joe Harrington) Date: Tue, 07 Jul 2009 21:36:29 -0400 Subject: [SciPy-user] Bootstrap? In-Reply-To: (scipy-user-request@scipy.org) References: Message-ID: On Tue, Jul 7, 2009 at 6:28 AM, Joshua Stults wrote: > I was wondering if scipy had something similar to Octave/Matlab's > empricial_rnd(). ?Here's the blurb from Octave's help describing the > function: > > ?-- Function File: ?empirical_rnd (N, DATA) > ?-- Function File: ?empirical_rnd (DATA, R, C) > ?-- Function File: ?empirical_rnd (DATA, SZ) > ? ? Generate a bootstrap sample of size N from the empirical > ? ? distribution obtained from the univariate sample DATA. > > ? ? If R and C are given create a matrix with R rows and C columns. Or > ? ? if SZ is a vector, create a matrix of size SZ. > > So basically you pass it an array of data, and it returns bootstrap > samples (resampling from the array with replacement). > Be very careful and be certain you can derive the statistical justification for what you are doing when you use bootstrap. There are numerous cases in which bootstrapping will not give you the right answer, such as when fitting a function that has a parameter that is set in just a small subset of the data, because in some samples the subset may be omitted completely or in large part, admitting wildly wrong parameter values. While you didn't specify exactly what you are trying to do, for many problems Markov-Chain Monte Carlo is both better and faster, and is often easier to code. Plus, there is Python for it (pymc, I think). --jh-- From joshua.stults at gmail.com Tue Jul 7 22:07:15 2009 From: joshua.stults at gmail.com (Joshua Stults) Date: Tue, 7 Jul 2009 22:07:15 -0400 Subject: [SciPy-user] Bootstrap? In-Reply-To: References: Message-ID: Joe, Thanks for the tip. On Tue, Jul 7, 2009 at 9:36 PM, Joe Harrington wrote: > On Tue, Jul 7, 2009 at 6:28 AM, Joshua Stults wrote: > >> I was wondering if scipy had something similar to Octave/Matlab's >> empricial_rnd(). ?Here's the blurb from Octave's help describing the >> function: >> >> ?-- Function File: ?empirical_rnd (N, DATA) >> ?-- Function File: ?empirical_rnd (DATA, R, C) >> ?-- Function File: ?empirical_rnd (DATA, SZ) >> ? ? Generate a bootstrap sample of size N from the empirical >> ? ? distribution obtained from the univariate sample DATA. >> >> ? ? If R and C are given create a matrix with R rows and C columns. Or >> ? ? if SZ is a vector, create a matrix of size SZ. >> >> So basically you pass it an array of data, and it returns bootstrap >> samples (resampling from the array with replacement). >> > > Be very careful and be certain you can derive the statistical > justification for what you are doing when you use bootstrap. ?There > are numerous cases in which bootstrapping will not give you the right > answer, such as when fitting a function that has a parameter that is > set in just a small subset of the data, because in some samples the > subset may be omitted completely or in large part, admitting wildly > wrong parameter values. I was doing a toy problem with 0-1 data (1=success, 0=failure), estimating a reliability. So my statistic was just: sum(bootstrap_sample) / n. Does your criticism apply to bootstrapping the residuals too? I'd appreciate if you could point me towards any accessible (I'm not a statistician) references. > While you didn't specify exactly what you are > trying to do, for many problems Markov-Chain Monte Carlo is both > better and faster, and is often easier to code. ?Plus, there is Python > for it (pymc, I think). Could you give an example where it's easier to code an MCMC method? Doing a bootstrap is one or two lines of code in most high level languages (eg Matlab/Octave), and turns out Python too using the random indexing method that Josef and Ernest posted (of course you have to put it in an interpreted loop, which is not very scalable). > > --jh-- > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > Thanks again, I've been consistently impressed by the quality of responses on this list. -- Joshua Stults Website: http://j-stults.blogspot.com From jongisli at gmail.com Wed Jul 8 06:09:31 2009 From: jongisli at gmail.com (=?ISO-8859-1?Q?J=F3n_Egilsson?=) Date: Wed, 8 Jul 2009 10:09:31 +0000 Subject: [SciPy-user] Problems with the interpolation package Message-ID: <44cd5a960907080309t481ba009l36fa53d2dee3806e@mail.gmail.com> Hi there. I've now tried various functions in the scipy.interpolate package without getting far. *What I want to do:* I want to interpolate using spline interpolation, cubic spline interpolation to be more precise. 1. I want to be able to choose which points to interpolate (to choose between which points on the x-axis the cubic polynomials are). Then I'll have some polynomials on the form ax^3 + bx^2 + cx + d 2. Then I want to be able to extract the coefficients a,b,c,d from each polynomial because that's the only thing I want to keep. I don't want to keep an interpolation object of some sort. *Input:* Two lists, one representing the x-axis: [0..n] and another one with the function values: [y0, y1, y2, ... , yn] and the desired points I wish to interpolate between. *Output (preferably):* The coefficients (a,b,c,d) for every polynomial beetween the points I chose to interpolate between. Any help at all will be greatly appreciated. Cheers. J?n G?sli Egilsson. -------------- next part -------------- An HTML attachment was scrubbed... URL: From timmichelsen at gmx-topmail.de Wed Jul 8 07:48:22 2009 From: timmichelsen at gmx-topmail.de (Tim Michelsen) Date: Wed, 8 Jul 2009 11:48:22 +0000 (UTC) Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> <4A537A54.2010601@gmail.com> <1BB30B2C-4CE8-44C7-82FF-62E9EFF0FB7C@cs.toronto.edu> Message-ID: Hello. David Warde-Farley cs.toronto.edu> writes: > On 7-Jul-09, at 12:39 PM, Bruce Southey wrote: > > As an example, you can very easily 'bolt on' web stuff to an > > existing peice of Python code to create an almost instant web > > application. [...] > Indeed, I've done this too. Getting matplotlib to render PNGs for > TurboGears/Django/your-favourite-web-framework is dead simple, too so > you can provide really quite fancy visualization in your web-based > interface without having to do anything extra. @Bruce, David Do you have a particular site that is one-line and available to the gereral public? I am intending something similar. Although it is not specially topic of this list, I am particulary interested in how you generate the matplotlib graphics on the fly. To you store them in a temporary directory or do create the files somehow on the fly? Would be nice to see your implementation. Thanks, Timmie From timmichelsen at gmx-topmail.de Wed Jul 8 07:52:39 2009 From: timmichelsen at gmx-topmail.de (Tim Michelsen) Date: Wed, 8 Jul 2009 11:52:39 +0000 (UTC) Subject: [SciPy-user] =?utf-8?q?Why_is_SciPy_better_than_Matlab_or_IDL=3F_?= =?utf-8?q?=3B=09=29_=28Vincent_van_Beveren=29?= References: <767369.32981.qm@web30401.mail.mud.yahoo.com> Message-ID: > do it in Python, even though I was much less familiar with that language. Matlab is fundamentally a > procedural language--it's fine for writing simple one-page programs, but it quickly becomes > unmanageable when writing larger, more complex programs > . I find that my Python code is much easier to read and organize. I have the same perception when I see R-project scripts. I think R is good for full-fledged and proven statistics. The better the Python interface to R is the better... From cimrman3 at ntc.zcu.cz Wed Jul 8 08:38:51 2009 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Wed, 08 Jul 2009 14:38:51 +0200 Subject: [SciPy-user] circumference in raster image In-Reply-To: References: <4A538921.9060800@ntc.zcu.cz>

Message-ID: <4A54935B.4060209@ntc.zcu.cz> Hi Zach! Zachary Pincus wrote: > Hi all, > > Attached is find_contours, an extension module that finds all contours > in a 2D array at a specified iso-value. The contour position is linearly > interpolated between pixels. Let me know if you have any questions, etc. > (Looking over the code, I'm surprised how well I documented and > commented it... lucky break for anyone who wants to use or modify it, I > guess.) It's GPL because it comes from a larger GPL'd project of mine, > but just ask me and I'll send it under a different license. Thank you very much, I will try your code. This is the way I thought I might try (interpolating by a smooth function/polynomial), so it's cool I do not have to dive into it :-) I would like to use it within my BSD code [1], so it would be awesome if you relicensed the code to BSD. Thanks again, r. [1] http://github.com/rc/gensei/tree/master > On Jul 7, 2009, at 4:22 PM, Sebastian Haase wrote: > >> On Tue, Jul 7, 2009 at 8:34 PM, Zachary >> Pincus wrote: >>> Hi Robert, >>> >>> Basically, assuming the object is in a binarized array, you could use >>> ndimage to do one iteration of erosion, giving you the same object but >>> one pixel smaller. Then xor the eroded and original binary images to >>> give an image where the single-pixel border around the object is 1 and >>> the rest is zero; from here you can just sum the pixels to give a >>> (very rough) perimeter value. (Note of course that this doesn't >>> account for the spacing between pixels being different on the diagonal >>> than horizontal or vertical... for that you'd need some chain code >>> things, which I think ndimage doesn't provide.) >>> >>> Personally, in situations like these -- especially when the original >>> image is not binary and I'd need to threshold to get a binary image -- >>> I usually run a marching-squares algorithm over the data to extract >>> interpolated iso-intensity contours for a particular threshold; these >>> contours are polygons with which it is easy to calculate fairly >>> accurate perimeter, area, etc. values. I can send a C extension that >>> does this very quickly, if desired. >>> >>> Zach >>> >> Hi Zach, >> I would also be interested in that code - if you don't mind sending me >> a copy .... >> >> Thanks, >> Sebastian >> > > > > ------------------------------------------------------------------------ > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From cimrman3 at ntc.zcu.cz Wed Jul 8 08:43:14 2009 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Wed, 08 Jul 2009 14:43:14 +0200 Subject: [SciPy-user] circumference in raster image In-Reply-To: References: <4A538921.9060800@ntc.zcu.cz> Message-ID: <4A549462.5060502@ntc.zcu.cz> Hi Zach, Zachary Pincus wrote: > Hi Robert, > > Basically, assuming the object is in a binarized array, you could use > ndimage to do one iteration of erosion, giving you the same object but > one pixel smaller. Then xor the eroded and original binary images to > give an image where the single-pixel border around the object is 1 and > the rest is zero; from here you can just sum the pixels to give a > (very rough) perimeter value. (Note of course that this doesn't > account for the spacing between pixels being different on the diagonal > than horizontal or vertical... for that you'd need some chain code > things, which I think ndimage doesn't provide.) Just for the record how I did it (before knowing about your code): I have used np.gradient to obtain the edges (yes, the image is binary) and counted the circumference like this: # pixel_sizes = (width, height) of a pixel in some lenght units grad0, grad1 = np.gradient(mask) val0 = len(grad0[np.where(grad0)]) * pixel_sizes[0] val1 = len(grad1[np.where(grad1)]) * pixel_sizes[1] circumference = 0.5 * (val0 + val1) A poor man's approach, but worked too. cheers, r. From cimrman3 at ntc.zcu.cz Wed Jul 8 08:45:23 2009 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Wed, 08 Jul 2009 14:45:23 +0200 Subject: [SciPy-user] circumference in raster image In-Reply-To: <4A53D45A.1050908@bigpond.net.au> References: <4A538921.9060800@ntc.zcu.cz> <4A53D45A.1050908@bigpond.net.au> Message-ID: <4A5494E3.1010209@ntc.zcu.cz> Thank you Gary, it's good to know the right keyword - my google searches were not successful but now I get lots of results :-) cheers, r. Gary Ruben wrote: > I recently sat in on a lecture on how to do this. In summary, there is > no best way to do it - it's an open research problem. > Search for "Best Line Unbiased Estimators" to find out more. > If you just walk around the discretised 8-connected circumference and > try to compute pi from it, it will converge to 8(sqrt(2)-1), which is > close to pi and comes from the sum of sides of an octagon enclosing the > circle (this makes sense because all the traversals can be ordered to > lie on this octagon), so you need to be smarter, unless that's an > acceptable uncertainty for your problem. > > Zach's suggestion to use find_contours is probably a good one, depending > on how the contouring is done - certainly sounds like a promising approach, > > Gary R. > > Robert Cimrman wrote: >> Hi! >> >> Hoping that there are some raster image experts around: I need to >> compute an approximation of circumference of an object (with smooth >> boundary) represented as a filled area in a 2d array. Can scipy.ndimage >> deal with that? >> >> thanks, >> r. > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > > From cimrman3 at ntc.zcu.cz Wed Jul 8 11:35:51 2009 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Wed, 08 Jul 2009 17:35:51 +0200 Subject: [SciPy-user] circumference in raster image In-Reply-To: <4A54935B.4060209@ntc.zcu.cz> References: <4A538921.9060800@ntc.zcu.cz>

<4A54935B.4060209@ntc.zcu.cz> Message-ID: <4A54BCD7.6020506@ntc.zcu.cz> Robert Cimrman wrote: > Hi Zach! > > Zachary Pincus wrote: >> Hi all, >> >> Attached is find_contours, an extension module that finds all contours >> in a 2D array at a specified iso-value. The contour position is linearly >> interpolated between pixels. Let me know if you have any questions, etc. >> (Looking over the code, I'm surprised how well I documented and >> commented it... lucky break for anyone who wants to use or modify it, I >> guess.) It's GPL because it comes from a larger GPL'd project of mine, >> but just ask me and I'll send it under a different license. > > Thank you very much, I will try your code. This is the way I thought I > might try (interpolating by a smooth function/polynomial), so it's cool > I do not have to dive into it :-) ok, I have tried find_contours() - it seems very suitable for my purposes! r. From zachary.pincus at yale.edu Wed Jul 8 11:47:32 2009 From: zachary.pincus at yale.edu (Zachary Pincus) Date: Wed, 8 Jul 2009 11:47:32 -0400 Subject: [SciPy-user] circumference in raster image In-Reply-To: <4A54BCD7.6020506@ntc.zcu.cz> References: <4A538921.9060800@ntc.zcu.cz>

<4A54935B.4060209@ntc.zcu.cz> <4A54BCD7.6020506@ntc.zcu.cz> Message-ID: <11FF520A-D2FA-4198-97ED-650AEAE570FF@yale.edu> >>> Attached is find_contours, an extension module that finds all >>> contours >>> in a 2D array at a specified iso-value. The contour position is >>> linearly >>> interpolated between pixels. Let me know if you have any >>> questions, etc. >>> (Looking over the code, I'm surprised how well I documented and >>> commented it... lucky break for anyone who wants to use or modify >>> it, I >>> guess.) It's GPL because it comes from a larger GPL'd project of >>> mine, >>> but just ask me and I'll send it under a different license. >> >> Thank you very much, I will try your code. This is the way I >> thought I >> might try (interpolating by a smooth function/polynomial), so it's >> cool >> I do not have to dive into it :-) > > ok, I have tried find_contours() - it seems very suitable for my > purposes! Note of course that if the image starts out as binary, the exact same problems that Gary described will apply: the contour segments will all point in one of eight directions... If the image originates as non-binary on the other hand, and instead of thresholding at value X, you find the iso-contours at value X, you might be able to get better values. Probably worth doing some ground-truth testing with mockup images. I'd be interested in the results. Zach From cimrman3 at ntc.zcu.cz Wed Jul 8 11:53:40 2009 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Wed, 08 Jul 2009 17:53:40 +0200 Subject: [SciPy-user] circumference in raster image In-Reply-To: <11FF520A-D2FA-4198-97ED-650AEAE570FF@yale.edu> References: <4A538921.9060800@ntc.zcu.cz>

<4A54935B.4060209@ntc.zcu.cz> <4A54BCD7.6020506@ntc.zcu.cz> <11FF520A-D2FA-4198-97ED-650AEAE570FF@yale.edu> Message-ID: <4A54C104.6080109@ntc.zcu.cz> Zachary Pincus wrote: >>>> Attached is find_contours, an extension module that finds all >>>> contours >>>> in a 2D array at a specified iso-value. The contour position is >>>> linearly >>>> interpolated between pixels. Let me know if you have any >>>> questions, etc. >>>> (Looking over the code, I'm surprised how well I documented and >>>> commented it... lucky break for anyone who wants to use or modify >>>> it, I >>>> guess.) It's GPL because it comes from a larger GPL'd project of >>>> mine, >>>> but just ask me and I'll send it under a different license. >>> Thank you very much, I will try your code. This is the way I >>> thought I >>> might try (interpolating by a smooth function/polynomial), so it's >>> cool >>> I do not have to dive into it :-) >> ok, I have tried find_contours() - it seems very suitable for my >> purposes! > > Note of course that if the image starts out as binary, the exact same > problems that Gary described will apply: the contour segments will all > point in one of eight directions... It does starts out as binary, so its the best I can get, IMHO. Anyway, it is going to be better than my gradient based approach. I will post here the results soon (soon in rc's time = next week). > If the image originates as non-binary on the other hand, and instead > of thresholding at value X, you find the iso-contours at value X, you > might be able to get better values. Maybe some decent blurring by ndimage filters would help, but I guess that the accuracy is going to be ok without that. > Probably worth doing some ground-truth testing with mockup images. I'd > be interested in the results. Basically, all my images are mockup (and binary), I can, however, increase their resolution at will, as they correspond to slices of objects with an analytical description. I will do some accuracy tests. r. From 3ukip0s02 at sneakemail.com Wed Jul 8 11:36:47 2009 From: 3ukip0s02 at sneakemail.com (-) Date: Wed, 8 Jul 2009 15:36:47 +0000 (UTC) Subject: [SciPy-user] FFT-based convolution and correlation Message-ID: Is there a FFT-based 2D correlation function built into scipy? There are functions like: scipy.signal.correlate2d - "the direct method implemented by convolveND will be slow for large data" scipy.ndimage.correlate - "The array is correlated with the given kernel using exact calculation (i.e. not FFT)." scipy.fftpack.convolve.convolve, which I don't really understand Numarray had a correlate2d() function, with a 'fft=True' switch (http://structure.usc.edu/numarray/node61.html), but I guess numarray was folded into numpy, and I can't find if this function was included. From Dharhas.Pothina at twdb.state.tx.us Wed Jul 8 14:28:05 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Wed, 08 Jul 2009 13:28:05 -0500 Subject: [SciPy-user] Sort by first n columns in an array Message-ID: <4A549EE5.63BA.009B.0@twdb.state.tx.us> Hi, I am reading in an array with np.genfromtxt and I need to sort it by the first 6 columns (ie year,month etc). I do not know the number of column before reading the array in. >From the documentation for sorting it looks like I can use the 'order' keyword to do what I need if I have a structured array with names for each column. I haven't been able to work out how to convert the array I have to an ordered array though. Also as far as I can tell I can't set the dtype with field names in np.genfromtxt without previously knowing the number of columns in the data file. I don't want to use the timeseries toolkit for this since it is on a machine that hasn't got that package installed. I know I must be missing something simple. Any help is appreciated. - dharhas From oanjao at yahoo.com Wed Jul 8 14:51:36 2009 From: oanjao at yahoo.com (Craig Finch) Date: Wed, 8 Jul 2009 11:51:36 -0700 (PDT) Subject: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension Message-ID: <245284.53920.qm@web30405.mail.mud.yahoo.com> Progress! I asked on the Intel forums, and they pointed out that I can unmangle the symbol name using "c++filt _ZNSt8ios_base4InitC1Ev". This corresponds to "std::ios_base::Init::Init()" which is in the C++ standard library. It appears that the code was not being linked with C++ standard library, and I was able to prevent this error by adding "-lstdc++" to the linker command. At this point, I working copies of Python, Numpy, and Scipy built entirely with Intel tools. However, I had to "hard code" this flag into the linker command in the file numpy/distutils/intelccompiler.py. It just didn't work when I added it to the environment variable LDFLAGS (and yes, I double-checked the environment variable to make sure it was set correctly). Because of this, I'd say I found a hackish workaround more than a solution. Any suggestions about the "right" way to do this? Second problem: Scipy now crashes when I try to run the unit tests. Below is the partial output when I attempt to run the unit tests. I'm also going to ask about this on the Intel MKL forums, but I'd appreciate any suggestions from you about how to go about diagnosing this problem. Craig Python 2.5.4 (r254:67916, Jul 6 2009, 19:27:14) [GCC Intel(R) C++ gcc 4.1 mode] on linux2 Type "help", "copyright", "credits" or "license" for more information. >>> import scipy; scipy.test() Running unit tests for scipy NumPy version 1.3.0 NumPy is installed in /home/cfinch/lib/python2.5/site-packages/numpy SciPy version 0.7.0 SciPy is installed in /home/cfinch/lib/python2.5/site-packages/scipy Python version 2.5.4 (r254:67916, Jul 6 2009, 19:27:14) [GCC Intel(R) C++ gcc 4.1 mode] nose version 0.11.1 ... ...........................*** glibc detected *** python: free(): invalid next s ize (fast): 0x0000000010781a50 *** ======= Backtrace: ========= /lib64/libc.so.6[0x315326e8a0] /lib64/libc.so.6(cfree+0x8c)[0x3153271fbc] /home/cfinch/lib/python2.5/site-packages/numpy/core/multiarray.so[0x2aaaaebc6827 ] /home/cfinch/lib/python2.5/site-packages/scipy/linalg/flapack.so[0x2aaab702b927] /home/cfinch/lib/python2.5/site-packages/scipy/linalg/flapack.so[0x2aaab703937b] python(PyObject_Call+0x15)[0x417c65] python[0x4c9c07] python(PyEval_EvalFrameEx+0xe1e)[0x4c28a0] python[0x4ca369] python(PyEval_EvalFrameEx+0xe1e)[0x4c28a0] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] python(PyObject_Call+0x15)[0x417c65] python(PyEval_EvalFrameEx+0x8ba)[0x4c233c] python[0x4caa0f] python(PyEval_EvalFrameEx+0xe1e)[0x4c28a0] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] python(PyObject_Call+0x15)[0x417c65] python(PyEval_EvalFrameEx+0x8ba)[0x4c233c] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] python(PyObject_Call+0x15)[0x417c65] python[0x421bcd] python(PyObject_Call+0x15)[0x417c65] python[0x48c28f] python(PyObject_Call+0x15)[0x417c65] python[0x4c9c07] python(PyEval_EvalFrameEx+0xe1e)[0x4c28a0] python[0x4caa0f] python(PyEval_EvalFrameEx+0xe1e)[0x4c28a0] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] python(PyObject_Call+0x15)[0x417c65] python(PyEval_EvalFrameEx+0x8ba)[0x4c233c] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] python(PyObject_Call+0x15)[0x417c65] python[0x421bcd] python(PyObject_Call+0x15)[0x417c65] python[0x48c28f] python(PyObject_Call+0x15)[0x417c65] python[0x4c9c07] python(PyEval_EvalFrameEx+0xe1e)[0x4c28a0] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] python(PyObject_Call+0x15)[0x417c65] python(PyEval_EvalFrameEx+0x8ba)[0x4c233c] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] python(PyObject_Call+0x15)[0x417c65] python[0x421bcd] python(PyObject_Call+0x15)[0x417c65] python[0x48c28f] python(PyObject_Call+0x15)[0x417c65] python[0x4c9c07] python(PyEval_EvalFrameEx+0xe1e)[0x4c28a0] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] python(PyObject_Call+0x15)[0x417c65] python(PyEval_EvalFrameEx+0x8ba)[0x4c233c] python(PyEval_EvalCodeEx+0x5a2)[0x4cb7ee] python[0x532ef5] ======= Memory map: ======== 00400000-005e2000 r-xp 00000000 00:12 97412 /gpfs/fs0/work/cfinch/bin/python 007e2000-0081a000 rwxp 001e2000 00:12 97412 /gpfs/fs0/work/cfinch/bin/python 0081a000-00822000 rwxp 0081a000 00:00 0 0ede6000-10c04000 rwxp 0ede6000 00:00 0 40000000-40001000 ---p 40000000 00:00 0 40001000-40011000 rwxp 40001000 00:00 0 40011000-40012000 ---p 40011000 00:00 0 40012000-4040a000 rwxp 40012000 00:00 0 4040a000-4040b000 ---p 4040a000 00:00 0 4040b000-40803000 rwxp 4040b000 00:00 0 40803000-40804000 ---p 40803000 00:00 0 40804000-40bfc000 rwxp 40804000 00:00 0 40bfc000-40bfd000 ---p 40bfc000 00:00 0 40bfd000-40ff6000 rwxp 40bfd000 00:00 0 40ff6000-40ff7000 ---p 40ff6000 00:00 0 40ff7000-413f0000 rwxp 40ff7000 00:00 0 413f0000-413f1000 ---p 413f0000 00:00 0 413f1000-417ea000 rwxp 413f1000 00:00 0 417ea000-417eb000 ---p 417ea000 00:00 0 417eb000-41be4000 rwxp 417eb000 00:00 0 3152200000-315221a000 r-xp 00000000 08:03 131368 /lib64/ld-2.5.so 3152419000-315241a000 r-xp 00019000 08:03 131368 /lib64/ld-2.5.so 315241a000-315241b000 rwxp 0001a000 08:03 131368 /lib64/ld-2.5.so 3152600000-3152614000 r-xp 00000000 08:03 24085282 /usr/lib64/libz.so.1.2 .3 3152614000-3152813000 ---p 00014000 08:03 24085282 /usr/lib64/libz.so.1.2 .3 3152813000-3152814000 rwxp 00013000 08:03 24085282 /usr/lib64/libz.so.1.2 .3 3152a00000-3152a02000 r-xp 00000000 08:03 131373 /lib64/libdl-2.5.so 3152a02000-3152c02000 ---p 00002000 08:03 131373 /lib64/libdl-2.5.so 3152c02000-3152c03000 r-xp 00002000 08:03 131373 /lib64/libdl-2.5.so 3152c03000-3152c04000 rwxp 00003000 08:03 131373 /lib64/libdl-2.5.so 3152e00000-3152e35000 r-xp 00000000 08:03 24084785 /usr/lib64/libreadline .so.5.1 3152e35000-3153034000 ---p 00035000 08:03 24084785 /usr/lib64/libreadline .so.5.1 3153034000-315303c000 rwxp 00034000 08:03 24084785 /usr/lib64/libreadline .so.5.1 315303c000-315303d000 rwxp 315303c000 00:00 0 3153200000-3153344000 r-xp 00000000 08:03 131369 /lib64/libc-2.5.so 3153344000-3153544000 ---p 00144000 08:03 131369 /lib64/libc-2.5.so 3153544000-3153548000 r-xp 00144000 08:03 131369 /lib64/libc-2.5.so 3153548000-3153549000 rwxp 00148000 08:03 131369 /lib64/libc-2.5.so 3153549000-315354e000 rwxp 3153549000 00:00 0 3153600000-3153682000 r-xp 00000000 08:03 131374 /lib64/libm-2.5.so 3153682000-3153881000 ---p 00082000 08:03 131374 /lib64/libm-2.5.so 3153881000-3153882000 r-xp 00081000 08:03 131374 /lib64/libm-2.5.so 3153882000-3153883000 rwxp 00082000 08:03 131374 /lib64/libm-2.5.so 3153a00000-3153a15000 r-xp 00000000 08:03 131372 /lib64/libpthread-2.5.so 3153a15000-3153c14000 ---p 00015000 08:03 131372 /lib64/libpthread-2.5.so 3153c14000-3153c15000 r-xp 00014000 08:03 131372 /lib64/libpthread-2.5.so 3153c15000-3153c16000 rwxp 00015000 08:03 131372 /lib64/libpthread-2.5.so 3153c16000-3153c1a000 rwxp 3153c16000 00:00 0 3156e00000-3156e0d000 r-xp 00000000 08:03 131375 /lib64/libgcc_s-4.1.1-20070105.so.1 3156e0d000-315700c000 ---p 0000d000 08:03 131375 /lib64/libgcc_s-4.1.1-20070105.so.1 315700c000-315700d000 rwxp 0000c000 08:03 131375 /lib64/libgcc_s-4.1.1-20070105.so.1 3157200000-31572e6000 r-xp 00000000 08:03 24085283 /usr/lib64/libstdc++.so.6.0.8 31572e6000-31574e5000 ---p 000e6000 08:03 24085283 /usr/lib64/libstdc++.so.6.0.8 31574e5000-31574eb000 r-xp 000e5000 08:03 24085283 /usr/lib64/libstdc++.so.6.0.8 31574eb000-31574ee000 rwxp 000eb000 08:03 24085283 /usr/lib64/libstdc++.so.6.0.8 31574ee000-3157500000 rwxp 31574ee000 00:00 0 315c000000-315c125000 r-xp 00000000 08:03 131380 /lib64/libcrypto.so.0.9.8b 315c125000-315c324000 ---p 00125000 08:03 131380 /lib64/libcrypto.so.0.9.8b 315c324000-315c343000 rwxp 00124000 08:03 131380 /lib64/libcrypto.so.0.9.8b 315c343000-315c347000 rwxp 315c343000 00:00 0 315c400000-315c411000 r-xp 00000000 08:03 131378 /lib64/libresolv-2.5.so 315c411000-315c611000 ---p 00011000 08:03 131378 /lib64/libresolv-2.5.so 315c611000-315c612000 r-xp 00011000 08:03 131378 /lib64/libresolv-2.5.so 315c612000-315c613000 rwxp 00012000 08:03 131378 /lib64/libresolv-2.5.so 315c613000-315c615000 rwxp 315c613000 00:00 0 315d000000-315d002000 r-xp 00000000 08:03 Aborted ----- Original Message ---- From: Craig Finch To: David Cournapeau ; SciPy Users List Sent: Monday, July 6, 2009 7:16:42 PM Subject: Re: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension I've been dealing with issues like this, on this one system, for a year now. At one point, I had a working combination of python, numpy, and scipy. I tried so many things that I can't really remember how I got to that point. I think I used gcc to build scipy 0.6 because I couldn't get it to work with icc, and I linked it with the Intel MKL successfully. However, when I upgraded to scipy-0.7.0 I am again facing this problem. The latest thing I tried was to build both Numpy and Scipy with the Intel C and Fortran compilers (icc and ifort). Numpy builds fine and passes all its unit tests. Scipy builds without errors or warnings, but I still have problems when trying to import any Scipy module that depends on "sparse." Build command: python setup.py config --compiler=intel --fcompiler=intelem install --prefix=/home/cfinch 2>&1 > scipy_build_log.txt Results: >>> from scipy import sparse Traceback (most recent call last): File "", line 1, in File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/__init__.py", line 6, in from csr import * File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/csr.py", line 12, in from sparsetools import csr_tocsc, csr_tobsr, csr_count_blocks, \ File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/__init__.py", line 4, in from csr import * File "/home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/csr.py", line 7, in import _csr ImportError: /home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/_csr.so: undefined symbol: _ZNSt8ios_base4InitD1Ev When I use nm to look at the symbols in _csr.so, I see that these are undefined, along with several dozen other Python-related symbols: ... 00000000000d2810 W _ZNSt6vectorIxSaIxEE6resizeEmx 00000000000d2fdc W _ZNSt6vectorIySaIyEE6resizeEmy U _ZNSt8ios_base4InitC1Ev U _ZNSt8ios_base4InitD1Ev I have no idea what this symbol is, or where it's supposed to come from. Has anyone seen an error like this before? I am going to see what kind of support I can get from Intel. Craig ----- Original Message ---- From: Craig Finch To: David Cournapeau ; SciPy Users List Sent: Sunday, July 5, 2009 2:30:18 PM Subject: Re: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension By "build log" I assume you mean the output to stderr and stdout when when building Scipy. It's a huge amount of output, and I'm not sure exactly what to look for. Do you want me to post a particular section? Below is the output from sections where g++ is called. It seems to be linked with g++ everywhere. One more potential problem area: I'm linking with the Intel Math Kernel Library (MKL) which was almost certainly built with Intel compilers. Could that be causing this problem? Craig ----------------------- building 'scipy.interpolate._interpolate' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-Iscipy/interpolate/src -I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/interpolate/src/_interpolate.cpp g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/interpolate/src/_interpolate.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/interpolate/_interpolate.so ----------------------- building 'scipy.sparse.sparsetools._csr' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC creating build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/csr_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/csr_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_csr.so building 'scipy.sparse.sparsetools._csc' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/csc_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/csc_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_csc.so building 'scipy.sparse.sparsetools._coo' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/coo_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/coo_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_coo.so building 'scipy.sparse.sparsetools._bsr' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/bsr_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/bsr_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_bsr.so building 'scipy.sparse.sparsetools._dia' extension compiling C++ sources C compiler: g++ -fno-strict-aliasing -DNDEBUG -g -fwrapv -O3 -Wall -fPIC compile options: '-I/home/cfinch/lib/python2.5/site-packages/numpy/core/include -I/home/cfinch/include/python2.5 -c' g++: scipy/sparse/sparsetools/dia_wrap.cxx g++ -pthread -shared build/temp.linux-x86_64-2.5/scipy/sparse/sparsetools/dia_wrap.o -Lbuild/temp.linux-x86_64-2.5 -o build/lib.linux-x86_64-2.5/scipy/sparse/sparsetools/_dia.so ----------------------- ----- Original Message ---- From: David Cournapeau To: Craig Finch ; SciPy Users List Sent: Friday, July 3, 2009 10:21:28 PM Subject: Re: [SciPy-user] Scipy weave errors: undefined symbols when importing compiled extension On Sat, Jul 4, 2009 at 3:38 AM, Craig Finch wrote: > ImportError: /home/cfinch/lib/python2.5/site-packages/scipy/sparse/sparsetools/_csr.so: undefined symbol: __gxx_personality_v0 > > As far as I know, everything Python-related on this machine has now been built with GNU tools, but something is still broken. One more thing--Python, Numpy, and Scipy are all locally installed in my /home/cfinch directory because the system Python is frozen at 2.4. This is still a C++ problem, I have sometimes noticed this problem on some machines. Normally this is caused by a C++ object file being linked with gcc (instead of g++). Can you post the build log of scipy ? David From dwf at cs.toronto.edu Wed Jul 8 15:35:36 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Wed, 8 Jul 2009 15:35:36 -0400 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> <4A537A54.2010601@gmail.com> <1BB30B2C-4CE8-44C7-82FF-62E9EFF0FB7C@cs.toronto.edu> Message-ID: On 8-Jul-09, at 7:48 AM, Tim Michelsen wrote: > Would be nice to see your implementation. The way I did it was to do the plotting onto a FigureCanvasAgg and then printing it to a fake file using cStringIO. from matplotlib.backends.backend_agg import FigureCanvasAgg import matplotlib.pyplot as plt import cStringIO def plot(...): fig = plt.figure(...) # figure out size and such here canvas = FigureCanvasAgg(fig) fig.set_canvas(canvas) # I forget whether this is needed # Do some plotting commands here ... ... canvas.draw() imgdata = cStringIO.StringIO() canvas.print_png(imgdata) return imgdata.get_value() There's something very similar in the Matplotlib cookbook or the TurboGears cookbook, but I forget which. David From stefan at sun.ac.za Wed Jul 8 16:22:33 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Wed, 8 Jul 2009 22:22:33 +0200 Subject: [SciPy-user] FFT-based convolution and correlation In-Reply-To: References: Message-ID: <9457e7c80907081322q7d3ec762m568796948087e9d6@mail.gmail.com> 2009/7/8 - <3ukip0s02 at sneakemail.com>: > Is there a FFT-based 2D correlation function built into scipy? ?There are > functions like: Yes, try scipy.signal.fftconvolve. St?fan From trottier+pylist at gmail.com Wed Jul 8 20:31:42 2009 From: trottier+pylist at gmail.com (Leo Trottier) Date: Wed, 8 Jul 2009 17:31:42 -0700 Subject: [SciPy-user] Automatically making a dtype In-Reply-To: References: Message-ID: Hi, So, perhaps I'm the only one, but I find using numpy dtypes can be a bit more troublesome than I typically expect from Python libraries. ?So I've gone ahead and written a little function that, given an "exemplar" (e.g., a row from your data set) will create a dtype based on it. Anyone think that something like this should make it into the numpy/scipy distribution? ?Also, anyone want to improve the function so it can handle tuples, sub-arrays, etc? Anyway, here it is: def makeDType(exemplar): ? ?'''Return a dtype object based on the given list or dict *exemplar* ? ?This is a convenience function -- if you want to do anything sophisticated ? ?it's best to compose the dtype "by hand". ? ?If given a list, this will return a dtype with fields ordered in the same ? ?sequence as in exemplar. ? ?If given a dict, the field ordering will be alphabetical, based on the ? ?names of the fields. ? ?NB: any str example you give it should be the longest you can imagine, ? ?as the function will return a field based on that length. ? ?Usage: ? ?>>> makeDType(['a string', 4, 3.0, 3j, True, None, eval, u'asdf']) ? ?dtype([('f0', '|S8'), ('f1', '>> makeDType(dict(a='0123',b=3.,c=4,d=True,e=3j,f=eval,g=None,h=u'asdf')) ? ?dtype([('a', '|S4'), ('b', ' References: Message-ID: <6BB8CFC8-17E2-42CD-B6AA-6036AE017CA7@gmail.com> On Jul 8, 2009, at 8:31 PM, Leo Trottier wrote: > Hi, > > So, perhaps I'm the only one, but I find using numpy dtypes can be a > bit > more troublesome than I typically expect from Python libraries. So > I've gone > ahead and written a little function that, given an "exemplar" (e.g., a > row from your data set) will create a dtype based on it. Check np.genfromtxt, the portion of the code dealing w/ dtype is None (where the dtype is guessed from the data...) > Anyone think that something like this should make it into the > numpy/scipy distribution? Also, anyone want to improve the function > so it can handle tuples, sub-arrays, etc? Well, what are you up to these days ;) ? From dwf at cs.toronto.edu Wed Jul 8 22:37:13 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Wed, 8 Jul 2009 22:37:13 -0400 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> <4A537A54.2010601@gmail.com> <1BB30B2C-4CE8-44C7-82FF-62E9EFF0FB7C@cs.toronto.edu> Message-ID: On 8-Jul-09, at 7:48 AM, Tim Michelsen wrote: > @Bruce, David > Do you have a particular site that is one-line and > available to the gereral public? > I am intending something similar. http://www.zunzun.com uses matplotlib extensively. My own frankenstein creation has been superseded by a J2EE version designed by a whole team of programmers, unfortunately. :) David From neilcrighton at gmail.com Wed Jul 8 19:14:36 2009 From: neilcrighton at gmail.com (Neil Crighton) Date: Wed, 8 Jul 2009 23:14:36 +0000 (UTC) Subject: [SciPy-user] Sort by first n columns in an array References: <4A549EE5.63BA.009B.0@twdb.state.tx.us> Message-ID: Dharhas Pothina twdb.state.tx.us> writes: > I am reading in an array with np.genfromtxt and I need to sort it by the first 6 columns (ie year,month etc). I > do not know the number of column before reading the array in. > > >From the documentation for sorting it looks like I can use the 'order' keyword to do what I need if I have a > structured array with names for each column. I haven't been able to work out how to convert the array I have > to an ordered array though. Also as far as I can tell I can't set the dtype with field names in np.genfromtxt > without previously knowing the number of columns in the data file. > > I don't want to use the timeseries toolkit for this since it is on a machine that hasn't got that package installed. > > I know I must be missing something simple. Any help is appreciated. > > - dharhas > Something like this should work: data = np.genfromtxt(filename, dtype=None, usecols=(0,1,2), names='year,month,day') data.sort(order=['year', 'month', 'day']) Neil From Dharhas.Pothina at twdb.state.tx.us Thu Jul 9 11:21:06 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Thu, 09 Jul 2009 10:21:06 -0500 Subject: [SciPy-user] Sort by first n columns in an array In-Reply-To: References: <4A549EE5.63BA.009B.0@twdb.state.tx.us> Message-ID: <4A55C491.63BA.009B.0@twdb.state.tx.us> >>> Neil Crighton 7/8/2009 6:14 PM >>> >Something like this should work: > >data = np.genfromtxt(filename, dtype=None, usecols=(0,1,2), > names='year,month,day') >data.sort(order=['year', 'month', 'day']) That works if I know the number of columns prior to reading the file. In this case I'm reading a file with an unknown number of columns. I know that the first six columns are the date fields. thanks - dharhas From hnry2k at hotmail.com Thu Jul 9 12:51:53 2009 From: hnry2k at hotmail.com (=?iso-8859-1?Q?Jorge_E._=B4Sanchez_Sanchez?=) Date: Thu, 9 Jul 2009 11:51:53 -0500 Subject: [SciPy-user] Is there any easy way to calculate derivatives from ndimage.spline_filter coeffs? Message-ID: Hi all, I have interpolated a surface function with x, y, fvals using: coeffs = ndimage.spline_filter(fvals) newf = ndimage.map_coordinates(coeffs, coords, prefilter=False) and I also need the x and y partial derivatives of fvals at the new interpolation points in coords. I suppose that there should be some command similar to bisplev (as in the next code lines) which allows to calculate them, and I have been looking for it unsuccsesfully, so I would appreciate it very much if somebody could help me with this. ######################################################################### tck = interpolate.bisplrep(xn,yn,fvalsn) newff= interpolate.bisplev(newx[:,0],newy[0,:],tck) #THE PARTIAL DERIVATIVE RESPECT x newfdx = interpolate.bisplev(newx[:,0],newy[0,:],tck,dx=1,dy=0) #THE PARTIAL DERIVATIVE RESPECT y newfdy = interpolate.bisplev(newx[:,0],newy[0,:],tck,dx=0,dy=1) ######################################################################### I thank you in advance for your kind help jorge _________________________________________________________________ Actualiza tu Perfil y gana! http://www.actualizatuperfil.com.mx/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From kpdere at verizon.net Thu Jul 9 18:51:45 2009 From: kpdere at verizon.net (Ken Dere) Date: Thu, 09 Jul 2009 18:51:45 -0400 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> Message-ID: Vincent van Beveren wrote: > Hello everyone, > > I'm an engineer at Rijnhuizen, which is a research institute for plasma > physics. We use Python to drive one of our main research projects, > however, in the scientific circles in this institute Python (and SciPy) > are largely unknown. Time to change this, I think :). However, since I am > an engineer and not a scientist I my viewpoint on Python and SciPy are > more of an engineering perspective, like its Open Source, Free, modern > language, functional programming, etc... I'm not entirely sure these are > compelling arguments for a scientist to start working with Python (or > atleast not on it self). So I was wondering, if I was to promote Python in > the scientific community here at Rijnhuizen. So I have a few questions: > > - In what aspects does SciPy excel, compared to say IDL or > matlab? > - In what ways allows it a scientist to be more effective? > - How usable is SciPy for Plasma physics, molucular dynamics and > nanolayer Surface and Interface Physics (the 3 main areas at Rijnhuizen)? > - How stable is it compared to other packages. (bugs, > computation)? > > Any comments and insights are welcome! > > Regards, > Vincent > > PS. Posted in scipy-dev by accident. > ___ > Ing. V. van Beveren > Software Engineer, FOM Rijnhuizen > E: V.vanBeveren at rijnhuizen.nl The real question is "why is Python better than Matlab or IDL?" I don't have an opinion on Matlab since I never used it. However, I used IDL professionally for about 20 years or so (just for reference, I am a solar physicist). A few years ago, having gotten some proficiency in Python, I pretty much dropped my usage of IDL and now use Python. This means python, ipython, numpy, scipy, matplotlib and whatever other modules I need. One basic difference is that I can use it at home, on the road and at work without need of an expensive license. Also, I enjoy using it. I have seem some object-oriented implementations is IDL and they are pathetic. It may just be the way it was done but I don't know. The use of objects in Python is very straightforward. Ken Dere -- K. Dere From erik.tollerud at gmail.com Thu Jul 9 21:44:09 2009 From: erik.tollerud at gmail.com (Erik Tollerud) Date: Thu, 9 Jul 2009 18:44:09 -0700 Subject: [SciPy-user] Problems with the interpolation package In-Reply-To: <44cd5a960907080309t481ba009l36fa53d2dee3806e@mail.gmail.com> References: <44cd5a960907080309t481ba009l36fa53d2dee3806e@mail.gmail.com> Message-ID: I'm not sure I fully understand your question, but I think scipy.interpolate.InterpolatedUnivariateSpline is what you want - fit your spline with that, then if you want to get the properties of the spline at some subset of points, just use those values. Specifically, suppose you have your data as x and y vectors. Do s=scipy.interpolate.InterpolatedUnivariateSpline(x,y) This will subdivide the curve into some number of knots (n-k+1, to be exact), so get the knots with ks = s.get_knots() now, if you want the polynomial, bewteen, say the 1st and 2nd knot, do v = (ks[1]-ks[0])/2 derivs = s.derivatives(v) and derivs will be an array of derivatives at that point, and that gives you the coefficients you want (if I am interpreting your question correctly). On Wed, Jul 8, 2009 at 3:09 AM, J?n Egilsson wrote: > Hi there. > > I've now tried various functions in the scipy.interpolate package without > getting far. > > What I want to do: > > I want to interpolate using spline interpolation, cubic spline interpolation > to be more precise. > > 1. I want to be able to choose which points to interpolate (to choose > between which points on the x-axis the cubic polynomials are). Then I'll > have some polynomials on the form ax^3 + bx^2 + cx + d > > 2. Then I want to be able to extract the coefficients a,b,c,d from each > polynomial because that's the only thing I want to keep. I don't want to > keep an interpolation object of some sort. > > Input: > > Two lists, one representing the x-axis: [0..n] and another one with the > function values: [y0, y1, y2, ... , yn] and the desired points I wish to > interpolate between. > > Output (preferably): > > The coefficients (a,b,c,d) for every polynomial beetween the points I chose > to interpolate between. > > Any help at all will be greatly appreciated. Cheers. > > J?n G?sli Egilsson. > > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > > From V.vanBeveren at rijnhuizen.nl Fri Jul 10 03:20:44 2009 From: V.vanBeveren at rijnhuizen.nl (Vincent van Beveren) Date: Fri, 10 Jul 2009 09:20:44 +0200 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> Message-ID: <2926F4BC94217A43A2D21792DE8818931C5B245B2A@ex1.rijnh.nl> Thanks everyone for your comments. Because of the many responses I can't reply to everyone personally, but thank you for your input. I have summarised your comments into a list, if you have any corrections, or additions, I would like to hear: - Completely Free: No ridiculous licenses or restrictions - Open source: o If there is a bug you can fix it yourself, or ask the community o You can always go back to the source - Stable: o Originally written by commercial company, before released opensource, o The base of EPD (Enthought Python Distribution) - Large user community: o well supported o actively developed - Easy to build arbitrary complex applications (IDL and Matlab make this more difficult) - As a language it: o has many modules out of the box o is very readable, and modular o easy parallel programming o well featured: ? has support for many data structures (Lists, Tuples, Sets, Dicts) ? Object Orientated (OO in IDL is a pain, Matlab: what objects?) ? has compact array manipulation notation, functional programming and support for complex numbers - Cross-platform (So is IDL? I don't know about Matlab) - Integration with many different things into a unified interface o has many additional modules which can be installed ? HDF5, Numpy, SciPy, Matplotlib (2D plotting), VTK (3D graphics) o Easy to interface with other languages (Fortran, C, R) - Complete applications for numerical computation build on SciPy freely available: o SAGE o Python XY Thanks so far, you all have been really helpful. Regards, Vincent ________________________________ From: scipy-user-bounces at scipy.org [mailto:scipy-user-bounces at scipy.org] On Behalf Of Vincent van Beveren Sent: dinsdag 7 juli 2009 16:03 To: 'scipy-user at scipy.org' Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) Hello everyone, I'm an engineer at Rijnhuizen, which is a research institute for plasma physics. We use Python to drive one of our main research projects, however, in the scientific circles in this institute Python (and SciPy) are largely unknown. Time to change this, I think :). However, since I am an engineer and not a scientist I my viewpoint on Python and SciPy are more of an engineering perspective, like its Open Source, Free, modern language, functional programming, etc... I'm not entirely sure these are compelling arguments for a scientist to start working with Python (or atleast not on it self). So I was wondering, if I was to promote Python in the scientific community here at Rijnhuizen. So I have a few questions: - In what aspects does SciPy excel, compared to say IDL or matlab? - In what ways allows it a scientist to be more effective? - How usable is SciPy for Plasma physics, molucular dynamics and nanolayer Surface and Interface Physics (the 3 main areas at Rijnhuizen)? - How stable is it compared to other packages. (bugs, computation)? Any comments and insights are welcome! Regards, Vincent PS. Posted in scipy-dev by accident. ___ Ing. V. van Beveren Software Engineer, FOM Rijnhuizen E: V.vanBeveren at rijnhuizen.nl -------------- next part -------------- An HTML attachment was scrubbed... URL: From david at ar.media.kyoto-u.ac.jp Fri Jul 10 03:11:46 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 10 Jul 2009 16:11:46 +0900 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <2926F4BC94217A43A2D21792DE8818931C5B245B2A@ex1.rijnh.nl> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <2926F4BC94217A43A2D21792DE8818931C5B245B2A@ex1.rijnh.nl> Message-ID: <4A56E9B2.1060505@ar.media.kyoto-u.ac.jp> Vincent van Beveren wrote: > > Thanks everyone for your comments. Because of the many responses I > can?t reply to everyone personally, but thank you for your input. I > have summarised your comments into a list, if you have any > corrections, or additions, I would like to hear: > > > > - Completely Free: No ridiculous licenses or restrictions > > - Open source: > > o If there is a bug you can fix it yourself, or ask the community > > o You can always go back to the source > > - Stable: > > o Originally written by commercial company, before released > opensource, > It depends on which parts of the stack you are talking about. AFAIK, the original design for numpy was done ~15 years by Jim Hugunin, and scipy was born as an open source project from the start. The "official" history can be found here for scipy: http://www.scipy.org/History_of_SciPy David From neilcrighton at gmail.com Fri Jul 10 07:26:20 2009 From: neilcrighton at gmail.com (Neil Crighton) Date: Fri, 10 Jul 2009 11:26:20 +0000 (UTC) Subject: [SciPy-user] Sort by first n columns in an array References: <4A549EE5.63BA.009B.0@twdb.state.tx.us> <4A55C491.63BA.009B.0@twdb.state.tx.us> Message-ID: Dharhas Pothina twdb.state.tx.us> writes: > >>> Neil Crighton gmail.com> 7/8/2009 6:14 PM >>> > >Something like this should work: > > > >data = np.genfromtxt(filename, dtype=None, usecols=(0,1,2), > > names='year,month,day') > >data.sort(order=['year', 'month', 'day']) > > That works if I know the number of columns prior to reading the file. In this case I'm reading a file with an > unknown number of columns. I know that the first six columns are the date fields. > If you know the first six columns are the date fields, can't you use data = np.genfromtxt(filename, dtype=None, usecols=(0,1,2,3,4,5), names='year,month,day,hour,min,sec') ? That doesn't make any assumption about the number of columns (as long as there's at least six of them). Neil From Dharhas.Pothina at twdb.state.tx.us Fri Jul 10 08:26:11 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Fri, 10 Jul 2009 07:26:11 -0500 Subject: [SciPy-user] Sort by first n columns in an array In-Reply-To: References: <4A549EE5.63BA.009B.0@twdb.state.tx.us> <4A55C491.63BA.009B.0@twdb.state.tx.us> Message-ID: <4A56ED12.63BA.009B.0@twdb.state.tx.us> Yes, but I don't need just the first six columns sorted, I need the *entire* dataset (ie all columns) sorted by date. - dharhas >>> Neil Crighton 7/10/2009 6:26 AM >>> Dharhas Pothina twdb.state.tx.us> writes: > >>> Neil Crighton gmail.com> 7/8/2009 6:14 PM >>> > >Something like this should work: > > > >data = np.genfromtxt(filename, dtype=None, usecols=(0,1,2), > > names='year,month,day') > >data.sort(order=['year', 'month', 'day']) > > That works if I know the number of columns prior to reading the file. In this case I'm reading a file with an > unknown number of columns. I know that the first six columns are the date fields. > If you know the first six columns are the date fields, can't you use data = np.genfromtxt(filename, dtype=None, usecols=(0,1,2,3,4,5), names='year,month,day,hour,min,sec') ? That doesn't make any assumption about the number of columns (as long as there's at least six of them). Neil _______________________________________________ SciPy-user mailing list SciPy-user at scipy.org http://mail.scipy.org/mailman/listinfo/scipy-user From neilcrighton at gmail.com Fri Jul 10 09:45:58 2009 From: neilcrighton at gmail.com (Neil Crighton) Date: Fri, 10 Jul 2009 13:45:58 +0000 (UTC) Subject: [SciPy-user] Sort by first n columns in an array References: <4A549EE5.63BA.009B.0@twdb.state.tx.us> <4A55C491.63BA.009B.0@twdb.state.tx.us> <4A56ED12.63BA.009B.0@twdb.state.tx.us> Message-ID: Dharhas Pothina twdb.state.tx.us> writes: > Yes, but I don't need just the first six columns sorted, I need the *entire* d ataset (ie all columns) sorted > by date. I see. You could use argsort instead of sort to get the indices that sort the array. data = np.genfromtxt(filename, dtype=None, usecols=(0,1,2,3,4,5), names='year,month,day,hour,min,sec') ind = data.argsort(order='year month day hour min sec'.split()) alldata = np.genfromtxt(filename, dtype=None) sorted_alldata = alldata[ind] But it's probably better to let genfromtxt choose its own field names and then sort the whole thing using the first 6 fields. alldata = np.genfromtxt(filename, dtype=None) alldata.sort(order='f0 f1 f2 f3 f4 f5'.split()) Neil From lou_boog2000 at yahoo.com Fri Jul 10 09:47:03 2009 From: lou_boog2000 at yahoo.com (Lou Pecora) Date: Fri, 10 Jul 2009 06:47:03 -0700 (PDT) Subject: [SciPy-user] How good, stable, and usable is the Enthought Distribution? Message-ID: <729229.11467.qm@web34402.mail.mud.yahoo.com> I am thinking of buying an Enthought Package through my laboratory. ?I am familiar with some of the modules available in the package since I've used Python for 5 years in computational physics. ?But I'm wondering what others think of the Enthought package. ?I would be very interested in hearing from those with some experience with the package and the company. Thanks for any information. -- Lou Pecora, my views are my own. -------------- next part -------------- An HTML attachment was scrubbed... URL: From bblais at bryant.edu Fri Jul 10 16:23:36 2009 From: bblais at bryant.edu (Brian Blais) Date: Fri, 10 Jul 2009 16:23:36 -0400 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <2926F4BC94217A43A2D21792DE8818931C5B245B2A@ex1.rijnh.nl> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <2926F4BC94217A43A2D21792DE8818931C5B245B2A@ex1.rijnh.nl> Message-ID: On Jul 10, 2009, at 3:20 , Vincent van Beveren wrote: > Thanks everyone for your comments. Because of the many responses I > can?t reply to everyone personally, but thank you for your input. I > have summarised your comments into a list, if you have any > corrections, or additions, I would like to hear: > > - Completely Free: No ridiculous licenses or restrictions > - Open source: > o If there is a bug you can fix it yourself, or ask the > community > o You can always go back to the source > - Stable: > o Originally written by commercial company, before released > opensource, > o The base of EPD (Enthought Python Distribution) > - Large user community: > o well supported > o actively developed > - Easy to build arbitrary complex applications (IDL and > Matlab make this more difficult) > - As a language it: > o has many modules out of the box > o is very readable, and modular > o easy parallel programming > o well featured: > ? has support for many data structures (Lists, Tuples, > Sets, Dicts) > ? Object Orientated (OO in IDL is a pain, Matlab: what > objects?) actually, Matlab has had objects for some time, but it feels very tacked-on. > ? has compact array manipulation notation, functional > programming and support for complex numbers well, so do both IDL and Matlab. > - Cross-platform (So is IDL? I don?t know about Matlab) Matlab is cross platform. > - Integration with many different things into a unified > interface > o has many additional modules which can be installed > ? HDF5, Numpy, SciPy, Matplotlib (2D plotting), VTK (3D > graphics) much of the functionality of this is already in Matlab (can't speak for IDL). That's actually one of the pains of Python, that you have to install a whole lot of 3rd party packages where Matlab, it is all there. The Enthought Python Distribution, and Python XY go a long way to solve that, although 3d plotting seems easier in Matlab. > o Easy to interface with other languages (Fortran, C, R) > - Complete applications for numerical computation build on > SciPy freely available: > o SAGE > o Python XY I converted from Matlab to Python a few years ago. Much of it had to do with price and convenience (one example of Matlab craziness is the license manager, which always went berserk on me and kicked me out of my legitimate copy of Matlab). However, once I switched, there was another advantage that hadn't occurred to me. I could replace all of my shell scripts, perl scripts, etc... with python too. That way, I got a lot more fluid with the language with non-scientific apps. I could make a web-based gradebook, and easily manipulate directories of images files. I could parse html files (BeautifulSoup), and many other things. It opened up many more productive applications of the language than I had anticipated. For scientific work, cython is an unbeatable tool for optimizing slow pieces of your code. It's like Matlab cmex files, but infinitely more convenient and powerful. I haven't run Matlab for over a year now, and I haven't really missed it. Off my website, I have a presentation for Matlab users wanting to switch to python. It might be useful for you. bb -- Brian Blais bblais at bryant.edu http://web.bryant.edu/~bblais -------------- next part -------------- An HTML attachment was scrubbed... URL: From xavier.gnata at gmail.com Fri Jul 10 18:51:47 2009 From: xavier.gnata at gmail.com (Xavier Gnata) Date: Sat, 11 Jul 2009 00:51:47 +0200 Subject: [SciPy-user] How good, stable, and usable is the Enthought Distribution? In-Reply-To: <729229.11467.qm@web34402.mail.mud.yahoo.com> References: <729229.11467.qm@web34402.mail.mud.yahoo.com> Message-ID: <4A57C603.30103@gmail.com> Have a look at pythonxy. http://www.pythonxy.com/foreword.php Xavier > I am thinking of buying an Enthought Package through my laboratory. I > am familiar with some of the modules available in the package since > I've used Python for 5 years in computational physics. But I'm > wondering what others think of the Enthought package. I would be very > interested in hearing from those with some experience with the package > and the company. > > Thanks for any information. > > -- Lou Pecora, my views are my own. > > > ------------------------------------------------------------------------ > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From gael.varoquaux at normalesup.org Fri Jul 10 18:54:42 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Sat, 11 Jul 2009 00:54:42 +0200 Subject: [SciPy-user] How good, stable, and usable is the Enthought Distribution? In-Reply-To: <729229.11467.qm@web34402.mail.mud.yahoo.com> References: <729229.11467.qm@web34402.mail.mud.yahoo.com> Message-ID: <20090710225442.GF31151@phare.normalesup.org> On Fri, Jul 10, 2009 at 06:47:03AM -0700, Lou Pecora wrote: > I am thinking of buying an Enthought Package through my laboratory. I am > familiar with some of the modules available in the package since I've used > Python for 5 years in computational physics. But I'm wondering what > others think of the Enthought package. I would be very interested in > hearing from those with some experience with the package and the company. > Thanks for any information. I think you should simply try it out. You are allowed by the license to do this for one month. I will not comment on the rest, as I am biased (positively), having worked at Enthought. Ga?l From millman at berkeley.edu Sat Jul 11 04:32:21 2009 From: millman at berkeley.edu (Jarrod Millman) Date: Sat, 11 Jul 2009 01:32:21 -0700 Subject: [SciPy-user] ANN: SciPy 2009 early registration extended to July 17th Message-ID: The early registration deadline for SciPy 2009 has been extended for one week to July 17, 2009. Please register ( http://conference.scipy.org/to_register ) by this date to take advantage of the reduced early registration rate. About the conference -------------------- SciPy 2009, the 8th Python in Science conference, will be held from August 18-23, 2009 at Caltech in Pasadena, CA, USA. The conference starts with two days of tutorials to the scientific Python tools. There will be two tracks, one for introduction of the basic tools to beginners, and one for more advanced tools. The tutorials will be followed by two days of talks. Both days of talks will begin with a keynote address. The first day?s keynote will be given by Peter Norvig, the Director of Research at Google; while, the second keynote will be delivered by Jon Guyer, a Materials Scientist in the Thermodynamics and Kinetics Group at NIST. The program committee will select the remaining talks from submissions to our call for papers. All selected talks will be included in our conference proceedings edited by the program committee. After the talks each day we will provide several rooms for impromptu birds of a feather discussions. Finally, the last two days of the conference will be used for a number of coding sprints on the major software projects in our community. For the 8th consecutive year, the conference will bring together the developers and users of the open source software stack for scientific computing with Python. Attendees have the opportunity to review the available tools and how they apply to specific problems. By providing a forum for developers to share their Python expertise with the wider commercial, academic, and research communities, this conference fosters collaboration and facilitates the sharing of software components, techniques, and a vision for high level language use in scientific computing. For further information, please visit the conference homepage: http://conference.scipy.org. Important Dates --------------- * Friday, July 3: Abstracts Due * Wednesday, July 15: Announce accepted talks, post schedule * Friday, July 17: Early Registration ends * Tuesday-Wednesday, August 18-19: Tutorials * Thursday-Friday, August 20-21: Conference * Saturday-Sunday, August 22-23: Sprints * Friday, September 4: Papers for proceedings due Executive Committee ------------------- * Jarrod Millman, UC Berkeley, USA (Conference Chair) * Ga?l Varoquaux, INRIA Saclay, France (Program Co-Chair) * St?fan van der Walt, University of Stellenbosch, South Africa (Program Co-Chair) * Fernando P?rez, UC Berkeley, USA (Tutorial Chair) From jdh2358 at gmail.com Sat Jul 11 07:41:56 2009 From: jdh2358 at gmail.com (John Hunter) Date: Sat, 11 Jul 2009 06:41:56 -0500 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> <4A537A54.2010601@gmail.com> <1BB30B2C-4CE8-44C7-82FF-62E9EFF0FB7C@cs.toronto.edu>

Message-ID: <88e473830907110441i5adcfefbr736dec9e20629b0b@mail.gmail.com> On Wed, Jul 8, 2009 at 2:35 PM, David Warde-Farley wrote: > On 8-Jul-09, at 7:48 AM, Tim Michelsen wrote: > >> Would be nice to see your implementation. > > The way I did it was to do the plotting onto a FigureCanvasAgg and > then printing it to a fake file using cStringIO. > > from matplotlib.backends.backend_agg import FigureCanvasAgg > import matplotlib.pyplot as plt > import Apologies since this is OT for scipy, so please followup to matplolib-users if you have further questions or comments, but there is a slight problem with the example above. If you are using mpl in a web server, and want explicit control of the canvas, you can either use pyplot and the canvas like so:: import matplotlib.use('Agg') import matplotlib.pyplot as plt fig = plt.figure() fig.canvas # use thie canvas (it is FIgureCanvasAgg) rather than creating your own Or you can use the API and manually create your own figure and canvas (this is recommended since you have maximum control over object management):: from matplotlib.backends.backend_agg import FigureCanvasAgg as FigureCanvas from matplotlib.figure import Figure fig = Figure() canvas = FigureCanvas(fig) ax = fig.add_subplot(111) The code you posted is a hybrid of using pyplot and managing your own Canvas creation, which can work under certain configurations but is brittle and not supported. This is mainly because pyplot creates and manages its own canvas instances. JDH From lou_boog2000 at yahoo.com Sat Jul 11 08:21:37 2009 From: lou_boog2000 at yahoo.com (Lou Pecora) Date: Sat, 11 Jul 2009 05:21:37 -0700 (PDT) Subject: [SciPy-user] How good, stable, and usable is the Enthought Distribution? Message-ID: <914921.10145.qm@web34408.mail.mud.yahoo.com> This appears to be a Window only package. ?I didn't make it clear that I have Mac OS X. ? And I really want information on the Enthought distribution. Thank you anyway. -- Lou Pecora, my views are my own. --- On Fri, 7/10/09, Xavier Gnata wrote: From: Xavier Gnata Subject: Re: [SciPy-user] How good, stable, and usable is the Enthought Distribution? To: "SciPy Users List" Date: Friday, July 10, 2009, 6:51 PM Have a look at pythonxy. http://www.pythonxy.com/foreword.php Xavier -------------- next part -------------- An HTML attachment was scrubbed... URL: From lou_boog2000 at yahoo.com Sat Jul 11 08:24:09 2009 From: lou_boog2000 at yahoo.com (Lou Pecora) Date: Sat, 11 Jul 2009 05:24:09 -0700 (PDT) Subject: [SciPy-user] How good, stable, and usable is the Enthought Distribution? Message-ID: <785491.66970.qm@web34405.mail.mud.yahoo.com> Good idea. ?I'll do that. ? One question. ?If I install it, does it stay separate from my present python installation? ?I don't want to mess with that. Thanks. -- Lou P. --- On Fri, 7/10/09, Gael Varoquaux wrote: From: Gael Varoquaux Subject: Re: [SciPy-user] How good, stable, and usable is the Enthought Distribution? To: "SciPy Users List" Date: Friday, July 10, 2009, 6:54 PM On Fri, Jul 10, 2009 at 06:47:03AM -0700, Lou Pecora wrote: >? ? I am thinking of buying an Enthought Package through my laboratory.? I am [cut] I think you should simply try it out. You are allowed by the license to do this for one month. -------------- next part -------------- An HTML attachment was scrubbed... URL: From cohen at lpta.in2p3.fr Sat Jul 11 09:13:20 2009 From: cohen at lpta.in2p3.fr (Johann Cohen-Tanugi) Date: Sat, 11 Jul 2009 15:13:20 +0200 Subject: [SciPy-user] How good, stable, and usable is the Enthought Distribution? In-Reply-To: <785491.66970.qm@web34405.mail.mud.yahoo.com> References: <785491.66970.qm@web34405.mail.mud.yahoo.com> Message-ID: <4A588FF0.9050601@lpta.in2p3.fr> yes, it does, if you wish so Johann Lou Pecora wrote: > Good idea. I'll do that. > > One question. If I install it, does it stay separate from my present > python installation? I don't want to mess with that. > > Thanks. > > -- Lou P. > > --- On *Fri, 7/10/09, Gael Varoquaux > //* wrote: > > > From: Gael Varoquaux > Subject: Re: [SciPy-user] How good, stable, and usable is the > Enthought Distribution? > To: "SciPy Users List" > Date: Friday, July 10, 2009, 6:54 PM > > On Fri, Jul 10, 2009 at 06:47:03AM -0700, Lou Pecora wrote: > > I am thinking of buying an Enthought Package through my > laboratory. I am > [cut] > > I think you should simply try it out. You are allowed by the > license to > do this for one month. > > > ------------------------------------------------------------------------ > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From xyz.account at arcor.de Sat Jul 11 09:55:53 2009 From: xyz.account at arcor.de (Martin) Date: Sat, 11 Jul 2009 13:55:53 +0000 (UTC) Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? Message-ID: In an "ipython -pylab -p scipy" console I get In [1]: sqrt(-1) Out[1]: nan but I would prefer 1j as the result. This is achieved by In [2]: from scipy import * After that I get In [3]: sqrt(-1) Out[3]: 1j I found out that the ipython option -p scipy executes the startup script Python25\Lib\site-packages\IPython\Extensions\ipy_profile_scipy.py which contains ip = IPython.ipapi.get() try: ip.ex("import math,cmath") ip.ex("import numpy") ip.ex("import scipy") ip.ex("import numpy as np") ip.ex("import scipy as sp") ip.ex("from numpy import *") ip.ex("from scipy import *") print('... import * done') Here the command "from scipy import *" is the last executed command as in my interactive session above. Why is this startup script not sufficient to have sqrt(-1)=1j right from the console start?? What I should do additionally to achieve this? From carlos.grohmann at gmail.com Sat Jul 11 17:01:31 2009 From: carlos.grohmann at gmail.com (=?ISO-8859-1?Q?Carlos_=22Gu=E2no=22_Grohmann?=) Date: Sat, 11 Jul 2009 18:01:31 -0300 Subject: [SciPy-user] finding values in a list? Message-ID: Sorry if this is too simple or even OT, but say I have a list like [1,2,3,4,5,6,7,8,9] and I want to get the values just below and above a given value. So, if I enter 6.2, I get 6 and 7. but how? TIA Carlos -- Carlos Henrique Grohmann - Geologist D.Sc. a.k.a. Guano - Linux User #89721 ResearcherID: A-9030-2008 http://digitalelevation.blogspot.com http://www.igc.usp.br/pessoais/guano _________________ Can?t stop the signal. From alan.mcintyre at gmail.com Sat Jul 11 17:21:02 2009 From: alan.mcintyre at gmail.com (Alan McIntyre) Date: Sat, 11 Jul 2009 14:21:02 -0700 Subject: [SciPy-user] finding values in a list? In-Reply-To: References: Message-ID: <1d36917a0907111421h1fd79d51l292d3a288332ebd@mail.gmail.com> 2009/7/11 Carlos "Gu?no" Grohmann : > Sorry if this is too simple or even OT, but say I have a list like > > [1,2,3,4,5,6,7,8,9] > > and I want to get the values just below and above a given value. > > So, if I enter 6.2, I get 6 and 7. > > but how? There's probably more efficient ways to do it, but this works: def get_nearest(a, v): '''Get the values in 'a' that bracket 'v'. Assumes a is a list already sorted in ascending order.''' A = np.array(a) return A[A < v][-1], A[A > v][0] get_nearest([1,2,3,4,5,6,7,8,9], 6.2) [6, 7] From alan.mcintyre at gmail.com Sat Jul 11 17:22:02 2009 From: alan.mcintyre at gmail.com (Alan McIntyre) Date: Sat, 11 Jul 2009 14:22:02 -0700 Subject: [SciPy-user] finding values in a list? In-Reply-To: <1d36917a0907111421h1fd79d51l292d3a288332ebd@mail.gmail.com> References: <1d36917a0907111421h1fd79d51l292d3a288332ebd@mail.gmail.com> Message-ID: <1d36917a0907111422v495633f3vbd38d814442c2eba@mail.gmail.com> On Sat, Jul 11, 2009 at 2:21 PM, Alan McIntyre wrote: > There's probably more efficient ways to do it, but this works: > > def get_nearest(a, v): > ? ? '''Get the values in 'a' that bracket 'v'. ?Assumes a is a list > already sorted in ascending order.''' > ? ? A = np.array(a) > ? ? return A[A < v][-1], A[A > v][0] > > get_nearest([1,2,3,4,5,6,7,8,9], 6.2) > [6, 7] > Sorry, I forgot to include an "import numpy as np" in there. :) From josef.pktd at gmail.com Sat Jul 11 17:22:44 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 11 Jul 2009 17:22:44 -0400 Subject: [SciPy-user] finding values in a list? In-Reply-To: References: Message-ID: <1cd32cbb0907111422m42d6a053x965523d7e17579d3@mail.gmail.com> 2009/7/11 Carlos "Gu?no" Grohmann : > Sorry if this is too simple or even OT, but say I have a list like > > [1,2,3,4,5,6,7,8,9] > > and I want to get the values just below and above a given value. > > So, if I enter 6.2, I get 6 and 7. > > but how? maybe like this >>> a = (np.array([1,2,3,4,5,6,7,8,9])-6.1) >>> a[a>0].min() + 6.2 7.0 >>> a[a<0].max() + 6.2 6.0 Josef > > TIA > > Carlos > > -- > Carlos Henrique Grohmann - Geologist D.Sc. > a.k.a. Guano - Linux User #89721 > ResearcherID: A-9030-2008 > > http://digitalelevation.blogspot.com > > http://www.igc.usp.br/pessoais/guano > _________________ > Can?t stop the signal. > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From josef.pktd at gmail.com Sat Jul 11 17:25:27 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 11 Jul 2009 17:25:27 -0400 Subject: [SciPy-user] finding values in a list? In-Reply-To: <1cd32cbb0907111422m42d6a053x965523d7e17579d3@mail.gmail.com> References: <1cd32cbb0907111422m42d6a053x965523d7e17579d3@mail.gmail.com> Message-ID: <1cd32cbb0907111425v3eb581e3scfcd25473e5309e5@mail.gmail.com> On Sat, Jul 11, 2009 at 5:22 PM, wrote: > 2009/7/11 Carlos "Gu?no" Grohmann : >> Sorry if this is too simple or even OT, but say I have a list like >> >> [1,2,3,4,5,6,7,8,9] >> >> and I want to get the values just below and above a given value. >> >> So, if I enter 6.2, I get 6 and 7. >> >> but how? > > maybe like this >>>> a = (np.array([1,2,3,4,5,6,7,8,9])-6.1) >>>> a[a>0].min() + 6.2 > 7.0 >>>> a[a<0].max() + 6.2 > 6.0 except for the cut and paste error >>> d = 6.2 >>> a = (np.array([1,7,8,9,2,3,4,5,6]) - d) >>> a[a>0].min() + d 7.0 >>> a[a<0].max() + d 6.0 > > Josef > >> >> TIA >> >> Carlos >> >> -- >> Carlos Henrique Grohmann - Geologist D.Sc. >> a.k.a. Guano - Linux User #89721 >> ResearcherID: A-9030-2008 >> >> http://digitalelevation.blogspot.com >> >> http://www.igc.usp.br/pessoais/guano >> _________________ >> Can?t stop the signal. >> _______________________________________________ >> SciPy-user mailing list >> SciPy-user at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user >> > From jsseabold at gmail.com Sat Jul 11 17:29:29 2009 From: jsseabold at gmail.com (Skipper Seabold) Date: Sat, 11 Jul 2009 17:29:29 -0400 Subject: [SciPy-user] finding values in a list? In-Reply-To: <1cd32cbb0907111422m42d6a053x965523d7e17579d3@mail.gmail.com> References: <1cd32cbb0907111422m42d6a053x965523d7e17579d3@mail.gmail.com> Message-ID: On Sat, Jul 11, 2009 at 5:22 PM, wrote: > 2009/7/11 Carlos "Gu?no" Grohmann : >> Sorry if this is too simple or even OT, but say I have a list like >> >> [1,2,3,4,5,6,7,8,9] >> >> and I want to get the values just below and above a given value. >> >> So, if I enter 6.2, I get 6 and 7. >> >> but how? > > maybe like this >>>> a = (np.array([1,2,3,4,5,6,7,8,9])-6.1) >>>> a[a>0].min() + 6.2 > 7.0 >>>> a[a<0].max() + 6.2 > 6.0 > > Josef > Yet another solution (though I too don't know if it's optimal). This will work if you know that your values are always going to be within 1 as in your example. >>> import numpy as np >>> L = [1,2,3,4,5,6,7,8] >>> A = np.array(L) >>> index = np.where(np.abs((A-6.2))<1) >>> A[index] array([6, 7]) Skipper From josef.pktd at gmail.com Sat Jul 11 17:32:10 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 11 Jul 2009 17:32:10 -0400 Subject: [SciPy-user] finding values in a list? In-Reply-To: References: <1cd32cbb0907111422m42d6a053x965523d7e17579d3@mail.gmail.com> Message-ID: <1cd32cbb0907111432y57b1fecel63115dbbb1c2e200@mail.gmail.com> On Sat, Jul 11, 2009 at 5:29 PM, Skipper Seabold wrote: > On Sat, Jul 11, 2009 at 5:22 PM, wrote: >> 2009/7/11 Carlos "Gu?no" Grohmann : >>> Sorry if this is too simple or even OT, but say I have a list like >>> >>> [1,2,3,4,5,6,7,8,9] >>> >>> and I want to get the values just below and above a given value. >>> >>> So, if I enter 6.2, I get 6 and 7. >>> >>> but how? >> >> maybe like this >>>>> a = (np.array([1,2,3,4,5,6,7,8,9])-6.1) >>>>> a[a>0].min() + 6.2 >> 7.0 >>>>> a[a<0].max() + 6.2 >> 6.0 >> >> Josef >> > > Yet another solution (though I too don't know if it's optimal). ?This > will work if you know that your values are always going to be within 1 > as in your example. > >>>> import numpy as np >>>> L = [1,2,3,4,5,6,7,8] >>>> A = np.array(L) >>>> index = np.where(np.abs((A-6.2))<1) >>>> A[index] > array([6, 7]) > > Skipper > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > for sorted lists, this might be fastest >>> ind = np.searchsorted([1,2,3,4,5,6,7,8,9], 6.2) >>> ind 6 >>> [1,2,3,4,5,6,7,8,9][ind-1] 6 >>> [1,2,3,4,5,6,7,8,9][ind] It is unclear what should happen if the values is an element of the list. Josef From xavier.gnata at gmail.com Sat Jul 11 17:56:00 2009 From: xavier.gnata at gmail.com (Xavier Gnata) Date: Sat, 11 Jul 2009 23:56:00 +0200 Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? In-Reply-To: References: Message-ID: <4A590A70.1070400@gmail.com> outch! numpy.sqrt(-1) nan scipy.sqrt(-1) 1j IMHO, it should behave the same way. Python provides us with math and cmath. It is a bit strange from a mathematical point of view but it is perfectly valid from a computer science point of view. Why numpy.sqrt(-1)!=scipy.sqrt(-1) ?? I know it would be hard to change that now...but still. Xavier > In an "ipython -pylab -p scipy" console I get > In [1]: sqrt(-1) > Out[1]: nan > but I would prefer 1j as the result. This is achieved by > In [2]: from scipy import * > After that I get > In [3]: sqrt(-1) > Out[3]: 1j > > I found out that the ipython option -p scipy executes the startup script > Python25\Lib\site-packages\IPython\Extensions\ipy_profile_scipy.py > which contains > > ip = IPython.ipapi.get() > > try: > ip.ex("import math,cmath") > ip.ex("import numpy") > ip.ex("import scipy") > > ip.ex("import numpy as np") > ip.ex("import scipy as sp") > > ip.ex("from numpy import *") > ip.ex("from scipy import *") > print('... import * done') > > Here the command "from scipy import *" is the last executed command as in my > interactive session above. Why is this startup script not sufficient to have > sqrt(-1)=1j right from the console start?? > What I should do additionally to achieve this? > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From robert.kern at gmail.com Sun Jul 12 02:01:25 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 12 Jul 2009 01:01:25 -0500 Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? In-Reply-To: <4A590A70.1070400@gmail.com> References: <4A590A70.1070400@gmail.com> Message-ID: <3d375d730907112301x4c6eb1eeife76ae9b62465cd5@mail.gmail.com> On Sat, Jul 11, 2009 at 16:56, Xavier Gnata wrote: > outch! > > numpy.sqrt(-1) > nan > > scipy.sqrt(-1) > 1j > > IMHO, it should behave the same way. > Python provides us with math and cmath. > It is a bit strange from a mathematical point of view but it is > perfectly valid from a computer science point of view. > > Why numpy.sqrt(-1)!=scipy.sqrt(-1) ?? > I know it would be hard to change that now...but still. As with most such things, the answer is "history". Old Numeric had only the NaN behavior. scipy added functions with the 1j behavior. When numpy was formed, it kept the NaN behavior for the functions in the main numpy namespace and moved the 1j implementations from scipy into numpy.lib.scimath. scipy kept the 1j behaviors for the functions it exposes. Personally, I suggest simply forgetting that the scipy namespace aliases the numpy functions and only use the subpackages in scipy. Use the functions from numpy or numpy.lib.scimath directly, as needed. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From xyz.account at arcor.de Sun Jul 12 12:22:37 2009 From: xyz.account at arcor.de (Martin Kirsch) Date: Sun, 12 Jul 2009 16:22:37 +0000 (UTC) Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? References: <4A590A70.1070400@gmail.com> <3d375d730907112301x4c6eb1eeife76ae9b62465cd5@mail.gmail.com> Message-ID: Robert Kern gmail.com> writes: > ... forgetting that the scipy namespace > aliases the numpy functions and only use the subpackages in scipy. Use > the functions from numpy or numpy.lib.scimath directly, as needed. > Ok, using only the scipy subpackages helps to decouple the interdependencies between numpy and scipy (many pitfalls for newcomers). But why it makes a difference doing "from scipy import *" (1) per ipy_profile_scipy.py (ipython -pylab -p scipy) => sqrt(-1)= NaN (2) per console input => sqrt(-1)= 1j ? Originally I guessed that in both cases sqrt comes from scipy because of the preceding "from scipy import *" in both cases, but case (1) references numpy.sqrt despite "from scipy import *" in ipy_profile_scipy.py and I don't know the reason. From pav+sp at iki.fi Sun Jul 12 13:12:25 2009 From: pav+sp at iki.fi (Pauli Virtanen) Date: Sun, 12 Jul 2009 17:12:25 +0000 (UTC) Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? References: <4A590A70.1070400@gmail.com> <3d375d730907112301x4c6eb1eeife76ae9b62465cd5@mail.gmail.com> Message-ID: On 2009-07-12, Martin Kirsch wrote: [clip?] > But why it makes a difference doing "from scipy import *" > (1) per ipy_profile_scipy.py (ipython -pylab -p scipy) => sqrt(-1)= NaN > (2) per console input => sqrt(-1)= 1j > > Originally I guessed that in both cases sqrt comes from scipy because of the > preceding "from scipy import *" in both cases, but case (1) references > numpy.sqrt despite "from scipy import *" in ipy_profile_scipy.py and I don't > know the reason. Perhaps the -pylab switch makes Ipython to run "from numpy import *", or something similar, *after* ipy_profile_scipy.py? I'd suppose that if you do "from scipy import *" in the Ipython console, you'd get Scipy's version of sqrt. Also, does the order of the -pylab and -p switches on the Ipython command line make a difference? -- Pauli Virtanen From robert.kern at gmail.com Sun Jul 12 14:10:37 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 12 Jul 2009 13:10:37 -0500 Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? In-Reply-To: References: <4A590A70.1070400@gmail.com> <3d375d730907112301x4c6eb1eeife76ae9b62465cd5@mail.gmail.com> Message-ID: <3d375d730907121110h2f9f7382se20e5b4c6672fcc8@mail.gmail.com> On Sun, Jul 12, 2009 at 12:12, Pauli Virtanen wrote: > On 2009-07-12, Martin Kirsch wrote: > [clip?] >> But why it makes a difference doing "from scipy import *" >> (1) per ipy_profile_scipy.py (ipython -pylab -p scipy) => sqrt(-1)= NaN >> (2) per console input => sqrt(-1)= 1j Note that "from scipy import *" is precisely the thing that I am suggesting that you should never do. >> Originally I guessed that in both cases sqrt comes from scipy because of the >> preceding "from scipy import *" in both cases, but case (1) references >> numpy.sqrt despite "from scipy import *" in ipy_profile_scipy.py and I don't >> know the reason. > > Perhaps the -pylab switch makes Ipython to run "from numpy import *", > or something similar, *after* ipy_profile_scipy.py? I'd suppose that > if you do "from scipy import *" in the Ipython console, you'd get > Scipy's version of sqrt. Correct. > Also, does the order of the -pylab and -p switches on the Ipython > command line make a difference? I don't think so. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From humufr at yahoo.fr Sun Jul 12 15:42:11 2009 From: humufr at yahoo.fr (Nicolas Gruel) Date: Sun, 12 Jul 2009 12:42:11 -0700 (PDT) Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? In-Reply-To: <3d375d730907121110h2f9f7382se20e5b4c6672fcc8@mail.gmail.com> Message-ID: <936733.65958.qm@web111411.mail.gq1.yahoo.com> Why keeping the numpy comportment (NaN)? If it's only for historical reason and it's mathematicaly better to have 1j, this behaviour should be corrected. It was a limitation of numeric but numpy has been written to extend and to remove the limitation. H. --- En date de?: Dim 12.7.09, Robert Kern a ?crit?: > De: Robert Kern > Objet: Re: [SciPy-user] How to get sqrt(-1) = 1j per default? > ?: "SciPy Users List" > Date: Dimanche 12 Juillet 2009, 14h10 > On Sun, Jul 12, 2009 at 12:12, Pauli > Virtanen wrote: > > On 2009-07-12, Martin Kirsch > wrote: > > [clip?] > >> But why it makes a difference doing "from scipy > import *" > >> (1) per ipy_profile_scipy.py (ipython -pylab -p > scipy) => sqrt(-1)= NaN > >> (2) per console input => sqrt(-1)= 1j > > Note that "from scipy import *" is precisely the thing that > I am > suggesting that you should never do. > > >> Originally I guessed that in both cases sqrt comes > from scipy because of the > >> preceding "from scipy import *" in both cases, but > case (1) references > >> numpy.sqrt despite "from scipy import *" in > ipy_profile_scipy.py and I don't > >> know the reason. > > > > Perhaps the -pylab switch makes Ipython to run "from > numpy import *", > > or something similar, *after* ipy_profile_scipy.py? > I'd suppose that > > if you do "from scipy import *" in the Ipython > console, you'd get > > Scipy's version of sqrt. > > Correct. > > > Also, does the order of the -pylab and -p switches on > the Ipython > > command line make a difference? > > I don't think so. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, > a harmless > enigma that is made terrible by our own mad attempt to > interpret it as > though it had an underlying truth." > ? -- Umberto Eco > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From robert.kern at gmail.com Sun Jul 12 15:56:55 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 12 Jul 2009 14:56:55 -0500 Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? In-Reply-To: <936733.65958.qm@web111411.mail.gq1.yahoo.com> References: <3d375d730907121110h2f9f7382se20e5b4c6672fcc8@mail.gmail.com> <936733.65958.qm@web111411.mail.gq1.yahoo.com> Message-ID: <3d375d730907121256i7e913512se9731f577552301b@mail.gmail.com> On Sun, Jul 12, 2009 at 14:42, Nicolas Gruel wrote: > > Why keeping the numpy comportment (NaN)? If it's only for historical reason and it's mathematicaly better to have 1j, this behaviour should be corrected. It was a limitation of numeric but numpy has been written to extend and to remove the limitation. Both behaviors are useful in different situations. It was not a limitation of Numeric but rather a specific design decision. Quite often, passing a negative number to sqrt() is an error that needs to be detected. That's why the standard library has math.sqrt(-1) which raises and exception while cmath.sqrt(-1) returns 1j. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From dwf at cs.toronto.edu Sun Jul 12 17:03:32 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Sun, 12 Jul 2009 17:03:32 -0400 Subject: [SciPy-user] Why is SciPy better than Matlab or IDL? ;) In-Reply-To: <88e473830907110441i5adcfefbr736dec9e20629b0b@mail.gmail.com> References: <2926F4BC94217A43A2D21792DE8818931C5B245B29@ex1.rijnh.nl> <652A0D7E-168B-4355-ABDE-F40F7165664B@cs.toronto.edu> <4A537A54.2010601@gmail.com> <1BB30B2C-4CE8-44C7-82FF-62E9EFF0FB7C@cs.toronto.edu>

<88e473830907110441i5adcfefbr736dec9e20629b0b@mail.gmail.com> Message-ID: <4F392897-95CA-4ED8-B10D-951F8EAE90D4@cs.toronto.edu> On 11-Jul-09, at 7:41 AM, John Hunter wrote: > The code you posted is a hybrid of using pyplot and managing your own > Canvas creation, which can work under certain configurations but is > brittle and not supported. This is mainly because pyplot creates and > manages its own canvas instances. Oops. I suppose it was an instance where I fiddled until it worked and never gave it a second look. Thanks for the clarification John. David From xavier.gnata at gmail.com Sun Jul 12 17:32:11 2009 From: xavier.gnata at gmail.com (Xavier Gnata) Date: Sun, 12 Jul 2009 23:32:11 +0200 Subject: [SciPy-user] How to get sqrt(-1) = 1j per default? In-Reply-To: <3d375d730907121256i7e913512se9731f577552301b@mail.gmail.com> References: <3d375d730907121110h2f9f7382se20e5b4c6672fcc8@mail.gmail.com> <936733.65958.qm@web111411.mail.gq1.yahoo.com> <3d375d730907121256i7e913512se9731f577552301b@mail.gmail.com> Message-ID: <4A5A565B.1070806@gmail.com> Robert Kern wrote: > On Sun, Jul 12, 2009 at 14:42, Nicolas Gruel wrote: > >> Why keeping the numpy comportment (NaN)? If it's only for historical reason and it's mathematicaly better to have 1j, this behaviour should be corrected. It was a limitation of numeric but numpy has been written to extend and to remove the limitation. >> > > Both behaviors are useful in different situations. It was not a > limitation of Numeric but rather a specific design decision. Quite > often, passing a negative number to sqrt() is an error that needs to > be detected. That's why the standard library has math.sqrt(-1) which > raises and exception while cmath.sqrt(-1) returns 1j. > > "it's mathematicaly better to have 1j," It is wrong if it is written in the doc that sqrt is a function from R to R. Maybe what we would need here is a nice one page doc for the newcomers. The page listing the basic functions in numpy/IDL/matlab is very useful when you start with numpy/scipy. A clear written warning saying that "from scipy import *" is evil could also help a lot (iff it contains examples like "sqrt" showing *why* it is evil). Ok, "import *" is not a good practice in python. Not onlyin numpy/scipy BUT many users discover python as the same time as they discover numpy/scipy. "it a behaviour is clearly documented with a nice red warning, it is not quite a bug ;)" Xavier From perfreem at gmail.com Sun Jul 12 17:51:04 2009 From: perfreem at gmail.com (per freem) Date: Sun, 12 Jul 2009 17:51:04 -0400 Subject: [SciPy-user] nonzero and the matlab equivalent of find Message-ID: hi all, i am trying to find an equivalent of matlab's find in scipy. the documentation says to use nonzero but i am not sure how to interpret the results. suppose i have an array: a = [[0, 1], [1, 1], [1, 0]] and i want to return the indices (0, 1, or 2 in this case) of the elements that are [1, 1]. the natural notation seems to be: nonzero(a == array([1, 1])) but this returns: (array([0, 1, 1, 2]), array([1, 0, 1, 0])) -- not sure what this output means. can someone please explain how to find the elements that are [1,1], like in matlab's "find"? thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Sun Jul 12 18:03:47 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 12 Jul 2009 17:03:47 -0500 Subject: [SciPy-user] nonzero and the matlab equivalent of find In-Reply-To: References: Message-ID: <3d375d730907121503p30b4a011gfd02bec7a58a1dac@mail.gmail.com> On Sun, Jul 12, 2009 at 16:51, per freem wrote: > hi all, > > i am trying to find an equivalent of matlab's find in scipy. the > documentation says to use nonzero but i am not sure how to interpret the > results. > > suppose i have an array: > > a = [[0, 1], [1, 1], [1, 0]] > > and i want to return the indices (0, 1, or 2 in this case) of the elements > that are [1, 1]. the natural notation seems to be: > > nonzero(a == array([1, 1])) > > but this returns: (array([0, 1, 1, 2]), array([1, 0, 1, 0])) -- not sure > what this output means. Well, let's take a look at the intermediate value of a==array([1,1]): In [24]: a = array([[0,1], [1,1], [1,0]]) In [25]: a == array([1, 1]) Out[25]: array([[False, True], [ True, True], [ True, False]], dtype=bool) The result of nonzero(), when given a multidimensional array is to return a tuple with the indices for each dimension such that: In [26]: a[nonzero(a == array([1,1]))] Out[26]: array([1, 1, 1, 1]) > can someone please explain how to find the elements that are [1,1], like in > matlab's "find"? thank you. There isn't really a function that does this out-of-box, but here is how to do it: In [30]: nonzero((a[:,0] == 1) | (a[:,1] == 1))[0] Out[30]: array([0, 1, 2]) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Sun Jul 12 18:05:59 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 12 Jul 2009 17:05:59 -0500 Subject: [SciPy-user] nonzero and the matlab equivalent of find In-Reply-To: <3d375d730907121503p30b4a011gfd02bec7a58a1dac@mail.gmail.com> References: <3d375d730907121503p30b4a011gfd02bec7a58a1dac@mail.gmail.com> Message-ID: <3d375d730907121505u2156a23em1b5fe45bcb47c9b1@mail.gmail.com> On Sun, Jul 12, 2009 at 17:03, Robert Kern wrote: > There isn't really a function that does this out-of-box, but here is > how to do it: > > In [30]: nonzero((a[:,0] == 1) | (a[:,1] == 1))[0] > Out[30]: array([0, 1, 2]) Or more generally: In [34]: nonzero(logical_or.reduce(a == array([1,1]), axis=1))[0] Out[34]: array([0, 1, 2]) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robertrobert93 at yahoo.com Mon Jul 13 02:56:34 2009 From: robertrobert93 at yahoo.com (Robert Robert) Date: Sun, 12 Jul 2009 23:56:34 -0700 (PDT) Subject: [SciPy-user] retrieve indexes of 2D array element Message-ID: <941629.97046.qm@web59305.mail.re1.yahoo.com> Hi all, I have a 2D scipy array of integer. I can not figure out how to retrieve the indexes i, j of a specific element in the array. I can loop through the array in python and search for it, but I can not believe that scipy does not have a highly optimized algorithm. Thanks, robert -------------- next part -------------- An HTML attachment was scrubbed... URL: From robertrobert93 at yahoo.com Mon Jul 13 02:58:07 2009 From: robertrobert93 at yahoo.com (Robert Robert) Date: Sun, 12 Jul 2009 23:58:07 -0700 (PDT) Subject: [SciPy-user] retrieve indexes of element in 2D scipy array Message-ID: <131065.71917.qm@web59314.mail.re1.yahoo.com> Hi all, I have a 2D scipy array of integer. I can not figure out how to retrieve the indexes i, j of a specific element in the array. I can loop through the array in python and search for it, but I can not believe that scipy does not have a highly optimized algorithm. Thanks, robert -------------- next part -------------- An HTML attachment was scrubbed... URL: From dwf at cs.toronto.edu Mon Jul 13 03:11:05 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Mon, 13 Jul 2009 03:11:05 -0400 Subject: [SciPy-user] retrieve indexes of 2D array element In-Reply-To: <941629.97046.qm@web59305.mail.re1.yahoo.com> References: <941629.97046.qm@web59305.mail.re1.yahoo.com> Message-ID: <695927F2-F03D-4F13-A990-703EA47FE5C0@cs.toronto.edu> On 13-Jul-09, at 2:56 AM, Robert Robert wrote: > Hi all, > I have a 2D scipy array of integer. I can not figure out how to > retrieve the indexes i, j of a specific element in the array. I can > loop through the array in python and search for it, but I can not > believe that scipy does not have a highly optimized algorithm. > Thanks, > robert By a specific element, do you mean a certain value? If so use a boolean condition and where(). http://docs.scipy.org/doc/numpy/reference/generated/numpy.where.html David From robertrobert93 at yahoo.com Mon Jul 13 04:03:05 2009 From: robertrobert93 at yahoo.com (Robert Robert) Date: Mon, 13 Jul 2009 01:03:05 -0700 (PDT) Subject: [SciPy-user] retrieve indexes of 2D array element Message-ID: <218899.69139.qm@web59308.mail.re1.yahoo.com> Hi, I've seen this function where, but it requires three inputs and I have no clue how to use it. Is it possible to give me an example. E.g I have an array as a = [ [1,2,3], ??????? [4,5,6], ??????? [7,8,9] ] and I would like to retrieve the indexes of values 9 which are 2, 2. How do I do that with scipy.where ? Thanks, robert --- On Mon, 7/13/09, David Warde-Farley wrote: From: David Warde-Farley Subject: Re: [SciPy-user] retrieve indexes of 2D array element To: "SciPy Users List" Date: Monday, July 13, 2009, 7:11 AM On 13-Jul-09, at 2:56 AM, Robert Robert wrote: > Hi all, > I have a 2D scipy array of integer. I can not figure out how to? > retrieve the indexes i, j of a specific element in the array. I can? > loop through the array in python and search for it, but I can not? > believe that scipy does not have a highly optimized algorithm. > Thanks, > robert By a specific element, do you mean a certain value? If so use a? boolean condition and where(). http://docs.scipy.org/doc/numpy/reference/generated/numpy.where.html David _______________________________________________ SciPy-user mailing list SciPy-user at scipy.org http://mail.scipy.org/mailman/listinfo/scipy-user -------------- next part -------------- An HTML attachment was scrubbed... URL: From d_l_goldsmith at yahoo.com Mon Jul 13 04:31:17 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Mon, 13 Jul 2009 01:31:17 -0700 (PDT) Subject: [SciPy-user] sqrt(-1) Message-ID: <368305.16353.qm@web52103.mail.re2.yahoo.com> Ah, memories... ;-) http://mail.scipy.org/pipermail/numpy-discussion/2006-October/011318.html DG --- On Mon, 7/13/09, scipy-user-request at scipy.org wrote: > Today's Topics: > > ???1. Re: How to get sqrt(-1) = 1j per > default? (Pauli Virtanen) > ???2. Re: How to get sqrt(-1) = 1j per > default? (Robert Kern) > ???3. Re: How to get sqrt(-1) = 1j per > default? (Nicolas Gruel) > ???4. Re: How to get sqrt(-1) = 1j per > default? (Robert Kern) > ???6. Re: How to get sqrt(-1) = 1j per > default? (Xavier Gnata) From dwf at cs.toronto.edu Mon Jul 13 07:26:35 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Mon, 13 Jul 2009 07:26:35 -0400 Subject: [SciPy-user] retrieve indexes of 2D array element In-Reply-To: <218899.69139.qm@web59308.mail.re1.yahoo.com> References: <218899.69139.qm@web59308.mail.re1.yahoo.com> Message-ID: <68583489-BD7F-495D-AAE4-F85E886B8FEC@cs.toronto.edu> It doesn't require 3 inputs. Two of them are optional. See the examples, or this: In [20]: A = array([[1,4,9],[3,5,6],[9,9,3]]) In [21]: rows, cols = where(A == 9) In [22]: rows Out[22]: array([0, 2, 2]) In [23]: cols Out[23]: array([2, 0, 1]) A == 9 will produce a boolean array equal to True where the element is 9 and False otherwise. Use this as the first argument to where. David On 13-Jul-09, at 4:03 AM, Robert Robert wrote: > Hi, > I've seen this function where, but it requires three inputs and I > have no clue how to use it. Is it possible to give me an example. > E.g I have an array as > > a = [ [1,2,3], > [4,5,6], > [7,8,9] ] > > and I would like to retrieve the indexes of values 9 which are 2, 2. > How do I do that with scipy.where ? > Thanks, > robert > > --- On Mon, 7/13/09, David Warde-Farley wrote: > > From: David Warde-Farley > Subject: Re: [SciPy-user] retrieve indexes of 2D array element > To: "SciPy Users List" > Date: Monday, July 13, 2009, 7:11 AM > > On 13-Jul-09, at 2:56 AM, Robert Robert wrote: > > > Hi all, > > I have a 2D scipy array of integer. I can not figure out how to > > retrieve the indexes i, j of a specific element in the array. I can > > loop through the array in python and search for it, but I can not > > believe that scipy does not have a highly optimized algorithm. > > Thanks, > > robert > > > By a specific element, do you mean a certain value? If so use a > boolean condition and where(). > > http://docs.scipy.org/doc/numpy/reference/generated/numpy.where.html > > David > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From stefan at sun.ac.za Mon Jul 13 07:30:49 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Mon, 13 Jul 2009 13:30:49 +0200 Subject: [SciPy-user] retrieve indexes of element in 2D scipy array In-Reply-To: <131065.71917.qm@web59314.mail.re1.yahoo.com> References: <131065.71917.qm@web59314.mail.re1.yahoo.com> Message-ID: <9457e7c80907130430v4ea54ea4pa765f9712fca6679@mail.gmail.com> 2009/7/13 Robert Robert : > I have a 2D scipy array of integer. I can not figure out how to retrieve the > indexes i, j of a specific element in the array. I can loop through the > array in python and search for it, but I can not believe that scipy does not > have a highly optimized algorithm. Try np.where(x == 3) St?fan From welby555 at yahoo.com Mon Jul 13 08:14:13 2009 From: welby555 at yahoo.com (Afi Welbeck) Date: Mon, 13 Jul 2009 05:14:13 -0700 (PDT) Subject: [SciPy-user] plot vertical lines Message-ID: <72226.27338.qm@web51612.mail.re2.yahoo.com> Hi, I'm a newbie. I'm stuck trying to link the following points with vertical lines in the xy plane (1,1) (1,4) and (3,2) (3,6) Could anyone please help me with the code? Thanks a great deal. Harriet. -------------- next part -------------- An HTML attachment was scrubbed... URL: From amcmorl at gmail.com Mon Jul 13 08:31:17 2009 From: amcmorl at gmail.com (Angus McMorland) Date: Mon, 13 Jul 2009 08:31:17 -0400 Subject: [SciPy-user] plot vertical lines In-Reply-To: <72226.27338.qm@web51612.mail.re2.yahoo.com> References: <72226.27338.qm@web51612.mail.re2.yahoo.com> Message-ID: 2009/7/13 Afi Welbeck : > Hi, > > I'm a newbie. I'm stuck trying to link the following > points with vertical lines in the xy plane > (1,1) (1,4)? and (3,2) (3,6) > > Could anyone please help me with the code? Here's an verbose way to do it, so you can see what's going on. You'll need matplotlib installed for this, and it's the generally recommended 2-d graphics package to accompany scipy. import matplotlib.pyplot as plt x0 = [1,1] y0 = [1,4] x1 = [3,3] y1 = [2,6] fig = plt.figure() ax = fig.add_subplot(111) ax.plot(x0, y0, x1, y1) ax.set_xlim([0, 4]) ax.set_ylim([0, 7]) plt.show() Hoping that helps, Angus. -- AJC McMorland Post-doctoral research fellow Neurobiology, University of Pittsburgh From silva at lma.cnrs-mrs.fr Mon Jul 13 08:47:40 2009 From: silva at lma.cnrs-mrs.fr (Fabrice Silva) Date: Mon, 13 Jul 2009 14:47:40 +0200 Subject: [SciPy-user] plot vertical lines In-Reply-To: References: <72226.27338.qm@web51612.mail.re2.yahoo.com> Message-ID: <1247489260.4358.7.camel@localhost.localdomain> Le lundi 13 juillet 2009 ? 08:31 -0400, Angus McMorland a ?crit : > 2009/7/13 Afi Welbeck : > > Hi, > > > > I'm a newbie. I'm stuck trying to link the following > > points with vertical lines in the xy plane > > (1,1) (1,4) and (3,2) (3,6) > > > > Could anyone please help me with the code? I was suggesting: plt.axvline(1, ymin=1, ymax=4, trans=plt.gca().transData) plt.axvline(3, ymin=2, ymax=6, trans=plt.gca().transData) when I got the following message : TypeError: type object got multiple values for keyword argument 'transform' In fact, despite the doc tells axvline accepts the transform keyworg argument, there is no special handling of this kwarg in axvline code (in 0.98.5.3). There is a problem since axvline code (and axhline and ax*span too I suppose) add a transform kwarg before adding the line to the axes... -- Fabrice Silva Laboratory of Mechanics and Acoustics - CNRS 31 chemin Joseph Aiguier, 13402 Marseille, France. From welby555 at yahoo.com Mon Jul 13 09:32:56 2009 From: welby555 at yahoo.com (Afi Welbeck) Date: Mon, 13 Jul 2009 06:32:56 -0700 (PDT) Subject: [SciPy-user] plot vertical lines In-Reply-To: References: <72226.27338.qm@web51612.mail.re2.yahoo.com> Message-ID: <135317.46656.qm@web51608.mail.re2.yahoo.com> Hi, Thanks it worked. But I observed that for the values I gave you, you altered them a bit before it plotted it just they way I expected. Is there some formula for this sort of plot? ________________________________ From: Angus McMorland To: SciPy Users List Sent: Monday, July 13, 2009 2:31:17 PM Subject: Re: [SciPy-user] plot vertical lines 2009/7/13 Afi Welbeck : > Hi, > > I'm a newbie. I'm stuck trying to link the following > points with vertical lines in the xy plane > (1,1) (1,4) and (3,2) (3,6) > > Could anyone please help me with the code? Here's an verbose way to do it, so you can see what's going on. You'll need matplotlib installed for this, and it's the generally recommended 2-d graphics package to accompany scipy. import matplotlib.pyplot as plt x0 = [1,1] y0 = [1,4] x1 = [3,3] y1 = [2,6] fig = plt.figure() ax = fig.add_subplot(111) ax.plot(x0, y0, x1, y1) ax.set_xlim([0, 4]) ax.set_ylim([0, 7]) plt.show() Hoping that helps, Angus. -- AJC McMorland Post-doctoral research fellow Neurobiology, University of Pittsburgh _______________________________________________ SciPy-user mailing list SciPy-user at scipy.org http://mail.scipy.org/mailman/listinfo/scipy-user -------------- next part -------------- An HTML attachment was scrubbed... URL: From carlos.grohmann at gmail.com Mon Jul 13 10:04:45 2009 From: carlos.grohmann at gmail.com (=?ISO-8859-1?Q?Carlos_=22Gu=E2no=22_Grohmann?=) Date: Mon, 13 Jul 2009 11:04:45 -0300 Subject: [SciPy-user] finding values in a list? In-Reply-To: <1cd32cbb0907111432y57b1fecel63115dbbb1c2e200@mail.gmail.com> References: <1cd32cbb0907111422m42d6a053x965523d7e17579d3@mail.gmail.com> <1cd32cbb0907111432y57b1fecel63115dbbb1c2e200@mail.gmail.com> Message-ID: Thanks all for the help Carlos On Sat, Jul 11, 2009 at 18:32, wrote: > On Sat, Jul 11, 2009 at 5:29 PM, Skipper Seabold wrote: >> On Sat, Jul 11, 2009 at 5:22 PM, wrote: >>> 2009/7/11 Carlos "Gu?no" Grohmann : >>>> Sorry if this is too simple or even OT, but say I have a list like >>>> >>>> [1,2,3,4,5,6,7,8,9] >>>> >>>> and I want to get the values just below and above a given value. >>>> >>>> So, if I enter 6.2, I get 6 and 7. >>>> >>>> but how? >>> >>> maybe like this >>>>>> a = (np.array([1,2,3,4,5,6,7,8,9])-6.1) >>>>>> a[a>0].min() + 6.2 >>> 7.0 >>>>>> a[a<0].max() + 6.2 >>> 6.0 >>> >>> Josef >>> >> >> Yet another solution (though I too don't know if it's optimal). ?This >> will work if you know that your values are always going to be within 1 >> as in your example. >> >>>>> import numpy as np >>>>> L = [1,2,3,4,5,6,7,8] >>>>> A = np.array(L) >>>>> index = np.where(np.abs((A-6.2))<1) >>>>> A[index] >> array([6, 7]) >> >> Skipper >> _______________________________________________ >> SciPy-user mailing list >> SciPy-user at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user >> > > for sorted lists, this might be fastest > >>>> ind = np.searchsorted([1,2,3,4,5,6,7,8,9], 6.2) >>>> ind > 6 >>>> [1,2,3,4,5,6,7,8,9][ind-1] > 6 >>>> [1,2,3,4,5,6,7,8,9][ind] > > It is unclear what should happen if the values is an element of the list. > > Josef > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -- Carlos Henrique Grohmann - Geologist D.Sc. a.k.a. Guano - Linux User #89721 ResearcherID: A-9030-2008 http://digitalelevation.blogspot.com http://www.igc.usp.br/pessoais/guano _________________ Can?t stop the signal. From Dharhas.Pothina at twdb.state.tx.us Mon Jul 13 10:26:35 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Mon, 13 Jul 2009 09:26:35 -0500 Subject: [SciPy-user] Sort by first n columns in an array In-Reply-To: References: <4A549EE5.63BA.009B.0@twdb.state.tx.us> <4A55C491.63BA.009B.0@twdb.state.tx.us> <4A56ED12.63BA.009B.0@twdb.state.tx.us> Message-ID: <4A5AFDCB.63BA.009B.0@twdb.state.tx.us> > But it's probably better to let genfromtxt choose its own field names and then > sort the whole thing using the first 6 fields. > > alldata = np.genfromtxt(filename, dtype=None) > alldata.sort(order='f0 f1 f2 f3 f4 f5'.split()) Great, this is exactly what I needed. I had thought of the first approach but didn't want to have to read the file in twice. thanks, - dharhas From amcmorl at gmail.com Mon Jul 13 10:49:00 2009 From: amcmorl at gmail.com (Angus McMorland) Date: Mon, 13 Jul 2009 10:49:00 -0400 Subject: [SciPy-user] plot vertical lines In-Reply-To: <135317.46656.qm@web51608.mail.re2.yahoo.com> References: <72226.27338.qm@web51612.mail.re2.yahoo.com> <135317.46656.qm@web51608.mail.re2.yahoo.com> Message-ID: 2009/7/13 Afi Welbeck : > Hi, > Thanks it worked. > But I observed that for the values I gave you, > you altered them a bit before it plotted it just > they way I expected. Is there some formula > for this sort of plot? I take your question as asking what the convention is for the co-ordinate listing in the plot command (?). As I tried to indicate with the variable names, the x-values and y-values are listed separately, and multiple lines can be listed sequentially, as in line0_x, line0_y, line1_x, line1_y... so the first variable is the x values of the first line, the second is the corresponding y values of the first line and then the same for the second line. If you're using ipython (highly recommended), then you could find this sort of information by typing: In [1]: ax.plot? or In [1]: plt.plot? which will bring up the docstring for the associated function. Angus. -- AJC McMorland Post-doctoral research fellow Neurobiology, University of Pittsburgh From mailanhilli at googlemail.com Mon Jul 13 11:38:49 2009 From: mailanhilli at googlemail.com (Matthias Hillenbrand) Date: Mon, 13 Jul 2009 17:38:49 +0200 Subject: [SciPy-user] 2D/3D visualization Message-ID: <67b3a51f0907130838i57cd782dk70b7950252c56812@mail.gmail.com> Hello, In the new future I want to build a small GUI for my optical raytracing program written in python. For this purpose I intend to use wxpython, perhaps in combination with ETS. I especially want to be able to create 2D figures like this one: http://www.optima-research.com/Software/Optical/Zemax/images/2DLAYOUT.gif and 3D figures like those: http://www.zemax.com/kb/content_images/presentation/Default%20Shaded%20Model%20of%20Double%20Gauss.gif http://www.zemax.com/kb/content_images/presentation/revised%20shaded%20model%20of%20Double%20Gauss.jpg Do you have any advice which software packages for 2D/3D visualization could be most appropriate for my purpose? In the near future I also want to work with NURBS, so it will be even better if the package supports rendering of NURBS. As this is my first programming project, I would prefer a high level package, where I only have to define the surfaces and don't have to care about how they are rendered. It would also be nice if I could zoom, pan, and rotate the figures. I have already looked at Matplotlib, Chaco, Mayavi, and PyOpenGL but don't know about further, perhaps more appropriate, alternatives. Thank you very much for your help! Matthias -------------- next part -------------- An HTML attachment was scrubbed... URL: From gokhansever at gmail.com Mon Jul 13 11:49:12 2009 From: gokhansever at gmail.com (=?UTF-8?Q?G=C3=B6khan_SEVER?=) Date: Mon, 13 Jul 2009 10:49:12 -0500 Subject: [SciPy-user] 2D/3D visualization In-Reply-To: <67b3a51f0907130838i57cd782dk70b7950252c56812@mail.gmail.com> References: <67b3a51f0907130838i57cd782dk70b7950252c56812@mail.gmail.com> Message-ID: <49d6b3500907130849y59a32319o202202dce0a3fa98@mail.gmail.com> On Mon, Jul 13, 2009 at 10:38 AM, Matthias Hillenbrand < mailanhilli at googlemail.com> wrote: > Hello, > > As this is my first programming project, I would prefer a high level > package, where I only have to define the surfaces and don't have to care > about how they are rendered. It would also be nice if I could zoom, pan, and > rotate the figures. > Although I don't know exactly about how Mayavi handles surfaces, its VTK engine handles rendering via OpenGL, and has all the other functionalities you are seeking. I am not a super user in Mayavi, but from my more than a couple months of usage experiences Mayavi would seem to be a great fit for your design needs. -- G?khan -------------- next part -------------- An HTML attachment was scrubbed... URL: From welby555 at yahoo.com Mon Jul 13 12:41:28 2009 From: welby555 at yahoo.com (Afi Welbeck) Date: Mon, 13 Jul 2009 09:41:28 -0700 (PDT) Subject: [SciPy-user] plot vertical lines In-Reply-To: References: <72226.27338.qm@web51612.mail.re2.yahoo.com> <135317.46656.qm@web51608.mail.re2.yahoo.com> Message-ID: <459568.89486.qm@web51605.mail.re2.yahoo.com> Hi, Thanks it worked. I realised my error. If I may ask, is there a way to put the pairs of points that make a vertical line into a list and then plot them so you have a number of vertical lines without having to list them separately? Thanks. H. A. Welbeck. ________________________________ From: Angus McMorland To: SciPy Users List Sent: Monday, July 13, 2009 4:49:00 PM Subject: Re: [SciPy-user] plot vertical lines 2009/7/13 Afi Welbeck : > Hi, > Thanks it worked. > But I observed that for the values I gave you, > you altered them a bit before it plotted it just > they way I expected. Is there some formula > for this sort of plot? I take your question as asking what the convention is for the co-ordinate listing in the plot command (?). As I tried to indicate with the variable names, the x-values and y-values are listed separately, and multiple lines can be listed sequentially, as in line0_x, line0_y, line1_x, line1_y... so the first variable is the x values of the first line, the second is the corresponding y values of the first line and then the same for the second line. If you're using ipython (highly recommended), then you could find this sort of information by typing: In [1]: ax.plot? or In [1]: plt.plot? which will bring up the docstring for the associated function. Angus. -- AJC McMorland Post-doctoral research fellow Neurobiology, University of Pittsburgh _______________________________________________ SciPy-user mailing list SciPy-user at scipy.org http://mail.scipy.org/mailman/listinfo/scipy-user -------------- next part -------------- An HTML attachment was scrubbed... URL: From amcmorl at gmail.com Mon Jul 13 13:06:33 2009 From: amcmorl at gmail.com (Angus McMorland) Date: Mon, 13 Jul 2009 13:06:33 -0400 Subject: [SciPy-user] plot vertical lines In-Reply-To: <459568.89486.qm@web51605.mail.re2.yahoo.com> References: <72226.27338.qm@web51612.mail.re2.yahoo.com> <135317.46656.qm@web51608.mail.re2.yahoo.com> <459568.89486.qm@web51605.mail.re2.yahoo.com> Message-ID: 2009/7/13 Afi Welbeck : > Hi, > Thanks it worked. I realised my error. > If I may ask, is there a way to put the > pairs of points that make a vertical line > into a list and then plot them so you have > a number of vertical lines without having > to list them separately? > Thanks. You can construct a list of lists (or a 2d numpy array) that will do this. In array terminology (which is easier to describe) the elements of the first dimension are the points in each line, and the second dimension gives multiple lines to plot, like so: import matplotlib.pyplot as plt xs = [[1,3],[1,3]] ys = [[1,2],[4,6]] fig = plt.figure() ax = fig.add_subplot(111) ax.plot(xs, ys) ax.set_xlim([0, 4]) ax.set_ylim([0, 7]) plt.show() The corresponding arrays would be created by substituting in: import numpy as np xs = np.array([[1,3],[1,3]]) ys = np.array([[1,2],[4,6]]) These are really matplotlib questions, so it would be best to move any further questions on to the matplotlib list (matplotlib-users at lists.sourceforge.net), which will make sure they reach the most relevant audience. Angus. -- AJC McMorland Post-doctoral research fellow Neurobiology, University of Pittsburgh From s.mientki at ru.nl Mon Jul 13 14:41:42 2009 From: s.mientki at ru.nl (Stef Mientki) Date: Mon, 13 Jul 2009 20:41:42 +0200 Subject: [SciPy-user] 2D/3D visualization In-Reply-To: <67b3a51f0907130838i57cd782dk70b7950252c56812@mail.gmail.com> References: <67b3a51f0907130838i57cd782dk70b7950252c56812@mail.gmail.com> Message-ID: <4A5B7FE6.4090000@ru.nl> take a look at VPython, here some examples made with VPython http://www-ee.eng.hawaii.edu/~zqyun/caevp.html cheers, Stef Matthias Hillenbrand wrote: > Hello, > > In the new future I want to build a small GUI for my optical > raytracing program written in python. For this purpose I intend to use > wxpython, perhaps in combination with ETS. I especially want to be > able to create 2D figures like this one: > > http://www.optima-research.com/Software/Optical/Zemax/images/2DLAYOUT.gif > > and 3D figures like those: > > http://www.zemax.com/kb/content_images/presentation/Default%20Shaded%20Model%20of%20Double%20Gauss.gif > http://www.zemax.com/kb/content_images/presentation/revised%20shaded%20model%20of%20Double%20Gauss.jpg > > Do you have any advice which software packages for 2D/3D visualization > could be most appropriate for my purpose? In the near future I also > want to work with NURBS, so it will be even better if the package > supports rendering of NURBS. > > As this is my first programming project, I would prefer a high level > package, where I only have to define the surfaces and don't have to > care about how they are rendered. It would also be nice if I could > zoom, pan, and rotate the figures. > > I have already looked at Matplotlib, Chaco, Mayavi, and PyOpenGL but > don't know about further, perhaps more appropriate, alternatives. > > Thank you very much for your help! > > Matthias > ------------------------------------------------------------------------ > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From rajanikanth at gmail.com Mon Jul 13 21:44:27 2009 From: rajanikanth at gmail.com (rajanikanth at gmail.com) Date: Tue, 14 Jul 2009 01:44:27 +0000 Subject: [SciPy-user] python 2.6.2 mac os x (import signal causes a Bus Error) Message-ID: <0016e649c9263fd92c046ea0943d@google.com> Hi Could somebody tell me how to prevent this: >>> from scipy import signal Bus error Thanks, Raj -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Mon Jul 13 22:11:50 2009 From: cournape at gmail.com (David Cournapeau) Date: Tue, 14 Jul 2009 11:11:50 +0900 Subject: [SciPy-user] python 2.6.2 mac os x (import signal causes a Bus Error) In-Reply-To: <0016e649c9263fd92c046ea0943d@google.com> References: <0016e649c9263fd92c046ea0943d@google.com> Message-ID: <5b8d13220907131911h743b445bn2529439e00db5c20@mail.gmail.com> On Tue, Jul 14, 2009 at 10:44 AM, wrote: > Hi > > Could somebody tell me how to prevent this: > >>>> from scipy import signal > Bus error Which version of numpy and scipy are you using ? David From rajanikanth at gmail.com Mon Jul 13 22:54:08 2009 From: rajanikanth at gmail.com (Rajanikanth Jammalamadaka) Date: Mon, 13 Jul 2009 19:54:08 -0700 Subject: [SciPy-user] python 2.6.2 mac os x (import signal causes a Bus Error) In-Reply-To: <5b8d13220907131911h743b445bn2529439e00db5c20@mail.gmail.com> References: <0016e649c9263fd92c046ea0943d@google.com> <5b8d13220907131911h743b445bn2529439e00db5c20@mail.gmail.com> Message-ID: <84bdef3c0907131954r8658f28x7a3dbd00dfe72614@mail.gmail.com> On Mon, Jul 13, 2009 at 7:11 PM, David Cournapeau wrote: > On Tue, Jul 14, 2009 at 10:44 AM, wrote: >> Hi >> >> Could somebody tell me how to prevent this: >> >>>>> from scipy import signal >> Bus error > > Which version of numpy and scipy are you using ? Hi David: I am using numpy version: 1.2.1 scipy version: 0.7.1 I tried it on another system (also Mac OS X) which had numpy version 1.3 and it seems to work fine on that one. So, is it a version problem? Thanks, Raj > > David > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -- Rajanikanth From david at ar.media.kyoto-u.ac.jp Mon Jul 13 22:38:54 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 14 Jul 2009 11:38:54 +0900 Subject: [SciPy-user] python 2.6.2 mac os x (import signal causes a Bus Error) In-Reply-To: <84bdef3c0907131954r8658f28x7a3dbd00dfe72614@mail.gmail.com> References: <0016e649c9263fd92c046ea0943d@google.com> <5b8d13220907131911h743b445bn2529439e00db5c20@mail.gmail.com> <84bdef3c0907131954r8658f28x7a3dbd00dfe72614@mail.gmail.com> Message-ID: <4A5BEFBE.7040006@ar.media.kyoto-u.ac.jp> Rajanikanth Jammalamadaka wrote: > I tried it on another system (also Mac OS X) which had numpy version > 1.3 and it seems to work fine on that one. > > So, is it a version problem? > Yes - if you use the official scipy binary, you need to use the corresponding official numpy binary (which is 1.3.0 for python 2.6). As an alternative, you can build scipy by yourself against numpy - in this case, you can use numpy 1.2.1. cheers, David From rajanikanth at gmail.com Mon Jul 13 23:01:27 2009 From: rajanikanth at gmail.com (Rajanikanth Jammalamadaka) Date: Mon, 13 Jul 2009 20:01:27 -0700 Subject: [SciPy-user] python 2.6.2 mac os x (import signal causes a Bus Error) In-Reply-To: <4A5BEFBE.7040006@ar.media.kyoto-u.ac.jp> References: <0016e649c9263fd92c046ea0943d@google.com> <5b8d13220907131911h743b445bn2529439e00db5c20@mail.gmail.com> <84bdef3c0907131954r8658f28x7a3dbd00dfe72614@mail.gmail.com> <4A5BEFBE.7040006@ar.media.kyoto-u.ac.jp> Message-ID: <84bdef3c0907132001t18210683q9aea3a70b861d48f@mail.gmail.com> On Mon, Jul 13, 2009 at 7:38 PM, David Cournapeau wrote: > Rajanikanth Jammalamadaka wrote: >> I tried it on another system (also Mac OS X) which had numpy version >> 1.3 and it seems to work fine on that one. >> >> So, is it a version problem? >> > > Yes - if you use the official scipy binary, you need to use the > corresponding official numpy binary (which is 1.3.0 for python 2.6). As > an alternative, you can build scipy by yourself against numpy - in this > case, you can use numpy 1.2.1. Thanks. I upgraded to numpy-1.3. It works fine now. Raj > > cheers, > > David > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -- Rajanikanth From adrian.prw at gmail.com Tue Jul 14 10:56:33 2009 From: adrian.prw at gmail.com (Adrian Price-Whelan) Date: Tue, 14 Jul 2009 10:56:33 -0400 Subject: [SciPy-user] Quick way to delete all 'values' from array Message-ID: <31F448E2-6034-46D5-8231-4CDEE2ED836F@gmail.com> Hey -- I'm just looking for the quickest way to remove all X from an array [a,b,c,d,X,e,X,f,gX] or it could be multidimensional, I suppose, but thats the idea. I understand delete() will remove a value at a specific index, but I was unsuccessful in combining this function with 'where' to get what I want. Any suggestions? Thanks, -Adrian From gokhansever at gmail.com Tue Jul 14 11:07:17 2009 From: gokhansever at gmail.com (=?UTF-8?Q?G=C3=B6khan_SEVER?=) Date: Tue, 14 Jul 2009 10:07:17 -0500 Subject: [SciPy-user] Quick way to delete all 'values' from array In-Reply-To: <31F448E2-6034-46D5-8231-4CDEE2ED836F@gmail.com> References: <31F448E2-6034-46D5-8231-4CDEE2ED836F@gmail.com> Message-ID: <49d6b3500907140807m5fdef2d8h2e2941cbf1934bc0@mail.gmail.com> | On Tue, Jul 14, 2009 at 9:56 AM, Adrian Price-Whelan wrote: > > Hey -- > > I'm just looking for the quickest way to remove all X from an array > [a,b,c,d,X,e,X,f,gX] or it could be multidimensional, I suppose, but > thats the idea. I understand delete() will remove a value at a > specific index, but I was unsuccessful in combining this function with > 'where' to get what I want. Any suggestions? > > Thanks, > -Adrian > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user This is my quickest solution with a list comprehension : In [2]: a = array([1,2,3,4,6,5,6,7,8,6]) In [3]: a = array([a[i] for i in range(len(a)) if a[i] != 6]) In [4]: a Out[4]: array([1, 2, 3, 4, 5, 7, 8]) -- G?khan From scott.sinclair.za at gmail.com Tue Jul 14 11:14:48 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Tue, 14 Jul 2009 17:14:48 +0200 Subject: [SciPy-user] Quick way to delete all 'values' from array In-Reply-To: <31F448E2-6034-46D5-8231-4CDEE2ED836F@gmail.com> References: <31F448E2-6034-46D5-8231-4CDEE2ED836F@gmail.com> Message-ID: <6a17e9ee0907140814t1c81e208ydd8dd01a4923e758@mail.gmail.com> >2009/7/14 Adrian Price-Whelan : > I'm just looking for the quickest way to remove all X from an array > [a,b,c,d,X,e,X,f,gX] or it could be multidimensional, I suppose, but > thats the idea. I understand delete() will remove a value at a > specific index, but I was unsuccessful in combining this function with > 'where' to get what I want. Any suggestions? If by 'quickest' you mean 'easiest'. Here's how to do it using fancy indexing: >>> import numpy as np >>> a = np.array([0, -1, 2, 3, -1, 4]) >>> a array([ 0, -1, 2, 3, -1, 4]) >>> a = a[a != -1] >>> a array([0, 2, 3, 4]) This works because a != 1 returns a boolean array that can be used as indices into the original array. >>> a = np.array([0, -1, 2, 3, -1, 4]) >>> a != -1 array([ True, False, True, True, False, True], dtype=bool) See also: http://docs.scipy.org/doc/numpy/user/basics.indexing.html Cheers, Scott From Dharhas.Pothina at twdb.state.tx.us Tue Jul 14 13:23:39 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Tue, 14 Jul 2009 12:23:39 -0500 Subject: [SciPy-user] scikits.timeseries : Calculate previous 30 day and 30-60 day cumulative flows. Message-ID: <4A5C78CB.63BA.009B.0@twdb.state.tx.us> Hi, I have a daily timeseries called 'inflow_ts' and for each date in the series I need to calculate the cumulative inflow for the previous 30 days and the previous 30-60 days. From the documentation I think scikits.timeseries.lib.moving_funcs.mov_sum should do what I need for the first case using inflow_30 = mov_sum(inflow_ts,30) but this throws an 'AttributeError : logical_not' Also, how would I calculate the 30-60 day flow. I guess, I could calculate the previous 60 day flow and subtract the 30 day flow. Is there a better way? thanks. - dharhas From pgmdevlist at gmail.com Tue Jul 14 13:48:06 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Tue, 14 Jul 2009 13:48:06 -0400 Subject: [SciPy-user] scikits.timeseries : Calculate previous 30 day and 30-60 day cumulative flows. In-Reply-To: <4A5C78CB.63BA.009B.0@twdb.state.tx.us> References: <4A5C78CB.63BA.009B.0@twdb.state.tx.us> Message-ID: <0BD05B3E-E2C0-49C3-8AC3-3FA7BBEB2932@gmail.com> On Jul 14, 2009, at 1:23 PM, Dharhas Pothina wrote: > Hi, > > I have a daily timeseries called 'inflow_ts' and for each date in > the series I need to calculate the cumulative inflow for the > previous 30 days and the previous 30-60 days. From the documentation > I think scikits.timeseries.lib.moving_funcs.mov_sum should do what I > need for the first case using > > inflow_30 = mov_sum(inflow_ts,30) > > but this throws an 'AttributeError : logical_not' Now that's a strange one. Send me more info (inflows_ts, version, ...) > Also, how would I calculate the 30-60 day flow. I guess, I could > calculate the previous 60 day flow and subtract the 30 day flow. Is > there a better way? Not that I see right now. From asreeve at maine.edu Tue Jul 14 14:08:22 2009 From: asreeve at maine.edu (A.Reeve) Date: Tue, 14 Jul 2009 14:08:22 -0400 (EDT) Subject: [SciPy-user] passing vector to integrate.ode Message-ID: I'm attempting to use the scipy integrators to solve a suite of similar ODE's within a finite volume model (one system of equations in each grid cell of the model). Currently, I'm looping over each cell in the model and applying ode from scipy integrate. Is it possible to speed this up by passing vectors of my parameters to the ode method? Here's what I'm currently doing: --- from scipy.integrate import ode import numpy as N def MDiff1(t,C,k,porI,porM): #function for conc exchange between mobile and immobile domains #C[0] is mobile and and C[1] is immobile concentration dCI=(k/porI)*(C[0]-C[1]) dCM=(k/porM)*(C[1]-C[0]) return N.array([dCM,dCI]) porI=N.array([.5,.5,.5]) porM=N.array([.2,.2,.2]) dt=5000. k=1.e-6 CMob=N.array([10.,10.,10.]) CIm=N.array([10.1,5.,1.]) r = ode(MDiff1) r.set_integrator('vode', method='adams') for i in range(1): #loop over time for j in range(3): #loop over cells r.set_initial_value(N.array([CMob[j],CIm[j]])) r.set_f_params(k,porI[j],porM[j]) r.integrate(r.t+dt) print (i+1)*r.t,r.y#, r.y[0]*porM+r.y[1]*porI CMob[j],CIm[j]=r.y --- In my simulation, CMob, CIm, porM, and porI are vectors with lenths in the thousands. Are there better (faster) ways of doing this? Andrew Reeve Dept. of Earth Sciences University of Maine From Dharhas.Pothina at twdb.state.tx.us Tue Jul 14 14:18:35 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Tue, 14 Jul 2009 13:18:35 -0500 Subject: [SciPy-user] scikits.timeseries : Calculate previous 30 day and30-60 day cumulative flows. Message-ID: <4A5C85AB0200009B0001FF4F@GWWEB.twdb.state.tx.us> >>> Pierre GM 07/14/09 12:48 PM >>> > Now that's a strange one. Send me more info (inflows_ts, version, ...) I was able to recreate the error with a shorter dataset. I'm using np.__version__ = '1.3.0' and ts.__version__ = '0.91.1' In [24]: inflows_ts Out[24]: timeseries([(798062.75047999993,) (634009.66576,) (753657.40423999995,) (783260.9683999999,) (653745.37520000001,) (572335.57375999994,) (582203.42848,) (567401.64639999997,) (542732.00959999999,) (616740.91999999993,)], dtype = [('Inflow', '() ----> 1 2 3 4 5 /usr/lib64/python2.5/site-packages/scikits.timeseries-0.91.1-py2.5-linux-x86_64.egg/scikits/timeseries/lib/moving_funcs.pyc in mov_sum(data, span, dtype) 169 """ 170 --> 171 return _mov_sum(data, span, dtype=dtype) 172 #............................................................................... 173 def mov_median(data, span, dtype=None): /usr/lib64/python2.5/site-packages/scikits.timeseries-0.91.1-py2.5-linux-x86_64.egg/scikits/timeseries/lib/moving_funcs.pyc in _mov_sum(data, span, dtype, type_num_double) 154 if dtype is not None: 155 kwargs['dtype'] = dtype --> 156 return _moving_func(data, MA_mov_sum, kwargs) 157 #............................................................................... 158 def mov_sum(data, span, dtype=None): /usr/lib64/python2.5/site-packages/scikits.timeseries-0.91.1-py2.5-linux-x86_64.egg/scikits/timeseries/lib/moving_funcs.pyc in _moving_func(data, cfunc, kwargs) 119 def _moving_func(data, cfunc, kwargs): 120 --> 121 data = ma.fix_invalid(data) 122 data = ma.array(data.filled(0), mask=data._mask) 123 /usr/lib/python2.5/site-packages/numpy-1.3.0-py2.5-linux-x86_64.egg/numpy/ma/core.pyc in fix_invalid(a, mask, copy, fill_value) 514 a = masked_array(a, copy=copy, mask=mask, subok=True) 515 #invalid = (numpy.isnan(a._data) | numpy.isinf(a._data)) --> 516 invalid = np.logical_not(np.isfinite(a._data)) 517 if not invalid.any(): 518 return a AttributeError: logical_not From pgmdevlist at gmail.com Tue Jul 14 14:31:36 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Tue, 14 Jul 2009 14:31:36 -0400 Subject: [SciPy-user] scikits.timeseries : Calculate previous 30 day and30-60 day cumulative flows. In-Reply-To: <4A5C85AB0200009B0001FF4F@GWWEB.twdb.state.tx.us> References: <4A5C85AB0200009B0001FF4F@GWWEB.twdb.state.tx.us> Message-ID: On Jul 14, 2009, at 2:18 PM, Dharhas Pothina wrote: >>>> Pierre GM 07/14/09 12:48 PM >>> >> Now that's a strange one. Send me more info (inflows_ts, >> version, ...) > > I was able to recreate the error with a shorter dataset. I'm using > np.__version__ = '1.3.0' > and ts.__version__ = '0.91.1' OK, I've been able to reproduce the bug. I gonna investigate further why we get an AttributeError, but here's a workaround already: Your input series is a structured array, and a lot of numerical operations choke on it. You can either transform it into a non- structured array with .view(dtype=float), or call mov_sum on the 'inflow' field (as in mov_sum(inflows_ts['inflow'],2)) From Dharhas.Pothina at twdb.state.tx.us Tue Jul 14 14:34:53 2009 From: Dharhas.Pothina at twdb.state.tx.us (Dharhas Pothina) Date: Tue, 14 Jul 2009 13:34:53 -0500 Subject: [SciPy-user] scikits.timeseries : Calculate previous 30 dayand30-60 day cumulative flows. In-Reply-To: References: <4A5C85AB0200009B0001FF4F@GWWEB.twdb.state.tx.us> Message-ID: <4A5C897D.63BA.009B.0@twdb.state.tx.us> >>>> Pierre GM 07/14/09 12:48 PM >>> >> Now that's a strange one. Send me more info (inflows_ts, >> version, ...) > OK, I've been able to reproduce the bug. I gonna investigate further > why we get an AttributeError, but here's a workaround already: > Your input series is a structured array, and a lot of numerical > operations choke on it. You can either transform it into a non- > structured array with .view(dtype=float), or call mov_sum on the > 'inflow' field (as in mov_sum(inflows_ts['inflow'],2)) Ok good to know. I'll use the work around. thanks, - dharhas From rowen at uw.edu Tue Jul 14 14:33:10 2009 From: rowen at uw.edu (Russell E. Owen) Date: Tue, 14 Jul 2009 11:33:10 -0700 Subject: [SciPy-user] python 2.6.2 mac os x (import signal causes a Bus Error) References: <0016e649c9263fd92c046ea0943d@google.com> <5b8d13220907131911h743b445bn2529439e00db5c20@mail.gmail.com> <84bdef3c0907131954r8658f28x7a3dbd00dfe72614@mail.gmail.com> <4A5BEFBE.7040006@ar.media.kyoto-u.ac.jp> Message-ID: In article <4A5BEFBE.7040006 at ar.media.kyoto-u.ac.jp>, David Cournapeau wrote: > Rajanikanth Jammalamadaka wrote: > > I tried it on another system (also Mac OS X) which had numpy version > > 1.3 and it seems to work fine on that one. > > > > So, is it a version problem? > > > > Yes - if you use the official scipy binary, you need to use the > corresponding official numpy binary (which is 1.3.0 for python 2.6). As > an alternative, you can build scipy by yourself against numpy - in this > case, you can use numpy 1.2.1. Except that numpy 1.3.0 is the minimum version that is compatible with Python 2.6. -- Russell From pgmdevlist at gmail.com Tue Jul 14 14:41:46 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Tue, 14 Jul 2009 14:41:46 -0400 Subject: [SciPy-user] scikits.timeseries : Calculate previous 30 dayand30-60 day cumulative flows. In-Reply-To: <4A5C897D.63BA.009B.0@twdb.state.tx.us> References: <4A5C85AB0200009B0001FF4F@GWWEB.twdb.state.tx.us> <4A5C897D.63BA.009B.0@twdb.state.tx.us> Message-ID: <2982EC6F-A82F-4378-8E78-05171E903EF7@gmail.com> On Jul 14, 2009, at 2:34 PM, Dharhas Pothina wrote: > > Ok good to know. I'll use the work around. I checked a bit more. ma.fix_invalid calls np.isfinite on the data part (a structured ndarray). Because the input is a structured array, np.isfinite returns a NotImplemented (a specific object of type NotImplementedType). np.logical_not chokes on that and raises the AttributeError exception. I'm a bit surprised, I would have expected np.isfinite to raise a NotImplemented exception instead of returning something. I'll ask on the numpy mailing list. cheers P. From d_l_goldsmith at yahoo.com Tue Jul 14 16:19:21 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 14 Jul 2009 13:19:21 -0700 (PDT) Subject: [SciPy-user] Quick way to delete all 'values' > from array Message-ID: <202517.47117.qm@web52110.mail.re2.yahoo.com> > From: Scott Sinclair > Subject: Re: [SciPy-user] Quick way to delete all 'values' > from array > If by 'quickest' you mean 'easiest'. Here's how to do it > using fancy indexing: > > >>> import numpy as np > >>> a = np.array([0, -1, 2, 3, -1, 4]) > >>> a > array([ 0, -1,? 2,? 3, -1,? 4]) > >>> a = a[a != -1] > >>> a > array([0, 2, 3, 4]) what if he wants to eliminate more than one value at a time; is that possible? I tried a bunch of ways, the closest I got (methinks) to success was: >>> a[[index for index in range(len(a)) a[index] not in (0,2)]] That gave an invalid syntax error at the third a; using a colon following the len(a)) gave an invalid syntax error at the colon; and using a comma after len(a)) gave a NameError: name 'index' not defined. Am I just forgetting how to do conditional list comprehension, and/or is this approach doomed to failure anyway? If the latter, is there an alternative way to do this? Curious, DG From pgmdevlist at gmail.com Tue Jul 14 16:49:50 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Tue, 14 Jul 2009 16:49:50 -0400 Subject: [SciPy-user] Quick way to delete all 'values' > from array In-Reply-To: <202517.47117.qm@web52110.mail.re2.yahoo.com> References: <202517.47117.qm@web52110.mail.re2.yahoo.com> Message-ID: <47445786-ED66-403E-BEF9-7A80DB5ABD53@gmail.com> On Jul 14, 2009, at 4:19 PM, David Goldsmith wrote: > > what if he wants to eliminate more than one value at a time; is that > possible? David, Put it that way: a ndarray occupies a fixed space in memory (set when you create), and you cannot add nor delete entries. What you can do is to create a *new* array that contains only the values that satisfy a given condition. It's what we're doing with the syntax >>> a[a!=1]. Now, if you have several entries that you want to discard, you can try to create a boolean array that satisfies all the conditions simulatenously, like >>> cond=np.logical_and.reduce([a!=_ for _ in (1,2)]) and use that to select the proper values >>> a[cond] > I tried a bunch of ways, the closest I got (methinks) to success was: > >>>> a[[index for index in range(len(a)) a[index] not in (0,2)]] > > That gave an invalid syntax error at the third a; using a colon > following the len(a)) gave an invalid syntax error at the colon; and > using a comma after len(a)) gave a NameError: name 'index' not > defined. Am I just forgetting how to do conditional list > comprehension, and/or is this approach doomed to failure anyway? If > the latter, is there an alternative way to do this? > > Curious, > DG > > > > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From sccolbert at gmail.com Tue Jul 14 20:00:14 2009 From: sccolbert at gmail.com (Chris Colbert) Date: Tue, 14 Jul 2009 20:00:14 -0400 Subject: [SciPy-user] non-linear multi-variate optimization Message-ID: <7f014ea60907141700w2f73f8e1yca076104054122ea@mail.gmail.com> The routines for non-linear optimization in scipy.optimize take an argument for a function that computes the gradient. What should be the format of return value of this function? I am assuming that its the gradient of the functions with repect to the independent variables in row vector format. for example say we have: f(x,y,z; a1, a2, a3) where a1, a2, and a3 are the independent variables. Should the gradient of N x,y,z points then be of the form: df/da = [[df(X0)/da1, df(X0)/da2, df(X0)/da3], [df(X1)/da1, df(X1)/da2, df(X1)/da3], .... [df(Xn)/da1, df(Xn)/da2, df(Xn)/da3]] where Xn is the set of (xn, yn, zn) ? cheers! Chris From cournape at gmail.com Tue Jul 14 20:43:01 2009 From: cournape at gmail.com (David Cournapeau) Date: Wed, 15 Jul 2009 09:43:01 +0900 Subject: [SciPy-user] python 2.6.2 mac os x (import signal causes a Bus Error) In-Reply-To: References: <0016e649c9263fd92c046ea0943d@google.com> <5b8d13220907131911h743b445bn2529439e00db5c20@mail.gmail.com> <84bdef3c0907131954r8658f28x7a3dbd00dfe72614@mail.gmail.com> <4A5BEFBE.7040006@ar.media.kyoto-u.ac.jp> Message-ID: <5b8d13220907141743u2d2915dbt3e85ce541bbdfecf@mail.gmail.com> On Wed, Jul 15, 2009 at 3:33 AM, Russell E. Owen wrote: > In article <4A5BEFBE.7040006 at ar.media.kyoto-u.ac.jp>, > ?David Cournapeau wrote: > >> Rajanikanth Jammalamadaka wrote: >> > I tried it on another system (also Mac OS X) which had numpy version >> > 1.3 and it seems to work fine on that one. >> > >> > So, is it a version problem? >> > >> >> Yes - if you use the official scipy binary, you need to use the >> corresponding official numpy binary (which is 1.3.0 for python 2.6). As >> an alternative, you can build scipy by yourself against numpy - in this >> case, you can use numpy 1.2.1. > > Except that numpy 1.3.0 is the minimum version that is compatible with > Python 2.6. You can get without on mac os x if you don't care about a few bugs. Certainly, the OP had numpy 1.2.1 installed for 2.6, after all (numpy would have never load and pass tests under 2.6 if it was using a version built against 2.6). cheers, David From sebastian.walter at gmail.com Wed Jul 15 03:32:55 2009 From: sebastian.walter at gmail.com (Sebastian Walter) Date: Wed, 15 Jul 2009 09:32:55 +0200 Subject: [SciPy-user] non-linear multi-variate optimization In-Reply-To: <7f014ea60907141700w2f73f8e1yca076104054122ea@mail.gmail.com> References: <7f014ea60907141700w2f73f8e1yca076104054122ea@mail.gmail.com> Message-ID: The gradient g is only defined for functions f: R^N --> R and is simply an array with shape (N,) what you sketched in your post is the Jacobian J of a function f: R^N --> R^M Typically, the Jacobian J is defined to have the shape (M,N), but there are exceptions. hope that helps a little Sebastian On Wed, Jul 15, 2009 at 2:00 AM, Chris Colbert wrote: > The routines for non-linear optimization in scipy.optimize take an > argument for a function that computes the gradient. > > What should be the format of return value of this function? I am > assuming that its the gradient of the functions with repect to the > independent variables in row vector format. > > for example say we have: > > f(x,y,z; a1, a2, a3) where a1, a2, and a3 are the independent variables. > > Should the gradient of N x,y,z points then be of the form: > > df/da ?= ? [[df(X0)/da1, df(X0)/da2, df(X0)/da3], > ? ? ? ? ? ? ? [df(X1)/da1, df(X1)/da2, df(X1)/da3], > ? ? ? ? ? ? ? .... > ? ? ? ? ? ? ? [df(Xn)/da1, df(Xn)/da2, df(Xn)/da3]] > > where Xn is the set of (xn, yn, zn) ? > > cheers! > > Chris > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From vanforeest at gmail.com Wed Jul 15 08:29:48 2009 From: vanforeest at gmail.com (nicky van foreest) Date: Wed, 15 Jul 2009 14:29:48 +0200 Subject: [SciPy-user] comparing two lists/arrays Message-ID: Hi, Given two vectors x and y, the (perhaps) common mathematical definition of x < y is that x_i < y_i for all i. Thus, the mathematical comparison x References: Message-ID: <4A5DCC42.4080003@ntc.zcu.cz> Hi Nicky, nicky van foreest wrote: > Hi, > > Given two vectors x and y, the (perhaps) common mathematical > definition of x < y is that x_i < y_i for all i. Thus, the > mathematical comparison x booleans for each x_i < y_i. I implemented this behavior as > prod(less(X,Y)) (I use less to be able to deal with lists X and Y > also). Is there perhaps a more straighforward/elegant/readible way to > achieve the same behavior? assuming x, y are numpy arrays: (x < y).all() r. From scott.sinclair.za at gmail.com Wed Jul 15 08:43:26 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Wed, 15 Jul 2009 14:43:26 +0200 Subject: [SciPy-user] comparing two lists/arrays In-Reply-To: <4A5DCC42.4080003@ntc.zcu.cz> References: <4A5DCC42.4080003@ntc.zcu.cz> Message-ID: <6a17e9ee0907150543o4fac8471off2f6fb1214791af@mail.gmail.com> > 2009/7/15 Robert Cimrman : > nicky van foreest wrote: >> Given two vectors x and y, the (perhaps) common mathematical >> definition of x < y is that x_i < y_i for all i. ?Thus, the >> mathematical comparison x > booleans for each x_i < y_i. ?I implemented this behavior as >> prod(less(X,Y)) (I use less to be able to deal with lists X and Y >> also). Is there perhaps a more straighforward/elegant/readible way to >> achieve the same behavior? > > assuming x, y are numpy arrays: (x < y).all() You could do the following to handle the case where they aren't: >>> import numpy as np >>> x = range(10) >>> y = range(1, 11) >>> np.all(x < y) True Cheers, Scott From scott.sinclair.za at gmail.com Wed Jul 15 08:48:18 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Wed, 15 Jul 2009 14:48:18 +0200 Subject: [SciPy-user] comparing two lists/arrays In-Reply-To: <6a17e9ee0907150543o4fac8471off2f6fb1214791af@mail.gmail.com> References: <4A5DCC42.4080003@ntc.zcu.cz> <6a17e9ee0907150543o4fac8471off2f6fb1214791af@mail.gmail.com> Message-ID: <6a17e9ee0907150548n18c9eb95h17ad20c9e2c5e31d@mail.gmail.com> >2009/7/15 Scott Sinclair : >> 2009/7/15 Robert Cimrman : >> nicky van foreest wrote: >>> Given two vectors x and y, the (perhaps) common mathematical >>> definition of x < y is that x_i < y_i for all i. ?Thus, the >>> mathematical comparison x >> booleans for each x_i < y_i. ?I implemented this behavior as >>> prod(less(X,Y)) (I use less to be able to deal with lists X and Y >>> also). Is there perhaps a more straighforward/elegant/readible way to >>> achieve the same behavior? >> >> assuming x, y are numpy arrays: (x < y).all() > > You could do the following to handle the case where they aren't: > >>>> import numpy as np >>>> x = range(10) >>>> y = range(1, 11) >>>> np.all(x < y) > True Scratch that >>> x < y True S From cimrman3 at ntc.zcu.cz Wed Jul 15 08:53:57 2009 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Wed, 15 Jul 2009 14:53:57 +0200 Subject: [SciPy-user] comparing two lists/arrays In-Reply-To: <6a17e9ee0907150548n18c9eb95h17ad20c9e2c5e31d@mail.gmail.com> References: <4A5DCC42.4080003@ntc.zcu.cz> <6a17e9ee0907150543o4fac8471off2f6fb1214791af@mail.gmail.com> <6a17e9ee0907150548n18c9eb95h17ad20c9e2c5e31d@mail.gmail.com> Message-ID: <4A5DD165.2000707@ntc.zcu.cz> Scott Sinclair wrote: >> 2009/7/15 Scott Sinclair : >>> 2009/7/15 Robert Cimrman : >>> nicky van foreest wrote: >>>> Given two vectors x and y, the (perhaps) common mathematical >>>> definition of x < y is that x_i < y_i for all i. Thus, the >>>> mathematical comparison x >>> booleans for each x_i < y_i. I implemented this behavior as >>>> prod(less(X,Y)) (I use less to be able to deal with lists X and Y >>>> also). Is there perhaps a more straighforward/elegant/readible way to >>>> achieve the same behavior? >>> assuming x, y are numpy arrays: (x < y).all() >> You could do the following to handle the case where they aren't: >> >>>>> import numpy as np >>>>> x = range(10) >>>>> y = range(1, 11) >>>>> np.all(x < y) >> True > > Scratch that > >>>> x < y > True beware! y[2] = -1 In [21]: y Out[21]: [1, 2, -1, 4, 5, 6, 7, 8, 9, 10] In [22]: x < y Out[22]: True From michael at ini.phys.ethz.ch Wed Jul 15 09:00:25 2009 From: michael at ini.phys.ethz.ch (Michael Graber) Date: Wed, 15 Jul 2009 15:00:25 +0200 Subject: [SciPy-user] reading (labview) binary data Message-ID: hi all, i'm looking for the fastest way to read (labview) binary files. i figured out a method which is basically the same as described on: http://www.shocksolution.com/2008/06/25/reading-labview-binary-files-with-python/ that is: binaryFile = open("Measurement_4.bin", mode='rb') (data.offset,) = struct.unpack('>d', binaryFile.read(8)) but this is terribly slow. to read 12 MB takes about 80 seconds. matlab is orders of magnitude faster .. is there a better, i.e. faster way to read in (labview) binary files? thanks, michael From emanuele at relativita.com Wed Jul 15 09:47:17 2009 From: emanuele at relativita.com (Emanuele Olivetti) Date: Wed, 15 Jul 2009 15:47:17 +0200 Subject: [SciPy-user] reading (labview) binary data In-Reply-To: References: Message-ID: <4A5DDDE5.1050504@relativita.com> Michael Graber wrote: > hi all, > > > i'm looking for the fastest way to read (labview) binary files. > > i figured out a method which is basically the same as described on: > > http://www.shocksolution.com/2008/06/25/reading-labview-binary-files-with-python/ > > that is: > > binaryFile = open("Measurement_4.bin", mode='rb') > (data.offset,) = struct.unpack('>d', binaryFile.read(8)) > > but this is terribly slow. to read 12 MB takes about 80 seconds. > matlab is orders of magnitude faster .. > > is there a better, i.e. faster way to read in (labview) binary files? > > > Try numpy.fromfile(). Example: import numpy as np f = open('','rb') data = np.fromfile(f, dtype='>d') # reads the whole file f.close() print data.shape 'data' should now be a vector of big-endian doubles from the whole file. If you want to parse just a subset of all doubles within the file use the optional argument 'count='. See np.fromfile() docstring. Cheers, Emanuele From david.huard at gmail.com Wed Jul 15 09:56:08 2009 From: david.huard at gmail.com (David Huard) Date: Wed, 15 Jul 2009 09:56:08 -0400 Subject: [SciPy-user] reading (labview) binary data In-Reply-To: References: Message-ID: <91cf711d0907150656v417e2aafyf9d28588241b535b@mail.gmail.com> Michael, I have no experience with Labview files, but for uniform data structures, I found numpy.fromfile and numpy.memmap to be more practical than struct.unpack. HTH, David On Wed, Jul 15, 2009 at 9:00 AM, Michael Graber wrote: > > hi all, > > > i'm looking for the fastest way to read (labview) binary files. > > i figured out a method which is basically the same as described on: > > > http://www.shocksolution.com/2008/06/25/reading-labview-binary-files-with-python/ > > that is: > > binaryFile = open("Measurement_4.bin", mode='rb') > (data.offset,) = struct.unpack('>d', binaryFile.read(8)) > > but this is terribly slow. to read 12 MB takes about 80 seconds. > matlab is orders of magnitude faster .. > > is there a better, i.e. faster way to read in (labview) binary files? > > > thanks, > michael > _______________________________________________ > SciPy-user mailing list > SciPy-user at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -------------- next part -------------- An HTML attachment was scrubbed... URL: