From alan.mcintyre at gmail.com Wed Oct 1 00:37:38 2008 From: alan.mcintyre at gmail.com (Alan McIntyre) Date: Tue, 30 Sep 2008 21:37:38 -0700 Subject: [SciPy-dev] Next scipy release 0.7 In-Reply-To: References: Message-ID: <1d36917a0809302137u78a9644fu9da38e3a79157a40@mail.gmail.com> On Mon, Sep 29, 2008 at 9:55 AM, Jarrod Millman wrote: >> Just curious. Is there a new deadline for scipy 0.7 ? > > If anyone has time and interest in helping, it would make a huge > difference in getting this release out. If someone could hunt down > the and fix the test failures and clean up all the deprecation > warnings, we would be much closer to getting this release out the > door. I'll try to fix some of the deprecation warnings and/or look at failing tests this week or over the weekend. Alan From millman at berkeley.edu Wed Oct 1 01:11:46 2008 From: millman at berkeley.edu (Jarrod Millman) Date: Tue, 30 Sep 2008 22:11:46 -0700 Subject: [SciPy-dev] Next scipy release 0.7 In-Reply-To: <1d36917a0809302137u78a9644fu9da38e3a79157a40@mail.gmail.com> References: <1d36917a0809302137u78a9644fu9da38e3a79157a40@mail.gmail.com> Message-ID: On Tue, Sep 30, 2008 at 9:37 PM, Alan McIntyre wrote: > On Mon, Sep 29, 2008 at 9:55 AM, Jarrod Millman wrote: >>> Just curious. Is there a new deadline for scipy 0.7 ? >> >> If anyone has time and interest in helping, it would make a huge >> difference in getting this release out. If someone could hunt down >> the and fix the test failures and clean up all the deprecation >> warnings, we would be much closer to getting this release out the >> door. > > I'll try to fix some of the deprecation warnings and/or look at > failing tests this week or over the weekend. Thanks! The sooner we can get some of these issues resolved the sooner we can get a release out. -- Jarrod Millman Computational Infrastructure for Research Labs 10 Giannini Hall, UC Berkeley phone: 510.643.4014 http://cirl.berkeley.edu/ From prabhu at aero.iitb.ac.in Wed Oct 1 02:09:08 2008 From: prabhu at aero.iitb.ac.in (Prabhu Ramachandran) Date: Wed, 01 Oct 2008 11:39:08 +0530 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E1A6A2.4050406@noaa.gov> <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> Message-ID: <48E31404.7000009@aero.iitb.ac.in> Anne Archibald wrote: > 2008/9/30 Peter : > I think the profusion of different implementations is an argument for > including this in scipy. I think it is also an argument for providing > a standard interface with (at least potentially) several different > implementations. At the moment, that proposed interface looks like: [...] > In the first two cases, missing neighbors are represented with an > infinite distance and an invalid index. In the last case, distances > and indices are both either lists (if there's only one query point) or > object arrays of lists (if there are many query points). If only one > neighbor is requested, the array does not have a dimension of length 1 > in the which-neighbor position. If (potentially) many neighbors are > returned, they are sorted by distance, nearest first. > > What do you think of this interface? This is nice. How about a keyword argument "sort=True" that lets people choose if they want the results sorted by distance or not? I have seen cases where unsorted results are useful. cheers, prabhu From peridot.faceted at gmail.com Wed Oct 1 05:46:45 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Wed, 1 Oct 2008 05:46:45 -0400 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: <48E31404.7000009@aero.iitb.ac.in> References: <48E1A6A2.4050406@noaa.gov> <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> <48E31404.7000009@aero.iitb.ac.in> Message-ID: 2008/10/1 Prabhu Ramachandran : > Anne Archibald wrote: >> 2008/9/30 Peter : >> I think the profusion of different implementations is an argument for >> including this in scipy. I think it is also an argument for providing >> a standard interface with (at least potentially) several different >> implementations. At the moment, that proposed interface looks like: > [...] >> In the first two cases, missing neighbors are represented with an >> infinite distance and an invalid index. In the last case, distances >> and indices are both either lists (if there's only one query point) or >> object arrays of lists (if there are many query points). If only one >> neighbor is requested, the array does not have a dimension of length 1 >> in the which-neighbor position. If (potentially) many neighbors are >> returned, they are sorted by distance, nearest first. >> >> What do you think of this interface? > > This is nice. How about a keyword argument "sort=True" that lets people > choose if they want the results sorted by distance or not? I have seen > cases where unsorted results are useful. That is a possibility. My current code produces results that are in a heap, so sorting can in principle be an O(n) activity. Since the number of results returned is rarely going to be large, I'm not sure the gain in efficiency is worth the loss in determinism. And additional API complexity does have a cost - in an ideal world it should at the least double the number of unit tests. I think perhaps this should wait on a C implementation. Anne From peridot.faceted at gmail.com Wed Oct 1 05:54:29 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Wed, 1 Oct 2008 05:54:29 -0400 Subject: [SciPy-dev] Proposal: scipy.spatial In-Reply-To: <11F5C399-FE46-4321-AC9E-8B0699D06A84@nist.gov> References: <11F5C399-FE46-4321-AC9E-8B0699D06A84@nist.gov> Message-ID: 2008/9/30 Jonathan Guyer : > > On Sep 30, 2008, at 8:45 PM, Anne Archibald wrote: > >> (Also I have no experience with SVN and >> merging, but reports suggest it's a horrible experience.) > > It is not a horrible experience. It involves a little bookkeeping, but > it's entirely manageable. Our collected instructions for handling > branches of FiPy (derived largely from the SVN book at red-bean) is at http://matforge.org/fipy/browser/trunk/documentation/ADMINISTRATA.txt > . We branch all the time. Thanks for the pointer! I've now split it off as a separate branch, "spatial". With luck it won't be much work to reintegrate it once 0.7 is out. Anne From david at ar.media.kyoto-u.ac.jp Wed Oct 1 05:42:53 2008 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Wed, 01 Oct 2008 18:42:53 +0900 Subject: [SciPy-dev] Proposal: scipy.spatial In-Reply-To: References: <11F5C399-FE46-4321-AC9E-8B0699D06A84@nist.gov> Message-ID: <48E3461D.6090808@ar.media.kyoto-u.ac.jp> Anne Archibald wrote: > > Thanks for the pointer! I've now split it off as a separate branch, > "spatial". With luck it won't be much work to reintegrate it once 0.7 > is out. This is also documented on the numpy wiki: http://projects.scipy.org/scipy/numpy/wiki/MakingBranches As Robert said, the problem of svn with merging is unlikely to happen in your case (Problems arise mostly when you use multiple branches and merge between them; as long as you have one branch which keeps following the trunk, svn poor merging capabilities can be worked around with svnmerge). cheers, David From Per.Brodtkorb at ffi.no Wed Oct 1 07:14:34 2008 From: Per.Brodtkorb at ffi.no (Per.Brodtkorb at ffi.no) Date: Wed, 1 Oct 2008 13:14:34 +0200 Subject: [SciPy-dev] Fwd: tests and partial patch forscipy.stats.distributions In-Reply-To: <1cd32cbb0809290551g7c00a5ddh44369352ece506d2@mail.gmail.com> References: <1cd32cbb0809282100x75eccfb9la109e92032f798ac@mail.gmail.com> <1cd32cbb0809290551g7c00a5ddh44369352ece506d2@mail.gmail.com> Message-ID: <1ED225FF18AA8B48AC192F7E1D032C6E0C5C1D@hbu-posten.ffi.no> I have looked into the bugs in the patch of the discrete distributions you sent. The error in _ppf method of dlaplace can be corrected by replacing it with: class dlaplace_gen(rv_discrete): ..... def _ppf(self, q, a): const = 1.0/(1+exp(-a)) cons2 = 1+exp(a) ind = q < const return ceil(where(ind, log(q*cons2)/a-1, -log((1-q)*cons2)/a)) ...... The tests written for the generic moment, fails for those distributions that set the limits (self.a and/or self.b) for the support of the distribution in the self._argcheck method. I think the generic_moment method is meant to be private to the self._munp method, since it does only limited input checks. Perhaps better to rename generic_moment to _generic_moment Per A. -----Opprinnelig melding----- Fra: scipy-dev-bounces at scipy.org [mailto:scipy-dev-bounces at scipy.org] P? vegne av josef.pktd at gmail.com Sendt: 29. september 2008 14:51 Til: scipy-dev at scipy.org Emne: [SciPy-dev] Fwd: tests and partial patch forscipy.stats.distributions removed patch because of file size, mail bounced ---------- Forwarded message ---------- From: josef.pktd at gmail.com Date: Mon, 29 Sep 2008 00:00:25 -0400 Subject: tests and partial patch for scipy.stats.distributions To: scipy-dev at scipy.org To help in bug hunting and patching of the discrete distribution in scipy.stats, I wrote several test scripts that could be incorporated in scipy tests. The attachment contains the corrected scipy.stats.distributions.py file and 2 test files. The tests check all discrete distributions at predefined (not randomly chosen) values. Some distributions have some wrong results for some other range of parameters. To try out the patch, it is possible just to backup the current scipy.stats.distributions.py and copy the patched version in its place. Right now, I still left many comments (and some commented out failed attempts for fixing some persistent bugs). In direct comparison with the current trunk file this might help in understanding what the underlying bug is. Since there are a large number of bug fixes in this patch, if any of them get incorporated in scipy, then they can be selectively copied. Many bugs only show up once other bugs are fixed. In terms of importance and sequence the bug fix categories are roughly: * nin correction for vectorize with *args: without this, some methods raise exceptions, e.g. _ppf (I think after many shaky attempts, I found the correct solution) * _ppf did not handle infinite support properly and requires a .tolist in call to ``place`` (no idea why, but this works) * _drv2_moment(self, n, *args) was completely broken, it didn't even have a return and is the base for generic moment calculations individual fixes: * recipinvgauss_gen: rvs added, didn't work before * def moment: for 2nd moment returned the mean (both discrete and continuous distribution * randint: -1 missing in _ppf, definitions differs from those in description pdf * dlaplace: _ppf gives wrong results in tests, replace with generic calculation (Note: the first vectorize nin correction in continuous distributions is still at the wrong spot. I was only looking at discrete distributions in the last few days, since I figured out how vectorize nin works) If you want a cleaner patch or more information, I can provide it next week. Because, I never used numpy/scipy heavily, it took me a long time to figure out what is going on, but it should be more obvious to an expert. But overall my impression is, that using features in numpy.stats.distribution, that are not commonly used, is pretty risky, and until recently didn't have any stringent tests and does not have any reasonably large test coverage. For serious work, I would want to make sure that the results are actually correct. There are still several things that are not covered, e.g. handling of loc and scale, other methods, correctness over full range of parameters. I haven't checked the continuous distributions since the initial fuzz tests, but the test coverage for all methods looks pretty low. Josef Test f Basic Properties ---------------------- The first test just tests for basic properties and their consistency, e.g. cdf, pmf, ppf, stats and moments and the internal methods e.g _ppf. The tests check quite a bit for internal consistency (private methods) since I used the tests for debugging and trying out fixes. >python C:\Programs\Python25\Scripts\nosetests-script.py -s test_discrete_basic.py ... with current trunk, I get for this test: Ran 112 tests in 0.140s FAILED (errors=41, failures=15) after the bugfixes, I get: Ran 112 tests in 8.672s FAILED (failures=5) some failures in binom, dlaplace, zipf are still remaining Chisquare Test -------------- the second test is a chisquare test for the random variables to be close to the theoretical distribution as defined by .cdf with current trunk I get for this test: >python C:\Programs\Python25\Scripts\nosetests-script.py -s test_ndiscrete.py Ran 12 tests in 0.453s FAILED (errors=5) after patching scipy.stats.distribution the only test failure left is in logser, where I think the numpy.random random numbers are wrong >python C:\Programs\Python25\Scripts\nosetests-script.py -s test_ndiscrete.py bernoulli. binom. boltzmann. dlaplace. geom. hypergeom. logserF nbinom. planck. poisson. randint. zipf. ====================================================================== FAIL: test_ndiscrete.test_discrete_rvscdf('logser', (0.59999999999999998,)) ---------------------------------------------------------------------- Traceback (most recent call last): File "c:\programs\python25\lib\site-packages\nose-0.10.3-py2.5.egg\nose\case.p y", line 182, in runTest self.test(*self.arg) File "C:\Josef\work-oth\sort\pypi\test_ndiscrete.py", line 76, in checkchisqua re_discrete assert (pval > alpha), 'chisquare - test for %s at arg = %s' % (distname,str (arg)) AssertionError: chisquare - test for logser at arg = (0.59999999999999998,) ---------------------------------------------------------------------- Ran 12 tests in 19.781s FAILED (failures=1) scipy.stats tests: no failures ----------------------------------------- running the current trunk test for scipy.stats on the pre- and post patch distributions.py gives no errors: >>> from scipy import stats >>> stats.test() Running unit tests for scipy.stats NumPy version 1.2.0rc2 NumPy is installed in C:\Programs\Python25\lib\site-packages\numpy SciPy version 0.7.0.dev SciPy is installed in C:\Josef\_progs\virtualpy25\envscipy\lib\site-packages\sci py Python version 2.5.2 (r252:60911, Feb 21 2008, 13:11:45) [MSC v.1310 32 bit (Int el)] nose version 0.10.3 ................................................................................ C:\Josef\_progs\virtualpy25\envscipy\lib\site-packages\scipy\stats\morestats.py: 618: UserWarning: Ties preclude use of exact statistic. warnings.warn("Ties preclude use of exact statistic.") ............................................................C:\Programs\Python25 \lib\site-packages\numpy\lib\function_base.py:343: Warning: The semantics of histogram has been modified in the current release to fix long-standing issues with outliers handling. The main changes concern 1. the definition of the bin edges, now including the rightmost edge, and 2. the handling of upper outliers, now ignored rather than tallied in the rightmost bin. The previous behaviour is still accessible using `new=False`, but is scheduled to be deprecated in the next release (1.3). *This warning will not printed in the 1.3 release.* Use `new=True` to bypass this warning. Please read the docstring for more information. """, Warning) ................................................................................ ............. ---------------------------------------------------------------------- Ran 233 tests in 2.266s OK From guyer at nist.gov Wed Oct 1 09:54:12 2008 From: guyer at nist.gov (Jonathan Guyer) Date: Wed, 1 Oct 2008 09:54:12 -0400 Subject: [SciPy-dev] Proposal: scipy.spatial In-Reply-To: <48E3461D.6090808@ar.media.kyoto-u.ac.jp> References: <11F5C399-FE46-4321-AC9E-8B0699D06A84@nist.gov> <48E3461D.6090808@ar.media.kyoto-u.ac.jp> Message-ID: <68B2C042-C4FD-426F-8E1D-5E207465B377@nist.gov> On Oct 1, 2008, at 5:42 AM, David Cournapeau wrote: > As Robert said, the problem of svn with merging is unlikely to > happen in > your case (Problems arise mostly when you use multiple branches and > merge between them; as long as you have one branch which keeps > following > the trunk, We keep things simple. We release from the trunk and strive to do all development on branches from the trunk. I don't believe we've ever branched a branch and I don't think we'd ever want to. The keys to keeping things manageable for us are (1) disciplined commenting and (2) merging *from* the trunk to the branches on a pathologically regular basis. If you don't do the latter, then when you do try to merge your branch to the trunk, you can have an insurmountable amount of work facing you. > svn poor merging capabilities can be worked around with > svnmerge). Thanks for the pointer to svnmerge. Although I'd read about it early on, I'd forgotten about it. I definitely see its value in keeping the history intact. From david at ar.media.kyoto-u.ac.jp Wed Oct 1 10:15:46 2008 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Wed, 01 Oct 2008 23:15:46 +0900 Subject: [SciPy-dev] Proposal: scipy.spatial In-Reply-To: <68B2C042-C4FD-426F-8E1D-5E207465B377@nist.gov> References: <11F5C399-FE46-4321-AC9E-8B0699D06A84@nist.gov> <48E3461D.6090808@ar.media.kyoto-u.ac.jp> <68B2C042-C4FD-426F-8E1D-5E207465B377@nist.gov> Message-ID: <48E38612.1010002@ar.media.kyoto-u.ac.jp> Jonathan Guyer wrote: > > We keep things simple. We release from the trunk and strive to do all > development on branches from the trunk. I don't believe we've ever > branched a branch and I don't think we'd ever want to. My remark was general: I am one of the most (too, I guess) vocal about switching from svn to something which handles merge correctly on the ML, and wanted to clarify why it is not really a problem in Anne's case, and how people use svnmerge on numpy as documented (since it is quite easy to screw up the first time you do it without any documentation). > The keys to > keeping things manageable for us are (1) disciplined commenting and > (2) merging *from* the trunk to the branches on a pathologically > regular basis. If you don't do the latter, then when you do try to > merge your branch to the trunk, you can have an insurmountable amount > of work facing you. Whatever works best for you, of course. cheers, David From thouis at broad.mit.edu Wed Oct 1 11:52:21 2008 From: thouis at broad.mit.edu (Thouis (Ray) Jones) Date: Wed, 1 Oct 2008 11:52:21 -0400 Subject: [SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles Message-ID: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> I just submitted a patch (http://scipy.org/scipy/scipy/ticket/743) extending the functionality of io.matlab for v5 files, adding writers for structs, cell arrays, & objects, and readers+writers for function handles and 64-bit ints. The patch modifies the numpy types of strings, structs, and objects loaded from matlab: matlab strings are loaded as numpy arrays of unicode strings, rather than numpy arrays of objects. structs are loaded as numpy arrays with dtype [(field_name, object), ...] matlab objects are wrapped in an mio5.MatlabObject instance, which contains a 1x1 struct & the object classname. (function handles are also wrapped in a special object in mio5.py) These changes were to make the scipy<->matlab mapping more explicit. The changes to the string type probably don't break any existing code. The changes to the struct & matlab objects probably do. It's possible to make both of these backward compatible, if that's a priority. We're looking for feedback on that (or any other) issue. Ray Jones From prabhu at aero.iitb.ac.in Wed Oct 1 12:53:01 2008 From: prabhu at aero.iitb.ac.in (Prabhu Ramachandran) Date: Wed, 01 Oct 2008 22:23:01 +0530 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E1A6A2.4050406@noaa.gov> <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> <48E31404.7000009@aero.iitb.ac.in> Message-ID: <48E3AAED.6080601@aero.iitb.ac.in> Anne Archibald wrote: >> This is nice. How about a keyword argument "sort=True" that lets people >> choose if they want the results sorted by distance or not? I have seen >> cases where unsorted results are useful. > > That is a possibility. My current code produces results that are in a > heap, so sorting can in principle be an O(n) activity. Since the > number of results returned is rarely going to be large, I'm not sure I thought this was a discussion on a good API for such an algorithm (sorry if I misunderstood!). Given that context, if this is a general purpose algorithm I'd rather not assume that the number of results is typically going to be small. > the gain in efficiency is worth the loss in determinism. And If there is no need for sorting and there are a large number of nearest neighbor queries (in my case there are O(N) nearest neighbor queries), this quickly adds up to an unnecessary O(N) inefficiency that I'd like to avoid. > additional API complexity does have a cost - in an ideal world it > should at the least double the number of unit tests. I think perhaps > this should wait on a C implementation. I don't see why the addition of a sort keyword will double your unit tests. Another option would be to return unsorted results and have a separate sort method if it makes testing easier. Thanks for listening. cheers, prabhu From wnbell at gmail.com Wed Oct 1 13:56:22 2008 From: wnbell at gmail.com (Nathan Bell) Date: Wed, 1 Oct 2008 13:56:22 -0400 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: <48E3AAED.6080601@aero.iitb.ac.in> References: <48E1A6A2.4050406@noaa.gov> <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> Message-ID: On Wed, Oct 1, 2008 at 12:53 PM, Prabhu Ramachandran wrote: > > I thought this was a discussion on a good API for such an algorithm > (sorry if I misunderstood!). Given that context, if this is a general > purpose algorithm I'd rather not assume that the number of results is > typically going to be small. However, you *can* assume log(k) of a k-nearest neighbor query is going to be small. > > If there is no need for sorting and there are a large number of nearest > neighbor queries (in my case there are O(N) nearest neighbor queries), > this quickly adds up to an unnecessary O(N) inefficiency that I'd like > to avoid. > Although you might not sort a k-nearest neighbor query, but as Anne has said, you end up needing a sorted structure (like a priority queue) to perform the query itself anyway. The only case where not sorting makes sense is an "in sphere" query where there's no need to prioritize the results. In this case, I'd still argue that the overhead of sorting isn't likely to matter. > > I don't see why the addition of a sort keyword will double your unit > tests. Another option would be to return unsorted results and have a > separate sort method if it makes testing easier. > It would be more profitable to offload the queries to native code. That is a much larger speedup than the O(log k) sorting factor. Interpreter overhead is going to dominate any pure-python hierarchy traversal. -- Nathan Bell wnbell at gmail.com http://graphics.cs.uiuc.edu/~wnbell/ From zachary.pincus at yale.edu Wed Oct 1 14:52:32 2008 From: zachary.pincus at yale.edu (Zachary Pincus) Date: Wed, 1 Oct 2008 14:52:32 -0400 Subject: [SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> Message-ID: Hi Ray, Those changes look really useful! St?fan van der Walt (on this email list) and Matthew Brett (not on the list?) seem to be coordinating the matlab IO (at least, they were the ones who shepherded my recent patch to the same). I'm not 100% sure what the backwards-compatibility guarantees for scipy are right now, but the matlab 5 IO is pretty new, so hopefully useful changes like these won't be a problem. (Any one can say for sure?) One comment -- perhaps for objects, would a 0-d "array scalar" (i.e. shape=()) struct be better than a 1x1 struct? I'm not really sure either way... Anyhow, thanks for the patch. This should be very helpful. Best, Zach On Oct 1, 2008, at 11:52 AM, Thouis (Ray) Jones wrote: > I just submitted a patch (http://scipy.org/scipy/scipy/ticket/743) > extending the functionality of io.matlab for v5 files, adding writers > for structs, cell arrays, & objects, and readers+writers for function > handles and 64-bit ints. > > The patch modifies the numpy types of strings, structs, and objects > loaded from matlab: > matlab strings are loaded as numpy arrays of unicode strings, rather > than numpy arrays of objects. > structs are loaded as numpy arrays with dtype [(field_name, > object), ...] > matlab objects are wrapped in an mio5.MatlabObject instance, which > contains a 1x1 struct & the object classname. > (function handles are also wrapped in a special object in mio5.py) > > These changes were to make the scipy<->matlab mapping more explicit. > The changes to the string type probably don't break any existing code. > The changes to the struct & matlab objects probably do. It's > possible to make both of these backward compatible, if that's a > priority. We're looking for feedback on that (or any other) issue. > > Ray Jones > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://projects.scipy.org/mailman/listinfo/scipy-dev From millman at berkeley.edu Wed Oct 1 15:27:16 2008 From: millman at berkeley.edu (Jarrod Millman) Date: Wed, 1 Oct 2008 12:27:16 -0700 Subject: [SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> Message-ID: On Wed, Oct 1, 2008 at 11:52 AM, Zachary Pincus wrote: > I'm not 100% sure what the backwards-compatibility guarantees for > scipy are right now, but the matlab 5 IO is pretty new, so hopefully > useful changes like these won't be a problem. (Any one can say for > sure?) In general, backward compatibility for SciPy isn't as big a constraint as it is for NumPy. NumPy is an extremely mature codebase that is widely used by many projects, so we have to be *extremely* careful about ABI/API breaks. SciPy on the other hand is still considered 'beta' software, so we should be focusing on getting it to the point that we consider it more 'production' code. Once we get to that point we will need to start being much more careful about ABI/API breaks. Of course, no one is going to want to use SciPy if we arbitrarily break their code with every release of SciPy. So we just need to strike a reasonable balance of fixing and improving SciPy, while not chasing off users. In this particular case, the changes to the mat file io looks very reasonable. I will let St?fan van der Walt or Matthew Brett make the final call, but I would like to see this patch (or a revised version of it) get in. Thanks, -- Jarrod Millman Computational Infrastructure for Research Labs 10 Giannini Hall, UC Berkeley phone: 510.643.4014 http://cirl.berkeley.edu/ From stefan at sun.ac.za Wed Oct 1 16:20:31 2008 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Wed, 1 Oct 2008 22:20:31 +0200 Subject: [SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> Message-ID: <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> Hi Ray 2008/10/1 Thouis (Ray) Jones : > These changes were to make the scipy<->matlab mapping more explicit. > The changes to the string type probably don't break any existing code. > The changes to the struct & matlab objects probably do. It's > possible to make both of these backward compatible, if that's a > priority. We're looking for feedback on that (or any other) issue. Thank you for improving the SciPy IO capabilities! Before we go ahead, would you kindly do the following: - Make sure that the existing test suite passes with your patch applied (I can't do that from here right now) - Write tests to verify the working of the new functionality You'll see that we have a very thorough round-tripping test set for the IO code already, and it would be good to keep this up to date with your changes. Requiring tests for contributions is certainly not intended to slow the adoption of patches; we simply need to see what the code does, and must ensure that it does so correctly. Ideally, we'd also like documentation for all the classes in the IO module, but it would hardly be fair to ask you to write them :) Thanks again, St?fan From thouis at broad.mit.edu Wed Oct 1 20:11:45 2008 From: thouis at broad.mit.edu (Thouis (Ray) Jones) Date: Wed, 1 Oct 2008 20:11:45 -0400 Subject: [SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> Message-ID: <6c17e6f50810011711n34da69b8w58ecd42938cec547@mail.gmail.com> On Wed, Oct 1, 2008 at 4:20 PM, St?fan van der Walt wrote: > Thank you for improving the SciPy IO capabilities! Before we go > ahead, would you kindly do the following: > > - Make sure that the existing test suite passes with your patch > applied (I can't do that from here right now) > - Write tests to verify the working of the new functionality Is it acceptable for us to modify the tests to handle the new form of some matlab objects. Structures, objects, and function handles are definitely different. String arrays have a different dtype, as well. Also, I noticed there is a test for reading matlab objects. I think this should be removed, as the structure that matlab uses to save objects is not well documented (besides being a matlab 1x1 struct array with an extra element for the classname). We'll try to get this done soon. Best, Ray Jones From cournape at gmail.com Wed Oct 1 20:37:58 2008 From: cournape at gmail.com (David Cournapeau) Date: Thu, 2 Oct 2008 09:37:58 +0900 Subject: [SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> Message-ID: <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> On Thu, Oct 2, 2008 at 5:20 AM, St?fan van der Walt wrote: > Hi Ray > > 2008/10/1 Thouis (Ray) Jones : >> These changes were to make the scipy<->matlab mapping more explicit. >> The changes to the string type probably don't break any existing code. >> The changes to the struct & matlab objects probably do. It's >> possible to make both of these backward compatible, if that's a >> priority. We're looking for feedback on that (or any other) issue. > > Thank you for improving the SciPy IO capabilities! Before we go > ahead, would you kindly do the following: Would it be possible for this to wait after 0.7 ? More exactly, I would like to keep the current behavior of 0.7, with a deprecation warning, and maybe the new code available through some mechanism to adapt (and would replace in 0.8). Even if scipy is not considered stable, I think we should allow at least one version for deprecation. cheers, David From stefan at sun.ac.za Thu Oct 2 03:41:27 2008 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Thu, 2 Oct 2008 09:41:27 +0200 Subject: [SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> Message-ID: <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> 2008/10/2 David Cournapeau : > Would it be possible for this to wait after 0.7 ? More exactly, I > would like to keep the current behavior of 0.7, with a deprecation > warning, and maybe the new code available through some mechanism to > adapt (and would replace in 0.8). Even if scipy is not considered > stable, I think we should allow at least one version for deprecation. We should certainly look at applying the non-API-changing parts, though. I'm not sure what the best way is to represent these structures on the Python side. Thouis, you've thought about this a lot: could you tell us the pros and cons of switching to the new representation? I don't mind the tests changing, as long as the round-trip completes successfully and all the data is still read. Cheers St?fan From josef.pktd at gmail.com Thu Oct 2 09:29:48 2008 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 2 Oct 2008 09:29:48 -0400 Subject: [SciPy-dev] Fwd: tests and partial patch forscipy.stats.distributions In-Reply-To: <1ED225FF18AA8B48AC192F7E1D032C6E0C5C1D@hbu-posten.ffi.no> References: <1cd32cbb0809282100x75eccfb9la109e92032f798ac@mail.gmail.com> <1cd32cbb0809290551g7c00a5ddh44369352ece506d2@mail.gmail.com> <1ED225FF18AA8B48AC192F7E1D032C6E0C5C1D@hbu-posten.ffi.no> Message-ID: <1cd32cbb0810020629w56cf8e89t58954256e87bb29e@mail.gmail.com> I finally filed a ticket on this on scipy trac: http://scipy.org/scipy/scipy/ticket/745 On Wed, Oct 1, 2008 at 7:14 AM, wrote: > I have looked into the bugs in the patch of the discrete distributions you sent. The error in _ppf method of dlaplace can be corrected by replacing it with: > > class dlaplace_gen(rv_discrete): > ..... > def _ppf(self, q, a): > const = 1.0/(1+exp(-a)) > cons2 = 1+exp(a) > ind = q < const > return ceil(where(ind, log(q*cons2)/a-1, -log((1-q)*cons2)/a)) > ...... > Thanks, I didn't see the misplaced bracket, I changed my corrections to this. Do you have any idea or reference about the theoretical second moment of dlaplace? > The tests written for the generic moment, fails for those distributions that set the limits (self.a and/or self.b) for the support of the distribution in the self._argcheck method. I think the generic_moment method is meant to be private to the self._munp method, since it does only limited input checks. > Perhaps better to rename generic_moment to _generic_moment I aggree, I didn't check this systematically, but I guess 'vec_generic_moment', 'vecentropy' are also private methods. > > Per A. I did not realize that some of my tests on private methods might fail for some distributions by design. To make it a bit clearer, I separated the tests into tests for public methods, and tests for private methods. I still like the tests of private methods, even if some are known failures, since they help to narrow down what the source of an exception or test failure in the public methods is. Josef From ljosa at broad.mit.edu Thu Oct 2 10:14:14 2008 From: ljosa at broad.mit.edu (Vebjorn Ljosa) Date: Thu, 02 Oct 2008 10:14:14 -0400 Subject: [SciPy-dev] SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> Message-ID: St?fan van der Walt writes: > > We should certainly look at applying the non-API-changing parts, > though. I'm not sure what the best way is to represent these > structures on the Python side. > > Thouis, you've thought about this a lot: could you tell us the pros > and cons of switching to the new representation? The reason Ray and I changed some of the representations is that we wanted the mapping from Matlab to Python to be symmetric: anything read from a MAT-file should be represented in a way that allows the writer code to write it back in its original form. This requires that the original Matlab type be deducible from the Python representation. * Struct arrays: Matlab struct arrays were previously represented as numpy arrays of dtype=object filled with instances of mat_struct. The problem is that Matlab cell arrays were also represented as numpy arrays of dtype=objects. The writer code could in most cases have identified structs by looking at the contents (instances of mat_struct), but there was no way to distinguish a 0x0 cell array from a 0x0 struct array. We therefore opted to represent struct arrays as numpy record arrays. In order not to break existing code, we could introduce a keyword argument to loadmat that selects the old or new representation, similar to numpy.histogram's "new" argument. In 0.7, leaving the argument out would default to False (old behavior), but give a deprecation warning. Later versions can first change the default to True and then remove the old behavior entirely. The best name I can think of for this keyword argument is "struct_as_record". * Char arrays/strings: Same story. At the lowest level, the code represented char arrays as numpy arrays of dtype='U1', which is fine. A very useful "processor function" (in miobase) turns them into arrays of strings, however. This processor function created an array of dtype=object. We changed this to 'U...' so the array could be distinguished from a cell array. I think this is unlikely to break any code, do you agree? * Objects: This change in representation was purely for our convenience, and we should be able to fix our patch to keep the old representation. Vebjorn From matthew.brett at gmail.com Thu Oct 2 11:34:05 2008 From: matthew.brett at gmail.com (Matthew Brett) Date: Thu, 2 Oct 2008 08:34:05 -0700 Subject: [SciPy-dev] SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> Message-ID: <1e2af89e0810020834t64e104b2oaf5abad1b07d7333@mail.gmail.com> Hi, > The reason Ray and I changed some of the representations is that we > wanted the mapping from Matlab to Python to be symmetric: anything read > from a MAT-file should be represented in a way that allows the writer > code to write it back in its original form. This requires that the > original Matlab type be deducible from the Python representation. Thank you very much for doing this work - it was certainly a major deficiency in the scipy matlab io that it could not do this roundtripping for structs and c. For the API change, the API for the io has already changed in the trunk, from an earlier change (well-motivated) by Nathan Bell. I would love to see a version of this patch in 0.7 if at all possible. Best, Matthew From rmay31 at gmail.com Thu Oct 2 13:08:10 2008 From: rmay31 at gmail.com (Ryan May) Date: Thu, 02 Oct 2008 12:08:10 -0500 Subject: [SciPy-dev] Add density of water to constants? Message-ID: <48E4FFFA.2030007@gmail.com> Hi, I was just cleaning up my own hand-rolled constants file and noticed that the only one not already in scipy's own constants file was the density of liquid water. I'm not sure how much we want to accumulate more of these physical constants in there, but here's a patch nonetheless. Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: constants_water_density.diff URL: From pav at iki.fi Thu Oct 2 14:17:43 2008 From: pav at iki.fi (Pauli Virtanen) Date: Thu, 2 Oct 2008 18:17:43 +0000 (UTC) Subject: [SciPy-dev] Add density of water to constants? References: <48E4FFFA.2030007@gmail.com> Message-ID: Thu, 02 Oct 2008 12:08:10 -0500, Ryan May wrote: > I was just cleaning up my own hand-rolled constants file and noticed > that the only one not already in scipy's own constants file was the > density of liquid water. I'm not sure how much we want to accumulate > more of these physical constants in there, but here's a patch > nonetheless. I'm not sure if the density of liquid water is actually a good constant. First, it varies between 999.972 and 992.2 kg/m^3 already at temperatures between 0 ?C and 40 ?C. Second, the kilogram is not defined in terms of the density of water, but in terms of a metal prototype weight that was manufactured based on a single measurement of the density of water. The maximum density of water turns out to be, funnily enough, not 1000 kg/m^3 but slightly less. [1,2] .. [1] "Water". Wikipedia, http://en.wikipedia.org/wiki/H2O .. [2] "Kilogram. Wikipedia, http://en.wikipedia.org/wiki/Kilogram -- Pauli Virtanen From rmay31 at gmail.com Thu Oct 2 14:30:46 2008 From: rmay31 at gmail.com (Ryan May) Date: Thu, 02 Oct 2008 13:30:46 -0500 Subject: [SciPy-dev] Add density of water to constants? In-Reply-To: References: <48E4FFFA.2030007@gmail.com> Message-ID: <48E51356.5000806@gmail.com> Pauli Virtanen wrote: > Thu, 02 Oct 2008 12:08:10 -0500, Ryan May wrote: >> I was just cleaning up my own hand-rolled constants file and noticed >> that the only one not already in scipy's own constants file was the >> density of liquid water. I'm not sure how much we want to accumulate >> more of these physical constants in there, but here's a patch >> nonetheless. > > I'm not sure if the density of liquid water is actually a good constant. > > First, it varies between 999.972 and 992.2 kg/m^3 already at temperatures > between 0 ?C and 40 ?C. > > Second, the kilogram is not defined in terms of the density of water, but > in terms of a metal prototype weight that was manufactured based on a > single measurement of the density of water. The maximum density of water > turns out to be, funnily enough, not 1000 kg/m^3 but slightly less. [1,2] > > .. [1] "Water". Wikipedia, http://en.wikipedia.org/wiki/H2O > .. [2] "Kilogram. Wikipedia, http://en.wikipedia.org/wiki/Kilogram Wow. Point taken. I knew about the kilogram "definition", but never thought about it's impact on defining density. Granted, in meteorology, 999.972 and 992.2 are more or less the same. :) Maybe I'll just have maintain a few meteorologically relevant constants in a file myself then. Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From cournape at gmail.com Fri Oct 3 01:27:19 2008 From: cournape at gmail.com (David Cournapeau) Date: Fri, 3 Oct 2008 14:27:19 +0900 Subject: [SciPy-dev] SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> Message-ID: <5b8d13220810022227k2d70bea3y2d91f5783ce78b8@mail.gmail.com> On Thu, Oct 2, 2008 at 11:14 PM, Vebjorn Ljosa wrote: > > The reason Ray and I changed some of the representations is that we > wanted the mapping from Matlab to Python to be symmetric: anything read > from a MAT-file should be represented in a way that allows the writer > code to write it back in its original form. This requires that the > original Matlab type be deducible from the Python representation. FWIW, I agree completely that the current io for matlab would be greatly improved by those changes; there is no arguing that those changes should come in scipy. The problem is that some people have code which depend on the current shortcoming (well, *I* have), and it is better to have one full version at least which says that it will change (with deprecation warning). The code could be in something like io.newmaltlab, which would be changed to io.matlab in 0.8: this way, if people want to use it now, they can. cheers, David From cournape at gmail.com Fri Oct 3 01:29:13 2008 From: cournape at gmail.com (David Cournapeau) Date: Fri, 3 Oct 2008 14:29:13 +0900 Subject: [SciPy-dev] SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <1e2af89e0810020834t64e104b2oaf5abad1b07d7333@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> <1e2af89e0810020834t64e104b2oaf5abad1b07d7333@mail.gmail.com> Message-ID: <5b8d13220810022229s5523d3bex93355ed687c8d6ea@mail.gmail.com> On Fri, Oct 3, 2008 at 12:34 AM, Matthew Brett wrote: > For the API change, the API for the io has already changed in the > trunk, from an earlier change (well-motivated) by Nathan Bell. I > would love to see a version of this patch in 0.7 if at all possible. Yes, I know, it broke my packages :) I don't think it is good practice. We allow for scipy API changes, but it would still be better it we were saying that something is about to change. It really does not cost much, I think. cheers, David From matthew.brett at gmail.com Fri Oct 3 01:54:45 2008 From: matthew.brett at gmail.com (Matthew Brett) Date: Thu, 2 Oct 2008 22:54:45 -0700 Subject: [SciPy-dev] SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <5b8d13220810022229s5523d3bex93355ed687c8d6ea@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> <1e2af89e0810020834t64e104b2oaf5abad1b07d7333@mail.gmail.com> <5b8d13220810022229s5523d3bex93355ed687c8d6ea@mail.gmail.com> Message-ID: <1e2af89e0810022254k3ff22324k39d656d6d0b0aa38@mail.gmail.com> Hi, >> For the API change, the API for the io has already changed in the >> trunk, from an earlier change (well-motivated) by Nathan Bell. I >> would love to see a version of this patch in 0.7 if at all possible. > > Yes, I know, it broke my packages :) I don't think it is good > practice. We allow for scipy API changes, but it would still be better > it we were saying that something is about to change. It really does > not cost much, I think. All your points well taken, and I voted rather weakly against the earlier change, but, given the API is already changed compared to the previous release, and that Scipy releases are infrequent, my weak vote would still be for going the whole way here, but I'm happy to be outvoted. See you, Matthew From thouis at broad.mit.edu Fri Oct 3 10:49:51 2008 From: thouis at broad.mit.edu (Thouis (Ray) Jones) Date: Fri, 3 Oct 2008 10:49:51 -0400 Subject: [SciPy-dev] SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <1e2af89e0810022254k3ff22324k39d656d6d0b0aa38@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> <1e2af89e0810020834t64e104b2oaf5abad1b07d7333@mail.gmail.com> <5b8d13220810022229s5523d3bex93355ed687c8d6ea@mail.gmail.com> <1e2af89e0810022254k3ff22324k39d656d6d0b0aa38@mail.gmail.com> Message-ID: <6c17e6f50810030749p1966266bs23701e77f8c05b3a@mail.gmail.com> I've updated the patch to provide backwards compatibility for matlab structures, controlled by a keyword argument to loadmat (struct_as_record), which defaults to False (old style), and gives a deprecation warning if False. All tests in test_mio.py pass. I updated some of the tests for the new object structure. I tested both options for loading structures with some of my own data, but it would be useful if David could check his code against it (if that's not too difficult), as we don't really have a relevant codebase prior to these changes. Ray Jones (note that I goofed and didn't supersede the old patch in the Trac, but it' obvious which is the right one.) From josef.pktd at gmail.com Fri Oct 3 10:54:53 2008 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 3 Oct 2008 10:54:53 -0400 Subject: [SciPy-dev] Next scipy release 0.7 In-Reply-To: References: <1d36917a0809302137u78a9644fu9da38e3a79157a40@mail.gmail.com> Message-ID: <1cd32cbb0810030754x149929e9sae69626dbca8c50a@mail.gmail.com> Hi, I think that my fixes to scipy.stats.distributions in http://scipy.org/scipy/scipy/ticket/745 are now good enough that they can go into 0.7, after some review by numpy experts which I am not. As far as I can see, non of my fixes changes anything in the existing functionality, but fixes several methods that throw exceptions in the current trunk, and it corrects incorrect calculations in some places. I provided basic tests. The test results now look pretty good, compared to before. This is still not a complete test suite for scipy.stats.distributions, but in my opinion, a big enhancement to the current tests. Josef From charlesr.harris at gmail.com Sat Oct 4 04:40:53 2008 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 4 Oct 2008 02:40:53 -0600 Subject: [SciPy-dev] Question about Kaiser Implementation in firwin In-Reply-To: <44B9C5B0.1050201@ieee.org> References: <44B9C5B0.1050201@ieee.org> Message-ID: On Sat, Jul 15, 2006 at 10:50 PM, Travis Oliphant wrote: > Buehler, Eric (AGRE) wrote: > > Hello, > > > > However, the last line of the firwin function is causing me some > > heartburn. > > filter_design.py > > > > 1538 win = get_window(window,N,fftbins=1) > > 1539 alpha = N//2 > > 1540 m = numpy.arange(0,N) > > 1541 h = win*special.sinc(cutoff*(m-alpha)) > > 1542 return h / sum(h) > > > > Line 1542 of filter_design.py, "return h / sum(h)", normalizes the > > function where it doesn't seem necessary, at least in the kaiser window > > case. Without the normalization, the kaiser window already returns a > > value of 1 at the zero frequency point. This normalization scales all > > of the data, making the window difficult to use in the frequency domain. > > > > Can someone point me to the rationale for this line? Looking at the > > code, this seems to be a pretty recent change (within the last year/year > > and a half). > > > > > I'm not aware of if and when the change was made. Was there a time > when firwin did not have this normalization? > > The normalization is done so that the resulting filter has a 0dB gain at > DC (which is the center of the pass-band). > > In other-words fft(h)[0] is approximately equal to 1. This is usually > what is desired as firwin returns "time-domain" filter coefficients. > The return value of the function is not designed for use in the > "frequency"-domain. I'm not even sure what you mean by that in this > context. > > The intended usage of the result of firwin is in a convolution: > > convolve(h, ) > I almost always use area one. For looking at spectra and such it is best to be able to integrate over a band in the frequency domain without the results changing much. In particular, it is best to preserve the shape of continua. This is also the best way to treat power spectra. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From prabhu at aero.iitb.ac.in Sat Oct 4 08:37:37 2008 From: prabhu at aero.iitb.ac.in (Prabhu Ramachandran) Date: Sat, 04 Oct 2008 18:07:37 +0530 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E1A6A2.4050406@noaa.gov> <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> Message-ID: <48E76391.5050508@aero.iitb.ac.in> Hi, Nathan Bell wrote: > On Wed, Oct 1, 2008 at 12:53 PM, Prabhu Ramachandran > wrote: >> I thought this was a discussion on a good API for such an algorithm >> (sorry if I misunderstood!). Given that context, if this is a general >> purpose algorithm I'd rather not assume that the number of results is >> typically going to be small. > > However, you *can* assume log(k) of a k-nearest neighbor query is > going to be small. True. >> If there is no need for sorting and there are a large number of nearest >> neighbor queries (in my case there are O(N) nearest neighbor queries), >> this quickly adds up to an unnecessary O(N) inefficiency that I'd like >> to avoid. > > Although you might not sort a k-nearest neighbor query, but as Anne > has said, you end up needing a sorted structure (like a priority > queue) to perform the query itself anyway. In the context of a kd-tree, yes, but not in other contexts like the "in sphere" query you mention below. > The only case where not sorting makes sense is an "in sphere" query > where there's no need to prioritize the results. In this case, I'd > still argue that the overhead of sorting isn't likely to matter. Well, in the case of the kd-tree I understand that this is not likely to matter. I was under the mistaken impression that this was to be a common API. I re-read the thread and understand now that this is a first cut for just the KDTree. Personally, I have a need for finding the exact nearest neighbors within an epsilon ball (in 1, 2, and 3D) of *all* the nodes. Typically epislon is known (or at least bounded) a-priori. In this context I typically build a binary/quad/oct tree and limit the size of the leaves to around epsilon. I avoid tree traversals by iterating over the leaves in order to get to the points they contain -- note that the target point is always contained in the tree which is why I can do this. Anyway, in this context sorting the results ends up in O(N klog(k)) operations if you have k neighbors on the average. This is expensive. In any case, I am not sure there is a general need for this kind of data structure. I need it a whole lot. If there is a general need I'll be glad to try and extract any of my code and contribute it to scipy.spatial. cheers, prabhu From peridot.faceted at gmail.com Sat Oct 4 08:57:23 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sat, 4 Oct 2008 08:57:23 -0400 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: <48E76391.5050508@aero.iitb.ac.in> References: <48E1A6A2.4050406@noaa.gov> <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> Message-ID: 2008/10/4 Prabhu Ramachandran : > Hi, > > Nathan Bell wrote: >> On Wed, Oct 1, 2008 at 12:53 PM, Prabhu Ramachandran >> wrote: >>> I thought this was a discussion on a good API for such an algorithm >>> (sorry if I misunderstood!). Given that context, if this is a general >>> purpose algorithm I'd rather not assume that the number of results is >>> typically going to be small. >> >> However, you *can* assume log(k) of a k-nearest neighbor query is >> going to be small. > > True. > >>> If there is no need for sorting and there are a large number of nearest >>> neighbor queries (in my case there are O(N) nearest neighbor queries), >>> this quickly adds up to an unnecessary O(N) inefficiency that I'd like >>> to avoid. >> >> Although you might not sort a k-nearest neighbor query, but as Anne >> has said, you end up needing a sorted structure (like a priority >> queue) to perform the query itself anyway. > > In the context of a kd-tree, yes, but not in other contexts like the "in > sphere" query you mention below. > >> The only case where not sorting makes sense is an "in sphere" query >> where there's no need to prioritize the results. In this case, I'd >> still argue that the overhead of sorting isn't likely to matter. > > Well, in the case of the kd-tree I understand that this is not likely to > matter. I was under the mistaken impression that this was to be a > common API. I re-read the thread and understand now that this is a > first cut for just the KDTree. > > Personally, I have a need for finding the exact nearest neighbors within > an epsilon ball (in 1, 2, and 3D) of *all* the nodes. Typically epislon > is known (or at least bounded) a-priori. In this context I typically > build a binary/quad/oct tree and limit the size of the leaves to around > epsilon. I avoid tree traversals by iterating over the leaves in order > to get to the points they contain -- note that the target point is > always contained in the tree which is why I can do this. Anyway, in > this context sorting the results ends up in O(N klog(k)) operations if > you have k neighbors on the average. This is expensive. > > In any case, I am not sure there is a general need for this kind of data > structure. I need it a whole lot. If there is a general need I'll be > glad to try and extract any of my code and contribute it to scipy.spatial. There does seem to be a general need for a two-tree query that extracts (or just counts) all items of one tree within a ball centered at each item of another tree. You're right, too, that "in a ball" queries often don't naturally produce sorted results, or even known distances. But I think this has to be a different query. In fact, even for single-point queries it may make sense to have a separate query function for in-a-ball queries. Anne From prabhu at aero.iitb.ac.in Sat Oct 4 12:19:44 2008 From: prabhu at aero.iitb.ac.in (Prabhu Ramachandran) Date: Sat, 04 Oct 2008 21:49:44 +0530 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E1A6A2.4050406@noaa.gov> <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> Message-ID: <48E797A0.3070604@aero.iitb.ac.in> Anne Archibald wrote: > 2008/10/4 Prabhu Ramachandran : >> In any case, I am not sure there is a general need for this kind of data >> structure. I need it a whole lot. If there is a general need I'll be >> glad to try and extract any of my code and contribute it to scipy.spatial. > > There does seem to be a general need for a two-tree query that > extracts (or just counts) all items of one tree within a ball centered > at each item of another tree. You're right, too, that "in a ball" > queries often don't naturally produce sorted results, or even known > distances. But I think this has to be a different query. In fact, even > for single-point queries it may make sense to have a separate query > function for in-a-ball queries. True and another reason for a different approach would be performance. According to Lee and Wong (1970, Acta Informatica) the worst case performance for partial region searches on k-d trees is O(k N^{1-1/k}). So if the dimensionality is small and the region size is know we could do a lot better with other specialized algorithms for these cases. cheers, prabhu From peridot.faceted at gmail.com Sat Oct 4 18:07:15 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sat, 4 Oct 2008 18:07:15 -0400 Subject: [SciPy-dev] Using nose Message-ID: Hi, I have a particular pattern of tests I'd like to implement, and I don't know a good way to make nose carry it out. The pattern is this: I have a class with a setUp function that creates various variables (an array of test data, etc.). I also have a handful of test_whatever methods. This works fine. But now I'd like to apply exactly the same test methods to a different set of data. My first attempt was to simply create a derived class which overrode the setUp method, but unfortunately neither unittest nor nose will run tests defined in the parent class. Any suggestions? Thanks, Anne From alan.mcintyre at gmail.com Sat Oct 4 19:22:42 2008 From: alan.mcintyre at gmail.com (Alan McIntyre) Date: Sat, 4 Oct 2008 16:22:42 -0700 Subject: [SciPy-dev] Using nose In-Reply-To: References: Message-ID: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> On Sat, Oct 4, 2008 at 3:07 PM, Anne Archibald wrote: > I have a particular pattern of tests I'd like to implement, and I > don't know a good way to make nose carry it out. The pattern is this: > I have a class with a setUp function that creates various variables > (an array of test data, etc.). I also have a handful of test_whatever > methods. This works fine. But now I'd like to apply exactly the same > test methods to a different set of data. My first attempt was to > simply create a derived class which overrode the setUp method, but > unfortunately neither unittest nor nose will run tests defined in the > parent class. Anne, If I understand correctly, I think you might be able to use a variation of this technique: http://projects.scipy.org/scipy/numpy/wiki/TestingGuidelines#creating-many-similar-tests Please let me know if that doesn't work (or if the doc isn't clear on something). Cheers, Alan From matthew.brett at gmail.com Sat Oct 4 19:32:34 2008 From: matthew.brett at gmail.com (Matthew Brett) Date: Sat, 4 Oct 2008 16:32:34 -0700 Subject: [SciPy-dev] Using nose In-Reply-To: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> References: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> Message-ID: <1e2af89e0810041632j609ee5dcv1c1fda5b69811712@mail.gmail.com> Hi Anne, > If I understand correctly, I think you might be able to use a > variation of this technique: I am confident I don't understand correctly, but how about: data1 = [1] data2 = [2] datasets = [data1, data2] def func1(data): pass def func2(data): pass funcs = [func1, func2] # Nose test generator def test_datas(): for func in funcs: for dataset in datasets: yield func, data ? Matthew From peridot.faceted at gmail.com Sat Oct 4 20:42:59 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sat, 4 Oct 2008 20:42:59 -0400 Subject: [SciPy-dev] Using nose In-Reply-To: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> References: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> Message-ID: 2008/10/4 Alan McIntyre : > On Sat, Oct 4, 2008 at 3:07 PM, Anne Archibald > wrote: >> I have a particular pattern of tests I'd like to implement, and I >> don't know a good way to make nose carry it out. The pattern is this: >> I have a class with a setUp function that creates various variables >> (an array of test data, etc.). I also have a handful of test_whatever >> methods. This works fine. But now I'd like to apply exactly the same >> test methods to a different set of data. My first attempt was to >> simply create a derived class which overrode the setUp method, but >> unfortunately neither unittest nor nose will run tests defined in the >> parent class. > > Anne, > > If I understand correctly, I think you might be able to use a > variation of this technique: > > http://projects.scipy.org/scipy/numpy/wiki/TestingGuidelines#creating-many-similar-tests > > Please let me know if that doesn't work (or if the doc isn't clear on > something). Aha! Thanks. This is pretty much what I tried to do, but there was a bug in my code and so I assumed that nose behaved the same as unittest in this. Anne From matthew.brett at gmail.com Sat Oct 4 21:04:49 2008 From: matthew.brett at gmail.com (Matthew Brett) Date: Sat, 4 Oct 2008 18:04:49 -0700 Subject: [SciPy-dev] SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <6c17e6f50810030749p1966266bs23701e77f8c05b3a@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> <1e2af89e0810020834t64e104b2oaf5abad1b07d7333@mail.gmail.com> <5b8d13220810022229s5523d3bex93355ed687c8d6ea@mail.gmail.com> <1e2af89e0810022254k3ff22324k39d656d6d0b0aa38@mail.gmail.com> <6c17e6f50810030749p1966266bs23701e77f8c05b3a@mail.gmail.com> Message-ID: <1e2af89e0810041804j411009o8024fd3b73bf4389@mail.gmail.com> Hi, On Fri, Oct 3, 2008 at 7:49 AM, Thouis (Ray) Jones wrote: > I've updated the patch to provide backwards compatibility for matlab > structures, controlled by a keyword argument to loadmat > (struct_as_record), which defaults to False (old style), and gives a > deprecation warning if False. I've applied this patch, with thanks, all tests pass for me too. David - any problems? Matthew From cournape at gmail.com Sun Oct 5 01:57:43 2008 From: cournape at gmail.com (David Cournapeau) Date: Sun, 5 Oct 2008 14:57:43 +0900 Subject: [SciPy-dev] SciPy-dev] [patch] read/write v5 .mat files with structs, cell arrays, objects, or function handles In-Reply-To: <1e2af89e0810041804j411009o8024fd3b73bf4389@mail.gmail.com> References: <6c17e6f50810010852r24d2064qdffd92b5b9796dd9@mail.gmail.com> <9457e7c80810011320q41b8cf3fw8ec95308bc0a6e89@mail.gmail.com> <5b8d13220810011737x4b38a449ybe495dc7b961060a@mail.gmail.com> <9457e7c80810020041x485c1cb1w4581009630d53a6@mail.gmail.com> <1e2af89e0810020834t64e104b2oaf5abad1b07d7333@mail.gmail.com> <5b8d13220810022229s5523d3bex93355ed687c8d6ea@mail.gmail.com> <1e2af89e0810022254k3ff22324k39d656d6d0b0aa38@mail.gmail.com> <6c17e6f50810030749p1966266bs23701e77f8c05b3a@mail.gmail.com> <1e2af89e0810041804j411009o8024fd3b73bf4389@mail.gmail.com> Message-ID: <5b8d13220810042257g12f57884td5abd4b18c1bd432@mail.gmail.com> On Sun, Oct 5, 2008 at 10:04 AM, Matthew Brett wrote: > > I've applied this patch, with thanks, all tests pass for me too. > David - any problems? Nope, works for me, David From peridot.faceted at gmail.com Sun Oct 5 02:08:51 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sun, 5 Oct 2008 02:08:51 -0400 Subject: [SciPy-dev] Using nose In-Reply-To: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> References: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> Message-ID: 2008/10/4 Alan McIntyre : > http://projects.scipy.org/scipy/numpy/wiki/TestingGuidelines#creating-many-similar-tests > > Please let me know if that doesn't work (or if the doc isn't clear on > something). While we're on the subject of testing, what's the best way to run a (or some) tests and get dropped into the debugger when something goes wrong? With my own code and unittest I there was a magic invocation (I think it was MyTestClass().debug()) that I could run from inside ipython with %pdb, so that when an exception was triggered I wound up in the debugger. What's the best way to arrange this when I'm dealing with debugging a module inside scipy? Thanks, Anne From robert.kern at gmail.com Sun Oct 5 02:12:58 2008 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 5 Oct 2008 01:12:58 -0500 Subject: [SciPy-dev] Using nose In-Reply-To: References: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> Message-ID: <3d375d730810042312w44a35986n3a7e59c3ad6123@mail.gmail.com> On Sun, Oct 5, 2008 at 01:08, Anne Archibald wrote: > While we're on the subject of testing, what's the best way to run a > (or some) tests and get dropped into the debugger when something goes > wrong? With my own code and unittest I there was a magic invocation (I > think it was MyTestClass().debug()) that I could run from inside > ipython with %pdb, so that when an exception was triggered I wound up > in the debugger. What's the best way to arrange this when I'm dealing > with debugging a module inside scipy? nosetests --pdb --pdb-failure -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From wnbell at gmail.com Sun Oct 5 02:13:29 2008 From: wnbell at gmail.com (Nathan Bell) Date: Sun, 5 Oct 2008 02:13:29 -0400 Subject: [SciPy-dev] Using nose In-Reply-To: References: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> Message-ID: On Sun, Oct 5, 2008 at 2:08 AM, Anne Archibald wrote: > > While we're on the subject of testing, what's the best way to run a > (or some) tests and get dropped into the debugger when something goes > wrong? With my own code and unittest I there was a magic invocation (I > think it was MyTestClass().debug()) that I could run from inside > ipython with %pdb, so that when an exception was triggered I wound up > in the debugger. What's the best way to arrange this when I'm dealing > with debugging a module inside scipy? > Is this what you're looking for? $nosetests --pdb --pdb-failures scipy.something -- Nathan Bell wnbell at gmail.com http://graphics.cs.uiuc.edu/~wnbell/ From peridot.faceted at gmail.com Sun Oct 5 02:16:19 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sun, 5 Oct 2008 02:16:19 -0400 Subject: [SciPy-dev] Using nose In-Reply-To: References: <1d36917a0810041622r15db1d76h5669377960b30885@mail.gmail.com> Message-ID: 2008/10/5 Nathan Bell : > On Sun, Oct 5, 2008 at 2:08 AM, Anne Archibald > wrote: >> >> While we're on the subject of testing, what's the best way to run a >> (or some) tests and get dropped into the debugger when something goes >> wrong? With my own code and unittest I there was a magic invocation (I >> think it was MyTestClass().debug()) that I could run from inside >> ipython with %pdb, so that when an exception was triggered I wound up >> in the debugger. What's the best way to arrange this when I'm dealing >> with debugging a module inside scipy? >> > > Is this what you're looking for? > > $nosetests --pdb --pdb-failures scipy.something Perfect, thank you! Anne From peridot.faceted at gmail.com Sun Oct 5 04:56:37 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sun, 5 Oct 2008 04:56:37 -0400 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: <48E797A0.3070604@aero.iitb.ac.in> References: <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> <48E797A0.3070604@aero.iitb.ac.in> Message-ID: 2008/10/4 Prabhu Ramachandran : > Anne Archibald wrote: >> 2008/10/4 Prabhu Ramachandran : >>> In any case, I am not sure there is a general need for this kind of data >>> structure. I need it a whole lot. If there is a general need I'll be >>> glad to try and extract any of my code and contribute it to scipy.spatial. >> >> There does seem to be a general need for a two-tree query that >> extracts (or just counts) all items of one tree within a ball centered >> at each item of another tree. You're right, too, that "in a ball" >> queries often don't naturally produce sorted results, or even known >> distances. But I think this has to be a different query. In fact, even >> for single-point queries it may make sense to have a separate query >> function for in-a-ball queries. > > True and another reason for a different approach would be performance. > According to Lee and Wong (1970, Acta Informatica) the worst case > performance for partial region searches on k-d trees is O(k N^{1-1/k}). > So if the dimensionality is small and the region size is know we could > do a lot better with other specialized algorithms for these cases. I have added several in-a-ball queries to the kd-tree implementation in the spatial branch. Please let me know what you think. The API is a bit ad-hoc, since I can see a number of different ways one might want to use the code. In particular, some query functions do not currently return the actual distance, since one of the shortcuts is to include whole branches of the tree if their bounding box lies inside the ball of interest. I suspect that for many algorithms of interest it will be necessary to write a custom tree-walking algorithm. I'm not sure whether it makes sense to try to provide functions to support this. Anne From nwagner at iam.uni-stuttgart.de Sun Oct 5 05:51:39 2008 From: nwagner at iam.uni-stuttgart.de (Nils Wagner) Date: Sun, 05 Oct 2008 11:51:39 +0200 Subject: [SciPy-dev] recent scipy.io Message-ID: ====================================================================== ERROR: Failure: ImportError (No module named byteordercodes) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/loader.py", line 364, in loadTestsFromName addr.filename, addr.module) File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/importer.py", line 39, in importFromPath return self.importFromDir(dir_path, fqname) File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/importer.py", line 84, in importFromDir mod = load_module(part_fqname, fh, filename, desc) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_byteordercodes.py", line 9, in import scipy.io.byteordercodes as sibc ImportError: No module named byteordercodes ====================================================================== FAIL: test_mio.test_load('double', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testdouble_6.1_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testdouble_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testdouble_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testdouble_4.2c_SOL2.mat'], {'testdouble': array([[ 0. ], ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 57, in _check_level assert_array_almost_equal(actual, expected, err_msg=label, decimal=5) File "/usr/local/lib64/python2.5/site-packages/numpy/testing/utils.py", line 310, in assert_array_almost_equal header='Arrays are not almost equal') File "/usr/local/lib64/python2.5/site-packages/numpy/testing/utils.py", line 260, in assert_array_compare raise AssertionError(msg) AssertionError: Arrays are not almost equal test double; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testdouble_6.1_SOL2.mat, variable testdouble (shapes (1, 9), (9, 1) mismatch) x: array([[ 0. , 0.78539816, 1.57079633, 2.35619449, 3.14159265, 3.92699082, 4.71238898, 5.49778714, 6.28318531]]) y: array([[ 0. ], [ 0.78539816], [ 1.57079633],... ====================================================================== FAIL: test_mio.test_load('string', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststring_4.2c_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststring_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststring_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststring_6.1_SOL2.mat'], {'teststring': u'"Do nine men interpret?" "Nine men," I nod.'}) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 60, in _check_level "Types %s and %s do not match at %s" % (typex, typac, label) AssertionError: Types and do not match at test string; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststring_4.2c_SOL2.mat, variable teststring ====================================================================== FAIL: test_mio.test_load('complex', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcomplex_4.2c_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcomplex_6.1_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcomplex_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcomplex_7.1_GLNX86.mat'], {'testcomplex': array([[ 1.000e+00 +0.000e+00j], ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 57, in _check_level assert_array_almost_equal(actual, expected, err_msg=label, decimal=5) File "/usr/local/lib64/python2.5/site-packages/numpy/testing/utils.py", line 310, in assert_array_almost_equal header='Arrays are not almost equal') File "/usr/local/lib64/python2.5/site-packages/numpy/testing/utils.py", line 260, in assert_array_compare raise AssertionError(msg) AssertionError: Arrays are not almost equal test complex; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcomplex_4.2c_SOL2.mat, variable testcomplex (shapes (1, 9), (9, 1) mismatch) x: array([[ 1.00000000e+00 +0.00000000e+00j, 7.07106781e-01 +7.07106781e-01j, 6.12323400e-17 +1.00000000e+00j,... y: array([[ 1.00000000e+00 +0.00000000e+00j], [ 7.07106781e-01 +7.07106781e-01j], [ 6.12323400e-17 +1.00000000e+00j],... ====================================================================== FAIL: test_mio.test_load('multi', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testmulti_4.2c_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testmulti_7.1_GLNX86.mat'], {'a': array([[ 1., 2., 3., 4., 5.], ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 57, in _check_level assert_array_almost_equal(actual, expected, err_msg=label, decimal=5) File "/usr/local/lib64/python2.5/site-packages/numpy/testing/utils.py", line 310, in assert_array_almost_equal header='Arrays are not almost equal') File "/usr/local/lib64/python2.5/site-packages/numpy/testing/utils.py", line 260, in assert_array_compare raise AssertionError(msg) AssertionError: Arrays are not almost equal test multi; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testmulti_4.2c_SOL2.mat, variable theta (shapes (1, 9), (9, 1) mismatch) x: array([[ 0. , 0.78539816, 1.57079633, 2.35619449, 3.14159265, 3.92699082, 4.71238898, 5.49778714, 6.28318531]]) y: array([[ 0. ], [ 0.78539816], [ 1.57079633],... ====================================================================== FAIL: test_mio.test_load('onechar', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testonechar_4.2c_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testonechar_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testonechar_6.1_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testonechar_6.5.1_GLNX86.mat'], {'testonechar': u'r'}) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 60, in _check_level "Types %s and %s do not match at %s" % (typex, typac, label) AssertionError: Types and do not match at test onechar; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testonechar_4.2c_SOL2.mat, variable testonechar ====================================================================== FAIL: test_mio.test_load('cell', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcell_6.1_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcell_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcell_6.5.1_GLNX86.mat'], {'testcell': array([This cell contains this string and 3 arrays of increasing length, 1, [1 2], [1 2 3]], dtype=object)}) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 27, in _check_level assert len(expected) == len(actual), "Different list lengths at %s" % label AssertionError: Different list lengths at test cell; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcell_6.1_SOL2.mat, variable testcell ====================================================================== FAIL: test_mio.test_load('emptycell', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testemptycell_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testemptycell_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testemptycell_5.3_SOL2.mat'], {'testemptycell': array([1, 2, [], [], 3], dtype=object)}) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 27, in _check_level assert len(expected) == len(actual), "Different list lengths at %s" % label AssertionError: Different list lengths at test emptycell; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testemptycell_7.1_GLNX86.mat, variable testemptycell ====================================================================== FAIL: test_mio.test_load('stringarray', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststringarray_6.1_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststringarray_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststringarray_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststringarray_4.2c_SOL2.mat'], {'teststringarray': array([one , two , three], dtype=object)}) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 30, in _check_level _check_level(level_label, ev, actual[i]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 60, in _check_level "Types %s and %s do not match at %s" % (typex, typac, label) AssertionError: Types and do not match at test stringarray; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststringarray_6.1_SOL2.mat, variable teststringarray, [0], ====================================================================== FAIL: test_mio.test_load('struct', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststruct_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststruct_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststruct_6.1_SOL2.mat'], {'teststruct': array([ (u'Rats live on no evil star.', [1.4142135623730951, 2.7182818284590451, 3.1415926535897931], [(1.4142135623730951+1.4142135623730951j), (2.7182818284590451+2.7182818284590451j), (3.1415926535897931+3.1415926535897931j)])], ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 30, in _check_level _check_level(level_label, ev, actual[i]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 60, in _check_level "Types %s and %s do not match at %s" % (typex, typac, label) AssertionError: Types and do not match at test struct; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststruct_6.5.1_GLNX86.mat, variable teststruct, [0], ====================================================================== FAIL: test_mio.test_load('cellnest', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcellnest_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcellnest_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcellnest_6.1_SOL2.mat'], {'testcellnest': array([1, [2 3 [4 5]]], dtype=object)}) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 27, in _check_level assert len(expected) == len(actual), "Different list lengths at %s" % label AssertionError: Different list lengths at test cellnest; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testcellnest_6.5.1_GLNX86.mat, variable testcellnest ====================================================================== FAIL: test_mio.test_load('structnest', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststructnest_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststructnest_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststructnest_6.1_SOL2.mat'], {'teststructnest': array([[(1, [[(u'number 3',)]])]], ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 30, in _check_level _check_level(level_label, ev, actual[i]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 30, in _check_level _check_level(level_label, ev, actual[i]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 60, in _check_level "Types %s and %s do not match at %s" % (typex, typac, label) AssertionError: Types and do not match at test structnest; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststructnest_7.1_GLNX86.mat, variable teststructnest, [0], , [0], ====================================================================== FAIL: test_mio.test_load('structarr', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststructarr_6.1_SOL2.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststructarr_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststructarr_7.1_GLNX86.mat'], {'teststructarr': array([[(1, 2)], ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 27, in _check_level assert len(expected) == len(actual), "Different list lengths at %s" % label AssertionError: Different list lengths at test structarr; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/teststructarr_6.1_SOL2.mat, variable teststructarr ====================================================================== FAIL: test_mio.test_load('object', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testobject_7.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testobject_6.5.1_GLNX86.mat', '/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testobject_6.1_SOL2.mat'], {'testobject': }) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 41, in _check_level assert k in ac_fields, "Missing property at %s" % label AssertionError: Missing property at test object; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testobject_7.1_GLNX86.mat, variable testobject ====================================================================== FAIL: test_mio.test_load('unicode', ['/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testunicode_7.1_GLNX86.mat'], {'testunicode': u'Japanese: \n\u3059\u3079\u3066\u306e\u4eba\u9593\u306f\u3001\u751f\u307e\u308c\u306a\u304c\u3089\u306b\u3057\u3066\u81ea\u7531\u3067\u3042\u308a\u3001\n\u304b\u3064\u3001\u5c0a\u53b3\u3068\u6a29\u5229\u3068 \u306b\u3064\u3044\u3066\u5e73\u7b49\u3067\u3042\u308b\u3002\n\u4eba\u9593\u306f\u3001\u7406\u6027\u3068\u826f\u5fc3\u3068\u3092\u6388\u3051\u3089\u308c\u3066\u304a\u308a\u3001\n\u4e92\u3044\u306b\u540c\u80de\u306e\u7cbe\u795e\u3092\u3082\u3063\u3066\u884c\u52d5\u3057\u306a\u3051\u308c\u3070\u306a\u3089\u306a\u3044\u3002'}) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 60, in _check_level "Types %s and %s do not match at %s" % (typex, typac, label) AssertionError: Types and do not match at test unicode; file /usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/data/testunicode_7.1_GLNX86.mat, variable testunicode ====================================================================== FAIL: test_mio.test_round_trip('string_round_trip', {'teststring': u'"Do nine men interpret?" "Nine men," I nod.'}, '4') ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 77, in _rt_check_case _check_case(name, [mat_stream], expected, struct_as_record=True) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 60, in _check_level "Types %s and %s do not match at %s" % (typex, typac, label) AssertionError: Types and do not match at test string_round_trip; file , variable teststring ====================================================================== FAIL: test_mio.test_round_trip('onechar_round_trip', {'testonechar': u'r'}, '4') ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/nose-0.10.1-py2.5.egg/nose/case.py", line 203, in runTest self.test(*self.arg) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 77, in _rt_check_case _check_case(name, [mat_stream], expected, struct_as_record=True) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 70, in _check_case _check_level(k_label, expected, matdict[k]) File "/usr/local/lib64/python2.5/site-packages/scipy/io/matlab/tests/test_mio.py", line 60, in _check_level "Types %s and %s do not match at %s" % (typex, typac, label) AssertionError: Types and do not match at test onechar_round_trip; file , variable testonechar ====================================================================== FAIL: test_imresize (test_pilutil.TestPILUtil) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/numpy/testing/decorators.py", line 82, in skipper return f(*args, **kwargs) File "/usr/local/lib64/python2.5/site-packages/scipy/misc/tests/test_pilutil.py", line 24, in test_imresize assert_equal(im1.shape,(11,22)) File "/usr/local/lib64/python2.5/site-packages/numpy/testing/utils.py", line 174, in assert_equal assert_equal(len(actual),len(desired),err_msg,verbose) File "/usr/local/lib64/python2.5/site-packages/numpy/testing/utils.py", line 183, in assert_equal raise AssertionError(msg) AssertionError: Items are not equal: ACTUAL: 0 DESIRED: 2 ---------------------------------------------------------------------- Ran 2342 tests in 36.958s FAILED (failures=17, errors=1) From matthew.brett at gmail.com Sun Oct 5 14:01:40 2008 From: matthew.brett at gmail.com (Matthew Brett) Date: Sun, 5 Oct 2008 11:01:40 -0700 Subject: [SciPy-dev] recent scipy.io In-Reply-To: References: Message-ID: <1e2af89e0810051101g6c61b0d2i358cd9e13d8db16f@mail.gmail.com> Hi, > ====================================================================== > ERROR: Failure: ImportError (No module named > byteordercodes) > ---------------------------------------------------------------------- etc - my fault - refactoring leak, I got too used to bzr. Thanks for the report, Matthew From alan.mcintyre at gmail.com Mon Oct 6 04:54:03 2008 From: alan.mcintyre at gmail.com (Alan McIntyre) Date: Mon, 6 Oct 2008 01:54:03 -0700 Subject: [SciPy-dev] PyArray_FromDims and friends In-Reply-To: <2CC40A34-8316-4F21-B73E-E3A7E1148AB7@yale.edu> References: <48A43348.2030006@netvision.net.il> <5C95B98C-5922-4D34-B3D3-B23C49E29300@yale.edu> <48A45D30.8040508@llnl.gov> <3FA4ADBE-DD0B-4143-9D32-AA2971263FE2@yale.edu> <2CC40A34-8316-4F21-B73E-E3A7E1148AB7@yale.edu> Message-ID: <1d36917a0810060154p4c55853cu9067fc22c6b352d1@mail.gmail.com> On Fri, Aug 15, 2008 at 10:26 AM, Zachary Pincus wrote: >> No, no, you can't just cast the pointer to a different type, it >> needs to point to npy_intp instead of int to start with, i.e., n has >> to be an array of npy_intp. > > Thanks for looking that over. This is of course what happens when I > try to deal with C too late at night. > > For cases where the dimension was 1, I had originally had: > > [whatever] = (PyArrayObject > *)PyArray_SimpleNew(1,&(npy_intp)n,PyArray_[whatever]); > > But I got compiler warnings to the tune of "The argument to & is not > strictly an lvalue. This will be a hard error in the future." Which > makes sense, (npy_intp)n isn't a proper lvalue... So I changed it to: > > [whatever] = (PyArrayObject *)PyArray_SimpleNew(1, > (npy_intp*)&n,PyArray_[whatever]); > > which, as you point out, is pretty daft. > > Is there a correct, portable, and warning-free way to do this in one > line? I'm guessing not. Anyhow, I'll fix the patch with a multi-line > solution: > npy_int intp_n; > ... > intp_n = (npy_intp) n; > [whatever] = (PyArrayObject > *)PyArray_SimpleNew(1,&intp_n,PyArray_[whatever]); > > Sorry for the noise, and thanks Chuck for the catch. I added a patch to #723 (http://scipy.org/scipy/scipy/ticket/723) that (I think) switches to use PyArray_SimpleNew instead of PyArray_FromDims. If nobody sees any problems with it I'll look it over again tomorrow or Tuesday and apply it. I'll try to get to switching out PyArray_FromDimsAndDataAndDescr later this week if nobody else gets around to it. From Per.Brodtkorb at ffi.no Mon Oct 6 10:47:28 2008 From: Per.Brodtkorb at ffi.no (Per.Brodtkorb at ffi.no) Date: Mon, 6 Oct 2008 16:47:28 +0200 Subject: [SciPy-dev] Fwd: tests and partial patchforscipy.stats.distributions In-Reply-To: <1cd32cbb0810020629w56cf8e89t58954256e87bb29e@mail.gmail.com> References: <1cd32cbb0809282100x75eccfb9la109e92032f798ac@mail.gmail.com><1cd32cbb0809290551g7c00a5ddh44369352ece506d2@mail.gmail.com><1ED225FF18AA8B48AC192F7E1D032C6E0C5C1D@hbu-posten.ffi.no> <1cd32cbb0810020629w56cf8e89t58954256e87bb29e@mail.gmail.com> Message-ID: <1ED225FF18AA8B48AC192F7E1D032C6E0C5C26@hbu-posten.ffi.no> I have some suggestions for further improvement on distributions.py and is listed in the following: When I import your scipy.stats.distributions into ipython, the doc-string for the distributions are not updated correctly. To fix it I would also suggest to replace this part in .__init__ method of rv_discrete and rv_continous class: if self.__doc__ is None: self.__doc__ = rv_discrete.__doc__ else: self.__doc__ = self.__doc__.replace("A Generic",longname) with if self.__doc__ is None: self.__doc__ = rv_discrete.__doc__ if self.__doc__ is None: self.__doc__ = self.__doc__.replace("A Generic",longname) Replace .... goodargs = argsreduce(cond, *((q,)+args+(loc,))) loc, goodargs = goodargs[-1], goodargs[:-1] if [1 for i in goodargs if any(i)]: place(output,cond,(self._ppf(*goodargs) + loc).tolist()) with if any(cond): goodargs = argsreduce(cond, *((q,)+args+(loc,))) loc, goodargs = goodargs[-1], goodargs[:-1] place(output,cond,self._ppf(*goodargs) + loc) (I also think this fix could be applied to cdf, ppf......) To fix the stats method of rv_frozen class, replace it with: def stats(self,moments='mv'): kwds = self.kwds kwds.update({'moments':moments}) return self.dist.stats(*self.args,**kwds) In order to improve accuracy in calculation for small x values, I would also suggest to replace all calls to log(1-x) and 1-exp(-x) with log1p(-x) and -expm1(-x), respectively, The _isf method of the rv_discrete class is never called. In the isf method of the rv_discrete class replace place(output,cond,self._ppf(*goodargs) + loc) with place(output,cond,self._isf(*goodargs) + loc) Per A. -----Opprinnelig melding----- Fra: scipy-dev-bounces at scipy.org [mailto:scipy-dev-bounces at scipy.org] P? vegne av josef.pktd at gmail.com Sendt: 2. oktober 2008 15:30 Til: SciPy Developers List Emne: Re: [SciPy-dev] Fwd: tests and partial patchforscipy.stats.distributions I finally filed a ticket on this on scipy trac: http://scipy.org/scipy/scipy/ticket/745 On Wed, Oct 1, 2008 at 7:14 AM, wrote: > I have looked into the bugs in the patch of the discrete distributions you sent. The error in _ppf method of dlaplace can be corrected by replacing it with: > > class dlaplace_gen(rv_discrete): > ..... > def _ppf(self, q, a): > const = 1.0/(1+exp(-a)) > cons2 = 1+exp(a) > ind = q < const > return ceil(where(ind, log(q*cons2)/a-1, -log((1-q)*cons2)/a)) > ...... > Thanks, I didn't see the misplaced bracket, I changed my corrections to this. Do you have any idea or reference about the theoretical second moment of dlaplace? > The tests written for the generic moment, fails for those distributions that set the limits (self.a and/or self.b) for the support of the distribution in the self._argcheck method. I think the generic_moment method is meant to be private to the self._munp method, since it does only limited input checks. > Perhaps better to rename generic_moment to _generic_moment I aggree, I didn't check this systematically, but I guess 'vec_generic_moment', 'vecentropy' are also private methods. > > Per A. I did not realize that some of my tests on private methods might fail for some distributions by design. To make it a bit clearer, I separated the tests into tests for public methods, and tests for private methods. I still like the tests of private methods, even if some are known failures, since they help to narrow down what the source of an exception or test failure in the public methods is. Josef _______________________________________________ Scipy-dev mailing list Scipy-dev at scipy.org http://projects.scipy.org/mailman/listinfo/scipy-dev From prabhu at aero.iitb.ac.in Mon Oct 6 16:05:30 2008 From: prabhu at aero.iitb.ac.in (Prabhu Ramachandran) Date: Tue, 07 Oct 2008 01:35:30 +0530 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <320fb6e00809300129j1b45ee21k6756b4c2b1945d84@mail.gmail.com> <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> <48E797A0.3070604@aero.iitb.ac.in> Message-ID: <48EA6F8A.2020104@aero.iitb.ac.in> Anne Archibald wrote: > 2008/10/4 Prabhu Ramachandran : > I have added several in-a-ball queries to the kd-tree implementation > in the spatial branch. Please let me know what you think. The API is a > bit ad-hoc, since I can see a number of different ways one might want > to use the code. In particular, some query functions do not currently > return the actual distance, since one of the shortcuts is to include > whole branches of the tree if their bounding box lies inside the ball I have only taken a cursory look and it looks very good to me. I like the general API and love the fact that you have tests for everything. I'm too swamped to try out the branch at the moment though. Thanks very much for doing this! > of interest. I suspect that for many algorithms of interest it will be > necessary to write a custom tree-walking algorithm. I'm not sure > whether it makes sense to try to provide functions to support this. I am also not sure about this. Once spatial is available in scipy proper perhaps we'll get more feedback on other things people are looking for. regards, prabhu From wnbell at gmail.com Mon Oct 6 16:17:50 2008 From: wnbell at gmail.com (Nathan Bell) Date: Mon, 6 Oct 2008 16:17:50 -0400 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> <48E797A0.3070604@aero.iitb.ac.in> Message-ID: On Sun, Oct 5, 2008 at 4:56 AM, Anne Archibald wrote: > > to use the code. In particular, some query functions do not currently > return the actual distance, since one of the shortcuts is to include > whole branches of the tree if their bounding box lies inside the ball That's a reasonable choice, we'd want to leave the door open for such optimizations. > of interest. I suspect that for many algorithms of interest it will be > necessary to write a custom tree-walking algorithm. I'm not sure > whether it makes sense to try to provide functions to support this. > I agree with Prabhu, start simple and elicit feedback. IMO speed for the common query types is more important than support for sophisticated traversals. Thanks again for your work Anne. -- Nathan Bell wnbell at gmail.com http://graphics.cs.uiuc.edu/~wnbell/ From charlesr.harris at gmail.com Mon Oct 6 17:03:56 2008 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 6 Oct 2008 15:03:56 -0600 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> <48E797A0.3070604@aero.iitb.ac.in> Message-ID: On Sun, Oct 5, 2008 at 2:56 AM, Anne Archibald wrote: > 2008/10/4 Prabhu Ramachandran : > > Anne Archibald wrote: > >> 2008/10/4 Prabhu Ramachandran : > >>> In any case, I am not sure there is a general need for this kind of > data > >>> structure. I need it a whole lot. If there is a general need I'll be > >>> glad to try and extract any of my code and contribute it to > scipy.spatial. > >> > >> There does seem to be a general need for a two-tree query that > >> extracts (or just counts) all items of one tree within a ball centered > >> at each item of another tree. You're right, too, that "in a ball" > >> queries often don't naturally produce sorted results, or even known > >> distances. But I think this has to be a different query. In fact, even > >> for single-point queries it may make sense to have a separate query > >> function for in-a-ball queries. > > > > True and another reason for a different approach would be performance. > > According to Lee and Wong (1970, Acta Informatica) the worst case > > performance for partial region searches on k-d trees is O(k N^{1-1/k}). > > So if the dimensionality is small and the region size is know we could > > do a lot better with other specialized algorithms for these cases. > > I have added several in-a-ball queries to the kd-tree implementation > in the spatial branch. Please let me know what you think. The API is a > bit ad-hoc, since I can see a number of different ways one might want > to use the code. In particular, some query functions do not currently > return the actual distance, since one of the shortcuts is to include > whole branches of the tree if their bounding box lies inside the ball > of interest. I suspect that for many algorithms of interest it will be > necessary to write a custom tree-walking algorithm. I'm not sure > whether it makes sense to try to provide functions to support this. > I'm wondering if this might be a good place to add an equivalence relation also. An equivalence relation is a good way to do clustering based on distances, i.e., a.join(i,j) where i and j are objects creates a cluster that is the union of those containing i and j respectively. All the clustered elements can then be recovered using something like a.elements(i). Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From turian at gmail.com Wed Oct 8 17:35:08 2008 From: turian at gmail.com (Joseph Turian) Date: Wed, 8 Oct 2008 17:35:08 -0400 Subject: [SciPy-dev] Bug in scipy.weave.catalog.default_dir() Message-ID: <4dacb2560810081435v3a161811wa79a1b97fbd41edd@mail.gmail.com> I am using scipy.weave in an environment in which my home directory is not accessable. However, scipy.weave attempts to write to $HOME/.python25_compiled, even though I set PYTHONCOMPILED. Looking a little deeper, I see that scipy.weave.catalog.default_dir() is implemented incorrectly. The documentation says: "If for some reason it isn't possible to build a default directory in the user's home, /tmp/_pythonXX_compiled is used." What actually happens is that if $HOME is defined, $HOME/.python25_compiled is used and no exception is caught if writing fails. I would prefer that, as per the documentation, if $HOME/.python25_compiled is not writeable then the exception is caught and /tmp/_pythonXX_compiled is used instead. Could someone please implement this? It's a simple fix. Thanks, Joseph -- Academic: http://www-etud.iro.umontreal.ca/~turian/ Business: http://www.metaoptimize.com/ From peridot.faceted at gmail.com Thu Oct 9 00:38:34 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Thu, 9 Oct 2008 00:38:34 -0400 Subject: [SciPy-dev] cython in scipy? Message-ID: Hi, I've just been trying to clean up scipy.stats.vonmises.cdf: it's currently badly broken (tickets 747, 748, and 749). I can fix the outright bugs, but it also has distressingly large roundoff error under some circumstances. I'd like to simply implement the standard algorithm (Hill 1977) for evaluating it. Unfortunately, it involves evaluating a series with a number of terms that depends on the parameter, so vectorizing a python version (which is already written) is going to be a nuisance. On the other hand, it would be very easy to translate this into cython and get C performance out of it. So: is it okay to include cython code in scipy? What is the best way to go about this? Include the cython source in setup.py somehow? Check the generated C into svn and run cython by hand when I remember to? Thanks, Anne From cournape at gmail.com Thu Oct 9 00:43:08 2008 From: cournape at gmail.com (David Cournapeau) Date: Thu, 9 Oct 2008 13:43:08 +0900 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: <5b8d13220810082143x5484eb09nad978d4008ad7228@mail.gmail.com> Hi Anne, On Thu, Oct 9, 2008 at 1:38 PM, Anne Archibald wrote: >On the other hand, it would be very easy to > translate this into cython and get C performance out of it. So: is it > okay to include cython code in scipy? If there is a python version already (to serve as a reference), I don't think it would be a big problem to include cython source. > Check the > generated C into svn and run cython by hand when I remember to? Yes, that's already how it is done for numpy.random.mtrand. Cython is not yet packaged by so many distributions, so depending on it for building scipy is a bit premature IMHO. cheers, David From millman at berkeley.edu Thu Oct 9 00:51:37 2008 From: millman at berkeley.edu (Jarrod Millman) Date: Wed, 8 Oct 2008 21:51:37 -0700 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: On Wed, Oct 8, 2008 at 9:38 PM, Anne Archibald wrote: > I've just been trying to clean up scipy.stats.vonmises.cdf: it's > currently badly broken (tickets 747, 748, and 749). I can fix the > outright bugs, but it also has distressingly large roundoff error > under some circumstances. I'd like to simply implement the standard > algorithm (Hill 1977) for evaluating it. Unfortunately, it involves > evaluating a series with a number of terms that depends on the > parameter, so vectorizing a python version (which is already written) > is going to be a nuisance. On the other hand, it would be very easy to > translate this into cython and get C performance out of it. So: is it > okay to include cython code in scipy? What is the best way to go about > this? Include the cython source in setup.py somehow? Check the > generated C into svn and run cython by hand when I remember to? Writing Cython code is fine with me. I think you should check in the generated C, though. Thanks, -- Jarrod Millman Computational Infrastructure for Research Labs 10 Giannini Hall, UC Berkeley phone: 510.643.4014 http://cirl.berkeley.edu/ From peridot.faceted at gmail.com Thu Oct 9 01:30:37 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Thu, 9 Oct 2008 01:30:37 -0400 Subject: [SciPy-dev] tickets 747, 748, 749 can be closed Message-ID: Tickets 747, 748, 749 can be closed, but unfortunately I (peridot) don't seem to have permission to close them, in spite of having been the one who submitted them. I did put a "closes tickets ..." in the svn commit message, but I don't think there's an automatic ticket-closing gizmo. Anne From robert.kern at gmail.com Thu Oct 9 01:38:38 2008 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 9 Oct 2008 00:38:38 -0500 Subject: [SciPy-dev] tickets 747, 748, 749 can be closed In-Reply-To: References: Message-ID: <3d375d730810082238w483032a6wf302b67356366429@mail.gmail.com> On Thu, Oct 9, 2008 at 00:30, Anne Archibald wrote: > Tickets 747, 748, 749 can be closed, but unfortunately I (peridot) > don't seem to have permission to close them, in spite of having been > the one who submitted them. You now have such permissions. > I did put a "closes tickets ..." in the > svn commit message, but I don't think there's an automatic > ticket-closing gizmo. Correct. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From fperez.net at gmail.com Thu Oct 9 02:09:22 2008 From: fperez.net at gmail.com (Fernando Perez) Date: Wed, 8 Oct 2008 23:09:22 -0700 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: On Wed, Oct 8, 2008 at 9:38 PM, Anne Archibald wrote: > Hi, > > I've just been trying to clean up scipy.stats.vonmises.cdf: it's > currently badly broken (tickets 747, 748, and 749). I can fix the > outright bugs, but it also has distressingly large roundoff error > under some circumstances. I'd like to simply implement the standard > algorithm (Hill 1977) for evaluating it. Unfortunately, it involves > evaluating a series with a number of terms that depends on the > parameter, so vectorizing a python version (which is already written) > is going to be a nuisance. On the other hand, it would be very easy to > translate this into cython and get C performance out of it. So: is it > okay to include cython code in scipy? What is the best way to go about > this? Include the cython source in setup.py somehow? Check the > generated C into svn and run cython by hand when I remember to? This is the kind of very simple code I've used in the past (back in the pyrex days, same thing for cython): # Make this usable by people who don't have pyrex installed (I've committed # the generated C sources to SVN). try: from Pyrex.Distutils import build_ext has_pyrex = True cmdclass = {'build_ext': build_ext} except ImportError: has_pyrex = False cmdclass = {} # Set up key extension key_sources = ['mwadap/key.pyx'] if not has_pyrex: key_sources = [s.replace('.pyx','.c') for s in key_sources] keyx = Extension('mwadap.key',key_sources, include_dirs = [numpy_include_dir]) ### Instead of triggering the use of the included C on the absence of Cython, you could just trigger on the absence of the C sources to fall back on the .pyx files (or even timestamp differences if you want). This way, you can just nuke the C files with a little utility script whenever you want and Cython will regenerate them for you at build time. Since they're checked into SVN, next time you commit the updated version will go back up. The choice of whether to trigger the rebuild simply on timestamp differences (a la .py/.pyc) or by manual removal of the C is up to you, but the resulting workflow is pretty straightforward in any case. Cheers, f From peridot.faceted at gmail.com Thu Oct 9 03:40:04 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Thu, 9 Oct 2008 03:40:04 -0400 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: Okay, the cython code for the von Mises CDF is committed. It's actually a mongrel, since I needed scipy.stats.norm.cdf and scipy.special.i0 for one branch, and the only way I knew of get them was through python, so I left that branch vectorized. Otherwise I think the cython version would have been shorter than the python version. The new CDF code also makes a minor change in the API: for values outside the domain -pi..pi, instead of returning cdf( (x+pi)%(2*pi)-pi ) it returns an increasing function. In particular, this means that the probability of getting a value between x and y is cdf(y)-cdf(x) whenever x and y differ by less than 2*pi. (Previously if x=3 and y=4 it gave you the probability minus 1.) This is valuable since the "location" parameter shifts the domain: in particular, the CDF is zero at "location" minus pi. 2008/10/9 Fernando Perez : > This is the kind of very simple code I've used in the past (back in > the pyrex days, same thing for cython): > > # Make this usable by people who don't have pyrex installed (I've committed > # the generated C sources to SVN). > try: > from Pyrex.Distutils import build_ext > has_pyrex = True > cmdclass = {'build_ext': build_ext} > except ImportError: > has_pyrex = False > cmdclass = {} > > # Set up key extension > key_sources = ['mwadap/key.pyx'] > if not has_pyrex: > key_sources = [s.replace('.pyx','.c') for s in key_sources] > keyx = Extension('mwadap.key',key_sources, > include_dirs = [numpy_include_dir]) > > ### I didn't get this working, I'm afraid; I wasn't willing to delve into the guts of numpy's distutils to figure out what was wrong, but it complained that I'd given it a ".pyx" file but I didn't have Pyrex installed. Anne From robince at gmail.com Thu Oct 9 03:58:15 2008 From: robince at gmail.com (Robin) Date: Thu, 9 Oct 2008 08:58:15 +0100 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: Hi On Thu, Oct 9, 2008 at 8:40 AM, Anne Archibald wrote: > I didn't get this working, I'm afraid; I wasn't willing to delve into > the guts of numpy's distutils to figure out what was wrong, but it > complained that I'd given it a ".pyx" file but I didn't have Pyrex > installed. There is a working cython example here: http://projects.scipy.org/scipy/numpy/browser/trunk/doc/cython/setup.py It might be as simple as changing Pyrex to Cython in the import. Cheers Robin From fperez.net at gmail.com Thu Oct 9 04:01:26 2008 From: fperez.net at gmail.com (Fernando Perez) Date: Thu, 9 Oct 2008 01:01:26 -0700 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: On Thu, Oct 9, 2008 at 12:58 AM, Robin wrote: > Hi > > On Thu, Oct 9, 2008 at 8:40 AM, Anne Archibald > wrote: >> I didn't get this working, I'm afraid; I wasn't willing to delve into >> the guts of numpy's distutils to figure out what was wrong, but it >> complained that I'd given it a ".pyx" file but I didn't have Pyrex >> installed. > > There is a working cython example here: > http://projects.scipy.org/scipy/numpy/browser/trunk/doc/cython/setup.py > > It might be as simple as changing Pyrex to Cython in the import. Yes, sorry about that. I should have pointed to that example instead, since it's been updated to use Cython and would have avoided confusing Anne. Given how I wrote that code, no valid excuse to hide behind... Thanks for pointing it out and rescuing the idea :) Cheers, f From robert.kern at gmail.com Thu Oct 9 04:02:39 2008 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 9 Oct 2008 03:02:39 -0500 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: <3d375d730810090102x67b51428tdd7d662bdd3ca277@mail.gmail.com> On Thu, Oct 9, 2008 at 02:58, Robin wrote: > Hi > > On Thu, Oct 9, 2008 at 8:40 AM, Anne Archibald > wrote: >> I didn't get this working, I'm afraid; I wasn't willing to delve into >> the guts of numpy's distutils to figure out what was wrong, but it >> complained that I'd given it a ".pyx" file but I didn't have Pyrex >> installed. > > There is a working cython example here: > http://projects.scipy.org/scipy/numpy/browser/trunk/doc/cython/setup.py > > It might be as simple as changing Pyrex to Cython in the import. Not really. This isn't a standalone package where we can do things like that. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robince at gmail.com Thu Oct 9 04:06:02 2008 From: robince at gmail.com (Robin) Date: Thu, 9 Oct 2008 09:06:02 +0100 Subject: [SciPy-dev] cython in scipy? In-Reply-To: <3d375d730810090102x67b51428tdd7d662bdd3ca277@mail.gmail.com> References: <3d375d730810090102x67b51428tdd7d662bdd3ca277@mail.gmail.com> Message-ID: On Thu, Oct 9, 2008 at 9:02 AM, Robert Kern wrote: > On Thu, Oct 9, 2008 at 02:58, Robin wrote: >> It might be as simple as changing Pyrex to Cython in the import. > > Not really. This isn't a standalone package where we can do things like that. Sorry - I meant in Anne's non-working example that Fernando originally gave, the fix to get it working might be as simple as changing "from Pyrex.Distutils" to "from Cython.Distutils"... Not sure what you think I meant... :) Cheers Robin From robert.kern at gmail.com Thu Oct 9 04:12:27 2008 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 9 Oct 2008 03:12:27 -0500 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: <3d375d730810090102x67b51428tdd7d662bdd3ca277@mail.gmail.com> Message-ID: <3d375d730810090112y7e722687jcc3ab7885eaf715d@mail.gmail.com> On Thu, Oct 9, 2008 at 03:06, Robin wrote: > On Thu, Oct 9, 2008 at 9:02 AM, Robert Kern wrote: >> On Thu, Oct 9, 2008 at 02:58, Robin wrote: >>> It might be as simple as changing Pyrex to Cython in the import. >> >> Not really. This isn't a standalone package where we can do things like that. > > Sorry - I meant in Anne's non-working example that Fernando originally > gave, the fix to get it working might be as simple as changing "from > Pyrex.Distutils" to "from Cython.Distutils"... > > Not sure what you think I meant... :) I thought you were talking about Anne's actual problem, which is using Cython inside scipy.stats. The errors referring to Pyrex come from inside numpy.distutils, which already has code to handle .pyx files as Pyrex, not Cython files. Even if you fix the example, the actual problem isn't solvable like that. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From peridot.faceted at gmail.com Thu Oct 9 04:26:31 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Thu, 9 Oct 2008 04:26:31 -0400 Subject: [SciPy-dev] cython in scipy? In-Reply-To: <3d375d730810090112y7e722687jcc3ab7885eaf715d@mail.gmail.com> References: <3d375d730810090102x67b51428tdd7d662bdd3ca277@mail.gmail.com> <3d375d730810090112y7e722687jcc3ab7885eaf715d@mail.gmail.com> Message-ID: 2008/10/9 Robert Kern : > On Thu, Oct 9, 2008 at 03:06, Robin wrote: >> On Thu, Oct 9, 2008 at 9:02 AM, Robert Kern wrote: >>> On Thu, Oct 9, 2008 at 02:58, Robin wrote: >>>> It might be as simple as changing Pyrex to Cython in the import. >>> >>> Not really. This isn't a standalone package where we can do things like that. >> >> Sorry - I meant in Anne's non-working example that Fernando originally >> gave, the fix to get it working might be as simple as changing "from >> Pyrex.Distutils" to "from Cython.Distutils"... >> >> Not sure what you think I meant... :) > > I thought you were talking about Anne's actual problem, which is using > Cython inside scipy.stats. The errors referring to Pyrex come from > inside numpy.distutils, which already has code to handle .pyx files as > Pyrex, not Cython files. Even if you fix the example, the actual > problem isn't solvable like that. This was the conclusion I drew (having tried substituting Cython in the import), and why I gave up. What would be involved in fixing numpy.distutils to resolve this problem? I realize that it will be awkward because cython and pyrex use the same extension but are not compatible. Anne From robert.kern at gmail.com Thu Oct 9 04:36:55 2008 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 9 Oct 2008 03:36:55 -0500 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: <3d375d730810090102x67b51428tdd7d662bdd3ca277@mail.gmail.com> <3d375d730810090112y7e722687jcc3ab7885eaf715d@mail.gmail.com> Message-ID: <3d375d730810090136rcf7b98dsd05c6e36e4d4b6c3@mail.gmail.com> On Thu, Oct 9, 2008 at 03:26, Anne Archibald wrote: > 2008/10/9 Robert Kern : >> On Thu, Oct 9, 2008 at 03:06, Robin wrote: >>> On Thu, Oct 9, 2008 at 9:02 AM, Robert Kern wrote: >>>> On Thu, Oct 9, 2008 at 02:58, Robin wrote: >>>>> It might be as simple as changing Pyrex to Cython in the import. >>>> >>>> Not really. This isn't a standalone package where we can do things like that. >>> >>> Sorry - I meant in Anne's non-working example that Fernando originally >>> gave, the fix to get it working might be as simple as changing "from >>> Pyrex.Distutils" to "from Cython.Distutils"... >>> >>> Not sure what you think I meant... :) >> >> I thought you were talking about Anne's actual problem, which is using >> Cython inside scipy.stats. The errors referring to Pyrex come from >> inside numpy.distutils, which already has code to handle .pyx files as >> Pyrex, not Cython files. Even if you fix the example, the actual >> problem isn't solvable like that. > > This was the conclusion I drew (having tried substituting Cython in > the import), and why I gave up. What would be involved in fixing > numpy.distutils to resolve this problem? I realize that it will be > awkward because cython and pyrex use the same extension but are not > compatible. We could extend the API somehow. Configuration.add_cython_extension() or something like that. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From david at ar.media.kyoto-u.ac.jp Fri Oct 10 02:35:01 2008 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 10 Oct 2008 15:35:01 +0900 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: <3d375d730810090102x67b51428tdd7d662bdd3ca277@mail.gmail.com> <3d375d730810090112y7e722687jcc3ab7885eaf715d@mail.gmail.com> Message-ID: <48EEF795.3010901@ar.media.kyoto-u.ac.jp> Anne Archibald wrote: > > This was the conclusion I drew (having tried substituting Cython in > the import), and why I gave up. What would be involved in fixing > numpy.distutils to resolve this problem? I realize that it will be > awkward because cython and pyrex use the same extension but are not > compatible. Should be doable if you don't want to mix cython and pyrex in the same setup.py. FWIW, numscons has cython support, so this won't be a problem either. cheers, David From charlesr.harris at gmail.com Sat Oct 11 13:49:36 2008 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 11 Oct 2008 11:49:36 -0600 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: On Wed, Oct 8, 2008 at 10:51 PM, Jarrod Millman wrote: > On Wed, Oct 8, 2008 at 9:38 PM, Anne Archibald > wrote: > > I've just been trying to clean up scipy.stats.vonmises.cdf: it's > > currently badly broken (tickets 747, 748, and 749). I can fix the > > outright bugs, but it also has distressingly large roundoff error > > under some circumstances. I'd like to simply implement the standard > > algorithm (Hill 1977) for evaluating it. Unfortunately, it involves > > evaluating a series with a number of terms that depends on the > > parameter, so vectorizing a python version (which is already written) > > is going to be a nuisance. On the other hand, it would be very easy to > > translate this into cython and get C performance out of it. So: is it > > okay to include cython code in scipy? What is the best way to go about > > this? Include the cython source in setup.py somehow? Check the > > generated C into svn and run cython by hand when I remember to? > > Writing Cython code is fine with me. I think you should check in the > generated C, though. > It would probably be better to check in both so that the .pyx file could be used for fixes or a template for other functions. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From millman at berkeley.edu Sat Oct 11 14:36:06 2008 From: millman at berkeley.edu (Jarrod Millman) Date: Sat, 11 Oct 2008 11:36:06 -0700 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: On Sat, Oct 11, 2008 at 10:49 AM, Charles R Harris wrote: >> Writing Cython code is fine with me. I think you should check in the >> generated C, though. > > It would probably be better to check in both so that the .pyx file could be > used for fixes or a template for other functions. +1, I meant check in the generated C in addition to the cython code. From peridot.faceted at gmail.com Sat Oct 11 19:37:37 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sat, 11 Oct 2008 19:37:37 -0400 Subject: [SciPy-dev] cython in scipy? In-Reply-To: References: Message-ID: 2008/10/11 Jarrod Millman : > On Sat, Oct 11, 2008 at 10:49 AM, Charles R Harris > wrote: >>> Writing Cython code is fine with me. I think you should check in the >>> generated C, though. >> >> It would probably be better to check in both so that the .pyx file could be >> used for fixes or a template for other functions. > > +1, I meant check in the generated C in addition to the cython code. In the event, I checked in the C, the pyx, and the python it was based on. I may end up moving the code around, since the von Mises CDF is actually some sort of incomplete Bessel function and since if my code were in scipy.special it might be easier to get at compiled versions of i0 and erf (and therefore provide an entirely compiled function without the gruesome vectorization). Anne From peridot.faceted at gmail.com Sun Oct 12 09:43:13 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sun, 12 Oct 2008 09:43:13 -0400 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E31404.7000009@aero.iitb.ac.in> <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> <48E797A0.3070604@aero.iitb.ac.in> Message-ID: 2008/10/6 Nathan Bell : > IMO speed for the common query types is more important than support > for sophisticated traversals. There is now a compiled kd-tree implementation in scipy.spatial. It is written in cython and based on the python implementation. It supports only optionally-bounded, optionally-approximate, k-nearest neighbor queries but runs without any per-point python code. It includes all the algorithmic optimizations described by the ANN authors (sliding midpoint subdivision, multiple-entry leaves, updating minimum-distance calculation, priority search, and short-circuit distance calculations). I think it's pretty good. The major feature it is missing, from what people have asked for, is an all-neighbors query. It's more or less impossible to traverse these trees from within python. I am considering a tree-walking API to make it easy to write algorithms that work on kd-trees efficiently, but it may make more sense to just let people use the python implementation for this. Anne From wnbell at gmail.com Sun Oct 12 14:44:42 2008 From: wnbell at gmail.com (Nathan Bell) Date: Sun, 12 Oct 2008 14:44:42 -0400 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> <48E797A0.3070604@aero.iitb.ac.in> Message-ID: On Sun, Oct 12, 2008 at 9:43 AM, Anne Archibald wrote: > > There is now a compiled kd-tree implementation in scipy.spatial. It is > written in cython and based on the python implementation. It supports > only optionally-bounded, optionally-approximate, k-nearest neighbor > queries but runs without any per-point python code. It includes all > the algorithmic optimizations described by the ANN authors (sliding > midpoint subdivision, multiple-entry leaves, updating minimum-distance > calculation, priority search, and short-circuit distance > calculations). I think it's pretty good. The major feature it is > missing, from what people have asked for, is an all-neighbors query. Very nice! Do you know offhand how much faster the cython implementation is for a typical case? -- Nathan Bell wnbell at gmail.com http://graphics.cs.uiuc.edu/~wnbell/ From peridot.faceted at gmail.com Sun Oct 12 18:41:47 2008 From: peridot.faceted at gmail.com (Anne Archibald) Date: Sun, 12 Oct 2008 18:41:47 -0400 Subject: [SciPy-dev] [Numpy-discussion] Proposal: scipy.spatial In-Reply-To: References: <48E3AAED.6080601@aero.iitb.ac.in> <48E76391.5050508@aero.iitb.ac.in> <48E797A0.3070604@aero.iitb.ac.in> Message-ID: 2008/10/12 Nathan Bell : > On Sun, Oct 12, 2008 at 9:43 AM, Anne Archibald > wrote: >> >> There is now a compiled kd-tree implementation in scipy.spatial. It is >> written in cython and based on the python implementation. It supports >> only optionally-bounded, optionally-approximate, k-nearest neighbor >> queries but runs without any per-point python code. It includes all >> the algorithmic optimizations described by the ANN authors (sliding >> midpoint subdivision, multiple-entry leaves, updating minimum-distance >> calculation, priority search, and short-circuit distance >> calculations). I think it's pretty good. The major feature it is >> missing, from what people have asked for, is an all-neighbors query. > > Very nice! Do you know offhand how much faster the cython > implementation is for a typical case? I'm not sure what's "typical" - dimensionality certainly affects the performance of kd-trees - but it seems to be a speefup by a factor of 20-50: dimension 3, 10000 points KDTree constructed: 0.208447 cKDTree constructed: 0.00362301 KDTree 1000 lookups: 0.953508 cKDTree 1000 lookups: 0.0197051 flat cKDTree 1000 lookups: 1.46467 dimension 8, 10000 points KDTree constructed: 0.394341 cKDTree constructed: 0.00567818 KDTree 1000 lookups: 17.0404 cKDTree 1000 lookups: 0.374597 flat cKDTree 1000 lookups: 2.96095 dimension 16, 10000 points KDTree constructed: 0.29892 cKDTree constructed: 0.00891495 KDTree 100 lookups: 10.3783 cKDTree 100 lookups: 0.472985 flat cKDTree 100 lookups: 0.667156 These numbers are in seconds; the distribution is a mixture of normal distributions some moderate distance apart; and the "flat cKDTree" serves as a somewhat-optimized compiled brute-force search. Anne -------------- next part -------------- A non-text attachment was scrubbed... Name: kdtree_speed.py Type: text/x-python Size: 828 bytes Desc: not available URL: From dave.hirschfeld at gmail.com Tue Oct 14 13:52:02 2008 From: dave.hirschfeld at gmail.com (Dave) Date: Tue, 14 Oct 2008 17:52:02 +0000 (UTC) Subject: [SciPy-dev] Timeseries Unusual Behaviour Message-ID: Firstly, thanks for a great package I find it very useful for my timeseries analysis at work. I have however just stumbled upon some unusual behaviour... It appears that the `date_array` function sorts the dates when constructing a DateArray instance. This bit me when I tried to compare the timestamps against another `values` array so perhaps this behaviour should be mentioned in the docstring if it is indeed desirable. The `time_series` function magically appears to be able to handle the sorted `DateArray` when first called, but when called again with the same arguments it gets the order wrong. Is this reproducible? Is it a bug? Or am I just doing something stupid?! Thanks, Dave Python 2.5.2 (r252:60911, Feb 21 2008, 13:11:45) [MSC v.1310 32 bit (Intel)] IPython 0.9.1 -- An enhanced Interactive Python. In [2]: import datetime In [3]: import scikits.timeseries as ts In [4]: ts.__version__ Out[4]: '0.67.dev' In [5]: numpy.__version__ Out[5]: '1.3.0.dev5859' In [6]: dates = (datetime.datetime(2011, 5, 1, 0, 0), ...: datetime.datetime(2011, 8, 1, 0, 0), ...: datetime.datetime(2011, 6, 1, 0, 0), ...: datetime.datetime(2011, 9, 1, 0, 0), ...: datetime.datetime(2011, 7, 1, 0, 0)) In [7]: values = rand(len(dates)) In [8]: zip(dates,values) Out[8]: [(datetime.datetime(2011, 5, 1, 0, 0), 0.68325143668413102), (datetime.datetime(2011, 8, 1, 0, 0), 0.5341201248082369), (datetime.datetime(2011, 6, 1, 0, 0), 0.419866941013046), (datetime.datetime(2011, 9, 1, 0, 0), 0.73489099683479775), (datetime.datetime(2011, 7, 1, 0, 0), 0.7040093508897648)] In [9]: date_array = ts.date_array(dlist=dates) In [10]: # INCORRECT ! In [11]: zip(date_array,values) Out[11]: [(, 0.68325143668413102), (, 0.5341201248082369), (, 0.419866941013046), (, 0.73489099683479775), (, 0.7040093508897648)] In [12]: # CORRECT In [13]: timeseries = ts.time_series(values,dates=date_array) In [14]: zip(timeseries.dates,timeseries.data) Out[14]: [(, 0.68325143668413102), (, 0.419866941013046), (, 0.7040093508897648), (, 0.5341201248082369), (, 0.73489099683479775)] In [15]: # INCORRECT !?! In [16]: timeseries = ts.time_series(values,dates=date_array) In [17]: zip(timeseries.dates,timeseries.data) Out[17]: [(, 0.68325143668413102), (, 0.5341201248082369), (, 0.419866941013046), (, 0.73489099683479775), (, 0.7040093508897648)] In [18] From dpeterson at enthought.com Fri Oct 17 19:01:41 2008 From: dpeterson at enthought.com (Dave Peterson) Date: Fri, 17 Oct 2008 18:01:41 -0500 Subject: [SciPy-dev] ANNOUNCE: EPD with Py2.5 version 4.0.30002 RC2 available for testing Message-ID: <48F91955.70400@enthought.com> Hello, We've recently posted the RC2 build of EPD (the Enthought Python Distribution) with Python 2.5 version 4.0.30002 to the EPD website. You may download the RC from here: http://www.enthought.com/products/epdbeta.php You can check out the release notes here: https://svn.enthought.com/epd/wiki/Python2.5.2/4.0.300/RC2 Please help us test it out and provide feedback on the EPD Trac instance: https://svn.enthought.com/epd or via e-mail to epd-support at enthought.com. If everything goes well, we are planning a final release for this coming Tuesday, October 21st. About EPD --------- The Enthought Python Distribution (EPD) is a "kitchen-sink-included" distribution of the Python? Programming Language, including over 60 additional tools and libraries. The EPD bundle includes NumPy, SciPy, IPython, 2D and 3D visualization, database adapters, and a lot of other tools right out of the box. http://www.enthought.com/products/epd.php It is currently available as a single-click installer for Windows XP (x86), Mac OS X (a universal binary for OS X 10.4 and above), and RedHat 3 and 4 (x86 and amd64). EPD is free for academic use. An annual subscription and installation support are available for individual commercial use. An enterprise subscription with support for particular deployment environments is also available for commercial purchase. -- Dave From philbinj at gmail.com Sat Oct 18 11:56:52 2008 From: philbinj at gmail.com (James Philbin) Date: Sat, 18 Oct 2008 16:56:52 +0100 Subject: [SciPy-dev] scipy.sparse.dok_matrix changes Message-ID: <2b1c8c4f0810180856q2ef0d97cqf5defe1cd1aae173@mail.gmail.com> Hi, I've recently been using dok_matrix a fair bit and have made a few improvements, mainly: i. Improved the docstring. ii. Fixed the constructor (this fixes #755). iii. Reduced the codepath for the common case of setting an individual element (~2x speedup). There's more work to do on dok_matrix, but this patch might be useful for some people: --- Index: scipy/sparse/tests/test_base.py =================================================================== --- scipy/sparse/tests/test_base.py (revision 4801) +++ scipy/sparse/tests/test_base.py (working copy) @@ -1162,8 +1162,27 @@ except: caught += 1 assert_equal(caught,5) + + def test_ctor(self): + caught = 0 + # Empty ctor + try: + A = dok_matrix() + except TypeError, e: + caught+=1 + assert_equal(caught, 1) + + # Dense ctor + b = matrix([[1,0,0,0],[0,0,1,0],[0,2,0,3]],'d') + A = dok_matrix(b) + assert_equal(A.todense(), b) + # Sparse ctor + c = csr_matrix(b) + assert_equal(A.todense(), c.todense()) + + class TestLIL( _TestCommon, _TestHorizSlicing, _TestVertSlicing, _TestBothSlicing, _TestGetSet, _TestSolve, _TestArithmetic, _TestInplaceArithmetic, Index: scipy/sparse/dok.py =================================================================== --- scipy/sparse/dok.py (revision 4801) +++ scipy/sparse/dok.py (working copy) @@ -14,45 +14,65 @@ class dok_matrix(spmatrix, dict): """Dictionary Of Keys based matrix. This is an efficient - structure for constructing sparse matrices + structure for constructing sparse matrices incrementally. + + This can be instatiated in several ways: + dok_matrix(D) + with a dense matrix, D + + dok_matrix(S) + with a sparse matrix, S + + dok_matrix((M,N), [dtype]) + create the matrix with initial shape (M,N) + dtype is optional, defaulting to dtype='d' + + Notes + ----- + Allows for efficient O(1) access of individual elements. + Duplicates are not allowed. + Can be efficiently converted to a coo_matrix once constructed. + + Examples + -------- + >>> from scipy.sparse import * + >>> from scipy import * + >>> S = dok_matrix((5,5), dtype=float32) + >>> for i in range(5): + >>> for j in range(5): + >>> S[i,j] = i+j # Update element + """ - def __init__(self, A=None, shape=None, dtype=None, copy=False): - """ Create a new dictionary-of-keys sparse matrix. An optional - argument A is accepted, which initializes the dok_matrix with it. - This can be a tuple of dimensions (M, N) or a (dense) array - to copy. - """ - #TODO deprecate argument A in favor of arg1 style - + + def __init__(self, arg1, shape=None, dtype=None, copy=False): dict.__init__(self) spmatrix.__init__(self) - self.dtype = getdtype(dtype, A, default=float) - if A is not None: - if isinstance(A, tuple): - # Interpret as dimensions - if not isshape(A): - raise TypeError, "dimensions must be a 2-tuple of positive"\ - " integers" - self.shape = A - elif isspmatrix(A): - if isspmatrix_dok(A) and copy: - A = A.copy() - else: - A = A.todok() - self.update( A ) - self.shape = A.shape - self.dtype = A.dtype + + self.dtype = getdtype(dtype, default=float) + if isinstance(arg1, tuple) and isshape(arg1): # (M,N) + M, N = arg1 + self.shape = (M, N) + elif isspmatrix(arg1): # Sparse ctor + if isspmatrix_dok(arg1) and copy: + arg1 = arg1.copy() else: - #must be dense, convert to COO first, then to DOK - try: - A = asarray(A) - except: - raise ValueError, "unrecognized form for" \ - " %s_matrix constructor" % self.format - from coo import coo_matrix - self.update( coo_matrix(A).todok() ) - self.shape = A.shape - self.dtype = A.dtype + arg1 = arg1.todok() + self.update(arg1) + self.shape = arg1.shape + self.dtype = arg1.dtype + else: # Dense ctor + try: + arg1 = asarray(arg1) + except: + raise TypeError('invalid input format') + + if len(arg1.shape)!=2: + raise TypeError('expected rank <=2 dense array or matrix') + + from coo import coo_matrix + self.update( coo_matrix(arg1).todok() ) + self.shape = arg1.shape + self.dtype = arg1.dtype def getnnz(self): return dict.__len__(self) @@ -90,7 +110,7 @@ # Bounds checking if isintlike(i): - if i < 0: + if i < 0: i += self.shape[0] if i < 0 or i >= self.shape[0]: raise IndexError, "index out of bounds" @@ -177,13 +197,10 @@ def __setitem__(self, key, value): try: - assert len(key) == 2 - except (AssertionError, TypeError): - raise TypeError, "index must be a pair of integers, slices, or" \ - " sequences" - i, j = key + i, j = key + except (ValueError, TypeError): + raise TypeError, "index must be a pair of integers or slices" - # First deal with the case where both i and j are integers if isintlike(i) and isintlike(j): if i < 0: @@ -193,20 +210,14 @@ if i < 0 or i >= self.shape[0] or j < 0 or j >= self.shape[1]: raise IndexError, "index out of bounds" - if isintlike(value) and value == 0: - if key in self.keys(): # get rid of it something already there - del self[key] + + if isscalar(value): + if value==0: + del self[(i,j)] + else: + dict.__setitem__(self, (i,j), self.dtype.type(value)) else: - # Ensure value is a single element, not a sequence - if isinstance(value, float) or isintlike(value) or \ - isinstance(value, complex): - dict.__setitem__(self, (i,j), self.dtype.type(value)) - newrows = max(self.shape[0], int(key[0])+1) - newcols = max(self.shape[1], int(key[1])+1) - self.shape = (newrows, newcols) - else: - raise TypeError, "cannot set matrix element to non-scalar" - return # done + raise TypeError, "cannot set matrix element to a non-scalar" else: # Either i or j is a slice, sequence, or invalid. If i is a slice # or sequence, unfold it first and call __setitem__ recursively. --- Thanks, James From mxhf at gmx.net Sun Oct 19 05:37:29 2008 From: mxhf at gmx.net (Maximilian Fabricius) Date: Sun, 19 Oct 2008 11:37:29 +0200 Subject: [SciPy-dev] parameter control in scipy.optimize.leastsq Message-ID: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> Hello, I have been using scipy for about one and a half years now and have been extremely pleased. In this time I have made extensive use of scipy.optimize.leastsq. While I am generally fairly happy with its performance, I think it does lacks one important feature which I am used to from similar routines in other languages. (Please excuse if I have just missed that feature so far.) I would like to be able to control which parameters actually are fitted. For example I would like to be able to do leastsq( redsiduals, p0, args=(y), fit=([True,True,False,True]) ) where the parameter "fit" would tell leastsq to leave the parameter p0[2] untouched while fitting the others. Right now I have to modify my source code to change the parameters which are actually fit. I'd be happy to participate and start getting my hands dirty if someone would point out where to start. Regards, Maximilian From robince at gmail.com Sun Oct 19 06:06:32 2008 From: robince at gmail.com (Robin) Date: Sun, 19 Oct 2008 11:06:32 +0100 Subject: [SciPy-dev] parameter control in scipy.optimize.leastsq In-Reply-To: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> References: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> Message-ID: On Sun, Oct 19, 2008 at 10:37 AM, Maximilian Fabricius wrote: > Hello, > > I have been using scipy for about one and a half years now and have > been extremely pleased. > > In this time I have made extensive use of scipy.optimize.leastsq. > While I am generally fairly happy > with its performance, I think it does lacks one important feature > which I am used to from similar > routines in other languages. (Please excuse if I have just missed that > feature so far.) > > I would like to be able to control which parameters actually are > fitted. For example I would like to be able to > do > > leastsq( redsiduals, p0, args=(y), fit=([True,True,False,True]) ) > > where the parameter "fit" would tell leastsq to leave the parameter > p0[2] untouched while fitting the others. I think you can do this by 'wrapping' your existing function: Something like: fit_function = lambda x, fixed: residuals(x[0],x[1],fixed[0],x[2]) Then you can call least squared with the fixed argument specified leastsq( fit_function, p0, args=(y,fixed)) You can either have the fixed parameters passed as a argument to the lambda, or have them defined where the lambda is run (but I find it a bit confusiong becuase the scoping gets funny). I'm sure you could do something to get the [True, False] type notation you want - but since you will have to specify the fixed values as well. Robin From mxhf at gmx.net Sun Oct 19 06:28:54 2008 From: mxhf at gmx.net (Maximilian Fabricius) Date: Sun, 19 Oct 2008 12:28:54 +0200 Subject: [SciPy-dev] parameter control in scipy.optimize.leastsq In-Reply-To: References: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> Message-ID: <88b277e60810190328l3731c71ds91b8030ca9547ba1@mail.gmail.com> On Sun, Oct 19, 2008 at 12:06 PM, Robin wrote: > On Sun, Oct 19, 2008 at 10:37 AM, Maximilian Fabricius wrote: >> Hello, >> >> I have been using scipy for about one and a half years now and have >> been extremely pleased. >> >> In this time I have made extensive use of scipy.optimize.leastsq. >> While I am generally fairly happy >> with its performance, I think it does lacks one important feature >> which I am used to from similar >> routines in other languages. (Please excuse if I have just missed that >> feature so far.) >> >> I would like to be able to control which parameters actually are >> fitted. For example I would like to be able to >> do >> >> leastsq( redsiduals, p0, args=(y), fit=([True,True,False,True]) ) >> >> where the parameter "fit" would tell leastsq to leave the parameter >> p0[2] untouched while fitting the others. > > I think you can do this by 'wrapping' your existing function: > > Something like: > > fit_function = lambda x, fixed: residuals(x[0],x[1],fixed[0],x[2]) > > Then you can call least squared with the fixed argument specified > leastsq( fit_function, p0, args=(y,fixed)) > > You can either have the fixed parameters passed as a argument to the > lambda, or have them defined where the lambda is run (but I find it a > bit confusiong becuase the scoping gets funny). I'm sure you could do > something to get the [True, False] type notation you want - but since > you will have to specify the fixed values as well. > > Robin > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://projects.scipy.org/mailman/listinfo/scipy-dev > Robin, thank you for the quick reply. Indeed, but this would still involve to swap parameters around between fixed and non-fixed and changing the lambda function manually rather then just switching the fissint on and off with a single variable. But no doubt, a wrapper can be done. Thanks! Maximilian From robince at gmail.com Sun Oct 19 06:52:24 2008 From: robince at gmail.com (Robin) Date: Sun, 19 Oct 2008 11:52:24 +0100 Subject: [SciPy-dev] parameter control in scipy.optimize.leastsq In-Reply-To: <88b277e60810190328l3731c71ds91b8030ca9547ba1@mail.gmail.com> References: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> <88b277e60810190328l3731c71ds91b8030ca9547ba1@mail.gmail.com> Message-ID: On Sun, Oct 19, 2008 at 11:28 AM, Maximilian Fabricius wrote: > Robin, > > thank you for the quick reply. > Indeed, but this would still involve to swap parameters around between > fixed and non-fixed and > changing the lambda function manually rather then just switching the > fissint on and off > with a single variable. > > But no doubt, a wrapper can be done. I couldn't think of a way to squeeze it into a lamdba single statement, but something like this would do: def fit_function(x, fit, y): p0 = zeros(fit.size) p0[fit] = x p0[~fit] = y where fit is a boolean array, x is the ones to optimised and y are the fixed values. You have the extra overhead of the p0 array allocation for every iteration of the function - but perhaps that is worth it for the convenience. Robin From mxhf at gmx.net Sun Oct 19 11:28:14 2008 From: mxhf at gmx.net (Maximilian Fabricius) Date: Sun, 19 Oct 2008 17:28:14 +0200 Subject: [SciPy-dev] parameter control in scipy.optimize.leastsq In-Reply-To: References: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> <88b277e60810190328l3731c71ds91b8030ca9547ba1@mail.gmail.com> Message-ID: <88b277e60810190828w43418c02i8969c31b597f4ef8@mail.gmail.com> On Sun, Oct 19, 2008 at 12:52 PM, Robin wrote: > On Sun, Oct 19, 2008 at 11:28 AM, Maximilian Fabricius wrote: >> Robin, >> >> thank you for the quick reply. >> Indeed, but this would still involve to swap parameters around between >> fixed and non-fixed and >> changing the lambda function manually rather then just switching the >> fissint on and off >> with a single variable. >> >> But no doubt, a wrapper can be done. > > I couldn't think of a way to squeeze it into a lamdba single > statement, but something like this would do: > > def fit_function(x, fit, y): > p0 = zeros(fit.size) > p0[fit] = x > p0[~fit] = y > > where fit is a boolean array, x is the ones to optimised and y are the > fixed values. > You have the extra overhead of the p0 array allocation for every > iteration of the function - but perhaps that is worth it for the > convenience. > > Robin > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://projects.scipy.org/mailman/listinfo/scipy-dev > Robin, great, nice idea! As this may be of use to others, my solution now is: def fit_function(p_fit, fit, p_fixed, ... ): # assemble p_fit and p_fixed into a single array # while restoring the correct order of the original p0 array. p = zeros(fit.size) p[ where(fit == True) ] = p_fit p[ where(fit == False) ] = p_fixed # calls the original residuals routine return residuals(p, ... ) main(): # best guesses p0 = zeros(10) p0[0] = vrot_d p0[1] = vsig_d .... # choose which parameters to fit fit = zeros(len(p0)) fit[0:7] = True fit[8] = False fit[9] = False # store parameters which should be fit in p_fit # all the others in p_fixed p_fit = p0[ where(fit == True) ] p_fixed = p0[ where(fit == False) ] plsq = leastsq(fit_function, p_fit, args=(fit, p_fixed, ...) ) Thanks! Cheers, Maximilian From robince at gmail.com Sun Oct 19 11:40:17 2008 From: robince at gmail.com (Robin) Date: Sun, 19 Oct 2008 16:40:17 +0100 Subject: [SciPy-dev] parameter control in scipy.optimize.leastsq In-Reply-To: <88b277e60810190828w43418c02i8969c31b597f4ef8@mail.gmail.com> References: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> <88b277e60810190328l3731c71ds91b8030ca9547ba1@mail.gmail.com> <88b277e60810190828w43418c02i8969c31b597f4ef8@mail.gmail.com> Message-ID: On Sun, Oct 19, 2008 at 4:28 PM, Maximilian Fabricius wrote: > Robin, > > great, nice idea! As this may be of use to others, my solution now is: > > > def fit_function(p_fit, fit, p_fixed, ... ): > # assemble p_fit and p_fixed into a single array > # while restoring the correct order of the original p0 array. > p = zeros(fit.size) > p[ where(fit == True) ] = p_fit > p[ where(fit == False) ] = p_fixed > # calls the original residuals routine > return residuals(p, ... ) > > main(): > # best guesses > p0 = zeros(10) > p0[0] = vrot_d > p0[1] = vsig_d > .... > > # choose which parameters to fit > fit = zeros(len(p0)) > fit[0:7] = True > fit[8] = False > fit[9] = False > > # store parameters which should be fit in p_fit > # all the others in p_fixed > p_fit = p0[ where(fit == True) ] > p_fixed = p0[ where(fit == False) ] > plsq = leastsq(fit_function, p_fit, args=(fit, p_fixed, ...) ) > > Thanks! Just want to point out that if fit is a boolean array (ie fit = array([True,False,False,True]) etc. then you can index with it directly p0[fit] instead of p0[where(fit==True)] and p0[~fit] instead of p0[where(fit==False)] which is likely to be a bit quicker. Doesn't matter outside, but in the fit_function you probably want to save any time you can since it will be run many times. Robin From benny.malengier at gmail.com Sun Oct 19 13:44:04 2008 From: benny.malengier at gmail.com (Benny Malengier) Date: Sun, 19 Oct 2008 19:44:04 +0200 Subject: [SciPy-dev] parameter control in scipy.optimize.leastsq In-Reply-To: <88b277e60810190828w43418c02i8969c31b597f4ef8@mail.gmail.com> References: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> <88b277e60810190328l3731c71ds91b8030ca9547ba1@mail.gmail.com> <88b277e60810190828w43418c02i8969c31b597f4ef8@mail.gmail.com> Message-ID: To avoid array creation overhead, I use objects encapsulating everything, so def fit_function(self, x, ... ): # x is a subset of all the parameters, self.tofit is bool array with those arg to fit self.p[self.tofit ][:] = x[:] self.p[~self.tofit][:] = self.all_arg [~self.tofit][:] # calls the original residuals routine return residuals(self.p, ... ) Then in your loop, you can change the self.tofit array to change what you fit. Obviously, self.all_arg default to something in the __init__() method and self.p is created once there, and things can be changed. If things are slow, the line self.p[~self.tofit][:] = self.all_arg [~self.tofit][:] can be removed and put into it's own method, only to be called if self.tofit changes. Benny 2008/10/19 Maximilian Fabricius > On Sun, Oct 19, 2008 at 12:52 PM, Robin wrote: > > On Sun, Oct 19, 2008 at 11:28 AM, Maximilian Fabricius > wrote: > >> Robin, > >> > >> thank you for the quick reply. > >> Indeed, but this would still involve to swap parameters around between > >> fixed and non-fixed and > >> changing the lambda function manually rather then just switching the > >> fissint on and off > >> with a single variable. > >> > >> But no doubt, a wrapper can be done. > > > > I couldn't think of a way to squeeze it into a lamdba single > > statement, but something like this would do: > > > > def fit_function(x, fit, y): > > p0 = zeros(fit.size) > > p0[fit] = x > > p0[~fit] = y > > > > where fit is a boolean array, x is the ones to optimised and y are the > > fixed values. > > You have the extra overhead of the p0 array allocation for every > > iteration of the function - but perhaps that is worth it for the > > convenience. > > > > Robin > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://projects.scipy.org/mailman/listinfo/scipy-dev > > > > > Robin, > > great, nice idea! As this may be of use to others, my solution now is: > > > def fit_function(p_fit, fit, p_fixed, ... ): > # assemble p_fit and p_fixed into a single array > # while restoring the correct order of the original p0 array. > p = zeros(fit.size) > p[ where(fit == True) ] = p_fit > p[ where(fit == False) ] = p_fixed > # calls the original residuals routine > return residuals(p, ... ) > > main(): > # best guesses > p0 = zeros(10) > p0[0] = vrot_d > p0[1] = vsig_d > .... > > # choose which parameters to fit > fit = zeros(len(p0)) > fit[0:7] = True > fit[8] = False > fit[9] = False > > # store parameters which should be fit in p_fit > # all the others in p_fixed > p_fit = p0[ where(fit == True) ] > p_fixed = p0[ where(fit == False) ] > plsq = leastsq(fit_function, p_fit, args=(fit, p_fixed, ...) ) > > Thanks! > > Cheers, > > Maximilian > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://projects.scipy.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From wnbell at gmail.com Sun Oct 19 23:18:45 2008 From: wnbell at gmail.com (Nathan Bell) Date: Sun, 19 Oct 2008 23:18:45 -0400 Subject: [SciPy-dev] scipy.sparse.dok_matrix changes In-Reply-To: <2b1c8c4f0810180856q2ef0d97cqf5defe1cd1aae173@mail.gmail.com> References: <2b1c8c4f0810180856q2ef0d97cqf5defe1cd1aae173@mail.gmail.com> Message-ID: On Sat, Oct 18, 2008 at 11:56 AM, James Philbin wrote: > > I've recently been using dok_matrix a fair bit and have made a few > improvements, mainly: > i. Improved the docstring. > ii. Fixed the constructor (this fixes #755). > iii. Reduced the codepath for the common case of setting an individual > element (~2x speedup). > Thanks James. I've merged your patch in r4802: http://projects.scipy.org/scipy/scipy/changeset/4802 The dok_matrix class is a neglected part of scipy.sparse, so your contribution is most appreciated. -- Nathan Bell wnbell at gmail.com http://graphics.cs.uiuc.edu/~wnbell/ From pgmdevlist at gmail.com Mon Oct 20 01:56:04 2008 From: pgmdevlist at gmail.com (Pierre GM) Date: Mon, 20 Oct 2008 01:56:04 -0400 Subject: [SciPy-dev] Timeseries Unusual Behaviour In-Reply-To: References: Message-ID: <200810200156.04924.pgmdevlist@gmail.com> David, Sorry for the delayed answer. * When you create a DateArray from a series of dates using `date_array` (the recommended way), the dates will automatically be sorted in chronological order, but the original order stored temporarily into a private attribute. * When you create a time series using `time_series`, the same thing happens: 1. a DateArray is created from the list of dates with `date_array` 2. it is sorted chronologically and the initial order is stored temporarily 3. the values are then sorted chronologically, 4. the attribute storing the initial order of the dates reset to None. The reason behind the automatic sorting is that in a huge majority of cases, one works with chronological series. Now, let's go through your example >>> dates = (datetime.datetime(2011, 5, 1, 0, 0), ... datetime.datetime(2011, 8, 1, 0, 0), ... datetime.datetime(2011, 6, 1, 0, 0), ... datetime.datetime(2011, 9, 1, 0, 0), ... datetime.datetime(2011, 7, 1, 0, 0)) >>> values = [1, 4, 2, 5, 3] >>> d1 = ts.date_array(dates) d1 is sorted chronologically by default. >>> zip(d1, values) [(, 1), (, 4), (, 2), (, 5), (, 3)] Here, we're mixing d1 (sorted) and values (unsorted). >>> series = ts.time_series(values,dates=d1) timeseries([1 2 3 4 5], dates = [01-May-2011 01-Jun-2011 01-Jul-2011 01-Aug-2011 01-Sep-2011], freq = U) d1 is sorted chronologically, values is sorted according to the initial order of d1, the private flag of d1 is reset to show it has been sorted. >>> series = ts.time_series(values,dates=d1) timeseries([1 4 2 5 3], dates = [01-May-2011 01-Jun-2011 01-Jul-2011 01-Aug-2011 01-Sep-2011], freq = U) d1 had already been sorted and its flag reset, the values are *not* sorted. I agree that the last line is so confusing that it should be considered a bug. I'll try to find some workaround. Meanwhile, a couple of advices: * As time_series calls date_array anyway, you don't gain a lot by creating a DateArray from your dates in the first place. On the contrary, it can lead to the surprises we just discuss: just use the dates instead. * Take the habit of always specifying a frequency, it will save you some headaches in the long run when converting between frequencies From mxhf at gmx.net Mon Oct 20 03:09:40 2008 From: mxhf at gmx.net (Maximilian Fabricius) Date: Mon, 20 Oct 2008 09:09:40 +0200 Subject: [SciPy-dev] parameter control in scipy.optimize.leastsq In-Reply-To: References: <88b277e60810190237k15f27343xeb27793ea13d856b@mail.gmail.com> <88b277e60810190328l3731c71ds91b8030ca9547ba1@mail.gmail.com> <88b277e60810190828w43418c02i8969c31b597f4ef8@mail.gmail.com> Message-ID: <88b277e60810200009w4d3efb25t7ca4f13fdeb5b18e@mail.gmail.com> On Sun, Oct 19, 2008 at 5:40 PM, Robin wrote: > On Sun, Oct 19, 2008 at 4:28 PM, Maximilian Fabricius wrote: >> Robin, >> >> great, nice idea! As this may be of use to others, my solution now is: >> >> >> def fit_function(p_fit, fit, p_fixed, ... ): >> # assemble p_fit and p_fixed into a single array >> # while restoring the correct order of the original p0 array. >> p = zeros(fit.size) >> p[ where(fit == True) ] = p_fit >> p[ where(fit == False) ] = p_fixed >> # calls the original residuals routine >> return residuals(p, ... ) >> >> main(): >> # best guesses >> p0 = zeros(10) >> p0[0] = vrot_d >> p0[1] = vsig_d >> .... >> >> # choose which parameters to fit >> fit = zeros(len(p0)) >> fit[0:7] = True >> fit[8] = False >> fit[9] = False >> >> # store parameters which should be fit in p_fit >> # all the others in p_fixed >> p_fit = p0[ where(fit == True) ] >> p_fixed = p0[ where(fit == False) ] >> plsq = leastsq(fit_function, p_fit, args=(fit, p_fixed, ...) ) >> >> Thanks! > > Just want to point out that if fit is a boolean array (ie fit = > array([True,False,False,True]) etc. then you can index with it > directly p0[fit] instead of p0[where(fit==True)] and p0[~fit] instead > of p0[where(fit==False)] which is likely to be a bit quicker. Doesn't > matter outside, but in the fit_function you probably want to save any > time you can since it will be run many times. > > Robin Robin, thank you. Indeed, I should know that. Python sometime is overwhelmingly convenient. Maximilian From simon.caschili at gmail.com Mon Oct 20 10:44:38 2008 From: simon.caschili at gmail.com (Simone Caschili) Date: Mon, 20 Oct 2008 16:44:38 +0200 Subject: [SciPy-dev] Genetic Algorithm documentation Message-ID: <256b49550810200744n40cfd26eq8631228f0c1b8a64@mail.gmail.com> Hello, I am thinking to use scipy to implement a genetic algorithm project, but I have found any documentation about it. Could you suggest me where I can find it? Thank you very much. Simone From philbinj at gmail.com Mon Oct 20 11:07:01 2008 From: philbinj at gmail.com (James Philbin) Date: Mon, 20 Oct 2008 16:07:01 +0100 Subject: [SciPy-dev] scipy.sparse.dok_matrix changes In-Reply-To: References: <2b1c8c4f0810180856q2ef0d97cqf5defe1cd1aae173@mail.gmail.com> Message-ID: <2b1c8c4f0810200807s73bc5e3cm1a122a1879401c2c@mail.gmail.com> Hi, > Thanks James. I've merged your patch in r4802: > http://projects.scipy.org/scipy/scipy/changeset/4802 Good stuff. I'd like to make some more changes/cleanups, but there's a few things i'm not sure about. Firstly, I'd like to remove the functions 'conjtranspose', 'take' + 'split'. All of them seem non-standard and can be expressed in other ways. If functions are removed, should they be deprecated first? I'm not sure what the policy is on a pre 1.0 release. Secondly, using __getitem__ with slices currently requires an O(nnz) crawl over all the values. Should we support this usage? My preference would be to only allow elementwise access, though for __setitem__ using slices is fine. James From wnbell at gmail.com Mon Oct 20 12:58:48 2008 From: wnbell at gmail.com (Nathan Bell) Date: Mon, 20 Oct 2008 12:58:48 -0400 Subject: [SciPy-dev] scipy.sparse.dok_matrix changes In-Reply-To: <2b1c8c4f0810200807s73bc5e3cm1a122a1879401c2c@mail.gmail.com> References: <2b1c8c4f0810180856q2ef0d97cqf5defe1cd1aae173@mail.gmail.com> <2b1c8c4f0810200807s73bc5e3cm1a122a1879401c2c@mail.gmail.com> Message-ID: