From lorenzo.isella at gmail.com Mon Oct 1 05:34:29 2012 From: lorenzo.isella at gmail.com (Lorenzo Isella) Date: Mon, 1 Oct 2012 11:34:29 +0200 Subject: [SciPy-User] Projected Area Message-ID: Dear All, I hope this is not too off-topic. I need to know if there is already some ready-to-use SciPy algorithm (or at least if this is easy to implement or not). Consider a dimer, i.e. 2 spheres with a single contact point. This dimer can have any orientation in the 3D and I have the (x,y,z) coordinates of the centre of the 2 spheres. For a given orientation, I want to project the dimer on, let's say, the xy plane and evaluate the area of the surface of its projection. I spoke about a dimer since it is easy to start discussing a simple case, but in general I will deal with objects consisting of several non-overlapping spheres such that any sphere has at least a contact point with another sphere. Any suggestion is appreciated. Cheers Lorenzo From robert.kern at gmail.com Mon Oct 1 11:03:30 2012 From: robert.kern at gmail.com (Robert Kern) Date: Mon, 1 Oct 2012 16:03:30 +0100 Subject: [SciPy-User] Projected Area In-Reply-To: References: Message-ID: On Mon, Oct 1, 2012 at 10:34 AM, Lorenzo Isella wrote: > Dear All, > I hope this is not too off-topic. > I need to know if there is already some ready-to-use SciPy algorithm > (or at least if this is easy to implement or not). > Consider a dimer, i.e. 2 spheres with a single contact point. This > dimer can have any orientation in the 3D and I have the (x,y,z) > coordinates of the centre of the 2 spheres. > For a given orientation, I want to project the dimer on, let's say, > the xy plane and evaluate the area of the surface of its projection. > I spoke about a dimer since it is easy to start discussing a simple > case, but in general I will deal with objects consisting of several > non-overlapping spheres such that any sphere has at least a contact > point with another sphere. There is nothing implemented in scipy for this. For the case of spheres projected (orthographically?) onto a plane, the shadows are probably-overlapping circles (the contact point is irrelevant). It looks like there is an analytical solution to the area of the intersection for circles: http://mathworld.wolfram.com/Circle-CircleIntersection.html You can probably just add up the areas of each circle, then subtract out one copy of each area of intersection to get the area of the union. -- Robert Kern From ralf.gommers at gmail.com Mon Oct 1 13:42:12 2012 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 1 Oct 2012 19:42:12 +0200 Subject: [SciPy-User] Request help on fsolve In-Reply-To: <1349044865.39688.YahooMailNeo@web31813.mail.mud.yahoo.com> References: <1349044865.39688.YahooMailNeo@web31813.mail.mud.yahoo.com> Message-ID: On Mon, Oct 1, 2012 at 12:41 AM, The Helmbolds wrote: > Please help me out here. I?m trying to rewrite the docstring for the > `fsolve.py` routine > located on my machine in: C:/users/owner/scipy/scipy/optimize/minpack.py > > The specific issue I?m having difficulty with is understanding the outputs > described in fsolve?s docstring as: > 'fjac': the orthogonal matrix, q, produced by the QR factorization of > the final approximate Jacobian matrix, stored column wise > 'r': upper triangular matrix produced by QR factorization of same matrix > > These are described in SciPy?s minpack/hybrd.f file as: > ?fjac? is an output n by n array which contains the orthogonal matrix q > produced by the qr factorization of the final approximate jacobian. > ?r? is an output array of length lr which contains the upper triangular > matrix produced by the qr factorization of the final approximate jacobian, > stored rowwise. > > For ease in writing, in what follows let?s use the symbols ?Jend? for the > final approximate Jacobian matrix, and use ?Q? and ?R? for its QR > decomposition matrices. Now consider the problem of finding the solution to > the following three nonlinear equations in three unknowns (u, v, w), which > we will refer to as ?E?: > 2 * a * u + b * v + d - w * v = 0 > b * u + 2 * c * v + e - w * u = 0 > u * v - f = 0 > where (a, b, c, d, e, f ) = (2, 3, 7, 8, 9, 2). For inputs to fsolve, we > identify (u, v, w) = (x[0], x[1], x[2]). > > Now fsolve gives the solution array: > [uend vend wend] = [ 1.79838825 1.11210691 16.66195357]. > With these values, the above three equations E are satisfied to an > accuracy of about 9 significant figures. > > The Jacobian matrix for the three LHS functions in E is: > J = np.matrix([[2*a, b-w, -v], [b-w, 2*c, -u], [v, u, 0.]]) > Note that it?s symmetric, and if we compute its value using the above > fsolve?s ?end? solution values we get: > Jend = [[ 4. 19.66195357 1.11210691], > [ 19.66195357 14. 1.79838825], > [ 1.11210691 1.79838825 0. ]] > Using SciPy?s linalg package, this Jend has the QR decomposition: > Qend = [[-0.28013447 -0.91516674 -0.28981807] > [ 0.95679602 -0.24168763 -0.16164302] > [ 0.07788487 -0.32257856 0.94333293]] > Rend = [[-14.278857 17.08226116 -1.40915124] > [ -0. 9.69946027 1.45241144] > [ -0. 0. 0.61300558]] > and Qend * Rend = Jend to within about 15 significant figures. > However, fsolve gives the QR decomposition: > qretm = [[-0.64093238 0.75748326 0.1241966 ] > [-0.62403598 -0.60841098 0.4903215 ] > [-0.44697291 -0.23675978 -0.8626471 ]] > rret = [ -7.77806716 30.02199802 -0.819055 -10.74878184 > 2.00090268 1.02706198] > and converting rret to a NumPy matrix gives: > rretm = [[ -7.77806716 30.02199802 -0.819055 ] > [ 0. -10.74878184 2.00090268] > [ 0. 0. 1.02706198]] > Now qret and rretm bear no obvious relation to Qend and Rend. Although > qretm is orthogonal to about 16 significant figures, we find the product: > qretm * rretm = [[ 4.98521509 -27.38409295 2.16816676] > [ 4.85379376 -12.19513008 -0.2026608 ] > [ 3.47658529 -10.87414051 -0.99362993]] > which bears no obvious relationship to Jend. > > The hybrdj.f routine in minpack refers to a permutation matrix, p, such > that we should have in our notation: > p*Jend = qretm*rretm, > but fsolve apparently does not return the matrix p, and I don?t see any > permutation of Jend that would equal qretm*rretm. > > If we reinterpret rret as meaning the matrix: > rretaltm = [[ -7.77806716 30.02199802 -10.74878184] > [ 0. -0.819055 2.00090268] > [ 0. 0. 1.02706198]] > then we get the product: > qretm * rretaltm = [[ 4.98521509 -19.86249109 8.53245022] > [ 4.85379376 -18.2364849 5.99384603] > [ 3.47658529 -13.22510045 3.44468895]] > which again bears no obvious relationship to Jend. Using the transpose of > qretm in the above product is no help. > > So please help me out here. What are the fjac and r values that fsolve > returns? > How are they related to the above Qend, Rend, and Jend? > How is the user supposed to use them? > I'm not sure. To play with your example it would be very helpful if you could provide it as a Python script. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From lorenzo.isella at gmail.com Mon Oct 1 13:54:16 2012 From: lorenzo.isella at gmail.com (Lorenzo Isella) Date: Mon, 01 Oct 2012 19:54:16 +0200 Subject: [SciPy-User] Projected Area In-Reply-To: References: Message-ID: Hello, And thanks for your reply. Unfortunately, the situation is not this easy. The dimer example was somehow misleading. It is not so straightforward to calculate the area of multiple overlapping circles (in particular when the intersection of 4-5 circles is not empty). I think I will have to resort to some Monte Carlo integration. Cheers Lorenzo On Mon, 01 Oct 2012 16:59:21 +0200, wrote: > On Mon, Oct 1, 2012 at 10:34 AM, Lorenzo Isella > wrote: >> Dear All, >> I hope this is not too off-topic. >> I need to know if there is already some ready-to-use SciPy algorithm >> (or at least if this is easy to implement or not). >> Consider a dimer, i.e. 2 spheres with a single contact point. This >> dimer can have any orientation in the 3D and I have the (x,y,z) >> coordinates of the centre of the 2 spheres. >> For a given orientation, I want to project the dimer on, let's say, >> the xy plane and evaluate the area of the surface of its projection. >> I spoke about a dimer since it is easy to start discussing a simple >> case, but in general I will deal with objects consisting of several >> non-overlapping spheres such that any sphere has at least a contact >> point with another sphere. > There is nothing implemented in scipy for this. For the case of > spheres projected (orthographically?) onto a plane, the shadows are > probably-overlapping circles (the contact point is irrelevant). It > looks like there is an analytical solution to the area of the > intersection for circles: > http://mathworld.wolfram.com/Circle-CircleIntersection.html > You can probably just add up the areas of each circle, then subtract > out one copy of each area of intersection to get the area of the > union. From davidmenhur at gmail.com Mon Oct 1 14:13:18 2012 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Mon, 1 Oct 2012 20:13:18 +0200 Subject: [SciPy-User] Projected Area In-Reply-To: References: Message-ID: On Mon, Oct 1, 2012 at 7:54 PM, Lorenzo Isella wrote: > I think I will have to resort to some Monte Carlo integration. Not everything is lost. You could make a boolean 3D grid as big as your memory allows to, with True in the spheres and False in empty space. Rotate it is just matrix multiplication, and project over one axis with .any. The shape of the sphere doesn't have oscillations, so a regular grid is a good approach to the integration. From cgohlke at uci.edu Mon Oct 1 14:19:57 2012 From: cgohlke at uci.edu (Christoph Gohlke) Date: Mon, 01 Oct 2012 11:19:57 -0700 Subject: [SciPy-User] Projected Area In-Reply-To: References: Message-ID: <5069DECD.1070705@uci.edu> On 10/1/2012 10:54 AM, Lorenzo Isella wrote: > > Hello, > And thanks for your reply. > Unfortunately, the situation is not this easy. The dimer example was > somehow misleading. > It is not so straightforward to calculate the area of multiple overlapping > circles (in particular when the intersection of 4-5 circles is not empty). > I think I will have to resort to some Monte Carlo integration. > Cheers > > Lorenzo Try Shapely , a geospatial library, to analyze planar geometric objects after projecting your 3D objects. Christoph > > > On Mon, 01 Oct 2012 16:59:21 +0200, wrote: > >> On Mon, Oct 1, 2012 at 10:34 AM, Lorenzo Isella >> wrote: >>> Dear All, >>> I hope this is not too off-topic. >>> I need to know if there is already some ready-to-use SciPy algorithm >>> (or at least if this is easy to implement or not). >>> Consider a dimer, i.e. 2 spheres with a single contact point. This >>> dimer can have any orientation in the 3D and I have the (x,y,z) >>> coordinates of the centre of the 2 spheres. >>> For a given orientation, I want to project the dimer on, let's say, >>> the xy plane and evaluate the area of the surface of its projection. >>> I spoke about a dimer since it is easy to start discussing a simple >>> case, but in general I will deal with objects consisting of several >>> non-overlapping spheres such that any sphere has at least a contact >>> point with another sphere. >> There is nothing implemented in scipy for this. For the case of >> spheres projected (orthographically?) onto a plane, the shadows are >> probably-overlapping circles (the contact point is irrelevant). It >> looks like there is an analytical solution to the area of the >> intersection for circles: >> http://mathworld.wolfram.com/Circle-CircleIntersection.html >> You can probably just add up the areas of each circle, then subtract >> out one copy of each area of intersection to get the area of the >> union. > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > > > From jkhilmer at chemistry.montana.edu Mon Oct 1 14:39:48 2012 From: jkhilmer at chemistry.montana.edu (jkhilmer at chemistry.montana.edu) Date: Mon, 1 Oct 2012 12:39:48 -0600 Subject: [SciPy-User] Projected Area In-Reply-To: References: Message-ID: Lorenzo, Were the previous suggestions not viable due to speed or precision? http://thread.gmane.org/gmane.comp.python.scientific.user/30450/focus=30464 Jonathan On Mon, Oct 1, 2012 at 11:54 AM, Lorenzo Isella wrote: > > Hello, > And thanks for your reply. > Unfortunately, the situation is not this easy. The dimer example was > somehow misleading. > It is not so straightforward to calculate the area of multiple overlapping > circles (in particular when the intersection of 4-5 circles is not empty). > I think I will have to resort to some Monte Carlo integration. > Cheers > > Lorenzo From ralf.gommers at gmail.com Mon Oct 1 15:58:57 2012 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 1 Oct 2012 21:58:57 +0200 Subject: [SciPy-User] cobyla In-Reply-To: <1349039373.72572.YahooMailNeo@web31805.mail.mud.yahoo.com> References: <1349039373.72572.YahooMailNeo@web31805.mail.mud.yahoo.com> Message-ID: On Sun, Sep 30, 2012 at 11:09 PM, The Helmbolds wrote: > On my system (Windows 7, Python 2.7.x and IDLE, latest SciPy), I observe > the following behavior with fmin_cobyla and minimize's COBYLA method. > > Case 1: When run either in the IDLE interactive shell or within an > enclosing Python program: > 1.1. The fmin_cobyla function never returns the Results dictionary, > and never displays it to Python's stdout. This is true regardless of the > function call's disp setting. > Correct. The fmin_cobyla docstring clearly says what it returns. Result objects are only returned by the new interfaces in the 0.11.0 release (minimize, minimize_scalar, root). 1.2. The 'minimize' function always returns the Results dictionary but > never displays it to Python's stdout. Again, this is true regardless of the > function call's disp setting. > `disp` doesn't print the Results objects. For me it works as advertized (in IPython), it prints something like: Normal return from subroutine COBYLA NFVALS = 37 F = 8.000000E-01 MAXCV = 0.000000E+00 X = 1.400113E+00 1.700056E+00 Ralf > Case 2: When run interactively in Window's Command Prompt box: > 2.1 The fmin_cobyla function never returns the Result dictionary, > regardless of the function call's disp setting. Setting disp to True or > False either displays the Results dictionary in the command box or not > (respectively). I don't think the Results dictionary gets to the command > box via stdout. > 2.2 The 'minimize' function always returns the Result dictionary, > regardless of the function call's disp setting. Setting disp to True or > False either displays the Results dictionary in the command box or not > (respectively). I don't think the Results dictionary gets to the command > box via stdout. > > My thanks to all who helped clarify this situation. > > Bob H > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -------------- next part -------------- An HTML attachment was scrubbed... URL: From klonuo at gmail.com Mon Oct 1 17:08:09 2012 From: klonuo at gmail.com (klo uo) Date: Mon, 1 Oct 2012 23:08:09 +0200 Subject: [SciPy-User] Scaling clip-art image Message-ID: Hi, while looking for a way to produce quality resize of line drawings I was suggested this algorithm: https://secure.wikimedia.org/wikipedia/en/wiki/Hqx It produces by far best results on fixed resize ratios: 2x, 3x, 4x Color edges are crisp, lines almost sharp as wanted, and I was wondering does scipy has similar function, or different approach that may give good results on clip-art images resize? -------------- next part -------------- An HTML attachment was scrubbed... URL: From will at thearete.co.uk Tue Oct 2 04:49:37 2012 From: will at thearete.co.uk (Will Furnass) Date: Tue, 2 Oct 2012 08:49:37 +0000 (UTC) Subject: [SciPy-User] [SciPy-user] Pylab - standard packages References:

<76b6b0e2f78755096dd3545e87ced475.squirrel@srv2.s4y.tournesol-consulting.eu> <01D91AC9-ACAA-4D5F-BB9C-B3BC179D39E0@continuum.io>

<34482439.post@talk.nabble.com> Message-ID: A point that I don't think has been mentioned so far (correct me if I'm wrong) is whether devising a Scipy standard with recommended/minimum package versions will hinder (or expedite) the transition to Python 3.x. If one package e.g. matplotlib is still Python 2.x only then that would keep the standard 2.7 but may add momentum the development of a 3.x version of that package. More generally, is there any interest in a 3.x Scipy standard, either now or in the next couple of years? Even if there were sufficient 3.x packages to permit both a 2.x and 3.x version of the standard I hope others would agree that this is not a great idea. From robert.kern at gmail.com Tue Oct 2 05:09:35 2012 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 2 Oct 2012 10:09:35 +0100 Subject: [SciPy-User] Scaling clip-art image In-Reply-To: References: Message-ID: On Mon, Oct 1, 2012 at 10:08 PM, klo uo wrote: > Hi, > > while looking for a way to produce quality resize of line drawings I was > suggested this algorithm: https://secure.wikimedia.org/wikipedia/en/wiki/Hqx > > It produces by far best results on fixed resize ratios: 2x, 3x, 4x > Color edges are crisp, lines almost sharp as wanted, and I was wondering > does scipy has similar function, or different approach that may give good > results on clip-art images resize? We do not have any such specialized filters. -- Robert Kern From jwevandijk at xs4all.nl Tue Oct 2 06:35:05 2012 From: jwevandijk at xs4all.nl (Janwillem) Date: Tue, 02 Oct 2012 12:35:05 +0200 Subject: [SciPy-User] interpolate interp1d and rbf Message-ID: <506AC359.1040300@xs4all.nl> I am using interpolate.interp1d and interpolate.rbf and could simplify my scripts greatly if I could get the original axes values from the interpolating function like f = interpolate.interp1d(x, y) and than somewhere else original_x = f.get_original_x() Is there such a thing as "get_original_x" Or if not possible is there at least a way to find the range of the axes to prevent extrapolation errors. I had a look at f.nodes but with no success My scipy is 0.9.0 Many thanks, Janwillem From trive at astro.su.se Tue Oct 2 06:57:12 2012 From: trive at astro.su.se (=?ISO-8859-1?Q?Th=F8ger_Rivera-Thorsen?=) Date: Tue, 02 Oct 2012 12:57:12 +0200 Subject: [SciPy-User] Fitting Gaussian in spectra In-Reply-To: References: Message-ID: <506AC888.4050702@astro.su.se> Hi Joe; Depending on the physical character of your data, I believe the spectral fitting tool Sherpa could be of help to you. http://cxc.cfa.harvard.edu/contrib/sherpa/ An introductory tutorial is given here: http://python4astronomers.github.com/fitting/spectrum.html - the software package is general enough that it is also useful for non-astronomers. It continas some very nice tools to include or ignore certain data regions in your fit. There is a bug in the Sherpa standalone package, the solution of which I descibe here: http://lusepuster.posterous.com/installing-sherpa-fitting-software-on-ubuntu (I believe it should work on any *nix like system with the proper libraries and build tools installed). The bug only affects the sherpa.astro.ui sub-package, If for some reason you have no suces with the bug fix, you can still use all the functionality in sherpa.ui and follow the tutorials etc.; all you lose is some specialized high-level astronomical convenience functions and models. If your continuum isn't particularly well-behaved, and if you are only interested in the continuum in order to eliminate it, I think I'd start with localizing the peaks, then selecting a small region around each of them and model the background with your model of choice - in this case, a constant or a polynomium or power law shouod often work fine - and add the gaussian for the peak to the model, perform the fit and go on to next line. A first-guess to the peak position can often be made with some prior knowledge of the wavelength of the transition you're investigating. Cheers; Emil On 09/30/2012 08:21 PM, Matt Newville wrote: > Hi Joe, > > On Fri, Sep 28, 2012 at 1:45 PM, Joe Philip Ninan wrote: >> Hi, >> I have a spectra with multiple gaussian emission lines over a noisy >> continuum. >> My primary objective is to find areas under all the gaussian peaks. >> For that, the following is the algorithm i have in mind. >> 1) fit the continuum and subtract it. >> 2) find the peaks >> 3) do least square fit of gaussian at the peaks to find the area under each >> gaussian peaks. >> I am basically stuck at the first step itself. Simple 2nd or 3rd order >> polynomial fit is not working because the contribution from peaks are >> significant. Any tool exist to fit continuum ignoring the peaks? >> For finding peaks, i tried find_peaks_cwt in signal module of scipy. But it >> seems to be quite sensitive of the width of peak and was picking up >> non-existing peaks also. >> The wavelet used was default mexican hat. Is there any better wavelet i >> should try? >> >> Or is there any other module in python/scipy which i should give a try? >> Thanking you. >> -cheers >> joe > I would echo much of the earlier advice. Fitting in stages (first > background, then peaks) can be a bit dangerous, but is sometimes > justifiable. > > I think there really isn't a good domain-independent way to model a > continuum background, and it can be very useful to have some physical > or spectral model for what the form of the continuum should be. > > That being said, there are a few things you might consider trying, > especially since you know that you have positive peaks on a relatively > smooth (if noisy) background. First, in the fit objective function, > you might consider weighting positive elements of the residuals > logarithmically and negative elements by some large scale or even > exponentially. That will help to ignore the peaks, and keep the > modeled background on the very low end of the spectra. > > Second, use your knowledge of the peak widths to set the polynomial or > spline, or whatever function you're using to model the background. If > you know your peaks have some range of widths, you could even consider > using a Fourier filtering method to reduce the low-frequency continuum > and the high-frequency noise while leaving the frequencies of interest > (mostly) in tact. With such an approach, you might fit the background > such that it only tried to match the low-frequency components of the > spectra. > > Finally, sometimes, a least-squares fit isn't needed. For example, > for x-ray fluorescence spectra there is a simple but pretty effective > method by Kajfosz and Kwiatek in Nucl Instrum Meth B22, p78 (1987) > "Non-polynomial approximation of background in x-ray spectra". For an > implementation of this, see > https://github.com/xraypy/tdl/blob/master/modules/xrf/xrf_bgr.py > > This might not be exactly what you're looking for, but it might help > get you started. > > --Matt > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From takowl at gmail.com Tue Oct 2 07:09:12 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Tue, 2 Oct 2012 12:09:12 +0100 Subject: [SciPy-User] [SciPy-user] Pylab - standard packages In-Reply-To: References:

<76b6b0e2f78755096dd3545e87ced475.squirrel@srv2.s4y.tournesol-consulting.eu> <01D91AC9-ACAA-4D5F-BB9C-B3BC179D39E0@continuum.io>

<34482439.post@talk.nabble.com> Message-ID: Hi Will, On 2 October 2012 09:49, Will Furnass wrote: > A point that I don't think has been mentioned so far (correct me if I'm > wrong) is whether devising a Scipy standard with recommended/minimum > package versions will hinder (or expedite) the transition to Python 3.x. > If one package e.g. matplotlib is still Python 2.x only then that would > keep the standard 2.7 but may add momentum the development of a 3.x > version of that package. More generally, is there any interest in a 3.x > Scipy standard, either now or in the next couple of years? At present, I've put in the standard that 2.x >= 2.6 or 3.x >= 3.2 is valid. Of the current selection of packages, there are three that aren't yet released on Python 3: - matplotlib: coming very soon - SymPy: I think the work is done, but it has yet to be released. Hopefully coming soon (https://github.com/sympy/sympy/pull/1507 ) - Pytables: Still a work in progress, but it *is* being worked on. For now, I think we should steer newcomers towards Python 2, but I don't want the standard to preclude making Python 3 distributions once the necessary packages are there. Pyzo, Almar's distribution, is based on Python 3. Thanks, Thomas From lorenzo.isella at gmail.com Tue Oct 2 09:37:15 2012 From: lorenzo.isella at gmail.com (Lorenzo Isella) Date: Tue, 02 Oct 2012 15:37:15 +0200 Subject: [SciPy-User] Projected Area In-Reply-To: References: Message-ID: On Mon, 01 Oct 2012 21:54:29 +0200, wrote: > Date: Mon, 1 Oct 2012 12:39:48 -0600 > From: "jkhilmer at chemistry.montana.edu" > > Subject: Re: [SciPy-User] Projected Area > To: SciPy Users List > Message-ID: > > Content-Type: text/plain; charset=ISO-8859-1 > Lorenzo, > Were the previous suggestions not viable due to speed or precision? > http://thread.gmane.org/gmane.comp.python.scientific.user/30450/focus=30464 > Jonathan Hello, Thanks for the link. That was another question I asked some time ago. Here the situation is simpler and I think I do not want to over-engineer the solution: so far it looks like I can get some decent results with an old-fashoned Monte Carlo integration. Cheers Lorenzo From takowl at gmail.com Tue Oct 2 10:57:59 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Tue, 2 Oct 2012 15:57:59 +0100 Subject: [SciPy-User] [SciPy-user] Pylab - standard packages In-Reply-To: References:

<76b6b0e2f78755096dd3545e87ced475.squirrel@srv2.s4y.tournesol-consulting.eu> <01D91AC9-ACAA-4D5F-BB9C-B3BC179D39E0@continuum.io>

<34482439.post@talk.nabble.com> Message-ID: So that everyone's aware, there's more discussion of what packages should be in the standard taking place on the NumFOCUS list. If you want to follow it, start reading from about here: https://groups.google.com/d/msg/numfocus/aQKHmlS4m0Y/h8iOeyoruTEJ Thanks, Thomas From helmrp at yahoo.com Tue Oct 2 13:47:49 2012 From: helmrp at yahoo.com (The Helmbolds) Date: Tue, 2 Oct 2012 10:47:49 -0700 (PDT) Subject: [SciPy-User] Autoblock? Message-ID: <1349200069.60097.YahooMailNeo@web31816.mail.mud.yahoo.com> When attempting to install SciPy 0.11.0, Webroot blocks installation with message: Win32.Autoblock.1 detected ? Autoblock appears to be a virus. ? Now what? Bob and Paula H From cournape at gmail.com Tue Oct 2 14:11:30 2012 From: cournape at gmail.com (David Cournapeau) Date: Tue, 2 Oct 2012 19:11:30 +0100 Subject: [SciPy-User] Autoblock? In-Reply-To: <1349200069.60097.YahooMailNeo@web31816.mail.mud.yahoo.com> References: <1349200069.60097.YahooMailNeo@web31816.mail.mud.yahoo.com> Message-ID: On Tue, Oct 2, 2012 at 6:47 PM, The Helmbolds wrote: > When attempting to install SciPy 0.11.0, Webroot blocks installation with message: > Win32.Autoblock.1 detected > > Autoblock appears to be a virus. > > Now what? Where did you download scipy from ? If you got it from the official download page, your anti-virus may not be up to date. There is no virus in the official scipy installers, David From kevin.gullikson at gmail.com Mon Oct 1 11:21:17 2012 From: kevin.gullikson at gmail.com (Kevin Gullikson) Date: Mon, 1 Oct 2012 10:21:17 -0500 Subject: [SciPy-User] Projected Area In-Reply-To: References:

Message-ID: For the more general case, I would wager it has something to do with vector projection, which you can use to find the length of a "shadow" cast by a line. http://en.wikipedia.org/wiki/Vector_projection Your case would be a 3d generalization of it, but I'm sure that has been done somewhere... Kevin Gullikson On Mon, Oct 1, 2012 at 10:03 AM, Robert Kern wrote: > On Mon, Oct 1, 2012 at 10:34 AM, Lorenzo Isella > wrote: > > Dear All, > > I hope this is not too off-topic. > > I need to know if there is already some ready-to-use SciPy algorithm > > (or at least if this is easy to implement or not). > > Consider a dimer, i.e. 2 spheres with a single contact point. This > > dimer can have any orientation in the 3D and I have the (x,y,z) > > coordinates of the centre of the 2 spheres. > > For a given orientation, I want to project the dimer on, let's say, > > the xy plane and evaluate the area of the surface of its projection. > > I spoke about a dimer since it is easy to start discussing a simple > > case, but in general I will deal with objects consisting of several > > non-overlapping spheres such that any sphere has at least a contact > > point with another sphere. > > There is nothing implemented in scipy for this. For the case of > spheres projected (orthographically?) onto a plane, the shadows are > probably-overlapping circles (the contact point is irrelevant). It > looks like there is an analytical solution to the area of the > intersection for circles: > > http://mathworld.wolfram.com/Circle-CircleIntersection.html > > You can probably just add up the areas of each circle, then subtract > out one copy of each area of intersection to get the area of the > union. > > -- > Robert Kern > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -------------- next part -------------- An HTML attachment was scrubbed... URL: From thoger.emil at gmail.com Mon Oct 1 18:37:24 2012 From: thoger.emil at gmail.com (=?ISO-8859-1?Q?Th=F8ger_Rivera-Thorsen?=) Date: Tue, 02 Oct 2012 00:37:24 +0200 Subject: [SciPy-User] Fitting Gaussian in spectra In-Reply-To: References: Message-ID: <506A1B24.1040102@gmail.com> Hi Joe; I don't know what exactly you are working on, but it seems like you could benefit from the astronomical spectrum fitting package Sherpa, which is importable as a python module. You can read more about it here: http://cxc.cfa.harvard.edu/contrib/sherpa/ Python is developed but the Chandra x-ray center but is not astronomy-specific. An introduction to the interactive interface can be found at: http://python4astronomers.github.com/fitting/spectrum.html There is a bug in the current installer which concerns the sherpa.astro.ui module; if you're on a *nix-like system I have written a little how-to on fixing this bug here: http://lusepuster.posterous.com/installing-sherpa-fitting-software-on-ubuntu (The title says Ubuntu but I actually don't think there's anything Ubuntu-specific in it). But even if you have no luck fixing the bug, you can still use the normal sherpa.ui module which is all that is required to follow the above tutorial, all you'll lose is some pretty astronomy-specific convenience functions. As for the actual strategy: if you're only interested in the continuum in order to eliminate it, I think I'd recommend localizing the peaks first, then select a region around them (sherpa has a tool for that), choose a model for the continuum (there are several to choose from, but for local simple models a constant or a power law would often be fine), and then add a simple gaussian to the model and perform the fit as described in the Python4Astronomers link above. If your spectra are particularly well-behaved, you may have luck building a model that describes both continuum and all your peaks of interest with a combination of e.g. a (multiple) power-law or a blackbody spectrum plus some gaussians, but often the reward is not really worth the hassle. Cheers; Emil On 09/30/2012 08:21 PM, Matt Newville wrote: > Hi Joe, > > On Fri, Sep 28, 2012 at 1:45 PM, Joe Philip Ninan wrote: >> Hi, >> I have a spectra with multiple gaussian emission lines over a noisy >> continuum. >> My primary objective is to find areas under all the gaussian peaks. >> For that, the following is the algorithm i have in mind. >> 1) fit the continuum and subtract it. >> 2) find the peaks >> 3) do least square fit of gaussian at the peaks to find the area under each >> gaussian peaks. >> I am basically stuck at the first step itself. Simple 2nd or 3rd order >> polynomial fit is not working because the contribution from peaks are >> significant. Any tool exist to fit continuum ignoring the peaks? >> For finding peaks, i tried find_peaks_cwt in signal module of scipy. But it >> seems to be quite sensitive of the width of peak and was picking up >> non-existing peaks also. >> The wavelet used was default mexican hat. Is there any better wavelet i >> should try? >> >> Or is there any other module in python/scipy which i should give a try? >> Thanking you. >> -cheers >> joe > I would echo much of the earlier advice. Fitting in stages (first > background, then peaks) can be a bit dangerous, but is sometimes > justifiable. > > I think there really isn't a good domain-independent way to model a > continuum background, and it can be very useful to have some physical > or spectral model for what the form of the continuum should be. > > That being said, there are a few things you might consider trying, > especially since you know that you have positive peaks on a relatively > smooth (if noisy) background. First, in the fit objective function, > you might consider weighting positive elements of the residuals > logarithmically and negative elements by some large scale or even > exponentially. That will help to ignore the peaks, and keep the > modeled background on the very low end of the spectra. > > Second, use your knowledge of the peak widths to set the polynomial or > spline, or whatever function you're using to model the background. If > you know your peaks have some range of widths, you could even consider > using a Fourier filtering method to reduce the low-frequency continuum > and the high-frequency noise while leaving the frequencies of interest > (mostly) in tact. With such an approach, you might fit the background > such that it only tried to match the low-frequency components of the > spectra. > > Finally, sometimes, a least-squares fit isn't needed. For example, > for x-ray fluorescence spectra there is a simple but pretty effective > method by Kajfosz and Kwiatek in Nucl Instrum Meth B22, p78 (1987) > "Non-polynomial approximation of background in x-ray spectra". For an > implementation of this, see > https://github.com/xraypy/tdl/blob/master/modules/xrf/xrf_bgr.py > > This might not be exactly what you're looking for, but it might help > get you started. > > --Matt > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From harshadsurdi at gmail.com Wed Oct 3 09:27:53 2012 From: harshadsurdi at gmail.com (Harshad Surdi) Date: Wed, 3 Oct 2012 18:57:53 +0530 Subject: [SciPy-User] Eclipse IDE for Java Developers with PyDev - updating scipy Message-ID: Hi, I am using Eclipse IDE for Java Developers with PyDev on Ubuntu 12.04 and I am quite new to Ubuntu and Eclipse. Can you guide me as to hos to update scipy version in PyDev in Eclipse? -- Best Regards, Harshad Surdi -------------- next part -------------- An HTML attachment was scrubbed... URL: From takowl at gmail.com Wed Oct 3 12:06:10 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Wed, 3 Oct 2012 17:06:10 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) Message-ID: Following on from recent discussion here and on the numfocus list, I'm trying to work out the set of packages that should make up a standardised 'scipy stack'. We've determined that Python, numpy, scipy, matplotlib and IPython are to be included. Then there's a list that have got a 'maybe': pandas, statsmodels, sympy, scikits-learn, scikits-image, PyTables, h5py, NetworkX, nose, basemap & netCDF4. My aim is to have a general set of packages that you can do useful work with, and will stand up to the competition (particularly Matlab & R), but without gaining too many subject-specific packages. But I don't know what's generally useful and what's subject specific. Vote at: http://www.doodle.com/ma6rnpnbfc6wivu9 It's set up so you can vote for or against a package, or abstain if you're not sure - I've abstained on most of them myself. Thanks, Thomas From josef.pktd at gmail.com Wed Oct 3 12:52:59 2012 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 3 Oct 2012 12:52:59 -0400 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References: Message-ID: On Wed, Oct 3, 2012 at 12:06 PM, Thomas Kluyver wrote: > Following on from recent discussion here and on the numfocus list, I'm > trying to work out the set of packages that should make up a > standardised 'scipy stack'. We've determined that Python, numpy, > scipy, matplotlib and IPython are to be included. Then there's a list > that have got a 'maybe': pandas, statsmodels, sympy, scikits-learn, > scikits-image, PyTables, h5py, NetworkX, nose, basemap & netCDF4. > > My aim is to have a general set of packages that you can do useful > work with, and will stand up to the competition (particularly Matlab & > R), but without gaining too many subject-specific packages. But I > don't know what's generally useful and what's subject specific. > > Vote at: http://www.doodle.com/ma6rnpnbfc6wivu9 > > It's set up so you can vote for or against a package, or abstain if > you're not sure - I've abstained on most of them myself. Why is the default no, instead of abstain (Yes)? I had to go back to fix where I didn't vote. Josef > > Thanks, > Thomas > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From josh.k.lawrence at gmail.com Wed Oct 3 12:54:09 2012 From: josh.k.lawrence at gmail.com (Josh Lawrence) Date: Wed, 3 Oct 2012 11:54:09 -0500 Subject: [SciPy-User] NumPy Binomial BTPE method Problem Message-ID: Hello all, I am implementing a binomial random variable in MATLAB. The default method in the statistics toolbox is extremely slow for large population/trial size. I am needing to do trials for n as large as 2**28. I found in NumPy some code that implements a binomial random draw in numpy/random/mtrand/distributions.c. I was trying to convert the code to MATLAB and the BTPE method seems to have an error in lines 337-341 of distributions.c. The if ... else if ... else statement I think is incorrect. I think it should be an if ... else ... statement followed by the contents of the original else which starts on line 337. The if ... else if ... else block is as follows: #### begin code snippet #### if (m < y) { for (i=m; i<=y; i++) { F *= (a/i - s); } } else if (m > y) { for (i=y; i<=m; i++) { F /= (a/i - s); } } else { if (v > F) goto Step10; goto Step60; } #### end code snippet #### >From what I can tell, the variable F is only used in the comparison within the else{} statment (i.e. the if(v > F) statement) and nowhere else within the scope of the function. I also found a fortran implementation here: http://wstein.org/home/wstein/www/home/mhansen/spkgs_in_progress/octave-3.2.4/src/libcruft/ranlib/ignbin.f and it appears this is from where the code was originally adapted as the variable names are the same. My parsing of fortran GOTOs is a bit rusty, but I think the contents of the else block in above snippet should be not be conditional. I don't understand the underlying algorithm very well and don't have access the the BTPE paper, so I can't comment on the validity of the fortran code. There just seems to be an error in logic in the above code. So please have someone who understands it look at it. It appears Robert Kern wrote the function a decent portion of the file at some point in the past. I hope this helps. Cheers, -- Josh Lawrence P.S. I apologize if my email is inconvenient, but I could not figure out how to tell gmail to set the reply-to field to be scipy-user at scipy.org. From takowl at gmail.com Wed Oct 3 13:07:22 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Wed, 3 Oct 2012 18:07:22 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On 3 October 2012 17:52, wrote: > Why is the default no, instead of abstain (Yes)? Because this isn't exactly the use case Doodle is designed for. Sorry about that, and thanks for checking your answer. Anyone else who did the same, please take a moment to edit your response. Early results suggest pandas, sympy, h5py and nose are the most popular. Thanks, Thomas From josh.k.lawrence at gmail.com Wed Oct 3 14:42:41 2012 From: josh.k.lawrence at gmail.com (Josh Lawrence) Date: Wed, 3 Oct 2012 13:42:41 -0500 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References: Message-ID: Hey all, I received access to the paper and it seems it was originally based purely on the paper written by Kachitvichyanukul in 1988. I still think there's a whoopsies with the if ... else if ... else, block though. On Wed, Oct 3, 2012 at 11:54 AM, Josh Lawrence wrote: > Hello all, > > I am implementing a binomial random variable in MATLAB. The default > method in the statistics toolbox is extremely slow for large > population/trial size. I am needing to do trials for n as large as > 2**28. I found in NumPy some code that implements a binomial random > draw in numpy/random/mtrand/distributions.c. I was trying to convert > the code to MATLAB and the BTPE method seems to have an error in lines > 337-341 of distributions.c. The if ... else if ... else statement I > think is incorrect. I think it should be an if ... else ... statement > followed by the contents of the original else which starts on line > 337. > > The if ... else if ... else block is as follows: > > #### begin code snippet #### > if (m < y) > { > for (i=m; i<=y; i++) > { > F *= (a/i - s); > } > } > else if (m > y) > { > for (i=y; i<=m; i++) > { > F /= (a/i - s); > } > } > else > { > if (v > F) goto Step10; > goto Step60; > } > #### end code snippet #### > > From what I can tell, the variable F is only used in the comparison > within the else{} statment (i.e. the if(v > F) statement) and nowhere > else within the scope of the function. > > I also found a fortran implementation here: > http://wstein.org/home/wstein/www/home/mhansen/spkgs_in_progress/octave-3.2.4/src/libcruft/ranlib/ignbin.f > and it appears this is from where the code was originally adapted as > the variable names are the same. > > My parsing of fortran GOTOs is a bit rusty, but I think the contents > of the else block in above snippet should be not be conditional. > > I don't understand the underlying algorithm very well and don't have > access the the BTPE paper, so I can't comment on the validity of the > fortran code. There just seems to be an error in logic in the above > code. So please have someone who understands it look at it. It appears > Robert Kern wrote the function a decent portion of the file at some > point in the past. > > I hope this helps. > > Cheers, > > -- > Josh Lawrence > > > P.S. I apologize if my email is inconvenient, but I could not figure > out how to tell gmail to set the reply-to field to be > scipy-user at scipy.org. -- Josh Lawrence From josef.pktd at gmail.com Wed Oct 3 15:07:54 2012 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 3 Oct 2012 15:07:54 -0400 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References:

Message-ID: On Wed, Oct 3, 2012 at 2:42 PM, Josh Lawrence wrote: > Hey all, > > I received access to the paper and it seems it was originally based > purely on the paper written by Kachitvichyanukul in 1988. I still > think there's a whoopsies with the if ... else if ... else, block > though. the c code "else" looks strange to me, however, checking a few cases with large p*n for a large sample (1 million draws), I don't see any difference of the frequency count to the theoretical distribution from scipy.binom. (but with all the goto's I'm not sure if I really trigger that path.) Josef > > On Wed, Oct 3, 2012 at 11:54 AM, Josh Lawrence > wrote: >> Hello all, >> >> I am implementing a binomial random variable in MATLAB. The default >> method in the statistics toolbox is extremely slow for large >> population/trial size. I am needing to do trials for n as large as >> 2**28. I found in NumPy some code that implements a binomial random >> draw in numpy/random/mtrand/distributions.c. I was trying to convert >> the code to MATLAB and the BTPE method seems to have an error in lines >> 337-341 of distributions.c. The if ... else if ... else statement I >> think is incorrect. I think it should be an if ... else ... statement >> followed by the contents of the original else which starts on line >> 337. >> >> The if ... else if ... else block is as follows: >> >> #### begin code snippet #### >> if (m < y) >> { >> for (i=m; i<=y; i++) >> { >> F *= (a/i - s); >> } >> } >> else if (m > y) >> { >> for (i=y; i<=m; i++) >> { >> F /= (a/i - s); >> } >> } >> else >> { >> if (v > F) goto Step10; >> goto Step60; >> } >> #### end code snippet #### >> >> From what I can tell, the variable F is only used in the comparison >> within the else{} statment (i.e. the if(v > F) statement) and nowhere >> else within the scope of the function. >> >> I also found a fortran implementation here: >> http://wstein.org/home/wstein/www/home/mhansen/spkgs_in_progress/octave-3.2.4/src/libcruft/ranlib/ignbin.f >> and it appears this is from where the code was originally adapted as >> the variable names are the same. >> >> My parsing of fortran GOTOs is a bit rusty, but I think the contents >> of the else block in above snippet should be not be conditional. >> >> I don't understand the underlying algorithm very well and don't have >> access the the BTPE paper, so I can't comment on the validity of the >> fortran code. There just seems to be an error in logic in the above >> code. So please have someone who understands it look at it. It appears >> Robert Kern wrote the function a decent portion of the file at some >> point in the past. >> >> I hope this helps. >> >> Cheers, >> >> -- >> Josh Lawrence >> >> >> P.S. I apologize if my email is inconvenient, but I could not figure >> out how to tell gmail to set the reply-to field to be >> scipy-user at scipy.org. > > > > -- > Josh Lawrence > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From josef.pktd at gmail.com Wed Oct 3 15:59:05 2012 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 3 Oct 2012 15:59:05 -0400 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References:

Message-ID: On Wed, Oct 3, 2012 at 3:07 PM, wrote: > On Wed, Oct 3, 2012 at 2:42 PM, Josh Lawrence wrote: >> Hey all, >> >> I received access to the paper and it seems it was originally based >> purely on the paper written by Kachitvichyanukul in 1988. I still >> think there's a whoopsies with the if ... else if ... else, block >> though. > > the c code "else" looks strange to me, > however, checking a few cases with large p*n for a large sample (1 > million draws), I don't see any difference of the frequency count to > the theoretical distribution from scipy.binom. I'm pretty sure you are right. (If my reading as non c programmer is correct) The else block means that Step 50 is never used, instead it uses Step 52, which uses a different approximation that is intended for the tails. If Step 52 is relatively close to the result of Step 50, then it will not be very visible in the final results. >From my reading of the code there should be a small distortion around the mean. Josef > > (but with all the goto's I'm not sure if I really trigger that path.) > > Josef > >> >> On Wed, Oct 3, 2012 at 11:54 AM, Josh Lawrence >> wrote: >>> Hello all, >>> >>> I am implementing a binomial random variable in MATLAB. The default >>> method in the statistics toolbox is extremely slow for large >>> population/trial size. I am needing to do trials for n as large as >>> 2**28. I found in NumPy some code that implements a binomial random >>> draw in numpy/random/mtrand/distributions.c. I was trying to convert >>> the code to MATLAB and the BTPE method seems to have an error in lines >>> 337-341 of distributions.c. The if ... else if ... else statement I >>> think is incorrect. I think it should be an if ... else ... statement >>> followed by the contents of the original else which starts on line >>> 337. >>> >>> The if ... else if ... else block is as follows: >>> >>> #### begin code snippet #### >>> if (m < y) >>> { >>> for (i=m; i<=y; i++) >>> { >>> F *= (a/i - s); >>> } >>> } >>> else if (m > y) >>> { >>> for (i=y; i<=m; i++) >>> { >>> F /= (a/i - s); >>> } >>> } >>> else >>> { >>> if (v > F) goto Step10; >>> goto Step60; >>> } >>> #### end code snippet #### >>> >>> From what I can tell, the variable F is only used in the comparison >>> within the else{} statment (i.e. the if(v > F) statement) and nowhere >>> else within the scope of the function. >>> >>> I also found a fortran implementation here: >>> http://wstein.org/home/wstein/www/home/mhansen/spkgs_in_progress/octave-3.2.4/src/libcruft/ranlib/ignbin.f >>> and it appears this is from where the code was originally adapted as >>> the variable names are the same. >>> >>> My parsing of fortran GOTOs is a bit rusty, but I think the contents >>> of the else block in above snippet should be not be conditional. >>> >>> I don't understand the underlying algorithm very well and don't have >>> access the the BTPE paper, so I can't comment on the validity of the >>> fortran code. There just seems to be an error in logic in the above >>> code. So please have someone who understands it look at it. It appears >>> Robert Kern wrote the function a decent portion of the file at some >>> point in the past. >>> >>> I hope this helps. >>> >>> Cheers, >>> >>> -- >>> Josh Lawrence >>> >>> >>> P.S. I apologize if my email is inconvenient, but I could not figure >>> out how to tell gmail to set the reply-to field to be >>> scipy-user at scipy.org. >> >> >> >> -- >> Josh Lawrence >> _______________________________________________ >> SciPy-User mailing list >> SciPy-User at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user From josh.k.lawrence at gmail.com Wed Oct 3 16:05:55 2012 From: josh.k.lawrence at gmail.com (Josh Lawrence) Date: Wed, 3 Oct 2012 15:05:55 -0500 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References:

Message-ID: Also, the for loops should be i=m+1 and i=y+1 for the left and right tails, respectively. Again, I do'nt think this tangibly changes things, but the algorithm shows that you set i=m (or i=y), and the first step of the loop in both cases is i=i+1. Here's a link to the paper if you have access to ACM. http://dl.acm.org/citation.cfm?id=42381 . So I think it's just the two changes. I have implemented those and get very similar results from doing a histogram. On Wed, Oct 3, 2012 at 2:59 PM, wrote: > On Wed, Oct 3, 2012 at 3:07 PM, wrote: >> On Wed, Oct 3, 2012 at 2:42 PM, Josh Lawrence wrote: >>> Hey all, >>> >>> I received access to the paper and it seems it was originally based >>> purely on the paper written by Kachitvichyanukul in 1988. I still >>> think there's a whoopsies with the if ... else if ... else, block >>> though. >> >> the c code "else" looks strange to me, >> however, checking a few cases with large p*n for a large sample (1 >> million draws), I don't see any difference of the frequency count to >> the theoretical distribution from scipy.binom. > > > I'm pretty sure you are right. > (If my reading as non c programmer is correct) > The else block means that Step 50 is never used, instead it uses Step > 52, which uses a different approximation that is intended for the > tails. > If Step 52 is relatively close to the result of Step 50, then it will > not be very visible in the final results. > >From my reading of the code there should be a small distortion around the mean. > > Josef > >> >> (but with all the goto's I'm not sure if I really trigger that path.) >> >> Josef >> >>> >>> On Wed, Oct 3, 2012 at 11:54 AM, Josh Lawrence >>> wrote: >>>> Hello all, >>>> >>>> I am implementing a binomial random variable in MATLAB. The default >>>> method in the statistics toolbox is extremely slow for large >>>> population/trial size. I am needing to do trials for n as large as >>>> 2**28. I found in NumPy some code that implements a binomial random >>>> draw in numpy/random/mtrand/distributions.c. I was trying to convert >>>> the code to MATLAB and the BTPE method seems to have an error in lines >>>> 337-341 of distributions.c. The if ... else if ... else statement I >>>> think is incorrect. I think it should be an if ... else ... statement >>>> followed by the contents of the original else which starts on line >>>> 337. >>>> >>>> The if ... else if ... else block is as follows: >>>> >>>> #### begin code snippet #### >>>> if (m < y) >>>> { >>>> for (i=m; i<=y; i++) >>>> { >>>> F *= (a/i - s); >>>> } >>>> } >>>> else if (m > y) >>>> { >>>> for (i=y; i<=m; i++) >>>> { >>>> F /= (a/i - s); >>>> } >>>> } >>>> else >>>> { >>>> if (v > F) goto Step10; >>>> goto Step60; >>>> } >>>> #### end code snippet #### >>>> >>>> From what I can tell, the variable F is only used in the comparison >>>> within the else{} statment (i.e. the if(v > F) statement) and nowhere >>>> else within the scope of the function. >>>> >>>> I also found a fortran implementation here: >>>> http://wstein.org/home/wstein/www/home/mhansen/spkgs_in_progress/octave-3.2.4/src/libcruft/ranlib/ignbin.f >>>> and it appears this is from where the code was originally adapted as >>>> the variable names are the same. >>>> >>>> My parsing of fortran GOTOs is a bit rusty, but I think the contents >>>> of the else block in above snippet should be not be conditional. >>>> >>>> I don't understand the underlying algorithm very well and don't have >>>> access the the BTPE paper, so I can't comment on the validity of the >>>> fortran code. There just seems to be an error in logic in the above >>>> code. So please have someone who understands it look at it. It appears >>>> Robert Kern wrote the function a decent portion of the file at some >>>> point in the past. >>>> >>>> I hope this helps. >>>> >>>> Cheers, >>>> >>>> -- >>>> Josh Lawrence >>>> >>>> >>>> P.S. I apologize if my email is inconvenient, but I could not figure >>>> out how to tell gmail to set the reply-to field to be >>>> scipy-user at scipy.org. >>> >>> >>> >>> -- >>> Josh Lawrence >>> _______________________________________________ >>> SciPy-User mailing list >>> SciPy-User at scipy.org >>> http://mail.scipy.org/mailman/listinfo/scipy-user > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user -- Josh Lawrence From josh.k.lawrence at gmail.com Wed Oct 3 16:07:05 2012 From: josh.k.lawrence at gmail.com (Josh Lawrence) Date: Wed, 3 Oct 2012 15:07:05 -0500 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References:

Message-ID: Sorry that's lines 325 and 332 for the for loops. On Wed, Oct 3, 2012 at 3:05 PM, Josh Lawrence wrote: > Also, the for loops should be i=m+1 and i=y+1 for the left and right > tails, respectively. Again, I do'nt think this tangibly changes > things, but the algorithm shows that you set i=m (or i=y), and the > first step of the loop in both cases is i=i+1. Here's a link to the > paper if you have access to ACM. > > http://dl.acm.org/citation.cfm?id=42381 . > > So I think it's just the two changes. I have implemented those and get > very similar results from doing a histogram. > > On Wed, Oct 3, 2012 at 2:59 PM, wrote: >> On Wed, Oct 3, 2012 at 3:07 PM, wrote: >>> On Wed, Oct 3, 2012 at 2:42 PM, Josh Lawrence wrote: >>>> Hey all, >>>> >>>> I received access to the paper and it seems it was originally based >>>> purely on the paper written by Kachitvichyanukul in 1988. I still >>>> think there's a whoopsies with the if ... else if ... else, block >>>> though. >>> >>> the c code "else" looks strange to me, >>> however, checking a few cases with large p*n for a large sample (1 >>> million draws), I don't see any difference of the frequency count to >>> the theoretical distribution from scipy.binom. >> >> >> I'm pretty sure you are right. >> (If my reading as non c programmer is correct) >> The else block means that Step 50 is never used, instead it uses Step >> 52, which uses a different approximation that is intended for the >> tails. >> If Step 52 is relatively close to the result of Step 50, then it will >> not be very visible in the final results. >> >From my reading of the code there should be a small distortion around the mean. >> >> Josef >> >>> >>> (but with all the goto's I'm not sure if I really trigger that path.) >>> >>> Josef >>> >>>> >>>> On Wed, Oct 3, 2012 at 11:54 AM, Josh Lawrence >>>> wrote: >>>>> Hello all, >>>>> >>>>> I am implementing a binomial random variable in MATLAB. The default >>>>> method in the statistics toolbox is extremely slow for large >>>>> population/trial size. I am needing to do trials for n as large as >>>>> 2**28. I found in NumPy some code that implements a binomial random >>>>> draw in numpy/random/mtrand/distributions.c. I was trying to convert >>>>> the code to MATLAB and the BTPE method seems to have an error in lines >>>>> 337-341 of distributions.c. The if ... else if ... else statement I >>>>> think is incorrect. I think it should be an if ... else ... statement >>>>> followed by the contents of the original else which starts on line >>>>> 337. >>>>> >>>>> The if ... else if ... else block is as follows: >>>>> >>>>> #### begin code snippet #### >>>>> if (m < y) >>>>> { >>>>> for (i=m; i<=y; i++) >>>>> { >>>>> F *= (a/i - s); >>>>> } >>>>> } >>>>> else if (m > y) >>>>> { >>>>> for (i=y; i<=m; i++) >>>>> { >>>>> F /= (a/i - s); >>>>> } >>>>> } >>>>> else >>>>> { >>>>> if (v > F) goto Step10; >>>>> goto Step60; >>>>> } >>>>> #### end code snippet #### >>>>> >>>>> From what I can tell, the variable F is only used in the comparison >>>>> within the else{} statment (i.e. the if(v > F) statement) and nowhere >>>>> else within the scope of the function. >>>>> >>>>> I also found a fortran implementation here: >>>>> http://wstein.org/home/wstein/www/home/mhansen/spkgs_in_progress/octave-3.2.4/src/libcruft/ranlib/ignbin.f >>>>> and it appears this is from where the code was originally adapted as >>>>> the variable names are the same. >>>>> >>>>> My parsing of fortran GOTOs is a bit rusty, but I think the contents >>>>> of the else block in above snippet should be not be conditional. >>>>> >>>>> I don't understand the underlying algorithm very well and don't have >>>>> access the the BTPE paper, so I can't comment on the validity of the >>>>> fortran code. There just seems to be an error in logic in the above >>>>> code. So please have someone who understands it look at it. It appears >>>>> Robert Kern wrote the function a decent portion of the file at some >>>>> point in the past. >>>>> >>>>> I hope this helps. >>>>> >>>>> Cheers, >>>>> >>>>> -- >>>>> Josh Lawrence >>>>> >>>>> >>>>> P.S. I apologize if my email is inconvenient, but I could not figure >>>>> out how to tell gmail to set the reply-to field to be >>>>> scipy-user at scipy.org. >>>> >>>> >>>> >>>> -- >>>> Josh Lawrence >>>> _______________________________________________ >>>> SciPy-User mailing list >>>> SciPy-User at scipy.org >>>> http://mail.scipy.org/mailman/listinfo/scipy-user >> _______________________________________________ >> SciPy-User mailing list >> SciPy-User at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user > > > > -- > Josh Lawrence -- Josh Lawrence From trive at astro.su.se Wed Oct 3 16:41:21 2012 From: trive at astro.su.se (=?ISO-8859-1?Q?Th=F8ger_Rivera-Thorsen?=) Date: Wed, 03 Oct 2012 22:41:21 +0200 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: <506CA2F1.6030407@astro.su.se> Just a thought, although late in the process; Is there in the default stack any toolkit to help create simple interactive GUIs, like e.g. Traits(ui)? Nothing overly complicated, but simple dialogues etc. would be great for creating simple apps for e.g. teaching. I know IDL has it and it is used quite frequently (yes, I'm an astronomer). Cheers Emil On 10/03/2012 06:52 PM, josef.pktd at gmail.com wrote: > On Wed, Oct 3, 2012 at 12:06 PM, Thomas Kluyver wrote: >> Following on from recent discussion here and on the numfocus list, I'm >> trying to work out the set of packages that should make up a >> standardised 'scipy stack'. We've determined that Python, numpy, >> scipy, matplotlib and IPython are to be included. Then there's a list >> that have got a 'maybe': pandas, statsmodels, sympy, scikits-learn, >> scikits-image, PyTables, h5py, NetworkX, nose, basemap & netCDF4. >> >> My aim is to have a general set of packages that you can do useful >> work with, and will stand up to the competition (particularly Matlab & >> R), but without gaining too many subject-specific packages. But I >> don't know what's generally useful and what's subject specific. >> >> Vote at: http://www.doodle.com/ma6rnpnbfc6wivu9 >> >> It's set up so you can vote for or against a package, or abstain if >> you're not sure - I've abstained on most of them myself. > Why is the default no, instead of abstain (Yes)? > > I had to go back to fix where I didn't vote. > > Josef > > >> Thanks, >> Thomas >> _______________________________________________ >> SciPy-User mailing list >> SciPy-User at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From takowl at gmail.com Wed Oct 3 16:54:44 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Wed, 3 Oct 2012 21:54:44 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: <506CA2F1.6030407@astro.su.se> References:

<506CA2F1.6030407@astro.su.se> Message-ID: On 3 October 2012 21:41, Th?ger Rivera-Thorsen wrote: > Is there in the default stack any toolkit to help create simple > interactive GUIs, like e.g. Traits(ui)? Nothing overly complicated, but > simple dialogues etc. would be great for creating simple apps for e.g. > teaching. I know IDL has it and it is used quite frequently (yes, I'm an > astronomer). Tkinter is included as part of the Python standard library, so you can build simple GUIs. For quickly presenting dialogs, you could easily install easygui (http://easygui.sourceforge.net/ ), which builds on Tkinter, but I don't think it should be part of the standard. I don't know how either compare to TraitsUI, which I haven't used. Thomas From cgohlke at uci.edu Wed Oct 3 17:06:59 2012 From: cgohlke at uci.edu (Christoph Gohlke) Date: Wed, 03 Oct 2012 14:06:59 -0700 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References: Message-ID: <506CA8F3.2000500@uci.edu> On 10/3/2012 9:06 AM, Thomas Kluyver wrote: > Following on from recent discussion here and on the numfocus list, I'm > trying to work out the set of packages that should make up a > standardised 'scipy stack'. We've determined that Python, numpy, > scipy, matplotlib and IPython are to be included. Then there's a list > that have got a 'maybe': pandas, statsmodels, sympy, scikits-learn, > scikits-image, PyTables, h5py, NetworkX, nose, basemap & netCDF4. > > My aim is to have a general set of packages that you can do useful > work with, and will stand up to the competition (particularly Matlab & > R), but without gaining too many subject-specific packages. But I > don't know what's generally useful and what's subject specific. > > Vote at: http://www.doodle.com/ma6rnpnbfc6wivu9 > > It's set up so you can vote for or against a package, or abstain if > you're not sure - I've abstained on most of them myself. > > Thanks, > Thomas Hi, it was mentioned before: none of the suggested packages can read or write image files on their own, except for matplotlib's built-in PNG support. Matplotlib, Scipy and skimage depend on other, optional packages or binaries for image I/O: PIL, FreeImage, GDAL, PyQt. Christoph From robert.kern at gmail.com Wed Oct 3 17:28:20 2012 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 3 Oct 2012 22:28:20 +0100 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References: Message-ID: On Wed, Oct 3, 2012 at 5:54 PM, Josh Lawrence wrote: > Hello all, > > I am implementing a binomial random variable in MATLAB. The default > method in the statistics toolbox is extremely slow for large > population/trial size. I am needing to do trials for n as large as > 2**28. I found in NumPy some code that implements a binomial random > draw in numpy/random/mtrand/distributions.c. I was trying to convert > the code to MATLAB and the BTPE method seems to have an error in lines > 337-341 of distributions.c. The if ... else if ... else statement I > think is incorrect. I think it should be an if ... else ... statement > followed by the contents of the original else which starts on line > 337. Yes, you are correct, on this point as well as the m+1 and y+1. Thank you for debugging my code! -- Robert Kern From josh.k.lawrence at gmail.com Wed Oct 3 17:42:53 2012 From: josh.k.lawrence at gmail.com (Josh Lawrence) Date: Wed, 3 Oct 2012 16:42:53 -0500 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References: Message-ID: Hah, my pleasure. I'm surprised I found them, as your code seems to always work so well. On Wed, Oct 3, 2012 at 4:28 PM, Robert Kern wrote: > On Wed, Oct 3, 2012 at 5:54 PM, Josh Lawrence wrote: >> Hello all, >> >> I am implementing a binomial random variable in MATLAB. The default >> method in the statistics toolbox is extremely slow for large >> population/trial size. I am needing to do trials for n as large as >> 2**28. I found in NumPy some code that implements a binomial random >> draw in numpy/random/mtrand/distributions.c. I was trying to convert >> the code to MATLAB and the BTPE method seems to have an error in lines >> 337-341 of distributions.c. The if ... else if ... else statement I >> think is incorrect. I think it should be an if ... else ... statement >> followed by the contents of the original else which starts on line >> 337. > > Yes, you are correct, on this point as well as the m+1 and y+1. Thank > you for debugging my code! > > -- > Robert Kern > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user -- Josh Lawrence From robert.kern at gmail.com Wed Oct 3 17:45:03 2012 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 3 Oct 2012 22:45:03 +0100 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References: Message-ID: On Wed, Oct 3, 2012 at 10:42 PM, Josh Lawrence wrote: > Hah, my pleasure. I'm surprised I found them, as your code seems to > always work so well. I was a bored grad student, desperately not trying to do real work and mistranslated some goto logic. The paper is clearer than the RANLIB code I was referencing, but I must have missed that. -- Robert Kern From josh.k.lawrence at gmail.com Wed Oct 3 18:00:45 2012 From: josh.k.lawrence at gmail.com (Josh Lawrence) Date: Wed, 3 Oct 2012 17:00:45 -0500 Subject: [SciPy-User] NumPy Binomial BTPE method Problem In-Reply-To: References:

Message-ID: Yes, I found the paper quite clear. I did a while loop with if blocks (basically a switch statement) instead of goto statements since I was in MATLAB and it makes a lot more sense the way I wrote it. On Wed, Oct 3, 2012 at 4:45 PM, Robert Kern wrote: > On Wed, Oct 3, 2012 at 10:42 PM, Josh Lawrence > wrote: >> Hah, my pleasure. I'm surprised I found them, as your code seems to >> always work so well. > > I was a bored grad student, desperately not trying to do real work and > mistranslated some goto logic. The paper is clearer than the RANLIB > code I was referencing, but I must have missed that. > > -- > Robert Kern > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user -- Josh Lawrence From takowl at gmail.com Wed Oct 3 18:09:11 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Wed, 3 Oct 2012 23:09:11 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: <506CA8F3.2000500@uci.edu> References: <506CA8F3.2000500@uci.edu> Message-ID: On 3 October 2012 22:06, Christoph Gohlke wrote: > it was mentioned before: none of the suggested packages can read or > write image files on their own, except for matplotlib's built-in PNG > support. Matplotlib, Scipy and skimage depend on other, optional > packages or binaries for image I/O: PIL, FreeImage, GDAL, PyQt. If we include scikits-image (which looks unlikely based on the current poll results), we had agreed to specify FreeImage, or possibly one of FreeImage and PIL. Matplotlib will need at least one backend installed, and the documentation says "Most backends support png, pdf, ps, eps and svg." That seems adequate. For saving images, there's less need to require a range of formats than if loading them is a key feature. Thomas From cgohlke at uci.edu Wed Oct 3 18:27:27 2012 From: cgohlke at uci.edu (Christoph Gohlke) Date: Wed, 03 Oct 2012 15:27:27 -0700 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References: <506CA8F3.2000500@uci.edu> Message-ID: <506CBBCF.8080608@uci.edu> On 10/3/2012 3:09 PM, Thomas Kluyver wrote: > On 3 October 2012 22:06, Christoph Gohlke wrote: >> it was mentioned before: none of the suggested packages can read or >> write image files on their own, except for matplotlib's built-in PNG >> support. Matplotlib, Scipy and skimage depend on other, optional >> packages or binaries for image I/O: PIL, FreeImage, GDAL, PyQt. > > If we include scikits-image (which looks unlikely based on the current > poll results), we had agreed to specify FreeImage, or possibly one of > FreeImage and PIL. > > Matplotlib will need at least one backend installed, and the > documentation says "Most backends support png, pdf, ps, eps and svg." > That seems adequate. For saving images, there's less need to require a > range of formats than if loading them is a key feature. > > Thomas I thought PIL was out of question because it's abandonware. Did anyone check if the triple-licensing option of FreeImage (GPLv2, GPLv3, or FIPL) is compatible with the Scipy stack? Also, FreeImage is not a Python package. Pdf, ps, eps and svg are vector graphics formats, not adequate for image IO. Christoph From robert.kern at gmail.com Wed Oct 3 18:34:54 2012 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 3 Oct 2012 23:34:54 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: <506CBBCF.8080608@uci.edu> References: <506CA8F3.2000500@uci.edu> <506CBBCF.8080608@uci.edu> Message-ID: On Wed, Oct 3, 2012 at 11:27 PM, Christoph Gohlke wrote: > On 10/3/2012 3:09 PM, Thomas Kluyver wrote: >> On 3 October 2012 22:06, Christoph Gohlke wrote: >>> it was mentioned before: none of the suggested packages can read or >>> write image files on their own, except for matplotlib's built-in PNG >>> support. Matplotlib, Scipy and skimage depend on other, optional >>> packages or binaries for image I/O: PIL, FreeImage, GDAL, PyQt. >> >> If we include scikits-image (which looks unlikely based on the current >> poll results), we had agreed to specify FreeImage, or possibly one of >> FreeImage and PIL. >> >> Matplotlib will need at least one backend installed, and the >> documentation says "Most backends support png, pdf, ps, eps and svg." >> That seems adequate. For saving images, there's less need to require a >> range of formats than if loading them is a key feature. >> >> Thomas > > I thought PIL was out of question because it's abandonware. Pillow is a maintained, drop-in fork: http://pypi.python.org/pypi/Pillow/ -- Robert Kern From takowl at gmail.com Wed Oct 3 18:42:35 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Wed, 3 Oct 2012 23:42:35 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: <506CBBCF.8080608@uci.edu> References: <506CA8F3.2000500@uci.edu> <506CBBCF.8080608@uci.edu> Message-ID: On 3 October 2012 23:27, Christoph Gohlke wrote: > Did anyone check if the triple-licensing option of FreeImage (GPLv2, > GPLv3, or FIPL) is compatible with the Scipy stack? Also, FreeImage is > not a Python package. IANAL, but I think the FIPL is acceptable. It looks roughly equivalent to LGPL. http://freeimage.sourceforge.net/freeimage-license.txt > Pdf, ps, eps and svg are vector graphics formats, not adequate for image IO. For saving plots, vector formats + png seems adequate to me. PNG is lossless, so it can be converted to other raster formats if there's a specific need. And the standard is a minimum: distributions are free to support other image formats beyond these. For loading images, I agree that these options would not be adequate - at least JPEG support is important. But if scikits-image is not included, loading image files is not a key concern, so I don't think we need to specify it. Thanks, Thomas From cgohlke at uci.edu Wed Oct 3 19:11:50 2012 From: cgohlke at uci.edu (Christoph Gohlke) Date: Wed, 03 Oct 2012 16:11:50 -0700 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References: <506CA8F3.2000500@uci.edu> <506CBBCF.8080608@uci.edu> Message-ID: <506CC636.7080402@uci.edu> On 10/3/2012 3:34 PM, Robert Kern wrote: > On Wed, Oct 3, 2012 at 11:27 PM, Christoph Gohlke wrote: >> On 10/3/2012 3:09 PM, Thomas Kluyver wrote: >>> On 3 October 2012 22:06, Christoph Gohlke wrote: >>>> it was mentioned before: none of the suggested packages can read or >>>> write image files on their own, except for matplotlib's built-in PNG >>>> support. Matplotlib, Scipy and skimage depend on other, optional >>>> packages or binaries for image I/O: PIL, FreeImage, GDAL, PyQt. >>> >>> If we include scikits-image (which looks unlikely based on the current >>> poll results), we had agreed to specify FreeImage, or possibly one of >>> FreeImage and PIL. >>> >>> Matplotlib will need at least one backend installed, and the >>> documentation says "Most backends support png, pdf, ps, eps and svg." >>> That seems adequate. For saving images, there's less need to require a >>> range of formats than if loading them is a key feature. >>> >>> Thomas >> >> I thought PIL was out of question because it's abandonware. > > Pillow is a maintained, drop-in fork: > > http://pypi.python.org/pypi/Pillow/ > Seriously, only few of PIL's bugs have been fixed in Pillow (it's a fork to "foster packaging improvements"), there's no support for Python 3, no new features are planned, and the test suite was removed. Christoph From josef.pktd at gmail.com Wed Oct 3 21:00:10 2012 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 3 Oct 2012 21:00:10 -0400 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References: Message-ID: On Wed, Oct 3, 2012 at 12:06 PM, Thomas Kluyver wrote: > Following on from recent discussion here and on the numfocus list, I'm > trying to work out the set of packages that should make up a > standardised 'scipy stack'. We've determined that Python, numpy, > scipy, matplotlib and IPython are to be included. Then there's a list > that have got a 'maybe': pandas, statsmodels, sympy, scikits-learn, > scikits-image, PyTables, h5py, NetworkX, nose, basemap & netCDF4. > > My aim is to have a general set of packages that you can do useful > work with, and will stand up to the competition (particularly Matlab & > R), but without gaining too many subject-specific packages. But I > don't know what's generally useful and what's subject specific. > > Vote at: http://www.doodle.com/ma6rnpnbfc6wivu9 > > It's set up so you can vote for or against a package, or abstain if > you're not sure - I've abstained on most of them myself. Why I'm in favor of a "Big Scipy": Using Travis's popularity criterion: google has for "from scipy import stats" "About 104,000 results" scipy.stats is a bit of an outlier among the scipy subpackages in that it is more application oriented. I uses many tools from other scipy.subpackages. scipy.stats is in turn used by many application packages, if they don't want to bother coding a version of the statistics themselves. If you are in a field with a strong python background, then there are field specific packages available, cars, sherpa in the recent spectra discussion, nipy/pymvpa, pysal, ... If you are not in one of those python fields (or want to try something non-standard), then you have to use a general purpose library, or code it yourself. scikit-learn, statsmodels and scikit-image try to be the general purpose extension of scipy (the package), and there is a lot of useful and reusable code. for example, clustering with sklearn http://spikesort.org/docs/intro.html#installation a linear regression, or a polyfit if you have outliers use statsmodels that's not field specific. (I'm not using scikits-image, but I assume there are similar features, given the mailing list) (I would also like to use a scikits-signal, but it's still is vapor-ware.) As a user I don't care (much) about a new meta-package, python-xy and Gohlke have (almost) all I need an easy_install away, and a lot more than is under discussion here. Where I do see a potentially big advantage as a maintainer of statsmodels is in code sharing and being able to rely on more consistent package versions by users. Currently we are reluctant to add any additional dependencies to statsmodels not only because it requires more work by users, but also because it requires work for us to keep track of changes across versions of the different packages. We currently maintain compatibility modules for python between 2.5 and 3.2, and for numpy >= 1.4, scipy >= 0.7 and pandas > 0.7.1. Increasing the number of dependencies increases the number of version combinations that need to be tested. That's also a good reason for me not to split up scipy, keeping track of the versions of 8 (linalg, optimize, signal, sparse, stats, fftpack, integrate, interpolate, special and maybe some others) packages sounds like a lot of fun. (I wouldn't mind splitting off scipy.stats.) I would prefer to go the other way, and have a "scipy-big", where I can use any functions from any of the packages without having to worry too much about whether they are available on a users machine or about version compatibilities across packages. As a statsmodels developer I would be glad about the additional advertising and the hopefully faster development of or convergence to a standard through the scipy-stack discussed here, but, at least in the "data-analysis" area, I think we are well on our way to get to the "big-scipy" and fill in the major gaps compared to other languages or data analysis packages. Josef > > Thanks, > Thomas > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From takowl at gmail.com Thu Oct 4 05:38:19 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Thu, 4 Oct 2012 10:38:19 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On 4 October 2012 02:00, wrote: > Where I do see a potentially big advantage as a maintainer of > statsmodels is in code sharing and being able to rely on more > consistent package versions by users. That's a good point: one of my other aims is that packages can more comfortably rely on things in the specification - similar to relying on the Python standard library. For example, I recall statsmodels was looking at adding formula support: I imagine there are tools in Sympy that you could use in this. It looks likely that Sympy will be part of the specification, so maybe there's less need to provide fallback functionality for when it's not installed. Thomas From robert.kern at gmail.com Thu Oct 4 05:43:56 2012 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 4 Oct 2012 10:43:56 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On Thu, Oct 4, 2012 at 10:38 AM, Thomas Kluyver wrote: > On 4 October 2012 02:00, wrote: >> Where I do see a potentially big advantage as a maintainer of >> statsmodels is in code sharing and being able to rely on more >> consistent package versions by users. > > That's a good point: one of my other aims is that packages can more > comfortably rely on things in the specification - similar to relying > on the Python standard library. For example, I recall statsmodels was > looking at adding formula support: I imagine there are tools in Sympy > that you could use in this. It looks likely that Sympy will be part of > the specification, so maybe there's less need to provide fallback > functionality for when it's not installed. Those formulae have very different semantics. Sympy would probably not have saved much, if any, code. http://patsy.readthedocs.org/en/latest/formulas.html -- Robert Kern From takowl at gmail.com Thu Oct 4 06:05:20 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Thu, 4 Oct 2012 11:05:20 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On 4 October 2012 10:43, Robert Kern wrote: > Those formulae have very different semantics. Sympy would probably not > have saved much, if any, code. OK, I guess that was a poor example. But the larger point is being able to depend on a larger set of packages, rather than reimplementing bits of those packages to make those dependencies optional. Thomas From indiajoe at gmail.com Thu Oct 4 07:20:56 2012 From: indiajoe at gmail.com (Joe Philip Ninan) Date: Thu, 4 Oct 2012 16:50:56 +0530 Subject: [SciPy-User] Fitting Gaussian in spectra In-Reply-To: References: Message-ID: Hi Matt, Christian, Jerome, Kevin and David. Thanks a lot for all the suggestions. First i apologize for my delay in replying. something was wrong with my subscription, and i was only able to read the emails in archive page. I tried to model the continuum as an exponential function and did least square fit. ( at first i was trying out 2nd degree polynomials) With the iterative masking method suggested. it seems to be doing a good job. I haven't tried on all data set yet. Since the width,position nor amplitude of peaks were not same in all data, peak finding was not easy. But the code by sixtenbe in github https://gist.github.com/1178136 helped me find the peaks _almost_ reliably. Thanking you again for all the help. -cheers joe On 29 September 2012 00:15, Joe Philip Ninan wrote: > Hi, > I have a spectra with multiple gaussian emission lines over a noisy > continuum. > My primary objective is to find areas under all the gaussian peaks. > For that, the following is the algorithm i have in mind. > 1) fit the continuum and subtract it. > 2) find the peaks > 3) do least square fit of gaussian at the peaks to find the area under > each gaussian peaks. > I am basically stuck at the first step itself. Simple 2nd or 3rd order > polynomial fit is not working because the contribution from peaks are > significant. Any tool exist to fit continuum ignoring the peaks? > For finding peaks, i tried find_peaks_cwt in signal module of scipy. But > it seems to be quite sensitive of the width of peak and was picking up > non-existing peaks also. > The wavelet used was default mexican hat. Is there any better wavelet i > should try? > > Or is there any other module in python/scipy which i should give a try? > Thanking you. > -cheers > joe > -- > /--------------------------------------------------------------- > "GNU/Linux: because a PC is a terrible thing to waste" - GNU Generation > > > -- /--------------------------------------------------------------- "GNU/Linux: because a PC is a terrible thing to waste" - GNU Generation ************************************************ Joe Philip Ninan http://sites.google.com/site/jpninan/ Research Scholar /________________\ DAA, | Vadakeparambil | TIFR, | Pullad P.O. | Mumbai-05, India. | Kerala, India | Ph: +917738438212 | PIN:689548 | ------------------------------\_______________/-------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: From alec.kalinin at gmail.com Thu Oct 4 07:25:58 2012 From: alec.kalinin at gmail.com (Alexander Kalinin) Date: Thu, 4 Oct 2012 15:25:58 +0400 Subject: [SciPy-User] Dot product of two arrays of vectors Message-ID: Hello, SciPy, Could you, please, explain me, what is the most standard way in NumPy to calculate a dot product of two arrays of vectors, like in MatLab? For example, consider two numpy arrays of vectors: a = np.array([[1, 2, 3], [4, 5, 6]]) b = np.array([[3, 2, 1], [6, 5, 4]]) For the cross product we have convenient function numpy.cross: >>> np.cross(a, b) array([[ -4, 8, -4], [-10, 20, -10]]) But the numpy.dot product for the arrays of vectors do the matrix multiplication: >>> np.dot(a, b) Traceback (most recent call last): File "", line 1, in ValueError: objects are not aligned Yes, I can emulate the dot product code like: np.sum(a * b, axis = 1).reshape(-1, 1) but may be there is exist more standard way to do the dot product? Sincerely, Alexander -------------- next part -------------- An HTML attachment was scrubbed... URL: From cimrman3 at ntc.zcu.cz Thu Oct 4 07:43:47 2012 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Thu, 04 Oct 2012 13:43:47 +0200 Subject: [SciPy-User] Dot product of two arrays of vectors In-Reply-To: References: Message-ID: <506D7673.90204@ntc.zcu.cz> On 10/04/2012 01:25 PM, Alexander Kalinin wrote: > Hello, SciPy, > > Could you, please, explain me, what is the most standard way in NumPy to > calculate a dot product of two arrays of vectors, like in MatLab? For > example, consider two numpy arrays of vectors: > > a = np.array([[1, 2, 3], [4, 5, 6]]) > b = np.array([[3, 2, 1], [6, 5, 4]]) > > For the cross product we have convenient function numpy.cross: >>>> np.cross(a, b) > array([[ -4, 8, -4], > [-10, 20, -10]]) > > But the numpy.dot product for the arrays of vectors do the matrix > multiplication: >>>> np.dot(a, b) > Traceback (most recent call last): > File "", line 1, in > ValueError: objects are not aligned > > Yes, I can emulate the dot product code like: > > np.sum(a * b, axis = 1).reshape(-1, 1) > but may be there is exist more standard way to do the dot product? You could try using: from numpy.core.umath_tests import matrix_multiply if your numpy is recent enough. Cheers, r. From njs at pobox.com Thu Oct 4 08:07:51 2012 From: njs at pobox.com (Nathaniel Smith) Date: Thu, 4 Oct 2012 13:07:51 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On Thu, Oct 4, 2012 at 10:38 AM, Thomas Kluyver wrote: > On 4 October 2012 02:00, wrote: >> Where I do see a potentially big advantage as a maintainer of >> statsmodels is in code sharing and being able to rely on more >> consistent package versions by users. > > That's a good point: one of my other aims is that packages can more > comfortably rely on things in the specification - similar to relying > on the Python standard library. This suggests another possible way of coming up with the base package list... if a package is already included in all of Python(x,y), EPD, Anaconda, Debian, Redhat, then practically speaking it sticking it in the first version of the spec won't cause any problems for anybody, because everyone's already distributing it. But it will document that everyone is distributing it, which is useful for tutorials, making decisions about dependencies, etc. (Python: batteries included!) -n From takowl at gmail.com Thu Oct 4 08:19:58 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Thu, 4 Oct 2012 13:19:58 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On 4 October 2012 13:07, Nathaniel Smith wrote: > This suggests another possible way of coming up with the base package > list... if a package is already included in all of > Python(x,y), EPD, Anaconda, Debian, Redhat, distros I'm missing> The the question becomes one of which distros are relevant. If we count EPD Free, for example, only nose (of the packages in the poll) is common to all the distributions at present. For Linux distributions, it's trickier: I have a wealth of packages available from the Ubuntu repositories, but they're mostly not installed by default - I'm not sure if even numpy is in a default installation. The intention is to make a metapackage called something like scipy-stack, which will pull in all the relevant packages. But for now, there's no set of packages you can assume will be installed together. Thomas From cournape at gmail.com Thu Oct 4 08:38:25 2012 From: cournape at gmail.com (David Cournapeau) Date: Thu, 4 Oct 2012 13:38:25 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On Thu, Oct 4, 2012 at 1:19 PM, Thomas Kluyver wrote: > On 4 October 2012 13:07, Nathaniel Smith wrote: >> This suggests another possible way of coming up with the base package >> list... if a package is already included in all of >> Python(x,y), EPD, Anaconda, Debian, Redhat, > distros I'm missing> > > The the question becomes one of which distros are relevant. If we > count EPD Free, for example, only nose (of the packages in the poll) > is common to all the distributions at present. I think Nathaniel meant included in the official repos, not in the single cdrom distribution (otherwise, you would indeed get an near-empty set because of Ubuntu) David From amcmorl at gmail.com Thu Oct 4 08:53:08 2012 From: amcmorl at gmail.com (Angus McMorland) Date: Thu, 4 Oct 2012 08:53:08 -0400 Subject: [SciPy-User] Dot product of two arrays of vectors In-Reply-To: References: Message-ID: On 4 October 2012 07:25, Alexander Kalinin wrote: > Hello, SciPy, > > Could you, please, explain me, what is the most standard way in NumPy to > calculate a dot product of two arrays of vectors, like in MatLab? For > example, consider two numpy arrays of vectors: > > a = np.array([[1, 2, 3], [4, 5, 6]]) > b = np.array([[3, 2, 1], [6, 5, 4]]) > > For the cross product we have convenient function numpy.cross: >>>> np.cross(a, b) > array([[ -4, 8, -4], > [-10, 20, -10]]) > > But the numpy.dot product for the arrays of vectors do the matrix > multiplication: >>>> np.dot(a, b) > Traceback (most recent call last): > File "", line 1, in > ValueError: objects are not aligned > > Yes, I can emulate the dot product code like: > > np.sum(a * b, axis = 1).reshape(-1, 1) > > but may be there is exist more standard way to do the dot product? >From the docstring of dot: "For N dimensions it is a sum product over the last axis of `a` and the second-to-last of `b`:: dot(a, b)[i,j,k,m] = sum(a[i,j,:] * b[k,:,m])" meaning that you want to do np.dot(a, b.T). This gives you the dot product of all combinations of vectors (not just row-wise) between a and b: array([[10, 28], [28, 73]]). You can extract just the row-wise dot products using diag: In: np.diag(np.dot(a, b.T)) Out: array([10, 73]) which is still faster than the summing and reshaping solution. In: %timeit np.sum(a * b, axis = 1).reshape(-1, 1) 100000 loops, best of 3: 5.24 us per loop In: %timeit np.diag(np.dot(a, b.T)) 100000 loops, best of 3: 4.21 us per loop I hope that helps. Angus -- AJC McMorland Post-doctoral research fellow Neurobiology, University of Pittsburgh From johnl at cs.wisc.edu Thu Oct 4 08:57:13 2012 From: johnl at cs.wisc.edu (J. David Lee) Date: Thu, 04 Oct 2012 07:57:13 -0500 Subject: [SciPy-User] Fitting Gaussian in spectra In-Reply-To: <20120930105405.9fb85e88.Jerome.Kieffer@esrf.fr> References: <20120930105405.9fb85e88.Jerome.Kieffer@esrf.fr> Message-ID: <506D87A9.80205@cs.wisc.edu> Hi, I know I'm a bit late to the discussion, but I have some experience fitting emission lines. Here's what I've found to work: *) Fit the lines and background together *) Use the simplest reasonable model for the background: constant, linear, etc. --> You could measure the background and construct a model using linear interpolation *) Put the characteristics of your detector in your model: -> Line-width (fwhm) vs energy -> Detector efficiency vs energy *) If you know the possible lines you'll be looking for, put those in your model as well If you don't know what lines to expect, but know the shape of the peaks you're looking for, you might look at using MPOC-MLE, which is described reasonably well in the paper "Pileup Correction Algorithms for Very-High-Count-Rate Gamma-Ray Spectrometry With NaI(Tl) Detectors" by M. Bolic. I've implemented a modified version of the algorithm for counting x-rays from detectors in pulse-mode, and it's the most robust algorithm I've been able to find for that purpose. I hope this helps. David On 09/30/2012 03:54 AM, Jerome Kieffer wrote: > On Sat, 29 Sep 2012 00:15:21 +0530 > Joe Philip Ninan wrote: > >> 1) fit the continuum and subtract it. >> Or is there any other module in python/scipy which i should give a try? >> Thanking you. > Iteratively apply a Savitsky-Golay filter with a large width(>10) and a low order (2). > at the begining you will only smear out the noise then start removing peaks. > > SG filter are really fast to apply. > From takowl at gmail.com Thu Oct 4 09:03:14 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Thu, 4 Oct 2012 14:03:14 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On 4 October 2012 13:38, David Cournapeau wrote: > I think Nathaniel meant included in the official repos, not in the > single cdrom distribution (otherwise, you would indeed get an > near-empty set because of Ubuntu) But if the criterion is 'available from repositories for all relevant distributions', then there's a very large set of packages we could specify. Thomas From pavel.lurye at gmail.com Thu Oct 4 09:05:37 2012 From: pavel.lurye at gmail.com (Pavel Lurye) Date: Thu, 4 Oct 2012 17:05:37 +0400 Subject: [SciPy-User] csr_matrix rows remove Message-ID: Hi, I'm using scipy csr_matrix and I'm trying to figure out what is the simple and fast way to remove a row from such matrix? For example, I have a tuple of rows, that should be deleted. The only way I see, is to generate a tuple of matrix parts and vstack it. Please, help me out with this. Thanks in advance, Pavel. From alec.kalinin at gmail.com Thu Oct 4 09:16:04 2012 From: alec.kalinin at gmail.com (Alexander Kalinin) Date: Thu, 4 Oct 2012 17:16:04 +0400 Subject: [SciPy-User] Dot product of two arrays of vectors In-Reply-To: References: Message-ID: Angus, Thank you for the interesting solution! But for large arrays np.diag(np.dot(a, b.T)) is more slower the sum: import time import numpy as np M = 10 N = 3000 a = np.random.rand(N, 3) b = np.random.rand(N, 3) t0 = time.time() for i in range(M): np.sum(a * b, axis = 1).reshape(-1, 1) t1 = time.time() print "{:.3f} s.".format(t1 - t0) t0 = time.time() for i in range(M): np.diag(np.dot(a, b.T)) t1 = time.time() print "{:.3f} s.".format(t1 - t0) Output is: 0.001 s. 0.915 s. Sincerely, Alexander On Thu, Oct 4, 2012 at 4:53 PM, Angus McMorland wrote: > On 4 October 2012 07:25, Alexander Kalinin wrote: > > Hello, SciPy, > > > > Could you, please, explain me, what is the most standard way in NumPy to > > calculate a dot product of two arrays of vectors, like in MatLab? For > > example, consider two numpy arrays of vectors: > > > > a = np.array([[1, 2, 3], [4, 5, 6]]) > > b = np.array([[3, 2, 1], [6, 5, 4]]) > > > > For the cross product we have convenient function numpy.cross: > >>>> np.cross(a, b) > > array([[ -4, 8, -4], > > [-10, 20, -10]]) > > > > But the numpy.dot product for the arrays of vectors do the matrix > > multiplication: > >>>> np.dot(a, b) > > Traceback (most recent call last): > > File "", line 1, in > > ValueError: objects are not aligned > > > > Yes, I can emulate the dot product code like: > > > > np.sum(a * b, axis = 1).reshape(-1, 1) > > > > but may be there is exist more standard way to do the dot product? > > >From the docstring of dot: > > "For N dimensions it is a sum product over the last axis of `a` and > the second-to-last of `b`:: > > dot(a, b)[i,j,k,m] = sum(a[i,j,:] * b[k,:,m])" > > meaning that you want to do > > np.dot(a, b.T). > > This gives you the dot product of all combinations of vectors (not > just row-wise) between a and b: > > array([[10, 28], > [28, 73]]). > > You can extract just the row-wise dot products using diag: > > In: np.diag(np.dot(a, b.T)) > Out: array([10, 73]) > > which is still faster than the summing and reshaping solution. > > In: %timeit np.sum(a * b, axis = 1).reshape(-1, 1) > 100000 loops, best of 3: 5.24 us per loop > > In: %timeit np.diag(np.dot(a, b.T)) > 100000 loops, best of 3: 4.21 us per loop > > I hope that helps. > > Angus > -- > AJC McMorland > Post-doctoral research fellow > Neurobiology, University of Pittsburgh > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Thu Oct 4 09:18:23 2012 From: cournape at gmail.com (David Cournapeau) Date: Thu, 4 Oct 2012 14:18:23 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On Thu, Oct 4, 2012 at 2:03 PM, Thomas Kluyver wrote: > On 4 October 2012 13:38, David Cournapeau wrote: >> I think Nathaniel meant included in the official repos, not in the >> single cdrom distribution (otherwise, you would indeed get an >> near-empty set because of Ubuntu) > > But if the criterion is 'available from repositories for all relevant > distributions', then there's a very large set of packages we could > specify. I thought the idea was closer to take the intersection of all the distros (rh, ubuntu, epd free, anaconda, etc...) as a working basis. David From alec.kalinin at gmail.com Thu Oct 4 09:26:13 2012 From: alec.kalinin at gmail.com (Alexander Kalinin) Date: Thu, 4 Oct 2012 17:26:13 +0400 Subject: [SciPy-User] Dot product of two arrays of vectors In-Reply-To: <506D7673.90204@ntc.zcu.cz> References: <506D7673.90204@ntc.zcu.cz> Message-ID: Could you, please, explain me more about matrix_multiply? I tried the following: >>> import numpy.core.umath_tests as ut >>> ut.matrix_multiply.signature '(m,n),(n,p)->(m,p)' >>> So, I see the the matrix_multiply is the usual matrix product. Sincerely, Alexander On Thu, Oct 4, 2012 at 3:43 PM, Robert Cimrman wrote: > On 10/04/2012 01:25 PM, Alexander Kalinin wrote: > > Hello, SciPy, > > > > Could you, please, explain me, what is the most standard way in NumPy to > > calculate a dot product of two arrays of vectors, like in MatLab? For > > example, consider two numpy arrays of vectors: > > > > a = np.array([[1, 2, 3], [4, 5, 6]]) > > b = np.array([[3, 2, 1], [6, 5, 4]]) > > > > For the cross product we have convenient function numpy.cross: > >>>> np.cross(a, b) > > array([[ -4, 8, -4], > > [-10, 20, -10]]) > > > > But the numpy.dot product for the arrays of vectors do the matrix > > multiplication: > >>>> np.dot(a, b) > > Traceback (most recent call last): > > File "", line 1, in > > ValueError: objects are not aligned > > > > Yes, I can emulate the dot product code like: > > > > np.sum(a * b, axis = 1).reshape(-1, 1) > > but may be there is exist more standard way to do the dot product? > > You could try using: > > from numpy.core.umath_tests import matrix_multiply > > if your numpy is recent enough. > > Cheers, > r. > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -------------- next part -------------- An HTML attachment was scrubbed... URL: From opossumnano at gmail.com Thu Oct 4 09:26:13 2012 From: opossumnano at gmail.com (Tiziano Zito) Date: Thu, 4 Oct 2012 15:26:13 +0200 (CEST) Subject: [SciPy-User] =?utf-8?q?=5BANN=5D_MDP-3=2E3_released!?= Message-ID: <20121004132613.A5E3512E00D5@comms.bccn-berlin.de> We are glad to announce release 3.3 of the Modular toolkit for Data Processing (MDP). This a bug-fix release, all current users are invited to upgrade. MDP is a Python library of widely used data processing algorithms that can be combined according to a pipeline analogy to build more complex data processing software. The base of available algorithms includes signal processing methods (Principal Component Analysis, Independent Component Analysis, Slow Feature Analysis), manifold learning methods ([Hessian] Locally Linear Embedding), several classifiers, probabilistic methods (Factor Analysis, RBM), data pre-processing methods, and many others. What's new in version 3.3? -------------------------- - support sklearn versions up to 0.12 - cleanly support reload - fail gracefully if pp server does not start - several bug-fixes and improvements Resources --------- Download: http://sourceforge.net/projects/mdp-toolkit/files Homepage: http://mdp-toolkit.sourceforge.net Mailing list: http://lists.sourceforge.net/mailman/listinfo/mdp-toolkit-users Acknowledgments --------------- We thank the contributors to this release: Philip DeBoer, Yaroslav Halchenko. The MDP developers, Pietro Berkes Zbigniew J?drzejewski-Szmek Rike-Benjamin Schuppner Niko Wilbert Tiziano Zito From takowl at gmail.com Thu Oct 4 09:27:15 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Thu, 4 Oct 2012 14:27:15 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: OK, based on the responses so far from the poll, here's a new draft of the standard. It's rather smaller than the previous draft (when we were using the name Pylab), but not completely minimalist. I'm fairly happy with the general shape of it. https://gist.github.com/3833499 The biggest remaining question (as I see it) is the hdf5 libraries. Both have got a somewhat mixed response on the poll, although h5py has a bit more support than PyTables. This did come up before, but let's hear more voices on the question. Should we specify neither, one, or both? Thanks, From gnurser at gmail.com Thu Oct 4 09:29:31 2012 From: gnurser at gmail.com (George Nurser) Date: Thu, 4 Oct 2012 14:29:31 +0100 Subject: [SciPy-User] Dot product of two arrays of vectors In-Reply-To: References: <506D7673.90204@ntc.zcu.cz> Message-ID: Tensordot may be what you're after. It gives a lot of flexibility. cheers, George. On 4 October 2012 14:26, Alexander Kalinin wrote: > Could you, please, explain me more about matrix_multiply? I tried the > following: > >>>> import numpy.core.umath_tests as ut >>>> ut.matrix_multiply.signature > '(m,n),(n,p)->(m,p)' >>>> > > So, I see the the matrix_multiply is the usual matrix product. > > Sincerely, > Alexander > > > On Thu, Oct 4, 2012 at 3:43 PM, Robert Cimrman wrote: >> >> On 10/04/2012 01:25 PM, Alexander Kalinin wrote: >> > Hello, SciPy, >> > >> > Could you, please, explain me, what is the most standard way in NumPy to >> > calculate a dot product of two arrays of vectors, like in MatLab? For >> > example, consider two numpy arrays of vectors: >> > >> > a = np.array([[1, 2, 3], [4, 5, 6]]) >> > b = np.array([[3, 2, 1], [6, 5, 4]]) >> > >> > For the cross product we have convenient function numpy.cross: >> >>>> np.cross(a, b) >> > array([[ -4, 8, -4], >> > [-10, 20, -10]]) >> > >> > But the numpy.dot product for the arrays of vectors do the matrix >> > multiplication: >> >>>> np.dot(a, b) >> > Traceback (most recent call last): >> > File "", line 1, in >> > ValueError: objects are not aligned >> > >> > Yes, I can emulate the dot product code like: >> > >> > np.sum(a * b, axis = 1).reshape(-1, 1) >> > but may be there is exist more standard way to do the dot product? >> >> You could try using: >> >> from numpy.core.umath_tests import matrix_multiply >> >> if your numpy is recent enough. >> >> Cheers, >> r. >> >> _______________________________________________ >> SciPy-User mailing list >> SciPy-User at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user > > > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From cimrman3 at ntc.zcu.cz Thu Oct 4 09:33:39 2012 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Thu, 04 Oct 2012 15:33:39 +0200 Subject: [SciPy-User] Dot product of two arrays of vectors In-Reply-To: References: <506D7673.90204@ntc.zcu.cz> Message-ID: <506D9033.2030904@ntc.zcu.cz> On 10/04/2012 03:26 PM, Alexander Kalinin wrote: > Could you, please, explain me more about matrix_multiply? I tried the > following: > >>>> import numpy.core.umath_tests as ut >>>> ut.matrix_multiply.signature > '(m,n),(n,p)->(m,p)' >>>> > > So, I see the the matrix_multiply is the usual matrix product. Yes, but the important part is the "on last two dimensions" part of the docstring: In [5]: a = np.ones((5, 2)) In [6]: b = 2 * a In [7]: a Out[7]: array([[ 1., 1.], [ 1., 1.], [ 1., 1.], [ 1., 1.], [ 1., 1.]]) In [8]: b Out[8]: array([[ 2., 2.], [ 2., 2.], [ 2., 2.], [ 2., 2.], [ 2., 2.]]) In [17]: matrix_multiply(a[:, None, :], b[:, :, None]).squeeze() Out[17]: array([ 4., 4., 4., 4., 4.]) r. > Sincerely, > Alexander > > On Thu, Oct 4, 2012 at 3:43 PM, Robert Cimrman wrote: > >> On 10/04/2012 01:25 PM, Alexander Kalinin wrote: >>> Hello, SciPy, >>> >>> Could you, please, explain me, what is the most standard way in NumPy to >>> calculate a dot product of two arrays of vectors, like in MatLab? For >>> example, consider two numpy arrays of vectors: >>> >>> a = np.array([[1, 2, 3], [4, 5, 6]]) >>> b = np.array([[3, 2, 1], [6, 5, 4]]) >>> >>> For the cross product we have convenient function numpy.cross: >>>>>> np.cross(a, b) >>> array([[ -4, 8, -4], >>> [-10, 20, -10]]) >>> >>> But the numpy.dot product for the arrays of vectors do the matrix >>> multiplication: >>>>>> np.dot(a, b) >>> Traceback (most recent call last): >>> File "", line 1, in >>> ValueError: objects are not aligned >>> >>> Yes, I can emulate the dot product code like: >>> >>> np.sum(a * b, axis = 1).reshape(-1, 1) >>> but may be there is exist more standard way to do the dot product? >> >> You could try using: >> >> from numpy.core.umath_tests import matrix_multiply >> >> if your numpy is recent enough. >> >> Cheers, >> r. >> >> _______________________________________________ >> SciPy-User mailing list >> SciPy-User at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user >> > > > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From cimrman3 at ntc.zcu.cz Thu Oct 4 09:36:17 2012 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Thu, 04 Oct 2012 15:36:17 +0200 Subject: [SciPy-User] Dot product of two arrays of vectors In-Reply-To: References: <506D7673.90204@ntc.zcu.cz>

Message-ID: <506D90D1.5050104@ntc.zcu.cz> Or the ultimate weapon: np.einsum(). But I suspect matrix_multiply() to be faster. r. On 10/04/2012 03:29 PM, George Nurser wrote: > Tensordot may be what you're after. It gives a lot of flexibility. > cheers, George. > > On 4 October 2012 14:26, Alexander Kalinin wrote: >> Could you, please, explain me more about matrix_multiply? I tried the >> following: >> >>>>> import numpy.core.umath_tests as ut >>>>> ut.matrix_multiply.signature >> '(m,n),(n,p)->(m,p)' >>>>> >> >> So, I see the the matrix_multiply is the usual matrix product. >> >> Sincerely, >> Alexander >> >> >> On Thu, Oct 4, 2012 at 3:43 PM, Robert Cimrman wrote: >>> >>> On 10/04/2012 01:25 PM, Alexander Kalinin wrote: >>>> Hello, SciPy, >>>> >>>> Could you, please, explain me, what is the most standard way in NumPy to >>>> calculate a dot product of two arrays of vectors, like in MatLab? For >>>> example, consider two numpy arrays of vectors: >>>> >>>> a = np.array([[1, 2, 3], [4, 5, 6]]) >>>> b = np.array([[3, 2, 1], [6, 5, 4]]) >>>> >>>> For the cross product we have convenient function numpy.cross: >>>>>>> np.cross(a, b) >>>> array([[ -4, 8, -4], >>>> [-10, 20, -10]]) >>>> >>>> But the numpy.dot product for the arrays of vectors do the matrix >>>> multiplication: >>>>>>> np.dot(a, b) >>>> Traceback (most recent call last): >>>> File "", line 1, in >>>> ValueError: objects are not aligned >>>> >>>> Yes, I can emulate the dot product code like: >>>> >>>> np.sum(a * b, axis = 1).reshape(-1, 1) >>>> but may be there is exist more standard way to do the dot product? >>> >>> You could try using: >>> >>> from numpy.core.umath_tests import matrix_multiply >>> >>> if your numpy is recent enough. >>> >>> Cheers, >>> r. >>> >>> _______________________________________________ >>> SciPy-User mailing list >>> SciPy-User at scipy.org >>> http://mail.scipy.org/mailman/listinfo/scipy-user >> >> >> >> _______________________________________________ >> SciPy-User mailing list >> SciPy-User at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user >> > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From takowl at gmail.com Thu Oct 4 09:40:43 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Thu, 4 Oct 2012 14:40:43 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On 4 October 2012 14:27, Thomas Kluyver wrote: > https://gist.github.com/3833499 For reference, a few notes on how it matches up to existing distributions. - Anaconda, EPD full & WinPython already meet that list - EPD Free does not currently include pandas or sympy. - Python(x,y) has older versions of pandas & IPython, but a new release is coming soon. - Ubuntu has older versions of the scipy library, pandas & IPython. The new release later this month will have the requisite versions of all three. Thanks, Thomas From josef.pktd at gmail.com Thu Oct 4 09:50:20 2012 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 4 Oct 2012 09:50:20 -0400 Subject: [SciPy-User] Fitting Gaussian in spectra In-Reply-To: References: Message-ID: On Thu, Oct 4, 2012 at 7:20 AM, Joe Philip Ninan wrote: > Hi Matt, Christian, Jerome, Kevin and David. > Thanks a lot for all the suggestions. > First i apologize for my delay in replying. something was wrong with my > subscription, and i was only able to read the emails in archive page. > I tried to model the continuum as an exponential function and did least > square fit. ( at first i was trying out 2nd degree polynomials) > With the iterative masking method suggested. it seems to be doing a good > job. I haven't tried on all data set yet. > Since the width,position nor amplitude of peaks were not same in all data, > peak finding was not easy. > But the code by sixtenbe in github https://gist.github.com/1178136 helped me > find the peaks _almost_ reliably. > Thanking you again for all the help. Is there some sample data of spectra that has this pattern available somewhere? Kevins iterative dropping/masking method is very similar to least trimmed squares. My impression is that identifying peaks and fitting the continuum/baseline is very similar to outlier detection with robust estimation. statsmodels has robust M-estimation, essentially replacing least squares by a robust loss function. I have in preparation for statsmodels, least trimmed squares (which starts with a small subsample and adds observations until only outliers are left), maximum trimmed likelihood (which also works for other models like Poisson) and MM-estimators (which start with least trimmed squares but then switches to M-estimation to get higher efficiency in the normal case.) Caveat: so far only for models that are linear in parameters. With some sample data we could try if any of our robust estimators would help in this case. Josef > -cheers > joe > > > On 29 September 2012 00:15, Joe Philip Ninan wrote: >> >> Hi, >> I have a spectra with multiple gaussian emission lines over a noisy >> continuum. >> My primary objective is to find areas under all the gaussian peaks. >> For that, the following is the algorithm i have in mind. >> 1) fit the continuum and subtract it. >> 2) find the peaks >> 3) do least square fit of gaussian at the peaks to find the area under >> each gaussian peaks. >> I am basically stuck at the first step itself. Simple 2nd or 3rd order >> polynomial fit is not working because the contribution from peaks are >> significant. Any tool exist to fit continuum ignoring the peaks? >> For finding peaks, i tried find_peaks_cwt in signal module of scipy. But >> it seems to be quite sensitive of the width of peak and was picking up >> non-existing peaks also. >> The wavelet used was default mexican hat. Is there any better wavelet i >> should try? >> >> Or is there any other module in python/scipy which i should give a try? >> Thanking you. >> -cheers >> joe >> -- >> /--------------------------------------------------------------- >> "GNU/Linux: because a PC is a terrible thing to waste" - GNU Generation >> >> > > > > -- > /--------------------------------------------------------------- > "GNU/Linux: because a PC is a terrible thing to waste" - GNU Generation > > ************************************************ > Joe Philip Ninan http://sites.google.com/site/jpninan/ > Research Scholar /________________\ > DAA, | Vadakeparambil | > TIFR, | Pullad P.O. | > Mumbai-05, India. | Kerala, India | > Ph: +917738438212 | PIN:689548 | > ------------------------------\_______________/-------------- > > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From e.antero.tammi at gmail.com Thu Oct 4 10:27:04 2012 From: e.antero.tammi at gmail.com (eat) Date: Thu, 4 Oct 2012 17:27:04 +0300 Subject: [SciPy-User] Dot product of two arrays of vectors In-Reply-To: <506D90D1.5050104@ntc.zcu.cz> References: <506D7673.90204@ntc.zcu.cz>

<506D90D1.5050104@ntc.zcu.cz> Message-ID: Hi, On Thu, Oct 4, 2012 at 4:36 PM, Robert Cimrman wrote: > Or the ultimate weapon: np.einsum(). But I suspect matrix_multiply() to be > faster. > FWIW, indeed it's at least faster than sum() based, like: In []: from numpy.core.umath_tests import matrix_multiply as mm In []: f0= lambda a, b: mm(a[:, None, :], b[:, :, None]).squeeze() In []: f1= lambda a, b: np.sum(a* b, axis= 1).reshape(-1, 1).squeeze() In []: n= 1000 In []: a, b= rand(n, 3), rand(n, 3) In []: allclose(f0(a, b), f1(a, b)) Out[]: True In []: %timeit f0(a, b) 10000 loops, best of 3: 47.2 us per loop In []: %timeit f1(a, b) 10000 loops, best of 3: 58 us per loop In []: n= 5000 In []: a, b= rand(n, 3), rand(n, 3) In []: %timeit f0(a, b) 10000 loops, best of 3: 178 us per loop In []: %timeit f1(a, b) 1000 loops, best of 3: 225 us per loop My 2 cents, -eat > > r. > > On 10/04/2012 03:29 PM, George Nurser wrote: > > Tensordot may be what you're after. It gives a lot of flexibility. > > cheers, George. > > > > On 4 October 2012 14:26, Alexander Kalinin > wrote: > >> Could you, please, explain me more about matrix_multiply? I tried the > >> following: > >> > >>>>> import numpy.core.umath_tests as ut > >>>>> ut.matrix_multiply.signature > >> '(m,n),(n,p)->(m,p)' > >>>>> > >> > >> So, I see the the matrix_multiply is the usual matrix product. > >> > >> Sincerely, > >> Alexander > >> > >> > >> On Thu, Oct 4, 2012 at 3:43 PM, Robert Cimrman > wrote: > >>> > >>> On 10/04/2012 01:25 PM, Alexander Kalinin wrote: > >>>> Hello, SciPy, > >>>> > >>>> Could you, please, explain me, what is the most standard way in NumPy > to > >>>> calculate a dot product of two arrays of vectors, like in MatLab? For > >>>> example, consider two numpy arrays of vectors: > >>>> > >>>> a = np.array([[1, 2, 3], [4, 5, 6]]) > >>>> b = np.array([[3, 2, 1], [6, 5, 4]]) > >>>> > >>>> For the cross product we have convenient function numpy.cross: > >>>>>>> np.cross(a, b) > >>>> array([[ -4, 8, -4], > >>>> [-10, 20, -10]]) > >>>> > >>>> But the numpy.dot product for the arrays of vectors do the matrix > >>>> multiplication: > >>>>>>> np.dot(a, b) > >>>> Traceback (most recent call last): > >>>> File "", line 1, in > >>>> ValueError: objects are not aligned > >>>> > >>>> Yes, I can emulate the dot product code like: > >>>> > >>>> np.sum(a * b, axis = 1).reshape(-1, 1) > >>>> but may be there is exist more standard way to do the dot product? > >>> > >>> You could try using: > >>> > >>> from numpy.core.umath_tests import matrix_multiply > >>> > >>> if your numpy is recent enough. > >>> > >>> Cheers, > >>> r. > >>> > >>> _______________________________________________ > >>> SciPy-User mailing list > >>> SciPy-User at scipy.org > >>> http://mail.scipy.org/mailman/listinfo/scipy-user > >> > >> > >> > >> _______________________________________________ > >> SciPy-User mailing list > >> SciPy-User at scipy.org > >> http://mail.scipy.org/mailman/listinfo/scipy-user > >> > > _______________________________________________ > > SciPy-User mailing list > > SciPy-User at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-user > > > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -------------- next part -------------- An HTML attachment was scrubbed... URL: From andrew.collette at gmail.com Thu Oct 4 11:20:20 2012 From: andrew.collette at gmail.com (Andrew Collette) Date: Thu, 4 Oct 2012 09:20:20 -0600 Subject: [SciPy-User] ANN: HDF5 for Python (h5py) 2.1.0-final Message-ID: Announcing HDF5 for Python (h5py) 2.1.0 ======================================= We are proud to announce the availability of HDF5 for Python (h5py) 2.1.0! This release has been a long time coming. Thanks to everyone who contributed code and filed bug reports! What's new in h5py 2.1 ----------------------- * The HDF5 Dimension Scales API is now available, along with high-level integration with Dataset objects. Thanks to D. Dale for implementing this. * Unicode scalar strings can now be stored in attributes. * Dataset objects now expose a .size property giving the total number of elements. * Many performance improvements and bug fixes About the project ----------------------- HDF5 for Python (h5py) is a general-purpose Python interface to the Hierarchical Data Format library, version 5. HDF5 is a mature scientific software library originally developed at NCSA, designed for the fast, flexible storage of enormous amounts of data. >From a Python programmer's perspective, HDF5 provides a robust way to store data, organized by name in a tree-like fashion. You can create datasets (arrays on disk) hundreds of gigabytes in size, and perform random-access I/O on desired sections. Datasets are organized in a filesystem-like hierarchy using containers called "groups", and accessed using the traditional POSIX /path/to/resource syntax. Downloads, FAQ and bug tracker are available at Google Code: * Google code site: http://h5py.googlecode.com Documentation is available at Alfven.org: * http://h5py.alfven.org From takowl at gmail.com Thu Oct 4 14:39:53 2012 From: takowl at gmail.com (Thomas Kluyver) Date: Thu, 4 Oct 2012 19:39:53 +0100 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: On 4 October 2012 14:27, Thomas Kluyver wrote: > The biggest remaining question (as I see it) is the hdf5 libraries. > Both have got a somewhat mixed response on the poll, although h5py has > a bit more support than PyTables. This did come up before, but let's > hear more voices on the question. Should we specify neither, one, or > both? Discussion on the numfocus list has come to the conclusion that we should either specify both h5py and PyTables, or neither. Please register your opinion on this new poll: http://www.misterpoll.com/polls/568484 To be clear, I'm using all these polls to gauge what a larger number of people think. It's like Wikipedia's "!voting" model - the option with the most votes doesn't automatically win, but it's used to form a consensus. Thanks, Thomas From e.antero.tammi at gmail.com Thu Oct 4 15:14:42 2012 From: e.antero.tammi at gmail.com (eat) Date: Thu, 4 Oct 2012 22:14:42 +0300 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References:

Message-ID: Hi, On Thu, Oct 4, 2012 at 9:39 PM, Thomas Kluyver wrote: > On 4 October 2012 14:27, Thomas Kluyver wrote: > > The biggest remaining question (as I see it) is the hdf5 libraries. > > Both have got a somewhat mixed response on the poll, although h5py has > > a bit more support than PyTables. This did come up before, but let's > > hear more voices on the question. Should we specify neither, one, or > > both? > > Discussion on the numfocus list has come to the conclusion that we > should either specify both h5py and PyTables, or neither. Please > register your opinion on this new poll: > http://www.misterpoll.com/polls/568484 Why do you need to use a polling service that has this potentially malicious requirement that "You must disable safe mode to view this content." Regards, -eat > > To be clear, I'm using all these polls to gauge what a larger number > of people think. It's like Wikipedia's "!voting" model - the option > with the most votes doesn't automatically win, but it's used to form a > consensus. > > Thanks, > Thomas > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Dharhas.Pothina at twdb.texas.gov Thu Oct 4 17:39:22 2012 From: Dharhas.Pothina at twdb.texas.gov (Dharhas Pothina) Date: Thu, 04 Oct 2012 16:39:22 -0500 Subject: [SciPy-User] Scipy stack: standard packages (poll) Message-ID: <506DBBBA0200009B0004CA36@GWWEB.twdb.state.tx.us> Hi, I just voted on the poll, but i think the issue of whether package will be in epd free or not is kinda orthogonal to this discussion. I realize that having all the 'standard' packages in epd free would be an awesome thing but isn't that a business decision enthought needs to make. Epd is not the only way to get packages installed, but it is a very convenient one and if enthought wants to make epd free with a more limited subset of packages and have the full licensed epd be the standard compliant version, I don't see anything really wrong with that. After all they are providing a value added service by doing the cross platform packaging. Dharhas >>> Thomas Kluyver 10/04/12 13:41 PM >>> On 4 October 2012 14:27, Thomas Kluyver wrote: > The biggest remaining question (as I see it) is the hdf5 libraries. > Both have got a somewhat mixed response on the poll, although h5py has > a bit more support than PyTables. This did come up before, but let's > hear more voices on the question. Should we specify neither, one, or > both? Discussion on the numfocus list has come to the conclusion that we should either specify both h5py and PyTables, or neither. Please register your opinion on this new poll: http://www.misterpoll.com/polls/568484 To be clear, I'm using all these polls to gauge what a larger number of people think. It's like Wikipedia's "!voting" model - the option with the most votes doesn't automatically win, but it's used to form a consensus. Thanks, Thomas _______________________________________________ SciPy-User mailing list SciPy-User at scipy.org http://mail.scipy.org/mailman/listinfo/scipy-user From travis at continuum.io Thu Oct 4 18:00:22 2012 From: travis at continuum.io (Travis Oliphant) Date: Thu, 4 Oct 2012 17:00:22 -0500 Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: <506DBBBA0200009B0004CA36@GWWEB.twdb.state.tx.us> References: <506DBBBA0200009B0004CA36@GWWEB.twdb.state.tx.us> Message-ID: On Oct 4, 2012, at 4:39 PM, Dharhas Pothina wrote: > Hi, > > I just voted on the poll, but i think the issue of whether package will be in epd free or not is kinda orthogonal to this discussion. I realize that having all the 'standard' packages in epd free would be an awesome thing but isn't that a business decision enthought needs to make. Epd is not the only way to get packages installed, but it is a very convenient one and if enthought wants to make epd free with a more limited subset of packages and have the full licensed epd be the standard compliant version, I don't see anything really wrong with that. After all they are providing a value added service by doing the cross platform packaging. I agree that the poll should not discuss what Enthought is doing with EPD free. That's really quite a different question. Anaconda CE from Continuum is another way you can get all the packages we are discussing in a cross platform way for free. -Travis > > Dharhas > >>>> Thomas Kluyver 10/04/12 13:41 PM >>> > On 4 October 2012 14:27, Thomas Kluyver wrote: >> The biggest remaining question (as I see it) is the hdf5 libraries. >> Both have got a somewhat mixed response on the poll, although h5py has >> a bit more support than PyTables. This did come up before, but let's >> hear more voices on the question. Should we specify neither, one, or >> both? > > Discussion on the numfocus list has come to the conclusion that we > should either specify both h5py and PyTables, or neither. Please > register your opinion on this new poll: > http://www.misterpoll.com/polls/568484 > > To be clear, I'm using all these polls to gauge what a larger number > of people think. It's like Wikipedia's "!voting" model - the option > with the most votes doesn't automatically win, but it's used to form a > consensus. > > Thanks, > Thomas > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From david_baddeley at yahoo.com.au Thu Oct 4 19:23:38 2012 From: david_baddeley at yahoo.com.au (David Baddeley) Date: Thu, 4 Oct 2012 16:23:38 -0700 (PDT) Subject: [SciPy-User] Scipy stack: standard packages (poll) In-Reply-To: References: