From oliphant at enthought.com  Tue Jun  1 00:54:59 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Mon, 31 May 2010 23:54:59 -0500
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
Message-ID: <18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>


On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:

> Since Travis seems to want to take back control of scipy.stats, I am
> considering my role as inofficial maintainer as ended.

Obviously I've offended you.   That has never been my intent.   I apologize if my enthusiasm for getting some changes that I wanted to see into SciPy stepped on an area you felt ownership of.     I do not mind if people add changes to code that I've written and I assume that others feel the same.   That has always been the development mode of SciPy.   We clearly have different development styles.    I think we can find a way to work together.   I think the move to github will help.  

I did not understand that you felt such ownership of scipy.stats.  I have certainly appreciated your input. 

I do like a more "free-wheeling" style to code development than one that is bogged down with "rules" and "procedures".     This clearly is not your style.   For me, it comes down to time to spend.   I love working on SciPy and NumPy.    I don't have a lot of time to do it.   When I see quick changes I can make that add value I like to be able to do it.   I think we both want the same thing while we may disagree about the best way to get there. 
In my mind, discussion doesn't end when a check-in is made --- it just begins.   You should never interpret my checking something in as the final word.   We clearly have a different view of "trunk" 

I certainly don't want my approach to open source development to offend others or chase them away.  If I check in something you don't like, then tell me and let's talk about it.    If you need to vent and call me names, a private email to me or others can go a long way.  

What do we need to do to keep you around?   Is there specifically something you didn't like about my recent check-ins? 

In this case, the features added were not terribly extensive.   The current unit tests helped ferret out major problems.  Yes, I could write more tests and documentation, and you have been a model of writing tests and documentation.   I have been particularly impressed by the amount of quality documentation you have written.   

While you seem to dismiss the episode as problematic, I actually think curve_fit was a good example of how something very positive can emerge quickly when people are open and willing to work together.  

While formal, strict test-driven development is easy to point to for salvation -- it does have its costs.   I've always used informal test-driven development.   Just because I don't *always* add formal unit tests for every piece of code written does not mean the code that is currently in SciPy is un-tested and useless.   Such an approach leaves me open to criticism, which I acknowledge.  But, I think there have been far too many dismissive comments about the state of the code. 

I would argue that the problem with scipy.stats does not lie mainly in distributions.py or the lack of test-driven-development --- but in the lack of certain easy to use features.    Quality code comes out of people who care --- not out of procedure.   

I think you are someone who cares and your code reflects that.    We would all benefit from your staying part of the main development.   

Sincere regards,

-Travis


From oliphant at enthought.com  Tue Jun  1 01:02:42 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 00:02:42 -0500
Subject: [SciPy-Dev] Development process (was: scipy.stats)
In-Reply-To: <AANLkTilisJT7naCrpZnb5vqcRNWILtBqkQYx8vh8wmhA@mail.gmail.com>
References: <AANLkTin58vCwfm71xW3Mi319zq4cAPw5o6xzAPBgFDsX@mail.gmail.com>
	<AANLkTimLpSyE7np5BqSKQSmvdBQzimSkCXxg_5ZkHKbS@mail.gmail.com>
	<AANLkTilisJT7naCrpZnb5vqcRNWILtBqkQYx8vh8wmhA@mail.gmail.com>
Message-ID: <CE85E065-7512-4236-8BEA-EE08C930E414@enthought.com>


On May 31, 2010, at 9:09 PM, Matthew Brett wrote:

> Hi David,
> 
>> I'm sure you mean (for it's the nature of this list, is it not, that we all
>> have free rein to be as diplomatic, or not, as we wish) something along the
>> lines of: if you tone it down a bit and make it less personal, i.e., be a
>> bit more 'diplomatic,' then people are more likely to take you seriously.
> 
> Sorry - I should have replied to the earlier thread after starting this one.
> 
> I think that is indeed Charles' point, that the best thing to do, is
> to identify the general problem, where the problem does not start with
> 'if only X would not ...'  but is more on the lines of 'there must be
> a problem in our process because the following things happen fairly
> often ... '
> 
> That's what I am trying to do with this thread.   I think we have
> structural problem in organization, where it is not clear what the
> process for code maintenance is.  I think many people believe that we
> need such a process, but, given we do not have one, it is inevitable
> that things like this (significant portions of untested code suddenly
> appearing in trunk) are going to happen.
> 
> What we need is a ) agreement that there is problem and b) an idea of
> how to go forward.
> 
> I think it's also obvious that that conversation has to happen in
> public and on record so we can all have our say and agree.   I'm sure
> it's possible to do that.
> 
> And - Travis (sorry - I am sure you are doing more enjoyable things
> for Memorial day) - of course it's essential that you join in with and
> / or lead that conversation.
> 

How many people interested in this discussion will be at SciPy this year?  It may be a good idea to have a discussion about this at the conference.    We could phone conference others in as well so that every voice can be heard. 

I do think we need to address this issue.   I did not realize I was offending people with my enthusiasm for having a chance to work on SciPy.   I have always resisted too much "procedure" and "policy" so that it becomes difficult for people to contribute.    I really think technology changes and DVCS can help with this process. 

-Travis


From oliphant at enthought.com  Tue Jun  1 01:20:41 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 00:20:41 -0500
Subject: [SciPy-Dev] [SciPy-User] log pdf, cdf, etc
In-Reply-To: <AANLkTikHSTyt_OeKLtz4sWzpPKYMUlvr78qmY8gfVdek@mail.gmail.com>
References: <AANLkTilmC_l3pWTizWFhB6v0R8Jo03v440ZdLfZf5pND@mail.gmail.com>
	<AANLkTinJXzsaCkbP1E1dkGIDNDruuYX_1vD9PCg6Ml5l@mail.gmail.com>
	<6D1C6011-7B0A-45DB-9B54-6CAE1FA38F71@enthought.com>
	<AANLkTilDNm6fK2FYLlNgtWJLWb77_OaefOKqWhwb7sJ3@mail.gmail.com>
	<AANLkTikHSTyt_OeKLtz4sWzpPKYMUlvr78qmY8gfVdek@mail.gmail.com>
Message-ID: <12883887-E601-467B-9C56-55BDA8169C19@enthought.com>


On May 31, 2010, at 6:39 AM, Ralf Gommers wrote:

> 
> 
> On Sun, May 30, 2010 at 5:38 AM, <josef.pktd at gmail.com> wrote:
> On Sat, May 29, 2010 at 4:51 PM, Travis Oliphant <oliphant at enthought.com> wrote:
> >
> > Hey Josef,
> >
> > I've been playing with distributions.py today and added logpdf, logcdf, logsf methods (based on _logpdf, _logcdf, _logsf methods in each distribution).
> 
> I would like to get the private _logpdf in a useful (vectorized or
> broadcastable) version because for estimation and optimization, I want
> to avoid the logpdf overhead. So, my testing will be on the underline
> versions.
> 
> >
> > I also added your _fitstart suggestion.   I would like to do something like your nnlf_fit method that allows you to fix some parameters and only solve for others, but I haven't thought through all the issues yet.
> 
> I have written a semi-frozen fit function and posted to the mailing
> list a long time ago, but since I'm not sure about the API and I'm
> expanding to several new estimators, I kept this under
> work-in-progress.
> 
> Similar _fitstart might need extra options, for estimation when some
> parameters are fixed, e.g. there are good moment estimators that work
> when some of the parameters (e.g. loc or scale) are fixed. Also
> _fitstart is currently used only by my fit_frozen.
> 
> I was hoping to get this done this year, maybe together with the
> enhancements that Per Brodtkorb proposed two years ago, e.g. Method of
> Maximum Spacings.
> 
> I also have a Generalized Method of Moments estimator based on
> matching quantiles and moments in the works.
> 
> So, I don't want yet to be pinned down with any API for the estimation
> enhancements.
> 
>  These recent changes are a bit problematic for several reasons:
> - there are many new methods for distributions without tests. 

These methods are simple to see and verify.    Which methods specifically are you concerned about? 

> - there are no docs for many new private and public methods

They are all fairly self explanatory.   But, docs can be added if needed.

> - invalid syntax: http://projects.scipy.org/scipy/ticket/1186

This has been fixed (it was easier to fix the syntax then file the ticket...)  Also to be clear this is only invalid for Python < 2.6 (the comment makes it sound like somehow the changes weren't tested at all). 

> - the old rv_continuous doc template was put back in

I'm not sure what you mean.   Which change did this? 

> 
> This, plus Josef saying that he doesn't want to fix the API for some methods yet, makes me want to take it out of the 0.8.x branch. Any objections to that Travis or Josef? 

I would really like to see these changes go in to 0.8.x.    If Josef feels strongly about the API in the future, we can change it for the next release.   I don't understand what the specific concerns are.  

-Travis


> 
> Cheers,
> Ralf
> 
> 
> _______________________________________________
> SciPy-User mailing list
> SciPy-User at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-user

---
Travis Oliphant
Enthought, Inc.
oliphant at enthought.com
1-512-536-1057
http://www.enthought.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/85fa03c4/attachment.html>

From matthew.brett at gmail.com  Tue Jun  1 01:26:54 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Mon, 31 May 2010 22:26:54 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
Message-ID: <AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>

Hi,

> I do like a more "free-wheeling" style to code development than one that is bogged down with "rules" and "procedures".

Well - but that is because you don't maintenance.   Imagine a
maintainer puts in a lot of effort to make the code well-documented
and tested.  Then, you have put in new code that has neither
documentation nor tests.    As a good maintainer, it's really painful
for them that there's new code without documentation or tests.   They
can only feel abused in that situation, because it seems as if you are
expecting them to clean up after you - without asking.

I'm offering this only as an explanation for why this situation can
get people pretty pissed.

See you,

Matthew


From oliphant at enthought.com  Tue Jun  1 02:15:30 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 01:15:30 -0500
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
Message-ID: <3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>


On Jun 1, 2010, at 12:26 AM, Matthew Brett wrote:

> Hi,
> 
>> I do like a more "free-wheeling" style to code development than one that is bogged down with "rules" and "procedures".
> 
> Well - but that is because you don't maintenance.   Imagine a
> maintainer puts in a lot of effort to make the code well-documented
> and tested.  Then, you have put in new code that has neither
> documentation nor tests.    As a good maintainer, it's really painful
> for them that there's new code without documentation or tests.   They
> can only feel abused in that situation, because it seems as if you are
> expecting them to clean up after you - without asking.

I don't think that is fair.  I have been "maintaining" SciPy and NumPy code for over 10 years.   I have done an immense amount of work in porting SciPy to NumPy and continuing to fix bugs that I am made aware of.  I don't have as much time to commit to SciPy as I would like.  


-Travis


From oliphant at enthought.com  Tue Jun  1 02:38:35 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 01:38:35 -0500
Subject: [SciPy-Dev] Recent changes to scipy stats
Message-ID: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com>


My recent changes to trunk certainly started a controversy.   I'm not exactly sure why.    I do not mean to give the impression that people should "clean" up after me which has been implied by some.   Please let me know if there is something specific that you would like me to do. 

I appreciate the speific concerns that Ralf raised as opposed to "generalizations" and metaphors that are open to interpretation.   All of his concerns have been addressed, I think, except the addition of all tests that some would like to see. 

Some of the added methods are so simple, that I do not think they require tests to verify their accuracy --- you can look at the code and understand it.    In cases like this I get somewhat frustrated with a naive fixed rule like "no check-ins" without "tests".  

There can always be more tests, but tests cost and should be part of a general improvement strategy and not just trotted out as a weapon when there is disagreement about something else.   

Is there a disagreement about other changes that have been made?    The only one I can think of that could be controversial is perhaps pulling in Josef's expect methods from his file when he did not want the "API" methods finalized.   I'm fine with removing them if he wants to do that. 

Perhaps, the interface I chose to fix certain parameters for the fit methods is also in question.    I really don't know as I have received no specific communication about the concerns.   I welcome any review or comment on what has been done. 

As I am not able to follow all threads on SciPy-User and SciPy-Dev, I did not know that Ralf was going to create the 0.8.x branch when he did.  Perhaps I should have known, but I did not know.

-Travis


From d.l.goldsmith at gmail.com  Tue Jun  1 04:07:12 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 01:07:12 -0700
Subject: [SciPy-Dev] Recent changes to scipy stats
In-Reply-To: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com>
References: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com>
Message-ID: <AANLkTik0FK74kRccGE2yImSsmicDvIXqVQGibY0xIvvF@mail.gmail.com>

On Mon, May 31, 2010 at 11:38 PM, Travis Oliphant <oliphant at enthought.com>wrote:

>
> My recent changes to trunk certainly started a controversy.   I'm not
> exactly sure why.    I do not mean to give the impression that people should
> "clean" up after me which has been implied by some.   Please let me know if
> there is something specific that you would like me to do.
>
> I appreciate the speific concerns that Ralf raised as opposed to
> "generalizations" and metaphors that are open to interpretation.   All of
> his concerns have been addressed, I think, except the addition of all tests
> that some would like to see.
>
> Some of the added methods are so simple, that I do not think they require
> tests to verify their accuracy --- you can look at the code and understand
> it.    In cases like this I get somewhat frustrated with a naive fixed rule
> like "no check-ins" without "tests".
>
> There can always be more tests, but tests cost and should be part of a
> general improvement strategy and not just trotted out as a weapon when there
> is disagreement about something else.
>
> Is there a disagreement about other changes that have been made?    The
> only one I can think of that could be controversial is perhaps pulling in
> Josef's expect methods from his file when he did not want the "API" methods
> finalized.   I'm fine with removing them if he wants to do that.
>
> Perhaps, the interface I chose to fix certain parameters for the fit
> methods is also in question.    I really don't know as I have received no
> specific communication about the concerns.   I welcome any review or comment
> on what has been done.
>
> As I am not able to follow all threads on SciPy-User and SciPy-Dev, I did
> not know that Ralf was going to create the 0.8.x branch when he did.
>  Perhaps I should have known, but I did not know.
>
> -Travis
>

IMO, the problem - in general, not just w/ any one person - is not the
particulars of what's been done, but the attitude, when it's exhibited by an
individual, any individual, that the rules may be disregarded when that
individual, any individual, unilaterally and spontaneously decides those
rules are inconvenient.  The rules are there for very good reasons;
paraphrasing a recent set of statements by Robert K.:

We should follow the rules that we have agreed to because we should make
good on our promises. Otherwise, we might as well not make those
promises...don't look for excuses to break them...break them [only] when it
would be Really Bad if [one] were to follow them. Generally...try to make
good on [one's] promises and not renege on them just because [one]
*think[s]* no one [else] will notice...[only] break rules/promises when they
are in tension with other promises. This is not such a case.

Words to commit by.  (Thanks, Robert; my apologies if you would rather not
have been quoted in this way/situation.)

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/16be04a6/attachment.html>

From oliphant at enthought.com  Tue Jun  1 04:09:50 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 03:09:50 -0500
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
Message-ID: <9A04B7F1-D738-46F6-8E38-ABE06C3CC0FF@enthought.com>


On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:

> 
> This is more about the process then the content, distributions was
> Travis's baby (although unfinished), and most of his changes are very
> good, but I don't want to look for the 5-10% (?) typos anymore.

I really am not sure what the difference between looking at timeline of changes and a formal "review" process really is?  In either case you are "looking for someone's mistakes or problems".   I do think your estimate of typos is a bit aggressive.  Really?  5-10% typos.    What is the denominator?  

-Travis


From josef.pktd at gmail.com  Tue Jun  1 04:12:06 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 04:12:06 -0400
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
Message-ID: <AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com>

On Tue, Jun 1, 2010 at 12:54 AM, Travis Oliphant <oliphant at enthought.com> wrote:
>
> On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:
>
>> Since Travis seems to want to take back control of scipy.stats, I am
>> considering my role as inofficial maintainer as ended.
>
> Obviously I've offended you. ? That has never been my intent. ? I apologize if my enthusiasm for getting some changes that I wanted to see into SciPy stepped on an area you felt ownership of. ? ? I do not mind if people add changes to code that I've written and I assume that others feel the same. ? That has always been the development mode of SciPy. ? We clearly have different development styles. ? ?I think we can find a way to work together. ? I think the move to github will help.
>
> I did not understand that you felt such ownership of scipy.stats. ?I have certainly appreciated your input.
>
> I do like a more "free-wheeling" style to code development than one that is bogged down with "rules" and "procedures". ? ? This clearly is not your style. ? For me, it comes down to time to spend. ? I love working on SciPy and NumPy. ? ?I don't have a lot of time to do it. ? When I see quick changes I can make that add value I like to be able to do it. ? I think we both want the same thing while we may disagree about the best way to get there.
> In my mind, discussion doesn't end when a check-in is made --- it just begins. ? You should never interpret my checking something in as the final word. ? We clearly have a different view of "trunk"
>
> I certainly don't want my approach to open source development to offend others or chase them away. ?If I check in something you don't like, then tell me and let's talk about it. ? ?If you need to vent and call me names, a private email to me or others can go a long way.
>
> What do we need to do to keep you around? ? Is there specifically something you didn't like about my recent check-ins?
>
> In this case, the features added were not terribly extensive. ? The current unit tests helped ferret out major problems. ?Yes, I could write more tests and documentation, and you have been a model of writing tests and documentation. ? I have been particularly impressed by the amount of quality documentation you have written.
>
> While you seem to dismiss the episode as problematic, I actually think curve_fit was a good example of how something very positive can emerge quickly when people are open and willing to work together.
>
> While formal, strict test-driven development is easy to point to for salvation -- it does have its costs. ? I've always used informal test-driven development. ? Just because I don't *always* add formal unit tests for every piece of code written does not mean the code that is currently in SciPy is un-tested and useless. ? Such an approach leaves me open to criticism, which I acknowledge. ?But, I think there have been far too many dismissive comments about the state of the code.
>
> I would argue that the problem with scipy.stats does not lie mainly in distributions.py or the lack of test-driven-development --- but in the lack of certain easy to use features. ? ?Quality code comes out of people who care --- not out of procedure.
>
> I think you are someone who cares and your code reflects that. ? ?We would all benefit from your staying part of the main development.

(not answering inline to keep thoughts together)

I think the main disagreements are about the quality control of the
trunk and whether scipy development is a community effort or not.

I think most of us write code in spurts as we find time and some idea
bites us, and I have a written a lot of code. However, this is *not*
trunk code, this is sandbox code.

As Skipper described, in statsmodels almost all development occurs in
the sandbox and in branches, and it is only included in the "official"
core of statsmodels after it has been verified and tests have been
added. sandbox code is everything from first draft version to almost
finished code.
And one of Skippers task in his gsoc is to clean out the sandbox.
Once it is in trunk (core) any further refactoring follows very strict rules.

*Every* new function or method needs test before going into trunk or
right after. And I hope the test coverage of scipy goes towards that
goal. This also applies to trivial functions, because they might be
victims of some later refactoring.  I have seen a lot of stranded
non-functional code in scipy.stats, stats.models and in other parts of
scipy.

Review before or after commit
I think (non-minor) changes, especially new functions, methods and
classes need to be offered to the mailing list for comments, review
before being committed. (Plus to make it feasible, we have an implied:
"If nobody voices disagreement, then I will commit".) The git mirror
has been working for a long time, and most development in scipy seems
to follow this policy.

curve_fit is a good example, Travis committed the changes, without
mentioning it on the mailing list. I saw the commit, commented that
the statistics of the new function is incorrect and we changed after
several rounds until it was verified. I don't think it has any tests
yet.

Specific to stats: I want a reference for any function where the
explanation cannot be found with a Wikipedia search with one of the
terms in the docstring. One or a few weeks ago, scipy.stats gained a
new function, my asking on the mailing list what it is supposed to be,
didn't receive any reply. (besides the problem that the function had
the same name as an existing function).

Dumping new code into scipy trunk, without any review and tests,
hoping that someone else looks for the problems is not an approach
that I find acceptable. And personally, I refuse now being "dumped
at". And I will *not* spend my time in the next three days writing
missing tests and verifying code that has been committed to trunk this
weekend.

Asking me if I have commit rights, shows at least some disconnect from
the development of scipy in the last three years, since I have been
pretty (too) noisy about it on the mailing lists.

Josef


>
> Sincere regards,
>
> -Travis
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Tue Jun  1 04:22:08 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 04:22:08 -0400
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <9A04B7F1-D738-46F6-8E38-ABE06C3CC0FF@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<9A04B7F1-D738-46F6-8E38-ABE06C3CC0FF@enthought.com>
Message-ID: <AANLkTinj_zxxGTScwGFhi5WpsHbQeBbF-sFbF6CAPjHr@mail.gmail.com>

On Tue, Jun 1, 2010 at 4:09 AM, Travis Oliphant <oliphant at enthought.com> wrote:
>
> On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:
>
>>
>> This is more about the process then the content, distributions was
>> Travis's baby (although unfinished), and most of his changes are very
>> good, but I don't want to look for the 5-10% (?) typos anymore.
>
> I really am not sure what the difference between looking at timeline of changes and a formal "review" process really is? ?In either case you are "looking for someone's mistakes or problems". ? I do think your estimate of typos is a bit aggressive. ?Really? ?5-10% typos. ? ?What is the denominator?

I just replied for most of this.

My test run in the middle of the weekend (before I gave up), had about
4 or 5 test failures in the new _logpdf _logcdf methods.

Third and forth moments (skew, kurtosis) might still return about 5%
incorrect numbers, which I accept since it was written at a different
time. Same with many generic methods in stats.distributions that I
fixed two and a half years ago and which seems to never have worked
from what I inferred from the history.

denominator: functions/methods that return numbers
5-10% is just a guess, I never tried to measure it, maybe it's only
3%, but each one requires an afternoon to hunt down the reference and
the correct formula.

Josef


>
> -Travis
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From oliphant at enthought.com  Tue Jun  1 04:25:31 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 03:25:31 -0500
Subject: [SciPy-Dev] Recent changes to scipy stats
In-Reply-To: <AANLkTik0FK74kRccGE2yImSsmicDvIXqVQGibY0xIvvF@mail.gmail.com>
References: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com>
	<AANLkTik0FK74kRccGE2yImSsmicDvIXqVQGibY0xIvvF@mail.gmail.com>
Message-ID: <ECEC67D9-DCD8-4E8A-8BAA-D3771DA05D4A@enthought.com>


> IMO, the problem - in general, not just w/ any one person - is not the particulars of what's been done, but the attitude, when it's exhibited by an individual, any individual, that the rules may be disregarded when that individual, any individual, unilaterally and spontaneously decides those rules are inconvenient.  The rules are there for very good reasons; paraphrasing a recent set of statements by Robert K.:

What is the rule that has been broken exactly?   I'd really like to know what people are actually annoyed by and who exactly is annoyed? 

Perhaps my confidence with committing to trunk is what is fundamentally the issue.  It's clear that some people prefer a different process and perhaps the move to a distributed version control will help things.   

I do feel a certain confidence with code that I have written and I like to get changes into trunk quickly.   That has always been my style.    I don't think I have changed in this regard.   Perhaps it is seen as brazen or inconsiderate, but I don't see it that way.    I actually think it very inconsiderate that I should be treated with such rudeness for contributing needed functionality.   

Sometimes rules become rules inappropriately.  Why should one development process hold sway over another?  Who is right?  Well, clearly, it's just a matter of the people around and what they want to see.   If the majority here want to see a different process, then that's where we will go.  But, to really do it, we will need to move to a distributed version control process, I think --- or at least I will need to.  I will try to work on that when I can find the motivation. 

-Travis


From oliphant at enthought.com  Tue Jun  1 04:32:56 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 03:32:56 -0500
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTinj_zxxGTScwGFhi5WpsHbQeBbF-sFbF6CAPjHr@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<9A04B7F1-D738-46F6-8E38-ABE06C3CC0FF@enthought.com>
	<AANLkTinj_zxxGTScwGFhi5WpsHbQeBbF-sFbF6CAPjHr@mail.gmail.com>
Message-ID: <5C7630FC-D91E-4CD9-91D7-7A4CCEBEFD04@enthought.com>


On Jun 1, 2010, at 3:22 AM, josef.pktd at gmail.com wrote:

> On Tue, Jun 1, 2010 at 4:09 AM, Travis Oliphant <oliphant at enthought.com> wrote:
>> 
>> On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:
>> 
>>> 
>>> This is more about the process then the content, distributions was
>>> Travis's baby (although unfinished), and most of his changes are very
>>> good, but I don't want to look for the 5-10% (?) typos anymore.
>> 
>> I really am not sure what the difference between looking at timeline of changes and a formal "review" process really is?  In either case you are "looking for someone's mistakes or problems".   I do think your estimate of typos is a bit aggressive.  Really?  5-10% typos.    What is the denominator?
> 
> I just replied for most of this.
> 
> My test run in the middle of the weekend (before I gave up), had about
> 4 or 5 test failures in the new _logpdf _logcdf methods.

In this particular case, you can just look at the pdf method and compare it with the logpdf method.  I only added ones that were obvious.  Are you running a test different from 

>>> from scipy.stats import test
>>> test()

to get these errors? 

Are you saying the skew and kurtosis test functions return different numbers than expected?   

-Travis


From cournape at gmail.com  Tue Jun  1 04:35:38 2010
From: cournape at gmail.com (David Cournapeau)
Date: Tue, 1 Jun 2010 17:35:38 +0900
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
Message-ID: <AANLkTikNpPNiaYgYQpwiJxWOyZKbT55TEEcadZHD89dS@mail.gmail.com>

On Tue, Jun 1, 2010 at 1:54 PM, Travis Oliphant <oliphant at enthought.com> wrote:

>
> I do like a more "free-wheeling" style to code development than one that is bogged down with "rules" and "procedures". ? ? This clearly is not your style. ? For me, it comes down to time to spend. ? I love working on SciPy and NumPy. ? ?I don't have a lot of time to do it. ? When I see quick changes I can make that add value I like to be able to do it. ? I think we both want the same thing while we may disagree about the best way to get there.
> In my mind, discussion doesn't end when a check-in is made --- it just begins. ? You should never interpret my checking something in as the final word. ? We clearly have a different view of "trunk"

I think the main issue is that you only see tests as a nuisance
because it gives you less time to do the actual work. Testing,
documenting indeed has a cost - but by not doing it, you are
transferring this cost to someone else. IOW, the cost of your changes
are the same with or without tests  - it just ends up being someone
else doing the work you don't do, work that you recognize yourself as
not being the most interesting one.

I think we all understand how valuable your contribution has been (and
still is !) to numpy/scipy. But whether you like it or not, now that
scipy/numpy are matured packages used by a lot of people, some
"overhead" and process is unavoidable.

cheers,

David


From oliphant at enthought.com  Tue Jun  1 04:43:54 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 03:43:54 -0500
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com>
Message-ID: <B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>


On Jun 1, 2010, at 3:12 AM, josef.pktd at gmail.com wrote:

> On Tue, Jun 1, 2010 at 12:54 AM, Travis Oliphant <oliphant at enthought.com> wrote:
>> 
>> On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:
>> 
>>> Since Travis seems to want to take back control of scipy.stats, I am
>>> considering my role as inofficial maintainer as ended.
>> 
>> Obviously I've offended you.   That has never been my intent.   I apologize if my enthusiasm for getting some changes that I wanted to see into SciPy stepped on an area you felt ownership of.     I do not mind if people add changes to code that I've written and I assume that others feel the same.   That has always been the development mode of SciPy.   We clearly have different development styles.    I think we can find a way to work together.   I think the move to github will help.
>> 
>> I did not understand that you felt such ownership of scipy.stats.  I have certainly appreciated your input.
>> 
>> I do like a more "free-wheeling" style to code development than one that is bogged down with "rules" and "procedures".     This clearly is not your style.   For me, it comes down to time to spend.   I love working on SciPy and NumPy.    I don't have a lot of time to do it.   When I see quick changes I can make that add value I like to be able to do it.   I think we both want the same thing while we may disagree about the best way to get there.
>> In my mind, discussion doesn't end when a check-in is made --- it just begins.   You should never interpret my checking something in as the final word.   We clearly have a different view of "trunk"
>> 
>> I certainly don't want my approach to open source development to offend others or chase them away.  If I check in something you don't like, then tell me and let's talk about it.    If you need to vent and call me names, a private email to me or others can go a long way.
>> 
>> What do we need to do to keep you around?   Is there specifically something you didn't like about my recent check-ins?
>> 
>> In this case, the features added were not terribly extensive.   The current unit tests helped ferret out major problems.  Yes, I could write more tests and documentation, and you have been a model of writing tests and documentation.   I have been particularly impressed by the amount of quality documentation you have written.
>> 
>> While you seem to dismiss the episode as problematic, I actually think curve_fit was a good example of how something very positive can emerge quickly when people are open and willing to work together.
>> 
>> While formal, strict test-driven development is easy to point to for salvation -- it does have its costs.   I've always used informal test-driven development.   Just because I don't *always* add formal unit tests for every piece of code written does not mean the code that is currently in SciPy is un-tested and useless.   Such an approach leaves me open to criticism, which I acknowledge.  But, I think there have been far too many dismissive comments about the state of the code.
>> 
>> I would argue that the problem with scipy.stats does not lie mainly in distributions.py or the lack of test-driven-development --- but in the lack of certain easy to use features.    Quality code comes out of people who care --- not out of procedure.
>> 
>> I think you are someone who cares and your code reflects that.    We would all benefit from your staying part of the main development.
> 
> (not answering inline to keep thoughts together)
> 
> I think the main disagreements are about the quality control of the
> trunk and whether scipy development is a community effort or not.

I certainly think scipy development is a community effort.   I'm very sorry for making you feel "dumped" on.   That has never been my intent.  I was simply hoping to contribute a little where I could.   

> As Skipper described, in statsmodels almost all development occurs in
> the sandbox and in branches, and it is only included in the "official"
> core of statsmodels after it has been verified and tests have been
> added. sandbox code is everything from first draft version to almost
> finished code.
> And one of Skippers task in his gsoc is to clean out the sandbox.
> Once it is in trunk (core) any further refactoring follows very strict rules.

This has not been SciPy's process.   I can understand people may want it to become SciPy's process, but it has not been.  There are dangers of this process --- there is a reason that the mantra of "release early and release often".  It can also prevent progress when you are dealing with people's spare time because all of that process takes time and man-power and effort.   There is some value in it, I'm just not sure the extent of that value in contrast to other uses of that time.  

For example.  I would love to see statsmodels get more use.   I think there is much code there that is usable.  Yet, it remains outside of SciPy.   

If we agree to change the SciPy process will you agree to put statsmodels into SciPy? 

> 
> Specific to stats: I want a reference for any function where the
> explanation cannot be found with a Wikipedia search with one of the
> terms in the docstring. One or a few weeks ago, scipy.stats gained a
> new function, my asking on the mailing list what it is supposed to be,
> didn't receive any reply. (besides the problem that the function had
> the same name as an existing function).

I did not see your message.   I changed the name of the function and didn't know you were concerned about the addition.   It is a convenience function for bayes_mvs that returns the distribution objects from which the other numbers can be obtained instead of just the numbers.     The paper is already referenced in bayes_mvs.   

> Dumping new code into scipy trunk, without any review and tests,
> hoping that someone else looks for the problems is not an approach
> that I find acceptable.

That was never my "hope".  I planned to and have fixed all problems that I saw later and that others have pointed out.   You can never test for all possible failures.  

> 
> Asking me if I have commit rights, shows at least some disconnect from
> the development of scipy in the last three years, since I have been
> pretty (too) noisy about it on the mailing lists.

I know you have been noisy on the lists --- that's why I spoke to you about _logpdf and friends.  It also appears that you don't commit that often.   This is your process.   But, it made me wonder if permissions were an issue.    I was pretty sure you had been given commit rights, but I could not remember.   I'm sorry if that offended you.

-Travis

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/05c4cb0b/attachment.html>

From d.l.goldsmith at gmail.com  Tue Jun  1 04:54:20 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 01:54:20 -0700
Subject: [SciPy-Dev] Recent changes to scipy stats
In-Reply-To: <ECEC67D9-DCD8-4E8A-8BAA-D3771DA05D4A@enthought.com>
References: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com>
	<AANLkTik0FK74kRccGE2yImSsmicDvIXqVQGibY0xIvvF@mail.gmail.com>
	<ECEC67D9-DCD8-4E8A-8BAA-D3771DA05D4A@enthought.com>
Message-ID: <AANLkTil_0uTwD8qnjc1-sCKXwJvPu83I0mkiia99g84P@mail.gmail.com>

On Tue, Jun 1, 2010 at 1:25 AM, Travis Oliphant <oliphant at enthought.com>wrote:

>
> > IMO, the problem - in general, not just w/ any one person - is not the
> particulars of what's been done, but the attitude, when it's exhibited by an
> individual, any individual, that the rules may be disregarded when that
> individual, any individual, unilaterally and spontaneously decides those
> rules are inconvenient.  The rules are there for very good reasons;
> paraphrasing a recent set of statements by Robert K.:
>
> What is the rule that has been broken exactly?


Good point: I've been led to believe that there's a "rule" (aka "policy")
against checking-in code that doesn't include passing unit tests and a
Standard-compliant docstring, but I must concede that I can't say *where*
(other than on the listserv) *any* rules are recorded, so if I am mistaken -
if there is in fact no such rule/policy - I apologize, I guess we are all
(or at least all those w/ commit privilege) free to commit as it suits our
"style."


> I'd really like to know what people are actually annoyed by and who exactly
> is annoyed?
>

Great, I'd like to know who *isn't* annoyed.

DG

>
> Perhaps my confidence with committing to trunk is what is fundamentally the
> issue.  It's clear that some people prefer a different process and perhaps
> the move to a distributed version control will help things.
>
> I do feel a certain confidence with code that I have written and I like to
> get changes into trunk quickly.   That has always been my style.    I don't
> think I have changed in this regard.   Perhaps it is seen as brazen or
> inconsiderate, but I don't see it that way.    I actually think it very
> inconsiderate that I should be treated with such rudeness for contributing
> needed functionality.
>
> Sometimes rules become rules inappropriately.  Why should one development
> process hold sway over another?  Who is right?  Well, clearly, it's just a
> matter of the people around and what they want to see.   If the majority
> here want to see a different process, then that's where we will go.  But, to
> really do it, we will need to move to a distributed version control process,
> I think --- or at least I will need to.  I will try to work on that when I
> can find the motivation.
>
> -Travis
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/68b3ef91/attachment.html>

From josef.pktd at gmail.com  Tue Jun  1 05:50:46 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 05:50:46 -0400
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <5C7630FC-D91E-4CD9-91D7-7A4CCEBEFD04@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<9A04B7F1-D738-46F6-8E38-ABE06C3CC0FF@enthought.com>
	<AANLkTinj_zxxGTScwGFhi5WpsHbQeBbF-sFbF6CAPjHr@mail.gmail.com>
	<5C7630FC-D91E-4CD9-91D7-7A4CCEBEFD04@enthought.com>
Message-ID: <AANLkTil3f7RMZJ_cnJQuT_diibrLCHtaN96evAAsQD9w@mail.gmail.com>

On Tue, Jun 1, 2010 at 4:32 AM, Travis Oliphant <oliphant at enthought.com> wrote:
>
> On Jun 1, 2010, at 3:22 AM, josef.pktd at gmail.com wrote:
>
>> On Tue, Jun 1, 2010 at 4:09 AM, Travis Oliphant <oliphant at enthought.com> wrote:
>>>
>>> On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:
>>>
>>>>
>>>> This is more about the process then the content, distributions was
>>>> Travis's baby (although unfinished), and most of his changes are very
>>>> good, but I don't want to look for the 5-10% (?) typos anymore.
>>>
>>> I really am not sure what the difference between looking at timeline of changes and a formal "review" process really is? ?In either case you are "looking for someone's mistakes or problems". ? I do think your estimate of typos is a bit aggressive. ?Really? ?5-10% typos. ? ?What is the denominator?
>>
>> I just replied for most of this.
>>
>> My test run in the middle of the weekend (before I gave up), had about
>> 4 or 5 test failures in the new _logpdf _logcdf methods.
>
> In this particular case, you can just look at the pdf method and compare it with the logpdf method. ?I only added ones that were obvious. ?Are you running a test different from
>
>>>> from scipy.stats import test
>>>> test()

no, I was running a variation on the new tests for logpdf logcdf, that
I have attached to the ticket

>
> to get these errors?
>
> Are you saying the skew and kurtosis test functions return different numbers than expected?

no, the methods in the distributions for distfn.stats(moments="sk") or
distfn.moment(3) or 4
I think, the f distribution is the only one where I went through the
formulas to find the typo.

I think skew and kurtosistests are ok, although I would have to look
it up to be sure.

Josef

>
> -Travis
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Tue Jun  1 06:48:12 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 06:48:12 -0400
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com>
	<B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
Message-ID: <AANLkTimjqn2NHViJi-ePaA5YxMEjDRdwXV4V81t8Jt5L@mail.gmail.com>

On Tue, Jun 1, 2010 at 4:43 AM, Travis Oliphant <oliphant at enthought.com> wrote:
>
> On Jun 1, 2010, at 3:12 AM, josef.pktd at gmail.com wrote:
>
> On Tue, Jun 1, 2010 at 12:54 AM, Travis Oliphant <oliphant at enthought.com>
> wrote:
>
> On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:
>
> Since Travis seems to want to take back control of scipy.stats, I am
>
> considering my role as inofficial maintainer as ended.
>
> Obviously I've offended you. ? That has never been my intent. ? I apologize
> if my enthusiasm for getting some changes that I wanted to see into SciPy
> stepped on an area you felt ownership of. ? ? I do not mind if people add
> changes to code that I've written and I assume that others feel the same.
> That has always been the development mode of SciPy. ? We clearly have
> different development styles. ? ?I think we can find a way to work together.
> ? I think the move to github will help.
>
> I did not understand that you felt such ownership of scipy.stats. ?I have
> certainly appreciated your input.
>
> I do like a more "free-wheeling" style to code development than one that is
> bogged down with "rules" and "procedures". ? ? This clearly is not your
> style. ? For me, it comes down to time to spend. ? I love working on SciPy
> and NumPy. ? ?I don't have a lot of time to do it. ? When I see quick
> changes I can make that add value I like to be able to do it. ? I think we
> both want the same thing while we may disagree about the best way to get
> there.
>
> In my mind, discussion doesn't end when a check-in is made --- it just
> begins. ? You should never interpret my checking something in as the final
> word. ? We clearly have a different view of "trunk"
>
> I certainly don't want my approach to open source development to offend
> others or chase them away. ?If I check in something you don't like, then
> tell me and let's talk about it. ? ?If you need to vent and call me names, a
> private email to me or others can go a long way.
>
> What do we need to do to keep you around? ? Is there specifically something
> you didn't like about my recent check-ins?
>
> In this case, the features added were not terribly extensive. ? The current
> unit tests helped ferret out major problems. ?Yes, I could write more tests
> and documentation, and you have been a model of writing tests and
> documentation. ? I have been particularly impressed by the amount of quality
> documentation you have written.
>
> While you seem to dismiss the episode as problematic, I actually think
> curve_fit was a good example of how something very positive can emerge
> quickly when people are open and willing to work together.
>
> While formal, strict test-driven development is easy to point to for
> salvation -- it does have its costs. ? I've always used informal test-driven
> development. ? Just because I don't *always* add formal unit tests for every
> piece of code written does not mean the code that is currently in SciPy is
> un-tested and useless. ? Such an approach leaves me open to criticism, which
> I acknowledge. ?But, I think there have been far too many dismissive
> comments about the state of the code.
>
> I would argue that the problem with scipy.stats does not lie mainly in
> distributions.py or the lack of test-driven-development --- but in the lack
> of certain easy to use features. ? ?Quality code comes out of people who
> care --- not out of procedure.
>
> I think you are someone who cares and your code reflects that. ? ?We would
> all benefit from your staying part of the main development.
>
> (not answering inline to keep thoughts together)
>
> I think the main disagreements are about the quality control of the
> trunk and whether scipy development is a community effort or not.
>
> I certainly think scipy development is a community effort. ? I'm very sorry
> for making you feel "dumped" on. ? That has never been my intent. ?I was
> simply hoping to contribute a little where I could.

I only feel "dumped" on, because I want tested and verified stage. I
could leave it to somebody else in five years to clean it up. And I
don't want to add lot's of notes in docstrings, "use at your own risk,
this function hasn't been verified" as we sometimes do in our
(statsmodels) sandbox.

>
> As Skipper described, in statsmodels almost all development occurs in
> the sandbox and in branches, and it is only included in the "official"
> core of statsmodels after it has been verified and tests have been
> added. sandbox code is everything from first draft version to almost
> finished code.
> And one of Skippers task in his gsoc is to clean out the sandbox.
> Once it is in trunk (core) any further refactoring follows very strict
> rules.
>
> This has not been SciPy's process. ? I can understand people may want it to
> become SciPy's process, but it has not been. ?There are dangers of this
> process --- there is a reason that the mantra of "release early and release
> often". ?It can also prevent progress when you are dealing with people's
> spare time because all of that process takes time and man-power and effort.
> ? There is some value in it, I'm just not sure the extent of that value in
> contrast to other uses of that time.

I think that's another discussion I have seen already several times.

I think it's time that scipy moves to a "verified" only stage, instead
of "this is a young project, still work in progress and use at your
own risk"

> For example. ?I would love to see statsmodels get more use. ? I think there
> is much code there that is usable. ?Yet, it remains outside of SciPy.
> If we agree to change the SciPy process will you agree to put statsmodels
> into SciPy?

I hope that statsmodels becomes too big for scipy, but I still would
like to see core models to go into scipy. To quote myself from the
pystatsmodels mailing list.

"The way it looks like, I don't think statsmodels (as a whole) will go
back into scipy, the count of python lines of code of statsmodels is
already almost 20% of the one in scipy according to ohloh.
Large parts of the code are still in the sandbox but with another gsoc
and continued development we will have too much statistics coverage for
statsmodels to be absorbed by scipy."

There are now at least 3 very active scikits, image, learn and
statsmodels, and I think the model of developing and maturing code in
a scikit starts to work pretty well.

For me it's easier to develop and mature inside a pure python package,
which is also more accessible for new contributors. One of my wishful
target audience are contributors on Windows, which would become rather
difficult as part of scipy and git.

>
> Specific to stats: I want a reference for any function where the
> explanation cannot be found with a Wikipedia search with one of the
> terms in the docstring. One or a few weeks ago, scipy.stats gained a
> new function, my asking on the mailing list what it is supposed to be,
> didn't receive any reply. (besides the problem that the function had
> the same name as an existing function).
>
> I did not see your message. ? I changed the name of the function and didn't
> know you were concerned about the addition. ? It is a convenience function
> for bayes_mvs that returns the distribution objects from which the other
> numbers can be obtained instead of just the numbers. ? ? The paper is
> already referenced in bayes_mvs.

This explanation would have made a good comment in the notes section
of the docstring, and I wouldn't have to try to remember and look up
whether this might be some posterior distribution for a diffuse prior
with normal likelihood.

>
> Dumping new code into scipy trunk, without any review and tests,
> hoping that someone else looks for the problems is not an approach
> that I find acceptable.
>
> That was never my "hope". ?I planned to and have fixed all problems that I
> saw later and that others have pointed out. ? You can never test for all
> possible failures.

For many cases, I haven't seen you committed to do any maintenance on
it. At least, there are many functions that never got a test added
later on. You respond to bug reports, but that is after the fact, when
someone already ran into it.

What I think has to be required are basic tests. I'm not religious
about testing for all possible failures. Edge cases, numerical
precision problems, problems with initially not targeted use cases can
and need still be handled after the code is in trunk.

And as Skipper said, and I felt from the beginning about scipy, a
package where you cannot rely (up to a high) degree on the correctness
of the results is pretty unattractive for serious work. Nobody wants
to retract a paper because there was a programming mistake somewhere.

So, verification of the code for the main usecase(s) is the minimum
requirement that Skipper and I agreed upon last summer for any
statistics/econometrics in python development.

>
>
> Asking me if I have commit rights, shows at least some disconnect from
> the development of scipy in the last three years, since I have been
> pretty (too) noisy about it on the mailing lists.
>
> I know you have been noisy on the lists --- that's why I spoke to you about
> _logpdf and friends. ?It also appears that you don't commit that often.

After, I had several crashes late last year and because I'm working
now mostly on statsmodels, I haven't kept my scipy development setup
up to date very often. I'm usually pretty fast in responding to open
issues and Stefan and Ralph made commits to scipy.stats that I
reviewed and were discussed on the mailing list or in a ticket.

On the other hand, I'm not "pushing" my own code into scipy very fast,
although I push it to the mailing list. Mainly, because I'm reluctant
to commit my own code when I don't think it's perfect yet, and when
the response on the mailing list doesn't look like there is an urgent
demand for it.
I only see feedback when the code gets questions later on on the
mailing list or on stackoverflow.
So this is maybe not the best approach.

> This is your process. ? But, it made me wonder if permissions were an issue.
> ? ?I was pretty sure you had been given commit rights, but I could not
> remember. ? I'm sorry if that offended you.

I might have overreacted initially, but I would have expected you to
participate in the discussion or at least mention that you work on it,
instead of announcing it at almost (?) the same time as making the
commits.

Josef

> -Travis
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Tue Jun  1 08:59:50 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 1 Jun 2010 06:59:50 -0600
Subject: [SciPy-Dev] Recent changes to scipy stats
In-Reply-To: <AANLkTil_0uTwD8qnjc1-sCKXwJvPu83I0mkiia99g84P@mail.gmail.com>
References: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com>
	<AANLkTik0FK74kRccGE2yImSsmicDvIXqVQGibY0xIvvF@mail.gmail.com>
	<ECEC67D9-DCD8-4E8A-8BAA-D3771DA05D4A@enthought.com>
	<AANLkTil_0uTwD8qnjc1-sCKXwJvPu83I0mkiia99g84P@mail.gmail.com>
Message-ID: <AANLkTilIfiwuc4ao864778NtdMhTPQrQd0-FdUVTL-ds@mail.gmail.com>

The diversity of perspective as to who has done what and more
importantly if it was right or ok to do seems to imply that there is
lack of clear roles/policies.  If there was it seems that we could
hope there would not be this diversity in perspectives.

Someone with more experience than me should make the rules and policies.

Moving to github would be great IMO.

Have a great day
Vincent


On Tue, Jun 1, 2010 at 2:54 AM, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> On Tue, Jun 1, 2010 at 1:25 AM, Travis Oliphant <oliphant at enthought.com>
> wrote:
>>
>> > IMO, the problem - in general, not just w/ any one person - is not the
>> > particulars of what's been done, but the attitude, when it's exhibited by an
>> > individual, any individual, that the rules may be disregarded when that
>> > individual, any individual, unilaterally and spontaneously decides those
>> > rules are inconvenient. ?The rules are there for very good reasons;
>> > paraphrasing a recent set of statements by Robert K.:
>>
>> What is the rule that has been broken exactly?
>
> Good point: I've been led to believe that there's a "rule" (aka "policy")
> against checking-in code that doesn't include passing unit tests and a
> Standard-compliant docstring, but I must concede that I can't say *where*
> (other than on the listserv) *any* rules are recorded, so if I am mistaken -
> if there is in fact no such rule/policy - I apologize, I guess we are all
> (or at least all those w/ commit privilege) free to commit as it suits our
> "style."
>
>>
>> I'd really like to know what people are actually annoyed by and who
>> exactly is annoyed?
>
> Great, I'd like to know who *isn't* annoyed.
>
> DG
>>
>> Perhaps my confidence with committing to trunk is what is fundamentally
>> the issue. ?It's clear that some people prefer a different process and
>> perhaps the move to a distributed version control will help things.
>>
>> I do feel a certain confidence with code that I have written and I like to
>> get changes into trunk quickly. ? That has always been my style. ? ?I don't
>> think I have changed in this regard. ? Perhaps it is seen as brazen or
>> inconsiderate, but I don't see it that way. ? ?I actually think it very
>> inconsiderate that I should be treated with such rudeness for contributing
>> needed functionality.
>>
>> Sometimes rules become rules inappropriately. ?Why should one development
>> process hold sway over another? ?Who is right? ?Well, clearly, it's just a
>> matter of the people around and what they want to see. ? If the majority
>> here want to see a different process, then that's where we will go. ?But, to
>> really do it, we will need to move to a distributed version control process,
>> I think --- or at least I will need to. ?I will try to work on that when I
>> can find the motivation.
>>
>> -Travis
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide. ?(As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From ralf.gommers at googlemail.com  Tue Jun  1 09:19:58 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Tue, 1 Jun 2010 21:19:58 +0800
Subject: [SciPy-Dev] [SciPy-User] log pdf, cdf, etc
In-Reply-To: <12883887-E601-467B-9C56-55BDA8169C19@enthought.com>
References: <AANLkTilmC_l3pWTizWFhB6v0R8Jo03v440ZdLfZf5pND@mail.gmail.com>
	<AANLkTinJXzsaCkbP1E1dkGIDNDruuYX_1vD9PCg6Ml5l@mail.gmail.com>
	<6D1C6011-7B0A-45DB-9B54-6CAE1FA38F71@enthought.com>
	<AANLkTilDNm6fK2FYLlNgtWJLWb77_OaefOKqWhwb7sJ3@mail.gmail.com>
	<AANLkTikHSTyt_OeKLtz4sWzpPKYMUlvr78qmY8gfVdek@mail.gmail.com>
	<12883887-E601-467B-9C56-55BDA8169C19@enthought.com>
Message-ID: <AANLkTinDmwgf6KgCM9_XIzgWdgVW9sylEIc4f9sypcFU@mail.gmail.com>

On Tue, Jun 1, 2010 at 1:20 PM, Travis Oliphant <oliphant at enthought.com>wrote:

>
> On May 31, 2010, at 6:39 AM, Ralf Gommers wrote:
>


>  These recent changes are a bit problematic for several reasons:
> - there are many new methods for distributions without tests.
>
> These methods are simple to see and verify.    Which methods specifically
> are you concerned about?
>

They're not all simple, for example rv_continuous._reduce_func. Since it
contains inner function definitions inside an "else" block there's also a
good chance it's actually broken.

And in principle I'm worried about all of them. The python 2.4/2.5 syntax
error was caught early, but what if some code you regard as simple is broken
in a less obvious way on 2.4/2.5? Maybe a user finds it in a release
candidate, forcing us to build an extra one? Or just after the final
release?

>
> - there are no docs for many new private and public methods
>
>
> They are all fairly self explanatory.   But, docs can be added if needed.
>

For you, and maybe for me too. But for undergraduate students, or Joe in
accounting who inherited  this random app that's essential for his job? It's
simple, no public docs without docstrings. And preferably no private ones
either.

Thanks for fixing all public docs quickly though. You missed just one,
gamma.fit.

>
> - invalid syntax: http://projects.scipy.org/scipy/ticket/1186
>
>
> This has been fixed (it was easier to fix the syntax then file the
> ticket...)  Also to be clear this is only invalid for Python < 2.6 (the
> comment makes it sound like somehow the changes weren't tested at all).
>
> I didn't mean to imply that you were committing code that didn't even work
for you.

> - the old rv_continuous doc template was put back in
>
>
> I'm not sure what you mean.   Which change did this?
>

The first one of your recent commits, r6392. The docstrings for subclasses
of rv_continuous and rv_discrete are not generated from this template
anymore, which is why it was removed. Look at line 862 (# generate docstring
for subclass instances) and below that to see how it works now.

If you're wondering why that changed, the main reasons are (1) to make the
docstrings conform to the standard, (2) to be able to put useful info in the
base classes, like "this is how you subclass it: ..." instead of a template,
and (3) to be able to customize individual distribution docstrings easily.


>
> This, plus Josef saying that he doesn't want to fix the API for some
> methods yet, makes me want to take it out of the 0.8.x branch. Any
> objections to that Travis or Josef?
>
>
> I would really like to see these changes go in to 0.8.x.    If Josef feels
> strongly about the API in the future, we can change it for the next release.
>   I don't understand what the specific concerns are.
>
> No you can't. For API changes we do have a policy, they need deprecation
first. Which means if we release it like this now, we're stuck with it till
0.10 / 1.0.


In summary, I see quite a few reasons why this shouldn't go in and don't see
a compelling reason to release it right now. The 0.9 release is
(tentatively) planned for September, so you don't have to worry that your
changes sit in trunk unreleased for 1.5 years.

Cheers,

Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/e78f5c32/attachment.html>

From ralf.gommers at googlemail.com  Tue Jun  1 09:23:25 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Tue, 1 Jun 2010 21:23:25 +0800
Subject: [SciPy-Dev] Scipy archive on PyPI
In-Reply-To: <4C045DCD.1080503@silveregg.co.jp>
References: <201005311655.53741.ce@vejnar.eu>
	<AANLkTikch7lzbB_IzHHn4a-NjaRHJDmE1PHmV5fv6wP_@mail.gmail.com>
	<4C045DCD.1080503@silveregg.co.jp>
Message-ID: <AANLkTinYBdHcYHxGwyTjZGSUiSogOiA-qnFqbjlYvycD@mail.gmail.com>

On Tue, Jun 1, 2010 at 9:09 AM, David <david at silveregg.co.jp> wrote:

> On 06/01/2010 12:15 AM, Ralf Gommers wrote:
> >
> >
> > On Mon, May 31, 2010 at 10:55 PM, Charles Vejnar <ce at vejnar.eu
> > <mailto:ce at vejnar.eu>> wrote:
> >
> >     Hi,
> >
> >     I was trying to install Scipy with easy_install and it seems that
> >     downloading
> >     from Sourceforge is no longer possible (Sourceforge no longer gives
> >     a direct
> >     link to the .tar.gz file) which makes the install fail.
> >
> >     Would it be possible to always upload the latest Scipy tarball to
> PyPI ?
> >
> > It's possible, but because that encourages the use of easy_install/pip
> > it would probably give more problems than that it helps. Just today
> > there was a thread on numpy-discussion about pip failing and standard
> > "python setup.py install" fixing the problem. easy_install is just as
> > problematic as pip, if not more so.
>
> Unfortunately, people will always use those half broken tools. I think
> we should at least put the tarballs - I also used to put a simple
> executable (result of bdist_wininst) so that easy_install numpy works on
> windows.
>
> OK, I'll do the same then.

Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/66cd7077/attachment.html>

From vincent at vincentdavis.net  Tue Jun  1 09:32:32 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 1 Jun 2010 07:32:32 -0600
Subject: [SciPy-Dev] Development process (was: scipy.stats)
In-Reply-To: <CE85E065-7512-4236-8BEA-EE08C930E414@enthought.com>
References: <AANLkTin58vCwfm71xW3Mi319zq4cAPw5o6xzAPBgFDsX@mail.gmail.com>
	<AANLkTimLpSyE7np5BqSKQSmvdBQzimSkCXxg_5ZkHKbS@mail.gmail.com>
	<AANLkTilisJT7naCrpZnb5vqcRNWILtBqkQYx8vh8wmhA@mail.gmail.com>
	<CE85E065-7512-4236-8BEA-EE08C930E414@enthought.com>
Message-ID: <AANLkTilqECfM6oe_7PsvZPImctOz8bXT6Pj9ZL4CxSW6@mail.gmail.com>

On Mon, May 31, 2010 at 11:02 PM, Travis Oliphant
<oliphant at enthought.com> wrote:

> How many people interested in this discussion will be at SciPy this year? ?It may be a good idea to have a discussion about this at the conference. ? ?We could phone conference others in as well so that every voice can be heard.

I think it should be done here on the list. This makes it easier for
all to review and refer back to. Also makes it more open, NOT that you
are trying to do it behind closed doors.

> I do think we need to address this issue. ? I did not realize I was offending people with my enthusiasm for having a chance to work on SciPy. ? I have always resisted too much "procedure" and "policy" so that it becomes difficult for people to contribute. ? ?I really think technology changes and DVCS can help with this process.

I am all for DVCS.

(I posted this on another thread but it is more appropriate here or
maybe I just want to repeat myself :) )
The diversity of perspective as to who has done what and more
importantly if it was right or ok to do seems to imply that there is
lack of clear roles/policies.  If there was it seems that we could
hope there would not be this diversity in perspectives.

>
> -Travis
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From charlesr.harris at gmail.com  Tue Jun  1 09:50:30 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Tue, 1 Jun 2010 07:50:30 -0600
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com>
	<B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
Message-ID: <AANLkTimtJI15wNsOvcdlHl_PjqD-W-WAyfDoWwPPatGf@mail.gmail.com>

On Tue, Jun 1, 2010 at 2:43 AM, Travis Oliphant <oliphant at enthought.com>wrote:

>
> On Jun 1, 2010, at 3:12 AM, josef.pktd at gmail.com wrote:
>
> On Tue, Jun 1, 2010 at 12:54 AM, Travis Oliphant <oliphant at enthought.com>
> wrote:
>
>
> On May 31, 2010, at 9:16 AM, josef.pktd at gmail.com wrote:
>
>
> Since Travis seems to want to take back control of scipy.stats, I am
>
> considering my role as inofficial maintainer as ended.
>
>
> Obviously I've offended you.   That has never been my intent.   I apologize
> if my enthusiasm for getting some changes that I wanted to see into SciPy
> stepped on an area you felt ownership of.     I do not mind if people add
> changes to code that I've written and I assume that others feel the same.
> That has always been the development mode of SciPy.   We clearly have
> different development styles.    I think we can find a way to work together.
>   I think the move to github will help.
>
>
> I did not understand that you felt such ownership of scipy.stats.  I have
> certainly appreciated your input.
>
>
> I do like a more "free-wheeling" style to code development than one that is
> bogged down with "rules" and "procedures".     This clearly is not your
> style.   For me, it comes down to time to spend.   I love working on SciPy
> and NumPy.    I don't have a lot of time to do it.   When I see quick
> changes I can make that add value I like to be able to do it.   I think we
> both want the same thing while we may disagree about the best way to get
> there.
>
> In my mind, discussion doesn't end when a check-in is made --- it just
> begins.   You should never interpret my checking something in as the final
> word.   We clearly have a different view of "trunk"
>
>
> I certainly don't want my approach to open source development to offend
> others or chase them away.  If I check in something you don't like, then
> tell me and let's talk about it.    If you need to vent and call me names, a
> private email to me or others can go a long way.
>
>
> What do we need to do to keep you around?   Is there specifically something
> you didn't like about my recent check-ins?
>
>
> In this case, the features added were not terribly extensive.   The current
> unit tests helped ferret out major problems.  Yes, I could write more tests
> and documentation, and you have been a model of writing tests and
> documentation.   I have been particularly impressed by the amount of quality
> documentation you have written.
>
>
> While you seem to dismiss the episode as problematic, I actually think
> curve_fit was a good example of how something very positive can emerge
> quickly when people are open and willing to work together.
>
>
> While formal, strict test-driven development is easy to point to for
> salvation -- it does have its costs.   I've always used informal test-driven
> development.   Just because I don't *always* add formal unit tests for every
> piece of code written does not mean the code that is currently in SciPy is
> un-tested and useless.   Such an approach leaves me open to criticism, which
> I acknowledge.  But, I think there have been far too many dismissive
> comments about the state of the code.
>
>
> I would argue that the problem with scipy.stats does not lie mainly in
> distributions.py or the lack of test-driven-development --- but in the lack
> of certain easy to use features.    Quality code comes out of people who
> care --- not out of procedure.
>
>
> I think you are someone who cares and your code reflects that.    We would
> all benefit from your staying part of the main development.
>
>
> (not answering inline to keep thoughts together)
>
> I think the main disagreements are about the quality control of the
> trunk and whether scipy development is a community effort or not.
>
>
> I certainly think scipy development is a community effort.   I'm very sorry
> for making you feel "dumped" on.   That has never been my intent.  I was
> simply hoping to contribute a little where I could.
>
> As Skipper described, in statsmodels almost all development occurs in
> the sandbox and in branches, and it is only included in the "official"
> core of statsmodels after it has been verified and tests have been
> added. sandbox code is everything from first draft version to almost
> finished code.
> And one of Skippers task in his gsoc is to clean out the sandbox.
> Once it is in trunk (core) any further refactoring follows very strict
> rules.
>
>
> This has not been SciPy's process.   I can understand people may want it to
> become SciPy's process, but it has not been.  There are dangers of this
> process --- there is a reason that the mantra of "release early and release
> often".  It can also prevent progress when you are dealing with people's
> spare time because all of that process takes time and man-power and effort.
>   There is some value in it, I'm just not sure the extent of that value in
> contrast to other uses of that time.
>
>
Numpy/Scipy has changed from the days when there were just a few folks
involved and the urgent need was to get some code, any code, out there. I'm
sure many projects start that way because in beginning the idea is the
important thing, the perfection of the implementation not so much. But as
things progress and more people use the code, correctness becomes important.
The numpy/numeric C code itself shows this process, with the early code
quality being what I would classify as  "undergraduate" C. That doesn't mean
Numeric wasn't useful, obviously many people found it so or we wouldn't be
here, but it does mean that the code wasn't easy to maintain or understand.
Now the basic ideas have been worked out and the originators have moved on
while at the same time the code has become more widely used, so the need
becomes maintenance, correctness, distribution, and attracting the people to
do those things. That requires a different sort of process.

<snip>

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/1a2159a5/attachment.html>

From stefan.czesla at hs.uni-hamburg.de  Tue Jun  1 13:05:19 2010
From: stefan.czesla at hs.uni-hamburg.de (Stefan)
Date: Tue, 1 Jun 2010 17:05:19 +0000 (UTC)
Subject: [SciPy-Dev]
	=?utf-8?q?np=2Esavetxt=3A_apply_patch_in_enhancement_?=
	=?utf-8?q?ticket_1079_to=09add_headers=3F?=
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com>
Message-ID: <loom.20100601T190316-782@post.gmane.org>

Skipper Seabold <jsseabold <at> gmail.com> writes:

> 
> Hi all,
> 
> I am assuming that this is ok to request via the list...  Could we
> discuss or could someone apply the patch in enhancement ticket 1079?
> 
> http://projects.scipy.org/numpy/ticket/1079
> 
> I needed this functionality recently, and this is a quick and easy fix
> that may have been overlooked.
> 
> There is also another enhancement request about this here:
> http://projects.scipy.org/numpy/ticket/1236
> 
> The only thing that I can think of that might need to be added is a
> test to see that the header length is the same as the number of
> columns, but really that might just be up to the user to supply the
> right headers.  It might also be nice to have a header = True, that
> uses the field names for a structured array, but I can live without
> that.
> 
> Cheers,
> 
> Skipper
> 

Hi,

+1; we have the same problem quite frequently. Our current solution looks
similar to what has been proposed in ticket 1079, and we wonder why a solution
has not yet found its way into the official release of numpy.

We can, however, image a slightly different implementation and would like to
hear the community's opinion on it.

If the header is given as a plane string (such as envisaged in ticket 1079), the
user has to care for the correct formatting, in particular, the user has to
supply the comment character(s) and the new line formatting. This might be
against intuition, because many users will at first try to supply their
header(s) without specifying those formatting characters. The result will be a
file not readable with numpy.loadtxt, and the error might not be detected right
away.

As numpy.loadtxt has a default comment character ('#'), the same may be
implemented for numpy.savetxt. In this case, numpy.savetxt would get two
additional keywords (e.g. header, comment(character)), which bloats the
interface, but potentially provides more safety.

Cheers,
Stefan & Christian


From jsseabold at gmail.com  Tue Jun  1 13:48:28 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Tue, 1 Jun 2010 13:48:28 -0400
Subject: [SciPy-Dev] np.savetxt: apply patch in enhancement ticket 1079
	to add headers?
In-Reply-To: <loom.20100601T190316-782@post.gmane.org>
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com> 
	<loom.20100601T190316-782@post.gmane.org>
Message-ID: <AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com>

On Tue, Jun 1, 2010 at 1:05 PM, Stefan <stefan.czesla at hs.uni-hamburg.de> wrote:
> Skipper Seabold <jsseabold <at> gmail.com> writes:
>
>>
>> Hi all,
>>
>> I am assuming that this is ok to request via the list... ?Could we
>> discuss or could someone apply the patch in enhancement ticket 1079?
>>
>> http://projects.scipy.org/numpy/ticket/1079
>>
>> I needed this functionality recently, and this is a quick and easy fix
>> that may have been overlooked.
>>
>> There is also another enhancement request about this here:
>> http://projects.scipy.org/numpy/ticket/1236
>>
>> The only thing that I can think of that might need to be added is a
>> test to see that the header length is the same as the number of
>> columns, but really that might just be up to the user to supply the
>> right headers. ?It might also be nice to have a header = True, that
>> uses the field names for a structured array, but I can live without
>> that.
>>
>> Cheers,
>>
>> Skipper
>>
>
> Hi,

And here I was thinking no one was listening so long ago.

>
> +1; we have the same problem quite frequently. Our current solution looks
> similar to what has been proposed in ticket 1079, and we wonder why a solution
> has not yet found its way into the official release of numpy.
>
> We can, however, image a slightly different implementation and would like to
> hear the community's opinion on it.
>
> If the header is given as a plane string (such as envisaged in ticket 1079), the
> user has to care for the correct formatting, in particular, the user has to
> supply the comment character(s) and the new line formatting. This might be
> against intuition, because many users will at first try to supply their
> header(s) without specifying those formatting characters. The result will be a
> file not readable with numpy.loadtxt, and the error might not be detected right
> away.

I'm not sure I understand why I would want to specify a comment
character for writing a csv file (unless of course I had some comments
to add).

Also note that since that patch was written, savetxt takes a user
supplied newline keyword, so you can just append that to the header
string.

>
> As numpy.loadtxt has a default comment character ('#'), the same may be
> implemented for numpy.savetxt. In this case, numpy.savetxt would get two
> additional keywords (e.g. header, comment(character)), which bloats the
> interface, but potentially provides more safety.
>

FWIW, I ended up rolling my own using the most recent pre-Python 3
changes for savetxt that accepts a list of names instead of one string
or if the provided array has the attribute dtype.names (non-nested rec
or structured arrays) it uses those.  Whatever is done I think the
support for structured arrays is nice, and I think having this
functionality is a no-brainer.  I need it quite often.

Skipper


From matthew.brett at gmail.com  Tue Jun  1 13:55:08 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 10:55:08 -0700
Subject: [SciPy-Dev] Development process (was: scipy.stats)
In-Reply-To: <AANLkTilqECfM6oe_7PsvZPImctOz8bXT6Pj9ZL4CxSW6@mail.gmail.com>
References: <AANLkTin58vCwfm71xW3Mi319zq4cAPw5o6xzAPBgFDsX@mail.gmail.com>
	<AANLkTimLpSyE7np5BqSKQSmvdBQzimSkCXxg_5ZkHKbS@mail.gmail.com>
	<AANLkTilisJT7naCrpZnb5vqcRNWILtBqkQYx8vh8wmhA@mail.gmail.com>
	<CE85E065-7512-4236-8BEA-EE08C930E414@enthought.com>
	<AANLkTilqECfM6oe_7PsvZPImctOz8bXT6Pj9ZL4CxSW6@mail.gmail.com>
Message-ID: <AANLkTilFmxWGuQ1cwwU9NuPggD128H7oS_48rFmMA24v@mail.gmail.com>

Hi,

On Tue, Jun 1, 2010 at 6:32 AM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Mon, May 31, 2010 at 11:02 PM, Travis Oliphant
> <oliphant at enthought.com> wrote:
>
>> How many people interested in this discussion will be at SciPy this year? ?It may be a good idea to have a discussion about this at the conference. ? ?We could phone conference others in as well so that every voice can be heard.
>
> I think it should be done here on the list. This makes it easier for
> all to review and refer back to. Also makes it more open, NOT that you
> are trying to do it behind closed doors.

I agree very much that discussion on the list is better.  I think it
helps solidify the idea of numpy and scipy being a community project,
where all discussion is public and open.   I know that can be a little
tough sometimes, but that too has its benefits in clearing the air and
making people feel that the discussion is open.

See you,

Matthew


From matthew.brett at gmail.com  Tue Jun  1 14:57:10 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 11:57:10 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
Message-ID: <AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>

Hi,

>> Well - but that is because you don't maintenance. ? Imagine a
>> maintainer puts in a lot of effort to make the code well-documented
>> and tested. ?Then, you have put in new code that has neither
>> documentation nor tests. ? ?As a good maintainer, it's really painful
>> for them that there's new code without documentation or tests. ? They
>> can only feel abused in that situation, because it seems as if you are
>> expecting them to clean up after you - without asking.
>
> I don't think that is fair. ?I have been "maintaining" SciPy and NumPy code for over 10 years. ? I have done an immense amount of work in porting SciPy to NumPy and continuing to fix bugs that I am made aware of. ?I don't have as much time to commit to SciPy as I would like.

I wasn't really saying whether it was fair or not, I was only trying
to explain why it might cause offense.

When I say that you don't do maintenance, I mean that you are not
currently the person who has to make sure that the code is readable
and maintainable.   That is hard and often thankless work.

I presume that you agree that numpy and scipy code should have
documentation and tests.    I presume also that when you commit code
without documentation or tests, that you do not usually intend to come
back and do these later - say - before the next release.   That means
that someone else has to do it.  It will take them a lot longer than
it would take you because they don't know the code as well.

I realize this is not your intent, but, it's tempting in this
situation to feel that you think that your time is more valuable than
the person who has to write the documentation and tests - and that's a
painful feeling to have - hence - I believe - the level of bad feeling
that arises...

See you,

Matthew


From d.l.goldsmith at gmail.com  Tue Jun  1 14:59:48 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 11:59:48 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTimtJI15wNsOvcdlHl_PjqD-W-WAyfDoWwPPatGf@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com>
	<B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
	<AANLkTimtJI15wNsOvcdlHl_PjqD-W-WAyfDoWwPPatGf@mail.gmail.com>
Message-ID: <AANLkTim6SNWMcSrYA3U2er8K4TXAxzPe_tJmCE0AbKBd@mail.gmail.com>

On Tue, Jun 1, 2010 at 6:50 AM, Charles R Harris
<charlesr.harris at gmail.com>wrote:

> Numpy/Scipy has changed from the days when there were just a few folks
> involved and the urgent need was to get some code, any code, out there. I'm
> sure many projects start that way because in beginning the idea is the
> important thing, the perfection of the implementation not so much. But as
> things progress and more people use the code, correctness becomes important.
> The numpy/numeric C code itself shows this process, with the early code
> quality being what I would classify as  "undergraduate" C. That doesn't mean
> Numeric wasn't useful, obviously many people found it so or we wouldn't be
> here, but it does mean that the code wasn't easy to maintain or understand.
> Now the basic ideas have been worked out and the originators have moved on
> while at the same time the code has become more widely used, so the need
> becomes maintenance, correctness, distribution, and attracting the people to
> do those things. That requires a different sort of process.
>

And reliability, i.e., it is not enough to claim that the code is correct,
people (scientists whose reputations are at stake) need to be able to *rely*
on it being correct.

Perhaps it hasn't been long enough, but I note two things at this point:

0) No one has disputed that we have (and have had for some amount of time,
i.e., it didn't go into affect yesterday) a standing policy that new code
submissions are supposed to have passing tests and a Standard-compliant
docstring *before* being checked-in, and

1) No one has indicated a specific place where this (or any other standing
policy) may be found for reference.

So, I propose that we establish such a place - even if we don't presently
populate it with *anything* - so that, if we wish to discuss, e.g., whether
or not rules may be subject to individuals' "style," we can at least all
know exactly what rules we're discussing.

DG


>
> <snip>
>
> Chuck
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/53c337d7/attachment.html>

From josef.pktd at gmail.com  Tue Jun  1 15:24:22 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 15:24:22 -0400
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
	<AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
Message-ID: <AANLkTilsIgRGE9uVTOtCZRN7EbEYAr4QR4e9X_7QDltQ@mail.gmail.com>

On Tue, Jun 1, 2010 at 2:57 PM, Matthew Brett <matthew.brett at gmail.com> wrote:
> Hi,
>
>>> Well - but that is because you don't maintenance. ? Imagine a
>>> maintainer puts in a lot of effort to make the code well-documented
>>> and tested. ?Then, you have put in new code that has neither
>>> documentation nor tests. ? ?As a good maintainer, it's really painful
>>> for them that there's new code without documentation or tests. ? They
>>> can only feel abused in that situation, because it seems as if you are
>>> expecting them to clean up after you - without asking.
>>
>> I don't think that is fair. ?I have been "maintaining" SciPy and NumPy code for over 10 years. ? I have done an immense amount of work in porting SciPy to NumPy and continuing to fix bugs that I am made aware of. ?I don't have as much time to commit to SciPy as I would like.
>
> I wasn't really saying whether it was fair or not, I was only trying
> to explain why it might cause offense.
>
> When I say that you don't do maintenance, I mean that you are not
> currently the person who has to make sure that the code is readable
> and maintainable. ? That is hard and often thankless work.
>
> I presume that you agree that numpy and scipy code should have
> documentation and tests. ? ?I presume also that when you commit code
> without documentation or tests, that you do not usually intend to come
> back and do these later - say - before the next release. ? That means
> that someone else has to do it. ?It will take them a lot longer than
> it would take you because they don't know the code as well.
>
> I realize this is not your intent, but, it's tempting in this
> situation to feel that you think that your time is more valuable than
> the person who has to write the documentation and tests - and that's a
> painful feeling to have - hence - I believe - the level of bad feeling
> that arises...

Just to emphasis my point

I'm mainly concerned about quality control of trunk.

Open source development is still a collaborative process, and the
person to write the code and the final tests doesn't necessarily have
to be the same person. For example, Skipper is doing a lot more than
his "fair" share of writing formal tests in statsmodels, and I'm
writing a good amount of test code for scipy.
(Skipper and I usually provide sufficient documentation, developer
comments, and references that we are in most cases able to understand
the code.)

But if the original coder doesn't have the time to bring the code up
to testing and documentation standard, then the code should stay out
of trunk until someone finds the time to get it through a review and
quality control process. The problem is that, that someone might not
be able to figure out how to fix possible problems (my example is
Fisher's exact test).

Josef


>
> See you,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From matthew.brett at gmail.com  Tue Jun  1 15:24:41 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 12:24:41 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
Message-ID: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>

Hi,

I thought I'd split this one off onto its own thread too because it's
an important issue independent of scipy.stats

> For me it's easier to develop and mature inside a pure python package,
> which is also more accessible for new contributors. One of my wishful
> target audience are contributors on Windows, which would become rather
> difficult as part of scipy and git.

I would imagine that everyone agrees that it's very important that we
have developers like you (Josef) who are using windows as their main
platform.  It's by far the best way to make sure we are shaking out
bugs on windows.

So, given that there seems a strong mood to switch to git, we should
make sure that this does not cause problems for windows developers.

So: Josef - and others

a) are there any problems that you know of using git from the windows shell?
b) Do you think you would prefer to use mercurial as a client for the
git repo : http://github.com/blog/439-hg-git-mercurial-plugin ?  In
that case we should set up documentation for that.
c) Do you want to stick with bzr?   That might be possible
(https://launchpad.net/bzr-git,
http://github.com/matthew-brett/git-bzr) but that will likely be
considerably harder than a mercurial client.

See you,

Matthew


From stefan at sun.ac.za  Tue Jun  1 15:25:36 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Tue, 1 Jun 2010 12:25:36 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTim6SNWMcSrYA3U2er8K4TXAxzPe_tJmCE0AbKBd@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com> 
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com> 
	<B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
	<AANLkTimtJI15wNsOvcdlHl_PjqD-W-WAyfDoWwPPatGf@mail.gmail.com> 
	<AANLkTim6SNWMcSrYA3U2er8K4TXAxzPe_tJmCE0AbKBd@mail.gmail.com>
Message-ID: <AANLkTikTbWDGgjT7vKG0TX0GCLWM0WGog8nzSrscgAES@mail.gmail.com>

On 1 June 2010 11:59, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> Perhaps it hasn't been long enough, but I note two things at this point:
>
> 0) No one has disputed that we have (and have had for some amount of time,
> i.e., it didn't go into affect yesterday) a standing policy that new code
> submissions are supposed to have passing tests and a Standard-compliant
> docstring *before* being checked-in, and

We have had many discussions around unit testing and code review, but
the fact is that there is no such policy.  Whether that should change
or not is another question.

> 1) No one has indicated a specific place where this (or any other standing
> policy) may be found for reference.

Developer guidelines may be found here:

 http://projects.scipy.org/scipy
 http://projects.scipy.org/numpy

Ideally, all the guidelines should be checked in to the repo under
numpy/docs (the documentation guidelines, for example, already are).

Regards
St?fan


From d.l.goldsmith at gmail.com  Tue Jun  1 15:31:54 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 12:31:54 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
Message-ID: <AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>

On Tue, Jun 1, 2010 at 12:24 PM, Matthew Brett <matthew.brett at gmail.com>wrote:

> Hi,
>
> I thought I'd split this one off onto its own thread too because it's
> an important issue independent of scipy.stats
>
> > For me it's easier to develop and mature inside a pure python package,
> > which is also more accessible for new contributors. One of my wishful
> > target audience are contributors on Windows, which would become rather
> > difficult as part of scipy and git.
>
> I would imagine that everyone agrees that it's very important that we
> have developers like you (Josef) who are using windows as their main
> platform.  It's by far the best way to make sure we are shaking out
> bugs on windows.
>
> So, given that there seems a strong mood to switch to git, we should
> make sure that this does not cause problems for windows developers.
>
> So: Josef - and others
>
> a) are there any problems that you know of using git from the windows
> shell?
>

None in principle here (and from what I've garnered through the discussion,
I am supportive of the move, as long as we don't deprecate the SVN trunk too
quickly), but do we have anyone, even just one person, who is already
reasonably facile in this regard who'd be willing to support others through
the transition?

DG


> b) Do you think you would prefer to use mercurial as a client for the
> git repo : http://github.com/blog/439-hg-git-mercurial-plugin ?  In
> that case we should set up documentation for that.
> c) Do you want to stick with bzr?   That might be possible
> (https://launchpad.net/bzr-git,
> http://github.com/matthew-brett/git-bzr) but that will likely be
> considerably harder than a mercurial client.
>
> See you,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/fb6dc232/attachment.html>

From stefan at sun.ac.za  Tue Jun  1 15:31:55 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Tue, 1 Jun 2010 12:31:55 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTikTbWDGgjT7vKG0TX0GCLWM0WGog8nzSrscgAES@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com> 
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com> 
	<B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
	<AANLkTimtJI15wNsOvcdlHl_PjqD-W-WAyfDoWwPPatGf@mail.gmail.com> 
	<AANLkTim6SNWMcSrYA3U2er8K4TXAxzPe_tJmCE0AbKBd@mail.gmail.com> 
	<AANLkTikTbWDGgjT7vKG0TX0GCLWM0WGog8nzSrscgAES@mail.gmail.com>
Message-ID: <AANLkTikf_Dp7Y_3tqc3cnMw6Snjp93KqCN9ulvHXfRO0@mail.gmail.com>

2010/6/1 St?fan van der Walt <stefan at sun.ac.za>:
>> 0) No one has disputed that we have (and have had for some amount of time,
>> i.e., it didn't go into affect yesterday) a standing policy that new code
>> submissions are supposed to have passing tests and a Standard-compliant
>> docstring *before* being checked-in, and
>
> We have had many discussions around unit testing and code review, but
> the fact is that there is no such policy. ?Whether that should change
> or not is another question.

Looks like I read your message too hastily.  I meant to comment on a
policy surrounding addition of tests and code review.

By the way, you'll notice that we have *guidelines*, not policy.  I
think that this is an important indicator of the way that SciPy
development takes place (we agree by consensus and help each other
out, rather than enforcing restrictions).

If our guidelines may be modified to benefit one another so that we
may all enjoy working on SciPy, that would be a good thing.

Regards
St?fan


From matthew.brett at gmail.com  Tue Jun  1 15:44:00 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 12:44:00 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
Message-ID: <AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>

Hi,

>> a) are there any problems that you know of using git from the windows
>> shell?
>
> None in principle here (and from what I've garnered through the discussion,
> I am supportive of the move, as long as we don't deprecate the SVN trunk too
> quickly), but do we have anyone, even just one person, who is already
> reasonably facile in this regard who'd be willing to support others through
> the transition?

I would not claim to be very experienced, but I have not had any
problems using msysgit with either the windows shell or the (rather
good) windows power shell.    The bash shell does have problems but
the windows shells have proved more useful.

I'd certainly be willing to help as far as I can - but I think the
next step is to find what problems people are having (or expect to
have) and go from there.

See you,

Matthew


From d.l.goldsmith at gmail.com  Tue Jun  1 15:47:59 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 12:47:59 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTikf_Dp7Y_3tqc3cnMw6Snjp93KqCN9ulvHXfRO0@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com>
	<B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
	<AANLkTimtJI15wNsOvcdlHl_PjqD-W-WAyfDoWwPPatGf@mail.gmail.com>
	<AANLkTim6SNWMcSrYA3U2er8K4TXAxzPe_tJmCE0AbKBd@mail.gmail.com>
	<AANLkTikTbWDGgjT7vKG0TX0GCLWM0WGog8nzSrscgAES@mail.gmail.com>
	<AANLkTikf_Dp7Y_3tqc3cnMw6Snjp93KqCN9ulvHXfRO0@mail.gmail.com>
Message-ID: <AANLkTilFbJWrnaC6lLSOh8oQXu_WKKLQ6DCjgoYQ81Ig@mail.gmail.com>

2010/6/1 St?fan van der Walt <stefan at sun.ac.za>

> 2010/6/1 St?fan van der Walt <stefan at sun.ac.za>:
> >> 0) No one has disputed that we have (and have had for some amount of
> time,
> >> i.e., it didn't go into affect yesterday) a standing policy that new
> code
> >> submissions are supposed to have passing tests and a Standard-compliant
> >> docstring *before* being checked-in, and
> >
> > We have had many discussions around unit testing and code review, but
> > the fact is that there is no such policy.  Whether that should change
> > or not is another question.
>
> Looks like I read your message too hastily.  I meant to comment on a
> policy surrounding addition of tests and code review.
>
> By the way, you'll notice that we have *guidelines*, not policy.  I
> think that this is an important indicator of the way that SciPy
> development takes place (we agree by consensus and help each other
> out, rather than enforcing restrictions).
>
> If our guidelines may be modified to benefit one another so that we
> may all enjoy working on SciPy, that would be a good thing.
>
> Regards
> St?fan
>

Thanks, St?fan.  So, we don't have a policy (or even a guideline that I
could see) addressing minimum requirements code must meet before check-in -
my apologies to all, and especially Travis.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/c4f48167/attachment.html>

From d.l.goldsmith at gmail.com  Tue Jun  1 15:53:27 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 12:53:27 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
Message-ID: <AANLkTikOWOqQ2Nw1P9Jd9WECEn_tsVqT_ZcoVux9AVnj@mail.gmail.com>

On Tue, Jun 1, 2010 at 12:44 PM, Matthew Brett <matthew.brett at gmail.com>wrote:

> Hi,
> >> a) are there any problems that you know of using git from the windows
> >> shell?
> >
> > None in principle here (and from what I've garnered through the
> discussion,
> > I am supportive of the move, as long as we don't deprecate the SVN trunk
> too
> > quickly), but do we have anyone, even just one person, who is already
> > reasonably facile in this regard who'd be willing to support others
> through
> > the transition?
>
> I would not claim to be very experienced, but I have not had any
> problems using msysgit with either the windows shell or the (rather
> good) windows power shell.    The bash shell does have problems but
> the windows shells have proved more useful.
>
> I'd certainly be willing to help as far as I can - but I think the
> next step is to find what problems people are having (or expect to
> have) and go from there.
>

Of course, but it's comforting to know there's someone in the community whom
we might hope will know the answers to questions that arise. :-)  FFR, what
are your platform specifics (e.g., I'm running Win7 Home Prem. 64bit), in
case it turns out to matter.

DG

>
> See you,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/24925093/attachment.html>

From josef.pktd at gmail.com  Tue Jun  1 16:31:28 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 16:31:28 -0400
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
Message-ID: <AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>

On Tue, Jun 1, 2010 at 3:44 PM, Matthew Brett <matthew.brett at gmail.com> wrote:
> Hi,
>
>>> a) are there any problems that you know of using git from the windows
>>> shell?
>>
>> None in principle here (and from what I've garnered through the discussion,
>> I am supportive of the move, as long as we don't deprecate the SVN trunk too
>> quickly), but do we have anyone, even just one person, who is already
>> reasonably facile in this regard who'd be willing to support others through
>> the transition?
>
> I would not claim to be very experienced, but I have not had any
> problems using msysgit with either the windows shell or the (rather
> good) windows power shell. ? ?The bash shell does have problems but
> the windows shells have proved more useful.

It depends a lot on the part that I am working on. I wouldn't want to
switch statsmodels where I do my main development to git.

For scipy.stats (or bugfixes in other parts of scipy) I will give git
a try, or look at the mercurial interface, if git doesn't work out for
me.

My main problem with git was the treatment of the file system, and I
find it much easier to work with separate branches as in bzr or
mercurial. For scipy, I never had to maintain a longer lived branch
where I needed to worry about synchronizing with a changing trunk. I
prepare most changes in scipy on standalone files, because they have a
much faster development and test cycle, and merging them back into the
scipy source is usually easy.
(caveat: large/invasive changes like Ralf's docstring improvements are
a lot more difficult to handle this way, but he was finally able to
commit them himself.)

And since I never (except for two c code bugfixes in numpy random)
worked on compiled code, I didn't need a full develop-compile-test
cycle. So, any version control system is fine with me, and maybe I can
get used to the advantages of git.
As long as it is possible to stick with the basic workflow of git
without anything fancy, similar what I have seen while skimming the
nipy docs, I think it is not a problem on windows. The basic commands
and for example eclipse, GUI plugins look similar enough.

However, if/when parts of statsmodels go into scipy and I have to do
maintenance of less isolated code, then I think the Mercurial
interface might be my preferred choice.

I haven't used Mercurial much yet, but I don't see any problems with it.

So, the bottom line is, that documentation for the hg-git interface
would be very useful for Windows users (or those that think git is a
strange/unfamiliar concept.)

Josef

>
> I'd certainly be willing to help as far as I can - but I think the
> next step is to find what problems people are having (or expect to
> have) and go from there.
>
> See you,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From d.l.goldsmith at gmail.com  Tue Jun  1 16:32:20 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 13:32:20 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section optional?
Message-ID: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>

The docstring Standard seems to be careful to note which sections are
considered optional, and the "Extended Summary" is *not* on that list.
However, I'm encountering many SciPy docstrings in the Wiki lacking this
section and yet marked as "Needs review": should I ignore this deficiency
and add a ticket to clarify the Standard, or should such docstrings be moved
back to "Being written"?

DG
-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/aebc4d1a/attachment.html>

From stefan at sun.ac.za  Tue Jun  1 16:40:12 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Tue, 1 Jun 2010 13:40:12 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
Message-ID: <AANLkTinGy2DBoMPd5g644WP1R1XAMlx4QkV4PsfH6b39@mail.gmail.com>

On 1 June 2010 13:32, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> The docstring Standard seems to be careful to note which sections are
> considered optional, and the "Extended Summary" is *not* on that list.
> However, I'm encountering many SciPy docstrings in the Wiki lacking this
> section and yet marked as "Needs review": should I ignore this deficiency
> and add a ticket to clarify the Standard, or should such docstrings be moved
> back to "Being written"?

Typically, there is no reason not to have an extended section.  Can
you give an example where it would seem unnecessary?  Unless those
functions mentioned above are exceptional, we should probably add
blurbs for them.

Regards
St?fan


From josef.pktd at gmail.com  Tue Jun  1 16:45:57 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 16:45:57 -0400
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTikf_Dp7Y_3tqc3cnMw6Snjp93KqCN9ulvHXfRO0@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTimEloj1kK7vpy6RBIUpZZEbIYa388EwAsd77faZ@mail.gmail.com>
	<B71574FE-E1F4-4917-9EE4-F2A6EE9D8926@enthought.com>
	<AANLkTimtJI15wNsOvcdlHl_PjqD-W-WAyfDoWwPPatGf@mail.gmail.com>
	<AANLkTim6SNWMcSrYA3U2er8K4TXAxzPe_tJmCE0AbKBd@mail.gmail.com>
	<AANLkTikTbWDGgjT7vKG0TX0GCLWM0WGog8nzSrscgAES@mail.gmail.com>
	<AANLkTikf_Dp7Y_3tqc3cnMw6Snjp93KqCN9ulvHXfRO0@mail.gmail.com>
Message-ID: <AANLkTil2ywLRZ3r9dmdrBCRWYn9O9-W5iZBnDJBpW0wL@mail.gmail.com>

2010/6/1 St?fan van der Walt <stefan at sun.ac.za>:
> 2010/6/1 St?fan van der Walt <stefan at sun.ac.za>:
>>> 0) No one has disputed that we have (and have had for some amount of time,
>>> i.e., it didn't go into affect yesterday) a standing policy that new code
>>> submissions are supposed to have passing tests and a Standard-compliant
>>> docstring *before* being checked-in, and
>>
>> We have had many discussions around unit testing and code review, but
>> the fact is that there is no such policy. ?Whether that should change
>> or not is another question.
>
> Looks like I read your message too hastily. ?I meant to comment on a
> policy surrounding addition of tests and code review.
>
> By the way, you'll notice that we have *guidelines*, not policy. ?I
> think that this is an important indicator of the way that SciPy
> development takes place (we agree by consensus and help each other
> out, rather than enforcing restrictions).
>
> If our guidelines may be modified to benefit one another so that we
> may all enjoy working on SciPy, that would be a good thing.

I don't know or remember whether the guidelines have ever been decided
upon, but my impression was that offering larger changes for review
has become the established, de facto rule.

Maybe it's time to spell out the conclusions explicitly, so we don't
have to repeat the same discussion every one to one and a half years.

http://mail.scipy.org/pipermail/scipy-dev/2009-February/011241.html

That thread is too long to see whether there was any conclusion.

Josef


>
> Regards
> St?fan
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From stefan at sun.ac.za  Tue Jun  1 16:45:47 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Tue, 1 Jun 2010 13:45:47 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com> 
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com> 
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com> 
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
Message-ID: <AANLkTilXjWvHaGs8UnymKCuyvRbFpE6F_qH4vONiLZY_@mail.gmail.com>

On 1 June 2010 13:31,  <josef.pktd at gmail.com> wrote:
> So, the bottom line is, that documentation for the hg-git interface
> would be very useful for Windows users (or those that think git is a
> strange/unfamiliar concept.)

These interfaces are somewhat dangerous, in the sense that you may
encounter rather untypical scenarios and strange bugs in those tools
(for example, we even have to be careful with git-svn, and that tool
is widely used).

Do you think a clear, simple, numpy/scipy-oriented tutorial could
sufficiently lower the barrier to adoption?  I think the bzr work-flow
you are used to is probably very similar to the one you'd follow with
git.

Regards
St?fan


From d.l.goldsmith at gmail.com  Tue Jun  1 16:48:08 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 13:48:08 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTinGy2DBoMPd5g644WP1R1XAMlx4QkV4PsfH6b39@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinGy2DBoMPd5g644WP1R1XAMlx4QkV4PsfH6b39@mail.gmail.com>
Message-ID: <AANLkTik1L8EWiUmggtlenRghpusryY3971rXtpI-T8_D@mail.gmail.com>

2010/6/1 St?fan van der Walt <stefan at sun.ac.za>

> On 1 June 2010 13:32, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> > The docstring Standard seems to be careful to note which sections are
> > considered optional, and the "Extended Summary" is *not* on that list.
> > However, I'm encountering many SciPy docstrings in the Wiki lacking this
> > section and yet marked as "Needs review": should I ignore this deficiency
> > and add a ticket to clarify the Standard, or should such docstrings be
> moved
> > back to "Being written"?
>
> Typically, there is no reason not to have an extended section.  Can
> you give an example where it would seem unnecessary?


No: my position would appear to be the same as yours, and my inclination
would be to "revert" them to "Being written."  I'm basically inviting people
to tell me that that would be too strict.  :-)  So far, it's +1 that it
wouldn't.

DG


> Unless those
> functions mentioned above are exceptional, we should probably add
> blurbs for them.
>
> Regards
> St?fan
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/1944ee35/attachment.html>

From charlesr.harris at gmail.com  Tue Jun  1 16:51:51 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Tue, 1 Jun 2010 14:51:51 -0600
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTilXjWvHaGs8UnymKCuyvRbFpE6F_qH4vONiLZY_@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTilXjWvHaGs8UnymKCuyvRbFpE6F_qH4vONiLZY_@mail.gmail.com>
Message-ID: <AANLkTimYlcKii4Zzj5b7xaf7tOTzgst2lrJyFQzjqFHe@mail.gmail.com>

2010/6/1 St?fan van der Walt <stefan at sun.ac.za>

> On 1 June 2010 13:31,  <josef.pktd at gmail.com> wrote:
> > So, the bottom line is, that documentation for the hg-git interface
> > would be very useful for Windows users (or those that think git is a
> > strange/unfamiliar concept.)
>
> These interfaces are somewhat dangerous, in the sense that you may
> encounter rather untypical scenarios and strange bugs in those tools
> (for example, we even have to be careful with git-svn, and that tool
> is widely used).
>
> Do you think a clear, simple, numpy/scipy-oriented tutorial could
> sufficiently lower the barrier to adoption?  I think the bzr work-flow
> you are used to is probably very similar to the one you'd follow with
> git.
>
>
I looked at the trial version of smartgit
<http://www.syntevo.com/smartgit/index.html>a while back and it seemed
decent to me as a git interface. I didn't actually use it, though.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/bd4f14ed/attachment.html>

From vincent at vincentdavis.net  Tue Jun  1 16:55:37 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 1 Jun 2010 14:55:37 -0600
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTilsIgRGE9uVTOtCZRN7EbEYAr4QR4e9X_7QDltQ@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
	<AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
	<AANLkTilsIgRGE9uVTOtCZRN7EbEYAr4QR4e9X_7QDltQ@mail.gmail.com>
Message-ID: <AANLkTikyZqDRP_mEP7uXM4sPpJYIpvj-Uutz69XYkwY0@mail.gmail.com>

On Tue, Jun 1, 2010 at 1:24 PM,  <josef.pktd at gmail.com> wrote:

> Open source development is still a collaborative process, and the
> person to write the code and the final tests doesn't necessarily have
> to be the same person. For example, Skipper is doing a lot more than
> his "fair" share of writing formal tests in statsmodels, and I'm
> writing a good amount of test code for scipy.
> (Skipper and I usually provide sufficient documentation, developer
> comments, and references that we are in most cases able to understand
> the code.)
>
> But if the original coder doesn't have the time to bring the code up
> to testing and documentation standard, then the code should stay out
> of trunk until someone finds the time to get it through a review and
> quality control process. The problem is that, that someone might not
> be able to figure out how to fix possible problems (my example is
> Fisher's exact test).

I think this is important and as someone that is fairly new to python
and open source development I greatly value they input of others in
reviewing and making suggestions to my contributions. In addition, I
might not know how to finish all parts of a contribution and knowing
that I can contribute pieces that are within my abilities and others
can review and possibly finish encourage me to contribute.

Vincent
>
> Josef
>
>
>
>
>
>
>
>>
>> See you,
>>
>> Matthew
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From fperez.net at gmail.com  Tue Jun  1 18:11:27 2010
From: fperez.net at gmail.com (Fernando Perez)
Date: Tue, 1 Jun 2010 15:11:27 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com> 
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com> 
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com> 
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
Message-ID: <AANLkTim4kXQ7_SPhDtqQEDmNbAhICFgkeD3ppX2SK6Et@mail.gmail.com>

Hi Josef,

On Tue, Jun 1, 2010 at 1:31 PM,  <josef.pktd at gmail.com> wrote:
> My main problem with git was the treatment of the file system, and I
> find it much easier to work with separate branches as in bzr or
> mercurial.

One thing I've found very useful after transitioning to git for
IPython is the git new-workdir command:

http://kerneltrap.org/mailarchive/git/2008/5/21/1900044
http://nuclearsquid.com/writings/git-new-workdir.html

It lets me keep a few branches around that I want 'permanent' on my
filesystem, in a bzr shared-repo style, while using git for the
lightweight feature-only branches.  This is how it looks like right
now on my system:

- Main ipython git repo:
uqbar[ipython]> cd ipython/
(Master)uqbar[ipython]> git branch -a
  0.10
  0.10.1
  0.8
  0.9
* Master
  master
  remotes/mainline/0.10
  remotes/mainline/0.10.1
  remotes/mainline/0.8
  remotes/mainline/0.9
  remotes/mainline/master
  remotes/min/0.10
  remotes/min/0.10.1
  remotes/min/0.8
  remotes/min/0.9
  remotes/min/master
  remotes/origin/master


And a separate 'branches' repo, populated with new-workdir:

(Master)uqbar[ipython]> cd ../branches
uqbar[branches]> d
/home/fperez/ipython/branches
total 16
drwxr-xr-x 10 fperez 4096 2010-05-13 01:35 0.10/
drwxr-xr-x 10 fperez 4096 2010-05-13 15:48 0.10.1/


where I keep branches I may need to see persistently on disk.

HTH.

Cheers,

f


From matthew.brett at gmail.com  Tue Jun  1 18:14:45 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 15:14:45 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
Message-ID: <AANLkTikxGYrIjwFNwxpVeHXlNSjuFMGPXzuOfQUfjQVM@mail.gmail.com>

Hi,

> My main problem with git was the treatment of the file system, and I
> find it much easier to work with separate branches as in bzr or
> mercurial.

Yes, it is true that the git lightweight branch model takes some time
to get used to.  My experience is that it's quick to get used to the
git way, and once I did, it was a large relief to get rid of all those
branch directories when we switched, but I understand that it is a
leap.

I am sure you know this, but you can replicate the heavyweight
branches of hg and bzr with:

# initial git clone of 'trunk'
git clone git://github.com/nipy/nipy.git
# make a heavyweight branch
git clone nipy my-nipy-branch
# push somewhere
# First add repo for the branch via github interface, then
cd my-nipy-branch
git remote add origin git at github.com:matthew-brett/my-nipy-branch.git
git push origin master

I think you'd agree that it's not a windows / unix difference though.
I'd agree it is a larger conceptual leap from svn to git than it is
from svn to bzr or svn to mercurial.  The git argument is that making
that initial leap gives you a great deal of freedom and flexibility,
but it can be intimidating at first.

> As long as it is possible to stick with the basic workflow of git
> without anything fancy, similar what I have seen while skimming the
> nipy docs, I think it is not a problem on windows.

I think that is true that most of us won't need to go further than the
nipy basic workflow - but we haven't been using git long enough to
know that very well.  I would defer to the git masters out there -
David, Pauli and others - ?

> However, if/when parts of statsmodels go into scipy and I have to do
> maintenance of less isolated code, then I think the Mercurial
> interface might be my preferred choice.
>
> I haven't used Mercurial much yet, but I don't see any problems with it.
>
> So, the bottom line is, that documentation for the hg-git interface
> would be very useful for Windows users (or those that think git is a
> strange/unfamiliar concept.)

So - two issues:

1) The conceptual issues involved in switching mind-set from svn or
bzr to git.  That may require some thought and documentation
2) There might be some technical issues using git on windows - but I
think so far we don't have any reason to think so?
3) Some people may prefer mercurial for other reasons; it would be
good to respect that if possible.

So, it may well be worth making a hg-git doc for numpy when we do the
transition - with the caveats that David raised.

In the meantime, it would be very good to hear of any problems that do
come up specifically using git on windows...

See you,

Matthew


From matthew.brett at gmail.com  Tue Jun  1 18:15:52 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 15:15:52 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTikxGYrIjwFNwxpVeHXlNSjuFMGPXzuOfQUfjQVM@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTikxGYrIjwFNwxpVeHXlNSjuFMGPXzuOfQUfjQVM@mail.gmail.com>
Message-ID: <AANLkTim3VoojBZARXFY-_gf-V35_FUUVwPcwn-e9xJpQ@mail.gmail.com>

> So, it may well be worth making a hg-git doc for numpy when we do the
> transition - with the caveats that David raised.

That Stefan raised !  Sorry man.  I just fused the two of you because
you're both so awesome.. ;)

Matthew


From d.l.goldsmith at gmail.com  Tue Jun  1 18:37:09 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 15:37:09 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTilXjWvHaGs8UnymKCuyvRbFpE6F_qH4vONiLZY_@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTilXjWvHaGs8UnymKCuyvRbFpE6F_qH4vONiLZY_@mail.gmail.com>
Message-ID: <AANLkTimFwv4Pw4Xjf8k5aaG5p5FDLFrkD7jQiu80EdnC@mail.gmail.com>

2010/6/1 St?fan van der Walt <stefan at sun.ac.za>

> On 1 June 2010 13:31,  <josef.pktd at gmail.com> wrote:
> > So, the bottom line is, that documentation for the hg-git interface
> > would be very useful for Windows users (or those that think git is a
> > strange/unfamiliar concept.)
>
> These interfaces are somewhat dangerous, in the sense that you may
> encounter rather untypical scenarios and strange bugs in those tools
> (for example, we even have to be careful with git-svn, and that tool
> is widely used).
>
> Do you think a clear, simple, numpy/scipy-oriented tutorial could
> sufficiently lower the barrier to adoption?


Is that a rhetorical question?  I don't think there's any doubt that such
would very likely have the stated result.  The question is, is that your way
of offering to write it? :-)

DG


> I think the bzr work-flow
> you are used to is probably very similar to the one you'd follow with
> git.
>
> Regards
> St?fan
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/286ca91f/attachment.html>

From stefan at sun.ac.za  Tue Jun  1 19:03:40 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Tue, 1 Jun 2010 16:03:40 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTikyZqDRP_mEP7uXM4sPpJYIpvj-Uutz69XYkwY0@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com> 
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com> 
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
	<AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com> 
	<AANLkTilsIgRGE9uVTOtCZRN7EbEYAr4QR4e9X_7QDltQ@mail.gmail.com> 
	<AANLkTikyZqDRP_mEP7uXM4sPpJYIpvj-Uutz69XYkwY0@mail.gmail.com>
Message-ID: <AANLkTimJnCkPvcdjOcMEexCvSI1dsKVBLwr2wkwx3_a7@mail.gmail.com>

On 1 June 2010 13:55, Vincent Davis <vincent at vincentdavis.net> wrote:
> I think this is important and as someone that is fairly new to python
> and open source development I greatly value they input of others in
> reviewing and making suggestions to my contributions. In addition, I
> might not know how to finish all parts of a contribution and knowing
> that I can contribute pieces that are within my abilities and others
> can review and possibly finish encourage me to contribute.

This is one scenario in which a DVCS really shines: you can mature
your code, incorporating feedback as you go along, and have it
included once its ready.  Like you suggest, that actually lowers the
barrier to entry.  Asking contributors to repeatedly rework patches
quickly turns into a mess.

Regards
St?fan


From oliphant at enthought.com  Tue Jun  1 19:25:35 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 18:25:35 -0500
Subject: [SciPy-Dev] [SciPy-User] log pdf, cdf, etc
In-Reply-To: <AANLkTinDmwgf6KgCM9_XIzgWdgVW9sylEIc4f9sypcFU@mail.gmail.com>
References: <AANLkTilmC_l3pWTizWFhB6v0R8Jo03v440ZdLfZf5pND@mail.gmail.com>
	<AANLkTinJXzsaCkbP1E1dkGIDNDruuYX_1vD9PCg6Ml5l@mail.gmail.com>
	<6D1C6011-7B0A-45DB-9B54-6CAE1FA38F71@enthought.com>
	<AANLkTilDNm6fK2FYLlNgtWJLWb77_OaefOKqWhwb7sJ3@mail.gmail.com>
	<AANLkTikHSTyt_OeKLtz4sWzpPKYMUlvr78qmY8gfVdek@mail.gmail.com>
	<12883887-E601-467B-9C56-55BDA8169C19@enthought.com>
	<AANLkTinDmwgf6KgCM9_XIzgWdgVW9sylEIc4f9sypcFU@mail.gmail.com>
Message-ID: <A4407784-49D9-444A-9DA1-A08EA75F26B6@enthought.com>


On Jun 1, 2010, at 8:19 AM, Ralf Gommers wrote:

> 
> 
> On Tue, Jun 1, 2010 at 1:20 PM, Travis Oliphant <oliphant at enthought.com> wrote:
> 
> On May 31, 2010, at 6:39 AM, Ralf Gommers wrote:
>  
>  These recent changes are a bit problematic for several reasons:
> - there are many new methods for distributions without tests. 
> 
> These methods are simple to see and verify.    Which methods specifically are you concerned about? 
> 
> They're not all simple, for example rv_continuous._reduce_func. Since it contains inner function definitions inside an "else" block there's also a good chance it's actually broken.
> 
> And in principle I'm worried about all of them. The python 2.4/2.5 syntax error was caught early, but what if some code you regard as simple is broken in a less obvious way on 2.4/2.5? Maybe a user finds it in a release candidate, forcing us to build an extra one? Or just after the final release?
> 
>> - there are no docs for many new private and public methods
> 
> They are all fairly self explanatory.   But, docs can be added if needed.
> 
> For you, and maybe for me too. But for undergraduate students, or Joe in accounting who inherited  this random app that's essential for his job? It's simple, no public docs without docstrings. And preferably no private ones either.
> 
> Thanks for fixing all public docs quickly though. You missed just one, gamma.fit.
> 
>> - invalid syntax: http://projects.scipy.org/scipy/ticket/1186
> 
> This has been fixed (it was easier to fix the syntax then file the ticket...)  Also to be clear this is only invalid for Python < 2.6 (the comment makes it sound like somehow the changes weren't tested at all). 
> 
> I didn't mean to imply that you were committing code that didn't even work for you.
>> - the old rv_continuous doc template was put back in
> 
> I'm not sure what you mean.   Which change did this? 
> 
> The first one of your recent commits, r6392. The docstrings for subclasses of rv_continuous and rv_discrete are not generated from this template anymore, which is why it was removed. Look at line 862 (# generate docstring for subclass instances) and below that to see how it works now. 
> 
> If you're wondering why that changed, the main reasons are (1) to make the docstrings conform to the standard, (2) to be able to put useful info in the base classes, like "this is how you subclass it: ..." instead of a template, and (3) to be able to customize individual distribution docstrings easily.
> 
> 
>> 
>> This, plus Josef saying that he doesn't want to fix the API for some methods yet, makes me want to take it out of the 0.8.x branch. Any objections to that Travis or Josef? 
> 
> I would really like to see these changes go in to 0.8.x.    If Josef feels strongly about the API in the future, we can change it for the next release.   I don't understand what the specific concerns are.  
> 
> No you can't. For API changes we do have a policy, they need deprecation first. Which means if we release it like this now, we're stuck with it till 0.10 / 1.0.
> 
> 
> In summary, I see quite a few reasons why this shouldn't go in and don't see a compelling reason to release it right now. The 0.9 release is (tentatively) planned for September, so you don't have to worry that your changes sit in trunk unreleased for 1.5 years.

As the one doing the work of release manager, you have a lot of latitude in making this decision, of course.    The compelling reason to release it right now is to get the improved features which nobody has actually voiced specific concerns about. 

Specifically improvements to the fit method of distribution objects (the ability to fix specific parameters of the distribution and vary others in the fit) is a very nice-to-have feature.     The API change problem you mention is actually an argument for putting it in now (because we *can* deprecate it in 0.9 and then have whatever unspecified correct API come out in 1.0).    I have not heard that there is real disagreement about the API either.   

It feels like I've addressed the major reasons you feel it can't go in.  The functionality is tested.    There are docstrings.  I just removed the rv_continuous doc template.  I really don't know why that was added.   I did not make a specific change to include it.  It must have been a merge error. 

Suggestions about how to give gamma.fit and beta.fit the docstring of it's parent would be appreciated.    

I don't think a general rule of "no private methods without docstrings" is necessarily appropriate, and a bit of an example of going overboard with "rules" and "procedures."   Private methods are not meant to be called outside of code and should not necessarily have to be documented with docstrings.   Every docstring creates more code to maintain and keep consistent with the actual code.    

One of the great things about Python is that you can read the code itself so that it is much closer to self-documenting code 
(close to it but not there --- I like comments and docstrings too). 

Thanks for your efforts. 

-Travis


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/4be60216/attachment.html>

From oliphant at enthought.com  Tue Jun  1 19:33:56 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 1 Jun 2010 18:33:56 -0500
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
	<AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
Message-ID: <1331DFB9-4FCA-44F3-A1D0-C00714A60511@enthought.com>


On Jun 1, 2010, at 1:57 PM, Matthew Brett wrote:

> Hi,
> 
>>> Well - but that is because you don't maintenance.   Imagine a
>>> maintainer puts in a lot of effort to make the code well-documented
>>> and tested.  Then, you have put in new code that has neither
>>> documentation nor tests.    As a good maintainer, it's really painful
>>> for them that there's new code without documentation or tests.   They
>>> can only feel abused in that situation, because it seems as if you are
>>> expecting them to clean up after you - without asking.
>> 
>> I don't think that is fair.  I have been "maintaining" SciPy and NumPy code for over 10 years.   I have done an immense amount of work in porting SciPy to NumPy and continuing to fix bugs that I am made aware of.  I don't have as much time to commit to SciPy as I would like.
> 
> I wasn't really saying whether it was fair or not, I was only trying
> to explain why it might cause offense.
> 
> When I say that you don't do maintenance, I mean that you are not
> currently the person who has to make sure that the code is readable
> and maintainable.   That is hard and often thankless work.
> 
> I presume that you agree that numpy and scipy code should have
> documentation and tests.    I presume also that when you commit code
> without documentation or tests, that you do not usually intend to come
> back and do these later - say - before the next release.   That means
> that someone else has to do it.  It will take them a lot longer than
> it would take you because they don't know the code as well.
> 

No, that is actually not what I imply but checking something in to the trunk.    I plan to submit tests and docs before the next release when I commit code.    I don't expect anyone else to do that for me.   I always welcome help, but I don't expect it.  

I really think this is more about how people view commits to the trunk than anything else.   I like to use SVN as a version control system.   My commits to trunk are always more incremental.   I like to get things committed in self-contained chunks.   Adding the requirement to put in documentation and tests before committing stretches out that "incremental" work element to longer than I ever have time for in one sitting. 

Clearly, if I were using DVCS to a published branch that could be then merged to the trunk this problem would not have arisen.   I see that I need to move to that style.    People are reading far more into my committing to trunk than I ever meant to imply.    

-Travis


From matthew.brett at gmail.com  Tue Jun  1 19:48:25 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 16:48:25 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <1331DFB9-4FCA-44F3-A1D0-C00714A60511@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
	<AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
	<1331DFB9-4FCA-44F3-A1D0-C00714A60511@enthought.com>
Message-ID: <AANLkTimFdk8qnbKLRYOcGbXoycmx9iTwUhprWlIsFoM9@mail.gmail.com>

Hi,

>> I presume that you agree that numpy and scipy code should have
>> documentation and tests. ? ?I presume also that when you commit code
>> without documentation or tests, that you do not usually intend to come
>> back and do these later - say - before the next release. ? That means
>> that someone else has to do it. ?It will take them a lot longer than
>> it would take you because they don't know the code as well.
>>
>
> No, that is actually not what I imply but checking something in to the trunk. ? ?I plan to submit tests and docs before the next > release when I commit code. ? ?I don't expect anyone else to do that for me. ? I always welcome help, but I don't expect it.

I am sure if people know that that is what you intend, and when DVCS
allows that to happen, no-one will be upset, and we will all return to
our usual mode of being very grateful for all the work that you've
done and are doing.  And - thanks for the clarification - sometimes
things that seem obvious - aren't obvious - and it's good to say them
out loud...

See you,

Matthew


From david at silveregg.co.jp  Tue Jun  1 20:57:18 2010
From: david at silveregg.co.jp (David)
Date: Wed, 02 Jun 2010 09:57:18 +0900
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
Message-ID: <4C05AC6E.1030600@silveregg.co.jp>

On 06/02/2010 05:31 AM, josef.pktd at gmail.com wrote:
> On Tue, Jun 1, 2010 at 3:44 PM, Matthew Brett<matthew.brett at gmail.com>  wrote:
>> Hi,
>>
>>>> a) are there any problems that you know of using git from the windows
>>>> shell?
>>>
>>> None in principle here (and from what I've garnered through the discussion,
>>> I am supportive of the move, as long as we don't deprecate the SVN trunk too
>>> quickly), but do we have anyone, even just one person, who is already
>>> reasonably facile in this regard who'd be willing to support others through
>>> the transition?
>>
>> I would not claim to be very experienced, but I have not had any
>> problems using msysgit with either the windows shell or the (rather
>> good) windows power shell.    The bash shell does have problems but
>> the windows shells have proved more useful.
>
> It depends a lot on the part that I am working on. I wouldn't want to
> switch statsmodels where I do my main development to git.

I don't think it is anyone's intention to force you to use git for your 
own packages :)

> As long as it is possible to stick with the basic workflow of git
> without anything fancy, similar what I have seen while skimming the
> nipy docs, I think it is not a problem on windows. The basic commands
> and for example eclipse, GUI plugins look similar enough.

We started some time ago a document in that respect: instead of 
describing git's features, we have a workflow-oriented document:

http://projects.scipy.org/numpy/wiki/GitWorkflow

David


From charlesr.harris at gmail.com  Tue Jun  1 21:07:27 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Tue, 1 Jun 2010 19:07:27 -0600
Subject: [SciPy-Dev] [SciPy-User] log pdf, cdf, etc
In-Reply-To: <A4407784-49D9-444A-9DA1-A08EA75F26B6@enthought.com>
References: <AANLkTilmC_l3pWTizWFhB6v0R8Jo03v440ZdLfZf5pND@mail.gmail.com>
	<AANLkTinJXzsaCkbP1E1dkGIDNDruuYX_1vD9PCg6Ml5l@mail.gmail.com>
	<6D1C6011-7B0A-45DB-9B54-6CAE1FA38F71@enthought.com>
	<AANLkTilDNm6fK2FYLlNgtWJLWb77_OaefOKqWhwb7sJ3@mail.gmail.com>
	<AANLkTikHSTyt_OeKLtz4sWzpPKYMUlvr78qmY8gfVdek@mail.gmail.com>
	<12883887-E601-467B-9C56-55BDA8169C19@enthought.com>
	<AANLkTinDmwgf6KgCM9_XIzgWdgVW9sylEIc4f9sypcFU@mail.gmail.com>
	<A4407784-49D9-444A-9DA1-A08EA75F26B6@enthought.com>
Message-ID: <AANLkTin_gBbpLU_6yer-lhMptTGFATtOSUw-bu-Dmc-Z@mail.gmail.com>

On Tue, Jun 1, 2010 at 5:25 PM, Travis Oliphant <oliphant at enthought.com>wrote:

>
> On Jun 1, 2010, at 8:19 AM, Ralf Gommers wrote:
>
>
>
> On Tue, Jun 1, 2010 at 1:20 PM, Travis Oliphant <oliphant at enthought.com>wrote:
>
>>
>> On May 31, 2010, at 6:39 AM, Ralf Gommers wrote:
>>
>
>
>>  These recent changes are a bit problematic for several reasons:
>> - there are many new methods for distributions without tests.
>>
>> These methods are simple to see and verify.    Which methods specifically
>> are you concerned about?
>>
>
> They're not all simple, for example rv_continuous._reduce_func. Since it
> contains inner function definitions inside an "else" block there's also a
> good chance it's actually broken.
>
> And in principle I'm worried about all of them. The python 2.4/2.5 syntax
> error was caught early, but what if some code you regard as simple is broken
> in a less obvious way on 2.4/2.5? Maybe a user finds it in a release
> candidate, forcing us to build an extra one? Or just after the final
> release?
>
>>
>> - there are no docs for many new private and public methods
>>
>>
>> They are all fairly self explanatory.   But, docs can be added if needed.
>>
>
> For you, and maybe for me too. But for undergraduate students, or Joe in
> accounting who inherited  this random app that's essential for his job? It's
> simple, no public docs without docstrings. And preferably no private ones
> either.
>
> Thanks for fixing all public docs quickly though. You missed just one,
> gamma.fit.
>
>>
>> - invalid syntax: http://projects.scipy.org/scipy/ticket/1186
>>
>>
>> This has been fixed (it was easier to fix the syntax then file the
>> ticket...)  Also to be clear this is only invalid for Python < 2.6 (the
>> comment makes it sound like somehow the changes weren't tested at all).
>>
>> I didn't mean to imply that you were committing code that didn't even work
> for you.
>
>>  - the old rv_continuous doc template was put back in
>>
>>
>> I'm not sure what you mean.   Which change did this?
>>
>
> The first one of your recent commits, r6392. The docstrings for subclasses
> of rv_continuous and rv_discrete are not generated from this template
> anymore, which is why it was removed. Look at line 862 (# generate docstring
> for subclass instances) and below that to see how it works now.
>
> If you're wondering why that changed, the main reasons are (1) to make the
> docstrings conform to the standard, (2) to be able to put useful info in the
> base classes, like "this is how you subclass it: ..." instead of a template,
> and (3) to be able to customize individual distribution docstrings easily.
>
>
>>
>> This, plus Josef saying that he doesn't want to fix the API for some
>> methods yet, makes me want to take it out of the 0.8.x branch. Any
>> objections to that Travis or Josef?
>>
>>
>> I would really like to see these changes go in to 0.8.x.    If Josef feels
>> strongly about the API in the future, we can change it for the next release.
>>   I don't understand what the specific concerns are.
>>
>> No you can't. For API changes we do have a policy, they need deprecation
> first. Which means if we release it like this now, we're stuck with it till
> 0.10 / 1.0.
>
>
>
> In summary, I see quite a few reasons why this shouldn't go in and don't
> see a compelling reason to release it right now. The 0.9 release is
> (tentatively) planned for September, so you don't have to worry that your
> changes sit in trunk unreleased for 1.5 years.
>
>
> As the one doing the work of release manager, you have a lot of latitude in
> making this decision, of course.    The compelling reason to release it
> right now is to get the improved features which nobody has actually voiced
> specific concerns about.
>
>
There have been expressed concerns as to both the design and validation. I
think it should be removed and these changes put into a branch or up on
github until they have been tested and documented. There is no rush, and
really, there is no reason for folks to use code that hasn't been validated
except for testing, and testing can be done using the branch.


> Specifically improvements to the fit method of distribution objects (the
> ability to fix specific parameters of the distribution and vary others in
> the fit) is a very nice-to-have feature.     The API change problem you
> mention is actually an argument for putting it in now (because we *can*
> deprecate it in 0.9 and then have whatever unspecified correct API come out
> in 1.0).    I have not heard that there is real disagreement about the API
> either.
>

No, it is a argument for *not* putting it in now. There is no rush, and
until the code has been looked over and thoroughly tested, there is no
guarantee that either the API is suitable or that the implementation is
correct.


>
> It feels like I've addressed the major reasons you feel it can't go in.
>  The functionality is tested.    There are docstrings.  I just removed the
> rv_continuous doc template.  I really don't know why that was added.   I did
> not make a specific change to include it.  It must have been a merge error.
>
>
We don't know what else might be wrong. Look at what happened with datetime
and all the work that made for David.


> Suggestions about how to give gamma.fit and beta.fit the docstring of it's
> parent would be appreciated.
>
> I don't think a general rule of "no private methods without docstrings" is
> necessarily appropriate, and a bit of an example of going overboard with
> "rules" and "procedures."   Private methods are not meant to be called
> outside of code and should not necessarily have to be documented with
> docstrings.   Every docstring creates more code to maintain and keep
> consistent with the actual code.
>
> One of the great things about Python is that you can read the code itself
> so that it is much closer to self-documenting code
> (close to it but not there --- I like comments and docstrings too).
>
>
Python beyond the trivial is *not* self documenting, no code is self
documenting. There is always a struggle to grasp the larger design and
intent, as well of niggling questions of correctness. All python serves to
do is remove a lot of verbiage by abstracting common objects like lists and
hash tables. That helps, but it is far from all that is needed.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/ce30f486/attachment.html>

From david at silveregg.co.jp  Tue Jun  1 21:22:56 2010
From: david at silveregg.co.jp (David)
Date: Wed, 02 Jun 2010 10:22:56 +0900
Subject: [SciPy-Dev] [SciPy-User] log pdf, cdf, etc
In-Reply-To: <AANLkTinDmwgf6KgCM9_XIzgWdgVW9sylEIc4f9sypcFU@mail.gmail.com>
References: <AANLkTilmC_l3pWTizWFhB6v0R8Jo03v440ZdLfZf5pND@mail.gmail.com>
	<AANLkTinJXzsaCkbP1E1dkGIDNDruuYX_1vD9PCg6Ml5l@mail.gmail.com>
	<6D1C6011-7B0A-45DB-9B54-6CAE1FA38F71@enthought.com>
	<AANLkTilDNm6fK2FYLlNgtWJLWb77_OaefOKqWhwb7sJ3@mail.gmail.com>
	<AANLkTikHSTyt_OeKLtz4sWzpPKYMUlvr78qmY8gfVdek@mail.gmail.com>
	<12883887-E601-467B-9C56-55BDA8169C19@enthought.com>
	<AANLkTinDmwgf6KgCM9_XIzgWdgVW9sylEIc4f9sypcFU@mail.gmail.com>
Message-ID: <4C05B270.8090608@silveregg.co.jp>

On 06/01/2010 10:19 PM, Ralf Gommers wrote:
>
> No you can't. For API changes we do have a policy, they need deprecation
> first. Which means if we release it like this now, we're stuck with it
> till 0.10 / 1.0.

I am not the release manager for 0.8.0, but I don't understand why we 
even discuss it *again*. *Every - single - time* this has happened in 
the past, it has caused numerous issues.

It can be put in 0.8.1 later, the choice is not between now and one year 
and a half.

David


From josef.pktd at gmail.com  Tue Jun  1 23:07:43 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 23:07:43 -0400
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTim4kXQ7_SPhDtqQEDmNbAhICFgkeD3ppX2SK6Et@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTim4kXQ7_SPhDtqQEDmNbAhICFgkeD3ppX2SK6Et@mail.gmail.com>
Message-ID: <AANLkTinvXCSWvxmN-SD8rANB4bSSmYth9r9slDUh3P5-@mail.gmail.com>

On Tue, Jun 1, 2010 at 6:11 PM, Fernando Perez <fperez.net at gmail.com> wrote:
> Hi Josef,
>
> On Tue, Jun 1, 2010 at 1:31 PM, ?<josef.pktd at gmail.com> wrote:
>> My main problem with git was the treatment of the file system, and I
>> find it much easier to work with separate branches as in bzr or
>> mercurial.
>
> One thing I've found very useful after transitioning to git for
> IPython is the git new-workdir command:
>
> http://kerneltrap.org/mailarchive/git/2008/5/21/1900044
> http://nuclearsquid.com/writings/git-new-workdir.html
>
> It lets me keep a few branches around that I want 'permanent' on my
> filesystem, in a bzr shared-repo style, while using git for the
> lightweight feature-only branches. ?This is how it looks like right
> now on my system:

thanks very useful information , git-new-workdir seems to be what I would like.
Does it work on Windows?

last year I didn't find any way to do this. I haven't updated git
since then and I don't see any git-new-workdir in the git folders.
(But maybe I deleted it when I switched from the full 1.x GB git
install to the light version - without mingw and the kitchen sink)

Josef


>
> - Main ipython git repo:
> uqbar[ipython]> cd ipython/
> (Master)uqbar[ipython]> git branch -a
> ?0.10
> ?0.10.1
> ?0.8
> ?0.9
> * Master
> ?master
> ?remotes/mainline/0.10
> ?remotes/mainline/0.10.1
> ?remotes/mainline/0.8
> ?remotes/mainline/0.9
> ?remotes/mainline/master
> ?remotes/min/0.10
> ?remotes/min/0.10.1
> ?remotes/min/0.8
> ?remotes/min/0.9
> ?remotes/min/master
> ?remotes/origin/master
>
>
> And a separate 'branches' repo, populated with new-workdir:
>
> (Master)uqbar[ipython]> cd ../branches
> uqbar[branches]> d
> /home/fperez/ipython/branches
> total 16
> drwxr-xr-x 10 fperez 4096 2010-05-13 01:35 0.10/
> drwxr-xr-x 10 fperez 4096 2010-05-13 15:48 0.10.1/
>
>
> where I keep branches I may need to see persistently on disk.
>
> HTH.
>
> Cheers,
>
> f
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From fperez.net at gmail.com  Tue Jun  1 23:15:38 2010
From: fperez.net at gmail.com (Fernando Perez)
Date: Tue, 1 Jun 2010 20:15:38 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTinvXCSWvxmN-SD8rANB4bSSmYth9r9slDUh3P5-@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com> 
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com> 
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com> 
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com> 
	<AANLkTim4kXQ7_SPhDtqQEDmNbAhICFgkeD3ppX2SK6Et@mail.gmail.com> 
	<AANLkTinvXCSWvxmN-SD8rANB4bSSmYth9r9slDUh3P5-@mail.gmail.com>
Message-ID: <AANLkTimb-6EUc3Rrm7xpRYF-36M0TiGUDB0e6pJnc_oq@mail.gmail.com>

Hi Josef,

On Tue, Jun 1, 2010 at 8:07 PM,  <josef.pktd at gmail.com> wrote:
>
> thanks very useful information , git-new-workdir seems to be what I would like.
> Does it work on Windows?
>
> last year I didn't find any way to do this. I haven't updated git
> since then and I don't see any git-new-workdir in the git folders.
> (But maybe I deleted it when I switched from the full 1.x GB git
> install to the light version - without mingw and the kitchen sink)

Unfortunately I don't know if it works on Windows; on my linux box it
ships here:

/usr/share/doc/git-core/contrib/workdir/git-new-workdir

and I had to enable it by copying this script to somewhere in my PATH
and making it executable.  I have no idea if these contortions would
work on Windows as well, though.

Cheers,

f


From josef.pktd at gmail.com  Tue Jun  1 23:38:21 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 23:38:21 -0400
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTikxGYrIjwFNwxpVeHXlNSjuFMGPXzuOfQUfjQVM@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTikxGYrIjwFNwxpVeHXlNSjuFMGPXzuOfQUfjQVM@mail.gmail.com>
Message-ID: <AANLkTimqZDpyDKBiXSppGgVps6OSs16kir1j3dhB8HJ0@mail.gmail.com>

On Tue, Jun 1, 2010 at 6:14 PM, Matthew Brett <matthew.brett at gmail.com> wrote:
> Hi,
>
>> My main problem with git was the treatment of the file system, and I
>> find it much easier to work with separate branches as in bzr or
>> mercurial.
>
> Yes, it is true that the git lightweight branch model takes some time
> to get used to. ?My experience is that it's quick to get used to the
> git way, and once I did, it was a large relief to get rid of all those
> branch directories when we switched, but I understand that it is a
> leap.
>
> I am sure you know this, but you can replicate the heavyweight
> branches of hg and bzr with:
>
> # initial git clone of 'trunk'
> git clone git://github.com/nipy/nipy.git
> # make a heavyweight branch
> git clone nipy my-nipy-branch
> # push somewhere
> # First add repo for the branch via github interface, then
> cd my-nipy-branch
> git remote add origin git at github.com:matthew-brett/my-nipy-branch.git
> git push origin master

However, I think this works only with a remote remote, github or similar
When I looked at bzr vs hg vs git, I also thought about my private
use, where I didn't find a way to compare across branches in separate
directories.

My work style in statsmodels is similar to the mailing list reference
that Fernando gave. Mainly I have many uncommitted files in each
branch, test scripts, examples scripts, quick checks whether a rewrite
would work, or R and matlab files. None of it I want to commit to the
repository, but have available when I work on it again.

>
> I think you'd agree that it's not a windows / unix difference though.
> I'd agree it is a larger conceptual leap from svn to git than it is
> from svn to bzr or svn to mercurial. ?The git argument is that making
> that initial leap gives you a great deal of freedom and flexibility,
> but it can be intimidating at first.

A great deal of freedom gives any new user also a lot of opportunities
to shoot in his own foot.
And my impression from the mailing lists is that the rescue team is
called more often than with bzr or hg.
My recommendation to myself is not to use with git more than the 10 or
so basic commands similar to svn or bzr. Then I don't think it will
create any real problems.

So the basic workflow description by the nipy and numpy/scipy git
developers will be the most useful help for the transition. (just
confirming what is obvious to you)

>
>> As long as it is possible to stick with the basic workflow of git
>> without anything fancy, similar what I have seen while skimming the
>> nipy docs, I think it is not a problem on windows.
>
> I think that is true that most of us won't need to go further than the
> nipy basic workflow - but we haven't been using git long enough to
> know that very well. ?I would defer to the git masters out there -
> David, Pauli and others - ?
>
>> However, if/when parts of statsmodels go into scipy and I have to do
>> maintenance of less isolated code, then I think the Mercurial
>> interface might be my preferred choice.
>>
>> I haven't used Mercurial much yet, but I don't see any problems with it.
>>
>> So, the bottom line is, that documentation for the hg-git interface
>> would be very useful for Windows users (or those that think git is a
>> strange/unfamiliar concept.)
>
> So - two issues:
>
> 1) The conceptual issues involved in switching mind-set from svn or
> bzr to git. ?That may require some thought and documentation
> 2) There might be some technical issues using git on windows - but I
> think so far we don't have any reason to think so?
> 3) Some people may prefer mercurial for other reasons; it would be
> good to respect that if possible.
>
> So, it may well be worth making a hg-git doc for numpy when we do the
> transition - with the caveats that David raised.
>
> In the meantime, it would be very good to hear of any problems that do
> come up specifically using git on windows...

Right now I only use 3 or so git commands and I don't see any problems.

Cheers,

Josef

>
> See you,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From matthew.brett at gmail.com  Tue Jun  1 23:39:53 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 20:39:53 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTinvXCSWvxmN-SD8rANB4bSSmYth9r9slDUh3P5-@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTim4kXQ7_SPhDtqQEDmNbAhICFgkeD3ppX2SK6Et@mail.gmail.com>
	<AANLkTinvXCSWvxmN-SD8rANB4bSSmYth9r9slDUh3P5-@mail.gmail.com>
Message-ID: <AANLkTimwzag7Be9xPpuYZtZbqGDu70RP1e9xBInTQjE6@mail.gmail.com>

Hi,

On Tue, Jun 1, 2010 at 8:07 PM,  <josef.pktd at gmail.com> wrote:
> On Tue, Jun 1, 2010 at 6:11 PM, Fernando Perez <fperez.net at gmail.com> wrote:
>> Hi Josef,
>>
>> On Tue, Jun 1, 2010 at 1:31 PM, ?<josef.pktd at gmail.com> wrote:
>>> My main problem with git was the treatment of the file system, and I
>>> find it much easier to work with separate branches as in bzr or
>>> mercurial.
>>
>> One thing I've found very useful after transitioning to git for
>> IPython is the git new-workdir command:
>>
>> http://kerneltrap.org/mailarchive/git/2008/5/21/1900044
>> http://nuclearsquid.com/writings/git-new-workdir.html
>>
>> It lets me keep a few branches around that I want 'permanent' on my
>> filesystem, in a bzr shared-repo style, while using git for the
>> lightweight feature-only branches. ?This is how it looks like right
>> now on my system:
>
> thanks very useful information , git-new-workdir seems to be what I would like.
> Does it work on Windows?

Sadly - probably not without a little hacking...

http://code.google.com/p/msysgit/issues/detail?id=99

But, if you think you need it, the script is so short that it would
only take a short time to port to python (it's in sh):

http://git.kernel.org/?p=git/git.git;a=blob_plain;f=contrib/workdir/git-new-workdir;hb=HEAD

I see there are symbolic links there, that will require a little fancy
footwork on windows, as you know.

If I have time I'll give it a go.

See you,

Matthew


From matthew.brett at gmail.com  Tue Jun  1 23:53:06 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Tue, 1 Jun 2010 20:53:06 -0700
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTimqZDpyDKBiXSppGgVps6OSs16kir1j3dhB8HJ0@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTikxGYrIjwFNwxpVeHXlNSjuFMGPXzuOfQUfjQVM@mail.gmail.com>
	<AANLkTimqZDpyDKBiXSppGgVps6OSs16kir1j3dhB8HJ0@mail.gmail.com>
Message-ID: <AANLkTimszW1NmQUPQOgDV70aFbwUS3NIVdpMFwyde7Bk@mail.gmail.com>

Hi,

>> # initial git clone of 'trunk'
>> git clone git://github.com/nipy/nipy.git
>> # make a heavyweight branch
>> git clone nipy my-nipy-branch
>> # push somewhere
>> # First add repo for the branch via github interface, then
>> cd my-nipy-branch
>> git remote add origin git at github.com:matthew-brett/my-nipy-branch.git
>> git push origin master
>
> However, I think this works only with a remote remote, github or similar
> When I looked at bzr vs hg vs git, I also thought about my private
> use, where I didn't find a way to compare across branches in separate
> directories.

Ah - with the paragraph below, I begin to see what you mean.  You
often have uncommitted changes, hence the need for several working
trees.  You can compare repositories, but it's a bit harder that with
- say - bzr:

http://stackoverflow.com/questions/687450/how-do-i-compare-two-git-repositories

> My work style in statsmodels is similar to the mailing list reference
> that Fernando gave. Mainly I have many uncommitted files in each
> branch, test scripts, examples scripts, quick checks whether a rewrite
> would work, or R and matlab files. None of it I want to commit to the
> repository, but have available when I work on it again.

Right - I see your point.  Maybe the git solution to that workflow
will be more obvious to others than it is to me.

> A great deal of freedom gives any new user also a lot of opportunities
> to shoot in his own foot.
> And my impression from the mailing lists is that the rescue team is
> called more often than with bzr or hg.
> My recommendation to myself is not to use with git more than the 10 or
> so basic commands similar to svn or bzr. Then I don't think it will
> create any real problems.

That's fair.  It is easier to mess up with git - it has a steeper
learning curve when you go past the basics.  It is well worthwhile
spending some time understanding the model underneath it - good links
from Fernando's page : http://www.fperez.org/py4science/git.html ; I
particularly liked
http://tom.preston-werner.com/2009/05/19/the-git-parable.html .

> So the basic workflow description by the nipy and numpy/scipy git
> developers will be the most useful help for the transition. (just
> confirming what is obvious to you)

Worth saying - thanks for the thoughtful feedback,

Matthew


From josef.pktd at gmail.com  Tue Jun  1 23:56:18 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 1 Jun 2010 23:56:18 -0400
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTimwzag7Be9xPpuYZtZbqGDu70RP1e9xBInTQjE6@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTim4kXQ7_SPhDtqQEDmNbAhICFgkeD3ppX2SK6Et@mail.gmail.com>
	<AANLkTinvXCSWvxmN-SD8rANB4bSSmYth9r9slDUh3P5-@mail.gmail.com>
	<AANLkTimwzag7Be9xPpuYZtZbqGDu70RP1e9xBInTQjE6@mail.gmail.com>
Message-ID: <AANLkTikyMk0OUrHY_ZcFAhjir6aNLW2fATB1xs2I6It1@mail.gmail.com>

On Tue, Jun 1, 2010 at 11:39 PM, Matthew Brett <matthew.brett at gmail.com> wrote:
> Hi,
>
> On Tue, Jun 1, 2010 at 8:07 PM, ?<josef.pktd at gmail.com> wrote:
>> On Tue, Jun 1, 2010 at 6:11 PM, Fernando Perez <fperez.net at gmail.com> wrote:
>>> Hi Josef,
>>>
>>> On Tue, Jun 1, 2010 at 1:31 PM, ?<josef.pktd at gmail.com> wrote:
>>>> My main problem with git was the treatment of the file system, and I
>>>> find it much easier to work with separate branches as in bzr or
>>>> mercurial.
>>>
>>> One thing I've found very useful after transitioning to git for
>>> IPython is the git new-workdir command:
>>>
>>> http://kerneltrap.org/mailarchive/git/2008/5/21/1900044
>>> http://nuclearsquid.com/writings/git-new-workdir.html
>>>
>>> It lets me keep a few branches around that I want 'permanent' on my
>>> filesystem, in a bzr shared-repo style, while using git for the
>>> lightweight feature-only branches. ?This is how it looks like right
>>> now on my system:
>>
>> thanks very useful information , git-new-workdir seems to be what I would like.
>> Does it work on Windows?
>
> Sadly - probably not without a little hacking...
>
> http://code.google.com/p/msysgit/issues/detail?id=99
>
> But, if you think you need it, the script is so short that it would
> only take a short time to port to python (it's in sh):
>
> http://git.kernel.org/?p=git/git.git;a=blob_plain;f=contrib/workdir/git-new-workdir;hb=HEAD
>
> I see there are symbolic links there, that will require a little fancy
> footwork on windows, as you know.
>
> If I have time I'll give it a go.

I don't think that's necessary (symlinks sound tricky) and scipy will
be on a public repository, so your multiple (if I understand
correctly) clone solution will work.

Josef

>
> See you,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Wed Jun  2 00:02:20 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 1 Jun 2010 22:02:20 -0600
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTikyMk0OUrHY_ZcFAhjir6aNLW2fATB1xs2I6It1@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTim4kXQ7_SPhDtqQEDmNbAhICFgkeD3ppX2SK6Et@mail.gmail.com>
	<AANLkTinvXCSWvxmN-SD8rANB4bSSmYth9r9slDUh3P5-@mail.gmail.com>
	<AANLkTimwzag7Be9xPpuYZtZbqGDu70RP1e9xBInTQjE6@mail.gmail.com>
	<AANLkTikyMk0OUrHY_ZcFAhjir6aNLW2fATB1xs2I6It1@mail.gmail.com>
Message-ID: <AANLkTinTIjVyqLk6eYH_4sIAmfBjcR5_FlligBAN4m2c@mail.gmail.com>

On Tue, Jun 1, 2010 at 9:56 PM,  <josef.pktd at gmail.com> wrote:
> On Tue, Jun 1, 2010 at 11:39 PM, Matthew Brett <matthew.brett at gmail.com> wrote:
>> Hi,
>>
>> On Tue, Jun 1, 2010 at 8:07 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Tue, Jun 1, 2010 at 6:11 PM, Fernando Perez <fperez.net at gmail.com> wrote:
>>>> Hi Josef,
>>>>
>>>> On Tue, Jun 1, 2010 at 1:31 PM, ?<josef.pktd at gmail.com> wrote:
>>>>> My main problem with git was the treatment of the file system, and I
>>>>> find it much easier to work with separate branches as in bzr or
>>>>> mercurial.
>>>>
>>>> One thing I've found very useful after transitioning to git for
>>>> IPython is the git new-workdir command:
>>>>
>>>> http://kerneltrap.org/mailarchive/git/2008/5/21/1900044
>>>> http://nuclearsquid.com/writings/git-new-workdir.html
>>>>
>>>> It lets me keep a few branches around that I want 'permanent' on my
>>>> filesystem, in a bzr shared-repo style, while using git for the
>>>> lightweight feature-only branches. ?This is how it looks like right
>>>> now on my system:
>>>
>>> thanks very useful information , git-new-workdir seems to be what I would like.
>>> Does it work on Windows?
>>
>> Sadly - probably not without a little hacking...
>>
>> http://code.google.com/p/msysgit/issues/detail?id=99
>>
>> But, if you think you need it, the script is so short that it would
>> only take a short time to port to python (it's in sh):
>>
>> http://git.kernel.org/?p=git/git.git;a=blob_plain;f=contrib/workdir/git-new-workdir;hb=HEAD
>>
>> I see there are symbolic links there, that will require a little fancy
>> footwork on windows, as you know.
>>
>> If I have time I'll give it a go.
>
> I don't think that's necessary (symlinks sound tricky) and scipy will
> be on a public repository, so your multiple (if I understand
> correctly) clone solution will work.

Could you not use hg to do what you want (work with local directories)
and the use hg-git when you need to?

I am kinda partial to hg and bzr.

Vincent

>
> Josef
>
>>
>> See you,
>>
>> Matthew
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Wed Jun  2 00:14:14 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Wed, 2 Jun 2010 00:14:14 -0400
Subject: [SciPy-Dev] git on windows (was: scipy.stats)
In-Reply-To: <AANLkTinTIjVyqLk6eYH_4sIAmfBjcR5_FlligBAN4m2c@mail.gmail.com>
References: <AANLkTinurqOqZx1Pox3GozCPZEqE5Q-JPmmPpEtDHJCQ@mail.gmail.com>
	<AANLkTim8suuCzt6i0Jcl6gPjX0uUoMdUhVAkMTxcY3nd@mail.gmail.com>
	<AANLkTilraupS6lle_oVYsoZVW0-cX3ssi6xmxLyEKEkK@mail.gmail.com>
	<AANLkTikzUE_Po0y-vqjQN38W7y-LhE7XSXvG-LgNivSO@mail.gmail.com>
	<AANLkTim4kXQ7_SPhDtqQEDmNbAhICFgkeD3ppX2SK6Et@mail.gmail.com>
	<AANLkTinvXCSWvxmN-SD8rANB4bSSmYth9r9slDUh3P5-@mail.gmail.com>
	<AANLkTimwzag7Be9xPpuYZtZbqGDu70RP1e9xBInTQjE6@mail.gmail.com>
	<AANLkTikyMk0OUrHY_ZcFAhjir6aNLW2fATB1xs2I6It1@mail.gmail.com>
	<AANLkTinTIjVyqLk6eYH_4sIAmfBjcR5_FlligBAN4m2c@mail.gmail.com>
Message-ID: <AANLkTimc_G_LxhwBISZbWYA8v1h0zeOiasDKl8DgMK--@mail.gmail.com>

On Wed, Jun 2, 2010 at 12:02 AM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Tue, Jun 1, 2010 at 9:56 PM, ?<josef.pktd at gmail.com> wrote:
>> On Tue, Jun 1, 2010 at 11:39 PM, Matthew Brett <matthew.brett at gmail.com> wrote:
>>> Hi,
>>>
>>> On Tue, Jun 1, 2010 at 8:07 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Tue, Jun 1, 2010 at 6:11 PM, Fernando Perez <fperez.net at gmail.com> wrote:
>>>>> Hi Josef,
>>>>>
>>>>> On Tue, Jun 1, 2010 at 1:31 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>> My main problem with git was the treatment of the file system, and I
>>>>>> find it much easier to work with separate branches as in bzr or
>>>>>> mercurial.
>>>>>
>>>>> One thing I've found very useful after transitioning to git for
>>>>> IPython is the git new-workdir command:
>>>>>
>>>>> http://kerneltrap.org/mailarchive/git/2008/5/21/1900044
>>>>> http://nuclearsquid.com/writings/git-new-workdir.html
>>>>>
>>>>> It lets me keep a few branches around that I want 'permanent' on my
>>>>> filesystem, in a bzr shared-repo style, while using git for the
>>>>> lightweight feature-only branches. ?This is how it looks like right
>>>>> now on my system:
>>>>
>>>> thanks very useful information , git-new-workdir seems to be what I would like.
>>>> Does it work on Windows?
>>>
>>> Sadly - probably not without a little hacking...
>>>
>>> http://code.google.com/p/msysgit/issues/detail?id=99
>>>
>>> But, if you think you need it, the script is so short that it would
>>> only take a short time to port to python (it's in sh):
>>>
>>> http://git.kernel.org/?p=git/git.git;a=blob_plain;f=contrib/workdir/git-new-workdir;hb=HEAD
>>>
>>> I see there are symbolic links there, that will require a little fancy
>>> footwork on windows, as you know.
>>>
>>> If I have time I'll give it a go.
>>
>> I don't think that's necessary (symlinks sound tricky) and scipy will
>> be on a public repository, so your multiple (if I understand
>> correctly) clone solution will work.
>
> Could you not use hg to do what you want (work with local directories)
> and the use hg-git when you need to?
>
> I am kinda partial to hg and bzr.

I will give git a try, reviewing patches sounds easier with git. For
other things, hg-git will be the likely outcome.

Josef

>
> Vincent
>
>>
>> Josef
>>
>>>
>>> See you,
>>>
>>> Matthew
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From warren.weckesser at enthought.com  Wed Jun  2 00:28:35 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Tue, 01 Jun 2010 23:28:35 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
Message-ID: <4C05DDF3.9010206@enthought.com>

I've been digging into some basic statistics recently, and developed the 
following function for applying the chi-square test to a contingency 
table.  Does something like this already exist in scipy.stats? If not, 
any objects to adding it?  (Tests are already written :)

Warren

-----

def chisquare_contingency(table):
    """Chi-square calculation for a contingency (R x C) table.

    This function computes the chi-square statistic and p-value of the
    data in the table.  The expected frequencies are computed based on
    the relative frequencies in the table.

    Parameters
    ----------
    table : array_like, 2D
        The contingency table, also known as the R x C table.

    Returns
    -------
    chisquare statistic : float
        The chisquare test statistic
    p : float
        The p-value of the test.
    """
    table = np.asarray(table)
    if table.ndim != 2:
        raise ValueError("table must be a 2D array.")

    # Create the table of expected frequencies.
    total = table.sum()
    row_sum = table.sum(axis=1).reshape(-1,1)
    col_sum = table.sum(axis=0)
    expected = row_sum * col_sum / float(total)

    # Since we are passing in 1D arrays of length table.size, the default
    # number of degrees of freedom is table.size-1.
    # For a contingency table, the actual number degrees of freedom is
    # (nr - 1)*(nc-1).  We use the ddof argument
    # of the chisquare function to adjust the default.
    nr, nc = table.shape
    dof = (nr - 1) * (nc - 1)
    dof_adjust = (table.size - 1) - dof

    chi2, p = chisquare(np.ravel(table), np.ravel(expected), 
ddof=dof_adjust)
    return chi2, p

-----


From d.l.goldsmith at gmail.com  Wed Jun  2 01:09:17 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 22:09:17 -0700
Subject: [SciPy-Dev] Difference between scipy.stats.gengamma and
	scipy.stats.distributions.gengamma
Message-ID: <AANLkTimo2_X00Mr-trDO88MbIT2ggwlPiJ9V8utzflz5@mail.gmail.com>

Is there a difference between these two?  Same question for stats.lognorm
and stats.distributions.lognorm?  Thanks.

DG

-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/1cad64c8/attachment.html>

From josef.pktd at gmail.com  Wed Jun  2 01:23:16 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Wed, 2 Jun 2010 01:23:16 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C05DDF3.9010206@enthought.com>
References: <4C05DDF3.9010206@enthought.com>
Message-ID: <AANLkTil3lUy4P8SKb8psksvaurZPsIgwwNOXfNNapODj@mail.gmail.com>

On Wed, Jun 2, 2010 at 12:28 AM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> I've been digging into some basic statistics recently, and developed the
> following function for applying the chi-square test to a contingency
> table. ?Does something like this already exist in scipy.stats? If not,
> any objects to adding it? ?(Tests are already written :)

There is no test like this yet in scipy.stats, and I think it is a
good addition.

My main question, which maybe Bruce can answer, is whether the
function should allow more than 2 dimensions. The function would be
easy to generalize but I don't know how common the test for example
for independence in (RxCxD) is.
(Options could still be added later without changing the API, in case
there are any.) I would also look briefly at the R manual, to see what
features their test has.
(I'm not a real user of contingency tables)

The docstring I think should mention that this is a test for
independence, and that it is only appropriate if the expected count in
each cell is at least 5. (off the top of my head)

"Chi-square test for independence in a contingency (R x C) table"
is (R x C) standard notation (letters)?

dof_adjust, I would have to check.

Can you open a ticket, mainly for the record, but to see if there are
any useful generalization?
But I think it can go in.

A comment:
The function matches the pattern of the current scipy.stats functions,
but in statsmodels I would most likely also make the expected values
available, so that users can directly compare data and expected
values.

Thanks,

Josef

>
> Warren
>
> -----
>
> def chisquare_contingency(table):
> ? ?"""Chi-square calculation for a contingency (R x C) table.
>
> ? ?This function computes the chi-square statistic and p-value of the
> ? ?data in the table. ?The expected frequencies are computed based on
> ? ?the relative frequencies in the table.
>
> ? ?Parameters
> ? ?----------
> ? ?table : array_like, 2D
> ? ? ? ?The contingency table, also known as the R x C table.
>
> ? ?Returns
> ? ?-------
> ? ?chisquare statistic : float
> ? ? ? ?The chisquare test statistic
> ? ?p : float
> ? ? ? ?The p-value of the test.
> ? ?"""
> ? ?table = np.asarray(table)
> ? ?if table.ndim != 2:
> ? ? ? ?raise ValueError("table must be a 2D array.")
>
> ? ?# Create the table of expected frequencies.
> ? ?total = table.sum()
> ? ?row_sum = table.sum(axis=1).reshape(-1,1)
> ? ?col_sum = table.sum(axis=0)
> ? ?expected = row_sum * col_sum / float(total)
>
> ? ?# Since we are passing in 1D arrays of length table.size, the default
> ? ?# number of degrees of freedom is table.size-1.
> ? ?# For a contingency table, the actual number degrees of freedom is
> ? ?# (nr - 1)*(nc-1). ?We use the ddof argument
> ? ?# of the chisquare function to adjust the default.
> ? ?nr, nc = table.shape
> ? ?dof = (nr - 1) * (nc - 1)
> ? ?dof_adjust = (table.size - 1) - dof
>
> ? ?chi2, p = chisquare(np.ravel(table), np.ravel(expected),
> ddof=dof_adjust)
> ? ?return chi2, p
>
> -----
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Wed Jun  2 01:26:03 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Wed, 2 Jun 2010 01:26:03 -0400
Subject: [SciPy-Dev] Difference between scipy.stats.gengamma and
	scipy.stats.distributions.gengamma
In-Reply-To: <AANLkTimo2_X00Mr-trDO88MbIT2ggwlPiJ9V8utzflz5@mail.gmail.com>
References: <AANLkTimo2_X00Mr-trDO88MbIT2ggwlPiJ9V8utzflz5@mail.gmail.com>
Message-ID: <AANLkTik2GNHnhC-Zo3lx0z-NzE9KjC8mmepeTjmYsa15@mail.gmail.com>

On Wed, Jun 2, 2010 at 1:09 AM, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> Is there a difference between these two?? Same question for stats.lognorm
> and stats.distributions.lognorm?? Thanks.

No, they are the same instance of the distribution

scipy.stats.__init__   has a from distributions import *  or something like this

Josef

>
> DG
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide. ?(As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From scott.sinclair.za at gmail.com  Wed Jun  2 01:41:10 2010
From: scott.sinclair.za at gmail.com (Scott Sinclair)
Date: Wed, 2 Jun 2010 07:41:10 +0200
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTik1L8EWiUmggtlenRghpusryY3971rXtpI-T8_D@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinGy2DBoMPd5g644WP1R1XAMlx4QkV4PsfH6b39@mail.gmail.com>
	<AANLkTik1L8EWiUmggtlenRghpusryY3971rXtpI-T8_D@mail.gmail.com>
Message-ID: <AANLkTinGc5HyZ-qkd09UjIW8tFXRCHvI8zYKMCZljpoO@mail.gmail.com>

On 1 June 2010 22:48, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> 2010/6/1 St?fan van der Walt <stefan at sun.ac.za>
>>
>> On 1 June 2010 13:32, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
>> > The docstring Standard seems to be careful to note which sections are
>> > considered optional, and the "Extended Summary" is *not* on that list.
>> > However, I'm encountering many SciPy docstrings in the Wiki lacking this
>> > section and yet marked as "Needs review": should I ignore this
>> > deficiency
>> > and add a ticket to clarify the Standard, or should such docstrings be
>> > moved
>> > back to "Being written"?
>>
>> Typically, there is no reason not to have an extended section. ?Can
>> you give an example where it would seem unnecessary?
>
> No: my position would appear to be the same as yours, and my inclination
> would be to "revert" them to "Being written."

Wouldn't it better to revert them to "Needs editing" instead? The
"Being written" status implies that someone is actively working on the
docstring...

Cheers,
Scott


From d.l.goldsmith at gmail.com  Wed Jun  2 02:34:04 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 23:34:04 -0700
Subject: [SciPy-Dev] Difference between scipy.stats.gengamma and
	scipy.stats.distributions.gengamma
In-Reply-To: <AANLkTik2GNHnhC-Zo3lx0z-NzE9KjC8mmepeTjmYsa15@mail.gmail.com>
References: <AANLkTimo2_X00Mr-trDO88MbIT2ggwlPiJ9V8utzflz5@mail.gmail.com>
	<AANLkTik2GNHnhC-Zo3lx0z-NzE9KjC8mmepeTjmYsa15@mail.gmail.com>
Message-ID: <AANLkTimXvNe6nBKP-q-Ftz-_V02p_0-dnI8RVOb14Y5y@mail.gmail.com>

On Tue, Jun 1, 2010 at 10:26 PM, <josef.pktd at gmail.com> wrote:

> On Wed, Jun 2, 2010 at 1:09 AM, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
> > Is there a difference between these two?  Same question for stats.lognorm
> > and stats.distributions.lognorm?  Thanks.
>
> No, they are the same instance of the distribution
>
> scipy.stats.__init__   has a from distributions import *  or something like
> this
> OK, thanks!
>

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/28745a01/attachment.html>

From d.l.goldsmith at gmail.com  Wed Jun  2 02:44:41 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 1 Jun 2010 23:44:41 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTinGc5HyZ-qkd09UjIW8tFXRCHvI8zYKMCZljpoO@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinGy2DBoMPd5g644WP1R1XAMlx4QkV4PsfH6b39@mail.gmail.com>
	<AANLkTik1L8EWiUmggtlenRghpusryY3971rXtpI-T8_D@mail.gmail.com>
	<AANLkTinGc5HyZ-qkd09UjIW8tFXRCHvI8zYKMCZljpoO@mail.gmail.com>
Message-ID: <AANLkTinExwEtRN-vPpYJVgAicY01zMsln_VQB6RDQV-i@mail.gmail.com>

On Tue, Jun 1, 2010 at 10:41 PM, Scott Sinclair <scott.sinclair.za at gmail.com
> wrote:

> On 1 June 2010 22:48, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> > 2010/6/1 St?fan van der Walt <stefan at sun.ac.za>
> >>
> >> On 1 June 2010 13:32, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> >> > The docstring Standard seems to be careful to note which sections are
> >> > considered optional, and the "Extended Summary" is *not* on that list.
> >> > However, I'm encountering many SciPy docstrings in the Wiki lacking
> this
> >> > section and yet marked as "Needs review": should I ignore this
> >> > deficiency
> >> > and add a ticket to clarify the Standard, or should such docstrings be
> >> > moved
> >> > back to "Being written"?
> >>
> >> Typically, there is no reason not to have an extended section.  Can
> >> you give an example where it would seem unnecessary?
> >
> > No: my position would appear to be the same as yours, and my inclination
> > would be to "revert" them to "Being written."
>
> Wouldn't it better to revert them to "Needs editing" instead? The
> "Being written" status implies that someone is actively working on the
> docstring...
>
> Cheers,
> Scott
>

Correct; actually, what I'm doing for these, and other prematurely promoted
docstrings, is checking the log: only if the most recent edit was
substantial and within the last 6 mo. (indicating some amount of recent
"ownership") am I pushing back to "Being written," otherwise, which, so far,
is the dominant case by far, I am indeed pushing it back to "Needs editing."
:-)

 DG

> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100601/4f85b46f/attachment.html>

From ce at vejnar.eu  Wed Jun  2 05:52:49 2010
From: ce at vejnar.eu (Charles Vejnar)
Date: Wed, 02 Jun 2010 11:52:49 +0200
Subject: [SciPy-Dev] Scipy archive on PyPI
In-Reply-To: <AANLkTinYBdHcYHxGwyTjZGSUiSogOiA-qnFqbjlYvycD@mail.gmail.com>
References: <201005311655.53741.ce@vejnar.eu>
	<4C045DCD.1080503@silveregg.co.jp>
	<AANLkTinYBdHcYHxGwyTjZGSUiSogOiA-qnFqbjlYvycD@mail.gmail.com>
Message-ID: <201006021152.49749.ce@vejnar.eu>

On Tuesday 01 June 2010, Ralf Gommers wrote:
> On Tue, Jun 1, 2010 at 9:09 AM, David <david at silveregg.co.jp> wrote:
> > On 06/01/2010 12:15 AM, Ralf Gommers wrote:
> > > On Mon, May 31, 2010 at 10:55 PM, Charles Vejnar <ce at vejnar.eu
> > > 
> > > <mailto:ce at vejnar.eu>> wrote:
> > >     Hi,
> > >     
> > >     I was trying to install Scipy with easy_install and it seems that
> > >     downloading
> > >     from Sourceforge is no longer possible (Sourceforge no longer gives
> > >     a direct
> > >     link to the .tar.gz file) which makes the install fail.
> > >     
> > >     Would it be possible to always upload the latest Scipy tarball to
> > 
> > PyPI ?
> > 
> > > It's possible, but because that encourages the use of easy_install/pip
> > > it would probably give more problems than that it helps. Just today
> > > there was a thread on numpy-discussion about pip failing and standard
> > > "python setup.py install" fixing the problem. easy_install is just as
> > > problematic as pip, if not more so.
> > 
> > Unfortunately, people will always use those half broken tools. I think
> > we should at least put the tarballs - I also used to put a simple
> > executable (result of bdist_wininst) so that easy_install numpy works on
> > windows.
> > 
> > OK, I'll do the same then.
> 
> Ralf

Thank you

Charles


From stefan.czesla at hs.uni-hamburg.de  Wed Jun  2 07:21:06 2010
From: stefan.czesla at hs.uni-hamburg.de (Stefan)
Date: Wed, 2 Jun 2010 11:21:06 +0000 (UTC)
Subject: [SciPy-Dev]
	=?utf-8?q?np=2Esavetxt=3A_apply_patch_in_enhancement_?=
	=?utf-8?q?ticket_1079=09to_add_headers=3F?=
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com>
	<loom.20100601T190316-782@post.gmane.org>
	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com>
Message-ID: <loom.20100602T125421-882@post.gmane.org>


> > If the header is given as a plane string
> > (such as envisaged in ticket 1079), the
> > user has to care for the correct formatting, in particular,
> > the user has to
> > supply the comment character(s) and the new line formatting.
> > This might be
> > against intuition, because many users will at first try to supply their
> > header(s) without specifying those formatting characters.
> > The result will be a
> > file not readable with numpy.loadtxt, and the error might
> > not be detected right
> > away.
> 
> I'm not sure I understand why I would want to specify a comment
> character for writing a csv file (unless of course I had some comments
> to add).

We are possibly talking about different things. In our approach of using
numpy.savetxt comments (preceeding the actual data) and a header
are essentially the same, such as in the following example.
Basically, we want to add some lines
of additional information at the top of the file written with
numpy.savetxt, and be able to recover the data with numpy.loadtxt
(for which the 'header' would
then be irrelevant, what may not be your intention, or is it?).

#Now comes the data
#column1 [kg] column2 [apple]
1  2
3  5


> 
> Also note that since that patch was written, savetxt takes a user
> supplied newline keyword, so you can just append that to the header
> string.
>
True, we were not aware of this, but this does not help much for the
comment/header. 
> >
> > As numpy.loadtxt has a default comment character ('#'), the same may be
> > implemented for numpy.savetxt. In this case, numpy.savetxt would get two
> > additional keywords (e.g. header, comment(character)), which bloats the
> > interface, but potentially provides more safety.
> >
> 
> FWIW, I ended up rolling my own using the most recent pre-Python 3
> changes for savetxt that accepts a list of names instead of one string
> or if the provided array has the attribute dtype.names (non-nested rec
> or structured arrays) it uses those.  Whatever is done I think the
> support for structured arrays is nice, and I think having this
> functionality is a no-brainer.  I need it quite often.
> 
Although, we have not been using record arrays too often, we see their
advantages and agree that it should be possible to use them as you described
it.
We also thought about a solution, using the __str__ method for the 'header
object'. In this vain, an arbitrary header class (including a plane string)
providing an __str__ member may be handed to numpy.savetxt,
which can use it to write the header. 

> Skipper
> 


From ralf.gommers at googlemail.com  Wed Jun  2 07:22:00 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Wed, 2 Jun 2010 19:22:00 +0800
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTinExwEtRN-vPpYJVgAicY01zMsln_VQB6RDQV-i@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinGy2DBoMPd5g644WP1R1XAMlx4QkV4PsfH6b39@mail.gmail.com>
	<AANLkTik1L8EWiUmggtlenRghpusryY3971rXtpI-T8_D@mail.gmail.com>
	<AANLkTinGc5HyZ-qkd09UjIW8tFXRCHvI8zYKMCZljpoO@mail.gmail.com>
	<AANLkTinExwEtRN-vPpYJVgAicY01zMsln_VQB6RDQV-i@mail.gmail.com>
Message-ID: <AANLkTin15nMojagPFp5zaCJOYsW2vXgWtO2ZSV9OeKdW@mail.gmail.com>

On Wed, Jun 2, 2010 at 2:44 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> On Tue, Jun 1, 2010 at 10:41 PM, Scott Sinclair <scott.sinclair.za@
> gmail.com> wrote:
>
>> On 1 June 2010 22:48, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
>> > 2010/6/1 St?fan van der Walt <stefan at sun.ac.za>
>> >>
>> >> On 1 June 2010 13:32, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
>> >> > The docstring Standard seems to be careful to note which sections are
>> >> > considered optional, and the "Extended Summary" is *not* on that
>> list.
>> >> > However, I'm encountering many SciPy docstrings in the Wiki lacking
>> this
>> >> > section and yet marked as "Needs review": should I ignore this
>> >> > deficiency
>> >> > and add a ticket to clarify the Standard, or should such docstrings
>> be
>> >> > moved
>> >> > back to "Being written"?
>> >>
>> >> Typically, there is no reason not to have an extended section.  Can
>> >> you give an example where it would seem unnecessary?
>>
>
I think we shouldn't go overboard here. In the great majority of cases it's
needed but sometimes there's just not much info to add besides what's in the
summary and parameter description. Examples:
http://docs.scipy.org/numpy/docs/numpy.core.umath.add/
http://docs.scipy.org/numpy/docs/numpy.lib.ufunclike.isneginf/
http://docs.scipy.org/numpy/docs/numpy.core.umath.logical_or/

These are all good docstrings and should not be reset to "needs editing"
imho. And if you really have info to add, I suggest to just add it the
moment you see it - will be a lot more productive in the end.

Finally, there's a huge amount of low hanging fruit in the scipy docs. Why
not just take a module and dig in? These details can wait for a while.

Best regards,
Ralf


>
>> > No: my position would appear to be the same as yours, and my inclination
>> > would be to "revert" them to "Being written."
>>
>> Wouldn't it better to revert them to "Needs editing" instead? The
>> "Being written" status implies that someone is actively working on the
>> docstring...
>>
>> Cheers,
>> Scott
>>
>
> Correct; actually, what I'm doing for these, and other prematurely promoted
> docstrings, is checking the log: only if the most recent edit was
> substantial and within the last 6 mo. (indicating some amount of recent
> "ownership") am I pushing back to "Being written," otherwise, which, so far,
> is the dominant case by far, I am indeed pushing it back to "Needs editing."
> :-)
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/036001b0/attachment.html>

From nmb at wartburg.edu  Wed Jun  2 08:24:25 2010
From: nmb at wartburg.edu (Neil Martinsen-Burrell)
Date: Wed, 02 Jun 2010 07:24:25 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C05DDF3.9010206@enthought.com>
References: <4C05DDF3.9010206@enthought.com>
Message-ID: <4C064D79.4030106@wartburg.edu>

On 2010-06-01 23:28 , Warren Weckesser wrote:
> I've been digging into some basic statistics recently, and developed the
> following function for applying the chi-square test to a contingency
> table.  Does something like this already exist in scipy.stats? If not,
> any objects to adding it?  (Tests are already written :)

Something like this would be great in scipy.stats since I end up doing 
the exact same thing by hand whenever I grade introductory statistics 
exams.  Thanks for writing this!

I've got some code review comments that I'll include below.

> def chisquare_contingency(table):

I think that chiquare_twoway fits the common name for this test better, 
but as Joseph mentions, this neglects the possibility of expanding this 
to n-dimensions.

>      """Chi-square calculation for a contingency (R x C) table.

The docstring should emphasize that this is a hypothesis test.  See for 
example http://docs.scipy.org/scipy/docs/scipy.stats.stats.ttest_rel/. 
I'm not familiar with the R x C notation, but it does work to make clear 
which chi square test this is.

>
>      This function computes the chi-square statistic and p-value of the
>      data in the table.  The expected frequencies are computed based on
>      the relative frequencies in the table.

I try to explain what the null and alternative hypotheses are for the 
tests in scipy.stats.

>
>      Parameters
>      ----------
>      table : array_like, 2D
>          The contingency table, also known as the R x C table.

This could also say something like "The table contains the observed 
frequencies of each category."

>
>      Returns
>      -------
>      chisquare statistic : float
>          The chisquare test statistic
>      p : float
>          The p-value of the test.

A function like this could really use an example, perhaps straight from 
one of the tests.

>      """
>      table = np.asarray(table)
>      if table.ndim != 2:
>          raise ValueError("table must be a 2D array.")
>
>      # Create the table of expected frequencies.
>      total = table.sum()
>      row_sum = table.sum(axis=1).reshape(-1,1)
>      col_sum = table.sum(axis=0)
>      expected = row_sum * col_sum / float(total)

I think that np.outer(row_sum, col_sum) is clearer than reshaping one to 
be a column vector.

>
>      # Since we are passing in 1D arrays of length table.size, the default
>      # number of degrees of freedom is table.size-1.
>      # For a contingency table, the actual number degrees of freedom is
>      # (nr - 1)*(nc-1).  We use the ddof argument
>      # of the chisquare function to adjust the default.
>      nr, nc = table.shape
>      dof = (nr - 1) * (nc - 1)
>      dof_adjust = (table.size - 1) - dof
>
>      chi2, p = chisquare(np.ravel(table), np.ravel(expected),
> ddof=dof_adjust)
>      return chi2, p


From josef.pktd at gmail.com  Wed Jun  2 10:03:09 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Wed, 2 Jun 2010 10:03:09 -0400
Subject: [SciPy-Dev] old new (?) StatisticReview
Message-ID: <AANLkTilcX0U9KgizbJHg1sjX6WxDKiVtIJ04HmB_OM66@mail.gmail.com>

4 years ago Robert Kern defined guide lines or check list for the
review of the functions in scipy.stats

http://projects.scipy.org/scipy/wiki/StatisticsReview

This is a useful checklist for evaluating legacy functions,... in scipy.stats.
And I think I (implicitly) followed this most of the time in my stats cleanup.

But the criteria should not apply to only existing functions (i.e.
that have entered trunk), but also to new code.

The only point I want to strengthen is number

"1. The function works. Sometimes, you just have to state the obvious."

to

"1. The function works and produces correct result."

"works" sounds too much like "it doesn't raise an exception"

"correct" is also a vague term, but it captures more the spirit.

The checklist could be reviewed or rephrased, but I would like to have
guide lines spelled out more explicitly, so we know what the rules of
the game are. (even if they are guidelines)

Josef


From josef.pktd at gmail.com  Wed Jun  2 10:37:56 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Wed, 2 Jun 2010 10:37:56 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C064D79.4030106@wartburg.edu>
References: <4C05DDF3.9010206@enthought.com>
	<4C064D79.4030106@wartburg.edu>
Message-ID: <AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>

On Wed, Jun 2, 2010 at 8:24 AM, Neil Martinsen-Burrell <nmb at wartburg.edu> wrote:
> On 2010-06-01 23:28 , Warren Weckesser wrote:
>> I've been digging into some basic statistics recently, and developed the
>> following function for applying the chi-square test to a contingency
>> table. ?Does something like this already exist in scipy.stats? If not,
>> any objects to adding it? ?(Tests are already written :)
>
> Something like this would be great in scipy.stats since I end up doing
> the exact same thing by hand whenever I grade introductory statistics
> exams. ?Thanks for writing this!
>
> I've got some code review comments that I'll include below.
>
>> def chisquare_contingency(table):
>
> I think that chiquare_twoway fits the common name for this test better,
> but as Joseph mentions, this neglects the possibility of expanding this
> to n-dimensions.
>
>> ? ? ?"""Chi-square calculation for a contingency (R x C) table.
>
> The docstring should emphasize that this is a hypothesis test. ?See for
> example http://docs.scipy.org/scipy/docs/scipy.stats.stats.ttest_rel/.
> I'm not familiar with the R x C notation, but it does work to make clear
> which chi square test this is.
>
>>
>> ? ? ?This function computes the chi-square statistic and p-value of the
>> ? ? ?data in the table. ?The expected frequencies are computed based on
>> ? ? ?the relative frequencies in the table.
>
> I try to explain what the null and alternative hypotheses are for the
> tests in scipy.stats.
>
>>
>> ? ? ?Parameters
>> ? ? ?----------
>> ? ? ?table : array_like, 2D
>> ? ? ? ? ?The contingency table, also known as the R x C table.
>
> This could also say something like "The table contains the observed
> frequencies of each category."
>
>>
>> ? ? ?Returns
>> ? ? ?-------
>> ? ? ?chisquare statistic : float
>> ? ? ? ? ?The chisquare test statistic
>> ? ? ?p : float
>> ? ? ? ? ?The p-value of the test.
>
> A function like this could really use an example, perhaps straight from
> one of the tests.
>
>> ? ? ?"""
>> ? ? ?table = np.asarray(table)
>> ? ? ?if table.ndim != 2:
>> ? ? ? ? ?raise ValueError("table must be a 2D array.")
>>
>> ? ? ?# Create the table of expected frequencies.
>> ? ? ?total = table.sum()
>> ? ? ?row_sum = table.sum(axis=1).reshape(-1,1)
>> ? ? ?col_sum = table.sum(axis=0)
>> ? ? ?expected = row_sum * col_sum / float(total)
>
> I think that np.outer(row_sum, col_sum) is clearer than reshaping one to
> be a column vector.
>
>>
>> ? ? ?# Since we are passing in 1D arrays of length table.size, the default
>> ? ? ?# number of degrees of freedom is table.size-1.
>> ? ? ?# For a contingency table, the actual number degrees of freedom is
>> ? ? ?# (nr - 1)*(nc-1). ?We use the ddof argument
>> ? ? ?# of the chisquare function to adjust the default.
>> ? ? ?nr, nc = table.shape
>> ? ? ?dof = (nr - 1) * (nc - 1)
>> ? ? ?dof_adjust = (table.size - 1) - dof
>>
>> ? ? ?chi2, p = chisquare(np.ravel(table), np.ravel(expected),
>> ddof=dof_adjust)
>> ? ? ?return chi2, p


Just a thought:
I think it would be useful to have this kind of proposals on the
scipy-user list (even though it is a dev issue), just to be able to
get more feedback from potential users.

And again,
Thanks Neil, it's very nice to have the statistics in the docstrings
instead of having to run to Wikipedia

Josef

> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From bsouthey at gmail.com  Wed Jun  2 10:41:39 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 02 Jun 2010 09:41:39 -0500
Subject: [SciPy-Dev] np.savetxt: apply patch in enhancement ticket 1079
 to add headers?
In-Reply-To: <loom.20100602T125421-882@post.gmane.org>
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com>	<loom.20100601T190316-782@post.gmane.org>	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com>
	<loom.20100602T125421-882@post.gmane.org>
Message-ID: <4C066DA3.8010609@gmail.com>

On 06/02/2010 06:21 AM, Stefan wrote:
>    
>>> If the header is given as a plane string
>>> (such as envisaged in ticket 1079), the
>>> user has to care for the correct formatting, in particular,
>>> the user has to
>>> supply the comment character(s) and the new line formatting.
>>> This might be
>>> against intuition, because many users will at first try to supply their
>>> header(s) without specifying those formatting characters.
>>> The result will be a
>>> file not readable with numpy.loadtxt, and the error might
>>> not be detected right
>>> away.
>>>        
>> I'm not sure I understand why I would want to specify a comment
>> character for writing a csv file (unless of course I had some comments
>> to add).
>>      
> We are possibly talking about different things. In our approach of using
> numpy.savetxt comments (preceeding the actual data) and a header
> are essentially the same, such as in the following example.
> Basically, we want to add some lines
> of additional information at the top of the file written with
> numpy.savetxt, and be able to recover the data with numpy.loadtxt
> (for which the 'header' would
> then be irrelevant, what may not be your intention, or is it?).
>
> #Now comes the data
> #column1 [kg] column2 [apple]
> 1  2
> 3  5
>
>    
Not that I am complaining rather trying to understand what is expected 
to happen.

Under the patch, it is very much user beware.  The header argument can 
be anything or nothing. There is no check for the contents or if the 
delimiter used is the same as the rest of the output. Further with the 
newline option there is no guarantee that the lines in the header will 
have the same line endings throughout the file.

So what should a user be allowed to use as a header?
You could write a whole program there or an explanation of the following 
output - which is very appealing. You could force a list of strings so 
that you print out newline.join(header) - okay not quite because it 
should include the comment argument.

Should savetxt be restricted to something that loadtxt can read?
This is potentially problematic if you want a header line. Although it 
could return the number of header lines.

[savetxt should also be updated to allow bz2 as loadtxt handles those 
now - not that I have used it]

>    
>> Also note that since that patch was written, savetxt takes a user
>> supplied newline keyword, so you can just append that to the header
>> string.
>>
>>      
> True, we were not aware of this, but this does not help much for the
> comment/header.
>    
Entered as ~3 months ago:
http://projects.scipy.org/numpy/changeset/8180

Should this be forced to check for valid options for new lines?
Otherwise you from this  'np.savetxt('junk.text', [1,2,3,4,5], 
newline='what')' you get:
1.000000000000000000e+00what2.000000000000000000e+00what3.000000000000000000e+00what4.000000000000000000e+00what5.000000000000000000e+00what
Which is not going to be read back by loadtxt.

>>> As numpy.loadtxt has a default comment character ('#'), the same may be
>>> implemented for numpy.savetxt. In this case, numpy.savetxt would get two
>>> additional keywords (e.g. header, comment(character)), which bloats the
>>> interface, but potentially provides more safety.
>>>
>>>        
>> FWIW, I ended up rolling my own using the most recent pre-Python 3
>> changes for savetxt that accepts a list of names instead of one string
>> or if the provided array has the attribute dtype.names (non-nested rec
>> or structured arrays) it uses those.  Whatever is done I think the
>> support for structured arrays is nice, and I think having this
>> functionality is a no-brainer.  I need it quite often.
>>
>>      
> Although, we have not been using record arrays too often, we see their
> advantages and agree that it should be possible to use them as you described
> it.
> We also thought about a solution, using the __str__ method for the 'header
> object'. In this vain, an arbitrary header class (including a plane string)
> providing an __str__ member may be handed to numpy.savetxt,
> which can use it to write the header.
>
>    
>> Skipper
>>
>>      
>
It would nice if  savetxt used the dtype of the input to get a header 
and format by default unless overwritten by the user.

Bruce
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/1db6e065/attachment.html>

From bsouthey at gmail.com  Wed Jun  2 12:02:02 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 02 Jun 2010 11:02:02 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
Message-ID: <4C06807A.40301@gmail.com>

On 06/02/2010 09:37 AM, josef.pktd at gmail.com wrote:
> On Wed, Jun 2, 2010 at 8:24 AM, Neil Martinsen-Burrell<nmb at wartburg.edu>  wrote:
>    
>> On 2010-06-01 23:28 , Warren Weckesser wrote:
>>      
>>> I've been digging into some basic statistics recently, and developed the
>>> following function for applying the chi-square test to a contingency
>>> table.  Does something like this already exist in scipy.stats? If not,
>>> any objects to adding it?  (Tests are already written :)
>>>        
>> Something like this would be great in scipy.stats since I end up doing
>> the exact same thing by hand whenever I grade introductory statistics
>> exams.  Thanks for writing this!
>>      
You might find SAS helpful:
http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/freq_toc.htm

However, this code is the chi-squared test part as SAS will compute the 
actual cell numbers. Also an extension to scipy.stats.chisquare() so we 
can not have both functions.

Really this should be combined with fisher.py in ticket 956:
http://projects.scipy.org/scipy/ticket/956

>> I've got some code review comments that I'll include below.
>>
>>      
>>> def chisquare_contingency(table):
>>>        
>> I think that chiquare_twoway fits the common name for this test better,
>> but as Joseph mentions, this neglects the possibility of expanding this
>> to n-dimensions.
>>
>>      
>>>       """Chi-square calculation for a contingency (R x C) table.
>>>        
>> The docstring should emphasize that this is a hypothesis test.  See for
>> example http://docs.scipy.org/scipy/docs/scipy.stats.stats.ttest_rel/.
>> I'm not familiar with the R x C notation, but it does work to make clear
>> which chi square test this is.
>>
>>      
>>>       This function computes the chi-square statistic and p-value of the
>>>       data in the table.  The expected frequencies are computed based on
>>>       the relative frequencies in the table.
>>>        
>> I try to explain what the null and alternative hypotheses are for the
>> tests in scipy.stats.
>>      
It is also an asymptotic test so cell size should be mentioned.

>>      
>>>       Parameters
>>>       ----------
>>>       table : array_like, 2D
>>>           The contingency table, also known as the R x C table.
>>>        
>> This could also say something like "The table contains the observed
>> frequencies of each category."
>>
>>      
>>>       Returns
>>>       -------
>>>       chisquare statistic : float
>>>           The chisquare test statistic
>>>       p : float
>>>           The p-value of the test.
>>>        
>> A function like this could really use an example, perhaps straight from
>> one of the tests.
>>      
It needs at least to support both the 1-d and 2-d cases (preferably 
where R and C > 2)
>>>       """
>>>       table = np.asarray(table)
>>>       if table.ndim != 2:
>>>           raise ValueError("table must be a 2D array.")
>>>        
This should not be restricted to 2-d array's. At the very least it 
should handle 1-d and 2-d array_like inputs. There also should have 
correct handling of masked arrays because np.asarray ignores the mask - 
I do not recall what happens with Matrix class. Obviously one needs to 
address how masked values are handled such as replacing the values with 
zero.
>>>       # Create the table of expected frequencies.
>>>       total = table.sum()
>>>        
total=table.sum(dtype=float) # dtype will not be needed if integer 
division is not used (ie Python3)
>>>       row_sum = table.sum(axis=1).reshape(-1,1)
>>>       col_sum = table.sum(axis=0)
>>>       expected = row_sum * col_sum / float(total)
>>>        

expected = row_sum * col_sum /total

>> I think that np.outer(row_sum, col_sum) is clearer than reshaping one to
>> be a column vector.
>>      
Make it one liner:
expected = np.outer( table.sum(axis=1),  table.sum(axis=0))/total

>>>       # Since we are passing in 1D arrays of length table.size, the default
>>>       # number of degrees of freedom is table.size-1.
>>>       # For a contingency table, the actual number degrees of freedom is
>>>       # (nr - 1)*(nc-1).  We use the ddof argument
>>>       # of the chisquare function to adjust the default.
>>>       nr, nc = table.shape
>>>       dof = (nr - 1) * (nc - 1)
>>>       dof_adjust = (table.size - 1) - dof
>>>
>>>       chi2, p = chisquare(np.ravel(table), np.ravel(expected),
>>> ddof=dof_adjust)
>>>        
Where is your chisquare function - this is meant to be a standard alone 
function?

Why not do say:

import special

chi2_value=(((table-expected)**2)/expected).sum()
chi2_prob=special.chdtrc(dof,chi2_value)


>>>       return chi2, p
>>>        
>
> Just a thought:
> I think it would be useful to have this kind of proposals on the
> scipy-user list (even though it is a dev issue), just to be able to
> get more feedback from potential users.
>
> And again,
> Thanks Neil, it's very nice to have the statistics in the docstrings
> instead of having to run to Wikipedia
>
> Josef
>
>    
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>      
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    
Bruce
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/fcd8b7db/attachment.html>

From nmb at wartburg.edu  Wed Jun  2 12:26:04 2010
From: nmb at wartburg.edu (Neil Martinsen-Burrell)
Date: Wed, 02 Jun 2010 11:26:04 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C06807A.40301@gmail.com>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com>
Message-ID: <4C06861C.1060401@wartburg.edu>

On 2010-06-02 11:02 , Bruce Southey wrote:
> On 06/02/2010 09:37 AM, josef.pktd at gmail.com wrote:
>> On Wed, Jun 2, 2010 at 8:24 AM, Neil Martinsen-Burrell<nmb at wartburg.edu>  wrote:
>>
>>> On 2010-06-01 23:28 , Warren Weckesser wrote:
>>>
>>>> I've been digging into some basic statistics recently, and developed the
>>>> following function for applying the chi-square test to a contingency
>>>> table.  Does something like this already exist in scipy.stats? If not,
>>>> any objects to adding it?  (Tests are already written :)
>>>>
>>> Something like this would be great in scipy.stats since I end up doing
>>> the exact same thing by hand whenever I grade introductory statistics
>>> exams.  Thanks for writing this!
>>>
> You might find SAS helpful:
> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/freq_toc.htm

I'm not sure what you mean by this.  I have no problem performing the 
test, it's just inconvenient that it isn't already a part of scipy.stats

> However, this code is the chi-squared test part as SAS will compute the
> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
> can not have both functions.

Again, I don't understand what you mean that we can't have both 
functions?  I believe (from a statistics teacher's point of view) that 
the Chi-Squared goodness of fit test (which is stats.chisquare) is a 
different beast from the Chi-Square test for independence (which is 
stats.chisquare_contingency).  The fact that the distribution of the 
test statistic is the same should not tempt us to put them into the same 
function.

> Really this should be combined with fisher.py in ticket 956:
> http://projects.scipy.org/scipy/ticket/956

Wow, apparently I have lots of disagreements today, but I don't think 
that this should be combined with Fisher's Exact test.  (I would like to 
see that ticket mature to the point where it can be added to 
scipy.stats.)  I like the functions in scipy.stats to correspond in a 
one-to-one manner with the statistical tests.  I think that the docs 
should "See Also" the appropriate exact (and non-parametric) tests, but 
I think that one function/one test is a good rule.  This is particularly 
true for people (like me) who would like to someday be able to use 
scipy.stats in a pedagogical context.

-Neil


From stefan.czesla at hs.uni-hamburg.de  Wed Jun  2 13:14:04 2010
From: stefan.czesla at hs.uni-hamburg.de (Stefan)
Date: Wed, 2 Jun 2010 17:14:04 +0000 (UTC)
Subject: [SciPy-Dev] np.savetxt: apply patch in enhancement ticket 1079
	to add headers?
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com>	<loom.20100601T190316-782@post.gmane.org>	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com>
	<loom.20100602T125421-882@post.gmane.org>
	<4C066DA3.8010609@gmail.com>
Message-ID: <loom.20100602T181159-631@post.gmane.org>


> Not that I am complaining rather trying to understand what is expected
> to happen. 
> Under the patch, it is very much user beware.? The header argument can
> be anything or nothing. There is no check for the contents or if the
> delimiter used is the same as the rest of the output. Further with the
> newline option there is no guarantee that the lines in the header will
> have the same line endings throughout the file.
> So what should a user be allowed to use as a header? 
> You could write a whole program there or an explanation of the
> following output - which is very appealing. You could force a list of
> strings so that you print out newline.join(header) - okay not quite
> because it should include the comment argument.
> Should savetxt be restricted to something that loadtxt can read? 
> This is potentially problematic if you want a header line. Although it
> could return the number of header lines.
> [savetxt should also be updated to allow bz2 as loadtxt handles those
> now - not that I have used it]
>   
>   
>   
>     
> Also note that since that patch was written, savetxt takes a user
> supplied newline keyword, so you can just append that to the header
> string.
> 
>     
>   
>   True, we were not aware of this, but this does not help much for the
> comment/header. 
>   
> 
> 
> Entered as ~3 months ago:http://projects.scipy.org/numpy/changeset/8180
> Should this be forced to check for valid options for new lines?
> Otherwise you from this? 'np.savetxt('junk.text', [1,2,3,4,5],
> newline='what')' you get:
>
1.000000000000000000e+00what2.000000000000000000e+00what
3.000000000000000000e+00what4.000000000000000000e+00
what5.000000000000000000e+00what
> Which is not going to be read back by loadtxt.
>   
>     
>       
> As numpy.loadtxt has a default comment character ('#'), the same may be
> implemented for numpy.savetxt. In this case, numpy.savetxt would get two
> additional keywords (e.g. header, comment(character)), which bloats the
> interface, but potentially provides more safety.
> 
>       
>     
>     
> FWIW, I ended up rolling my own using the most recent pre-Python 3
> changes for savetxt that accepts a list of names instead of one string
> or if the provided array has the attribute dtype.names (non-nested rec
> or structured arrays) it uses those.  Whatever is done I think the
> support for structured arrays is nice, and I think having this
> functionality is a no-brainer.  I need it quite often.
> 
>     
>   
>   Although, we have not been using record arrays too often, we see their
> advantages and agree that it should be possible to use them as you described
> it.
> We also thought about a solution, using the __str__ method for the 'header
> object'. In this vain, an arbitrary header class (including a plane string)
> providing an __str__ member may be handed to numpy.savetxt,
> which can use it to write the header. 
> 


So let us briefly summarize whats on the table. It appears to us that
there are basically three open issues:
(1) a csv like header for savetxt written files (first line contains column
    names)
(2) comments (introduced by comment character e.g. '#') at the beginning
    of the file (preceding the data)
(3) the role of the 'newline' option

As was noted, the patch (ticket 1079) enables both to write a csv like
header (1) and comment line(s) introduced by a comment character (e.g. '#').
Nonetheless, this solution is quite unsatisfactory
in our opinion, because it may be error prone,
as the user is in charge of the entire formatting. Despite this, we think
that it should be up to the user what amount of information is to be put
at the top of the file, but the format should be checked as far as possible.

Using either a string or a list/tuple of strings, as proposed by Bruce,
seems to be a reasonable possibility to implement the desired functionality.
Maybe two individual keywords ('header' and 'comment') should exist to
distinguish whether the the user requests case (1) or (2). As for loadtxt
the default comment character should be '#', but it may be changed by the
user.

We think that savetxt should not be restricted to output, which can be read
by loadtxt. Although it should be possible to add commments to the output
file, so that it remains readable by loadtxt (without tweaking it
e.g. with the skiprows keyword). 

We agree that the newline keyword may cause inconsistencies in the file
(if ticket 1079 were applied),
and possibly strange behavior such as when newline='what' is specified.
Yet, this question does not only concern the header/comments.

Stefan & Christian


From bsouthey at gmail.com  Wed Jun  2 14:10:12 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 02 Jun 2010 13:10:12 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C06861C.1060401@wartburg.edu>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
Message-ID: <4C069E84.4020308@gmail.com>

On 06/02/2010 11:26 AM, Neil Martinsen-Burrell wrote:
> On 2010-06-02 11:02 , Bruce Southey wrote:
>> On 06/02/2010 09:37 AM, josef.pktd at gmail.com wrote:
>>> On Wed, Jun 2, 2010 at 8:24 AM, Neil 
>>> Martinsen-Burrell<nmb at wartburg.edu>  wrote:
>>>
>>>> On 2010-06-01 23:28 , Warren Weckesser wrote:
>>>>
>>>>> I've been digging into some basic statistics recently, and 
>>>>> developed the
>>>>> following function for applying the chi-square test to a contingency
>>>>> table.  Does something like this already exist in scipy.stats? If 
>>>>> not,
>>>>> any objects to adding it?  (Tests are already written :)
>>>>>
>>>> Something like this would be great in scipy.stats since I end up doing
>>>> the exact same thing by hand whenever I grade introductory statistics
>>>> exams.  Thanks for writing this!
>>>>
>> You might find SAS helpful:
>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/freq_toc.htm 
>>
>
> I'm not sure what you mean by this.  I have no problem performing the 
> test, it's just inconvenient that it isn't already a part of scipy.stats
Because this is the main SAS procedure that does contingency tables and 
tests. There is useful information as well.
>
>> However, this code is the chi-squared test part as SAS will compute the
>> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
>> can not have both functions.
>
> Again, I don't understand what you mean that we can't have both 
> functions?  I believe (from a statistics teacher's point of view) that 
> the Chi-Squared goodness of fit test (which is stats.chisquare) is a 
> different beast from the Chi-Square test for independence (which is 
> stats.chisquare_contingency).  The fact that the distribution of the 
> test statistic is the same should not tempt us to put them into the 
> same function.
Please read scipy.stats.chisquare() because scipy.stats.chisquare() is 
the 1-d case of yours.
Quote from the docstring:
    " The chi square test tests the null hypothesis that the categorical 
data
     has the given frequencies."
Also go the web site provided in the docstring.

By default you get the expected frequencies but you can also put in your 
own using the f_exp variable. You could do the same in your code.
>
>> Really this should be combined with fisher.py in ticket 956:
>> http://projects.scipy.org/scipy/ticket/956
>
> Wow, apparently I have lots of disagreements today, but I don't think 
> that this should be combined with Fisher's Exact test.  (I would like 
> to see that ticket mature to the point where it can be added to 
> scipy.stats.)  I like the functions in scipy.stats to correspond in a 
> one-to-one manner with the statistical tests.  I think that the docs 
> should "See Also" the appropriate exact (and non-parametric) tests, 
> but I think that one function/one test is a good rule.  This is 
> particularly true for people (like me) who would like to someday be 
> able to use scipy.stats in a pedagogical context.
>
> -Neil
I don't see any 'disagreements' rather just different ways to do things 
and identifying areas that need to be addressed for more general use.

I accept your opinion as here only because these functions only accept 
the digested (ie summarized) data.

Bruce


From nmb at wartburg.edu  Wed Jun  2 14:18:01 2010
From: nmb at wartburg.edu (Neil Martinsen-Burrell)
Date: Wed, 02 Jun 2010 13:18:01 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C069E84.4020308@gmail.com>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com>
Message-ID: <4C06A059.6020901@wartburg.edu>

On 2010-06-02 13:10 , Bruce Southey wrote:
[...]

>>> However, this code is the chi-squared test part as SAS will compute the
>>> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
>>> can not have both functions.
>>
>> Again, I don't understand what you mean that we can't have both
>> functions? I believe (from a statistics teacher's point of view) that
>> the Chi-Squared goodness of fit test (which is stats.chisquare) is a
>> different beast from the Chi-Square test for independence (which is
>> stats.chisquare_contingency). The fact that the distribution of the
>> test statistic is the same should not tempt us to put them into the
>> same function.
> Please read scipy.stats.chisquare() because scipy.stats.chisquare() is
> the 1-d case of yours.
> Quote from the docstring:
> " The chi square test tests the null hypothesis that the categorical data
> has the given frequencies."
> Also go the web site provided in the docstring.
>
> By default you get the expected frequencies but you can also put in your
> own using the f_exp variable. You could do the same in your code.

In fact, Warren correctly used stats.chisquare with the expected 
frequencies calculated from the null hypothesis and the corrected 
degrees of freedom.  chisquare_contingency is in some sense a 
convenience method for taking care of these pre-calculations before 
calling stats.chisquare.  Can you explain more clearly to me why we 
should not include such a convenience function?

>>> Really this should be combined with fisher.py in ticket 956:
>>> http://projects.scipy.org/scipy/ticket/956
>>
>> Wow, apparently I have lots of disagreements today, but I don't think
>> that this should be combined with Fisher's Exact test. (I would like
>> to see that ticket mature to the point where it can be added to
>> scipy.stats.) I like the functions in scipy.stats to correspond in a
>> one-to-one manner with the statistical tests. I think that the docs
>> should "See Also" the appropriate exact (and non-parametric) tests,
>> but I think that one function/one test is a good rule. This is
>> particularly true for people (like me) who would like to someday be
>> able to use scipy.stats in a pedagogical context.
>>
>> -Neil
> I don't see any 'disagreements' rather just different ways to do things
> and identifying areas that need to be addressed for more general use.

Agreed. :)

[...]

-Neil


From stefan at sun.ac.za  Wed Jun  2 14:23:55 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Wed, 2 Jun 2010 11:23:55 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTin15nMojagPFp5zaCJOYsW2vXgWtO2ZSV9OeKdW@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com> 
	<AANLkTinGy2DBoMPd5g644WP1R1XAMlx4QkV4PsfH6b39@mail.gmail.com> 
	<AANLkTik1L8EWiUmggtlenRghpusryY3971rXtpI-T8_D@mail.gmail.com> 
	<AANLkTinGc5HyZ-qkd09UjIW8tFXRCHvI8zYKMCZljpoO@mail.gmail.com> 
	<AANLkTinExwEtRN-vPpYJVgAicY01zMsln_VQB6RDQV-i@mail.gmail.com> 
	<AANLkTin15nMojagPFp5zaCJOYsW2vXgWtO2ZSV9OeKdW@mail.gmail.com>
Message-ID: <AANLkTikb900FQxiw8kHOoucia1haLpE7rc5zlS9lNZxF@mail.gmail.com>

On 2 June 2010 04:22, Ralf Gommers <ralf.gommers at googlemail.com> wrote:
>>> >> Typically, there is no reason not to have an extended section. ?Can
>>> >> you give an example where it would seem unnecessary?
>
> I think we shouldn't go overboard here. In the great majority of cases it's
> needed but sometimes there's just not much info to add besides what's in the
> summary and parameter description. Examples:
> http://docs.scipy.org/numpy/docs/numpy.core.umath.add/
> http://docs.scipy.org/numpy/docs/numpy.lib.ufunclike.isneginf/
> http://docs.scipy.org/numpy/docs/numpy.core.umath.logical_or/

Thanks, Ralf.  Those are the examples I was looking for, and I agree.

Regards
St?fan


From d.l.goldsmith at gmail.com  Wed Jun  2 14:35:14 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 2 Jun 2010 11:35:14 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
Message-ID: <AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com>

On Tue, Jun 1, 2010 at 1:32 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> The docstring Standard seems to be careful to note which sections are
> considered optional, and the "Extended Summary" is *not* on that list.
> However,


> I'm encountering many SciPy docstrings


I'm not talking about NumPy docstrings; I'm not looking at/touching NumPy
docstrings; I'm only going after low-hangingl SciPy fruit.

DG

> ; cain the Wiki lacking this section and yet marked as "Needs review":
> should I ignore this deficiency and add a ticket to clarify the Standard, or
> should such docstrings be moved back to "Being written"?
>
> DG
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide.  (As interpreted
> by Robert Graves)
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/661fd5f8/attachment.html>

From bsouthey at gmail.com  Wed Jun  2 14:39:17 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 02 Jun 2010 13:39:17 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C06A059.6020901@wartburg.edu>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
Message-ID: <4C06A555.20803@gmail.com>

On 06/02/2010 01:18 PM, Neil Martinsen-Burrell wrote:
> On 2010-06-02 13:10 , Bruce Southey wrote:
> [...]
>
>>>> However, this code is the chi-squared test part as SAS will compute 
>>>> the
>>>> actual cell numbers. Also an extension to scipy.stats.chisquare() 
>>>> so we
>>>> can not have both functions.
>>>
>>> Again, I don't understand what you mean that we can't have both
>>> functions? I believe (from a statistics teacher's point of view) that
>>> the Chi-Squared goodness of fit test (which is stats.chisquare) is a
>>> different beast from the Chi-Square test for independence (which is
>>> stats.chisquare_contingency). The fact that the distribution of the
>>> test statistic is the same should not tempt us to put them into the
>>> same function.
>> Please read scipy.stats.chisquare() because scipy.stats.chisquare() is
>> the 1-d case of yours.
>> Quote from the docstring:
>> " The chi square test tests the null hypothesis that the categorical 
>> data
>> has the given frequencies."
>> Also go the web site provided in the docstring.
>>
>> By default you get the expected frequencies but you can also put in your
>> own using the f_exp variable. You could do the same in your code.
>
> In fact, Warren correctly used stats.chisquare with the expected 
> frequencies calculated from the null hypothesis and the corrected 
> degrees of freedom.  chisquare_contingency is in some sense a 
> convenience method for taking care of these pre-calculations before 
> calling stats.chisquare.  Can you explain more clearly to me why we 
> should not include such a convenience function?
I do not understand you here.

Clearly you have not read scipy.stats.chisquare() to know what it is 
doing. You should also read the cited url including the second part:
http://faculty.vassar.edu/lowry/ch8pt2.html

I don't see any 'pre-calculations' in the code. You have to compute the 
'expected value' for each cell because of the overall null hypothesis. 
Then you have to sum across all cells the value of 
(observed-expected)*(observed-expected)/expected to get the test 
statistic. That is trivial to do within the code and a waste of cpu time 
and memory to send it to another function to do that.

Bruce

>
>>>> Really this should be combined with fisher.py in ticket 956:
>>>> http://projects.scipy.org/scipy/ticket/956
>>>
>>> Wow, apparently I have lots of disagreements today, but I don't think
>>> that this should be combined with Fisher's Exact test. (I would like
>>> to see that ticket mature to the point where it can be added to
>>> scipy.stats.) I like the functions in scipy.stats to correspond in a
>>> one-to-one manner with the statistical tests. I think that the docs
>>> should "See Also" the appropriate exact (and non-parametric) tests,
>>> but I think that one function/one test is a good rule. This is
>>> particularly true for people (like me) who would like to someday be
>>> able to use scipy.stats in a pedagogical context.
>>>
>>> -Neil
>> I don't see any 'disagreements' rather just different ways to do things
>> and identifying areas that need to be addressed for more general use.
>
> Agreed. :)
>
> [...]
>
> -Neil

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/62da7df4/attachment.html>

From stefan at sun.ac.za  Wed Jun  2 14:39:54 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Wed, 2 Jun 2010 11:39:54 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com> 
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com>
Message-ID: <AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com>

On 2 June 2010 11:35, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
>> I'm encountering many SciPy docstrings
>
> I'm not talking about NumPy docstrings; I'm not looking at/touching NumPy
> docstrings; I'm only going after low-hangingl SciPy fruit.

I think Ralf's point was that we have more important things to do than
nitpick around whether some functions should have extended sections or
not.  Let's get cracking on the many docstrings that are not even
close to done.

Regards
St?fan


From josef.pktd at gmail.com  Wed Jun  2 14:41:47 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Wed, 2 Jun 2010 14:41:47 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C06A059.6020901@wartburg.edu>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
Message-ID: <AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>

On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell <nmb at wartburg.edu> wrote:
> On 2010-06-02 13:10 , Bruce Southey wrote:
> [...]
>
>>>> However, this code is the chi-squared test part as SAS will compute the
>>>> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
>>>> can not have both functions.
>>>
>>> Again, I don't understand what you mean that we can't have both
>>> functions? I believe (from a statistics teacher's point of view) that
>>> the Chi-Squared goodness of fit test (which is stats.chisquare) is a
>>> different beast from the Chi-Square test for independence (which is
>>> stats.chisquare_contingency). The fact that the distribution of the
>>> test statistic is the same should not tempt us to put them into the
>>> same function.
>> Please read scipy.stats.chisquare() because scipy.stats.chisquare() is
>> the 1-d case of yours.
>> Quote from the docstring:
>> " The chi square test tests the null hypothesis that the categorical data
>> has the given frequencies."
>> Also go the web site provided in the docstring.
>>
>> By default you get the expected frequencies but you can also put in your
>> own using the f_exp variable. You could do the same in your code.
>
> In fact, Warren correctly used stats.chisquare with the expected
> frequencies calculated from the null hypothesis and the corrected
> degrees of freedom. ?chisquare_contingency is in some sense a
> convenience method for taking care of these pre-calculations before
> calling stats.chisquare. ?Can you explain more clearly to me why we
> should not include such a convenience function?

Just a clarification, before I find time to work my way through the
other comments

stats.chisquare is a generic test for goodness-of-fit for discreted or
binned distributions.
and from the docstring of it
"If no expected frequencies are given, the total
    N is assumed to be equally distributed across all groups."

default is uniform distribution

chisquare_twoway is a special case that additional calculates the
correct expected frequencies for the test of independencs based on the
margin totals. The resulting distribution is not uniform.

I agree with Neil that this is a very useful convenience function.

I never heard of a one-way contingency table, my question was whether
the function should also handle 3-way or 4-way tables, additional to
two-way.

I thought about the question how the input should be specified for my
initial response, the alternative would be to use the original data or
a "long" format instead of a table. But I thought that as a
convenience function using the table format will be the most common
use.

I have written in the past functions that calculate the contingency
table, and would be very useful to have a more complete coverage of
tools to work with contingency tables in scipy.stats (or temporarily
in statsmodels, where we are working also on the anova type of
analysis)

So, I think the way it is it is a nice function and we don't have to
put all contingency table analysis into this function.

Josef

>
>>>> Really this should be combined with fisher.py in ticket 956:
>>>> http://projects.scipy.org/scipy/ticket/956
>>>
>>> Wow, apparently I have lots of disagreements today, but I don't think
>>> that this should be combined with Fisher's Exact test. (I would like
>>> to see that ticket mature to the point where it can be added to
>>> scipy.stats.) I like the functions in scipy.stats to correspond in a
>>> one-to-one manner with the statistical tests. I think that the docs
>>> should "See Also" the appropriate exact (and non-parametric) tests,
>>> but I think that one function/one test is a good rule. This is
>>> particularly true for people (like me) who would like to someday be
>>> able to use scipy.stats in a pedagogical context.
>>>
>>> -Neil
>> I don't see any 'disagreements' rather just different ways to do things
>> and identifying areas that need to be addressed for more general use.
>
> Agreed. :)
>
> [...]
>
> -Neil
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From d.l.goldsmith at gmail.com  Wed Jun  2 14:47:05 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 2 Jun 2010 11:47:05 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com>
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com>
Message-ID: <AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>

I'm working on the docstrings - is any one else?

DG

2010/6/2 St?fan van der Walt <stefan at sun.ac.za>

> On 2 June 2010 11:35, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> >> I'm encountering many SciPy docstrings
> >
> > I'm not talking about NumPy docstrings; I'm not looking at/touching NumPy
> > docstrings; I'm only going after low-hangingl SciPy fruit.
>
> I think Ralf's point was that we have more important things to do than
> nitpick around whether some functions should have extended sections or
> not.  Let's get cracking on the many docstrings that are not even
> close to done.
>
> Regards
> St?fan
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/4de51b56/attachment.html>

From stefan at sun.ac.za  Wed Jun  2 14:59:04 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Wed, 2 Jun 2010 11:59:04 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com> 
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com> 
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com> 
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>
Message-ID: <AANLkTinaZiPP2Y1xp9e2m02cdQY6JwhplcqFi58mU6On@mail.gmail.com>

On 2 June 2010 11:47, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> I'm working on the docstrings - is any one else?

In the past, there used to be very targeted mini-sprints; are we
following a similar process this time?  If so, where should we focus
our attention?

If you post a list of 5 functions that need urgent attention, I'll put
in some time to document at least one of them.

Regards
St?fan


From stefan at sun.ac.za  Wed Jun  2 14:55:24 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Wed, 2 Jun 2010 11:55:24 -0700
Subject: [SciPy-Dev] Recent changes to scipy stats
In-Reply-To: <ECEC67D9-DCD8-4E8A-8BAA-D3771DA05D4A@enthought.com>
References: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com> 
	<AANLkTik0FK74kRccGE2yImSsmicDvIXqVQGibY0xIvvF@mail.gmail.com> 
	<ECEC67D9-DCD8-4E8A-8BAA-D3771DA05D4A@enthought.com>
Message-ID: <AANLkTikYvEEAu36lQx0S9_TKHIhft1Gf9WagE8WOiwVd@mail.gmail.com>

Dear Travis (and others)

On 1 June 2010 01:25, Travis Oliphant <oliphant at enthought.com> wrote:
> I actually think it very inconsiderate that I should be treated with such rudeness for contributing needed functionality.

I was saddened to witness the tone of these conversations, and I wish
certain rash personal comments by Charles and David G were rather not
made; they certainly don't reflect the attitude of the community as a
whole.

While you and I have very different approaches to software
engineering, I respect the fact that we both aim to achieve the same
goal: create a better SciPy.  In the past, this spirit of innovation
helped to form a remarkably friendly, driven and effective community
in which decisions were reached by civil argument and consensus,
rather than hard-line rules and policies.

Hopefully, we can all return our focus to steering this ship in the
same direction.  If some technological changes would help with that
process, that's well worth investigating (Jarrod and I are nearly
ready with a NEP for switching to Github).

Kind regards
St?fan


From njs at pobox.com  Wed Jun  2 15:06:29 2010
From: njs at pobox.com (Nathaniel Smith)
Date: Wed, 2 Jun 2010 12:06:29 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <1331DFB9-4FCA-44F3-A1D0-C00714A60511@enthought.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
	<AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
	<1331DFB9-4FCA-44F3-A1D0-C00714A60511@enthought.com>
Message-ID: <AANLkTilKIrz1wQRyCyR3_C6_gG8Fk4C6itE4-k9uAsLw@mail.gmail.com>

On Tue, Jun 1, 2010 at 4:33 PM, Travis Oliphant <oliphant at enthought.com> wrote:
>
> I really think this is more about how people view commits to the trunk than anything else. ? I like to use SVN as a version control system. ? My commits to trunk are always more incremental. ? I like to get things committed in self-contained chunks. ? Adding the requirement to put in documentation and tests before committing stretches out that "incremental" work element to longer than I ever have time for in one sitting.
>
> Clearly, if I were using DVCS to a published branch that could be then merged to the trunk this problem would not have arisen. ? I see that I need to move to that style. ? ?People are reading far more into my committing to trunk than I ever meant to imply.

I remember when I first started hacking free software, this was the
model that *every* project used, and when people started talking about
"always releasable trunks" it seemed like the weirdest, most unlikely
concept ever. (I guess that makes this a generational thing?) Having
finally wrapped my head around it on a few other projects, though, I
can't imagine ever going back. Those "rules" and "procedures" are
about as jackbooted as a dayplanner or a todo list... they let us
avoid all the stress of having to remember which pieces *have* to get
added before a release can happen, accidentally crashing into other
people's work, having big debates, etc.; we can just get on with
hacking and the resulting code is even better. (Because *everyone*'s
code is better for being reviewed and tested. Even mine!)

The other thing that helped reconcile me to this style of development
was figuring out how to make testing less of a chore. Personally, I
can't deal with TDD -- I don't understand how people know what the API
should look like (to write the test) until they've written the
implementation! But a much simpler method works for me: I never would
commit code without at least *running* it, so now I've trained myself
to just type those "hey, does this thing I just wrote work at *all*?"
lines into a test function instead of a REPL. And while I'm sure there
are all sorts of wonderful virtues and maintenance benefits to having
a test suite, the real reason I do this is discovering that while I'm
actually hacking, it's way easier to hit the 're-run tests' button
than it is to re-copy/paste that line of code into the REPL. Kind of
embarrassing in retrospect...

No idea how any of this applies to others, but maybe someone will find
it useful.

-- Nathaniel


From matthew.brett at gmail.com  Wed Jun  2 15:51:34 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Wed, 2 Jun 2010 12:51:34 -0700
Subject: [SciPy-Dev] Recent changes to scipy stats
In-Reply-To: <AANLkTikYvEEAu36lQx0S9_TKHIhft1Gf9WagE8WOiwVd@mail.gmail.com>
References: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com>
	<AANLkTik0FK74kRccGE2yImSsmicDvIXqVQGibY0xIvvF@mail.gmail.com>
	<ECEC67D9-DCD8-4E8A-8BAA-D3771DA05D4A@enthought.com>
	<AANLkTikYvEEAu36lQx0S9_TKHIhft1Gf9WagE8WOiwVd@mail.gmail.com>
Message-ID: <AANLkTikG8ZjrBIrbxopzAVOJYP5UztSP845dTUFY3hFf@mail.gmail.com>

Hi,

> On 1 June 2010 01:25, Travis Oliphant <oliphant at enthought.com> wrote:
>> I actually think it very inconsiderate that I should be treated with such rudeness for contributing needed functionality.
>
> I was saddened to witness the tone of these conversations, and I wish
> certain rash personal comments by Charles and David G were rather not
> made; they certainly don't reflect the attitude of the community as a
> whole.

Well - hold on though.  Of course we should call people out on being
personally offensive - but if we're going to do that, we should do it
at the time of the email - directly to that person - it's only fair.
And - I think we have to be careful also to defend our ability to be
direct and honest when then are problems that need to be addressed.
Any community needs that in order to grow, I believe.

See you,

Matthew


From bsouthey at gmail.com  Wed Jun  2 16:03:07 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 02 Jun 2010 15:03:07 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
Message-ID: <4C06B8FB.8080806@gmail.com>

On 06/02/2010 01:41 PM, josef.pktd at gmail.com wrote:
> On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell<nmb at wartburg.edu>  wrote:
>    
>> On 2010-06-02 13:10 , Bruce Southey wrote:
>> [...]
>>
>>      
>>>>> However, this code is the chi-squared test part as SAS will compute the
>>>>> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
>>>>> can not have both functions.
>>>>>            
>>>> Again, I don't understand what you mean that we can't have both
>>>> functions? I believe (from a statistics teacher's point of view) that
>>>> the Chi-Squared goodness of fit test (which is stats.chisquare) is a
>>>> different beast from the Chi-Square test for independence (which is
>>>> stats.chisquare_contingency). The fact that the distribution of the
>>>> test statistic is the same should not tempt us to put them into the
>>>> same function.
>>>>          
>>> Please read scipy.stats.chisquare() because scipy.stats.chisquare() is
>>> the 1-d case of yours.
>>> Quote from the docstring:
>>> " The chi square test tests the null hypothesis that the categorical data
>>> has the given frequencies."
>>> Also go the web site provided in the docstring.
>>>
>>> By default you get the expected frequencies but you can also put in your
>>> own using the f_exp variable. You could do the same in your code.
>>>        
>> In fact, Warren correctly used stats.chisquare with the expected
>> frequencies calculated from the null hypothesis and the corrected
>> degrees of freedom.  chisquare_contingency is in some sense a
>> convenience method for taking care of these pre-calculations before
>> calling stats.chisquare.  Can you explain more clearly to me why we
>> should not include such a convenience function?
>>      
> Just a clarification, before I find time to work my way through the
> other comments
>
> stats.chisquare is a generic test for goodness-of-fit for discreted or
> binned distributions.
> and from the docstring of it
> "If no expected frequencies are given, the total
>      N is assumed to be equally distributed across all groups."
>
> default is uniform distribution
>
>    
Try:
http://en.wikipedia.org/wiki/Pearson's_chi-square_test 
<http://en.wikipedia.org/wiki/Pearson%27s_chi-square_test>

The use of the uniform distribution is rather misleading and technically 
wrong as it does not help address the expected number of outcomes in a cell:
http://en.wikipedia.org/wiki/Discrete_uniform_distribution


> chisquare_twoway is a special case that additional calculates the
> correct expected frequencies for the test of independencs based on the
> margin totals. The resulting distribution is not uniform.
>    
Actually the null hypothesis is rather different between 1-way and 2-way 
tables so you can not say that chisquare_twoway is a special case of 
chisquare.

I am not sure what you mean by the 'resulting distribution is not 
uniform'.  The distribution of the cells values has nothing to do with 
the uniform distribution in either case because it is not used in the 
data nor in the formulation of the test. (And, yes, I have had to do the 
proof that the test statistic is Chi-squared - which is why there is the 
warning about small cells...).
> I agree with Neil that this is a very useful convenience function.
>    
My problem with the chisquare_twoway is that it should not call another 
function to finish two lines of code. It is just an excessive waste of 
resources.

> I never heard of a one-way contingency table, my question was whether
> the function should also handle 3-way or 4-way tables, additional to
> two-way.
>    
Correct to both of these as I just consider these as n-way tables. I 
think that contingency tables by definition only applies to the 2-d 
case. Pivot tables are essentially the same thing. I would have to 
lookup on how to get the expected number of outcomes but probably of the 
form Ni.. * N.j. *N..k/N... for the 3-way (the 2-way table is of the 
form Ni.*N.j/N..) for i=rows, j=columns, k=3rd axis and '.' means sum 
for that axis.

> I thought about the question how the input should be specified for my
> initial response, the alternative would be to use the original data or
> a "long" format instead of a table. But I thought that as a
> convenience function using the table format will be the most common
> use.
> I have written in the past functions that calculate the contingency
> table, and would be very useful to have a more complete coverage of
> tools to work with contingency tables in scipy.stats (or temporarily
> in statsmodels, where we are working also on the anova type of
> analysis)
>    
It depends on what tasks are needed.  Really there are two steps:
1) Cross-tabulation that summarized the data from whatever input 
(groupby would help here).
2) Statistical tests - series of functions that accept summarized data only.

If you have separate functions then the burden is on the user to find 
and call all the desired functions. You can also provide a single helper 
function to do all that because you don't want to repeat unnecessary calls.

> So, I think the way it is it is a nice function and we don't have to
> put all contingency table analysis into this function.
>
> Josef
>    


Bruce

>    
>>      
>>>>> Really this should be combined with fisher.py in ticket 956:
>>>>> http://projects.scipy.org/scipy/ticket/956
>>>>>            
>>>> Wow, apparently I have lots of disagreements today, but I don't think
>>>> that this should be combined with Fisher's Exact test. (I would like
>>>> to see that ticket mature to the point where it can be added to
>>>> scipy.stats.) I like the functions in scipy.stats to correspond in a
>>>> one-to-one manner with the statistical tests. I think that the docs
>>>> should "See Also" the appropriate exact (and non-parametric) tests,
>>>> but I think that one function/one test is a good rule. This is
>>>> particularly true for people (like me) who would like to someday be
>>>> able to use scipy.stats in a pedagogical context.
>>>>
>>>> -Neil
>>>>          
>>> I don't see any 'disagreements' rather just different ways to do things
>>> and identifying areas that need to be addressed for more general use.
>>>        
>> Agreed. :)
>>
>> [...]
>>
>> -Neil
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>      
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/4c59a68d/attachment.html>

From oliphant at enthought.com  Wed Jun  2 16:21:32 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Wed, 2 Jun 2010 15:21:32 -0500
Subject: [SciPy-Dev] Difference between scipy.stats.gengamma and
	scipy.stats.distributions.gengamma
In-Reply-To: <AANLkTik2GNHnhC-Zo3lx0z-NzE9KjC8mmepeTjmYsa15@mail.gmail.com>
References: <AANLkTimo2_X00Mr-trDO88MbIT2ggwlPiJ9V8utzflz5@mail.gmail.com>
	<AANLkTik2GNHnhC-Zo3lx0z-NzE9KjC8mmepeTjmYsa15@mail.gmail.com>
Message-ID: <5FDEAF11-E3B4-4285-A8C7-8E1676201466@enthought.com>


On Jun 2, 2010, at 12:26 AM, josef.pktd at gmail.com wrote:

> On Wed, Jun 2, 2010 at 1:09 AM, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
>> Is there a difference between these two?  Same question for stats.lognorm
>> and stats.distributions.lognorm?  Thanks.
> 
> No, they are the same instance of the distribution
> 
> scipy.stats.__init__   has a from distributions import *  or something like this

In general, the original design concept in scipy name-spaces is that names should not be imported from their "leaf-node", but from somewhere higher up.   The fact that the distribution objects are in scipy.stats.distributions should not be relied upon. 

This is the same philosophy in NumPy (i.e. you shouldn't import things from numpy.core or numpy.lib directly). 

-Travis

> 
> Josef
> 
>> 
>> DG
>> 
>> --
>> Mathematician: noun, someone who disavows certainty when their uncertainty
>> set is non-empty, even if that set has measure zero.
>> 
>> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
>> lies, prevents mankind from committing a general suicide.  (As interpreted
>> by Robert Graves)
>> 
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>> 
>> 
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev

---
Travis Oliphant
Enthought, Inc.
oliphant at enthought.com
1-512-536-1057
http://www.enthought.com


From oliphant at enthought.com  Wed Jun  2 16:23:44 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Wed, 2 Jun 2010 15:23:44 -0500
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTilKIrz1wQRyCyR3_C6_gG8Fk4C6itE4-k9uAsLw@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
	<AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
	<1331DFB9-4FCA-44F3-A1D0-C00714A60511@enthought.com>
	<AANLkTilKIrz1wQRyCyR3_C6_gG8Fk4C6itE4-k9uAsLw@mail.gmail.com>
Message-ID: <AF83427E-E843-4D32-95E6-EFF267B7A82C@enthought.com>


On Jun 2, 2010, at 2:06 PM, Nathaniel Smith wrote:

> On Tue, Jun 1, 2010 at 4:33 PM, Travis Oliphant <oliphant at enthought.com> wrote:
>> 
>> I really think this is more about how people view commits to the trunk than anything else.   I like to use SVN as a version control system.   My commits to trunk are always more incremental.   I like to get things committed in self-contained chunks.   Adding the requirement to put in documentation and tests before committing stretches out that "incremental" work element to longer than I ever have time for in one sitting.
>> 
>> Clearly, if I were using DVCS to a published branch that could be then merged to the trunk this problem would not have arisen.   I see that I need to move to that style.    People are reading far more into my committing to trunk than I ever meant to imply.
> 
> I remember when I first started hacking free software, this was the
> model that *every* project used, and when people started talking about
> "always releasable trunks" it seemed like the weirdest, most unlikely
> concept ever. (I guess that makes this a generational thing?) Having
> finally wrapped my head around it on a few other projects, though, I
> can't imagine ever going back. Those "rules" and "procedures" are
> about as jackbooted as a dayplanner or a todo list... they let us
> avoid all the stress of having to remember which pieces *have* to get
> added before a release can happen, accidentally crashing into other
> people's work, having big debates, etc.; we can just get on with
> hacking and the resulting code is even better. (Because *everyone*'s
> code is better for being reviewed and tested. Even mine!)
> 
> The other thing that helped reconcile me to this style of development
> was figuring out how to make testing less of a chore. Personally, I
> can't deal with TDD -- I don't understand how people know what the API
> should look like (to write the test) until they've written the
> implementation! But a much simpler method works for me: I never would
> commit code without at least *running* it, so now I've trained myself
> to just type those "hey, does this thing I just wrote work at *all*?"
> lines into a test function instead of a REPL. And while I'm sure there
> are all sorts of wonderful virtues and maintenance benefits to having
> a test suite, the real reason I do this is discovering that while I'm
> actually hacking, it's way easier to hit the 're-run tests' button
> than it is to re-copy/paste that line of code into the REPL. Kind of
> embarrassing in retrospect...
> 
> No idea how any of this applies to others, but maybe someone will find
> it useful.

I found it very useful.   Thanks for sharing your experience. 

-Travis


> 
> -- Nathaniel
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev

---
Travis Oliphant
Enthought, Inc.
oliphant at enthought.com
1-512-536-1057
http://www.enthought.com


From ilanschnell at gmail.com  Wed Jun  2 18:17:11 2010
From: ilanschnell at gmail.com (Ilan Schnell)
Date: Wed, 2 Jun 2010 17:17:11 -0500
Subject: [SciPy-Dev] import error in scipy.stats on RH3 32-bit
Message-ID: <AANLkTink02FLVCAd9-OM0G-M8h6IoALUmr6L1m961DKJ@mail.gmail.com>

Hello group,

I'm not exactly sure what has changed in scipy.stats, but
building and importing all extensions with the first 0.8.x brach
(revision 6446) worked fine on CentOS release 3.9 (32-bit).

Now (revision 6476), I can still build everything, but when
I try to import scipy.stats.vonmises_cython, I get the following
unresolved symbol:
ImportError: /home/tester/master/lib/python2.6/site-packages/scipy/linalg/clapack.so:
undefined symbol: clapack_sgesv

Strangely, I don't get this import error on any platform (64-bit/32-bit,
Windows, MaxOSX, Redhat 5, Solaris).

Does anyone know what could be going on here?

- Ilan


From ben.root at ou.edu  Wed Jun  2 19:03:36 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Wed, 2 Jun 2010 18:03:36 -0500
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com> 
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com> 
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com> 
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>
Message-ID: <AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com>

You may have my keyboard!

Seriously, though, I just registered a username on the wiki (weathergod) and
I would like to help out with documentation.

Ben Root

On Wed, Jun 2, 2010 at 1:47 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> I'm working on the docstrings - is any one else?
>
> DG
>
> 2010/6/2 St?fan van der Walt <stefan at sun.ac.za>
>
> On 2 June 2010 11:35, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
>> >> I'm encountering many SciPy docstrings
>> >
>> > I'm not talking about NumPy docstrings; I'm not looking at/touching
>> NumPy
>> > docstrings; I'm only going after low-hangingl SciPy fruit.
>>
>> I think Ralf's point was that we have more important things to do than
>> nitpick around whether some functions should have extended sections or
>> not.  Let's get cracking on the many docstrings that are not even
>> close to done.
>>
>> Regards
>> St?fan
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide.  (As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/44f95898/attachment.html>

From stefan at sun.ac.za  Wed Jun  2 19:27:14 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Wed, 2 Jun 2010 16:27:14 -0700
Subject: [SciPy-Dev] Recent changes to scipy stats
In-Reply-To: <AANLkTikG8ZjrBIrbxopzAVOJYP5UztSP845dTUFY3hFf@mail.gmail.com>
References: <4D0A9D22-882F-4FCC-82D5-740D332CF7F9@enthought.com> 
	<AANLkTik0FK74kRccGE2yImSsmicDvIXqVQGibY0xIvvF@mail.gmail.com> 
	<ECEC67D9-DCD8-4E8A-8BAA-D3771DA05D4A@enthought.com>
	<AANLkTikYvEEAu36lQx0S9_TKHIhft1Gf9WagE8WOiwVd@mail.gmail.com> 
	<AANLkTikG8ZjrBIrbxopzAVOJYP5UztSP845dTUFY3hFf@mail.gmail.com>
Message-ID: <AANLkTik5YCXaSxyYl_kHgNkTrXbnL_ScmYvW_42Knt66@mail.gmail.com>

On 2 June 2010 12:51, Matthew Brett <matthew.brett at gmail.com> wrote:
>> I was saddened to witness the tone of these conversations, and I wish
>> certain rash personal comments by Charles and David G were rather not
>> made; they certainly don't reflect the attitude of the community as a
>> whole.
>
> Well - hold on though. ?Of course we should call people out on being
> personally offensive - but if we're going to do that, we should do it
> at the time of the email - directly to that person - it's only fair.

Unfortunately, those comments were made in public; if we express our
disagreement in private only, the offended party would never even be
aware of any disagreement in the community.  As for calling people out
at time of writing, such a time limit suggests that any distress
caused is similarly limited, which it is not.

> And - I think we have to be careful also to defend our ability to be
> direct and honest when then are problems that need to be addressed.
> Any community needs that in order to grow, I believe.

Yes, absolutely: direct and honest discourse is great.  Offensive
statements are not.

Regards
St?fan


From matthew.brett at gmail.com  Wed Jun  2 19:34:34 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Wed, 2 Jun 2010 16:34:34 -0700
Subject: [SciPy-Dev] scipy.stats
In-Reply-To: <AANLkTilKIrz1wQRyCyR3_C6_gG8Fk4C6itE4-k9uAsLw@mail.gmail.com>
References: <AANLkTikKQK_5yyqL5BhqGPvdF9s8inag62_Z_i5gnOmz@mail.gmail.com>
	<18D26A1A-0164-4D80-8619-BAC28FC33D11@enthought.com>
	<AANLkTil0AYi6voAujnbNKdv_0tsBCXQT1ApEsO1GlnkL@mail.gmail.com>
	<3430B5AD-E3C2-4CE7-B07F-D8210C2E53D5@enthought.com>
	<AANLkTilfc1jQUwhftg_vBCTYFJ8zr1XDh9KML-Cu6DzO@mail.gmail.com>
	<1331DFB9-4FCA-44F3-A1D0-C00714A60511@enthought.com>
	<AANLkTilKIrz1wQRyCyR3_C6_gG8Fk4C6itE4-k9uAsLw@mail.gmail.com>
Message-ID: <AANLkTilRBVD7EOrhzUhosnwi8Wvq1BxEttykFdEfCUA5@mail.gmail.com>

Hi,

> The other thing that helped reconcile me to this style of development
> was figuring out how to make testing less of a chore. Personally, I
> can't deal with TDD -- I don't understand how people know what the API
> should look like (to write the test) until they've written the
> implementation!

I just thought I'd pitch in with this one, because there can be
confusion between writing code with tests, and
test-driven-development.    My understanding is that there is good
objective evidence that test-driven-development improves code quality,
but it takes a lot of discipline until you are used to it.  In my
experience it's most important precisely for defining the API,
because, in writing the tests, you start defining what the API will
look like, and then I find that my API is pretty bad and I change it
before I've written the code.

But - if TDD is an ideal - it is of course a matter of personal
practice.  But - having tests for your code - developed before, after,
or during your code - that's really important for having maintainable
code - as I'm sure we all agree.  And - yes - absolutely - if you are
doing _any_ kind of testing when developing, please do check that in,
even if that's all you've got - at least it's something,

Thanks a lot,

Matthew


From stefan at sun.ac.za  Wed Jun  2 20:02:40 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Wed, 2 Jun 2010 17:02:40 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com> 
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com> 
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com> 
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com> 
	<AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com>
Message-ID: <AANLkTim8eZq6fpU0FZoSHA9vrP_4oClL7Ava64nk53c0@mail.gmail.com>

On 2 June 2010 16:03, Benjamin Root <ben.root at ou.edu> wrote:
> You may have my keyboard!
>
> Seriously, though, I just registered a username on the wiki (weathergod) and
> I would like to help out with documentation.

Added, and welcome!

Cheers
St?fan


From warren.weckesser at enthought.com  Wed Jun  2 20:04:11 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Wed, 02 Jun 2010 19:04:11 -0500
Subject: [SciPy-Dev] Mea culpa: deprecation and API changes
In-Reply-To: <4C044B0F.4000103@enthought.com>
References: <4C012BFE.4090103@enthought.com>	<AANLkTinBtUuoCiviONCfBgPKO6_9ViPmPmb805WmRjnH@mail.gmail.com>	<4C0149EB.8030608@enthought.com>	<AANLkTime2NfLT9W2Q5cboyHwgXZ7v3aBPRPU-xInlYmL@mail.gmail.com>	<4C03F00F.3020806@enthought.com>
	<4C044B0F.4000103@enthought.com>
Message-ID: <4C06F17B.7040209@enthought.com>

Warren Weckesser wrote:
> Opinion wanted:  codata.find(sub) used to print a list of strings.  A 
> while ago, in response to http://projects.scipy.org/scipy/ticket/996,  I 
> changed it to return the list of strings.  But this is an API change, 
> and should follow the deprecation policy.  One way to do this is to 
> restore find() to its previous behavior, and deprecate the function.  At 
> the same time, add a new function, find_string(sub), which returns the 
> list of strings.  What do you think?
>
>   

Instead of creating a new function, I added a keyword argument whose 
default value (True) preserves the old behavior.  When it is False, it 
returns the keys instead of printing them.  In 0.9, the default behavior 
will be reversed.

Warren

> Warren
>
>
> Warren Weckesser wrote:
>   
>> David Cournapeau wrote:
>>   
>>     
>>> On Sun, May 30, 2010 at 2:07 AM, Warren Weckesser
>>> <warren.weckesser at enthought.com> wrote:
>>>   
>>>     
>>>       
>>>> David Cournapeau wrote:
>>>>     
>>>>       
>>>>         
>>>>> On Sun, May 30, 2010 at 12:00 AM, Warren Weckesser
>>>>> <warren.weckesser at enthought.com> wrote:
>>>>>
>>>>>
>>>>>
>>>>>       
>>>>>         
>>>>>           
>>>>>> What I would like to do is leave trunk as it is, and after 0.8 is
>>>>>> branched, make the appropriate changes in the branch to follow the
>>>>>> deprecation policy.  Is that a reasonable approach?
>>>>>>
>>>>>>         
>>>>>>           
>>>>>>             
>>>>> May I ask why do you want to do that way ?
>>>>>       
>>>>>         
>>>>>           
>>>> Because it doesn't look like I will have time to make the changes before
>>>> Ralf branches 0.8 tomorrow.
>>>>
>>>>     
>>>>       
>>>>         
>>>>>  Putting the deprecation in
>>>>> the release branch means people tracking trunk will never see them.
>>>>>
>>>>>       
>>>>>         
>>>>>           
>>>> Good point.    But in case I am misinterpreting what you mean by
>>>> "tracking trunk" and "see":  I assume this means it is important to have
>>>> a record of the deprecation changes in the svn logs, and not that some
>>>> who is *using* scipy from trunk also needs to be exposed to the
>>>> deprecation warning for some minimum amount of time.
>>>>     
>>>>       
>>>>         
>>> actually, I meant both. For example, I often use scipy from trunk, and
>>> rarely from releases.  I will never see the deprecation, which is not
>>> good.
>>>
>>> Also, I think we should generally try to never put things in release
>>> branches, but always backport from trunk (except for branch specific
>>> changes). Having the 0.8 branch created tomorrow does not mean you
>>> cannot put the changes into trunk, and backport them in 0.8 later -
>>> deprecation which were already agreed on are the kind of things which
>>> can happen after the branching without putting much burden on the
>>> release process.
>>>
>>>   
>>>     
>>>       
>>>>  If the changes are
>>>> made to trunk, then they will be undone immediately after 0.8 is
>>>> branched.
>>>>     
>>>>       
>>>>         
>>> deprecated features do not be to be removed just after the trunk is
>>> opened for the next release cycle (0.9 here).
>>>
>>>   
>>>     
>>>       
>>>> ever have a copy that includes the deprecation warnings.  In other
>>>> words, deprecations are linked to releases, not to "time in trunk".
>>>>     
>>>>       
>>>>         
>>> Indeed - but I think that we should let the deprecation be in place
>>> for as long as possible in the source code repository.
>>>
>>>   
>>>     
>>>       
>> OK.  It might be a couple more days before I can make the reversions and 
>> deprecations, but I'll get them in before the beta release on June 6.
>>
>> Warren
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>   
>>     
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From vincent at vincentdavis.net  Wed Jun  2 20:11:09 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 2 Jun 2010 18:11:09 -0600
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTim8eZq6fpU0FZoSHA9vrP_4oClL7Ava64nk53c0@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com>
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com>
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>
	<AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com>
	<AANLkTim8eZq6fpU0FZoSHA9vrP_4oClL7Ava64nk53c0@mail.gmail.com>
Message-ID: <AANLkTiml_q1q48kpJbhw-WJmg5OGmHM9-2Qx2nsdBgMV@mail.gmail.com>

I just setup an account. vincentdavis
I assume I will find instruction but how does the review/commit of
updates work (in brief)

Thanks
Vincent

2010/6/2 St?fan van der Walt <stefan at sun.ac.za>:
> On 2 June 2010 16:03, Benjamin Root <ben.root at ou.edu> wrote:
>> You may have my keyboard!
>>
>> Seriously, though, I just registered a username on the wiki (weathergod) and
>> I would like to help out with documentation.
>
> Added, and welcome!
>
> Cheers
> St?fan
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From charlesr.harris at gmail.com  Wed Jun  2 21:14:59 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Wed, 2 Jun 2010 19:14:59 -0600
Subject: [SciPy-Dev] import error in scipy.stats on RH3 32-bit
In-Reply-To: <AANLkTink02FLVCAd9-OM0G-M8h6IoALUmr6L1m961DKJ@mail.gmail.com>
References: <AANLkTink02FLVCAd9-OM0G-M8h6IoALUmr6L1m961DKJ@mail.gmail.com>
Message-ID: <AANLkTilWfx3XogGYefxorjJ4Z8iaiFhTUN-Py43kHtVQ@mail.gmail.com>

On Wed, Jun 2, 2010 at 4:17 PM, Ilan Schnell <ilanschnell at gmail.com> wrote:

> Hello group,
>
> I'm not exactly sure what has changed in scipy.stats, but
> building and importing all extensions with the first 0.8.x brach
> (revision 6446) worked fine on CentOS release 3.9 (32-bit).
>
>
Does revision 6446 still work? The vonmises distribution hasn't been touched
in a long time.


 Now (revision 6476), I can still build everything, but when
> I try to import scipy.stats.vonmises_cython, I get the following
> unresolved symbol:
> ImportError:
> /home/tester/master/lib/python2.6/site-packages/scipy/linalg/clapack.so:
> undefined symbol: clapack_sgesv
>
> Strangely, I don't get this import error on any platform (64-bit/32-bit,
> Windows, MaxOSX, Redhat 5, Solaris).
>
> Does anyone know what could be going on here?
>
>
Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/e471d683/attachment.html>

From rmay31 at gmail.com  Wed Jun  2 21:32:44 2010
From: rmay31 at gmail.com (Ryan May)
Date: Wed, 2 Jun 2010 20:32:44 -0500
Subject: [SciPy-Dev] Mea culpa: deprecation and API changes
In-Reply-To: <4C06F17B.7040209@enthought.com>
References: <4C012BFE.4090103@enthought.com>
	<AANLkTinBtUuoCiviONCfBgPKO6_9ViPmPmb805WmRjnH@mail.gmail.com> 
	<4C0149EB.8030608@enthought.com>
	<AANLkTime2NfLT9W2Q5cboyHwgXZ7v3aBPRPU-xInlYmL@mail.gmail.com> 
	<4C03F00F.3020806@enthought.com> <4C044B0F.4000103@enthought.com> 
	<4C06F17B.7040209@enthought.com>
Message-ID: <AANLkTilXzU30fm9SQvaTCNy8VqloyjlrnhsVdANb2HNv@mail.gmail.com>

On Wed, Jun 2, 2010 at 7:04 PM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> Warren Weckesser wrote:
>> Opinion wanted: ?codata.find(sub) used to print a list of strings. ?A
>> while ago, in response to http://projects.scipy.org/scipy/ticket/996, ?I
>> changed it to return the list of strings. ?But this is an API change,
>> and should follow the deprecation policy. ?One way to do this is to
>> restore find() to its previous behavior, and deprecate the function. ?At
>> the same time, add a new function, find_string(sub), which returns the
>> list of strings. ?What do you think?
>>
>>
>
> Instead of creating a new function, I added a keyword argument whose
> default value (True) preserves the old behavior. ?When it is False, it
> returns the keys instead of printing them. ?In 0.9, the default behavior
> will be reversed.

Why not always return the list and just make only the print controlled
by the kwarg? That way the return type of the function doesn't depend
on a kwarg, which IIRC is considered bad style. You won't break
existing code, which will just ignore the new return value.

Ryan

-- 
Ryan May
Graduate Research Assistant
School of Meteorology
University of Oklahoma


From warren.weckesser at enthought.com  Wed Jun  2 21:45:53 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Wed, 02 Jun 2010 20:45:53 -0500
Subject: [SciPy-Dev] Mea culpa: deprecation and API changes
In-Reply-To: <AANLkTilXzU30fm9SQvaTCNy8VqloyjlrnhsVdANb2HNv@mail.gmail.com>
References: <4C012BFE.4090103@enthought.com>	<AANLkTinBtUuoCiviONCfBgPKO6_9ViPmPmb805WmRjnH@mail.gmail.com>
	<4C0149EB.8030608@enthought.com>	<AANLkTime2NfLT9W2Q5cboyHwgXZ7v3aBPRPU-xInlYmL@mail.gmail.com>
	<4C03F00F.3020806@enthought.com> <4C044B0F.4000103@enthought.com>
	<4C06F17B.7040209@enthought.com>
	<AANLkTilXzU30fm9SQvaTCNy8VqloyjlrnhsVdANb2HNv@mail.gmail.com>
Message-ID: <4C070951.8070209@enthought.com>

Ryan May wrote:
> On Wed, Jun 2, 2010 at 7:04 PM, Warren Weckesser
> <warren.weckesser at enthought.com> wrote:
>   
>> Warren Weckesser wrote:
>>     
>>> Opinion wanted:  codata.find(sub) used to print a list of strings.  A
>>> while ago, in response to http://projects.scipy.org/scipy/ticket/996,  I
>>> changed it to return the list of strings.  But this is an API change,
>>> and should follow the deprecation policy.  One way to do this is to
>>> restore find() to its previous behavior, and deprecate the function.  At
>>> the same time, add a new function, find_string(sub), which returns the
>>> list of strings.  What do you think?
>>>
>>>
>>>       
>> Instead of creating a new function, I added a keyword argument whose
>> default value (True) preserves the old behavior.  When it is False, it
>> returns the keys instead of printing them.  In 0.9, the default behavior
>> will be reversed.
>>     
>
> Why not always return the list and just make only the print controlled
> by the kwarg? That way the return type of the function doesn't depend
> on a kwarg, which IIRC is considered bad style. You won't break
> existing code, which will just ignore the new return value.
>   

That seemed the most conservative approach, despite being bad style.  It 
can all be cleaned up in 0.9 anyway.

I'm currently working on "fixing" signal.waveforms.chirp to maintain 
compatibility for one release cycle.  More judgment calls will be 
required, and I'm sure that not everyone would do it the same way.

Anyone want to write the official "SciPy Developers Deprecation 
Guidelines (with recommended patterns of deprecation and a bunch of 
use-cases)"?

Warren

> Ryan
>
>   


From d.l.goldsmith at gmail.com  Wed Jun  2 21:54:30 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 2 Jun 2010 18:54:30 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTiml_q1q48kpJbhw-WJmg5OGmHM9-2Qx2nsdBgMV@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com>
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com>
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>
	<AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com>
	<AANLkTim8eZq6fpU0FZoSHA9vrP_4oClL7Ava64nk53c0@mail.gmail.com>
	<AANLkTiml_q1q48kpJbhw-WJmg5OGmHM9-2Qx2nsdBgMV@mail.gmail.com>
Message-ID: <AANLkTinrVQ0ClzZpR6DYe2VI5XkVcL8iQrq-rowDaJkC@mail.gmail.com>

On Wed, Jun 2, 2010 at 5:11 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> I just setup an account. vincentdavis
> I assume I will find instruction but how does the review/commit of
> updates work (in brief)
>

Ah, good question, with a somewhat complicated answer, I'm afraid.  At a
minimum, when a writer/editor feels that a docstring is "done," s/he
"promotes" it to "Needs review" status.  In addition, since we feel that,
very generally speaking, a "Needs review" docstring is in a more advanced
state than whatever is in the current distribution, s/he also marks the
docstring as "OK to apply Yes."  Then, eventually, two things happen: a
release manager/worker comes along and merges "OK to apply Yes" docstrings
into the source code, and a reviewer - different than the writer/editor(s)
who worked on the docstring - comes along, reviews the docstring, and either
promotes it to "Reviewed, needs proof" or demotes it to "Reviewed, needs
work."

Here is where it gets a little "complicated."  The review effort, which
_NumPy_ is largely ready for, is stalled pending implementation of
enhancements to the Wiki to support a dual review system: in the past,
parties have found reviewed and proofed, i.e., "finalized" docstrings which
are either pretty unclear, or were pretty clear but had technical
deficiencies.  Consequently, we've been wanting to implement a system
whereby each docstring must pass both a technical and a "presentation"
review, but, as I said, the Wiki presently doesn't support this.  Joe
Harrington and myself have been trying to line up the labor to get this
done, so far unsuccessfully.  I think there may soon be an announcement
concerning this...  (The delay in the review process is at least in part why
we've opted to go ahead and start incorporating "unfinalized" docstrings
into the source.)

Welcome aboard, and thanks!

DG

>
> Thanks
> Vincent
>
> 2010/6/2 St?fan van der Walt <stefan at sun.ac.za>:
> > On 2 June 2010 16:03, Benjamin Root <ben.root at ou.edu> wrote:
> >> You may have my keyboard!
> >>
> >> Seriously, though, I just registered a username on the wiki (weathergod)
> and
> >> I would like to help out with documentation.
> >
> > Added, and welcome!
> >
> > Cheers
> > St?fan
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/71952742/attachment.html>

From warren.weckesser at enthought.com  Wed Jun  2 22:06:30 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Wed, 02 Jun 2010 21:06:30 -0500
Subject: [SciPy-Dev] Mea culpa: deprecation and API changes
In-Reply-To: <4C070951.8070209@enthought.com>
References: <4C012BFE.4090103@enthought.com>	<AANLkTinBtUuoCiviONCfBgPKO6_9ViPmPmb805WmRjnH@mail.gmail.com>	<4C0149EB.8030608@enthought.com>	<AANLkTime2NfLT9W2Q5cboyHwgXZ7v3aBPRPU-xInlYmL@mail.gmail.com>	<4C03F00F.3020806@enthought.com>
	<4C044B0F.4000103@enthought.com>	<4C06F17B.7040209@enthought.com>	<AANLkTilXzU30fm9SQvaTCNy8VqloyjlrnhsVdANb2HNv@mail.gmail.com>
	<4C070951.8070209@enthought.com>
Message-ID: <4C070E26.3060306@enthought.com>

Warren Weckesser wrote:
> Ryan May wrote:
>   
>> On Wed, Jun 2, 2010 at 7:04 PM, Warren Weckesser
>> <warren.weckesser at enthought.com> wrote:
>>   
>>     
>>> Warren Weckesser wrote:
>>>     
>>>       
>>>> Opinion wanted:  codata.find(sub) used to print a list of strings.  A
>>>> while ago, in response to http://projects.scipy.org/scipy/ticket/996,  I
>>>> changed it to return the list of strings.  But this is an API change,
>>>> and should follow the deprecation policy.  One way to do this is to
>>>> restore find() to its previous behavior, and deprecate the function.  At
>>>> the same time, add a new function, find_string(sub), which returns the
>>>> list of strings.  What do you think?
>>>>
>>>>
>>>>       
>>>>         
>>> Instead of creating a new function, I added a keyword argument whose
>>> default value (True) preserves the old behavior.  When it is False, it
>>> returns the keys instead of printing them.  In 0.9, the default behavior
>>> will be reversed.
>>>     
>>>       
>> Why not always return the list and just make only the print controlled
>> by the kwarg? That way the return type of the function doesn't depend
>> on a kwarg, which IIRC is considered bad style. You won't break
>> existing code, which will just ignore the new return value.
>>   
>>     
>
> That seemed the most conservative approach, despite being bad style.  It 
> can all be cleaned up in 0.9 anyway.
>
> I'm currently working on "fixing" signal.waveforms.chirp to maintain 
> compatibility for one release cycle.  More judgment calls will be 
> required, and I'm sure that not everyone would do it the same way.
>
> Anyone want to write the official "SciPy Developers Deprecation 
> Guidelines (with recommended patterns of deprecation and a bunch of 
> use-cases)"?
>   

Hmmm... perhaps that should be "Developers' Deprecation Guidelines".  
Without the apostrophe, it could mean something else.  :)

> Warren
>
>   
>> Ryan
>>
>>   
>>     
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From vincent at vincentdavis.net  Wed Jun  2 22:09:39 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 2 Jun 2010 20:09:39 -0600
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTinrVQ0ClzZpR6DYe2VI5XkVcL8iQrq-rowDaJkC@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com>
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com>
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>
	<AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com>
	<AANLkTim8eZq6fpU0FZoSHA9vrP_4oClL7Ava64nk53c0@mail.gmail.com>
	<AANLkTiml_q1q48kpJbhw-WJmg5OGmHM9-2Qx2nsdBgMV@mail.gmail.com>
	<AANLkTinrVQ0ClzZpR6DYe2VI5XkVcL8iQrq-rowDaJkC@mail.gmail.com>
Message-ID: <AANLkTil3ZxkzKzVYOvZf5uxFI5RrEN8rQSXx8BUwmnrB@mail.gmail.com>

On Wed, Jun 2, 2010 at 7:54 PM, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> On Wed, Jun 2, 2010 at 5:11 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
>>
>> I just setup an account. vincentdavis
>> I assume I will find instruction but how does the review/commit of
>> updates work (in brief)
>
> Ah, good question, with a somewhat complicated answer, I'm afraid.? At a
> minimum, when a writer/editor feels that a docstring is "done," s/he
> "promotes" it to "Needs review" status.? In addition, since we feel that,
> very generally speaking, a "Needs review" docstring is in a more advanced
> state than whatever is in the current distribution, s/he also marks the
> docstring as "OK to apply Yes."? Then, eventually, two things happen: a
> release manager/worker comes along and merges "OK to apply Yes" docstrings
> into the source code, and a reviewer - different than the writer/editor(s)
> who worked on the docstring - comes along, reviews the docstring, and either
> promotes it to "Reviewed, needs proof" or demotes it to "Reviewed, needs
> work."
>
> Here is where it gets a little "complicated."? The review effort, which
> _NumPy_ is largely ready for, is stalled pending implementation of
> enhancements to the Wiki to support a dual review system: in the past,
> parties have found reviewed and proofed, i.e., "finalized" docstrings which
> are either pretty unclear, or were pretty clear but had technical
> deficiencies.? Consequently, we've been wanting to implement a system
> whereby each docstring must pass both a technical and a "presentation"
> review, but, as I said, the Wiki presently doesn't support this.? Joe
> Harrington and myself have been trying to line up the labor to get this
> done, so far unsuccessfully.? I think there may soon be an announcement
> concerning this...? (The delay in the review process is at least in part why
> we've opted to go ahead and start incorporating "unfinalized" docstrings
> into the source.)

As I am always interested in learning new things is there any help I
can offer in getting the wiki review feature implemented?

Thanks for the summary, this clears up a few of the question I had
after looking over things.

Vincent

>
> Welcome aboard, and thanks!
>
> DG
>>
>> Thanks
>> Vincent
>>
>> 2010/6/2 St?fan van der Walt <stefan at sun.ac.za>:
>> > On 2 June 2010 16:03, Benjamin Root <ben.root at ou.edu> wrote:
>> >> You may have my keyboard!
>> >>
>> >> Seriously, though, I just registered a username on the wiki
>> >> (weathergod) and
>> >> I would like to help out with documentation.
>> >
>> > Added, and welcome!
>> >
>> > Cheers
>> > St?fan
>> > _______________________________________________
>> > SciPy-Dev mailing list
>> > SciPy-Dev at scipy.org
>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>> >
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide. ?(As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Wed Jun  2 22:22:49 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 2 Jun 2010 20:22:49 -0600
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
Message-ID: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>

For example scipy.stats.stats.cov when you view source has
"scipy.stats.cov is deprecated; please update your code to use
numpy.cov." Should this be in the docs ? and is there an example of
how this should be pointed out.
This is something I actually implemented in a program then discovered
that is was deprecated. I would have like that to be in the online
docs.

Thanks
Vincent


From vincent at vincentdavis.net  Wed Jun  2 22:30:49 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 2 Jun 2010 20:30:49 -0600
Subject: [SciPy-Dev] Summer Marathon Skypecon tomorrow
In-Reply-To: <AANLkTimX7YUGtwqLlHnCGO_fD9kkVJqWnjPhA9GvHq0l@mail.gmail.com>
References: <AANLkTimX7YUGtwqLlHnCGO_fD9kkVJqWnjPhA9GvHq0l@mail.gmail.com>
Message-ID: <AANLkTil8kT1HdhdAxgU4wWGg-th6Uq72NRpTFtBCsQ45@mail.gmail.com>

On Thu, May 27, 2010 at 11:47 AM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> So far, no one has RSVP-ed (positive or negative).? Is this due to:
>
> A) Lack of time;
> B) Bad scheduling (i.e., you have time, just not at the time we've chosen);
> C) Lack of interest;
> D) Lack of issues to discuss (i.e., you have interest, but not in a meeting
> without a specific aganda);
> E) Bad choice of conference media (i.e., can't/won't do Skype);
> F) Just forgot to RSVP;
> G) Some of the above;
> H) Other/None of the above?
>
> If I haven't heard from anyone, in the positive, by midnight tonight, EDT,
> this week's Skypecon is canceled.
>

I thought it "would" happen and hoped to be able to listen, (Unlikely
I have anything to contribute) I was unsure I would be available so
didn't rsvp. It might be nice to record the skype call and make it
available as a sudo podcast. It might be nice for those that miss it
and might be a nice weekly update on scipy, this I could possibly help
with.

Thanks
Vincent

> DG
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From luis94855510 at gmail.com  Wed Jun  2 22:44:33 2010
From: luis94855510 at gmail.com (Luis Saavedra)
Date: Wed, 2 Jun 2010 22:44:33 -0400
Subject: [SciPy-Dev] how to get "help docs" in other languages
Message-ID: <AANLkTilAiJYv8uIK-G8L4nFkRMtoqC-7VMGFOkjuF8sf@mail.gmail.com>

Hi all,

that is my problem... how to get "Help" sections or "guide for documentation
authors" in other languages for my project, in a automagical way :P, ?that
is a request for feature?

regards,
Luis
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/6e41f325/attachment.html>

From luis94855510 at gmail.com  Wed Jun  2 22:57:54 2010
From: luis94855510 at gmail.com (Luis Saavedra)
Date: Wed, 2 Jun 2010 22:57:54 -0400
Subject: [SciPy-Dev] how to get "help docs" in other languages
In-Reply-To: <AANLkTilAiJYv8uIK-G8L4nFkRMtoqC-7VMGFOkjuF8sf@mail.gmail.com>
References: <AANLkTilAiJYv8uIK-G8L4nFkRMtoqC-7VMGFOkjuF8sf@mail.gmail.com>
Message-ID: <AANLkTimZ4hVeBJ8WHQmBVNWloId8iW0bDA112V-HF5uk@mail.gmail.com>

ups,sorry for the noise, that list is for scipy not for sphinx O_o

2010/6/2 Luis Saavedra <luis94855510 at gmail.com>

> Hi all,
>
> that is my problem... how to get "Help" sections or "guide for
> documentation authors" in other languages for my project, in a automagical
> way :P, ?that is a request for feature?
>
> regards,
> Luis
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/5a6b6f2a/attachment.html>

From ilanschnell at gmail.com  Wed Jun  2 23:03:53 2010
From: ilanschnell at gmail.com (Ilan Schnell)
Date: Wed, 2 Jun 2010 22:03:53 -0500
Subject: [SciPy-Dev] import error in scipy.stats on RH3 32-bit
In-Reply-To: <AANLkTilWfx3XogGYefxorjJ4Z8iaiFhTUN-Py43kHtVQ@mail.gmail.com>
References: <AANLkTink02FLVCAd9-OM0G-M8h6IoALUmr6L1m961DKJ@mail.gmail.com>
	<AANLkTilWfx3XogGYefxorjJ4Z8iaiFhTUN-Py43kHtVQ@mail.gmail.com>
Message-ID: <AANLkTinmfY_TDLe-2U1Lvro78kgZWDvKiaLT3yrYCOI8@mail.gmail.com>

Hello Chuck,
yes 6446 works.  Actually, as the error indicates, the unresolved
symbol in is linalg/clapack.so, it just happened that during my
testing the stats package was imported first, so I initially thought
the error was there.
However, something has changed between 6446 and 6476, as
I wasn't seeing this error before.  Looking at the revision log of
the 0.8.x branch, but I cannot see any obvious.  And I'm also
puzzled why this only happens on one particular platform.
To make sure the build environment hasn't changed, I rebuild 6446
on the same system, and it still works.

- Ilan


On Wed, Jun 2, 2010 at 8:14 PM, Charles R Harris
<charlesr.harris at gmail.com> wrote:
>
>
> On Wed, Jun 2, 2010 at 4:17 PM, Ilan Schnell <ilanschnell at gmail.com> wrote:
>>
>> Hello group,
>>
>> I'm not exactly sure what has changed in scipy.stats, but
>> building and importing all extensions with the first 0.8.x brach
>> (revision 6446) worked fine on CentOS release 3.9 (32-bit).
>>
>
> Does revision 6446 still work? The vonmises distribution hasn't been touched
> in a long time.
>
>
>> Now (revision 6476), I can still build everything, but when
>> I try to import scipy.stats.vonmises_cython, I get the following
>> unresolved symbol:
>> ImportError:
>> /home/tester/master/lib/python2.6/site-packages/scipy/linalg/clapack.so:
>> undefined symbol: clapack_sgesv
>>
>> Strangely, I don't get this import error on any platform (64-bit/32-bit,
>> Windows, MaxOSX, Redhat 5, Solaris).
>>
>> Does anyone know what could be going on here?
>>
>
> Chuck
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From d.l.goldsmith at gmail.com  Wed Jun  2 23:07:07 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 2 Jun 2010 20:07:07 -0700
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
Message-ID: <AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>

On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> For example scipy.stats.stats.cov when you view source has
> "scipy.stats.cov is deprecated; please update your code to use
> numpy.cov." Should this be in the docs ? and is there an example of
> how this should be pointed out.
> This is something I actually implemented in a program then discovered
> that is was deprecated. I would have like that to be in the online
> docs.
>
> Thanks
> Vincent
>

I vaguely recollect this being discussed before, but I can't find anything
about it in our docstring Standard, in our Q+A section, nor (easily) at the
Python site (generally, when in doubt, we default to Python docstring
standards); so, how 'bout it guys and gals: should deprecation be noted in
docstrings and if so, where and how?

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/ece726ca/attachment.html>

From d.l.goldsmith at gmail.com  Wed Jun  2 23:11:04 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 2 Jun 2010 20:11:04 -0700
Subject: [SciPy-Dev] Summer Marathon Skypecon tomorrow
In-Reply-To: <AANLkTil8kT1HdhdAxgU4wWGg-th6Uq72NRpTFtBCsQ45@mail.gmail.com>
References: <AANLkTimX7YUGtwqLlHnCGO_fD9kkVJqWnjPhA9GvHq0l@mail.gmail.com>
	<AANLkTil8kT1HdhdAxgU4wWGg-th6Uq72NRpTFtBCsQ45@mail.gmail.com>
Message-ID: <AANLkTikv6_2X2CabWeNVihUyEl5mwwCHq9vzpDXyKssy@mail.gmail.com>

On Wed, Jun 2, 2010 at 7:30 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Thu, May 27, 2010 at 11:47 AM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > So far, no one has RSVP-ed (positive or negative).  Is this due to:
> >
> > A) Lack of time;
> > B) Bad scheduling (i.e., you have time, just not at the time we've
> chosen);
> > C) Lack of interest;
> > D) Lack of issues to discuss (i.e., you have interest, but not in a
> meeting
> > without a specific aganda);
> > E) Bad choice of conference media (i.e., can't/won't do Skype);
> > F) Just forgot to RSVP;
> > G) Some of the above;
> > H) Other/None of the above?
> >
> > If I haven't heard from anyone, in the positive, by midnight tonight,
> EDT,
> > this week's Skypecon is canceled.
> >
>
> I thought it "would" happen and hoped to be able to listen, (Unlikely
> I have anything to contribute) I was unsure I would be available so
> didn't rsvp. It might be nice to record the skype call and make it
> available as a sudo podcast. It might be nice for those that miss it
> and might be a nice weekly update on scipy, this I could possibly help
> with.
>
> Thanks
> Vincent
>

I'd certainly prefer that to taking notes and publishing minutes! ;-)
(Though I'll probably do the latter anyway for people who don't want to
listen to the whole thing.)  Who besides Vincent would participate this
week?

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/48875cf2/attachment.html>

From charlesr.harris at gmail.com  Wed Jun  2 23:19:10 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Wed, 2 Jun 2010 21:19:10 -0600
Subject: [SciPy-Dev] import error in scipy.stats on RH3 32-bit
In-Reply-To: <AANLkTinmfY_TDLe-2U1Lvro78kgZWDvKiaLT3yrYCOI8@mail.gmail.com>
References: <AANLkTink02FLVCAd9-OM0G-M8h6IoALUmr6L1m961DKJ@mail.gmail.com>
	<AANLkTilWfx3XogGYefxorjJ4Z8iaiFhTUN-Py43kHtVQ@mail.gmail.com>
	<AANLkTinmfY_TDLe-2U1Lvro78kgZWDvKiaLT3yrYCOI8@mail.gmail.com>
Message-ID: <AANLkTinQ-u2QllOt5NfAoiKWEXxudwqmYIQ3iX7BzD0Z@mail.gmail.com>

On Wed, Jun 2, 2010 at 9:03 PM, Ilan Schnell <ilanschnell at gmail.com> wrote:

> Hello Chuck,
> yes 6446 works.  Actually, as the error indicates, the unresolved
> symbol in is linalg/clapack.so, it just happened that during my
> testing the stats package was imported first, so I initially thought
> the error was there.
> However, something has changed between 6446 and 6476, as
> I wasn't seeing this error before.  Looking at the revision log of
> the 0.8.x branch, but I cannot see any obvious.  And I'm also
> puzzled why this only happens on one particular platform.
> To make sure the build environment hasn't changed, I rebuild 6446
> on the same system, and it still works.
>
>
I hate to ask this of anyone, but... could you determine which revision
caused the problem?

Sadistical Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/40543e92/attachment.html>

From ben.root at ou.edu  Wed Jun  2 23:28:40 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Wed, 2 Jun 2010 22:28:40 -0500
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com> 
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
Message-ID: <AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>

As a power user of these tools, I often will encounter these warnings while
bulding my code piece-wise, however, I can easily imagine a case where a
regular user simply seeing a useful feature and spending time coding around
it, only to discover that it will soon be deprecated.  I would certainly be
annoyed in such a case.

A quick and easy way to list deprecations would be towards the end of the
docstring, but the user might not scroll all the way down past the feature
that they found.  So, to raise visibility, such deprecation warnings should
be towards the beginning of the docstring.

Just a thought... is it feasible for the doc building system to scan through
the function code and spot a deprecation warning and thereby be able to add
a list of deprecation warnings to the docstring?  Obviously, such warnings
would have to follow some standard format, but it would be neat if such
things could be automated.

Just my 2 cents,
Ben Root

On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>wrote:
>
>> For example scipy.stats.stats.cov when you view source has
>> "scipy.stats.cov is deprecated; please update your code to use
>> numpy.cov." Should this be in the docs ? and is there an example of
>> how this should be pointed out.
>> This is something I actually implemented in a program then discovered
>> that is was deprecated. I would have like that to be in the online
>> docs.
>>
>> Thanks
>> Vincent
>>
>
> I vaguely recollect this being discussed before, but I can't find anything
> about it in our docstring Standard, in our Q+A section, nor (easily) at the
> Python site (generally, when in doubt, we default to Python docstring
> standards); so, how 'bout it guys and gals: should deprecation be noted in
> docstrings and if so, where and how?
>
> DG
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/0cb3b91a/attachment.html>

From ilanschnell at gmail.com  Wed Jun  2 23:29:51 2010
From: ilanschnell at gmail.com (Ilan Schnell)
Date: Wed, 2 Jun 2010 22:29:51 -0500
Subject: [SciPy-Dev] import error in scipy.stats on RH3 32-bit
In-Reply-To: <AANLkTinQ-u2QllOt5NfAoiKWEXxudwqmYIQ3iX7BzD0Z@mail.gmail.com>
References: <AANLkTink02FLVCAd9-OM0G-M8h6IoALUmr6L1m961DKJ@mail.gmail.com>
	<AANLkTilWfx3XogGYefxorjJ4Z8iaiFhTUN-Py43kHtVQ@mail.gmail.com>
	<AANLkTinmfY_TDLe-2U1Lvro78kgZWDvKiaLT3yrYCOI8@mail.gmail.com>
	<AANLkTinQ-u2QllOt5NfAoiKWEXxudwqmYIQ3iX7BzD0Z@mail.gmail.com>
Message-ID: <AANLkTimumFYrbT7Rllq0IVjxgQqIbaMhEEQuVbWZZetS@mail.gmail.com>

Not yet.  I'll look more into it tomorrow.  :-)

- Ilan

On Wed, Jun 2, 2010 at 10:19 PM, Charles R Harris
<charlesr.harris at gmail.com> wrote:
>
>
> On Wed, Jun 2, 2010 at 9:03 PM, Ilan Schnell <ilanschnell at gmail.com> wrote:
>>
>> Hello Chuck,
>> yes 6446 works. ?Actually, as the error indicates, the unresolved
>> symbol in is linalg/clapack.so, it just happened that during my
>> testing the stats package was imported first, so I initially thought
>> the error was there.
>> However, something has changed between 6446 and 6476, as
>> I wasn't seeing this error before. ?Looking at the revision log of
>> the 0.8.x branch, but I cannot see any obvious. ?And I'm also
>> puzzled why this only happens on one particular platform.
>> To make sure the build environment hasn't changed, I rebuild 6446
>> on the same system, and it still works.
>>
>
> I hate to ask this of anyone, but... could you determine which revision
> caused the problem?
>
> Sadistical Chuck
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From d.l.goldsmith at gmail.com  Thu Jun  3 00:05:26 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 2 Jun 2010 21:05:26 -0700
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
Message-ID: <AANLkTinqVV6g4Io8-3Vws1xurx97vncDT0IfLEp-fOyy@mail.gmail.com>

On Wed, Jun 2, 2010 at 8:28 PM, Benjamin Root <ben.root at ou.edu> wrote:

> As a power user of these tools, I often will encounter these warnings while
> bulding my code piece-wise, however, I can easily imagine a case where a
> regular user simply seeing a useful feature and spending time coding around
> it, only to discover that it will soon be deprecated.  I would certainly be
> annoyed in such a case.
>
> A quick and easy way to list deprecations would be towards the end of the
> docstring, but the user might not scroll all the way down past the feature
> that they found.  So, to raise visibility, such deprecation warnings should
> be towards the beginning of the docstring.
>
> Just a thought... is it feasible for the doc building system to scan
> through the function code and spot a deprecation warning and thereby be able
> to add a list of deprecation warnings to the docstring?  Obviously, such
> warnings would have to follow some standard format, but it would be neat if
> such things could be automated.
>
> Just my 2 cents,
> Ben Root
>

pydocweb (our doc editing Wiki) does do something like that in that it
automatically prepends the function signature to the docstring (at least I
think it's pydocweb that's doing it), so I think it's possible in
principle.  code.google.com/p/pydocweb hosts a ticketing system (the
"Issues" tab) - may I ask you to go there and file an "enhancement" ticket
for this - the worst that can happen is that someone (probably Pauli V.)
will mark it as "will not do" with some sort of explanation as to why.

That said, pydocweb has a long backlog of open issues, and this is not the
highest priority among them.  Accordingly, we probably shouldn't wait for it
to solve our problem, i.e., we should still decide on where and how to note
this, and do it manually when we encounter the situation.  So, so far we
have one "vote" for "yes, near the beginning." :-)

DG


>
> On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:
>
>> On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>wrote:
>>
>>> For example scipy.stats.stats.cov when you view source has
>>> "scipy.stats.cov is deprecated; please update your code to use
>>> numpy.cov." Should this be in the docs ? and is there an example of
>>> how this should be pointed out.
>>> This is something I actually implemented in a program then discovered
>>> that is was deprecated. I would have like that to be in the online
>>> docs.
>>>
>>> Thanks
>>> Vincent
>>>
>>
>> I vaguely recollect this being discussed before, but I can't find anything
>> about it in our docstring Standard, in our Q+A section, nor (easily) at the
>> Python site (generally, when in doubt, we default to Python docstring
>> standards); so, how 'bout it guys and gals: should deprecation be noted in
>> docstrings and if so, where and how?
>>
>> DG
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/c806b7c4/attachment.html>

From ben.root at ou.edu  Thu Jun  3 00:17:51 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Wed, 2 Jun 2010 23:17:51 -0500
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTinqVV6g4Io8-3Vws1xurx97vncDT0IfLEp-fOyy@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com> 
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com> 
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com> 
	<AANLkTinqVV6g4Io8-3Vws1xurx97vncDT0IfLEp-fOyy@mail.gmail.com>
Message-ID: <AANLkTilpAyB6H2jv7izxBYxUuOhP8Vka65tYVPYOXmjt@mail.gmail.com>

On Wed, Jun 2, 2010 at 11:05 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> On Wed, Jun 2, 2010 at 8:28 PM, Benjamin Root <ben.root at ou.edu> wrote:
>
>> As a power user of these tools, I often will encounter these warnings
>> while bulding my code piece-wise, however, I can easily imagine a case where
>> a regular user simply seeing a useful feature and spending time coding
>> around it, only to discover that it will soon be deprecated.  I would
>> certainly be annoyed in such a case.
>>
>> A quick and easy way to list deprecations would be towards the end of the
>> docstring, but the user might not scroll all the way down past the feature
>> that they found.  So, to raise visibility, such deprecation warnings should
>> be towards the beginning of the docstring.
>>
>> Just a thought... is it feasible for the doc building system to scan
>> through the function code and spot a deprecation warning and thereby be able
>> to add a list of deprecation warnings to the docstring?  Obviously, such
>> warnings would have to follow some standard format, but it would be neat if
>> such things could be automated.
>>
>> Just my 2 cents,
>> Ben Root
>>
>
> pydocweb (our doc editing Wiki) does do something like that in that it
> automatically prepends the function signature to the docstring (at least I
> think it's pydocweb that's doing it), so I think it's possible in
> principle.  code.google.com/p/pydocweb hosts a ticketing system (the
> "Issues" tab) - may I ask you to go there and file an "enhancement" ticket
> for this - the worst that can happen is that someone (probably Pauli V.)
> will mark it as "will not do" with some sort of explanation as to why.
>
> That said, pydocweb has a long backlog of open issues, and this is not the
> highest priority among them.  Accordingly, we probably shouldn't wait for it
> to solve our problem, i.e., we should still decide on where and how to note
> this, and do it manually when we encounter the situation.  So, so far we
> have one "vote" for "yes, near the beginning." :-)
>
> I will look into that tomorrow.  And I certainly agree that we should not
wait until pydocweb presents us a solution.  We should certainly follow some
sort of standard way to mark/tag/denote these deprecation warnings, that way
'grep' can still be a very valuable tool here.

Ben Root


> DG
>
>
>
>>
>> On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith <d.l.goldsmith at gmail.com
>> > wrote:
>>
>>> On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>wrote:
>>>
>>>> For example scipy.stats.stats.cov when you view source has
>>>> "scipy.stats.cov is deprecated; please update your code to use
>>>> numpy.cov." Should this be in the docs ? and is there an example of
>>>> how this should be pointed out.
>>>> This is something I actually implemented in a program then discovered
>>>> that is was deprecated. I would have like that to be in the online
>>>> docs.
>>>>
>>>> Thanks
>>>> Vincent
>>>>
>>>
>>> I vaguely recollect this being discussed before, but I can't find
>>> anything about it in our docstring Standard, in our Q+A section, nor
>>> (easily) at the Python site (generally, when in doubt, we default to Python
>>> docstring standards); so, how 'bout it guys and gals: should deprecation be
>>> noted in docstrings and if so, where and how?
>>>
>>> DG
>>>
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide.  (As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/05e6f2c5/attachment.html>

From vincent at vincentdavis.net  Thu Jun  3 00:32:26 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 2 Jun 2010 22:32:26 -0600
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTinqVV6g4Io8-3Vws1xurx97vncDT0IfLEp-fOyy@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<AANLkTinqVV6g4Io8-3Vws1xurx97vncDT0IfLEp-fOyy@mail.gmail.com>
Message-ID: <AANLkTinWpvpBoe5yWLHUijNUX2h7YgxrNNlsa2E8b4C6@mail.gmail.com>

On Wed, Jun 2, 2010 at 10:05 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> On Wed, Jun 2, 2010 at 8:28 PM, Benjamin Root <ben.root at ou.edu> wrote:
>>
>> As a power user of these tools, I often will encounter these warnings
>> while bulding my code piece-wise, however, I can easily imagine a case where
>> a regular user simply seeing a useful feature and spending time coding
>> around it, only to discover that it will soon be deprecated.? I would
>> certainly be annoyed in such a case.
>>
>> A quick and easy way to list deprecations would be towards the end of the
>> docstring, but the user might not scroll all the way down past the feature
>> that they found.? So, to raise visibility, such deprecation warnings should
>> be towards the beginning of the docstring.
? So, so far we

> have one "vote" for "yes, near the beginning." :-)
>

I vote near the beginning for the reasons Benjamin notes "the user
might not scroll all the way down past the feature
that they found"

And including as much reference to the replacement as possible (a link
to the doc?, function name......) Make it easy to find its
replacement.

Vincent


> DG
>
>
>>
>> On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith <d.l.goldsmith at gmail.com>
>> wrote:
>>>
>>> On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>
>>> wrote:
>>>>
>>>> For example scipy.stats.stats.cov when you view source has
>>>> "scipy.stats.cov is deprecated; please update your code to use
>>>> numpy.cov." Should this be in the docs ? and is there an example of
>>>> how this should be pointed out.
>>>> This is something I actually implemented in a program then discovered
>>>> that is was deprecated. I would have like that to be in the online
>>>> docs.
>>>>
>>>> Thanks
>>>> Vincent
>>>
>>> I vaguely recollect this being discussed before, but I can't find
>>> anything about it in our docstring Standard, in our Q+A section, nor
>>> (easily) at the Python site (generally, when in doubt, we default to Python
>>> docstring standards); so, how 'bout it guys and gals: should deprecation be
>>> noted in docstrings and if so, where and how?
>>>
>>> DG
>>>
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide. ?(As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Thu Jun  3 00:34:40 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 2 Jun 2010 22:34:40 -0600
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTilpAyB6H2jv7izxBYxUuOhP8Vka65tYVPYOXmjt@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<AANLkTinqVV6g4Io8-3Vws1xurx97vncDT0IfLEp-fOyy@mail.gmail.com>
	<AANLkTilpAyB6H2jv7izxBYxUuOhP8Vka65tYVPYOXmjt@mail.gmail.com>
Message-ID: <AANLkTikevM8pgWJ3AJPyazjOspWOTJxQCgIGkcum0XXE@mail.gmail.com>

On Wed, Jun 2, 2010 at 10:17 PM, Benjamin Root <ben.root at ou.edu> wrote:
> I will look into that tomorrow.? And I certainly agree that we should not
> wait until pydocweb presents us a solution.? We should certainly follow some
> sort of standard way to mark/tag/denote these deprecation warnings, that way
> 'grep' can still be a very valuable tool here.

Also when it will be deprecated would be good to know/document.

Thanks
Vincent

>
> Ben Root
>
>
>>
>> DG
>>
>>
>>>
>>> On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith
>>> <d.l.goldsmith at gmail.com> wrote:
>>>>
>>>> On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>
>>>> wrote:
>>>>>
>>>>> For example scipy.stats.stats.cov when you view source has
>>>>> "scipy.stats.cov is deprecated; please update your code to use
>>>>> numpy.cov." Should this be in the docs ? and is there an example of
>>>>> how this should be pointed out.
>>>>> This is something I actually implemented in a program then discovered
>>>>> that is was deprecated. I would have like that to be in the online
>>>>> docs.
>>>>>
>>>>> Thanks
>>>>> Vincent
>>>>
>>>> I vaguely recollect this being discussed before, but I can't find
>>>> anything about it in our docstring Standard, in our Q+A section, nor
>>>> (easily) at the Python site (generally, when in doubt, we default to Python
>>>> docstring standards); so, how 'bout it guys and gals: should deprecation be
>>>> noted in docstrings and if so, where and how?
>>>>
>>>> DG
>>>>
>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>>
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>
>>
>>
>> --
>> Mathematician: noun, someone who disavows certainty when their uncertainty
>> set is non-empty, even if that set has measure zero.
>>
>> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
>> lies, prevents mankind from committing a general suicide. ?(As interpreted
>> by Robert Graves)
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From nmb at wartburg.edu  Thu Jun  3 00:47:50 2010
From: nmb at wartburg.edu (Neil Martinsen-Burrell)
Date: Wed, 02 Jun 2010 23:47:50 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C06B8FB.8080806@gmail.com>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
Message-ID: <4C0733F6.7040608@wartburg.edu>

On 2010-06-02 15:03 , Bruce Southey wrote:
> On 06/02/2010 01:41 PM, josef.pktd at gmail.com wrote:
>> On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell<nmb at wartburg.edu>  wrote:
>>
>>> On 2010-06-02 13:10 , Bruce Southey wrote:

[...]

>> I agree with Neil that this is a very useful convenience function.
>>
> My problem with the chisquare_twoway is that it should not call another
> function to finish two lines of code. It is just an excessive waste of
> resources.

Do you mean that you would rather see the equivalent of

chisq = (table - expected)**2 / expected
return chisq, chisqprob(chisq, dof)

at the bottom of chisquare_contingency than the current call to 
chisquare?  I'm certainly okay with that.

>> I never heard of a one-way contingency table, my question was whether
>> the function should also handle 3-way or 4-way tables, additional to
>> two-way.
>>
> Correct to both of these as I just consider these as n-way tables. I
> think that contingency tables by definition only applies to the 2-d
> case. Pivot tables are essentially the same thing. I would have to
> lookup on how to get the expected number of outcomes but probably of the
> form Ni.. * N.j. *N..k/N... for the 3-way (the 2-way table is of the
> form Ni.*N.j/N..) for i=rows, j=columns, k=3rd axis and '.' means sum
> for that axis.

That is the correct (tensor) formula for higher dimensional tables. 
Pragmatically, since the number of cells climbs so rapidly with 
increasing dimension, there are more problems with small expected 
counts.  If we thought people would be interested in using it, we could 
certainly define a chisquare_nway function as well.

>> I thought about the question how the input should be specified for my
>> initial response, the alternative would be to use the original data or
>> a "long" format instead of a table. But I thought that as a
>> convenience function using the table format will be the most common
>> use.
>> I have written in the past functions that calculate the contingency
>> table, and would be very useful to have a more complete coverage of
>> tools to work with contingency tables in scipy.stats (or temporarily
>> in statsmodels, where we are working also on the anova type of
>> analysis)
>>
> It depends on what tasks are needed. Really there are two steps:
> 1) Cross-tabulation that summarized the data from whatever input
> (groupby would help here).
> 2) Statistical tests - series of functions that accept summarized data only.
>
> If you have separate functions then the burden is on the user to find
> and call all the desired functions. You can also provide a single helper
> function to do all that because you don't want to repeat unnecessary calls.

The facilities for handling raw, frame-style data in scipy.stats are not 
too strong.  A tabulation function that we could stick together with the 
chisquare* functions to make a single helper would certainly be convenient.

-Neil


From d.l.goldsmith at gmail.com  Thu Jun  3 01:04:27 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 2 Jun 2010 22:04:27 -0700
Subject: [SciPy-Dev] {True, False} should be replaced w/ bool, correct?
Message-ID: <AANLkTikl1prQDD3EaQrzdTzL7zbEy0XR1iK65wnjg49v@mail.gmail.com>

Just checking; see, e.g., scipy
<http://docs.scipy.org/scipy/docs/scipy/>.io<http://docs.scipy.org/scipy/docs/scipy.io/>
.matlab <http://docs.scipy.org/scipy/docs/scipy.io.matlab/>.mio<http://docs.scipy.org/scipy/docs/scipy.io.matlab.mio/>.savemat
appendmat parameter.  (Or is it possible that the function really needs to
see either the word True or the word False?)

DG
-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100602/7599769e/attachment.html>

From josef.pktd at gmail.com  Thu Jun  3 02:09:56 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 02:09:56 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C0733F6.7040608@wartburg.edu>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com> <4C0733F6.7040608@wartburg.edu>
Message-ID: <AANLkTim1aPLPAdXetNjt9yxD5fjHKKyVvY6OpnGr2r2H@mail.gmail.com>

On Thu, Jun 3, 2010 at 12:47 AM, Neil Martinsen-Burrell
<nmb at wartburg.edu> wrote:
> On 2010-06-02 15:03 , Bruce Southey wrote:
>> On 06/02/2010 01:41 PM, josef.pktd at gmail.com wrote:
>>> On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell<nmb at wartburg.edu> ?wrote:
>>>
>>>> On 2010-06-02 13:10 , Bruce Southey wrote:
>
> [...]
>
>>> I agree with Neil that this is a very useful convenience function.
>>>
>> My problem with the chisquare_twoway is that it should not call another
>> function to finish two lines of code. It is just an excessive waste of
>> resources.
>
> Do you mean that you would rather see the equivalent of
>
> chisq = (table - expected)**2 / expected
> return chisq, chisqprob(chisq, dof)
>
> at the bottom of chisquare_contingency than the current call to
> chisquare? ?I'm certainly okay with that.

But don't forget to ravel or you get cell-wise chisquare :)
For non-performance sensitive parts, as in this case I usually go by
how easy the function is to understand and to test.
for example I prefer distributions.chi2.sf(chisq, dof) to
chisqprob(chisq, dof) (I haven't checked if it is correct because I
immediately see that it is a one-sided pvalue.

inlining in this case might be nicer because of dof (when inlining)
versus ddof (when calling chisquare), I found the ddof confusing to
read

related: while I was skimming Bruce's reference
http://faculty.vassar.edu/lowry/ch8pt2.html
I saw that they recommend continuity correction for the 2by2 case.
Do you know what the common position on continuity correction is in this case?

(In something vaguely related to this, I read recently that some
continuity correction make the test too conservative and are not
recommended. But I don't remember for which test I read this.)

If there is test specific continuity correction, then chisquare will
have to be inlined.

>
>>> I never heard of a one-way contingency table, my question was whether
>>> the function should also handle 3-way or 4-way tables, additional to
>>> two-way.
>>>
>> Correct to both of these as I just consider these as n-way tables. I
>> think that contingency tables by definition only applies to the 2-d
>> case. Pivot tables are essentially the same thing. I would have to
>> lookup on how to get the expected number of outcomes but probably of the
>> form Ni.. * N.j. *N..k/N... for the 3-way (the 2-way table is of the
>> form Ni.*N.j/N..) for i=rows, j=columns, k=3rd axis and '.' means sum
>> for that axis.
>
> That is the correct (tensor) formula for higher dimensional tables.
> Pragmatically, since the number of cells climbs so rapidly with
> increasing dimension, there are more problems with small expected
> counts. ?If we thought people would be interested in using it, we could
> certainly define a chisquare_nway function as well.

I'm not too happy about having a large number of small functions
especially if they have code duplication and need to be separately
maintained.
When there is a demand for a convenient special case, then it could
just call the more general function.

For testing distribution, the common approach in the case when there
are too few expected counts in some cells, is, to combine several
cells together in one bin.
I guess, there might be something like this also feasible for nway,
i.e. coarsen the grid, or not?

>
>>> I thought about the question how the input should be specified for my
>>> initial response, the alternative would be to use the original data or
>>> a "long" format instead of a table. But I thought that as a
>>> convenience function using the table format will be the most common
>>> use.
>>> I have written in the past functions that calculate the contingency
>>> table, and would be very useful to have a more complete coverage of
>>> tools to work with contingency tables in scipy.stats (or temporarily
>>> in statsmodels, where we are working also on the anova type of
>>> analysis)
>>>
>> It depends on what tasks are needed. Really there are two steps:
>> 1) Cross-tabulation that summarized the data from whatever input
>> (groupby would help here).
>> 2) Statistical tests - series of functions that accept summarized data only.
>>
>> If you have separate functions then the burden is on the user to find
>> and call all the desired functions. You can also provide a single helper
>> function to do all that because you don't want to repeat unnecessary calls.
>
> The facilities for handling raw, frame-style data in scipy.stats are not
> too strong. ?A tabulation function that we could stick together with the
> chisquare* functions to make a single helper would certainly be convenient.

Since broader coverage of contingency tables with all the data
handling, bincount and table conversions would a much larger set of
functions.

I think our still evolving design for statistics (including test) in
statsmodels is to move to a more object oriented design, to keep
things together, and to take advantage of reusing previous
calculations.

In this case it could be a ContingencyTable class that could combine
creating the countdata from raw data (with or without missing values),
marginalization if it's 3-way or higher, attach several tests, create
a nice string that can be printed, and so on. With lazy evaluation and
reuse of previous calculations, we think this would be a better design
than only having standalone functions.

grouping functions together:
While statisticians might have a good overview of all the different
test, I found the "laundry list" of functions in scipy.stats for a
long time pretty confusing.
Instead of having group of functions fisherexact, chisquare_twoway,
chisquare_nway, and several other possible candidates for independence
tests in contingency tables, we are starting to combine them together,
e.g independence_tests, mean_tests, variance_tests and
correlation_test

We were discussing this in statsmodels in a different context, mainly
diagnostic tests for regression, e.g. heteroscedasticity,
autocorrelation tests or more recently post-hoc tests.

In the current case, I also thought that combining with a fisherexact
or other tests would potentially be useful, with a keyword argument
that selects "chisquare", "exact", "..."
Which is in this case not yet relevant because fisherexact, even when
it works, is only for 2by2, and I don't think mixing them together is
very useful.

Josef


> -Neil
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From stefan at sun.ac.za  Thu Jun  3 02:29:30 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Wed, 2 Jun 2010 23:29:30 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTiml_q1q48kpJbhw-WJmg5OGmHM9-2Qx2nsdBgMV@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com> 
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com> 
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com> 
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com> 
	<AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com> 
	<AANLkTim8eZq6fpU0FZoSHA9vrP_4oClL7Ava64nk53c0@mail.gmail.com> 
	<AANLkTiml_q1q48kpJbhw-WJmg5OGmHM9-2Qx2nsdBgMV@mail.gmail.com>
Message-ID: <AANLkTilYZ2dwL2rLxa4I56HNj1Ua0SLVcRbyy4Jv5Zya@mail.gmail.com>

On 2 June 2010 17:11, Vincent Davis <vincent at vincentdavis.net> wrote:
> I just setup an account. vincentdavis

Thanks, Vincent. I gave you editing permission.

Guidelines are accessible from the front page, let me know if you get stuck.

Regards
St?fan


From stefan at sun.ac.za  Thu Jun  3 02:33:39 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Wed, 2 Jun 2010 23:33:39 -0700
Subject: [SciPy-Dev] Clarification: is the Extended Summary section
	optional?
In-Reply-To: <AANLkTil3ZxkzKzVYOvZf5uxFI5RrEN8rQSXx8BUwmnrB@mail.gmail.com>
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com> 
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com> 
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com> 
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com> 
	<AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com> 
	<AANLkTim8eZq6fpU0FZoSHA9vrP_4oClL7Ava64nk53c0@mail.gmail.com> 
	<AANLkTiml_q1q48kpJbhw-WJmg5OGmHM9-2Qx2nsdBgMV@mail.gmail.com> 
	<AANLkTinrVQ0ClzZpR6DYe2VI5XkVcL8iQrq-rowDaJkC@mail.gmail.com> 
	<AANLkTil3ZxkzKzVYOvZf5uxFI5RrEN8rQSXx8BUwmnrB@mail.gmail.com>
Message-ID: <AANLkTinWc0l2yvetUKjSqqHlsRA4pc6NEKZ86YL1bS8s@mail.gmail.com>

On 2 June 2010 19:09, Vincent Davis <vincent at vincentdavis.net> wrote:
> As I am always interested in learning new things is there any help I
> can offer in getting the wiki review feature implemented?

Sure, have a look at:

http://code.google.com/p/pydocweb/

There are many issues that require attention, and all help is appreciated.

Regards
St?fan


From josef.pktd at gmail.com  Thu Jun  3 02:39:01 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 02:39:01 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTim1aPLPAdXetNjt9yxD5fjHKKyVvY6OpnGr2r2H@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com> <4C0733F6.7040608@wartburg.edu>
	<AANLkTim1aPLPAdXetNjt9yxD5fjHKKyVvY6OpnGr2r2H@mail.gmail.com>
Message-ID: <AANLkTiksVjtYqogCYwa3xnf7JAzvc0R6m3noi1CLlvId@mail.gmail.com>

On Thu, Jun 3, 2010 at 2:09 AM,  <josef.pktd at gmail.com> wrote:
> On Thu, Jun 3, 2010 at 12:47 AM, Neil Martinsen-Burrell
> <nmb at wartburg.edu> wrote:
>> On 2010-06-02 15:03 , Bruce Southey wrote:
>>> On 06/02/2010 01:41 PM, josef.pktd at gmail.com wrote:
>>>> On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell<nmb at wartburg.edu> ?wrote:
>>>>
>>>>> On 2010-06-02 13:10 , Bruce Southey wrote:
>>
>> [...]
>>
>>>> I agree with Neil that this is a very useful convenience function.
>>>>
>>> My problem with the chisquare_twoway is that it should not call another
>>> function to finish two lines of code. It is just an excessive waste of
>>> resources.
>>
>> Do you mean that you would rather see the equivalent of
>>
>> chisq = (table - expected)**2 / expected
>> return chisq, chisqprob(chisq, dof)
>>
>> at the bottom of chisquare_contingency than the current call to
>> chisquare? ?I'm certainly okay with that.
>
> But don't forget to ravel or you get cell-wise chisquare :)
> For non-performance sensitive parts, as in this case I usually go by
> how easy the function is to understand and to test.
> for example I prefer distributions.chi2.sf(chisq, dof) to
> chisqprob(chisq, dof) (I haven't checked if it is correct because I
> immediately see that it is a one-sided pvalue.
>
> inlining in this case might be nicer because of dof (when inlining)
> versus ddof (when calling chisquare), I found the ddof confusing to
> read
>
> related: while I was skimming Bruce's reference
> http://faculty.vassar.edu/lowry/ch8pt2.html
> I saw that they recommend continuity correction for the 2by2 case.
> Do you know what the common position on continuity correction is in this case?
>
> (In something vaguely related to this, I read recently that some
> continuity correction make the test too conservative and are not
> recommended. But I don't remember for which test I read this.)

It actually is for chisquare
http://en.wikipedia.org/wiki/Yates%27_correction_for_continuity

Josef

>
> If there is test specific continuity correction, then chisquare will
> have to be inlined.
>
>>
>>>> I never heard of a one-way contingency table, my question was whether
>>>> the function should also handle 3-way or 4-way tables, additional to
>>>> two-way.
>>>>
>>> Correct to both of these as I just consider these as n-way tables. I
>>> think that contingency tables by definition only applies to the 2-d
>>> case. Pivot tables are essentially the same thing. I would have to
>>> lookup on how to get the expected number of outcomes but probably of the
>>> form Ni.. * N.j. *N..k/N... for the 3-way (the 2-way table is of the
>>> form Ni.*N.j/N..) for i=rows, j=columns, k=3rd axis and '.' means sum
>>> for that axis.
>>
>> That is the correct (tensor) formula for higher dimensional tables.
>> Pragmatically, since the number of cells climbs so rapidly with
>> increasing dimension, there are more problems with small expected
>> counts. ?If we thought people would be interested in using it, we could
>> certainly define a chisquare_nway function as well.
>
> I'm not too happy about having a large number of small functions
> especially if they have code duplication and need to be separately
> maintained.
> When there is a demand for a convenient special case, then it could
> just call the more general function.
>
> For testing distribution, the common approach in the case when there
> are too few expected counts in some cells, is, to combine several
> cells together in one bin.
> I guess, there might be something like this also feasible for nway,
> i.e. coarsen the grid, or not?
>
>>
>>>> I thought about the question how the input should be specified for my
>>>> initial response, the alternative would be to use the original data or
>>>> a "long" format instead of a table. But I thought that as a
>>>> convenience function using the table format will be the most common
>>>> use.
>>>> I have written in the past functions that calculate the contingency
>>>> table, and would be very useful to have a more complete coverage of
>>>> tools to work with contingency tables in scipy.stats (or temporarily
>>>> in statsmodels, where we are working also on the anova type of
>>>> analysis)
>>>>
>>> It depends on what tasks are needed. Really there are two steps:
>>> 1) Cross-tabulation that summarized the data from whatever input
>>> (groupby would help here).
>>> 2) Statistical tests - series of functions that accept summarized data only.
>>>
>>> If you have separate functions then the burden is on the user to find
>>> and call all the desired functions. You can also provide a single helper
>>> function to do all that because you don't want to repeat unnecessary calls.
>>
>> The facilities for handling raw, frame-style data in scipy.stats are not
>> too strong. ?A tabulation function that we could stick together with the
>> chisquare* functions to make a single helper would certainly be convenient.
>
> Since broader coverage of contingency tables with all the data
> handling, bincount and table conversions would a much larger set of
> functions.
>
> I think our still evolving design for statistics (including test) in
> statsmodels is to move to a more object oriented design, to keep
> things together, and to take advantage of reusing previous
> calculations.
>
> In this case it could be a ContingencyTable class that could combine
> creating the countdata from raw data (with or without missing values),
> marginalization if it's 3-way or higher, attach several tests, create
> a nice string that can be printed, and so on. With lazy evaluation and
> reuse of previous calculations, we think this would be a better design
> than only having standalone functions.
>
> grouping functions together:
> While statisticians might have a good overview of all the different
> test, I found the "laundry list" of functions in scipy.stats for a
> long time pretty confusing.
> Instead of having group of functions fisherexact, chisquare_twoway,
> chisquare_nway, and several other possible candidates for independence
> tests in contingency tables, we are starting to combine them together,
> e.g independence_tests, mean_tests, variance_tests and
> correlation_test
>
> We were discussing this in statsmodels in a different context, mainly
> diagnostic tests for regression, e.g. heteroscedasticity,
> autocorrelation tests or more recently post-hoc tests.
>
> In the current case, I also thought that combining with a fisherexact
> or other tests would potentially be useful, with a keyword argument
> that selects "chisquare", "exact", "..."
> Which is in this case not yet relevant because fisherexact, even when
> it works, is only for 2by2, and I don't think mixing them together is
> very useful.
>
> Josef
>
>
>
>> -Neil
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>


From josef.pktd at gmail.com  Thu Jun  3 02:48:12 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 02:48:12 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C06B8FB.8080806@gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
Message-ID: <AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>

On Wed, Jun 2, 2010 at 4:03 PM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/02/2010 01:41 PM, josef.pktd at gmail.com wrote:
>
> On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell <nmb at wartburg.edu>
> wrote:
>
>
> On 2010-06-02 13:10 , Bruce Southey wrote:
> [...]
>
>
>
> However, this code is the chi-squared test part as SAS will compute the
> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
> can not have both functions.
>
>
> Again, I don't understand what you mean that we can't have both
> functions? I believe (from a statistics teacher's point of view) that
> the Chi-Squared goodness of fit test (which is stats.chisquare) is a
> different beast from the Chi-Square test for independence (which is
> stats.chisquare_contingency). The fact that the distribution of the
> test statistic is the same should not tempt us to put them into the
> same function.
>
>
> Please read scipy.stats.chisquare() because scipy.stats.chisquare() is
> the 1-d case of yours.
> Quote from the docstring:
> " The chi square test tests the null hypothesis that the categorical data
> has the given frequencies."
> Also go the web site provided in the docstring.
>
> By default you get the expected frequencies but you can also put in your
> own using the f_exp variable. You could do the same in your code.
>
>
> In fact, Warren correctly used stats.chisquare with the expected
> frequencies calculated from the null hypothesis and the corrected
> degrees of freedom. ?chisquare_contingency is in some sense a
> convenience method for taking care of these pre-calculations before
> calling stats.chisquare. ?Can you explain more clearly to me why we
> should not include such a convenience function?
>
>
> Just a clarification, before I find time to work my way through the
> other comments
>
> stats.chisquare is a generic test for goodness-of-fit for discreted or
> binned distributions.
> and from the docstring of it
> "If no expected frequencies are given, the total
>     N is assumed to be equally distributed across all groups."
>
> default is uniform distribution
>
>
>
> Try:
> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>
> The use of the uniform distribution is rather misleading and technically
> wrong as it does not help address the expected number of outcomes in a cell:

quote from the wikipedia page:
"A simple example is the hypothesis that an ordinary six-sided dice is
"fair", i.e., all six outcomes are equally likely to occur."

I don't see anything misleading or technically wrong with the uniform
distributions,
or if they come from a Poisson, Hypergeometric, binned Normal or any
of number of other distributions.


> http://en.wikipedia.org/wiki/Discrete_uniform_distribution
>
>
> chisquare_twoway is a special case that additional calculates the
> correct expected frequencies for the test of independencs based on the
> margin totals. The resulting distribution is not uniform.
>
>
> Actually the null hypothesis is rather different between 1-way and 2-way
> tables so you can not say that chisquare_twoway is a special case of
> chisquare.

What is the Null hypothesis in a one-way table?

Josef

>
> I am not sure what you mean by the 'resulting distribution is not uniform'.
> The distribution of the cells values has nothing to do with the uniform
> distribution in either case because it is not used in the data nor in the
> formulation of the test. (And, yes, I have had to do the proof that the test
> statistic is Chi-squared - which is why there is the warning about small
> cells...).
>
> I agree with Neil that this is a very useful convenience function.
>
>
> My problem with the chisquare_twoway is that it should not call another
> function to finish two lines of code. It is just an excessive waste of
> resources.
>
> I never heard of a one-way contingency table, my question was whether
> the function should also handle 3-way or 4-way tables, additional to
> two-way.
>
>
> Correct to both of these as I just consider these as n-way tables. I think
> that contingency tables by definition only applies to the 2-d case. Pivot
> tables are essentially the same thing. I would have to lookup on how to get
> the expected number of outcomes but probably of the form Ni.. * N.j.
> *N..k/N... for the 3-way (the 2-way table is of the form Ni.*N.j/N..) for
> i=rows, j=columns, k=3rd axis and '.' means sum for that axis.
>
> I thought about the question how the input should be specified for my
> initial response, the alternative would be to use the original data or
> a "long" format instead of a table. But I thought that as a
> convenience function using the table format will be the most common
> use.
>
> I have written in the past functions that calculate the contingency
> table, and would be very useful to have a more complete coverage of
> tools to work with contingency tables in scipy.stats (or temporarily
> in statsmodels, where we are working also on the anova type of
> analysis)
>
>
> It depends on what tasks are needed.? Really there are two steps:
> 1) Cross-tabulation that summarized the data from whatever input (groupby
> would help here).
> 2) Statistical tests - series of functions that accept summarized data only.
>
> If you have separate functions then the burden is on the user to find and
> call all the desired functions. You can also provide a single helper
> function to do all that because you don't want to repeat unnecessary calls.
>
> So, I think the way it is it is a nice function and we don't have to
> put all contingency table analysis into this function.
>
> Josef
>
>
> Bruce
>
>
>
>
>
> Really this should be combined with fisher.py in ticket 956:
> http://projects.scipy.org/scipy/ticket/956
>
>
> Wow, apparently I have lots of disagreements today, but I don't think
> that this should be combined with Fisher's Exact test. (I would like
> to see that ticket mature to the point where it can be added to
> scipy.stats.) I like the functions in scipy.stats to correspond in a
> one-to-one manner with the statistical tests. I think that the docs
> should "See Also" the appropriate exact (and non-parametric) tests,
> but I think that one function/one test is a good rule. This is
> particularly true for people (like me) who would like to someday be
> able to use scipy.stats in a pedagogical context.
>
> -Neil
>
>
> I don't see any 'disagreements' rather just different ways to do things
> and identifying areas that need to be addressed for more general use.
>
>
> Agreed. :)
>
> [...]
>
> -Neil
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From ralf.gommers at googlemail.com  Thu Jun  3 06:40:08 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 3 Jun 2010 18:40:08 +0800
Subject: [SciPy-Dev] {True, False} should be replaced w/ bool, correct?
In-Reply-To: <AANLkTikl1prQDD3EaQrzdTzL7zbEy0XR1iK65wnjg49v@mail.gmail.com>
References: <AANLkTikl1prQDD3EaQrzdTzL7zbEy0XR1iK65wnjg49v@mail.gmail.com>
Message-ID: <AANLkTinRQP_o0xFv_irQy6xaYQy1zzzNXgOI5iIlfGo1@mail.gmail.com>

On Thu, Jun 3, 2010 at 1:04 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> Just checking; see, e.g., scipy <http://docs.scipy.org/scipy/docs/scipy/>.
> io <http://docs.scipy.org/scipy/docs/scipy.io/>.matlab<http://docs.scipy.org/scipy/docs/scipy.io.matlab/>
> .mio <http://docs.scipy.org/scipy/docs/scipy.io.matlab.mio/>.savemat
> appendmat parameter.  (Or is it possible that the function really needs to
> see either the word True or the word False?)
>

Correct, {True, False} should always be changed to bool in the docs.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/68d2506f/attachment.html>

From warren.weckesser at enthought.com  Thu Jun  3 08:50:53 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Thu, 03 Jun 2010 07:50:53 -0500
Subject: [SciPy-Dev] Deprecate stats.glm?
Message-ID: <4C07A52D.30503@enthought.com>

stats.glm looks like it was started and then abandoned without being 
finished.  It was last touched in November 2007.  Should this function 
be deprecated so it can eventually be removed?

Warren


From warren.weckesser at enthought.com  Thu Jun  3 09:27:29 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Thu, 03 Jun 2010 08:27:29 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
Message-ID: <4C07ADC1.6040504@enthought.com>

Just letting you know that I'm not ignoring all the great comments from 
josef, Neil and Bruce about my suggestion for chisquare_contingency. 
Unfortunately, I won't have time to think about all the deeper 
suggestions for another week or so.   For now, I'll just say that I 
agree with josef's and Neil's suggestions for the docstring, and that 
Neil's summary of the function as simply a convenience function that 
calls stats.chisquare with appropriate arguments to perform a test of 
independence on a contingency table is exactly what I had in mind.

Warren


josef.pktd at gmail.com wrote:
> On Wed, Jun 2, 2010 at 4:03 PM, Bruce Southey <bsouthey at gmail.com> wrote:
>   
>> On 06/02/2010 01:41 PM, josef.pktd at gmail.com wrote:
>>
>> On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell <nmb at wartburg.edu>
>> wrote:
>>
>>
>> On 2010-06-02 13:10 , Bruce Southey wrote:
>> [...]
>>
>>
>>
>> However, this code is the chi-squared test part as SAS will compute the
>> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
>> can not have both functions.
>>
>>
>> Again, I don't understand what you mean that we can't have both
>> functions? I believe (from a statistics teacher's point of view) that
>> the Chi-Squared goodness of fit test (which is stats.chisquare) is a
>> different beast from the Chi-Square test for independence (which is
>> stats.chisquare_contingency). The fact that the distribution of the
>> test statistic is the same should not tempt us to put them into the
>> same function.
>>
>>
>> Please read scipy.stats.chisquare() because scipy.stats.chisquare() is
>> the 1-d case of yours.
>> Quote from the docstring:
>> " The chi square test tests the null hypothesis that the categorical data
>> has the given frequencies."
>> Also go the web site provided in the docstring.
>>
>> By default you get the expected frequencies but you can also put in your
>> own using the f_exp variable. You could do the same in your code.
>>
>>
>> In fact, Warren correctly used stats.chisquare with the expected
>> frequencies calculated from the null hypothesis and the corrected
>> degrees of freedom.  chisquare_contingency is in some sense a
>> convenience method for taking care of these pre-calculations before
>> calling stats.chisquare.  Can you explain more clearly to me why we
>> should not include such a convenience function?
>>
>>
>> Just a clarification, before I find time to work my way through the
>> other comments
>>
>> stats.chisquare is a generic test for goodness-of-fit for discreted or
>> binned distributions.
>> and from the docstring of it
>> "If no expected frequencies are given, the total
>>     N is assumed to be equally distributed across all groups."
>>
>> default is uniform distribution
>>
>>
>>
>> Try:
>> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>>
>> The use of the uniform distribution is rather misleading and technically
>> wrong as it does not help address the expected number of outcomes in a cell:
>>     
>
> quote from the wikipedia page:
> "A simple example is the hypothesis that an ordinary six-sided dice is
> "fair", i.e., all six outcomes are equally likely to occur."
>
> I don't see anything misleading or technically wrong with the uniform
> distributions,
> or if they come from a Poisson, Hypergeometric, binned Normal or any
> of number of other distributions.
>
>
>   
>> http://en.wikipedia.org/wiki/Discrete_uniform_distribution
>>
>>
>> chisquare_twoway is a special case that additional calculates the
>> correct expected frequencies for the test of independencs based on the
>> margin totals. The resulting distribution is not uniform.
>>
>>
>> Actually the null hypothesis is rather different between 1-way and 2-way
>> tables so you can not say that chisquare_twoway is a special case of
>> chisquare.
>>     
>
> What is the Null hypothesis in a one-way table?
>
> Josef
>
>   
>> I am not sure what you mean by the 'resulting distribution is not uniform'.
>> The distribution of the cells values has nothing to do with the uniform
>> distribution in either case because it is not used in the data nor in the
>> formulation of the test. (And, yes, I have had to do the proof that the test
>> statistic is Chi-squared - which is why there is the warning about small
>> cells...).
>>
>> I agree with Neil that this is a very useful convenience function.
>>
>>
>> My problem with the chisquare_twoway is that it should not call another
>> function to finish two lines of code. It is just an excessive waste of
>> resources.
>>
>> I never heard of a one-way contingency table, my question was whether
>> the function should also handle 3-way or 4-way tables, additional to
>> two-way.
>>
>>
>> Correct to both of these as I just consider these as n-way tables. I think
>> that contingency tables by definition only applies to the 2-d case. Pivot
>> tables are essentially the same thing. I would have to lookup on how to get
>> the expected number of outcomes but probably of the form Ni.. * N.j.
>> *N..k/N... for the 3-way (the 2-way table is of the form Ni.*N.j/N..) for
>> i=rows, j=columns, k=3rd axis and '.' means sum for that axis.
>>
>> I thought about the question how the input should be specified for my
>> initial response, the alternative would be to use the original data or
>> a "long" format instead of a table. But I thought that as a
>> convenience function using the table format will be the most common
>> use.
>>
>> I have written in the past functions that calculate the contingency
>> table, and would be very useful to have a more complete coverage of
>> tools to work with contingency tables in scipy.stats (or temporarily
>> in statsmodels, where we are working also on the anova type of
>> analysis)
>>
>>
>> It depends on what tasks are needed.  Really there are two steps:
>> 1) Cross-tabulation that summarized the data from whatever input (groupby
>> would help here).
>> 2) Statistical tests - series of functions that accept summarized data only.
>>
>> If you have separate functions then the burden is on the user to find and
>> call all the desired functions. You can also provide a single helper
>> function to do all that because you don't want to repeat unnecessary calls.
>>
>> So, I think the way it is it is a nice function and we don't have to
>> put all contingency table analysis into this function.
>>
>> Josef
>>
>>
>> Bruce
>>
>>
>>
>>
>>
>> Really this should be combined with fisher.py in ticket 956:
>> http://projects.scipy.org/scipy/ticket/956
>>
>>
>> Wow, apparently I have lots of disagreements today, but I don't think
>> that this should be combined with Fisher's Exact test. (I would like
>> to see that ticket mature to the point where it can be added to
>> scipy.stats.) I like the functions in scipy.stats to correspond in a
>> one-to-one manner with the statistical tests. I think that the docs
>> should "See Also" the appropriate exact (and non-parametric) tests,
>> but I think that one function/one test is a good rule. This is
>> particularly true for people (like me) who would like to someday be
>> able to use scipy.stats in a pedagogical context.
>>
>> -Neil
>>
>>
>> I don't see any 'disagreements' rather just different ways to do things
>> and identifying areas that need to be addressed for more general use.
>>
>>
>> Agreed. :)
>>
>> [...]
>>
>> -Neil
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>     
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From bsouthey at gmail.com  Thu Jun  3 09:27:26 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 03 Jun 2010 08:27:26 -0500
Subject: [SciPy-Dev] np.savetxt: apply patch in enhancement ticket 1079
 to add headers?
In-Reply-To: <loom.20100602T181159-631@post.gmane.org>
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com>	<loom.20100601T190316-782@post.gmane.org>	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com>	<loom.20100602T125421-882@post.gmane.org>	<4C066DA3.8010609@gmail.com>
	<loom.20100602T181159-631@post.gmane.org>
Message-ID: <4C07ADBE.3050407@gmail.com>

On 06/02/2010 12:14 PM, Stefan wrote:
>    
>> Not that I am complaining rather trying to understand what is expected
>> to happen.
>> Under the patch, it is very much user beware.  The header argument can
>> be anything or nothing. There is no check for the contents or if the
>> delimiter used is the same as the rest of the output. Further with the
>> newline option there is no guarantee that the lines in the header will
>> have the same line endings throughout the file.
>> So what should a user be allowed to use as a header?
>> You could write a whole program there or an explanation of the
>> following output - which is very appealing. You could force a list of
>> strings so that you print out newline.join(header) - okay not quite
>> because it should include the comment argument.
>> Should savetxt be restricted to something that loadtxt can read?
>> This is potentially problematic if you want a header line. Although it
>> could return the number of header lines.
>> [savetxt should also be updated to allow bz2 as loadtxt handles those
>> now - not that I have used it]
>>
>>
>>
>>
>> Also note that since that patch was written, savetxt takes a user
>> supplied newline keyword, so you can just append that to the header
>> string.
>>
>>
>>
>>    True, we were not aware of this, but this does not help much for the
>> comment/header.
>>
>>
>>
>> Entered as ~3 months ago:http://projects.scipy.org/numpy/changeset/8180
>> Should this be forced to check for valid options for new lines?
>> Otherwise you from this  'np.savetxt('junk.text', [1,2,3,4,5],
>> newline='what')' you get:
>>
>>      
> 1.000000000000000000e+00what2.000000000000000000e+00what
> 3.000000000000000000e+00what4.000000000000000000e+00
> what5.000000000000000000e+00what
>    
>> Which is not going to be read back by loadtxt.
>>
>>
>>
>> As numpy.loadtxt has a default comment character ('#'), the same may be
>> implemented for numpy.savetxt. In this case, numpy.savetxt would get two
>> additional keywords (e.g. header, comment(character)), which bloats the
>> interface, but potentially provides more safety.
>>
>>
>>
>>
>> FWIW, I ended up rolling my own using the most recent pre-Python 3
>> changes for savetxt that accepts a list of names instead of one string
>> or if the provided array has the attribute dtype.names (non-nested rec
>> or structured arrays) it uses those.  Whatever is done I think the
>> support for structured arrays is nice, and I think having this
>> functionality is a no-brainer.  I need it quite often.
>>
>>
>>
>>    Although, we have not been using record arrays too often, we see their
>> advantages and agree that it should be possible to use them as you described
>> it.
>> We also thought about a solution, using the __str__ method for the 'header
>> object'. In this vain, an arbitrary header class (including a plane string)
>> providing an __str__ member may be handed to numpy.savetxt,
>> which can use it to write the header.
>>
>>      
>
> So let us briefly summarize whats on the table. It appears to us that
> there are basically three open issues:
> (1) a csv like header for savetxt written files (first line contains column
>      names)
> (2) comments (introduced by comment character e.g. '#') at the beginning
>      of the file (preceding the data)
> (3) the role of the 'newline' option
>
> As was noted, the patch (ticket 1079) enables both to write a csv like
> header (1) and comment line(s) introduced by a comment character (e.g. '#').
> Nonetheless, this solution is quite unsatisfactory
> in our opinion, because it may be error prone,
> as the user is in charge of the entire formatting. Despite this, we think
> that it should be up to the user what amount of information is to be put
> at the top of the file, but the format should be checked as far as possible.
>
> Using either a string or a list/tuple of strings, as proposed by Bruce,
> seems to be a reasonable possibility to implement the desired functionality.
> Maybe two individual keywords ('header' and 'comment') should exist to
> distinguish whether the the user requests case (1) or (2). As for loadtxt
> the default comment character should be '#', but it may be changed by the
> user.
>
> We think that savetxt should not be restricted to output, which can be read
> by loadtxt. Although it should be possible to add commments to the output
> file, so that it remains readable by loadtxt (without tweaking it
> e.g. with the skiprows keyword).
>
> We agree that the newline keyword may cause inconsistencies in the file
> (if ticket 1079 were applied),
> and possibly strange behavior such as when newline='what' is specified.
> Yet, this question does not only concern the header/comments.
>
> Stefan&  Christian
>
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    
I am in agreement with what you suggest so post a patch. :-)

Some of what I suggested was over thinking what can really be done and 
keep the function relatively simple and easy to use.

My wish list would be that:
1) If the header is added that it allows names from structured/record 
arrays to be used and perhaps autogenerated (such as var1, var2, ..., varn).
2) That the dtype of the array_like input be used in the fmt when fmt is 
not provided.


Bruce


From josef.pktd at gmail.com  Thu Jun  3 09:38:32 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 09:38:32 -0400
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <4C07A52D.30503@enthought.com>
References: <4C07A52D.30503@enthought.com>
Message-ID: <AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>

On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> stats.glm looks like it was started and then abandoned without being
> finished. ?It was last touched in November 2007. ?Should this function
> be deprecated so it can eventually be removed?

My thoughts when I looked at it was roughly:
leave it alone since it's working, but don't "advertise" it because we
should get a better replacement.
similar to linregress the more general version will be available when
scipy.stats gets the full OLS model.

>>> x = (np.arange(20)>9).astype(int)
>>> y = x + np.random.randn(20)
>>> stats.glm(y,x)
(-1.7684287512254859, 0.093933208147769023)
>>> stats.ttest_ind(y[:10], y[10:])
(-1.7684287512254859, 0.093933208147768926)

In the current form it doesn't do much different than ttest_ind except
for different argument structure.

I think it could be made to work on string labels if _support.unique
is replaced by np.unique (which we are doing in statsmodels)

>>> x = (np.arange(20)>9).astype(str)
>>> x
array(['F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'T', 'T', 'T',
       'T', 'T', 'T', 'T', 'T', 'T', 'T'],
      dtype='|S1')
>>> stats.glm(y,x)
Traceback (most recent call last):
  File "<pyshell#24>", line 1, in <module>
    stats.glm(y,x)
  File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\stats.py",
line 3315, in glm
    p = _support.unique(para)
  File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\_support.py",
line 45, in unique
    if np.add.reduce(np.equal(uniques,item).flat) == 0:
AttributeError: 'NotImplementedType' object has no attribute 'flat'

Josef

>
> Warren
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From bsouthey at gmail.com  Thu Jun  3 10:07:42 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 03 Jun 2010 09:07:42 -0500
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
Message-ID: <4C07B72E.2050504@gmail.com>

On 06/02/2010 10:28 PM, Benjamin Root wrote:
> As a power user of these tools, I often will encounter these warnings 
> while bulding my code piece-wise, however, I can easily imagine a case 
> where a regular user simply seeing a useful feature and spending time 
> coding around it, only to discover that it will soon be deprecated.  I 
> would certainly be annoyed in such a case.
>
> A quick and easy way to list deprecations would be towards the end of 
> the docstring, but the user might not scroll all the way down past the 
> feature that they found.  So, to raise visibility, such deprecation 
> warnings should be towards the beginning of the docstring.
>
> Just a thought... is it feasible for the doc building system to scan 
> through the function code and spot a deprecation warning and thereby 
> be able to add a list of deprecation warnings to the docstring?  
> Obviously, such warnings would have to follow some standard format, 
> but it would be neat if such things could be automated.
>
> Just my 2 cents,
> Ben Root
>
> On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith 
> <d.l.goldsmith at gmail.com <mailto:d.l.goldsmith at gmail.com>> wrote:
>
>     On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis
>     <vincent at vincentdavis.net <mailto:vincent at vincentdavis.net>> wrote:
>
>         For example scipy.stats.stats.cov when you view source has
>         "scipy.stats.cov is deprecated; please update your code to use
>         numpy.cov." Should this be in the docs ? and is there an
>         example of
>         how this should be pointed out.
>         This is something I actually implemented in a program then
>         discovered
>         that is was deprecated. I would have like that to be in the online
>         docs.
>
>         Thanks
>         Vincent
>
>
>     I vaguely recollect this being discussed before, but I can't find
>     anything about it in our docstring Standard, in our Q+A section,
>     nor (easily) at the Python site (generally, when in doubt, we
>     default to Python docstring standards); so, how 'bout it guys and
>     gals: should deprecation be noted in docstrings and if so, where
>     and how?
>
>     DG
>
>
>     _______________________________________________
>     SciPy-Dev mailing list
>     SciPy-Dev at scipy.org <mailto:SciPy-Dev at scipy.org>
>     http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    
Users should first check that numpy does not have the functionality that 
a user needs. Duplicated functionality between numpy and scipy is or was 
a main reason for depreciation. There are or were cases where numpy is 
different than scipy but I think these are being corrected as when these 
are found.

Many of the warnings predate the numpy and scipy documentation marathon 
efforts and some depreciations may still be in tickets so it is very 
doubtful that an automated system will detect either of these cases 
anyhow. In the doc marathon someone will have to find these cases and 
deal with them appropriately - noting, as the person who created the 
ticket, that some of the scipy.stats should be gone in the tentative 
scipy 0.9 release.

In the future, someone will have to come up with a rule to force 
documentation change when a depreciation event occurs and then enforce 
it. In fact, for numpy (as scipy does not yet have the same policy) the 
desired documentation changes should be added to:
http://projects.scipy.org/numpy/wiki/ApiDeprecation


Bruce
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/914a3340/attachment.html>

From vincent at vincentdavis.net  Thu Jun  3 10:10:25 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 3 Jun 2010 08:10:25 -0600
Subject: [SciPy-Dev] {True, False} should be replaced w/ bool, correct?
In-Reply-To: <AANLkTinRQP_o0xFv_irQy6xaYQy1zzzNXgOI5iIlfGo1@mail.gmail.com>
References: <AANLkTikl1prQDD3EaQrzdTzL7zbEy0XR1iK65wnjg49v@mail.gmail.com>
	<AANLkTinRQP_o0xFv_irQy6xaYQy1zzzNXgOI5iIlfGo1@mail.gmail.com>
Message-ID: <AANLkTilWJSz19vOV9SHZ3J8g3yoXC-HRJmXuKAuR6oqu@mail.gmail.com>

On Thu, Jun 3, 2010 at 4:40 AM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
>
>
> On Thu, Jun 3, 2010 at 1:04 PM, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
>>
>> Just checking; see, e.g., scipy.io.matlab.mio.savemat appendmat
>> parameter.? (Or is it possible that the function really needs to see either
>> the word True or the word False?)
>
> Correct, {True, False} should always be changed to bool in the docs.

I didn't see how the "defualt" should be noted on bool options. I
think in most cases it should be clear but it might be nice is it was
explicit.

Vincent

>
> Cheers,
> Ralf
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Thu Jun  3 10:14:55 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 3 Jun 2010 08:14:55 -0600
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <4C07B72E.2050504@gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
Message-ID: <AANLkTims87tNOyMKQwkd2PCfsTIWeahxHxwk76WlCxpY@mail.gmail.com>

On Thu, Jun 3, 2010 at 8:07 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/02/2010 10:28 PM, Benjamin Root wrote:
>
> As a power user of these tools, I often will encounter these warnings while
> bulding my code piece-wise, however, I can easily imagine a case where a
> regular user simply seeing a useful feature and spending time coding around
> it, only to discover that it will soon be deprecated.? I would certainly be
> annoyed in such a case.
>
> A quick and easy way to list deprecations would be towards the end of the
> docstring, but the user might not scroll all the way down past the feature
> that they found.? So, to raise visibility, such deprecation warnings should
> be towards the beginning of the docstring.
>
> Just a thought... is it feasible for the doc building system to scan through
> the function code and spot a deprecation warning and thereby be able to add
> a list of deprecation warnings to the docstring?? Obviously, such warnings
> would have to follow some standard format, but it would be neat if such
> things could be automated.
>
> Just my 2 cents,
> Ben Root
>
> On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
>>
>> On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>
>> wrote:
>>>
>>> For example scipy.stats.stats.cov when you view source has
>>> "scipy.stats.cov is deprecated; please update your code to use
>>> numpy.cov." Should this be in the docs ? and is there an example of
>>> how this should be pointed out.
>>> This is something I actually implemented in a program then discovered
>>> that is was deprecated. I would have like that to be in the online
>>> docs.
>>>
>>> Thanks
>>> Vincent
>>
>> I vaguely recollect this being discussed before, but I can't find anything
>> about it in our docstring Standard, in our Q+A section, nor (easily) at the
>> Python site (generally, when in doubt, we default to Python docstring
>> standards); so, how 'bout it guys and gals: should deprecation be noted in
>> docstrings and if so, where and how?
>>
>> DG
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> Users should first check that numpy does not have the functionality that a
> user needs.

This is news to me, My point is that unless this is a very clearly and
obviously presented in scipy it is an assumption only you know about
:)

Vincent


Duplicated functionality between numpy and scipy is or was a
> main reason for depreciation. There are or were cases where numpy is
> different than scipy but I think these are being corrected as when these are
> found.
>
> Many of the warnings predate the numpy and scipy documentation marathon
> efforts and some depreciations may still be in tickets so it is very
> doubtful that an automated system will detect either of these cases anyhow.
> In the doc marathon someone will have to find these cases and deal with them
> appropriately - noting, as the person who created the ticket, that some of
> the scipy.stats should be gone in the tentative scipy 0.9 release.
>
> In the future, someone will have to come up with a rule to force
> documentation change when a depreciation event occurs and then enforce it.
> In fact, for numpy (as scipy does not yet have the same policy) the desired
> documentation changes should be added to:
> http://projects.scipy.org/numpy/wiki/ApiDeprecation
>
>
> Bruce
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From josef.pktd at gmail.com  Thu Jun  3 10:15:32 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 10:15:32 -0400
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <4C07B72E.2050504@gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
Message-ID: <AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>

On Thu, Jun 3, 2010 at 10:07 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/02/2010 10:28 PM, Benjamin Root wrote:
>
> As a power user of these tools, I often will encounter these warnings while
> bulding my code piece-wise, however, I can easily imagine a case where a
> regular user simply seeing a useful feature and spending time coding around
> it, only to discover that it will soon be deprecated.? I would certainly be
> annoyed in such a case.
>
> A quick and easy way to list deprecations would be towards the end of the
> docstring, but the user might not scroll all the way down past the feature
> that they found.? So, to raise visibility, such deprecation warnings should
> be towards the beginning of the docstring.
>
> Just a thought... is it feasible for the doc building system to scan through
> the function code and spot a deprecation warning and thereby be able to add
> a list of deprecation warnings to the docstring?? Obviously, such warnings
> would have to follow some standard format, but it would be neat if such
> things could be automated.
>
> Just my 2 cents,
> Ben Root
>
> On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
>>
>> On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>
>> wrote:
>>>
>>> For example scipy.stats.stats.cov when you view source has
>>> "scipy.stats.cov is deprecated; please update your code to use
>>> numpy.cov." Should this be in the docs ? and is there an example of
>>> how this should be pointed out.
>>> This is something I actually implemented in a program then discovered
>>> that is was deprecated. I would have like that to be in the online
>>> docs.
>>>
>>> Thanks
>>> Vincent
>>
>> I vaguely recollect this being discussed before, but I can't find anything
>> about it in our docstring Standard, in our Q+A section, nor (easily) at the
>> Python site (generally, when in doubt, we default to Python docstring
>> standards); so, how 'bout it guys and gals: should deprecation be noted in
>> docstrings and if so, where and how?
>>
>> DG
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> Users should first check that numpy does not have the functionality that a
> user needs. Duplicated functionality between numpy and scipy is or was a
> main reason for depreciation. There are or were cases where numpy is
> different than scipy but I think these are being corrected as when these are
> found.
>
> Many of the warnings predate the numpy and scipy documentation marathon
> efforts and some depreciations may still be in tickets so it is very
> doubtful that an automated system will detect either of these cases anyhow.
> In the doc marathon someone will have to find these cases and deal with them
> appropriately - noting, as the person who created the ticket, that some of
> the scipy.stats should be gone in the tentative scipy 0.9 release.
>
> In the future, someone will have to come up with a rule to force
> documentation change when a depreciation event occurs and then enforce it.
> In fact, for numpy (as scipy does not yet have the same policy) the desired
> documentation changes should be added to:
> http://projects.scipy.org/numpy/wiki/ApiDeprecation

I have never seen any guidelines or rules to add Deprecation Warnings
into the docstrings. It would be good to define a standard for the
docstrings first.

For scipy.stats, I just copied recently the deprecation warnings to
the notes section, because the notes section does not have rules for
it's content.

Josef


>
> Bruce
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From warren.weckesser at enthought.com  Thu Jun  3 10:18:02 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Thu, 03 Jun 2010 09:18:02 -0500
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
Message-ID: <4C07B99A.8080101@enthought.com>

josef.pktd at gmail.com wrote:
> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
> <warren.weckesser at enthought.com> wrote:
>   
>> stats.glm looks like it was started and then abandoned without being
>> finished.  It was last touched in November 2007.  Should this function
>> be deprecated so it can eventually be removed?
>>     
>
> My thoughts when I looked at it was roughly:
> leave it alone since it's working, but don't "advertise" it because we
> should get a better replacement.
>   

How does one not advertise it?

The docstring is wrong, incomplete, and not useful.  It has no tests.  
Currently, it appears that it just duplicates ttest_ind.  As far as I 
know, no one is working on it.

Leaving it in wastes users' time reading about it.  It erodes confidence 
in other functions in scipy:  "Is foo() a good function, or has it been 
abandoned, like glm()?"

To me, it is an ideal candidate for removal.


Warren

> similar to linregress the more general version will be available when
> scipy.stats gets the full OLS model.
>
>   
>>>> x = (np.arange(20)>9).astype(int)
>>>> y = x + np.random.randn(20)
>>>> stats.glm(y,x)
>>>>         
> (-1.7684287512254859, 0.093933208147769023)
>   
>>>> stats.ttest_ind(y[:10], y[10:])
>>>>         
> (-1.7684287512254859, 0.093933208147768926)
>
> In the current form it doesn't do much different than ttest_ind except
> for different argument structure.
>
> I think it could be made to work on string labels if _support.unique
> is replaced by np.unique (which we are doing in statsmodels)
>
>   
>>>> x = (np.arange(20)>9).astype(str)
>>>> x
>>>>         
> array(['F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'T', 'T', 'T',
>        'T', 'T', 'T', 'T', 'T', 'T', 'T'],
>       dtype='|S1')
>   
>>>> stats.glm(y,x)
>>>>         
> Traceback (most recent call last):
>   File "<pyshell#24>", line 1, in <module>
>     stats.glm(y,x)
>   File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\stats.py",
> line 3315, in glm
>     p = _support.unique(para)
>   File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\_support.py",
> line 45, in unique
>     if np.add.reduce(np.equal(uniques,item).flat) == 0:
> AttributeError: 'NotImplementedType' object has no attribute 'flat'
>
> Josef
>
>   
>> Warren
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>     
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From ralf.gommers at googlemail.com  Thu Jun  3 10:19:52 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 3 Jun 2010 22:19:52 +0800
Subject: [SciPy-Dev] [SciPy-User] log pdf, cdf, etc
In-Reply-To: <A4407784-49D9-444A-9DA1-A08EA75F26B6@enthought.com>
References: <AANLkTilmC_l3pWTizWFhB6v0R8Jo03v440ZdLfZf5pND@mail.gmail.com>
	<AANLkTinJXzsaCkbP1E1dkGIDNDruuYX_1vD9PCg6Ml5l@mail.gmail.com>
	<6D1C6011-7B0A-45DB-9B54-6CAE1FA38F71@enthought.com>
	<AANLkTilDNm6fK2FYLlNgtWJLWb77_OaefOKqWhwb7sJ3@mail.gmail.com>
	<AANLkTikHSTyt_OeKLtz4sWzpPKYMUlvr78qmY8gfVdek@mail.gmail.com>
	<12883887-E601-467B-9C56-55BDA8169C19@enthought.com>
	<AANLkTinDmwgf6KgCM9_XIzgWdgVW9sylEIc4f9sypcFU@mail.gmail.com>
	<A4407784-49D9-444A-9DA1-A08EA75F26B6@enthought.com>
Message-ID: <AANLkTilSKl-m8QgGQ_10c1Z7jBibggQmMYAGLliShTc5@mail.gmail.com>

On Wed, Jun 2, 2010 at 7:25 AM, Travis Oliphant <oliphant at enthought.com>wrote:

>
> On Jun 1, 2010, at 8:19 AM, Ralf Gommers wrote:
>
> In summary, I see quite a few reasons why this shouldn't go in and don't
> see a compelling reason to release it right now. The 0.9 release is
> (tentatively) planned for September, so you don't have to worry that your
> changes sit in trunk unreleased for 1.5 years.
>
>
> As the one doing the work of release manager, you have a lot of latitude in
> making this decision, of course.    The compelling reason to release it
> right now is to get the improved features which nobody has actually voiced
> specific concerns about.
>
> Travis, I just removed the code from 0.8.x. It's still in trunk, and with
the tests and docs you added for me that is fine. With a few months to shake
out possible bugs and agree on the API it will be a very useful improvement
for 0.9.


> Suggestions about how to give gamma.fit and beta.fit the docstring of it's
> parent would be appreciated.
>

How about (I didn't test this):
self.fit.__doc__ = rv_continuous.fit.__doc__


Best regards,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/bbb483bc/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun  3 10:26:00 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 3 Jun 2010 22:26:00 +0800
Subject: [SciPy-Dev] {True, False} should be replaced w/ bool, correct?
In-Reply-To: <AANLkTilWJSz19vOV9SHZ3J8g3yoXC-HRJmXuKAuR6oqu@mail.gmail.com>
References: <AANLkTikl1prQDD3EaQrzdTzL7zbEy0XR1iK65wnjg49v@mail.gmail.com>
	<AANLkTinRQP_o0xFv_irQy6xaYQy1zzzNXgOI5iIlfGo1@mail.gmail.com>
	<AANLkTilWJSz19vOV9SHZ3J8g3yoXC-HRJmXuKAuR6oqu@mail.gmail.com>
Message-ID: <AANLkTikQbEWSJ7LgVzkkRNsL8gNJkVM9lMhjIcqW3Tnp@mail.gmail.com>

On Thu, Jun 3, 2010 at 10:10 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Thu, Jun 3, 2010 at 4:40 AM, Ralf Gommers
> <ralf.gommers at googlemail.com> wrote:
> >
> >
> > On Thu, Jun 3, 2010 at 1:04 PM, David Goldsmith <d.l.goldsmith at gmail.com
> >
> > wrote:
> >>
> >> Just checking; see, e.g., scipy.io.matlab.mio.savemat appendmat
> >> parameter.  (Or is it possible that the function really needs to see
> either
> >> the word True or the word False?)
> >
> > Correct, {True, False} should always be changed to bool in the docs.
>
> I didn't see how the "defualt" should be noted on bool options. I
> think in most cases it should be clear but it might be nice is it was
> explicit.
>
> In the description of the parameter, for example:
cap : bool, optional
    Whether to return this string in capital letters. Default is True.

Noting defaults should be done not only for bool args, but for everything
that has a default.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/9407f570/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun  3 10:43:02 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 3 Jun 2010 22:43:02 +0800
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
Message-ID: <AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>

On Thu, Jun 3, 2010 at 10:15 PM, <josef.pktd at gmail.com> wrote:

> On Thu, Jun 3, 2010 at 10:07 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> > On 06/02/2010 10:28 PM, Benjamin Root wrote:
>
> >
> > Just a thought... is it feasible for the doc building system to scan
> through
> > the function code and spot a deprecation warning and thereby be able to
> add
> > a list of deprecation warnings to the docstring?  Obviously, such
> warnings
> > would have to follow some standard format, but it would be neat if such
> > things could be automated.
>

There's enough docstring manipulation going on already I think, this is not
that much work so manual would be better. It should be put in at the moment
the deprecation takes place.

>
> In the future, someone will have to come up with a rule to force
> documentation change when a depreciation event occurs and then enforce it.
> In fact, for numpy (as scipy does not yet have the same policy) the
desired
> documentation changes should be added to:
> http://projects.scipy.org/numpy/wiki/ApiDeprecation

I have never seen any guidelines or rules to add Deprecation Warnings
> into the docstrings. It would be good to define a standard for the
> docstrings first.


It should be made as visible as possible in my opinion. A reST warning in
between summary and extended summary would work. It should clearly state in
which version it will be removed. Best to keep the text identical to the one
passed to the deprecate decorator. A reason or alternative should be given
as well.

.. warning::
    `myfunc` is deprecated and will be removed in SciPy 0.9. Look at
`thatfunc` for equivalent functionality.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/f2d434d8/attachment.html>

From josef.pktd at gmail.com  Thu Jun  3 10:49:21 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 10:49:21 -0400
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <4C07B99A.8080101@enthought.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<4C07B99A.8080101@enthought.com>
Message-ID: <AANLkTine2Eeh30frV3FmkObBAfkiJ8Ia12SD3DRChO1g@mail.gmail.com>

On Thu, Jun 3, 2010 at 10:18 AM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> josef.pktd at gmail.com wrote:
>> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
>> <warren.weckesser at enthought.com> wrote:
>>
>>> stats.glm looks like it was started and then abandoned without being
>>> finished. ?It was last touched in November 2007. ?Should this function
>>> be deprecated so it can eventually be removed?
>>>
>>
>> My thoughts when I looked at it was roughly:
>> leave it alone since it's working, but don't "advertise" it because we
>> should get a better replacement.
>>
>
> How does one not advertise it?
>
> The docstring is wrong, incomplete, and not useful.

That's it's not advertised

> It has no tests.

It has no tests (except for examples on my computer), but the results
(for the basic case that I looked at) are correct.
If we increase test coverage or start removing functions that don't
have tests yet, I would work on box-cox, and several other functions
in morestats.py . Mainly a question of priorities.

> Currently, it appears that it just duplicates ttest_ind. ?As far as I
> know, no one is working on it.
>
> Leaving it in wastes users' time reading about it. ?It erodes confidence
> in other functions in scipy: ?"Is foo() a good function, or has it been
> abandoned, like glm()?"
>
> To me, it is an ideal candidate for removal.

If we apply strict criteria along those lines, we can reduce the size
of scipy.stats.stats and scipy.stats.morestats, I guess, by at least a
third. (Which I would do if I could start from scratch).
A big fraction of functions in scipy.stats are in the category "no one
is working on it".

For glm specifically, I don't see any big cost of leaving it in, nor
for deprecating it, and then I usually stick to the status-quo. But
you can as well deprecate it, and point to ttest_ind.

And for "bigger fish" like pdfmoments and pdf_approx, I never received
a reply or opinion on the mailing list.

statsmodels will have (or better, has in the sandbox) a generalization
for glm, that works for any number of groups and includes both t_test
and f_test.

Josef

>
> Warren
>
>> similar to linregress the more general version will be available when
>> scipy.stats gets the full OLS model.
>>
>>
>>>>> x = (np.arange(20)>9).astype(int)
>>>>> y = x + np.random.randn(20)
>>>>> stats.glm(y,x)
>>>>>
>> (-1.7684287512254859, 0.093933208147769023)
>>
>>>>> stats.ttest_ind(y[:10], y[10:])
>>>>>
>> (-1.7684287512254859, 0.093933208147768926)
>>
>> In the current form it doesn't do much different than ttest_ind except
>> for different argument structure.
>>
>> I think it could be made to work on string labels if _support.unique
>> is replaced by np.unique (which we are doing in statsmodels)
>>
>>
>>>>> x = (np.arange(20)>9).astype(str)
>>>>> x
>>>>>
>> array(['F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'T', 'T', 'T',
>> ? ? ? ?'T', 'T', 'T', 'T', 'T', 'T', 'T'],
>> ? ? ? dtype='|S1')
>>
>>>>> stats.glm(y,x)
>>>>>
>> Traceback (most recent call last):
>> ? File "<pyshell#24>", line 1, in <module>
>> ? ? stats.glm(y,x)
>> ? File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\stats.py",
>> line 3315, in glm
>> ? ? p = _support.unique(para)
>> ? File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\_support.py",
>> line 45, in unique
>> ? ? if np.add.reduce(np.equal(uniques,item).flat) == 0:
>> AttributeError: 'NotImplementedType' object has no attribute 'flat'
>>
>> Josef
>>
>>
>>> Warren
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From ben.root at ou.edu  Thu Jun  3 10:49:53 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Thu, 3 Jun 2010 09:49:53 -0500
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <4C07B72E.2050504@gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com> 
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com> 
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com> 
	<4C07B72E.2050504@gmail.com>
Message-ID: <AANLkTilKoXUbP-nyOGdYK5roob-UMz5XyKVESZt7iOyp@mail.gmail.com>

On Thu, Jun 3, 2010 at 9:07 AM, Bruce Southey <bsouthey at gmail.com> wrote:

>  On 06/02/2010 10:28 PM, Benjamin Root wrote:
>
> As a power user of these tools, I often will encounter these warnings while
> bulding my code piece-wise, however, I can easily imagine a case where a
> regular user simply seeing a useful feature and spending time coding around
> it, only to discover that it will soon be deprecated.  I would certainly be
> annoyed in such a case.
>
> A quick and easy way to list deprecations would be towards the end of the
> docstring, but the user might not scroll all the way down past the feature
> that they found.  So, to raise visibility, such deprecation warnings should
> be towards the beginning of the docstring.
>
> Just a thought... is it feasible for the doc building system to scan
> through the function code and spot a deprecation warning and thereby be able
> to add a list of deprecation warnings to the docstring?  Obviously, such
> warnings would have to follow some standard format, but it would be neat if
> such things could be automated.
>
> Just my 2 cents,
> Ben Root
>
> On Wed, Jun 2, 2010 at 10:07 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:
>
>> On Wed, Jun 2, 2010 at 7:22 PM, Vincent Davis <vincent at vincentdavis.net>wrote:
>>
>>> For example scipy.stats.stats.cov when you view source has
>>> "scipy.stats.cov is deprecated; please update your code to use
>>> numpy.cov." Should this be in the docs ? and is there an example of
>>> how this should be pointed out.
>>> This is something I actually implemented in a program then discovered
>>> that is was deprecated. I would have like that to be in the online
>>> docs.
>>>
>>> Thanks
>>> Vincent
>>>
>>
>> I vaguely recollect this being discussed before, but I can't find anything
>> about it in our docstring Standard, in our Q+A section, nor (easily) at the
>> Python site (generally, when in doubt, we default to Python docstring
>> standards); so, how 'bout it guys and gals: should deprecation be noted in
>> docstrings and if so, where and how?
>>
>> DG
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>
> _______________________________________________
> SciPy-Dev mailing listSciPy-Dev at scipy.orghttp://mail.scipy.org/mailman/listinfo/scipy-dev
>
>  Users should first check that numpy does not have the functionality that a
> user needs. Duplicated functionality between numpy and scipy is or was a
> main reason for depreciation. There are or were cases where numpy is
> different than scipy but I think these are being corrected as when these are
> found.
>
> I don't think that is a reasonable assumption to make for someone just
learning how to use these packages.  When I started using these packages
myself about a year and a half ago, I remember not understanding the
difference between scipy and numpy (and pylab... and matplotlib...) because
they presented many of the same functions to me.  At the time, I figured
that I really was calling the same functions, just merely wrapped around the
other, or something like that.  It was quite confusing.  A time evolution of
my scripts would probably reveal some interesting insights into how my
understanding of scipy/numpy changed.

My point is that because there is so much shared functionality to the newbie
user, that they will tend to treat scipy and numpy as synonymous, and the
thought to check numpy's documentation will never even enter their minds.
Therefore, one should be careful to note in deprecation warnings that a
particular function is being deprecated because the functionality belongs in
another package.  That should raise awareness of the roles of the packages.

Ben Root
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/1cf9aeac/attachment.html>

From josef.pktd at gmail.com  Thu Jun  3 10:52:24 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 10:52:24 -0400
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
Message-ID: <AANLkTimIdWUlPTpVUuPScXzzV9Il5ceWTHAmb8N5bgqE@mail.gmail.com>

On Thu, Jun 3, 2010 at 10:43 AM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
>
>
> On Thu, Jun 3, 2010 at 10:15 PM, <josef.pktd at gmail.com> wrote:
>>
>> On Thu, Jun 3, 2010 at 10:07 AM, Bruce Southey <bsouthey at gmail.com> wrote:
>> > On 06/02/2010 10:28 PM, Benjamin Root wrote:
>>
>> >
>> > Just a thought... is it feasible for the doc building system to scan
>> > through
>> > the function code and spot a deprecation warning and thereby be able to
>> > add
>> > a list of deprecation warnings to the docstring?? Obviously, such
>> > warnings
>> > would have to follow some standard format, but it would be neat if such
>> > things could be automated.
>
> There's enough docstring manipulation going on already I think, this is not
> that much work so manual would be better. It should be put in at the moment
> the deprecation takes place.
>
>>
>> In the future, someone will have to come up with a rule to force
>> documentation change when a depreciation event occurs and then enforce it.
>> In fact, for numpy (as scipy does not yet have the same policy) the
>> desired
>> documentation changes should be added to:
>> http://projects.scipy.org/numpy/wiki/ApiDeprecation
>
>> I have never seen any guidelines or rules to add Deprecation Warnings
>> into the docstrings. It would be good to define a standard for the
>> docstrings first.
>
> It should be made as visible as possible in my opinion. A reST warning in
> between summary and extended summary would work. It should clearly state in
> which version it will be removed. Best to keep the text identical to the one
> passed to the deprecate decorator. A reason or alternative should be given
> as well.
>
> .. warning::
> ??? `myfunc` is deprecated and will be removed in SciPy 0.9. Look at
> `thatfunc` for equivalent functionality.

Sounds good to me,

Does Sphinx and the webeditor accept warnings at that location?

Josef

>
> Cheers,
> Ralf
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From pav at iki.fi  Thu Jun  3 10:52:48 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Thu, 3 Jun 2010 14:52:48 +0000 (UTC)
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
Message-ID: <hu8fk0$jli$1@dough.gmane.org>

Thu, 03 Jun 2010 22:43:02 +0800, Ralf Gommers wrote:
[clip]
> It should be made as visible as possible in my opinion. A reST warning
> in between summary and extended summary would work. It should clearly
> state in which version it will be removed. Best to keep the text
> identical to the one passed to the deprecate decorator. A reason or
> alternative should be given as well.
> 
> .. warning::
>     `myfunc` is deprecated and will be removed in SciPy 0.9. Look at
> `thatfunc` for equivalent functionality.

Sphinx probably has a special format for deprecations. Best to use that, 
I believe.

-- 
Pauli Virtanen


From ralf.gommers at googlemail.com  Thu Jun  3 11:01:17 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 3 Jun 2010 23:01:17 +0800
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <hu8fk0$jli$1@dough.gmane.org>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<hu8fk0$jli$1@dough.gmane.org>
Message-ID: <AANLkTinzxJNHiSX6dAcdHa1jKGjsIazxOMMO-_GpQXyY@mail.gmail.com>

On Thu, Jun 3, 2010 at 10:52 PM, Pauli Virtanen <pav at iki.fi> wrote:

> Thu, 03 Jun 2010 22:43:02 +0800, Ralf Gommers wrote:
> [clip]
> > It should be made as visible as possible in my opinion. A reST warning
> > in between summary and extended summary would work. It should clearly
> > state in which version it will be removed. Best to keep the text
> > identical to the one passed to the deprecate decorator. A reason or
> > alternative should be given as well.
> >
> > .. warning::
> >     `myfunc` is deprecated and will be removed in SciPy 0.9. Look at
> > `thatfunc` for equivalent functionality.
>
> Sphinx probably has a special format for deprecations. Best to use that,
> I believe.
>
> Good point. In the Sphinx 0.6.6 docs I can't find it, but it seems there is
indeed a ".. deprecated::" directive,
https://bitbucket.org/birkenfeld/sphinx/issue/92/deprecated-options-not-working-in

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/0e176300/attachment.html>

From bsouthey at gmail.com  Thu Jun  3 11:05:45 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 03 Jun 2010 10:05:45 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
Message-ID: <4C07C4C9.6050104@gmail.com>

On 06/03/2010 01:48 AM, josef.pktd at gmail.com wrote:
> On Wed, Jun 2, 2010 at 4:03 PM, Bruce Southey<bsouthey at gmail.com>  wrote:
>    
>> On 06/02/2010 01:41 PM, josef.pktd at gmail.com wrote:
>>
>> On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell<nmb at wartburg.edu>
>> wrote:
>>
>>
>> On 2010-06-02 13:10 , Bruce Southey wrote:
>> [...]
>>
>>
>>
>> However, this code is the chi-squared test part as SAS will compute the
>> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
>> can not have both functions.
>>
>>
>> Again, I don't understand what you mean that we can't have both
>> functions? I believe (from a statistics teacher's point of view) that
>> the Chi-Squared goodness of fit test (which is stats.chisquare) is a
>> different beast from the Chi-Square test for independence (which is
>> stats.chisquare_contingency). The fact that the distribution of the
>> test statistic is the same should not tempt us to put them into the
>> same function.
>>
>>
>> Please read scipy.stats.chisquare() because scipy.stats.chisquare() is
>> the 1-d case of yours.
>> Quote from the docstring:
>> " The chi square test tests the null hypothesis that the categorical data
>> has the given frequencies."
>> Also go the web site provided in the docstring.
>>
>> By default you get the expected frequencies but you can also put in your
>> own using the f_exp variable. You could do the same in your code.
>>
>>
>> In fact, Warren correctly used stats.chisquare with the expected
>> frequencies calculated from the null hypothesis and the corrected
>> degrees of freedom.  chisquare_contingency is in some sense a
>> convenience method for taking care of these pre-calculations before
>> calling stats.chisquare.  Can you explain more clearly to me why we
>> should not include such a convenience function?
>>
>>
>> Just a clarification, before I find time to work my way through the
>> other comments
>>
>> stats.chisquare is a generic test for goodness-of-fit for discreted or
>> binned distributions.
>> and from the docstring of it
>> "If no expected frequencies are given, the total
>>      N is assumed to be equally distributed across all groups."
>>
>> default is uniform distribution
>>
>>
>>
>> Try:
>> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>>
>> The use of the uniform distribution is rather misleading and technically
>> wrong as it does not help address the expected number of outcomes in a cell:
>>      
> quote from the wikipedia page:
> "A simple example is the hypothesis that an ordinary six-sided dice is
> "fair", i.e., all six outcomes are equally likely to occur."
>
> I don't see anything misleading or technically wrong with the uniform
> distributions,
> or if they come from a Poisson, Hypergeometric, binned Normal or any
> of number of other distributions.
>    
Okay this must be only for the 1-way table as it does not apply to the 
2-way or higher tables where the test is for independence between 
variables.

There are valid technical reasons why it is misleading because saying 
that a random variable comes from some distribution has immutable 
meaning. Obviously if a random variable comes from the discrete uniform 
distribution then that random variable also must have a mean (N+1)/2,  
variance (N+1)*(N-1)/12 etc. There is nothing provided about the moments 
of the random variable provided under the null hypothesis so you can not 
say what distribution that a random variable is from. For example, the 
random variable could be from a beta-binomial distribution (as when 
alpha=beta=1 this is the discrete uniform) or binomial/multinomial with 
equal probabilities such that the statement 'all [the] outcomes are 
equally likely to occur' remains true.

If you assume that your random variables are discrete uniform or any 
other distribution (except normal) then in general you can not assume 
that the Pearson's chi-squared test statistic has a specific 
distribution. However, in this case the Pearson's chi-squared test 
statistic is asymptotically chi-squared because of the normality 
assumption. So provided the central limit theorem is valid (not 
necessarily true for all distributions and for 'small' sample sizes) 
then this test will be asymptotically valid regardless of the assumption 
of the random variables in this case.

>> http://en.wikipedia.org/wiki/Discrete_uniform_distribution
>>
>>
>> chisquare_twoway is a special case that additional calculates the
>> correct expected frequencies for the test of independencs based on the
>> margin totals. The resulting distribution is not uniform.
>>
>>
>> Actually the null hypothesis is rather different between 1-way and 2-way
>> tables so you can not say that chisquare_twoway is a special case of
>> chisquare.
>>      
> What is the Null hypothesis in a one-way table?
>
> Josef
>
>    
SAS definition for 1-way table: "the null hypothesis specifies equal 
proportions of the total sample size for each class". This is not the 
same as saying a discrete uniform distribution as you are not directly 
testing that the cells have equal probability. But the ultimate outcome 
is probably not any different.

Bruce


>> I am not sure what you mean by the 'resulting distribution is not uniform'.
>> The distribution of the cells values has nothing to do with the uniform
>> distribution in either case because it is not used in the data nor in the
>> formulation of the test. (And, yes, I have had to do the proof that the test
>> statistic is Chi-squared - which is why there is the warning about small
>> cells...).
>>
>> I agree with Neil that this is a very useful convenience function.
>>
>>
>> My problem with the chisquare_twoway is that it should not call another
>> function to finish two lines of code. It is just an excessive waste of
>> resources.
>>
>> I never heard of a one-way contingency table, my question was whether
>> the function should also handle 3-way or 4-way tables, additional to
>> two-way.
>>
>>
>> Correct to both of these as I just consider these as n-way tables. I think
>> that contingency tables by definition only applies to the 2-d case. Pivot
>> tables are essentially the same thing. I would have to lookup on how to get
>> the expected number of outcomes but probably of the form Ni.. * N.j.
>> *N..k/N... for the 3-way (the 2-way table is of the form Ni.*N.j/N..) for
>> i=rows, j=columns, k=3rd axis and '.' means sum for that axis.
>>
>> I thought about the question how the input should be specified for my
>> initial response, the alternative would be to use the original data or
>> a "long" format instead of a table. But I thought that as a
>> convenience function using the table format will be the most common
>> use.
>>
>> I have written in the past functions that calculate the contingency
>> table, and would be very useful to have a more complete coverage of
>> tools to work with contingency tables in scipy.stats (or temporarily
>> in statsmodels, where we are working also on the anova type of
>> analysis)
>>
>>
>> It depends on what tasks are needed.  Really there are two steps:
>> 1) Cross-tabulation that summarized the data from whatever input (groupby
>> would help here).
>> 2) Statistical tests - series of functions that accept summarized data only.
>>
>> If you have separate functions then the burden is on the user to find and
>> call all the desired functions. You can also provide a single helper
>> function to do all that because you don't want to repeat unnecessary calls.
>>
>> So, I think the way it is it is a nice function and we don't have to
>> put all contingency table analysis into this function.
>>
>> Josef
>>
>>
>> Bruce
>>
>>
>>
>>
>>
>> Really this should be combined with fisher.py in ticket 956:
>> http://projects.scipy.org/scipy/ticket/956
>>
>>
>> Wow, apparently I have lots of disagreements today, but I don't think
>> that this should be combined with Fisher's Exact test. (I would like
>> to see that ticket mature to the point where it can be added to
>> scipy.stats.) I like the functions in scipy.stats to correspond in a
>> one-to-one manner with the statistical tests. I think that the docs
>> should "See Also" the appropriate exact (and non-parametric) tests,
>> but I think that one function/one test is a good rule. This is
>> particularly true for people (like me) who would like to someday be
>> able to use scipy.stats in a pedagogical context.
>>
>> -Neil
>>
>>
>> I don't see any 'disagreements' rather just different ways to do things
>> and identifying areas that need to be addressed for more general use.
>>
>>
>> Agreed. :)
>>
>> [...]
>>
>> -Neil
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>      
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/3a8d541f/attachment.html>

From jsseabold at gmail.com  Thu Jun  3 11:06:40 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Thu, 3 Jun 2010 11:06:40 -0400
Subject: [SciPy-Dev] np.savetxt: apply patch in enhancement ticket 1079
	to add headers?
In-Reply-To: <loom.20100602T181159-631@post.gmane.org>
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com> 
	<loom.20100601T190316-782@post.gmane.org>
	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com> 
	<loom.20100602T125421-882@post.gmane.org> <4C066DA3.8010609@gmail.com> 
	<loom.20100602T181159-631@post.gmane.org>
Message-ID: <AANLkTikUf1HH5LADb1htC1xBr-KOnZFY1Xb7BP3OaJD_@mail.gmail.com>

On Wed, Jun 2, 2010 at 1:14 PM, Stefan <stefan.czesla at hs.uni-hamburg.de> wrote:
>
>> Not that I am complaining rather trying to understand what is expected
>> to happen.
>> Under the patch, it is very much user beware.? The header argument can
>> be anything or nothing. There is no check for the contents or if the
>> delimiter used is the same as the rest of the output. Further with the
>> newline option there is no guarantee that the lines in the header will
>> have the same line endings throughout the file.
>> So what should a user be allowed to use as a header?
>> You could write a whole program there or an explanation of the
>> following output - which is very appealing. You could force a list of
>> strings so that you print out newline.join(header) - okay not quite
>> because it should include the comment argument.
>> Should savetxt be restricted to something that loadtxt can read?
>> This is potentially problematic if you want a header line. Although it
>> could return the number of header lines.
>> [savetxt should also be updated to allow bz2 as loadtxt handles those
>> now - not that I have used it]
>>
>>
>>
>>
>> Also note that since that patch was written, savetxt takes a user
>> supplied newline keyword, so you can just append that to the header
>> string.
>>
>>
>>
>> ? True, we were not aware of this, but this does not help much for the
>> comment/header.
>>
>>
>>
>> Entered as ~3 months ago:http://projects.scipy.org/numpy/changeset/8180
>> Should this be forced to check for valid options for new lines?
>> Otherwise you from this? 'np.savetxt('junk.text', [1,2,3,4,5],
>> newline='what')' you get:
>>
> 1.000000000000000000e+00what2.000000000000000000e+00what
> 3.000000000000000000e+00what4.000000000000000000e+00
> what5.000000000000000000e+00what
>> Which is not going to be read back by loadtxt.
>>
>>
>>
>> As numpy.loadtxt has a default comment character ('#'), the same may be
>> implemented for numpy.savetxt. In this case, numpy.savetxt would get two
>> additional keywords (e.g. header, comment(character)), which bloats the
>> interface, but potentially provides more safety.
>>
>>
>>
>>
>> FWIW, I ended up rolling my own using the most recent pre-Python 3
>> changes for savetxt that accepts a list of names instead of one string
>> or if the provided array has the attribute dtype.names (non-nested rec
>> or structured arrays) it uses those. ?Whatever is done I think the
>> support for structured arrays is nice, and I think having this
>> functionality is a no-brainer. ?I need it quite often.
>>
>>
>>
>> ? Although, we have not been using record arrays too often, we see their
>> advantages and agree that it should be possible to use them as you described
>> it.
>> We also thought about a solution, using the __str__ method for the 'header
>> object'. In this vain, an arbitrary header class (including a plane string)
>> providing an __str__ member may be handed to numpy.savetxt,
>> which can use it to write the header.
>>
>
>
> So let us briefly summarize whats on the table. It appears to us that
> there are basically three open issues:
> (1) a csv like header for savetxt written files (first line contains column
> ? ?names)
> (2) comments (introduced by comment character e.g. '#') at the beginning
> ? ?of the file (preceding the data)
> (3) the role of the 'newline' option
>
> As was noted, the patch (ticket 1079) enables both to write a csv like
> header (1) and comment line(s) introduced by a comment character (e.g. '#').
> Nonetheless, this solution is quite unsatisfactory
> in our opinion, because it may be error prone,
> as the user is in charge of the entire formatting. Despite this, we think
> that it should be up to the user what amount of information is to be put
> at the top of the file, but the format should be checked as far as possible.
>
> Using either a string or a list/tuple of strings, as proposed by Bruce,
> seems to be a reasonable possibility to implement the desired functionality.
> Maybe two individual keywords ('header' and 'comment') should exist to
> distinguish whether the the user requests case (1) or (2). As for loadtxt
> the default comment character should be '#', but it may be changed by the
> user.
>
> We think that savetxt should not be restricted to output, which can be read
> by loadtxt. Although it should be possible to add commments to the output
> file, so that it remains readable by loadtxt (without tweaking it
> e.g. with the skiprows keyword).
>

Thanks.  This does clear up my confusion and I think having both a
header and a comments keyword makes sense.  For the form, as I said, I
went with a list of strings, as I encounter this more often than one
string, but in the end it's all the same to me.

Glad this is getting some attention.

> We agree that the newline keyword may cause inconsistencies in the file
> (if ticket 1079 were applied),
> and possibly strange behavior such as when newline='what' is specified.
> Yet, this question does not only concern the header/comments.
>
> Stefan & Christian
>
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Thu Jun  3 11:03:34 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 11:03:34 -0400
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTine2Eeh30frV3FmkObBAfkiJ8Ia12SD3DRChO1g@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<4C07B99A.8080101@enthought.com>
	<AANLkTine2Eeh30frV3FmkObBAfkiJ8Ia12SD3DRChO1g@mail.gmail.com>
Message-ID: <AANLkTilMt0Ra33NYvfoV7W4tOekNck2E7l-yUjyRBLFC@mail.gmail.com>

On Thu, Jun 3, 2010 at 10:49 AM,  <josef.pktd at gmail.com> wrote:
> On Thu, Jun 3, 2010 at 10:18 AM, Warren Weckesser
> <warren.weckesser at enthought.com> wrote:
>> josef.pktd at gmail.com wrote:
>>> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
>>> <warren.weckesser at enthought.com> wrote:
>>>
>>>> stats.glm looks like it was started and then abandoned without being
>>>> finished. ?It was last touched in November 2007. ?Should this function
>>>> be deprecated so it can eventually be removed?
>>>>
>>>
>>> My thoughts when I looked at it was roughly:
>>> leave it alone since it's working, but don't "advertise" it because we
>>> should get a better replacement.
>>>
>>
>> How does one not advertise it?
>>
>> The docstring is wrong, incomplete, and not useful.
>
> That's it's not advertised
>
>> It has no tests.
>
> It has no tests (except for examples on my computer), but the results
> (for the basic case that I looked at) are correct.
> If we increase test coverage or start removing functions that don't
> have tests yet, I would work on box-cox, and several other functions
> in morestats.py . Mainly a question of priorities.
>
>> Currently, it appears that it just duplicates ttest_ind. ?As far as I
>> know, no one is working on it.
>>
>> Leaving it in wastes users' time reading about it. ?It erodes confidence
>> in other functions in scipy: ?"Is foo() a good function, or has it been
>> abandoned, like glm()?"
>>
>> To me, it is an ideal candidate for removal.
>
> If we apply strict criteria along those lines, we can reduce the size
> of scipy.stats.stats and scipy.stats.morestats, I guess, by at least a
> third. (Which I would do if I could start from scratch).
> A big fraction of functions in scipy.stats are in the category "no one
> is working on it".
>
> For glm specifically, I don't see any big cost of leaving it in, nor
> for deprecating it, and then I usually stick to the status-quo. But
> you can as well deprecate it, and point to ttest_ind.
>
> And for "bigger fish" like pdfmoments and pdf_approx, I never received
> a reply or opinion on the mailing list.
>
> statsmodels will have (or better, has in the sandbox) a generalization
> for glm, that works for any number of groups and includes both t_test
> and f_test.

Actually, now that I have to think about glm again, I'm also in favor
of deprecating it, since I can always point to the general version in
statsmodels.

Josef


>
> Josef
>
>>
>> Warren
>>
>>> similar to linregress the more general version will be available when
>>> scipy.stats gets the full OLS model.
>>>
>>>
>>>>>> x = (np.arange(20)>9).astype(int)
>>>>>> y = x + np.random.randn(20)
>>>>>> stats.glm(y,x)
>>>>>>
>>> (-1.7684287512254859, 0.093933208147769023)
>>>
>>>>>> stats.ttest_ind(y[:10], y[10:])
>>>>>>
>>> (-1.7684287512254859, 0.093933208147768926)
>>>
>>> In the current form it doesn't do much different than ttest_ind except
>>> for different argument structure.
>>>
>>> I think it could be made to work on string labels if _support.unique
>>> is replaced by np.unique (which we are doing in statsmodels)
>>>
>>>
>>>>>> x = (np.arange(20)>9).astype(str)
>>>>>> x
>>>>>>
>>> array(['F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'T', 'T', 'T',
>>> ? ? ? ?'T', 'T', 'T', 'T', 'T', 'T', 'T'],
>>> ? ? ? dtype='|S1')
>>>
>>>>>> stats.glm(y,x)
>>>>>>
>>> Traceback (most recent call last):
>>> ? File "<pyshell#24>", line 1, in <module>
>>> ? ? stats.glm(y,x)
>>> ? File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\stats.py",
>>> line 3315, in glm
>>> ? ? p = _support.unique(para)
>>> ? File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\_support.py",
>>> line 45, in unique
>>> ? ? if np.add.reduce(np.equal(uniques,item).flat) == 0:
>>> AttributeError: 'NotImplementedType' object has no attribute 'flat'
>>>
>>> Josef
>>>
>>>
>>>> Warren
>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>


From josef.pktd at gmail.com  Thu Jun  3 11:22:42 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 11:22:42 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C07C4C9.6050104@gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07C4C9.6050104@gmail.com>
Message-ID: <AANLkTikh76B5Ti3ktICfGkgG5j_B-ZVjmYzBXwQYug8e@mail.gmail.com>

On Thu, Jun 3, 2010 at 11:05 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/03/2010 01:48 AM, josef.pktd at gmail.com wrote:
>
> On Wed, Jun 2, 2010 at 4:03 PM, Bruce Southey <bsouthey at gmail.com> wrote:
>
>
> On 06/02/2010 01:41 PM, josef.pktd at gmail.com wrote:
>
> On Wed, Jun 2, 2010 at 2:18 PM, Neil Martinsen-Burrell <nmb at wartburg.edu>
> wrote:
>
>
> On 2010-06-02 13:10 , Bruce Southey wrote:
> [...]
>
>
>
> However, this code is the chi-squared test part as SAS will compute the
> actual cell numbers. Also an extension to scipy.stats.chisquare() so we
> can not have both functions.
>
>
> Again, I don't understand what you mean that we can't have both
> functions? I believe (from a statistics teacher's point of view) that
> the Chi-Squared goodness of fit test (which is stats.chisquare) is a
> different beast from the Chi-Square test for independence (which is
> stats.chisquare_contingency). The fact that the distribution of the
> test statistic is the same should not tempt us to put them into the
> same function.
>
>
> Please read scipy.stats.chisquare() because scipy.stats.chisquare() is
> the 1-d case of yours.
> Quote from the docstring:
> " The chi square test tests the null hypothesis that the categorical data
> has the given frequencies."
> Also go the web site provided in the docstring.
>
> By default you get the expected frequencies but you can also put in your
> own using the f_exp variable. You could do the same in your code.
>
>
> In fact, Warren correctly used stats.chisquare with the expected
> frequencies calculated from the null hypothesis and the corrected
> degrees of freedom. ?chisquare_contingency is in some sense a
> convenience method for taking care of these pre-calculations before
> calling stats.chisquare. ?Can you explain more clearly to me why we
> should not include such a convenience function?
>
>
> Just a clarification, before I find time to work my way through the
> other comments
>
> stats.chisquare is a generic test for goodness-of-fit for discreted or
> binned distributions.
> and from the docstring of it
> "If no expected frequencies are given, the total
>     N is assumed to be equally distributed across all groups."
>
> default is uniform distribution
>
>
>
> Try:
> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>
> The use of the uniform distribution is rather misleading and technically
> wrong as it does not help address the expected number of outcomes in a cell:
>
>
> quote from the wikipedia page:
> "A simple example is the hypothesis that an ordinary six-sided dice is
> "fair", i.e., all six outcomes are equally likely to occur."
>
> I don't see anything misleading or technically wrong with the uniform
> distributions,
> or if they come from a Poisson, Hypergeometric, binned Normal or any
> of number of other distributions.
>
>
> Okay this must be only for the 1-way table as it does not apply to the 2-way
> or higher tables where the test is for independence between variables.

I'm talking about a completely different strand of literature, e.g. a
commercial program specialized on this
http://www.mathwave.com/articles/goodness_of_fit.html#cs

And never think of tables when I look at goodness-of-fit tests. I
haven't seen yet a case where the asymptotic results for the chisquare
test doesn't apply.

>
> There are valid technical reasons why it is misleading because saying that a
> random variable comes from some distribution has immutable meaning.
> Obviously if a random variable comes from the discrete uniform distribution
> then that random variable also must have a mean (N+1)/2,? variance
> (N+1)*(N-1)/12 etc. There is nothing provided about the moments of the
> random variable provided under the null hypothesis so you can not say what
> distribution that a random variable is from. For example, the random
> variable could be from a beta-binomial distribution (as when alpha=beta=1
> this is the discrete uniform) or binomial/multinomial with equal
> probabilities such that the statement 'all [the] outcomes are equally likely
> to occur' remains true.
>
> If you assume that your random variables are discrete uniform or any other
> distribution (except normal) then in general you can not assume that the
> Pearson's chi-squared test statistic has a specific distribution. However,
> in this case the Pearson's chi-squared test statistic is asymptotically
> chi-squared because of the normality assumption. So provided the central
> limit theorem is valid (not necessarily true for all distributions and for
> 'small' sample sizes) then this test will be asymptotically valid regardless
> of the assumption of the random variables in this case.
>
> http://en.wikipedia.org/wiki/Discrete_uniform_distribution
>
>
> chisquare_twoway is a special case that additional calculates the
> correct expected frequencies for the test of independencs based on the
> margin totals. The resulting distribution is not uniform.
>
>
> Actually the null hypothesis is rather different between 1-way and 2-way
> tables so you can not say that chisquare_twoway is a special case of
> chisquare.
>
>
> What is the Null hypothesis in a one-way table?
>
> Josef
>
>
>
> SAS definition for 1-way table: "the null hypothesis specifies equal
> proportions of the total sample size for each class". This is not the same
> as saying a discrete uniform distribution as you are not directly testing
> that the cells have equal probability. But the ultimate outcome is probably
> not any different.

Ok, I will have to look at this (when I have time), in my opinion this
is inconsistent with the interpretation of a test for independence in
a two-way or three-way table.

Josef

>
> Bruce
>
>
> I am not sure what you mean by the 'resulting distribution is not uniform'.
> The distribution of the cells values has nothing to do with the uniform
> distribution in either case because it is not used in the data nor in the
> formulation of the test. (And, yes, I have had to do the proof that the test
> statistic is Chi-squared - which is why there is the warning about small
> cells...).
>
> I agree with Neil that this is a very useful convenience function.
>
>
> My problem with the chisquare_twoway is that it should not call another
> function to finish two lines of code. It is just an excessive waste of
> resources.
>
> I never heard of a one-way contingency table, my question was whether
> the function should also handle 3-way or 4-way tables, additional to
> two-way.
>
>
> Correct to both of these as I just consider these as n-way tables. I think
> that contingency tables by definition only applies to the 2-d case. Pivot
> tables are essentially the same thing. I would have to lookup on how to get
> the expected number of outcomes but probably of the form Ni.. * N.j.
> *N..k/N... for the 3-way (the 2-way table is of the form Ni.*N.j/N..) for
> i=rows, j=columns, k=3rd axis and '.' means sum for that axis.
>
> I thought about the question how the input should be specified for my
> initial response, the alternative would be to use the original data or
> a "long" format instead of a table. But I thought that as a
> convenience function using the table format will be the most common
> use.
>
> I have written in the past functions that calculate the contingency
> table, and would be very useful to have a more complete coverage of
> tools to work with contingency tables in scipy.stats (or temporarily
> in statsmodels, where we are working also on the anova type of
> analysis)
>
>
> It depends on what tasks are needed.? Really there are two steps:
> 1) Cross-tabulation that summarized the data from whatever input (groupby
> would help here).
> 2) Statistical tests - series of functions that accept summarized data only.
>
> If you have separate functions then the burden is on the user to find and
> call all the desired functions. You can also provide a single helper
> function to do all that because you don't want to repeat unnecessary calls.
>
> So, I think the way it is it is a nice function and we don't have to
> put all contingency table analysis into this function.
>
> Josef
>
>
> Bruce
>
>
>
>
>
> Really this should be combined with fisher.py in ticket 956:
> http://projects.scipy.org/scipy/ticket/956
>
>
> Wow, apparently I have lots of disagreements today, but I don't think
> that this should be combined with Fisher's Exact test. (I would like
> to see that ticket mature to the point where it can be added to
> scipy.stats.) I like the functions in scipy.stats to correspond in a
> one-to-one manner with the statistical tests. I think that the docs
> should "See Also" the appropriate exact (and non-parametric) tests,
> but I think that one function/one test is a good rule. This is
> particularly true for people (like me) who would like to someday be
> able to use scipy.stats in a pedagogical context.
>
> -Neil
>
>
> I don't see any 'disagreements' rather just different ways to do things
> and identifying areas that need to be addressed for more general use.
>
>
> Agreed. :)
>
> [...]
>
> -Neil
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From njs at pobox.com  Thu Jun  3 11:32:59 2010
From: njs at pobox.com (Nathaniel Smith)
Date: Thu, 3 Jun 2010 08:32:59 -0700
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
Message-ID: <AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>

On Thu, Jun 3, 2010 at 6:38 AM,  <josef.pktd at gmail.com> wrote:
> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
> <warren.weckesser at enthought.com> wrote:
>> stats.glm looks like it was started and then abandoned without being
>> finished. ?It was last touched in November 2007. ?Should this function
>> be deprecated so it can eventually be removed?
>
> My thoughts when I looked at it was roughly:
> leave it alone since it's working, but don't "advertise" it because we
> should get a better replacement.
> similar to linregress the more general version will be available when
> scipy.stats gets the full OLS model.

Wait, what does 'glm' have to do with OLS (or t-tests) anyway? Surely
if anything it *should* be a function that fits, you know, GLMs
(generalized linear models)?

I guess this is a vote for removing it, because GLMs are one of the
fundamental stats models that people will look for, and having some
weird, broken, other thing in the obvious place is just confusing and
looks really bad.

-- Nathaniel


From warren.weckesser at enthought.com  Thu Jun  3 11:51:42 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Thu, 03 Jun 2010 10:51:42 -0500
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTilMt0Ra33NYvfoV7W4tOekNck2E7l-yUjyRBLFC@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>	<4C07B99A.8080101@enthought.com>	<AANLkTine2Eeh30frV3FmkObBAfkiJ8Ia12SD3DRChO1g@mail.gmail.com>
	<AANLkTilMt0Ra33NYvfoV7W4tOekNck2E7l-yUjyRBLFC@mail.gmail.com>
Message-ID: <4C07CF8E.4000906@enthought.com>

josef.pktd at gmail.com wrote:
> On Thu, Jun 3, 2010 at 10:49 AM,  <josef.pktd at gmail.com> wrote:
>   
>> On Thu, Jun 3, 2010 at 10:18 AM, Warren Weckesser
>> <warren.weckesser at enthought.com> wrote:
>>     
>>> josef.pktd at gmail.com wrote:
>>>       
>>>> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
>>>> <warren.weckesser at enthought.com> wrote:
>>>>
>>>>         
>>>>> stats.glm looks like it was started and then abandoned without being
>>>>> finished.  It was last touched in November 2007.  Should this function
>>>>> be deprecated so it can eventually be removed?
>>>>>
>>>>>           
>>>> My thoughts when I looked at it was roughly:
>>>> leave it alone since it's working, but don't "advertise" it because we
>>>> should get a better replacement.
>>>>
>>>>         
>>> How does one not advertise it?
>>>
>>> The docstring is wrong, incomplete, and not useful.
>>>       
>> That's it's not advertised
>>
>>     
>>> It has no tests.
>>>       
>> It has no tests (except for examples on my computer), but the results
>> (for the basic case that I looked at) are correct.
>> If we increase test coverage or start removing functions that don't
>> have tests yet, I would work on box-cox, and several other functions
>> in morestats.py . Mainly a question of priorities.
>>
>>     
>>> Currently, it appears that it just duplicates ttest_ind.  As far as I
>>> know, no one is working on it.
>>>
>>> Leaving it in wastes users' time reading about it.  It erodes confidence
>>> in other functions in scipy:  "Is foo() a good function, or has it been
>>> abandoned, like glm()?"
>>>
>>> To me, it is an ideal candidate for removal.
>>>       
>> If we apply strict criteria along those lines, we can reduce the size
>> of scipy.stats.stats and scipy.stats.morestats, I guess, by at least a
>> third. (Which I would do if I could start from scratch).
>> A big fraction of functions in scipy.stats are in the category "no one
>> is working on it".
>>
>> For glm specifically, I don't see any big cost of leaving it in, nor
>> for deprecating it, and then I usually stick to the status-quo. But
>> you can as well deprecate it, and point to ttest_ind.
>>
>> And for "bigger fish" like pdfmoments and pdf_approx, I never received
>> a reply or opinion on the mailing list.
>>
>> statsmodels will have (or better, has in the sandbox) a generalization
>> for glm, that works for any number of groups and includes both t_test
>> and f_test.
>>     
>
> Actually, now that I have to think about glm again, I'm also in favor
> of deprecating it, since I can always point to the general version in
> statsmodels.
>
> Josef
>
>   

Heh... meanwhile I'm starting to think that my call for deprecation was 
premature, and maybe all it really needs is an updated, accurate 
docstring that explains what the current implementation does.  :)

Warren

>
>
>   
>> Josef
>>
>>     
>>> Warren
>>>
>>>       
>>>> similar to linregress the more general version will be available when
>>>> scipy.stats gets the full OLS model.
>>>>
>>>>
>>>>         
>>>>>>> x = (np.arange(20)>9).astype(int)
>>>>>>> y = x + np.random.randn(20)
>>>>>>> stats.glm(y,x)
>>>>>>>
>>>>>>>               
>>>> (-1.7684287512254859, 0.093933208147769023)
>>>>
>>>>         
>>>>>>> stats.ttest_ind(y[:10], y[10:])
>>>>>>>
>>>>>>>               
>>>> (-1.7684287512254859, 0.093933208147768926)
>>>>
>>>> In the current form it doesn't do much different than ttest_ind except
>>>> for different argument structure.
>>>>
>>>> I think it could be made to work on string labels if _support.unique
>>>> is replaced by np.unique (which we are doing in statsmodels)
>>>>
>>>>
>>>>         
>>>>>>> x = (np.arange(20)>9).astype(str)
>>>>>>> x
>>>>>>>
>>>>>>>               
>>>> array(['F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'T', 'T', 'T',
>>>>        'T', 'T', 'T', 'T', 'T', 'T', 'T'],
>>>>       dtype='|S1')
>>>>
>>>>         
>>>>>>> stats.glm(y,x)
>>>>>>>
>>>>>>>               
>>>> Traceback (most recent call last):
>>>>   File "<pyshell#24>", line 1, in <module>
>>>>     stats.glm(y,x)
>>>>   File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\stats.py",
>>>> line 3315, in glm
>>>>     p = _support.unique(para)
>>>>   File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\_support.py",
>>>> line 45, in unique
>>>>     if np.add.reduce(np.equal(uniques,item).flat) == 0:
>>>> AttributeError: 'NotImplementedType' object has no attribute 'flat'
>>>>
>>>> Josef
>>>>
>>>>
>>>>         
>>>>> Warren
>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>>>
>>>>>           
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>>>         
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>       
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From josef.pktd at gmail.com  Thu Jun  3 11:53:41 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 11:53:41 -0400
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
Message-ID: <AANLkTinB0HYRFASdmR-MImdpJHBV4p7vFA6uxNcRmd6D@mail.gmail.com>

On Thu, Jun 3, 2010 at 11:32 AM, Nathaniel Smith <njs at pobox.com> wrote:
> On Thu, Jun 3, 2010 at 6:38 AM, ?<josef.pktd at gmail.com> wrote:
>> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
>> <warren.weckesser at enthought.com> wrote:
>>> stats.glm looks like it was started and then abandoned without being
>>> finished. ?It was last touched in November 2007. ?Should this function
>>> be deprecated so it can eventually be removed?
>>
>> My thoughts when I looked at it was roughly:
>> leave it alone since it's working, but don't "advertise" it because we
>> should get a better replacement.
>> similar to linregress the more general version will be available when
>> scipy.stats gets the full OLS model.
>
> Wait, what does 'glm' have to do with OLS (or t-tests) anyway? Surely
> if anything it *should* be a function that fits, you know, GLMs
> (generalized linear models)?
>
> I guess this is a vote for removing it, because GLMs are one of the
> fundamental stats models that people will look for, and having some
> weird, broken, other thing in the obvious place is just confusing and
> looks really bad.

That was my initial impression a long time ago.

GLM as in general linear model not generalized. (It's the worst
conflicting acronym in stats).

The function actually estimates a GLM, it construct a binary dummy
variable from the label data to get the design matrix, estimates it
with OLS, calculates the t-statistic and the corresponding p-value.

But then it becomes like the ttest_ind because it only returns the
t-statistic and the corresponding p-value.

I don't remember seeing any previous comments about it on the mailing
list, but it would be a prime candidate for "finishing" it. (except
finishing it requires a full module on it's own.)

The discussion what glm (general linear model) has to do with ols
fills already many pages on the pystatsmodels mailing list.  (GLM in
statsmodels is generalized linear model)

Josef


>
> -- Nathaniel
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Thu Jun  3 12:03:23 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 12:03:23 -0400
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <4C07CF8E.4000906@enthought.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<4C07B99A.8080101@enthought.com>
	<AANLkTine2Eeh30frV3FmkObBAfkiJ8Ia12SD3DRChO1g@mail.gmail.com>
	<AANLkTilMt0Ra33NYvfoV7W4tOekNck2E7l-yUjyRBLFC@mail.gmail.com>
	<4C07CF8E.4000906@enthought.com>
Message-ID: <AANLkTimZmgMbWgPtC064Ad1GnWwtTDe8ebAUy3fio6hy@mail.gmail.com>

On Thu, Jun 3, 2010 at 11:51 AM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> josef.pktd at gmail.com wrote:
>> On Thu, Jun 3, 2010 at 10:49 AM, ?<josef.pktd at gmail.com> wrote:
>>
>>> On Thu, Jun 3, 2010 at 10:18 AM, Warren Weckesser
>>> <warren.weckesser at enthought.com> wrote:
>>>
>>>> josef.pktd at gmail.com wrote:
>>>>
>>>>> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
>>>>> <warren.weckesser at enthought.com> wrote:
>>>>>
>>>>>
>>>>>> stats.glm looks like it was started and then abandoned without being
>>>>>> finished. ?It was last touched in November 2007. ?Should this function
>>>>>> be deprecated so it can eventually be removed?
>>>>>>
>>>>>>
>>>>> My thoughts when I looked at it was roughly:
>>>>> leave it alone since it's working, but don't "advertise" it because we
>>>>> should get a better replacement.
>>>>>
>>>>>
>>>> How does one not advertise it?
>>>>
>>>> The docstring is wrong, incomplete, and not useful.
>>>>
>>> That's it's not advertised
>>>
>>>
>>>> It has no tests.
>>>>
>>> It has no tests (except for examples on my computer), but the results
>>> (for the basic case that I looked at) are correct.
>>> If we increase test coverage or start removing functions that don't
>>> have tests yet, I would work on box-cox, and several other functions
>>> in morestats.py . Mainly a question of priorities.
>>>
>>>
>>>> Currently, it appears that it just duplicates ttest_ind. ?As far as I
>>>> know, no one is working on it.
>>>>
>>>> Leaving it in wastes users' time reading about it. ?It erodes confidence
>>>> in other functions in scipy: ?"Is foo() a good function, or has it been
>>>> abandoned, like glm()?"
>>>>
>>>> To me, it is an ideal candidate for removal.
>>>>
>>> If we apply strict criteria along those lines, we can reduce the size
>>> of scipy.stats.stats and scipy.stats.morestats, I guess, by at least a
>>> third. (Which I would do if I could start from scratch).
>>> A big fraction of functions in scipy.stats are in the category "no one
>>> is working on it".
>>>
>>> For glm specifically, I don't see any big cost of leaving it in, nor
>>> for deprecating it, and then I usually stick to the status-quo. But
>>> you can as well deprecate it, and point to ttest_ind.
>>>
>>> And for "bigger fish" like pdfmoments and pdf_approx, I never received
>>> a reply or opinion on the mailing list.
>>>
>>> statsmodels will have (or better, has in the sandbox) a generalization
>>> for glm, that works for any number of groups and includes both t_test
>>> and f_test.
>>>
>>
>> Actually, now that I have to think about glm again, I'm also in favor
>> of deprecating it, since I can always point to the general version in
>> statsmodels.
>>
>> Josef
>>
>>
>
> Heh... meanwhile I'm starting to think that my call for deprecation was
> premature, and maybe all it really needs is an updated, accurate
> docstring that explains what the current implementation does. ?:)

You should stay firm to compensate for my reluctance to change things
that are not (obviously or really) broken. :)

As, I said I'm really pretty indifferent in this case. (But I wouldn't
want to see wide spread use of it, because as Nathaniel said, the name
is very misleading for the current result.)

So, if you want to keep it mention clearly that it only does a ttest.

Josef

>
> Warren
>
>>
>>
>>
>>> Josef
>>>
>>>
>>>> Warren
>>>>
>>>>
>>>>> similar to linregress the more general version will be available when
>>>>> scipy.stats gets the full OLS model.
>>>>>
>>>>>
>>>>>
>>>>>>>> x = (np.arange(20)>9).astype(int)
>>>>>>>> y = x + np.random.randn(20)
>>>>>>>> stats.glm(y,x)
>>>>>>>>
>>>>>>>>
>>>>> (-1.7684287512254859, 0.093933208147769023)
>>>>>
>>>>>
>>>>>>>> stats.ttest_ind(y[:10], y[10:])
>>>>>>>>
>>>>>>>>
>>>>> (-1.7684287512254859, 0.093933208147768926)
>>>>>
>>>>> In the current form it doesn't do much different than ttest_ind except
>>>>> for different argument structure.
>>>>>
>>>>> I think it could be made to work on string labels if _support.unique
>>>>> is replaced by np.unique (which we are doing in statsmodels)
>>>>>
>>>>>
>>>>>
>>>>>>>> x = (np.arange(20)>9).astype(str)
>>>>>>>> x
>>>>>>>>
>>>>>>>>
>>>>> array(['F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'F', 'T', 'T', 'T',
>>>>> ? ? ? ?'T', 'T', 'T', 'T', 'T', 'T', 'T'],
>>>>> ? ? ? dtype='|S1')
>>>>>
>>>>>
>>>>>>>> stats.glm(y,x)
>>>>>>>>
>>>>>>>>
>>>>> Traceback (most recent call last):
>>>>> ? File "<pyshell#24>", line 1, in <module>
>>>>> ? ? stats.glm(y,x)
>>>>> ? File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\stats.py",
>>>>> line 3315, in glm
>>>>> ? ? p = _support.unique(para)
>>>>> ? File "C:\Josef\_progs\Subversion\scipy-trunk_after\trunk\dist\scipy-0.8.0.dev6416.win32\Programs\Python25\Lib\site-packages\scipy\stats\_support.py",
>>>>> line 45, in unique
>>>>> ? ? if np.add.reduce(np.equal(uniques,item).flat) == 0:
>>>>> AttributeError: 'NotImplementedType' object has no attribute 'flat'
>>>>>
>>>>> Josef
>>>>>
>>>>>
>>>>>
>>>>>> Warren
>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>>>
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From njs at pobox.com  Thu Jun  3 12:16:22 2010
From: njs at pobox.com (Nathaniel Smith)
Date: Thu, 3 Jun 2010 09:16:22 -0700
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTinB0HYRFASdmR-MImdpJHBV4p7vFA6uxNcRmd6D@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
	<AANLkTinB0HYRFASdmR-MImdpJHBV4p7vFA6uxNcRmd6D@mail.gmail.com>
Message-ID: <AANLkTimP3RkTWlXj8VjEXSHUu2btDixobu8yZaNegNMY@mail.gmail.com>

On Thu, Jun 3, 2010 at 8:53 AM,  <josef.pktd at gmail.com> wrote:
> GLM as in general linear model not generalized. (It's the worst
> conflicting acronym in stats).

Sure, and lets not even talk about generalized least squares
(unrelated to both!).

But the general linear model is basically identical to a simple linear
model, both in interface and implementation. There's no reason to have
a separate function for it, one should just accept a matrix for the
"y" variable in the OLS code. But *generalized* linear models are
different in interface, implementation, and are almost as much of a
stats workhorse as standard linear models. So every book I've ever
seen uses the abbreviation "glm" to refer to the generalized version.
(Also, this is what R calls the function ;-).)

The implementation of dummy coding is kind of useful, but this is the
wrong place and the wrong name...

(Also, its least squares implementation calls inv -- the textbook
example of bad numerics!)

...Okay, you know all that anyway, the question is what to do with it.
If the problem were just that it needed a better implementation and
some new features added, then maybe we would keep it and let it be
improved incrementally. But the interface is just wrong, so we'll be
removing it sooner or later, and it might as well be sooner, rather
than prolong the agony.

-- Nathaniel


From josef.pktd at gmail.com  Thu Jun  3 12:31:25 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 12:31:25 -0400
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTimP3RkTWlXj8VjEXSHUu2btDixobu8yZaNegNMY@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
	<AANLkTinB0HYRFASdmR-MImdpJHBV4p7vFA6uxNcRmd6D@mail.gmail.com>
	<AANLkTimP3RkTWlXj8VjEXSHUu2btDixobu8yZaNegNMY@mail.gmail.com>
Message-ID: <AANLkTiktiIGiM6kUYOm7zsaqKGWqnvR5UVO0Q01KoYgA@mail.gmail.com>

On Thu, Jun 3, 2010 at 12:16 PM, Nathaniel Smith <njs at pobox.com> wrote:
> On Thu, Jun 3, 2010 at 8:53 AM, ?<josef.pktd at gmail.com> wrote:
>> GLM as in general linear model not generalized. (It's the worst
>> conflicting acronym in stats).
>
> Sure, and lets not even talk about generalized least squares
> (unrelated to both!).
>
> But the general linear model is basically identical to a simple linear
> model, both in interface and implementation. There's no reason to have
> a separate function for it, one should just accept a matrix for the
> "y" variable in the OLS code. But *generalized* linear models are
> different in interface, implementation, and are almost as much of a
> stats workhorse as standard linear models. So every book I've ever
> seen uses the abbreviation "glm" to refer to the generalized version.
> (Also, this is what R calls the function ;-).)

coming more from the econometrics side, I never heard of "generalized"
until two years ago, and glm was always general linear model,
(scikits.learn and many other packages use it in this definition)


>
> The implementation of dummy coding is kind of useful, but this is the
> wrong place and the wrong name...
>
> (Also, its least squares implementation calls inv -- the textbook
> example of bad numerics!)
>
> ...Okay, you know all that anyway, the question is what to do with it.
> If the problem were just that it needed a better implementation and
> some new features added, then maybe we would keep it and let it be
> improved incrementally. But the interface is just wrong, so we'll be
> removing it sooner or later, and it might as well be sooner, rather
> than prolong the agony.

Actually my version for stats.glm, as a test not as an estimation
model uses least squares in the name, but has a similar interface

http://bazaar.launchpad.net/~scipystats/statsmodels/trunk/annotate/head%3A/scikits/statsmodels/sandbox/regression/onewaygls.py

class OneWayLS(object):
'''Class to test equality of regression coefficients across groups

This class performs tests whether the linear regression coefficients are
the same across pre-specified groups. This can be used to test for
structural breaks at given change points, or for ANOVA style analysis of
differences in the effect of explanatory variables across groups.


I don't see a way to provide a "better implementation and add some new
features" without going full scale.

That's why I agree now with deprecation, since after this thread it's
not a hidden legacy/fossil anymore.

Josef

>
> -- Nathaniel
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Thu Jun  3 12:58:25 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 3 Jun 2010 10:58:25 -0600
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
Message-ID: <AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>

On Thu, Jun 3, 2010 at 8:43 AM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
>
>
> On Thu, Jun 3, 2010 at 10:15 PM, <josef.pktd at gmail.com> wrote:
>>
>> On Thu, Jun 3, 2010 at 10:07 AM, Bruce Southey <bsouthey at gmail.com> wrote:
>> > On 06/02/2010 10:28 PM, Benjamin Root wrote:
>>
>> >
>> > Just a thought... is it feasible for the doc building system to scan
>> > through
>> > the function code and spot a deprecation warning and thereby be able to
>> > add
>> > a list of deprecation warnings to the docstring?? Obviously, such
>> > warnings
>> > would have to follow some standard format, but it would be neat if such
>> > things could be automated.
>
> There's enough docstring manipulation going on already I think, this is not
> that much work so manual would be better. It should be put in at the moment
> the deprecation takes place.
>
>>
>> In the future, someone will have to come up with a rule to force
>> documentation change when a depreciation event occurs and then enforce it.
>> In fact, for numpy (as scipy does not yet have the same policy) the
>> desired
>> documentation changes should be added to:
>> http://projects.scipy.org/numpy/wiki/ApiDeprecation
>
>> I have never seen any guidelines or rules to add Deprecation Warnings
>> into the docstrings. It would be good to define a standard for the
>> docstrings first.
>
> It should be made as visible as possible in my opinion. A reST warning in
> between summary and extended summary would work. It should clearly state in
> which version it will be removed. Best to keep the text identical to the one
> passed to the deprecate decorator. A reason or alternative should be given
> as well.

I would prefer to see it at the very top.
If there is an easily available alternative why would I as a user not
what to immediately view that alternative?
If I am already using it then it is a good remider. Why put it after
the summary?

Vincent

> .. warning::
> ??? `myfunc` is deprecated and will be removed in SciPy 0.9. Look at
> `thatfunc` for equivalent functionality.
>
> Cheers,
> Ralf
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From josef.pktd at gmail.com  Thu Jun  3 12:59:01 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 12:59:01 -0400
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTiktiIGiM6kUYOm7zsaqKGWqnvR5UVO0Q01KoYgA@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
	<AANLkTinB0HYRFASdmR-MImdpJHBV4p7vFA6uxNcRmd6D@mail.gmail.com>
	<AANLkTimP3RkTWlXj8VjEXSHUu2btDixobu8yZaNegNMY@mail.gmail.com>
	<AANLkTiktiIGiM6kUYOm7zsaqKGWqnvR5UVO0Q01KoYgA@mail.gmail.com>
Message-ID: <AANLkTimG2I343kbl9A7rFrhg_spUBah-hUEO_y_aqg1H@mail.gmail.com>

On Thu, Jun 3, 2010 at 12:31 PM,  <josef.pktd at gmail.com> wrote:
> On Thu, Jun 3, 2010 at 12:16 PM, Nathaniel Smith <njs at pobox.com> wrote:
>> On Thu, Jun 3, 2010 at 8:53 AM, ?<josef.pktd at gmail.com> wrote:
>>> GLM as in general linear model not generalized. (It's the worst
>>> conflicting acronym in stats).
>>
>> Sure, and lets not even talk about generalized least squares
>> (unrelated to both!).
>>
>> But the general linear model is basically identical to a simple linear
>> model, both in interface and implementation. There's no reason to have
>> a separate function for it, one should just accept a matrix for the
>> "y" variable in the OLS code. But *generalized* linear models are
>> different in interface, implementation, and are almost as much of a
>> stats workhorse as standard linear models. So every book I've ever
>> seen uses the abbreviation "glm" to refer to the generalized version.
>> (Also, this is what R calls the function ;-).)
>
> coming more from the econometrics side, I never heard of "generalized"
> until two years ago, and glm was always general linear model,
> (scikits.learn and many other packages use it in this definition)
>
>
>>
>> The implementation of dummy coding is kind of useful, but this is the
>> wrong place and the wrong name...
>>
>> (Also, its least squares implementation calls inv -- the textbook
>> example of bad numerics!)
>>
>> ...Okay, you know all that anyway, the question is what to do with it.
>> If the problem were just that it needed a better implementation and
>> some new features added, then maybe we would keep it and let it be
>> improved incrementally. But the interface is just wrong, so we'll be
>> removing it sooner or later, and it might as well be sooner, rather
>> than prolong the agony.
>
> Actually my version for stats.glm, as a test not as an estimation
> model uses least squares in the name, but has a similar interface
>
> http://bazaar.launchpad.net/~scipystats/statsmodels/trunk/annotate/head%3A/scikits/statsmodels/sandbox/regression/onewaygls.py
>
> class OneWayLS(object):
> '''Class to test equality of regression coefficients across groups
>
> This class performs tests whether the linear regression coefficients are
> the same across pre-specified groups. This can be used to test for
> structural breaks at given change points, or for ANOVA style analysis of
> differences in the effect of explanatory variables across groups.

Actually, I don't have ttest results, because I only look at the
general case with two or more groups and only ftest is relevant in
this case, so the simplest case of it is similar to stats.f_oneway not
stats.glm

http://bazaar.launchpad.net/~scipystats/statsmodels/trunk/annotate/head%3A/scikits/statsmodels/sandbox/examples/ex_onewaygls.py#L99

And thanks Warren and Nathaniel for voicing some strong opinions, it's
very useful to break my indifference (economic utility definition).

Josef

>
> I don't see a way to provide a "better implementation and add some new
> features" without going full scale.
>
> That's why I agree now with deprecation, since after this thread it's
> not a hidden legacy/fossil anymore.
>
> Josef
>
>>
>> -- Nathaniel
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>


From bsouthey at gmail.com  Thu Jun  3 13:14:44 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 03 Jun 2010 12:14:44 -0500
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
Message-ID: <4C07E304.8040503@gmail.com>

On 06/03/2010 10:32 AM, Nathaniel Smith wrote:
> On Thu, Jun 3, 2010 at 6:38 AM,<josef.pktd at gmail.com>  wrote:
>    
>> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
>> <warren.weckesser at enthought.com>  wrote:
>>      
>>> stats.glm looks like it was started and then abandoned without being
>>> finished.  It was last touched in November 2007.  Should this function
>>> be deprecated so it can eventually be removed?
>>>        
>> My thoughts when I looked at it was roughly:
>> leave it alone since it's working, but don't "advertise" it because we
>> should get a better replacement.
>> similar to linregress the more general version will be available when
>> scipy.stats gets the full OLS model.
>>      
> Wait, what does 'glm' have to do with OLS (or t-tests) anyway? Surely
> if anything it *should* be a function that fits, you know, GLMs
> (generalized linear models)?
>
> I guess this is a vote for removing it, because GLMs are one of the
> fundamental stats models that people will look for, and having some
> weird, broken, other thing in the obvious place is just confusing and
> looks really bad.
>
> -- Nathaniel
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    
Perhaps people should actually read the code before jumping to incorrect 
conclusions. It is not similar to linregress unless you know how to 
'trick' linreg.

Granted that stats.glm is a crippled but it is well intended (like most 
things in scipy.stats). The docstring intended it to general linear 
models such as SAS's glm procedure and R's glm function (without 
generalized part). At present is just does 1-way anova with only two 
levels but could do more.

 >>> drug=[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 
2, 2, 2, 2, 2, 2, 2, 2, 2, 2]
 >>> postrt=[6, 0, 2, 8, 11, 4, 13, 1, 8, 0, 0, 2, 3, 1, 18, 4, 14, 9, 
1, 9, 13, 10, 18, 5, 23, 12, 5, 16, 1, 20]
 >>> t_val,t_probs=stats.glm(postrt,drug)
 >>> t_val
-1.5463854661015379
 >>> t_probs
0.13324062984741347
 >>> idrug=[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 
0, 0, 0, 0, 0, 0, 0, 0, 0, 0] #create dummies to trick linreg
 >>> print stats.linregress(idrug, postrt)
(-3.9000000000000044, 9.2000000000000011, -0.280506586484015, 
0.13324062984741378, 2.5220102526131258)
 >>> -3.9000000000000044/2.5220102526131258 #this is the t-value of 
stats.glm
-1.5463854661015373


I have major concerns about depreciating code when there is no 
alternative proposed for such an important statistical function. As 
David has said elsewhere, this is just Python code and has little or no 
maintenance cost. The full solution is probably Jonathan Taylor's glm 
class but that uses the formula class and is for generalized linear 
models. However, I don't see that in scipy anywhere soon.

So the options are:

1) Rewrite the internals to fix address the current limitation - not 
hard but would need an API change and more importantly better options exist.
2) OLS is a superior version to linregress but needs changes to get 
ANOVA etc added
http://www.scipy.org/Cookbook/OLS
3) The best candidate that I know that can replace both stats.linregress 
and stats.glm is Skipper's try_ols_anova.py code from pystatsmodel (at 
least posted on the list).  But I am not sure what the current state of 
that is.
4) Some other option?


Bruce


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/408fe912/attachment.html>

From d.l.goldsmith at gmail.com  Thu Jun  3 13:35:37 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 3 Jun 2010 10:35:37 -0700
Subject: [SciPy-Dev] {True, False} should be replaced w/ bool, correct?
In-Reply-To: <AANLkTikQbEWSJ7LgVzkkRNsL8gNJkVM9lMhjIcqW3Tnp@mail.gmail.com>
References: <AANLkTikl1prQDD3EaQrzdTzL7zbEy0XR1iK65wnjg49v@mail.gmail.com>
	<AANLkTinRQP_o0xFv_irQy6xaYQy1zzzNXgOI5iIlfGo1@mail.gmail.com>
	<AANLkTilWJSz19vOV9SHZ3J8g3yoXC-HRJmXuKAuR6oqu@mail.gmail.com>
	<AANLkTikQbEWSJ7LgVzkkRNsL8gNJkVM9lMhjIcqW3Tnp@mail.gmail.com>
Message-ID: <AANLkTin1MStazrppE1hSpm6x6_r7Le6wussdN3VRN29I@mail.gmail.com>

On Thu, Jun 3, 2010 at 7:26 AM, Ralf Gommers <ralf.gommers at googlemail.com>wrote:

>
>
> On Thu, Jun 3, 2010 at 10:10 PM, Vincent Davis <vincent at vincentdavis.net>wrote:
>
>> On Thu, Jun 3, 2010 at 4:40 AM, Ralf Gommers
>> <ralf.gommers at googlemail.com> wrote:
>> >
>> >
>> > On Thu, Jun 3, 2010 at 1:04 PM, David Goldsmith <
>> d.l.goldsmith at gmail.com>
>> > wrote:
>> >>
>> >> Just checking; see, e.g., scipy.io.matlab.mio.savemat appendmat
>> >> parameter.  (Or is it possible that the function really needs to see
>> either
>> >> the word True or the word False?)
>> >
>> > Correct, {True, False} should always be changed to bool in the docs.
>>
>> I didn't see how the "defualt" should be noted on bool options. I
>> think in most cases it should be clear but it might be nice is it was
>> explicit.
>>
>> In the description of the parameter, for example:
> cap : bool, optional
>     Whether to return this string in capital letters. Default is True.
>
> Noting defaults should be done not only for bool args, but for everything
> that has a default.
>
> Cheers,
> Ralf
>

Thanks, Ralf.  Let me just add that if the default isn't clear in the
existing docs, it may be necessary, as the editor, to look at the source.
Thanks again.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/ccd5d9cd/attachment.html>

From vincent at vincentdavis.net  Thu Jun  3 13:48:06 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 3 Jun 2010 11:48:06 -0600
Subject: [SciPy-Dev] {True, False} should be replaced w/ bool, correct?
In-Reply-To: <AANLkTin1MStazrppE1hSpm6x6_r7Le6wussdN3VRN29I@mail.gmail.com>
References: <AANLkTikl1prQDD3EaQrzdTzL7zbEy0XR1iK65wnjg49v@mail.gmail.com>
	<AANLkTinRQP_o0xFv_irQy6xaYQy1zzzNXgOI5iIlfGo1@mail.gmail.com>
	<AANLkTilWJSz19vOV9SHZ3J8g3yoXC-HRJmXuKAuR6oqu@mail.gmail.com>
	<AANLkTikQbEWSJ7LgVzkkRNsL8gNJkVM9lMhjIcqW3Tnp@mail.gmail.com>
	<AANLkTin1MStazrppE1hSpm6x6_r7Le6wussdN3VRN29I@mail.gmail.com>
Message-ID: <AANLkTinkofSraCOO_lOaRKxL_2e6EC8wI0ecXESkBVB4@mail.gmail.com>

n Thursday, June 3, 2010, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> On Thu, Jun 3, 2010 at 7:26 AM, Ralf Gommers <ralf.gommers at googlemail.com> wrote:
>
>
>
> On Thu, Jun 3, 2010 at 10:10 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>
> On Thu, Jun 3, 2010 at 4:40 AM, Ralf Gommers
> <ralf.gommers at googlemail.com> wrote:
>>
>>
>> On Thu, Jun 3, 2010 at 1:04 PM, David Goldsmith <d.l.goldsmith at gmail.com>
>> wrote:
>>>
>>> Just checking; see, e.g., scipy.io.matlab.mio.savemat appendmat
>>> parameter.? (Or is it possible that the function really needs to see either
>>> the word True or the word False?)
>>
>> Correct, {True, False} should always be changed to bool in the docs.
>
> I didn't see how the "defualt" should be noted on bool options. I
> think in most cases it should be clear but it might be nice is it was
> explicit.
>
> In the description of the parameter, for example:
> cap : bool, optional
> ? ? Whether to return this string in capital letters. Default is True.
>
> Noting defaults should be done not only for bool args, but for everything that has a default.

Not sure what I was looking at but it did not state the default,
that's way I ask and I didn't see anything in the guide but I might
have missed it.
vincent
>
> Cheers,
> Ralf
>
> Thanks, Ralf.? Let me just add that if the default isn't clear in the existing docs, it may be necessary, as the editor, to look at the source.? Thanks again.
>
> DG
>
>


From d.l.goldsmith at gmail.com  Thu Jun  3 13:49:15 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 3 Jun 2010 10:49:15 -0700
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
Message-ID: <AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>

OK, we're getting enough proposed content here that I think a formal
modification of the docstring Standard is warranted; accordingly, I'm going
to file a ticket.  I'll post the link here and then if you want to be on the
notification-of-ticket-changes list you can go there and add yourself.  That
way, this discussion of where this entry should live, what it should
contain, how it should be formatted, etc., etc., will be in a more
appropriate, easier to find place.  Back shortly.

DG

On Thu, Jun 3, 2010 at 9:58 AM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Thu, Jun 3, 2010 at 8:43 AM, Ralf Gommers
> <ralf.gommers at googlemail.com> wrote:
> >
> >
> > On Thu, Jun 3, 2010 at 10:15 PM, <josef.pktd at gmail.com> wrote:
> >>
> >> On Thu, Jun 3, 2010 at 10:07 AM, Bruce Southey <bsouthey at gmail.com>
> wrote:
> >> > On 06/02/2010 10:28 PM, Benjamin Root wrote:
> >>
> >> >
> >> > Just a thought... is it feasible for the doc building system to scan
> >> > through
> >> > the function code and spot a deprecation warning and thereby be able
> to
> >> > add
> >> > a list of deprecation warnings to the docstring?  Obviously, such
> >> > warnings
> >> > would have to follow some standard format, but it would be neat if
> such
> >> > things could be automated.
> >
> > There's enough docstring manipulation going on already I think, this is
> not
> > that much work so manual would be better. It should be put in at the
> moment
> > the deprecation takes place.
> >
> >>
> >> In the future, someone will have to come up with a rule to force
> >> documentation change when a depreciation event occurs and then enforce
> it.
> >> In fact, for numpy (as scipy does not yet have the same policy) the
> >> desired
> >> documentation changes should be added to:
> >> http://projects.scipy.org/numpy/wiki/ApiDeprecation
> >
> >> I have never seen any guidelines or rules to add Deprecation Warnings
> >> into the docstrings. It would be good to define a standard for the
> >> docstrings first.
> >
> > It should be made as visible as possible in my opinion. A reST warning in
> > between summary and extended summary would work. It should clearly state
> in
> > which version it will be removed. Best to keep the text identical to the
> one
> > passed to the deprecate decorator. A reason or alternative should be
> given
> > as well.
>
> I would prefer to see it at the very top.
> If there is an easily available alternative why would I as a user not
> what to immediately view that alternative?
> If I am already using it then it is a good remider. Why put it after
> the summary?
>
> Vincent
>
> > .. warning::
> >     `myfunc` is deprecated and will be removed in SciPy 0.9. Look at
> > `thatfunc` for equivalent functionality.
> >
> > Cheers,
> > Ralf
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/22647eb6/attachment.html>

From josef.pktd at gmail.com  Thu Jun  3 13:53:08 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 3 Jun 2010 13:53:08 -0400
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <4C07E304.8040503@gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
	<4C07E304.8040503@gmail.com>
Message-ID: <AANLkTilcUj0bPbAbyjTJM1JoDHUqLPLnRBHpzAWOPbY7@mail.gmail.com>

On Thu, Jun 3, 2010 at 1:14 PM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/03/2010 10:32 AM, Nathaniel Smith wrote:
>
> On Thu, Jun 3, 2010 at 6:38 AM,  <josef.pktd at gmail.com> wrote:
>
>
> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
> <warren.weckesser at enthought.com> wrote:
>
>
> stats.glm looks like it was started and then abandoned without being
> finished. ?It was last touched in November 2007. ?Should this function
> be deprecated so it can eventually be removed?
>
>
> My thoughts when I looked at it was roughly:
> leave it alone since it's working, but don't "advertise" it because we
> should get a better replacement.
> similar to linregress the more general version will be available when
> scipy.stats gets the full OLS model.
>
>
> Wait, what does 'glm' have to do with OLS (or t-tests) anyway? Surely
> if anything it *should* be a function that fits, you know, GLMs
> (generalized linear models)?
>
> I guess this is a vote for removing it, because GLMs are one of the
> fundamental stats models that people will look for, and having some
> weird, broken, other thing in the obvious place is just confusing and
> looks really bad.
>
> -- Nathaniel
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> Perhaps people should actually read the code before jumping to incorrect
> conclusions. It is not similar to linregress unless you know how to 'trick'
> linreg.

It's similar in the sense that it promises a lot, but is very limited
or "crippled", and that the replacement is not just a quick rewrite.

>
> Granted that stats.glm is a crippled but it is well intended (like most
> things in scipy.stats). The docstring intended it to general linear models
> such as SAS's glm procedure and R's glm function (without generalized part).
> At present is just does 1-way anova with only two levels but could do more.
>
>>>> drug=[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
>>>> 2, 2, 2, 2, 2, 2, 2, 2]
>>>> postrt=[6, 0, 2, 8, 11, 4, 13, 1, 8, 0, 0, 2, 3, 1, 18, 4, 14, 9, 1, 9,
>>>> 13, 10, 18, 5, 23, 12, 5, 16, 1, 20]
>>>> t_val,t_probs=stats.glm(postrt,drug)
>>>> t_val
> -1.5463854661015379
>>>> t_probs
> 0.13324062984741347
>>>> idrug=[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
>>>> 0, 0, 0, 0, 0, 0, 0, 0] #create dummies to trick linreg
>>>> print stats.linregress(idrug, postrt)
> (-3.9000000000000044, 9.2000000000000011, -0.280506586484015,
> 0.13324062984741378, 2.5220102526131258)
>>>> -3.9000000000000044/2.5220102526131258 #this is the t-value of stats.glm
> -1.5463854661015373
>
>
> I have major concerns about depreciating code when there is no alternative
> proposed for such an important statistical function. As David has said
> elsewhere, this is just Python code and has little or no maintenance cost.
> The full solution is probably Jonathan Taylor's glm class but that uses the
> formula class and is for generalized linear models. However, I don't see
> that in scipy anywhere soon.

Currently the alternative is using ttest_ind, which produces the same result.
The cost of glm is the confusion that it creates if there is such a
big mismatch between name and result, which is exactly the response
Nathaniel and I had.

And Warren was proposing to deprecate it not to delete it right away.

>
> So the options are:
>
> 1) Rewrite the internals to fix address the current limitation - not hard
> but would need an API change and more importantly better options exist.
> 2) OLS is a superior version to linregress but needs changes to get ANOVA
> etc added
> http://www.scipy.org/Cookbook/OLS
> 3) The best candidate that I know that can replace both stats.linregress and
> stats.glm is Skipper's try_ols_anova.py code from pystatsmodel (at least
> posted on the list).? But I am not sure what the current state of that is.
> 4) Some other option?

Yes, move the OLS model and associated code from statsmodels to
scipy.stats (maybe we can discuss this after Skipper's gsoc), or use
statsmodels as addition to scipy.stats.

http://bazaar.launchpad.net/~scipystats/statsmodels/trunk/annotate/head%3A/scikits/statsmodels/sandbox/regression/try_ols_anova.py
was just my initial experimental script, and I think we might still
need a few versions (with Skipper's data and dummy handling and maybe
Jonathan's formula framework) before we come to a final design.

I don't think any duplication of effort to expand on stats.linregress
or stats.glm is productive.

Josef

>
>
> Bruce
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From d.l.goldsmith at gmail.com  Thu Jun  3 14:18:05 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 3 Jun 2010 11:18:05 -0700
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
	<AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
Message-ID: <AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>

http://projects.scipy.org/numpy/ticket/1501

Filed Description:

"Presently, the docstring standard does not specify how to note that an
object is to be deprecated; it has been proposed that this needs to be
rectified.

"Obviously, this should be an optional section in general, but required for
objects once it is decided that they are to be deprecated.

"Discussion on scipy-dev agreed that this section should be at or near the
top, but at the top or between the One-line and Extended Summaries have both
been proposed - we will try to reach a consensus [in the ticket comments].

"Proposed format is to utilize Sphinx' .. deprecated:: directive; someone
please provide a concrete example of what this looks like (for example, does
this directive support multi-line content, and if so, what does that look
like).

"Proposed content: summaries of deprecation schedule (in version number
time, not real time) and justification for deprecation (e.g., being
replaced, duplicates extant functionality elsewhere); existing alternatives
to obtain the same functionality. (Feel strongly that it should contain
something else? Add it below as a comment.)

"IMO, we should try to decide on this and update the standard by June 15 at
the latest.

"Have I forgotten anything"
DG

On Thu, Jun 3, 2010 at 10:49 AM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> OK, we're getting enough proposed content here that I think a formal
> modification of the docstring Standard is warranted; accordingly, I'm going
> to file a ticket.  I'll post the link here and then if you want to be on the
> notification-of-ticket-changes list you can go there and add yourself.  That
> way, this discussion of where this entry should live, what it should
> contain, how it should be formatted, etc., etc., will be in a more
> appropriate, easier to find place.  Back shortly.
>
> DG
>
>
> On Thu, Jun 3, 2010 at 9:58 AM, Vincent Davis <vincent at vincentdavis.net>wrote:
>
>> On Thu, Jun 3, 2010 at 8:43 AM, Ralf Gommers
>> <ralf.gommers at googlemail.com> wrote:
>> >
>> >
>> > On Thu, Jun 3, 2010 at 10:15 PM, <josef.pktd at gmail.com> wrote:
>> >>
>> >> On Thu, Jun 3, 2010 at 10:07 AM, Bruce Southey <bsouthey at gmail.com>
>> wrote:
>> >> > On 06/02/2010 10:28 PM, Benjamin Root wrote:
>> >>
>> >> >
>> >> > Just a thought... is it feasible for the doc building system to scan
>> >> > through
>> >> > the function code and spot a deprecation warning and thereby be able
>> to
>> >> > add
>> >> > a list of deprecation warnings to the docstring?  Obviously, such
>> >> > warnings
>> >> > would have to follow some standard format, but it would be neat if
>> such
>> >> > things could be automated.
>> >
>> > There's enough docstring manipulation going on already I think, this is
>> not
>> > that much work so manual would be better. It should be put in at the
>> moment
>> > the deprecation takes place.
>> >
>> >>
>> >> In the future, someone will have to come up with a rule to force
>> >> documentation change when a depreciation event occurs and then enforce
>> it.
>> >> In fact, for numpy (as scipy does not yet have the same policy) the
>> >> desired
>> >> documentation changes should be added to:
>> >> http://projects.scipy.org/numpy/wiki/ApiDeprecation
>> >
>> >> I have never seen any guidelines or rules to add Deprecation Warnings
>> >> into the docstrings. It would be good to define a standard for the
>> >> docstrings first.
>> >
>> > It should be made as visible as possible in my opinion. A reST warning
>> in
>> > between summary and extended summary would work. It should clearly state
>> in
>> > which version it will be removed. Best to keep the text identical to the
>> one
>> > passed to the deprecate decorator. A reason or alternative should be
>> given
>> > as well.
>>
>> I would prefer to see it at the very top.
>> If there is an easily available alternative why would I as a user not
>> what to immediately view that alternative?
>> If I am already using it then it is a good remider. Why put it after
>> the summary?
>>
>> Vincent
>>
>> > .. warning::
>> >     `myfunc` is deprecated and will be removed in SciPy 0.9. Look at
>> > `thatfunc` for equivalent functionality.
>> >
>> > Cheers,
>> > Ralf
>> > _______________________________________________
>> > SciPy-Dev mailing list
>> > SciPy-Dev at scipy.org
>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>> >
>> >
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide.  (As interpreted
> by Robert Graves)
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/af065d84/attachment.html>

From bsouthey at gmail.com  Thu Jun  3 15:15:10 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 03 Jun 2010 14:15:10 -0500
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTilcUj0bPbAbyjTJM1JoDHUqLPLnRBHpzAWOPbY7@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>	<4C07E304.8040503@gmail.com>
	<AANLkTilcUj0bPbAbyjTJM1JoDHUqLPLnRBHpzAWOPbY7@mail.gmail.com>
Message-ID: <4C07FF3E.2020505@gmail.com>

On 06/03/2010 12:53 PM, josef.pktd at gmail.com wrote:
> On Thu, Jun 3, 2010 at 1:14 PM, Bruce Southey<bsouthey at gmail.com>  wrote:
>    
>> On 06/03/2010 10:32 AM, Nathaniel Smith wrote:
>>
>> On Thu, Jun 3, 2010 at 6:38 AM,<josef.pktd at gmail.com>  wrote:
>>
>>
>> On Thu, Jun 3, 2010 at 8:50 AM, Warren Weckesser
>> <warren.weckesser at enthought.com>  wrote:
>>
>>
>> stats.glm looks like it was started and then abandoned without being
>> finished.  It was last touched in November 2007.  Should this function
>> be deprecated so it can eventually be removed?
>>
>>
>> My thoughts when I looked at it was roughly:
>> leave it alone since it's working, but don't "advertise" it because we
>> should get a better replacement.
>> similar to linregress the more general version will be available when
>> scipy.stats gets the full OLS model.
>>
>>
>> Wait, what does 'glm' have to do with OLS (or t-tests) anyway? Surely
>> if anything it *should* be a function that fits, you know, GLMs
>> (generalized linear models)?
>>
>> I guess this is a vote for removing it, because GLMs are one of the
>> fundamental stats models that people will look for, and having some
>> weird, broken, other thing in the obvious place is just confusing and
>> looks really bad.
>>
>> -- Nathaniel
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>> Perhaps people should actually read the code before jumping to incorrect
>> conclusions. It is not similar to linregress unless you know how to 'trick'
>> linreg.
>>      
> It's similar in the sense that it promises a lot, but is very limited
> or "crippled", and that the replacement is not just a quick rewrite.
>
>    
>> Granted that stats.glm is a crippled but it is well intended (like most
>> things in scipy.stats). The docstring intended it to general linear models
>> such as SAS's glm procedure and R's glm function (without generalized part).
>> At present is just does 1-way anova with only two levels but could do more.
>>
>>      
>>>>> drug=[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
>>>>> 2, 2, 2, 2, 2, 2, 2, 2]
>>>>> postrt=[6, 0, 2, 8, 11, 4, 13, 1, 8, 0, 0, 2, 3, 1, 18, 4, 14, 9, 1, 9,
>>>>> 13, 10, 18, 5, 23, 12, 5, 16, 1, 20]
>>>>> t_val,t_probs=stats.glm(postrt,drug)
>>>>> t_val
>>>>>            
>> -1.5463854661015379
>>      
>>>>> t_probs
>>>>>            
>> 0.13324062984741347
>>      
>>>>> idrug=[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
>>>>> 0, 0, 0, 0, 0, 0, 0, 0] #create dummies to trick linreg
>>>>> print stats.linregress(idrug, postrt)
>>>>>            
>> (-3.9000000000000044, 9.2000000000000011, -0.280506586484015,
>> 0.13324062984741378, 2.5220102526131258)
>>      
>>>>> -3.9000000000000044/2.5220102526131258 #this is the t-value of stats.glm
>>>>>            
>> -1.5463854661015373
>>
>>
>> I have major concerns about depreciating code when there is no alternative
>> proposed for such an important statistical function. As David has said
>> elsewhere, this is just Python code and has little or no maintenance cost.
>> The full solution is probably Jonathan Taylor's glm class but that uses the
>> formula class and is for generalized linear models. However, I don't see
>> that in scipy anywhere soon.
>>      
> Currently the alternative is using ttest_ind, which produces the same result.
>    
Not exactly since you have to reformat the input. Also you can do 
ttest_ind with linregress...

> The cost of glm is the confusion that it creates if there is such a
> big mismatch between name and result, which is exactly the response
> Nathaniel and I had.
>    
Generalized linear models is 'new' (so 1972) but general linear models 
is older (I think back to the 1950's when it was shown the relationship 
between ANOVA and regression). Yet both got back to the 1800's. But sure 
anyone is going to get confused if they come from the S/R world and 
don't check to see if the function at least has distribution and link 
arguments/options.

> And Warren was proposing to deprecate it not to delete it right away.
>
>    
>> So the options are:
>>
>> 1) Rewrite the internals to fix address the current limitation - not hard
>> but would need an API change and more importantly better options exist.
>> 2) OLS is a superior version to linregress but needs changes to get ANOVA
>> etc added
>> http://www.scipy.org/Cookbook/OLS
>> 3) The best candidate that I know that can replace both stats.linregress and
>> stats.glm is Skipper's try_ols_anova.py code from pystatsmodel (at least
>> posted on the list).  But I am not sure what the current state of that is.
>> 4) Some other option?
>>      
> Yes, move the OLS model and associated code from statsmodels to
> scipy.stats (maybe we can discuss this after Skipper's gsoc), or use
> statsmodels as addition to scipy.stats.
>
> http://bazaar.launchpad.net/~scipystats/statsmodels/trunk/annotate/head%3A/scikits/statsmodels/sandbox/regression/try_ols_anova.py
> was just my initial experimental script,
Sorry - I just recalled his script but not the history.

> and I think we might still
> need a few versions (with Skipper's data and dummy handling and maybe
> Jonathan's formula framework) before we come to a final design.
>
> I don't think any duplication of effort to expand on stats.linregress
> or stats.glm is productive.
>
> Josef
>
>    
I totally agree as adding that at the same time justifies depreciation 
of both functions.

Bruce

>>
>> Bruce
>>
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>      
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    


From bsouthey at gmail.com  Thu Jun  3 15:55:07 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 03 Jun 2010 14:55:07 -0500
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <AANLkTimP3RkTWlXj8VjEXSHUu2btDixobu8yZaNegNMY@mail.gmail.com>
References: <4C07A52D.30503@enthought.com>	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>	<AANLkTinB0HYRFASdmR-MImdpJHBV4p7vFA6uxNcRmd6D@mail.gmail.com>
	<AANLkTimP3RkTWlXj8VjEXSHUu2btDixobu8yZaNegNMY@mail.gmail.com>
Message-ID: <4C08089B.8040601@gmail.com>

On 06/03/2010 11:16 AM, Nathaniel Smith wrote:
> On Thu, Jun 3, 2010 at 8:53 AM,<josef.pktd at gmail.com>  wrote:
>    
>> GLM as in general linear model not generalized. (It's the worst
>> conflicting acronym in stats).
>>      
> Sure, and lets not even talk about generalized least squares
> (unrelated to both!).
>
> But the general linear model is basically identical to a simple linear
> model, both in interface and implementation.
Depends what you mean by 'simple'. Stealing from the SAS manual, these 
are some of the models fitted by the GLM procedure which I would not 
call simple:
simple regression
multiple regression
analysis of variance (ANOVA), especially for unbalanced data
analysis of covariance
response surface models
weighted regression
polynomial regression
partial correlation
multivariate analysis of variance (MANOVA)
repeated measures analysis of variance

These include interactions...
>   There's no reason to have
> a separate function for it, one should just accept a matrix for the
> "y" variable in the OLS code. But *generalized* linear models are
> different in interface, implementation, and are almost as much of a
> stats workhorse as standard linear models. So every book I've ever
> seen uses the abbreviation "glm" to refer to the generalized version.
> (Also, this is what R calls the function ;-).)
>    
Yeah, it is interesting that you forget older statistical packages (SAS, 
SPSS, don't remember what Genstat did ) and the first GLIM (the first? 
generalized linear model package).

> The implementation of dummy coding is kind of useful, but this is the
> wrong place and the wrong name...
>    
Why?
That is exactly what is needed and what stats.glm does.

> (Also, its least squares implementation calls inv -- the textbook
> example of bad numerics!)
>    
Actually it should call pinv() here but you going to have to prove that 
this is 'bad numerics'! Especially given how the numpy computes it and 
that design matrices tend to have poor numerics to start with 
(especially if you do anova and use condition number to assess 
numerics). [I strong dislike people complaining of the apparent bad 
numerics just because they see the word inverse.]

> ...Okay, you know all that anyway, the question is what to do with it.
> If the problem were just that it needed a better implementation and
> some new features added, then maybe we would keep it and let it be
> improved incrementally. But the interface is just wrong, so we'll be
> removing it sooner or later, and it might as well be sooner, rather
> than prolong the agony.
>
> -- Nathaniel
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    
The simple reason is that there is no alternative for users to use yet 
such as pystatsmodels.


Bruce


From ljosa at broad.mit.edu  Thu Jun  3 16:15:30 2010
From: ljosa at broad.mit.edu (Vebjorn Ljosa)
Date: Thu, 3 Jun 2010 16:15:30 -0400
Subject: [SciPy-Dev] License for parts of CellProfiler changed to BSD to
	allow incorporation into SciPy
Message-ID: <AANLkTim9gfd3EluSykvuFFQ4Q2IbvoojRcjjJule7Bgy@mail.gmail.com>

We have changed the license of some parts of CellProfiler from GNU GPL
to BSD.  It has previously been proposed [1] that some of the image
processing code in CellProfiler be merged into SciPy, and the license
change makes this possible.

The CellProfiler SVN repository is at
https://svn.broadinstitute.org/CellProfiler/trunk/CellProfiler/.  The
file LICENSE [2] contains a list of BSD-licensed subdirectories as
well as other license details.  The rest of CellProfiler continues to
be licensed under the GNU GPL.  The BSD-licensed subdirectories are:

 * CellProfiler/cpmath [3]: image processing algorithms
 * CellProfiler/utilities [4]: contains a Java bridge, making it
possible to call Java functions from Python
 * bioformats [5]: wrapper that uses the Java bridge to have
Bioformats [6] read or write an image file

Good luck with the upcoming scikits.image sprint.  I don't think
anyone from the CellProfiler team will be able to take part in the
sprint this time, but don't hesitate to ask on the
cellprofiler-dev at broadinstitute.org mailing list.

Thanks,
Vebjorn

[1] http://stefanv.github.com/scikits.image/contribute.html
[2] https://svn.broadinstitute.org/CellProfiler/trunk/CellProfiler/LICENSE
[3] https://svn.broadinstitute.org/CellProfiler/trunk/CellProfiler/cellprofiler/cpmath/
[4] https://svn.broadinstitute.org/CellProfiler/trunk/CellProfiler/cellprofiler/utilities/
[5] https://svn.broadinstitute.org/CellProfiler/trunk/CellProfiler/bioformats/
[6] http://www.loci.wisc.edu/software/bio-formats

-- 
Vebjorn Ljosa, PhD
Computational Biologist
Broad Institute of MIT and Harvard


From stefan at sun.ac.za  Thu Jun  3 16:24:04 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Thu, 3 Jun 2010 13:24:04 -0700
Subject: [SciPy-Dev] License for parts of CellProfiler changed to BSD to
	allow incorporation into SciPy
In-Reply-To: <AANLkTim9gfd3EluSykvuFFQ4Q2IbvoojRcjjJule7Bgy@mail.gmail.com>
References: <AANLkTim9gfd3EluSykvuFFQ4Q2IbvoojRcjjJule7Bgy@mail.gmail.com>
Message-ID: <AANLkTil8D0CJPJQse8kKYohdXC-oMF6zQ7jgwVmJarjK@mail.gmail.com>

Vebjorn,

2010/6/3 Vebjorn Ljosa <ljosa at broad.mit.edu>:
> We have changed the license of some parts of CellProfiler from GNU GPL
> to BSD. ?It has previously been proposed [1] that some of the image
> processing code in CellProfiler be merged into SciPy, and the license
> change makes this possible.

Thanks a lot for your effort, and for this highly anticipated outcome!
 At SciPy2010, the scikits.image team will make a concerted effort to
include many of these algorithms into our code-base.

Kind regards
St?fan


From njs at pobox.com  Thu Jun  3 17:20:40 2010
From: njs at pobox.com (Nathaniel Smith)
Date: Thu, 3 Jun 2010 14:20:40 -0700
Subject: [SciPy-Dev] Deprecate stats.glm?
In-Reply-To: <4C08089B.8040601@gmail.com>
References: <4C07A52D.30503@enthought.com>
	<AANLkTinkKxPJNIyENGDPIZAK4Pcd-SkAH0i3nT3q-j9Q@mail.gmail.com>
	<AANLkTimbijrsgJeOXw8b1vpFyhpUyT_XlqkFfqIYmwbU@mail.gmail.com>
	<AANLkTinB0HYRFASdmR-MImdpJHBV4p7vFA6uxNcRmd6D@mail.gmail.com>
	<AANLkTimP3RkTWlXj8VjEXSHUu2btDixobu8yZaNegNMY@mail.gmail.com>
	<4C08089B.8040601@gmail.com>
Message-ID: <AANLkTimUvLBDvLiBe8BL7pRWug4byb3Zvl1sbgjGKlKS@mail.gmail.com>

On Thu, Jun 3, 2010 at 12:55 PM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/03/2010 11:16 AM, Nathaniel Smith wrote:
>> But the general linear model is basically identical to a simple linear
>> model, both in interface and implementation.
> Depends what you mean by 'simple'. Stealing from the SAS manual, these
> are some of the models fitted by the GLM procedure which I would not
> call simple:
> simple regression
> multiple regression
> analysis of variance (ANOVA), especially for unbalanced data
> analysis of covariance
> multivariate analysis of variance (MANOVA)
> weighted regression
> polynomial regression

Well, I didn't mean to start an argument; certainly 'simple' is
underdefined, and there's a lot of conceptual richness to the linear
model framework. (And perhaps some obfuscation from the historical
tendency to use different names for mathematically equivalent ideas
when used in different contexts.) But at the implementation level,
everything in the above list is (1) solved in about 2 lines of code,
(2) they're the same 2 lines of code for all of them. Making a
friendly interface is more complicated than that, of course, but
that's just more reason to do it once for all of them, instead of
piece-meal.

> response surface models
> partial correlation

These I'm not sure about off-hand.

> repeated measures analysis of variance

And this is a very complicated area; I know of at least 3 totally
different approaches (traditional repeated measures ANOVA with or
without sphericity corrections, MANOVAs on contrasts, and multi-level
mixed-effect modelling), and none is very simple. I assume SAS has
picked one to implement (probably the first?). But these issues are
totally orthogonal to whether a "linear model" is "general" (in fact,
I don't know how to apply *any* of these techniques to *general*
linear models, i.e., multivariate ones, and would very much appreciate
references if you have them!). So I don't see how this argues for
treating "general linear models" separately from "linear models".

> These include interactions...
>> ? There's no reason to have
>> a separate function for it, one should just accept a matrix for the
>> "y" variable in the OLS code. But *generalized* linear models are
>> different in interface, implementation, and are almost as much of a
>> stats workhorse as standard linear models. So every book I've ever
>> seen uses the abbreviation "glm" to refer to the generalized version.
>> (Also, this is what R calls the function ;-).)
>>
> Yeah, it is interesting that you forget older statistical packages (SAS,
> SPSS, don't remember what Genstat did ) and the first GLIM (the first?
> generalized linear model package).

I didn't forget them; I've just never used them.

Can I also mention that I'm finding your tone quite combative and
off-putting? If I've offended you somehow then I apologize, and would
appreciate hearing why.

If those packages have useful ideas, then I'm interested in hearing
them, but just hearing the list of names unfortunately doesn't give me
much to go on.

>> The implementation of dummy coding is kind of useful, but this is the
>> wrong place and the wrong name...
>>
> Why?
> That is exactly what is needed and what stats.glm does.

I'm sorry, I don't know how to explain better. My statement is that
dummy coding is (1) useful, (2) neither called "glm" in any context,
nor in any way specific to the general linear model. Do you disagree
with any of this...?

>> (Also, its least squares implementation calls inv -- the textbook
>> example of bad numerics!)
>>
> Actually it should call pinv() here but you going to have to prove that
> this is 'bad numerics'! Especially given how the numpy computes it and
> that design matrices tend to have poor numerics to start with
> (especially if you do anova and use condition number to assess
> numerics). [I strong dislike people complaining of the apparent bad
> numerics just because they see the word inverse.]

Not sure I follow here either. If design matrices have poor numerics
to start with, then that's exactly the case where forming the inverse
is *bad*! If not, then it doesn't make much difference either way, but
since it's no more effort to write code that is both faster and more
robust, doing otherwise is just irresponsible in a widely-used
library. But in any case, this was a side point.

>> ...Okay, you know all that anyway, the question is what to do with it.
>> If the problem were just that it needed a better implementation and
>> some new features added, then maybe we would keep it and let it be
>> improved incrementally. But the interface is just wrong, so we'll be
>> removing it sooner or later, and it might as well be sooner, rather
>> than prolong the agony.
>
> The simple reason is that there is no alternative for users to use yet
> such as pystatsmodels.

Well, and this is a philosophical difference, I guess. Personally, as
a user, if given the choice between a stats library that was missing
many things, but everything there was well-engineered, reliable,
documented, etc., versus one that technically had more code but half
the things I started to use turned out to be broken, or do something
similar-but-different from what I expected, or just weren't
documented, then I would choose the first library, no question. And
I'd be more likely to contribute to make it more complete, too. It's
just easier to work in an area that's not cluttered with broken
machinery. "Add missing stuff following existing style" is a much
easier goal to work on than "pick your way through the rubble to find
usable pieces and cobble stuff out of them". But that's just me; I can
see your perspective too, and don't know the SciPy community's
preference.

-- Nathaniel


From ilanschnell at gmail.com  Thu Jun  3 19:28:05 2010
From: ilanschnell at gmail.com (Ilan Schnell)
Date: Thu, 3 Jun 2010 18:28:05 -0500
Subject: [SciPy-Dev] import error in scipy.stats on RH3 32-bit
In-Reply-To: <AANLkTimumFYrbT7Rllq0IVjxgQqIbaMhEEQuVbWZZetS@mail.gmail.com>
References: <AANLkTink02FLVCAd9-OM0G-M8h6IoALUmr6L1m961DKJ@mail.gmail.com>
	<AANLkTilWfx3XogGYefxorjJ4Z8iaiFhTUN-Py43kHtVQ@mail.gmail.com>
	<AANLkTinmfY_TDLe-2U1Lvro78kgZWDvKiaLT3yrYCOI8@mail.gmail.com>
	<AANLkTinQ-u2QllOt5NfAoiKWEXxudwqmYIQ3iX7BzD0Z@mail.gmail.com>
	<AANLkTimumFYrbT7Rllq0IVjxgQqIbaMhEEQuVbWZZetS@mail.gmail.com>
Message-ID: <AANLkTinsb1Qf-OujA_wY7vDvbuV1NqNzNmnVRUB71jm7@mail.gmail.com>

I've just found the problem, and it had nothing to do with
checkins that were being made to the 0.8.x branch.

After spending may hours on this problem, I'm now very
happy that I found the problem.  It tured out that the
machine I use to build scipy on RH3 32-bit had it's
clock set in the past.  So the new brach source tarball
I made (which is being used on all the build machines),
had timestamps which appeared to be in the future on
that machine.  I'm not sure why exactly timestamps in
the future can cause problems when building scipy,
but after setting the clock on the machine things work
fine now.  So it was only by accident this happend on
RH3 32-bit, it might have been any other system as well.

- Ilan


On Wed, Jun 2, 2010 at 10:29 PM, Ilan Schnell <ilanschnell at gmail.com> wrote:
> Not yet. ?I'll look more into it tomorrow. ?:-)
>
> - Ilan
>
> On Wed, Jun 2, 2010 at 10:19 PM, Charles R Harris
> <charlesr.harris at gmail.com> wrote:
>>
>>
>> On Wed, Jun 2, 2010 at 9:03 PM, Ilan Schnell <ilanschnell at gmail.com> wrote:
>>>
>>> Hello Chuck,
>>> yes 6446 works. ?Actually, as the error indicates, the unresolved
>>> symbol in is linalg/clapack.so, it just happened that during my
>>> testing the stats package was imported first, so I initially thought
>>> the error was there.
>>> However, something has changed between 6446 and 6476, as
>>> I wasn't seeing this error before. ?Looking at the revision log of
>>> the 0.8.x branch, but I cannot see any obvious. ?And I'm also
>>> puzzled why this only happens on one particular platform.
>>> To make sure the build environment hasn't changed, I rebuild 6446
>>> on the same system, and it still works.
>>>
>>
>> I hate to ask this of anyone, but... could you determine which revision
>> caused the problem?
>>
>> Sadistical Chuck
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>


From d.l.goldsmith at gmail.com  Thu Jun  3 22:12:11 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 3 Jun 2010 19:12:11 -0700
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
Message-ID: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>

...everywhere it occurs?

DG

-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/87559562/attachment.html>

From jsseabold at gmail.com  Thu Jun  3 23:52:10 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Thu, 3 Jun 2010 23:52:10 -0400
Subject: [SciPy-Dev] Building docs in scipy?
Message-ID: <AANLkTinqTEzGoVXJ4q3LZegctWae9y0gZ-o17cSH-KcK@mail.gmail.com>

Should numpy/doc/sphinxext be distributed with scipy/doc/ or is this user
error?  I couldn't get the scipy docs to build until I copied it over.

Skipper
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/735873e1/attachment.html>

From d.l.goldsmith at gmail.com  Fri Jun  4 01:04:21 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 3 Jun 2010 22:04:21 -0700
Subject: [SciPy-Dev] Marathon Skypecon tomorrow?
Message-ID: <AANLkTikXzLntemqkYqBvCKEonmJXKCUh00rKoh1bTm9n@mail.gmail.com>

Email me your Skype ID if you want to participate tomorrow, noon EDT.  If no
one emails me, I'll post a cancellation notice around 11:50 am EDT.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100603/4facb14a/attachment.html>

From klrkdekira at gmail.com  Fri Jun  4 01:40:41 2010
From: klrkdekira at gmail.com (CL Chow)
Date: Fri, 4 Jun 2010 13:40:41 +0800
Subject: [SciPy-Dev] Marathon Skypecon tomorrow?
In-Reply-To: <AANLkTikXzLntemqkYqBvCKEonmJXKCUh00rKoh1bTm9n@mail.gmail.com>
References: <AANLkTikXzLntemqkYqBvCKEonmJXKCUh00rKoh1bTm9n@mail.gmail.com>
Message-ID: <AANLkTimSxfo-Fuy2patyWSZVBKvr6-fBS1t52ItlEj18@mail.gmail.com>

My Skype ID is klrk_c
You can ignore mine if no one else emails you, because I'll only be there as
audience.

Regards,
CL Chow
"Please do not send me Microsoft Office/Apple iWork documents. Send
OpenDocument instead! http://fsf.org/campaigns/opendocument/"


On Fri, Jun 4, 2010 at 1:04 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> Email me your Skype ID if you want to participate tomorrow, noon EDT.  If
> no one emails me, I'll post a cancellation notice around 11:50 am EDT.
>
> DG
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100604/c6203329/attachment.html>

From matthew.brett at gmail.com  Fri Jun  4 02:01:18 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Thu, 3 Jun 2010 23:01:18 -0700
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
Message-ID: <AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>

Hi,

On Thu, Jun 3, 2010 at 7:12 PM, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> ...everywhere it occurs?

I seem to remember I did put TMs all over the place in the matlab reader code.

I did a brief scan of:

http://en.wikipedia.org/wiki/Trademark
http://en.wikipedia.org/wiki/Wikipedia:Manual_of_Style_%28trademarks%29

I see that the wikipedia style guide says (above) not to use TM etc.
I guess the principle is that, when we use the term 'matlab' it should
always be clear we are referring to the product made by Mathworks.  I
don't think adding TM will have much impact on that and it looks a bit
goofy.  My vote would be to remove all the TMs, and maybe add a couple
of footnotes in sensible places with 'matlab is a trademark of
Mathworks'.

Best,

Matthew


From david.kirkby at onetel.net  Fri Jun  4 03:03:33 2010
From: david.kirkby at onetel.net (Dr. David Kirkby)
Date: Fri, 04 Jun 2010 08:03:33 +0100
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
Message-ID: <4C08A545.1040604@onetel.net>

On 06/ 4/10 07:01 AM, Matthew Brett wrote:
> Hi,
>
> On Thu, Jun 3, 2010 at 7:12 PM, David Goldsmith<d.l.goldsmith at gmail.com>  wrote:
>> ...everywhere it occurs?
>
> I seem to remember I did put TMs all over the place in the matlab reader code.
>

IMHO it should be called MATLAB and not Matlab since that is what Mathworks call 
it.

As for the TM, I tend to agree, it is pretty irrelevant, though I am not a 
lawyer. It might be better from a legal point to leave them there.

Just my 2p

Dave


From matthew.brett at gmail.com  Fri Jun  4 03:07:52 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Fri, 4 Jun 2010 00:07:52 -0700
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <4C08A545.1040604@onetel.net>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
	<4C08A545.1040604@onetel.net>
Message-ID: <AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>

Hi,

> IMHO it should be called MATLAB and not Matlab since that is what Mathworks call
> it.

>From the wikipedia style guide, I don't think we're obliged to
capitalize the way the Mathworks would like, and we can choose
whatever reads better.

> As for the TM, I tend to agree, it is pretty irrelevant, though I am not a
> lawyer. It might be better from a legal point to leave them there.

I am not a lawyer either, but it looks as though the key principle is
fair use.  Fair use means - in our case - that when we say 'matlab' -
we mean the Matlab program written by the Mathworks.   If that's
obvious from the context, I don't think we need the TM, and if it
isn't, I don't think the TM helps much (whose TM?).  I might be wrong
though,

Cheers,

Matthew


From d.l.goldsmith at gmail.com  Fri Jun  4 03:18:52 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 4 Jun 2010 00:18:52 -0700
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
	<4C08A545.1040604@onetel.net>
	<AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>
Message-ID: <AANLkTil4bubCQBP2f5ggZyZoaz-0RtNU6JFM8u9ihKG-@mail.gmail.com>

Thanks, guys.  Not that your opinions aren't valuable, but it is a matter of
legality that I'm concerned about, and we do have people on-list who seem to
make it their business to worry about these things, so hopefully one of them
will chime in as well.

DG

On Fri, Jun 4, 2010 at 12:07 AM, Matthew Brett <matthew.brett at gmail.com>wrote:

> Hi,
>
> > IMHO it should be called MATLAB and not Matlab since that is what
> Mathworks call
> > it.
>
> >From the wikipedia style guide, I don't think we're obliged to
> capitalize the way the Mathworks would like, and we can choose
> whatever reads better.
>
> > As for the TM, I tend to agree, it is pretty irrelevant, though I am not
> a
> > lawyer. It might be better from a legal point to leave them there.
>
> I am not a lawyer either, but it looks as though the key principle is
> fair use.  Fair use means - in our case - that when we say 'matlab' -
> we mean the Matlab program written by the Mathworks.   If that's
> obvious from the context, I don't think we need the TM, and if it
> isn't, I don't think the TM helps much (whose TM?).  I might be wrong
> though,
>
> Cheers,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100604/f94aff2b/attachment.html>

From pav at iki.fi  Fri Jun  4 03:56:14 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 4 Jun 2010 07:56:14 +0000 (UTC)
Subject: [SciPy-Dev] Building docs in scipy?
References: <AANLkTinqTEzGoVXJ4q3LZegctWae9y0gZ-o17cSH-KcK@mail.gmail.com>
Message-ID: <huabiu$dm0$1@dough.gmane.org>

Thu, 03 Jun 2010 23:52:10 -0400, Skipper Seabold wrote:
> Should numpy/doc/sphinxext be distributed with scipy/doc/ or is this
> user error?  I couldn't get the scipy docs to build until I copied it
> over.

It's pulled in by svn:externals. With git, ymmv. It probably should be 
included in the distribution tarballs, nevertheless.

-- 
Pauli Virtanen


From david.kirkby at onetel.net  Fri Jun  4 04:41:42 2010
From: david.kirkby at onetel.net (Dr. David Kirkby)
Date: Fri, 04 Jun 2010 09:41:42 +0100
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>	<4C08A545.1040604@onetel.net>
	<AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>
Message-ID: <4C08BC46.4030006@onetel.net>

On 06/ 4/10 08:07 AM, Matthew Brett wrote:
> Hi,
>
>> IMHO it should be called MATLAB and not Matlab since that is what Mathworks call
>> it.
>
>> From the wikipedia style guide, I don't think we're obliged to
> capitalize the way the Mathworks would like, and we can choose
> whatever reads better.

If someone came along and changed SciPy to scipy, would you feel it appropriate 
to change it back? I suspect "yes" is the answer. As such, why not respect 
Mathworks and write MATLAB the prefer to write it?

BTW, I don't even use MATLAB, so I'm not a MATLAB employee or similar!


Dave


From matthew.brett at gmail.com  Fri Jun  4 06:30:36 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Fri, 4 Jun 2010 11:30:36 +0100
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <AANLkTil4bubCQBP2f5ggZyZoaz-0RtNU6JFM8u9ihKG-@mail.gmail.com>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
	<4C08A545.1040604@onetel.net>
	<AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>
	<AANLkTil4bubCQBP2f5ggZyZoaz-0RtNU6JFM8u9ihKG-@mail.gmail.com>
Message-ID: <AANLkTil-5UAUbCTRqjeeK77xmBZSXstFEXRKlNmRiSkf@mail.gmail.com>

Hi,

On Fri, Jun 4, 2010 at 8:18 AM, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> Thanks, guys.? Not that your opinions aren't valuable, but it is a matter of
> legality that I'm concerned about, and we do have people on-list who seem to
> make it their business to worry about these things, so hopefully one of them
> will chime in as well.

;) - ah yes - it is an art that can take an age to learn, to
distinguish signal from noise !

Matthew


From pav at iki.fi  Fri Jun  4 06:53:23 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 4 Jun 2010 10:53:23 +0000 (UTC)
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
	<4C08A545.1040604@onetel.net>
	<AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>
	<4C08BC46.4030006@onetel.net>
Message-ID: <hualv3$efd$1@dough.gmane.org>

Fri, 04 Jun 2010 09:41:42 +0100, Dr. David Kirkby wrote:

> On 06/ 4/10 08:07 AM, Matthew Brett wrote:
>> Hi,
>>
>>> IMHO it should be called MATLAB and not Matlab since that is what
>>> Mathworks call it.
>>
>>> From the wikipedia style guide, I don't think we're obliged to
>> capitalize the way the Mathworks would like, and we can choose whatever
>> reads better.
> 
> If someone came along and changed SciPy to scipy, would you feel it
> appropriate to change it back? I suspect "yes" is the answer. As such,
> why not respect Mathworks and write MATLAB the prefer to write it?

If Mathworks decided to refer to SciPy as SCIPY or Scipy or scipy, I  
would hardly be inclined to correct them, much less raise a lawsuit.

The point is that as long as with "Matlab" we are referring to the 
"MATLAB" produced by Mathworks, not a hypotethical product of our own 
called "Matlab" or "scipy.matlab", this cannot be a trademark 
infringement. IANAL, of course, but it seems clear that the exact 
spelling is hardly an issue of concern.

IMHO English proper name capitalization trumps the "official" spelling, 
but this is not very important.

-- 
Pauli Virtanen


From josef.pktd at gmail.com  Fri Jun  4 07:21:50 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 4 Jun 2010 07:21:50 -0400
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <hualv3$efd$1@dough.gmane.org>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
	<4C08A545.1040604@onetel.net>
	<AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>
	<4C08BC46.4030006@onetel.net> <hualv3$efd$1@dough.gmane.org>
Message-ID: <AANLkTinR2ydsYYF9X-JAERurVIojK-U5IGb94NWzhmV9@mail.gmail.com>

On Fri, Jun 4, 2010 at 6:53 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Fri, 04 Jun 2010 09:41:42 +0100, Dr. David Kirkby wrote:
>
>> On 06/ 4/10 08:07 AM, Matthew Brett wrote:
>>> Hi,
>>>
>>>> IMHO it should be called MATLAB and not Matlab since that is what
>>>> Mathworks call it.
>>>
>>>> From the wikipedia style guide, I don't think we're obliged to
>>> capitalize the way the Mathworks would like, and we can choose whatever
>>> reads better.
>>
>> If someone came along and changed SciPy to scipy, would you feel it
>> appropriate to change it back? I suspect "yes" is the answer. As such,
>> why not respect Mathworks and write MATLAB the prefer to write it?
>
> If Mathworks decided to refer to SciPy as SCIPY or Scipy or scipy, I
> would hardly be inclined to correct them, much less raise a lawsuit.
>
> The point is that as long as with "Matlab" we are referring to the
> "MATLAB" produced by Mathworks, not a hypotethical product of our own
> called "Matlab" or "scipy.matlab", this cannot be a trademark
> infringement. IANAL, of course, but it seems clear that the exact
> spelling is hardly an issue of concern.
>
> IMHO English proper name capitalization trumps the "official" spelling,
> but this is not very important.

Additionally, the reference is often to matlab as a programming
language, such as programs written in matlab, where the author is not
Mathworks.

We don't add a trademark sign to C# or Java, or Gauss or .. either.

Josef


> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From ralf.gommers at googlemail.com  Fri Jun  4 07:30:57 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Fri, 4 Jun 2010 19:30:57 +0800
Subject: [SciPy-Dev] Building docs in scipy?
In-Reply-To: <huabiu$dm0$1@dough.gmane.org>
References: <AANLkTinqTEzGoVXJ4q3LZegctWae9y0gZ-o17cSH-KcK@mail.gmail.com>
	<huabiu$dm0$1@dough.gmane.org>
Message-ID: <AANLkTinXK5gxzhwmLUKhSpu8g7zt5IS2MTHH6sxtEEAx@mail.gmail.com>

On Fri, Jun 4, 2010 at 3:56 PM, Pauli Virtanen <pav at iki.fi> wrote:

> Thu, 03 Jun 2010 23:52:10 -0400, Skipper Seabold wrote:
> > Should numpy/doc/sphinxext be distributed with scipy/doc/ or is this
> > user error?  I couldn't get the scipy docs to build until I copied it
> > over.
>
> It's pulled in by svn:externals. With git, ymmv. It probably should be
> included in the distribution tarballs, nevertheless.
>
> Matthew helpfully pointed out some options (thanks!) to do this in git:
http://news.gmane.org/gmane.comp.python.scientific.devel

However, none of the options he gave are automatic, so people will keep
running into this. I think building docs should work out of the box, so I
see 2 options:
1. we copy sphinxext in scipy. it's not like it changes often, so keeping
things in sync manually is doable.
2. sphinxext becomes a numpy module that can be imported and used from
scipy.

Including it in the tarballs while it's not in the repo is not a good idea
imho - tarballs should be exactly the same as the svn tag.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100604/320cee2b/attachment.html>

From aisaac at american.edu  Fri Jun  4 08:26:31 2010
From: aisaac at american.edu (Alan G Isaac)
Date: Fri, 04 Jun 2010 08:26:31 -0400
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <4C08A545.1040604@onetel.net>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
	<4C08A545.1040604@onetel.net>
Message-ID: <4C08F0F7.3070204@american.edu>

On 6/4/2010 3:03 AM, Dr. David Kirkby wrote:
> As for the TM, I tend to agree, it is pretty irrelevant, though I am not a
> lawyer. It might be better from a legal point to leave them there.

I think the usual rule is that the first use should show
the trademark. I cannot find that explicitly as law, but
see e.g.
http://www.filemaker.com/company/legal/trademark_guidelines.html

The key role of the TM symbol is to avoid confusion
about branding, so in the case of MATLAB, this actually should
be more than adequate, since it is a clear case of nominative use:
http://en.wikipedia.org/wiki/Nominative_use

fwiw,
Alan Isaac


From pav at iki.fi  Fri Jun  4 08:47:02 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 4 Jun 2010 12:47:02 +0000 (UTC)
Subject: [SciPy-Dev] Building docs in scipy?
References: <AANLkTinqTEzGoVXJ4q3LZegctWae9y0gZ-o17cSH-KcK@mail.gmail.com>
	<huabiu$dm0$1@dough.gmane.org>
	<AANLkTinXK5gxzhwmLUKhSpu8g7zt5IS2MTHH6sxtEEAx@mail.gmail.com>
Message-ID: <huask6$7k7$1@dough.gmane.org>

Fri, 04 Jun 2010 19:30:57 +0800, Ralf Gommers wrote:
[clip]
>> Matthew helpfully pointed out some options (thanks!) to do this in git:
> http://news.gmane.org/gmane.comp.python.scientific.devel
> 
> However, none of the options he gave are automatic, so people will keep
> running into this. I think building docs should work out of the box, so
> I see 2 options:
>
> 1. we copy sphinxext in scipy. it's not like it changes often, so
> keeping things in sync manually is doable.

I'd say this is bad practice, and we should not do this.

Better to have a Makefile rule that checks it out from git before 
building the documents, if it comes to that.

> 2. sphinxext becomes a numpy module that can be imported and used from
> scipy.
> 
> Including it in the tarballs while it's not in the repo is not a good
> idea imho - tarballs should be exactly the same as the svn tag.

Matplotlib moved their sphinx stuff into a submodule. I'm a bit leery 
doing that either, since the Sphinx stuff has not much to do with Numpy 
itself...

-- 
Pauli Virtanen


From vincent at vincentdavis.net  Fri Jun  4 08:51:42 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 4 Jun 2010 06:51:42 -0600
Subject: [SciPy-Dev] Marathon Skypecon tomorrow?
In-Reply-To: <AANLkTikXzLntemqkYqBvCKEonmJXKCUh00rKoh1bTm9n@mail.gmail.com>
References: <AANLkTikXzLntemqkYqBvCKEonmJXKCUh00rKoh1bTm9n@mail.gmail.com>
Message-ID: <AANLkTikB-dgI7ZvGVFR_mFCAmXme7mXo9TIwPn-0grLy@mail.gmail.com>

I will not be able to be there.
Thanks
Vincent

On Thu, Jun 3, 2010 at 11:04 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> Email me your Skype ID if you want to participate tomorrow, noon EDT.? If no
> one emails me, I'll post a cancellation notice around 11:50 am EDT.
>
> DG
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From jsseabold at gmail.com  Fri Jun  4 09:05:25 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Fri, 4 Jun 2010 09:05:25 -0400
Subject: [SciPy-Dev] Building docs in scipy?
In-Reply-To: <AANLkTinXK5gxzhwmLUKhSpu8g7zt5IS2MTHH6sxtEEAx@mail.gmail.com>
References: <AANLkTinqTEzGoVXJ4q3LZegctWae9y0gZ-o17cSH-KcK@mail.gmail.com> 
	<huabiu$dm0$1@dough.gmane.org>
	<AANLkTinXK5gxzhwmLUKhSpu8g7zt5IS2MTHH6sxtEEAx@mail.gmail.com>
Message-ID: <AANLkTikwzICbHTPmmg2Jki4X0JPd3aD6O4YZYCJe2oJi@mail.gmail.com>

On Fri, Jun 4, 2010 at 7:30 AM, Ralf Gommers <ralf.gommers at googlemail.com>wrote:

>
>
> On Fri, Jun 4, 2010 at 3:56 PM, Pauli Virtanen <pav at iki.fi> wrote:
>
>> Thu, 03 Jun 2010 23:52:10 -0400, Skipper Seabold wrote:
>> > Should numpy/doc/sphinxext be distributed with scipy/doc/ or is this
>> > user error?  I couldn't get the scipy docs to build until I copied it
>> > over.
>>
>>
Ah, ok.  I just got one of Pauli's branches (or whatever they are called in
git!) off github to look at the optimization rewrite, so that explains
that.  I didn't think I had to to this before.


>  It's pulled in by svn:externals. With git, ymmv. It probably should be
>> included in the distribution tarballs, nevertheless.
>>
>> Matthew helpfully pointed out some options (thanks!) to do this in git:
> http://news.gmane.org/gmane.comp.python.scientific.devel
>
> However, none of the options he gave are automatic, so people will keep
> running into this. I think building docs should work out of the box, so I
> see 2 options:
> 1. we copy sphinxext in scipy. it's not like it changes often, so keeping
> things in sync manually is doable.
> 2. sphinxext becomes a numpy module that can be imported and used from
> scipy.
>

I would vote for 2 if possible because I also use this stuff and have just
been copying it over by hand for now.


Thanks,

Skipper
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100604/e693dcfe/attachment.html>

From gael.varoquaux at normalesup.org  Fri Jun  4 09:06:39 2010
From: gael.varoquaux at normalesup.org (Gael Varoquaux)
Date: Fri, 4 Jun 2010 15:06:39 +0200
Subject: [SciPy-Dev] Building docs in scipy?
In-Reply-To: <AANLkTikwzICbHTPmmg2Jki4X0JPd3aD6O4YZYCJe2oJi@mail.gmail.com>
References: <AANLkTinqTEzGoVXJ4q3LZegctWae9y0gZ-o17cSH-KcK@mail.gmail.com>
	<huabiu$dm0$1@dough.gmane.org>
	<AANLkTinXK5gxzhwmLUKhSpu8g7zt5IS2MTHH6sxtEEAx@mail.gmail.com>
	<AANLkTikwzICbHTPmmg2Jki4X0JPd3aD6O4YZYCJe2oJi@mail.gmail.com>
Message-ID: <20100604130639.GD29814@phare.normalesup.org>

On Fri, Jun 04, 2010 at 09:05:25AM -0400, Skipper Seabold wrote:
>      2. sphinxext becomes a numpy module that can be imported and used from
>      scipy.

>    I would vote for 2 if possible because I also use this stuff and have just
>    been copying it over by hand for now.

+1.

Ga?l


From jdh2358 at gmail.com  Fri Jun  4 09:24:51 2010
From: jdh2358 at gmail.com (John Hunter)
Date: Fri, 4 Jun 2010 08:24:51 -0500
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <AANLkTil-5UAUbCTRqjeeK77xmBZSXstFEXRKlNmRiSkf@mail.gmail.com>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
	<AANLkTimEeHlDLKAFqsOQcKqPflWaptAV7qZlocT4le3i@mail.gmail.com>
	<4C08A545.1040604@onetel.net>
	<AANLkTim8RM648w4x8wJa3Jr6haglNL2uYljsu-XWrv7x@mail.gmail.com>
	<AANLkTil4bubCQBP2f5ggZyZoaz-0RtNU6JFM8u9ihKG-@mail.gmail.com>
	<AANLkTil-5UAUbCTRqjeeK77xmBZSXstFEXRKlNmRiSkf@mail.gmail.com>
Message-ID: <AANLkTin69qQfeXqwWNHv5SW8IBQeZ5Mnga91gWWSH2fr@mail.gmail.com>

On Fri, Jun 4, 2010 at 5:30 AM, Matthew Brett <matthew.brett at gmail.com> wrote:
> Hi,
>
> On Fri, Jun 4, 2010 at 8:18 AM, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
>> Thanks, guys.? Not that your opinions aren't valuable, but it is a matter of
>> legality that I'm concerned about, and we do have people on-list who seem to
>> make it their business to worry about these things, so hopefully one of them
>> will chime in as well.
>
> ;) - ah yes - it is an art that can take an age to learn, to
> distinguish signal from noise !

Yes, young Jedi, we are not seeking your Wikipedia skills here.
Please do not bother to research and answer the questions we pose.  We
are looking for an official IANAL/YMMV judgment.

JDH


From ralf.gommers at googlemail.com  Fri Jun  4 10:11:42 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Fri, 4 Jun 2010 22:11:42 +0800
Subject: [SciPy-Dev] signal.ltisys test crashes on windows
Message-ID: <AANLkTikdLZewOtB1CPbfJq3L4pVB86aUQ4gdyivB1op5@mail.gmail.com>

The offending test, (0, 3, 3) of:

class TestSS2TF:
    def tst_matrix_shapes(self, p, q, r):
        ss2tf(np.zeros((p, p)),
              np.zeros((p, q)),
              np.zeros((r, p)),
              np.zeros((r, q)), 0)

    def test_basic(self):
        for p, q, r in [
            (3, 3, 3),
            (0, 3, 3),
            (1, 1, 1)]:
            yield self.tst_matrix_shapes, p, q, r


The 0 causes an empty array to be passed to ss2tf, which crashes the
interpreter on Windows XP for both 2.5 and 2.6. Is the empty array really
what was intended with this test?

Thanks,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100604/87f044f7/attachment.html>

From josef.pktd at gmail.com  Fri Jun  4 10:20:01 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 4 Jun 2010 10:20:01 -0400
Subject: [SciPy-Dev] signal.ltisys test crashes on windows
In-Reply-To: <AANLkTikdLZewOtB1CPbfJq3L4pVB86aUQ4gdyivB1op5@mail.gmail.com>
References: <AANLkTikdLZewOtB1CPbfJq3L4pVB86aUQ4gdyivB1op5@mail.gmail.com>
Message-ID: <AANLkTimVn5Qidbz5mosPwUGhNwsaUZ07EQculmXil0EF@mail.gmail.com>

On Fri, Jun 4, 2010 at 10:11 AM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
> The offending test, (0, 3, 3) of:
>
> class TestSS2TF:
> ??? def tst_matrix_shapes(self, p, q, r):
> ??????? ss2tf(np.zeros((p, p)),
> ????????????? np.zeros((p, q)),
> ????????????? np.zeros((r, p)),
> ????????????? np.zeros((r, q)), 0)
>
> ??? def test_basic(self):
> ??????? for p, q, r in [
> ??????????? (3, 3, 3),
> ??????????? (0, 3, 3),
> ??????????? (1, 1, 1)]:
> ??????????? yield self.tst_matrix_shapes, p, q, r
>
>
> The 0 causes an empty array to be passed to ss2tf, which crashes the
> interpreter on Windows XP for both 2.5 and 2.6. Is the empty array really
> what was intended with this test?

replace

>             (0, 3, 3),
 by
>             (1, 3, 3),

We had a recent threads about this, and the crash is avoided with
numpy trunk (raises an exception instead)

But I don't think the empty array is an appropriate test.

Josef


>
> Thanks,
> Ralf
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From guyer at nist.gov  Fri Jun  4 10:21:05 2010
From: guyer at nist.gov (Jonathan Guyer)
Date: Fri, 4 Jun 2010 10:21:05 -0400
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
Message-ID: <D1CA2724-A03B-497B-9048-96AD91D3E087@nist.gov>


On Jun 3, 2010, at 10:12 PM, David Goldsmith wrote:

> ...everywhere it occurs?

Our rules are obviously not your rules, but the NIST Editorial Review Board explicitly prohibits the use if (TM) and the like in NIST publications (we are discouraged from using trade names at all, unless necessary to specify the "experimental" apparatus (I'm pretty sure {s,S}ci{p,P}y's usage of "{m,M}{atlab,ATLAB}" would be considered acceptable, since {m,M}{atlab,ATLAB} compatibility is the point)). 

My understanding from when I served on the Board is that "(TM)" carries no legal weight at all (anybody can affix it to a name they "claim") and that although "(R)" does carry legal weight, it was not considered our responsibility to defend other people's trademarks. In fact, we are emphatically required to use "(R)" with NIST registered trademarks (e.g., "Standard Reference Material(R)") and banned from using it with anybody else's trademarks, registered or otherwise.


None of this should be construed as any official NIST guidance as to what *you* should do, only my understanding of what *I* am supposed to do.


From warren.weckesser at enthought.com  Fri Jun  4 10:31:05 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Fri, 04 Jun 2010 09:31:05 -0500
Subject: [SciPy-Dev] signal.ltisys test crashes on windows
In-Reply-To: <AANLkTimVn5Qidbz5mosPwUGhNwsaUZ07EQculmXil0EF@mail.gmail.com>
References: <AANLkTikdLZewOtB1CPbfJq3L4pVB86aUQ4gdyivB1op5@mail.gmail.com>
	<AANLkTimVn5Qidbz5mosPwUGhNwsaUZ07EQculmXil0EF@mail.gmail.com>
Message-ID: <4C090E29.1050506@enthought.com>

josef.pktd at gmail.com wrote:
> On Fri, Jun 4, 2010 at 10:11 AM, Ralf Gommers
> <ralf.gommers at googlemail.com> wrote:
>   
>> The offending test, (0, 3, 3) of:
>>
>> class TestSS2TF:
>>     def tst_matrix_shapes(self, p, q, r):
>>         ss2tf(np.zeros((p, p)),
>>               np.zeros((p, q)),
>>               np.zeros((r, p)),
>>               np.zeros((r, q)), 0)
>>
>>     def test_basic(self):
>>         for p, q, r in [
>>             (3, 3, 3),
>>             (0, 3, 3),
>>             (1, 1, 1)]:
>>             yield self.tst_matrix_shapes, p, q, r
>>
>>
>> The 0 causes an empty array to be passed to ss2tf, which crashes the
>> interpreter on Windows XP for both 2.5 and 2.6. Is the empty array really
>> what was intended with this test?
>>     
>
> replace
>
>   
>>             (0, 3, 3),
>>     
>  by
>   
>>             (1, 3, 3),
>>     
>
>   

Agreed.  0 is a degenerate case.  Perhaps the original author of the 
test expected ss2tf to handle this case cleanly, but it currently doesn't.

Warren

> We had a recent threads about this, and the crash is avoided with
> numpy trunk (raises an exception instead)
>
> But I don't think the empty array is an appropriate test.
>
> Josef
>
>
>   
>> Thanks,
>> Ralf
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>     
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From d.l.goldsmith at gmail.com  Fri Jun  4 11:11:53 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 4 Jun 2010 08:11:53 -0700
Subject: [SciPy-Dev] Does Matlab need to be MATLAB(TM)...
In-Reply-To: <D1CA2724-A03B-497B-9048-96AD91D3E087@nist.gov>
References: <AANLkTimEHj__dKkmxLKXkX8jQ0ST-u0y9j9Rv75V-8X7@mail.gmail.com>
	<D1CA2724-A03B-497B-9048-96AD91D3E087@nist.gov>
Message-ID: <AANLkTimTqttgkUNQG5yeYd6qG78POSTrtqTNUrRQTvgb@mail.gmail.com>

On Fri, Jun 4, 2010 at 7:21 AM, Jonathan Guyer <guyer at nist.gov> wrote:

>
> On Jun 3, 2010, at 10:12 PM, David Goldsmith wrote:
>
> > ...everywhere it occurs?
>
> Our rules are obviously not your rules, but the NIST Editorial Review Board
> explicitly prohibits the use if (TM) and the like in NIST publications (we
> are discouraged from using trade names at all, unless necessary to specify
> the "experimental" apparatus (I'm pretty sure {s,S}ci{p,P}y's usage of
> "{m,M}{atlab,ATLAB}" would be considered acceptable, since
> {m,M}{atlab,ATLAB} compatibility is the point)).
>
> My understanding from when I served on the Board is that "(TM)" carries no
> legal weight at all (anybody can affix it to a name they "claim") and that
> although "(R)" does carry legal weight, it was not considered our
> responsibility to defend other people's trademarks. In fact, we are
> emphatically required to use "(R)" with NIST registered trademarks (e.g.,
> "Standard Reference Material(R)") and banned from using it with anybody
> else's trademarks, registered or otherwise.
>
> None of this should be construed as any official NIST guidance as to what
> *you* should do, only my understanding of what *I* am supposed to do.
>

Understood, but you guys are the "National Institute of Standards and
Technology," which is good enough for me - I'm going to cease to worry about
it (and in fact take out the TM where I see it). :-)

DG


>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100604/0a4b4d50/attachment.html>

From stefan.czesla at hs.uni-hamburg.de  Fri Jun  4 11:49:37 2010
From: stefan.czesla at hs.uni-hamburg.de (Stefan)
Date: Fri, 4 Jun 2010 15:49:37 +0000 (UTC)
Subject: [SciPy-Dev]
	=?utf-8?q?np=2Esavetxt=3A_apply_patch_in_enhancement_?=
	=?utf-8?q?ticket_1079=09to_add_headers=3F?=
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com>
	<loom.20100601T190316-782@post.gmane.org>
	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com>
	<loom.20100602T125421-882@post.gmane.org>
	<4C066DA3.8010609@gmail.com>
	<loom.20100602T181159-631@post.gmane.org>
	<AANLkTikUf1HH5LADb1htC1xBr-KOnZFY1Xb7BP3OaJD_@mail.gmail.com>
Message-ID: <loom.20100604T173209-199@post.gmane.org>

Dear all,

as a consequence of our discussion, we developed a patch (attached to
ticket 1079), which implements some of the features discussed here.
We concentrated on comments and the header. Please have a look at the
patch. We are looking forward to hearing your opinion and suggestions,
and whether you see any problems, which could prevent it from entering the
official release. 

We agree with Bruce that the format string should be inferred from the
data type of the array. Yet, we believe that this point should be
addressed in a different patch focussing on that topic.

Also we noted that there is no error checking, when an array of dimension
larger 2 is handed to np.savetxt, which may be implemented easily.

Stefan & Christian


From d.l.goldsmith at gmail.com  Fri Jun  4 11:56:57 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 4 Jun 2010 08:56:57 -0700
Subject: [SciPy-Dev] Canceling Skypecon again
Message-ID: <AANLkTikXy6S-NE9NbT0NCER__QG6yO6IdiW8jrTnMgil@mail.gmail.com>

Due to lack of issues requiring live discussion, and illness of host. :-(

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100604/eb0f3889/attachment.html>

From jsseabold at gmail.com  Fri Jun  4 12:03:33 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Fri, 4 Jun 2010 12:03:33 -0400
Subject: [SciPy-Dev] np.savetxt: apply patch in enhancement ticket 1079
	to add headers?
In-Reply-To: <loom.20100604T173209-199@post.gmane.org>
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com> 
	<loom.20100601T190316-782@post.gmane.org>
	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com> 
	<loom.20100602T125421-882@post.gmane.org> <4C066DA3.8010609@gmail.com> 
	<loom.20100602T181159-631@post.gmane.org>
	<AANLkTikUf1HH5LADb1htC1xBr-KOnZFY1Xb7BP3OaJD_@mail.gmail.com> 
	<loom.20100604T173209-199@post.gmane.org>
Message-ID: <AANLkTikhwqTsdw6qZjwiI7Ct5lOZcBamz0BWizKgcklj@mail.gmail.com>

On Fri, Jun 4, 2010 at 11:49 AM, Stefan <stefan.czesla at hs.uni-hamburg.de> wrote:
> Dear all,
>
> as a consequence of our discussion, we developed a patch (attached to
> ticket 1079), which implements some of the features discussed here.
> We concentrated on comments and the header. Please have a look at the
> patch. We are looking forward to hearing your opinion and suggestions,
> and whether you see any problems, which could prevent it from entering the
> official release.
>

Link: http://projects.scipy.org/numpy/ticket/1079

One comment.  Maybe you can add in the notes that the comment keyword
can be used to write a header and still preserve compatibility with
loadtxt.  This wasn't obvious to me at first, though maybe that's just
me.

Other than that I think it looks like a good first effort towards
making this a better function and I appreciate the attention here.

Skipper

> We agree with Bruce that the format string should be inferred from the
> data type of the array. Yet, we believe that this point should be
> addressed in a different patch focussing on that topic.
>
> Also we noted that there is no error checking, when an array of dimension
> larger 2 is handed to np.savetxt, which may be implemented easily.
>
> Stefan & Christian
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From warren.weckesser at enthought.com  Fri Jun  4 12:29:53 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Fri, 04 Jun 2010 11:29:53 -0500
Subject: [SciPy-Dev] ODEINT/ODE solvers redesign--anyone for a sprint at
	SciPy 2010?
Message-ID: <4C092A01.9040905@enthought.com>

It's about time we tackled the issue of the ODE solvers in SciPy.   Some 
notes about the issue are on the wiki: 
http://projects.scipy.org/scipy/wiki/OdeintRedesign

This would be a great topic for a sprint at the SciPy conference.  I 
just added it to the list of suggested sprint topics, so give it a vote 
if you are going to be there and are interested in working on this.

Warren


From bsouthey at gmail.com  Fri Jun  4 13:08:12 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Fri, 04 Jun 2010 12:08:12 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C07ADC1.6040504@enthought.com>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com>
Message-ID: <4C0932FC.2020108@gmail.com>

On 06/03/2010 08:27 AM, Warren Weckesser wrote:
> Just letting you know that I'm not ignoring all the great comments from
> josef, Neil and Bruce about my suggestion for chisquare_contingency.
> Unfortunately, I won't have time to think about all the deeper
> suggestions for another week or so.   For now, I'll just say that I
> agree with josef's and Neil's suggestions for the docstring, and that
> Neil's summary of the function as simply a convenience function that
> calls stats.chisquare with appropriate arguments to perform a test of
> independence on a contingency table is exactly what I had in mind.
>
> Warren
>
>
>    
Hi,
I looked at how SAS handles n-way tables. What it appears to do is break 
the original table down into a set of 2-way tables and does the analysis 
on each of these. So a 3 by 4 by 5 table is processed as three 2-way 
tables with the results of each 4 by 5 table presented. I do not know 
how Stata and R analysis analyze n-way tables.

Consequently, I rewrote my suggested code (attached) to handle 3 and 4 
way tables by using recursion. There should be some Python way to do 
that recursion for any number of dimensions. I also added the 1-way 
table (but that has a different hypothesis than the 2-way table) so 
users can send a 1-d table.

The data used is from two SAS examples and I added a dimension to get a 
4-way table. I included the SAS values but these are only to 4 decimal 
places for reference.

http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect029.htm
http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect030.htm 


What is missing:
1) Docstring and tests but those are dependent what is ultimately decided
2) Other test statistics but scipy.stats versions are not very friendly 
in that these do not accept a 2-d array
3) A way to do recursion
4) Ability to label the levels etc.
5) Correct handling of input types.

Bruce
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cont_table.py
Type: text/x-python
Size: 4300 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100604/25d1e459/attachment.py>

From bsouthey at gmail.com  Fri Jun  4 14:08:15 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Fri, 04 Jun 2010 13:08:15 -0500
Subject: [SciPy-Dev] np.savetxt: apply patch in enhancement ticket 1079
 to add headers?
In-Reply-To: <AANLkTikhwqTsdw6qZjwiI7Ct5lOZcBamz0BWizKgcklj@mail.gmail.com>
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com>
	<loom.20100601T190316-782@post.gmane.org>	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com>
	<loom.20100602T125421-882@post.gmane.org>
	<4C066DA3.8010609@gmail.com>
	<loom.20100602T181159-631@post.gmane.org>	<AANLkTikUf1HH5LADb1htC1xBr-KOnZFY1Xb7BP3OaJD_@mail.gmail.com>
	<loom.20100604T173209-199@post.gmane.org>
	<AANLkTikhwqTsdw6qZjwiI7Ct5lOZcBamz0BWizKgcklj@mail.gmail.com>
Message-ID: <4C09410F.1010900@gmail.com>

On 06/04/2010 11:03 AM, Skipper Seabold wrote:
> On Fri, Jun 4, 2010 at 11:49 AM, Stefan<stefan.czesla at hs.uni-hamburg.de>  wrote:
>    
>> Dear all,
>>
>> as a consequence of our discussion, we developed a patch (attached to
>> ticket 1079), which implements some of the features discussed here.
>> We concentrated on comments and the header. Please have a look at the
>> patch. We are looking forward to hearing your opinion and suggestions,
>> and whether you see any problems, which could prevent it from entering the
>> official release.
>>
>>      
> Link: http://projects.scipy.org/numpy/ticket/1079
>
> One comment.  Maybe you can add in the notes that the comment keyword
> can be used to write a header and still preserve compatibility with
> loadtxt.  This wasn't obvious to me at first, though maybe that's just
> me.
>
> Other than that I think it looks like a good first effort towards
> making this a better function and I appreciate the attention here.
>
> Skipper
>
>    
>> We agree with Bruce that the format string should be inferred from the
>> data type of the array. Yet, we believe that this point should be
>> addressed in a different patch focussing on that topic.
>>
>> Also we noted that there is no error checking, when an array of dimension
>> larger 2 is handed to np.savetxt, which may be implemented easily.
>>
>> Stefan&  Christian
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>      
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    
Hi,
For the sake of similarity to loadtxt keywords (because loadtxt has them 
and changing those is harder than adding new ones to savetxt):
1) 'comment_character' should be 'comments'
2) instead of 'comment' perhaps use 'preamble'

Thanks for doing the patch so quickly!


Bruce


From josef.pktd at gmail.com  Fri Jun  4 14:12:06 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 4 Jun 2010 14:12:06 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C0932FC.2020108@gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
Message-ID: <AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>

On Fri, Jun 4, 2010 at 1:08 PM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/03/2010 08:27 AM, Warren Weckesser wrote:
>>
>> Just letting you know that I'm not ignoring all the great comments from
>> josef, Neil and Bruce about my suggestion for chisquare_contingency.
>> Unfortunately, I won't have time to think about all the deeper
>> suggestions for another week or so. ? For now, I'll just say that I
>> agree with josef's and Neil's suggestions for the docstring, and that
>> Neil's summary of the function as simply a convenience function that
>> calls stats.chisquare with appropriate arguments to perform a test of
>> independence on a contingency table is exactly what I had in mind.
>>
>> Warren
>>
>>
>>
>
> Hi,
> I looked at how SAS handles n-way tables. What it appears to do is break the
> original table down into a set of 2-way tables and does the analysis on each
> of these. So a 3 by 4 by 5 table is processed as three 2-way tables with the
> results of each 4 by 5 table presented. I do not know how Stata and R
> analysis analyze n-way tables.
>
> Consequently, I rewrote my suggested code (attached) to handle 3 and 4 way
> tables by using recursion. There should be some Python way to do that
> recursion for any number of dimensions. I also added the 1-way table (but
> that has a different hypothesis than the 2-way table) so users can send a
> 1-d table.

(very briefly because I don't have much time today)

I think, these are good extensions, but to handle all cases, the
function is getting too large and would need several options.

On your code and SAS, Z(correct me if my quick reading is wrong)
You seem to be calculating conditional independence for the last two
variables conditional on the values of the first variables. I think
this could be generalized to all pairwise independence tests.

Similar, I'm a bit surprised that SAS uses conditional and not
marginal independence, I would have thought that the test for marginal
independence (aggregate out all but 2 variables) would be the more
common use case.

Initially, I was thinking just about independence of all variables in
a 3 or more way table, i.e. P(x,y,z)=P(x)*P(y)*P(z)

My opinion is that these variations of tests would fit better in a
class where all pairwise conditional, and marginal and joint
hypotheses can be supplied as methods, or split it up into a group of
functions.

Thanks,

Josef

>
> The data used is from two SAS examples and I added a dimension to get a
> 4-way table. I included the SAS values but these are only to 4 decimal
> places for reference.
>
> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect029.htm
> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect030.htm
>
> What is missing:
> 1) Docstring and tests but those are dependent what is ultimately decided
> 2) Other test statistics but scipy.stats versions are not very friendly in
> that these do not accept a 2-d array
> 3) A way to do recursion
> 4) Ability to label the levels etc.
> 5) Correct handling of input types.
>
> Bruce
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Fri Jun  4 20:49:27 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 4 Jun 2010 18:49:27 -0600
Subject: [SciPy-Dev] why this not report an error for a.dtype=float when a
	is mixed struc array
Message-ID: <AANLkTil3J7s7LLTgBR2YlsPwst_8zD2MT1eH7DJ9bFK_@mail.gmail.com>

Is there a reason not to have this return an error.
>>> a1 = np.array([(1,3.3),(2,4.4)], dtype=[('a', int),('b', float)])
>>> a1
array([(1, 3.2999999999999998), (2, 4.4000000000000004)],
      dtype=[('a', '<i8'), ('b', '<f8')])
>>> a1.dtype=float
>>> a1
array([  4.94065646e-324,   3.30000000e+000,   9.88131292e-324,
         4.40000000e+000])

It seems that this could really cause problems if you did not notice
what was going on.

Vincent


From ralf.gommers at googlemail.com  Sat Jun  5 06:45:48 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Sat, 5 Jun 2010 18:45:48 +0800
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
Message-ID: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>

I'm pleased to announce the first beta release of SciPy 0.8.0.

SciPy is a package of tools for science and engineering for Python.
It includes modules for statistics, optimization, integration, linear
algebra, Fourier transforms, signal and image processing, ODE solvers,
and more.

This beta release comes almost one and a half year after the 0.7.0 release
and
contains many new features, numerous bug-fixes, improved test
coverage, and better documentation.  Please note that SciPy 0.8.0b1
requires Python 2.4 or greater and NumPy 1.4.1 or greater.

For information, please see the release notes:
http://sourceforge.net/projects/scipy/files/scipy/0.8.0b1/NOTES.txt/view

You can download the release from here:
https://sourceforge.net/projects/scipy/
Python 2.5/2.6 binaries for Windows and OS X are available as well as source
tarballs for other platforms.

Thank you to everybody who contributed to this release.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100605/479ebc14/attachment.html>

From stefan at sun.ac.za  Sat Jun  5 04:22:04 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Sat, 5 Jun 2010 01:22:04 -0700
Subject: [SciPy-Dev] why this not report an error for a.dtype=float when
	a is mixed struc array
In-Reply-To: <AANLkTil3J7s7LLTgBR2YlsPwst_8zD2MT1eH7DJ9bFK_@mail.gmail.com>
References: <AANLkTil3J7s7LLTgBR2YlsPwst_8zD2MT1eH7DJ9bFK_@mail.gmail.com>
Message-ID: <AANLkTikmP03rgtRreUuihPc3Kl5Jenkd_bqGdw7GsowP@mail.gmail.com>

On 4 June 2010 17:49, Vincent Davis <vincent at vincentdavis.net> wrote:
> Is there a reason not to have this return an error.
>>>> a1 = np.array([(1,3.3),(2,4.4)], dtype=[('a', int),('b', float)])
>>>> a1
> array([(1, 3.2999999999999998), (2, 4.4000000000000004)],
> ? ? ?dtype=[('a', '<i8'), ('b', '<f8')])
>>>> a1.dtype=float
>>>> a1
> array([ ?4.94065646e-324, ? 3.30000000e+000, ? 9.88131292e-324,
> ? ? ? ? 4.40000000e+000])

This is a feature!  Sometimes, it is handy to view the raw memory in
different ways.

You are probably looking for the "astype" method.

Regards
St?fan


From d.l.goldsmith at gmail.com  Sat Jun  5 03:11:02 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sat, 5 Jun 2010 00:11:02 -0700
Subject: [SciPy-Dev] A doc-related, check-in-related request
Message-ID: <AANLkTikO1OeOBtF3C5uxMGcurq9V-sWoOTeuIB42985A@mail.gmail.com>

Hi!  If you add documented code to NumPy or SciPy, the Wiki will pull the
docstring and will give it the status of "Needs editing," even if you have
supplied a "Needs review"-quality docstring.  Also, even if your docstring
isn't "Needs review"-quality, you, as code writer, are presumably the best
person to "own" the docstring, be it for the purpose of finishing it later
or serving as a reference for someone else to do so.  So, I make the
following general request: a few days after you commit your code (give it a
few days because the Wiki doesn't always pull right away), please visit your
new committed objects in the Wiki and do one of two things: if you feel the
docstring is "finished," please go ahead and promote it to "Needs review"
status; if you feel the docstring is unfinished, please "claim" it by
editing it (if you don't have time for substantive edits, you can just add a
line break or something similarly trivial, just something so that the Wiki
will record you as having made an edit), which in turn will automatically
promote it to "Being written" (which alerts others to check the log to see
if someone else is working on the docstring).  This way, new docstrings
don't make our progress look, in the Wiki, like regress.  Thanks!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100605/fd6aec4e/attachment.html>

From vincent at vincentdavis.net  Sat Jun  5 11:09:01 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 5 Jun 2010 09:09:01 -0600
Subject: [SciPy-Dev] why this not report an error for a.dtype=float when
	a is mixed struc array
In-Reply-To: <AANLkTikmP03rgtRreUuihPc3Kl5Jenkd_bqGdw7GsowP@mail.gmail.com>
References: <AANLkTil3J7s7LLTgBR2YlsPwst_8zD2MT1eH7DJ9bFK_@mail.gmail.com>
	<AANLkTikmP03rgtRreUuihPc3Kl5Jenkd_bqGdw7GsowP@mail.gmail.com>
Message-ID: <AANLkTik7o2Ql8yZVaEZ8Hkkcqo9kcDrhq7gLSbzITsXi@mail.gmail.com>

2010/6/5 St?fan van der Walt <stefan at sun.ac.za>:
> On 4 June 2010 17:49, Vincent Davis <vincent at vincentdavis.net> wrote:
>> Is there a reason not to have this return an error.
>>>>> a1 = np.array([(1,3.3),(2,4.4)], dtype=[('a', int),('b', float)])
>>>>> a1
>> array([(1, 3.2999999999999998), (2, 4.4000000000000004)],
>> ? ? ?dtype=[('a', '<i8'), ('b', '<f8')])
>>>>> a1.dtype=float
>>>>> a1
>> array([ ?4.94065646e-324, ? 3.30000000e+000, ? 9.88131292e-324,
>> ? ? ? ? 4.40000000e+000])
>
> This is a feature! ?Sometimes, it is handy to view the raw memory in
> different ways.

Out of curiosity how would I use this?
Thanks
Vincent
>
> You are probably looking for the "astype" method.
>
> Regards
> St?fan
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From pav at iki.fi  Sat Jun  5 11:17:15 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Sat, 5 Jun 2010 15:17:15 +0000 (UTC)
Subject: [SciPy-Dev] Clarification: is the Extended Summary
	section	optional?
References: <AANLkTimjzyj6kLk_fqcUm2nzoG5-aflz2EL3BCAIfwMK@mail.gmail.com>
	<AANLkTinrPVH9dQoDt_lEPU6SuoOn3GsXo_8uFjheprx9@mail.gmail.com>
	<AANLkTimphoKDuMP30i3uzNY_1RsWylH9VNbyrK574H2n@mail.gmail.com>
	<AANLkTiluIrfZeE9swrUV4QO4VK1kug2_9tYkvs_fe0Fa@mail.gmail.com>
	<AANLkTil684Y6u_noEetGadjbRquX4CXlvoiFCwSo1121@mail.gmail.com>
	<AANLkTim8eZq6fpU0FZoSHA9vrP_4oClL7Ava64nk53c0@mail.gmail.com>
	<AANLkTiml_q1q48kpJbhw-WJmg5OGmHM9-2Qx2nsdBgMV@mail.gmail.com>
	<AANLkTinrVQ0ClzZpR6DYe2VI5XkVcL8iQrq-rowDaJkC@mail.gmail.com>
	<AANLkTil3ZxkzKzVYOvZf5uxFI5RrEN8rQSXx8BUwmnrB@mail.gmail.com>
Message-ID: <hudppr$dph$3@dough.gmane.org>

Wed, 02 Jun 2010 20:09:39 -0600, Vincent Davis wrote:
[clip]
> As I am always interested in learning new things is there any help I can
> offer in getting the wiki review feature implemented?

Yes, definitely, any help here is appreciated!

The list of issues to do has grown a bit long, as I haven't found 
sufficient time to tackle them :/

If you are not yet familiar with Django, the following will be helpful:

	http://docs.djangoproject.com/en/1.2/

Myself, I'd start by going hands-on through their excellent tutorial 
before diving into Pydocweb. The doc editor itself is not a very special 
as a Django app, and follows the usual Django conventions, so the 
tutorial should make several things more clear. (Unfortunately the app 
was a bit hastily cobbled together, and this shows at some points.)

If you have specific questions, feel free to ask!

-- 
Pauli Virtanen


From jsseabold at gmail.com  Sat Jun  5 11:20:05 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Sat, 5 Jun 2010 11:20:05 -0400
Subject: [SciPy-Dev] why this not report an error for a.dtype=float when
	a is mixed struc array
In-Reply-To: <AANLkTik7o2Ql8yZVaEZ8Hkkcqo9kcDrhq7gLSbzITsXi@mail.gmail.com>
References: <AANLkTil3J7s7LLTgBR2YlsPwst_8zD2MT1eH7DJ9bFK_@mail.gmail.com> 
	<AANLkTikmP03rgtRreUuihPc3Kl5Jenkd_bqGdw7GsowP@mail.gmail.com> 
	<AANLkTik7o2Ql8yZVaEZ8Hkkcqo9kcDrhq7gLSbzITsXi@mail.gmail.com>
Message-ID: <AANLkTimFNNPdKxrgThr4iDh9GQHy3M_DtQaQ_D9wz8nf@mail.gmail.com>

On Sat, Jun 5, 2010 at 11:09 AM, Vincent Davis <vincent at vincentdavis.net> wrote:
> 2010/6/5 St?fan van der Walt <stefan at sun.ac.za>:
>> On 4 June 2010 17:49, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> Is there a reason not to have this return an error.
>>>>>> a1 = np.array([(1,3.3),(2,4.4)], dtype=[('a', int),('b', float)])
>>>>>> a1
>>> array([(1, 3.2999999999999998), (2, 4.4000000000000004)],
>>> ? ? ?dtype=[('a', '<i8'), ('b', '<f8')])
>>>>>> a1.dtype=float
>>>>>> a1
>>> array([ ?4.94065646e-324, ? 3.30000000e+000, ? 9.88131292e-324,
>>> ? ? ? ? 4.40000000e+000])
>>
>> This is a feature! ?Sometimes, it is handy to view the raw memory in
>> different ways.
>

You might find this thread helpful.  Especially, Chris's reply.

http://thread.gmane.org/gmane.comp.python.numeric.general/32664/

Skipper


From vincent at vincentdavis.net  Sat Jun  5 11:49:09 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 5 Jun 2010 09:49:09 -0600
Subject: [SciPy-Dev] why this not report an error for a.dtype=float when
	a is mixed struc array
In-Reply-To: <AANLkTimFNNPdKxrgThr4iDh9GQHy3M_DtQaQ_D9wz8nf@mail.gmail.com>
References: <AANLkTil3J7s7LLTgBR2YlsPwst_8zD2MT1eH7DJ9bFK_@mail.gmail.com>
	<AANLkTikmP03rgtRreUuihPc3Kl5Jenkd_bqGdw7GsowP@mail.gmail.com>
	<AANLkTik7o2Ql8yZVaEZ8Hkkcqo9kcDrhq7gLSbzITsXi@mail.gmail.com>
	<AANLkTimFNNPdKxrgThr4iDh9GQHy3M_DtQaQ_D9wz8nf@mail.gmail.com>
Message-ID: <AANLkTik2zKVohcYwQbLtsm2FrmK0rW6M2NEqzdkkLdwj@mail.gmail.com>

On Sat, Jun 5, 2010 at 9:20 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Sat, Jun 5, 2010 at 11:09 AM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> 2010/6/5 St?fan van der Walt <stefan at sun.ac.za>:
>>> On 4 June 2010 17:49, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> Is there a reason not to have this return an error.
>>>>>>> a1 = np.array([(1,3.3),(2,4.4)], dtype=[('a', int),('b', float)])
>>>>>>> a1
>>>> array([(1, 3.2999999999999998), (2, 4.4000000000000004)],
>>>> ? ? ?dtype=[('a', '<i8'), ('b', '<f8')])
>>>>>>> a1.dtype=float
>>>>>>> a1
>>>> array([ ?4.94065646e-324, ? 3.30000000e+000, ? 9.88131292e-324,
>>>> ? ? ? ? 4.40000000e+000])
>>>
>>> This is a feature! ?Sometimes, it is handy to view the raw memory in
>>> different ways.
>>
>
> You might find this thread helpful. ?Especially, Chris's reply.
>
> http://thread.gmane.org/gmane.comp.python.numeric.general/32664/

Ok now I understand why, that is why the numbers are a mess but not
why it is a feature :) I guess I am try to think of why I would use
this. I might be completely wrong but if most users would expect a
different behavior and don't notice what is actually happening then
maybe there should be a warning and a different way to get the current
results.

Vincent


>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From stefan at sun.ac.za  Sat Jun  5 23:08:18 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Sat, 5 Jun 2010 20:08:18 -0700
Subject: [SciPy-Dev] why this not report an error for a.dtype=float when
	a is mixed struc array
In-Reply-To: <AANLkTik2zKVohcYwQbLtsm2FrmK0rW6M2NEqzdkkLdwj@mail.gmail.com>
References: <AANLkTil3J7s7LLTgBR2YlsPwst_8zD2MT1eH7DJ9bFK_@mail.gmail.com> 
	<AANLkTikmP03rgtRreUuihPc3Kl5Jenkd_bqGdw7GsowP@mail.gmail.com> 
	<AANLkTik7o2Ql8yZVaEZ8Hkkcqo9kcDrhq7gLSbzITsXi@mail.gmail.com> 
	<AANLkTimFNNPdKxrgThr4iDh9GQHy3M_DtQaQ_D9wz8nf@mail.gmail.com> 
	<AANLkTik2zKVohcYwQbLtsm2FrmK0rW6M2NEqzdkkLdwj@mail.gmail.com>
Message-ID: <AANLkTil_tYEIwnSUhAJusOHCOI2czR2TZkgPWgjJSe33@mail.gmail.com>

On 5 June 2010 08:49, Vincent Davis <vincent at vincentdavis.net> wrote:
>> You might find this thread helpful. ?Especially, Chris's reply.
>>
>> http://thread.gmane.org/gmane.comp.python.numeric.general/32664/
>
> Ok now I understand why, that is why the numbers are a mess but not
> why it is a feature :) I guess I am try to think of why I would use
> this. I might be completely wrong but if most users would expect a
> different behavior and don't notice what is actually happening then
> maybe there should be a warning and a different way to get the current
> results.

There are many uses for 'view', such as examining underlying bytes or
changing the subclass of an array without copying.

I'm not sure I follow your argument, though.  'view' and 'astype' do
distinctly different things (well defined), and are both necessary for
advanced array computation.  An array is simply a wrapper around
memory, and it should not be too magical.

Regards
St?fan


From vincent at vincentdavis.net  Sat Jun  5 23:22:09 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 5 Jun 2010 21:22:09 -0600
Subject: [SciPy-Dev] why this not report an error for a.dtype=float when
	a is mixed struc array
In-Reply-To: <AANLkTil_tYEIwnSUhAJusOHCOI2czR2TZkgPWgjJSe33@mail.gmail.com>
References: <AANLkTil3J7s7LLTgBR2YlsPwst_8zD2MT1eH7DJ9bFK_@mail.gmail.com>
	<AANLkTikmP03rgtRreUuihPc3Kl5Jenkd_bqGdw7GsowP@mail.gmail.com>
	<AANLkTik7o2Ql8yZVaEZ8Hkkcqo9kcDrhq7gLSbzITsXi@mail.gmail.com>
	<AANLkTimFNNPdKxrgThr4iDh9GQHy3M_DtQaQ_D9wz8nf@mail.gmail.com>
	<AANLkTik2zKVohcYwQbLtsm2FrmK0rW6M2NEqzdkkLdwj@mail.gmail.com>
	<AANLkTil_tYEIwnSUhAJusOHCOI2czR2TZkgPWgjJSe33@mail.gmail.com>
Message-ID: <AANLkTill_k_oUaQpQifP61gmS72VWvjqiTuvOMDqTE_f@mail.gmail.com>

2010/6/5 St?fan van der Walt <stefan at sun.ac.za>:
> On 5 June 2010 08:49, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> You might find this thread helpful. ?Especially, Chris's reply.
>>>
>>> http://thread.gmane.org/gmane.comp.python.numeric.general/32664/
>>
>> Ok now I understand why, that is why the numbers are a mess but not
>> why it is a feature :) I guess I am try to think of why I would use
>> this. I might be completely wrong but if most users would expect a
>> different behavior and don't notice what is actually happening then
>> maybe there should be a warning and a different way to get the current
>> results.
>
> There are many uses for 'view', such as examining underlying bytes or
> changing the subclass of an array without copying.
>
> I'm not sure I follow your argument, though. ?'view' and 'astype' do
> distinctly different things (well defined), and are both necessary for
> advanced array computation. ?An array is simply a wrapper around
> memory, and it should not be too magical.

i.e. I lack knowledge and experience, no thats not what you said but
it is probably the correct assessment.

Thanks
Vincent

> Regards
> St?fan
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From ralf.gommers at googlemail.com  Sun Jun  6 11:53:52 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Sun, 6 Jun 2010 23:53:52 +0800
Subject: [SciPy-Dev] SSE instruction in arpack file - f2py issue?
Message-ID: <AANLkTinsP_hu2xhJJSVgygqoHxrx6aJr7djbwxU_9oQ0@mail.gmail.com>

When checking the 0.8.0b1 superpacks I found a single file with SSE
instructions, sparse/linalg/eigen/arpack/_arpack.pyd. The only thing
possible explanation I found is that f2py can add SSE instructions by
default, as claimed here:
http://thread.gmane.org/gmane.comp.python.f2py.user/712/focus=6882. Is this
correct? Anyone have any other suggestions on where to look?

Ticket and history: http://projects.scipy.org/scipy/ticket/1170

Thanks,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100606/69b943cf/attachment.html>

From neilcrighton at gmail.com  Mon Jun  7 07:34:04 2010
From: neilcrighton at gmail.com (Neil Crighton)
Date: Mon, 7 Jun 2010 11:34:04 +0000 (UTC)
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
Message-ID: <loom.20100607T132750-19@post.gmane.org>

Ralf Gommers <ralf.gommers <at> googlemail.com> writes:

>  I'm pleased to announce the first beta release of SciPy
> 0.8.0.SciPy is a package of tools for science and engineering
> for Python.It includes modules for statistics, optimization,
> integration, linearalgebra, Fourier transforms, signal and
> image processing, ODE solvers, and more.This beta release comes
> almost one and a half year after the 0.7.0 release andcontains
> many new features, numerous bug-fixes, improved testcoverage,
> and better documentation.  Please note that SciPy 0.8.0b1
> requires Python 2.4 or greater and NumPy 1.4.1 or greater.

Thanks for getting the beta out!

The release notes say Numpy 1.3 or greater is needed - is this
correct? Above you say 1.4.1 is needed. I think "support for
Python 3 in Scipy might not yet be included in Scipy 0.8" is too
ambiguous. Just say 0.8 will not be compatible with Python 3, but
we expect the next version (0.9?) to be compatible, if that's the
case.

Cheers, Neil


From ralf.gommers at googlemail.com  Mon Jun  7 07:55:24 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Mon, 7 Jun 2010 19:55:24 +0800
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <loom.20100607T132750-19@post.gmane.org>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
Message-ID: <AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>

On Mon, Jun 7, 2010 at 7:34 PM, Neil Crighton <neilcrighton at gmail.com>wrote:

> Ralf Gommers <ralf.gommers <at> googlemail.com> writes:
>
> >  I'm pleased to announce the first beta release of SciPy
> > 0.8.0.SciPy is a package of tools for science and engineering
> > for Python.It includes modules for statistics, optimization,
> > integration, linearalgebra, Fourier transforms, signal and
> > image processing, ODE solvers, and more.This beta release comes
> > almost one and a half year after the 0.7.0 release andcontains
> > many new features, numerous bug-fixes, improved testcoverage,
> > and better documentation.  Please note that SciPy 0.8.0b1
> > requires Python 2.4 or greater and NumPy 1.4.1 or greater.
>
> Thanks for getting the beta out!
>
> The release notes say Numpy 1.3 or greater is needed - is this
> correct? Above you say 1.4.1 is needed.


No, 1.4.1 is needed. Notes are fixed now.


> I think "support for
> Python 3 in Scipy might not yet be included in Scipy 0.8" is too
> ambiguous. Just say 0.8 will not be compatible with Python 3, but
> we expect the next version (0.9?) to be compatible, if that's the
> case.
>
> Reworded as:
"Python 3 compatibility is planned and is currently technically
feasible, since Numpy has been ported. However, since the Python 3
compatible Numpy 2.0 has not been released yet, support for Python 3
in Scipy is not yet included in Scipy 0.8.  SciPy 0.9, planned for fall
2010, will very likely include experimental support for Python 3."

Thanks for reporting,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100607/cd446f8e/attachment.html>

From josef.pktd at gmail.com  Mon Jun  7 10:15:56 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Mon, 7 Jun 2010 10:15:56 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
Message-ID: <AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>

On Fri, Jun 4, 2010 at 2:12 PM,  <josef.pktd at gmail.com> wrote:
> On Fri, Jun 4, 2010 at 1:08 PM, Bruce Southey <bsouthey at gmail.com> wrote:
>> On 06/03/2010 08:27 AM, Warren Weckesser wrote:
>>>
>>> Just letting you know that I'm not ignoring all the great comments from
>>> josef, Neil and Bruce about my suggestion for chisquare_contingency.
>>> Unfortunately, I won't have time to think about all the deeper
>>> suggestions for another week or so. ? For now, I'll just say that I
>>> agree with josef's and Neil's suggestions for the docstring, and that
>>> Neil's summary of the function as simply a convenience function that
>>> calls stats.chisquare with appropriate arguments to perform a test of
>>> independence on a contingency table is exactly what I had in mind.
>>>
>>> Warren
>>>
>>>
>>>
>>
>> Hi,
>> I looked at how SAS handles n-way tables. What it appears to do is break the
>> original table down into a set of 2-way tables and does the analysis on each
>> of these. So a 3 by 4 by 5 table is processed as three 2-way tables with the
>> results of each 4 by 5 table presented. I do not know how Stata and R
>> analysis analyze n-way tables.
>>
>> Consequently, I rewrote my suggested code (attached) to handle 3 and 4 way
>> tables by using recursion. There should be some Python way to do that
>> recursion for any number of dimensions. I also added the 1-way table (but
>> that has a different hypothesis than the 2-way table) so users can send a
>> 1-d table.
>
> (very briefly because I don't have much time today)
>
> I think, these are good extensions, but to handle all cases, the
> function is getting too large and would need several options.
>
> On your code and SAS, Z(correct me if my quick reading is wrong)
> You seem to be calculating conditional independence for the last two
> variables conditional on the values of the first variables. I think
> this could be generalized to all pairwise independence tests.
>
> Similar, I'm a bit surprised that SAS uses conditional and not
> marginal independence, I would have thought that the test for marginal
> independence (aggregate out all but 2 variables) would be the more
> common use case.

just some more questions and comments (until I have time to check this)

looking at conditional independence looks similar to linear regression
models, where the effect of other variables is taken out. However,
looking at all chisquare tests (conditional on all possible other
values) runs into the multiple test problem. Is the some kind of
post-hoc or Bonferroni correction or is there a distribution for eg.
the max of all chisquare test statistics.

with an iterator (numpy mailinglist), my version for the conditional
independence of the last two variables for all values of the earlier
variables looks like

for ind in allbut2ax_iterator(table3, axes=(-2,-1)):
    print chisquare_contingency(table3[ind])

Josef

>
> Initially, I was thinking just about independence of all variables in
> a 3 or more way table, i.e. P(x,y,z)=P(x)*P(y)*P(z)
>
> My opinion is that these variations of tests would fit better in a
> class where all pairwise conditional, and marginal and joint
> hypotheses can be supplied as methods, or split it up into a group of
> functions.
>
> Thanks,
>
> Josef
>
>>
>> The data used is from two SAS examples and I added a dimension to get a
>> 4-way table. I included the SAS values but these are only to 4 decimal
>> places for reference.
>>
>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect029.htm
>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect030.htm
>>
>> What is missing:
>> 1) Docstring and tests but those are dependent what is ultimately decided
>> 2) Other test statistics but scipy.stats versions are not very friendly in
>> that these do not accept a 2-d array
>> 3) A way to do recursion
>> 4) Ability to label the levels etc.
>> 5) Correct handling of input types.
>>
>> Bruce
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>


From charlesr.harris at gmail.com  Mon Jun  7 10:20:01 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Mon, 7 Jun 2010 08:20:01 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
Message-ID: <AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>

On Mon, Jun 7, 2010 at 5:55 AM, Ralf Gommers <ralf.gommers at googlemail.com>wrote:

>
>
> On Mon, Jun 7, 2010 at 7:34 PM, Neil Crighton <neilcrighton at gmail.com>wrote:
>
>> Ralf Gommers <ralf.gommers <at> googlemail.com> writes:
>>
>> >  I'm pleased to announce the first beta release of SciPy
>> > 0.8.0.SciPy is a package of tools for science and engineering
>> > for Python.It includes modules for statistics, optimization,
>> > integration, linearalgebra, Fourier transforms, signal and
>> > image processing, ODE solvers, and more.This beta release comes
>> > almost one and a half year after the 0.7.0 release andcontains
>> > many new features, numerous bug-fixes, improved testcoverage,
>> > and better documentation.  Please note that SciPy 0.8.0b1
>> > requires Python 2.4 or greater and NumPy 1.4.1 or greater.
>>
>> Thanks for getting the beta out!
>>
>> The release notes say Numpy 1.3 or greater is needed - is this
>> correct? Above you say 1.4.1 is needed.
>
>
> No, 1.4.1 is needed. Notes are fixed now.
>
>
>> I think "support for
>> Python 3 in Scipy might not yet be included in Scipy 0.8" is too
>> ambiguous. Just say 0.8 will not be compatible with Python 3, but
>> we expect the next version (0.9?) to be compatible, if that's the
>> case.
>>
>> Reworded as:
> "Python 3 compatibility is planned and is currently technically
> feasible, since Numpy has been ported. However, since the Python 3
> compatible Numpy 2.0 has not been released yet, support for Python 3
> in Scipy is not yet included in Scipy 0.8.  SciPy 0.9, planned for fall
> 2010, will very likely include experimental support for Python 3."
>
>
Are we going to release a Numpy 1.5? Also, the beta release should be noted
on the SciPy home page. Maybe adding such notes needs to be part of the
how-to-release checklist since it tends to be forgotten.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100607/cae0edf0/attachment.html>

From bsouthey at gmail.com  Mon Jun  7 11:00:35 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Mon, 07 Jun 2010 10:00:35 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>
	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
Message-ID: <4C0D0993.1080206@gmail.com>

On 06/07/2010 09:15 AM, josef.pktd at gmail.com wrote:
> On Fri, Jun 4, 2010 at 2:12 PM,<josef.pktd at gmail.com>  wrote:
>    
>> On Fri, Jun 4, 2010 at 1:08 PM, Bruce Southey<bsouthey at gmail.com>  wrote:
>>      
>>> On 06/03/2010 08:27 AM, Warren Weckesser wrote:
>>>        
>>>> Just letting you know that I'm not ignoring all the great comments from
>>>> josef, Neil and Bruce about my suggestion for chisquare_contingency.
>>>> Unfortunately, I won't have time to think about all the deeper
>>>> suggestions for another week or so.   For now, I'll just say that I
>>>> agree with josef's and Neil's suggestions for the docstring, and that
>>>> Neil's summary of the function as simply a convenience function that
>>>> calls stats.chisquare with appropriate arguments to perform a test of
>>>> independence on a contingency table is exactly what I had in mind.
>>>>
>>>> Warren
>>>>
>>>>
>>>>
>>>>          
>>> Hi,
>>> I looked at how SAS handles n-way tables. What it appears to do is break the
>>> original table down into a set of 2-way tables and does the analysis on each
>>> of these. So a 3 by 4 by 5 table is processed as three 2-way tables with the
>>> results of each 4 by 5 table presented. I do not know how Stata and R
>>> analysis analyze n-way tables.
>>>
>>> Consequently, I rewrote my suggested code (attached) to handle 3 and 4 way
>>> tables by using recursion. There should be some Python way to do that
>>> recursion for any number of dimensions. I also added the 1-way table (but
>>> that has a different hypothesis than the 2-way table) so users can send a
>>> 1-d table.
>>>        
>> (very briefly because I don't have much time today)
>>
>> I think, these are good extensions, but to handle all cases, the
>> function is getting too large and would need several options.
>>
>> On your code and SAS, Z(correct me if my quick reading is wrong)
>> You seem to be calculating conditional independence for the last two
>> variables conditional on the values of the first variables. I think
>> this could be generalized to all pairwise independence tests.
>>
>> Similar, I'm a bit surprised that SAS uses conditional and not
>> marginal independence, I would have thought that the test for marginal
>> independence (aggregate out all but 2 variables) would be the more
>> common use case.
>>      
You can argue SAS's formulation relates to how the table is constructed 
because the hypothesis associated with the table is dependent on how the 
user constructs it. For example, the 3-way table A by (B by C) is very 
different from the 3-way table C by (B by A) yet these involve the same 
underlying numbers. If a user did not specify an order then considering 
all possible hypotheses is an option.

Really log-linear models are a better approach to analysis n-way tables 
because these allow you to examine all these different hypotheses.

> just some more questions and comments (until I have time to check this)
>
> looking at conditional independence looks similar to linear regression
> models, where the effect of other variables is taken out. However,
> looking at all chisquare tests (conditional on all possible other
> values) runs into the multiple test problem. Is the some kind of
> post-hoc or Bonferroni correction or is there a distribution for eg.
> the max of all chisquare test statistics.
>    
Ignoring my views on this, first 'multiple test problems' do not change 
the probability calculation for most approaches to compute the 'raw' 
p-value as the vast majority of the approaches require the 'raw' p-value.

Second, it is very easy to say 'correct for multiple tests' but that is 
pure ignorance when 'what' you are correcting is for is not stated. If 
you are correcting the 'family-wise error rate' then you need to 
correctly define 'family-wise' in this situation especially to address 
at least one other assumption being made.

> with an iterator (numpy mailinglist), my version for the conditional
> independence of the last two variables for all values of the earlier
> variables looks like
>
> for ind in allbut2ax_iterator(table3, axes=(-2,-1)):
>      print chisquare_contingency(table3[ind])
>
> Josef
>
>    
A link:
http://article.gmane.org/gmane.comp.python.numeric.general/38352

I would have to see.

Bruce
>> Initially, I was thinking just about independence of all variables in
>> a 3 or more way table, i.e. P(x,y,z)=P(x)*P(y)*P(z)
>>
>> My opinion is that these variations of tests would fit better in a
>> class where all pairwise conditional, and marginal and joint
>> hypotheses can be supplied as methods, or split it up into a group of
>> functions.
>>
>> Thanks,
>>
>> Josef
>>
>>      
>>> The data used is from two SAS examples and I added a dimension to get a
>>> 4-way table. I included the SAS values but these are only to 4 decimal
>>> places for reference.
>>>
>>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect029.htm
>>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect030.htm
>>>
>>> What is missing:
>>> 1) Docstring and tests but those are dependent what is ultimately decided
>>> 2) Other test statistics but scipy.stats versions are not very friendly in
>>> that these do not accept a 2-d array
>>> 3) A way to do recursion
>>> 4) Ability to label the levels etc.
>>> 5) Correct handling of input types.
>>>
>>> Bruce
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>
>>>        
>>      
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100607/f25117cc/attachment.html>

From josef.pktd at gmail.com  Mon Jun  7 11:45:06 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Mon, 7 Jun 2010 11:45:06 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C0D0993.1080206@gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
	<4C0D0993.1080206@gmail.com>
Message-ID: <AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>

On Mon, Jun 7, 2010 at 11:00 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/07/2010 09:15 AM, josef.pktd at gmail.com wrote:
>
> On Fri, Jun 4, 2010 at 2:12 PM,  <josef.pktd at gmail.com> wrote:
>
>
> On Fri, Jun 4, 2010 at 1:08 PM, Bruce Southey <bsouthey at gmail.com> wrote:
>
>
> On 06/03/2010 08:27 AM, Warren Weckesser wrote:
>
>
> Just letting you know that I'm not ignoring all the great comments from
> josef, Neil and Bruce about my suggestion for chisquare_contingency.
> Unfortunately, I won't have time to think about all the deeper
> suggestions for another week or so. ? For now, I'll just say that I
> agree with josef's and Neil's suggestions for the docstring, and that
> Neil's summary of the function as simply a convenience function that
> calls stats.chisquare with appropriate arguments to perform a test of
> independence on a contingency table is exactly what I had in mind.
>
> Warren
>
>
>
>
>
> Hi,
> I looked at how SAS handles n-way tables. What it appears to do is break the
> original table down into a set of 2-way tables and does the analysis on each
> of these. So a 3 by 4 by 5 table is processed as three 2-way tables with the
> results of each 4 by 5 table presented. I do not know how Stata and R
> analysis analyze n-way tables.
>
> Consequently, I rewrote my suggested code (attached) to handle 3 and 4 way
> tables by using recursion. There should be some Python way to do that
> recursion for any number of dimensions. I also added the 1-way table (but
> that has a different hypothesis than the 2-way table) so users can send a
> 1-d table.
>
>
> (very briefly because I don't have much time today)
>
> I think, these are good extensions, but to handle all cases, the
> function is getting too large and would need several options.
>
> On your code and SAS, Z(correct me if my quick reading is wrong)
> You seem to be calculating conditional independence for the last two
> variables conditional on the values of the first variables. I think
> this could be generalized to all pairwise independence tests.
>
> Similar, I'm a bit surprised that SAS uses conditional and not
> marginal independence, I would have thought that the test for marginal
> independence (aggregate out all but 2 variables) would be the more
> common use case.
>
>
> You can argue SAS's formulation relates to how the table is constructed
> because the hypothesis associated with the table is dependent on how the
> user constructs it. For example, the 3-way table A by (B by C) is very
> different from the 3-way table C by (B by A) yet these involve the same
> underlying numbers. If a user did not specify an order then considering all
> possible hypotheses is an option.

I don't know the SAS notation, what I thought in analogy to regression
analysis, is that if one variable is considered as endogenous, then
only pairwise tests with this variable need to be included.

>
> Really log-linear models are a better approach to analysis n-way tables
> because these allow you to examine all these different hypotheses.
>
> just some more questions and comments (until I have time to check this)
>
> looking at conditional independence looks similar to linear regression
> models, where the effect of other variables is taken out. However,
> looking at all chisquare tests (conditional on all possible other
> values) runs into the multiple test problem. Is the some kind of
> post-hoc or Bonferroni correction or is there a distribution for eg.
> the max of all chisquare test statistics.
>
>
> Ignoring my views on this, first 'multiple test problems' do not change the
> probability calculation for most approaches to compute the 'raw' p-value as
> the vast majority of the approaches require the 'raw' p-value.
>
> Second, it is very easy to say 'correct for multiple tests' but that is pure
> ignorance when 'what' you are correcting is for is not stated. If you are
> correcting the 'family-wise error rate' then you need to correctly define
> 'family-wise' in this situation especially to address at least one other
> assumption being made.

I know nothing about this in the context of contingency tables. We
recently had the discussion about multiple tests in the context of
post-hoc tests for anova, where I had to read up.

In econometrics, there is an extensive literature on this, and some
cases like structural change tests with unknown change points I know
pretty well.

The main point that I wanted to make is, that multiple change tests
need more attention and at least a warning in the docstring which
(raw) p-values are reported, since it is easy for unwary users to
misinterpret the reported p-values. But hopefully this could be
extended to provide the user with options to do an appropriate
correction.

Josef


>
> with an iterator (numpy mailinglist), my version for the conditional
> independence of the last two variables for all values of the earlier
> variables looks like
>
> for ind in allbut2ax_iterator(table3, axes=(-2,-1)):
>     print chisquare_contingency(table3[ind])
>
> Josef
>
>
>
> A link:
> http://article.gmane.org/gmane.comp.python.numeric.general/38352
>
> I would have to see.
>
> Bruce
>
> Initially, I was thinking just about independence of all variables in
> a 3 or more way table, i.e. P(x,y,z)=P(x)*P(y)*P(z)
>
> My opinion is that these variations of tests would fit better in a
> class where all pairwise conditional, and marginal and joint
> hypotheses can be supplied as methods, or split it up into a group of
> functions.
>
> Thanks,
>
> Josef
>
>
>
> The data used is from two SAS examples and I added a dimension to get a
> 4-way table. I included the SAS values but these are only to 4 decimal
> places for reference.
>
> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect029.htm
> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect030.htm
>
> What is missing:
> 1) Docstring and tests but those are dependent what is ultimately decided
> 2) Other test statistics but scipy.stats versions are not very friendly in
> that these do not accept a 2-d array
> 3) A way to do recursion
> 4) Ability to label the levels etc.
> 5) Correct handling of input types.
>
> Bruce
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From bsouthey at gmail.com  Mon Jun  7 12:45:07 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Mon, 07 Jun 2010 11:45:07 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>
	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>
	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>
Message-ID: <4C0D2213.7020302@gmail.com>

On 06/07/2010 10:45 AM, josef.pktd at gmail.com wrote:
> On Mon, Jun 7, 2010 at 11:00 AM, Bruce Southey<bsouthey at gmail.com>  wrote:
>    
>> On 06/07/2010 09:15 AM, josef.pktd at gmail.com wrote:
>>
>> On Fri, Jun 4, 2010 at 2:12 PM,<josef.pktd at gmail.com>  wrote:
>>
>>
>> On Fri, Jun 4, 2010 at 1:08 PM, Bruce Southey<bsouthey at gmail.com>  wrote:
>>
>>
>> On 06/03/2010 08:27 AM, Warren Weckesser wrote:
>>
>>
>> Just letting you know that I'm not ignoring all the great comments from
>> josef, Neil and Bruce about my suggestion for chisquare_contingency.
>> Unfortunately, I won't have time to think about all the deeper
>> suggestions for another week or so.   For now, I'll just say that I
>> agree with josef's and Neil's suggestions for the docstring, and that
>> Neil's summary of the function as simply a convenience function that
>> calls stats.chisquare with appropriate arguments to perform a test of
>> independence on a contingency table is exactly what I had in mind.
>>
>> Warren
>>
>>
>>
>>
>>
>> Hi,
>> I looked at how SAS handles n-way tables. What it appears to do is break the
>> original table down into a set of 2-way tables and does the analysis on each
>> of these. So a 3 by 4 by 5 table is processed as three 2-way tables with the
>> results of each 4 by 5 table presented. I do not know how Stata and R
>> analysis analyze n-way tables.
>>
>> Consequently, I rewrote my suggested code (attached) to handle 3 and 4 way
>> tables by using recursion. There should be some Python way to do that
>> recursion for any number of dimensions. I also added the 1-way table (but
>> that has a different hypothesis than the 2-way table) so users can send a
>> 1-d table.
>>
>>
>> (very briefly because I don't have much time today)
>>
>> I think, these are good extensions, but to handle all cases, the
>> function is getting too large and would need several options.
>>
>> On your code and SAS, Z(correct me if my quick reading is wrong)
>> You seem to be calculating conditional independence for the last two
>> variables conditional on the values of the first variables. I think
>> this could be generalized to all pairwise independence tests.
>>
>> Similar, I'm a bit surprised that SAS uses conditional and not
>> marginal independence, I would have thought that the test for marginal
>> independence (aggregate out all but 2 variables) would be the more
>> common use case.
>>
>>
>> You can argue SAS's formulation relates to how the table is constructed
>> because the hypothesis associated with the table is dependent on how the
>> user constructs it. For example, the 3-way table A by (B by C) is very
>> different from the 3-way table C by (B by A) yet these involve the same
>> underlying numbers. If a user did not specify an order then considering all
>> possible hypotheses is an option.
>>      
> I don't know the SAS notation, what I thought in analogy to regression
> analysis, is that if one variable is considered as endogenous, then
> only pairwise tests with this variable need to be included.
>    
This is not the same as regression for multiple reasons. Here we are 
testing independence without any distribution assumption associated with 
the actual data. (Of course under the normality assumption then these 
are the same. )

>    
>> Really log-linear models are a better approach to analysis n-way tables
>> because these allow you to examine all these different hypotheses.
>>
>> just some more questions and comments (until I have time to check this)
>>
>> looking at conditional independence looks similar to linear regression
>> models, where the effect of other variables is taken out. However,
>> looking at all chisquare tests (conditional on all possible other
>> values) runs into the multiple test problem. Is the some kind of
>> post-hoc or Bonferroni correction or is there a distribution for eg.
>> the max of all chisquare test statistics.
>>
>>
>> Ignoring my views on this, first 'multiple test problems' do not change the
>> probability calculation for most approaches to compute the 'raw' p-value as
>> the vast majority of the approaches require the 'raw' p-value.
>>
>> Second, it is very easy to say 'correct for multiple tests' but that is pure
>> ignorance when 'what' you are correcting is for is not stated. If you are
>> correcting the 'family-wise error rate' then you need to correctly define
>> 'family-wise' in this situation especially to address at least one other
>> assumption being made.
>>      
> I know nothing about this in the context of contingency tables.
In a 2-way table there is no need for any correction so it is pointless 
to say 'correct for multiple tests'. In a 3-way or higher table, as you 
indicated, is essentially a test of conditional independence as I 
implemented it. It is also pointless to say 'correct for multiple tests' 
because you are first assuming conditional independence between say A by 
B given C=1 and A by B for C=2. So what happens when C=1 is independent 
of when C=2 so these do belong to different 'families'. Second, there is 
nothing said about the relation of either A  or B with C - which may be 
a more critical problem.

> We
> recently had the discussion about multiple tests in the context of
> post-hoc tests for anova, where I had to read up.
>    
I am perhaps too aware of multiple testing and unfortunately these types 
of discussions go on and on and on. A lot depends on which of many 
'schools' of thought you subscribe to. It basically amounts to 'hand 
waving'  with no solution because these schools are defined by different 
fundamental  assumptions that can not be challenged. Ultimately none are 
correct because we never know the true situation - if we did we would 
not be doing it.
> In econometrics, there is an extensive literature on this, and some
> cases like structural change tests with unknown change points I know
> pretty well.
>
> The main point that I wanted to make is, that multiple change tests
> need more attention and at least a warning in the docstring which
> (raw) p-values are reported, since it is easy for unwary users to
> misinterpret the reported p-values. But hopefully this could be
> extended to provide the user with options to do an appropriate
> correction.
>
> Josef
>    
This is pointless because you are misunderstanding what is meant by 
'multiple test correction'. Placing those kinds of statements in the 
wrong places also reflects ignorance especially when the correct value 
maybe given and there is no 'appropriate' correction possible. Further 
no statement is ever going to protect users from misinterpreting p-values.

Bruce


>
>    
>> with an iterator (numpy mailinglist), my version for the conditional
>> independence of the last two variables for all values of the earlier
>> variables looks like
>>
>> for ind in allbut2ax_iterator(table3, axes=(-2,-1)):
>>      print chisquare_contingency(table3[ind])
>>
>> Josef
>>
>>
>>
>> A link:
>> http://article.gmane.org/gmane.comp.python.numeric.general/38352
>>
>> I would have to see.
>>
>> Bruce
>>
>> Initially, I was thinking just about independence of all variables in
>> a 3 or more way table, i.e. P(x,y,z)=P(x)*P(y)*P(z)
>>
>> My opinion is that these variations of tests would fit better in a
>> class where all pairwise conditional, and marginal and joint
>> hypotheses can be supplied as methods, or split it up into a group of
>> functions.
>>
>> Thanks,
>>
>> Josef
>>
>>
>>
>> The data used is from two SAS examples and I added a dimension to get a
>> 4-way table. I included the SAS values but these are only to 4 decimal
>> places for reference.
>>
>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect029.htm
>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect030.htm
>>
>> What is missing:
>> 1) Docstring and tests but those are dependent what is ultimately decided
>> 2) Other test statistics but scipy.stats versions are not very friendly in
>> that these do not accept a 2-d array
>> 3) A way to do recursion
>> 4) Ability to label the levels etc.
>> 5) Correct handling of input types.
>>
>> Bruce
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>
>>
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>      
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    


From ralf.gommers at googlemail.com  Mon Jun  7 12:56:10 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Tue, 8 Jun 2010 00:56:10 +0800
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
Message-ID: <AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>

On Mon, Jun 7, 2010 at 10:20 PM, Charles R Harris <charlesr.harris at gmail.com
> wrote:

>
>
> On Mon, Jun 7, 2010 at 5:55 AM, Ralf Gommers <ralf.gommers at googlemail.com>wrote:
>
>>
>>
>> On Mon, Jun 7, 2010 at 7:34 PM, Neil Crighton <neilcrighton at gmail.com>wrote:
>>
>>> Ralf Gommers <ralf.gommers <at> googlemail.com> writes:
>>>
>>> >  I'm pleased to announce the first beta release of SciPy
>>> > 0.8.0.SciPy is a package of tools for science and engineering
>>> > for Python.It includes modules for statistics, optimization,
>>> > integration, linearalgebra, Fourier transforms, signal and
>>> > image processing, ODE solvers, and more.This beta release comes
>>> > almost one and a half year after the 0.7.0 release andcontains
>>> > many new features, numerous bug-fixes, improved testcoverage,
>>> > and better documentation.  Please note that SciPy 0.8.0b1
>>> > requires Python 2.4 or greater and NumPy 1.4.1 or greater.
>>>
>>> Thanks for getting the beta out!
>>>
>>> The release notes say Numpy 1.3 or greater is needed - is this
>>> correct? Above you say 1.4.1 is needed.
>>
>>
>> No, 1.4.1 is needed. Notes are fixed now.
>>
>>
>>> I think "support for
>>> Python 3 in Scipy might not yet be included in Scipy 0.8" is too
>>> ambiguous. Just say 0.8 will not be compatible with Python 3, but
>>> we expect the next version (0.9?) to be compatible, if that's the
>>> case.
>>>
>>> Reworded as:
>> "Python 3 compatibility is planned and is currently technically
>> feasible, since Numpy has been ported. However, since the Python 3
>> compatible Numpy 2.0 has not been released yet, support for Python 3
>> in Scipy is not yet included in Scipy 0.8.  SciPy 0.9, planned for fall
>> 2010, will very likely include experimental support for Python 3."
>>
>>
> Are we going to release a Numpy 1.5?
>

Yes. Guess I should reread such a paragraph a few times before committing.

The only reason I've not made a 1.5 branch yet is I will only have time for
a numpy release cycle at or towards the end of this scipy release cycle.
Saves some backporting. If you think it'd be useful to do it now please let
me know.


> Also, the beta release should be noted on the SciPy home page.
>

Done. For the previous releases I put only the final release there. Maybe
good to announce beta/rc releases but then just update the announcement
instead of adding new items each time. It's a small sidebar after all.


> Maybe adding such notes needs to be part of the how-to-release checklist
> since it tends to be forgotten.
>

The scipy.org announcement is in there. I'll add a "check the release notes
for ..." item.


Here is my checklist of things to do before 0.8.0rc1. I'm traveling (without
computer) for the next week, so if anyone wants to tackle any of these
items, that would be be very helpful.
- remove stuff in scipy.io as explained in 0.7.0 release notes
- fix paver dmg task to include docs
- add sphinxext to tarballs?
- check numscons works
- SSE instruction ticket: http://projects.scipy.org/scipy/ticket/1170
- linalg.qr: http://projects.scipy.org/scipy/ticket/243
- windows crash: http://projects.scipy.org/scipy/ticket/1102
- invalid 2.6 syntax: http://projects.scipy.org/scipy/ticket/1193


Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100608/fec7a891/attachment.html>

From josef.pktd at gmail.com  Mon Jun  7 14:30:57 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Mon, 7 Jun 2010 14:30:57 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C0D2213.7020302@gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
	<4C0D0993.1080206@gmail.com>
	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>
	<4C0D2213.7020302@gmail.com>
Message-ID: <AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>

On Mon, Jun 7, 2010 at 12:45 PM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/07/2010 10:45 AM, josef.pktd at gmail.com wrote:
>> On Mon, Jun 7, 2010 at 11:00 AM, Bruce Southey<bsouthey at gmail.com> ?wrote:
>>
>>> On 06/07/2010 09:15 AM, josef.pktd at gmail.com wrote:
>>>
>>> On Fri, Jun 4, 2010 at 2:12 PM,<josef.pktd at gmail.com> ?wrote:
>>>
>>>
>>> On Fri, Jun 4, 2010 at 1:08 PM, Bruce Southey<bsouthey at gmail.com> ?wrote:
>>>
>>>
>>> On 06/03/2010 08:27 AM, Warren Weckesser wrote:
>>>
>>>
>>> Just letting you know that I'm not ignoring all the great comments from
>>> josef, Neil and Bruce about my suggestion for chisquare_contingency.
>>> Unfortunately, I won't have time to think about all the deeper
>>> suggestions for another week or so. ? For now, I'll just say that I
>>> agree with josef's and Neil's suggestions for the docstring, and that
>>> Neil's summary of the function as simply a convenience function that
>>> calls stats.chisquare with appropriate arguments to perform a test of
>>> independence on a contingency table is exactly what I had in mind.
>>>
>>> Warren
>>>
>>>
>>>
>>>
>>>
>>> Hi,
>>> I looked at how SAS handles n-way tables. What it appears to do is break the
>>> original table down into a set of 2-way tables and does the analysis on each
>>> of these. So a 3 by 4 by 5 table is processed as three 2-way tables with the
>>> results of each 4 by 5 table presented. I do not know how Stata and R
>>> analysis analyze n-way tables.
>>>
>>> Consequently, I rewrote my suggested code (attached) to handle 3 and 4 way
>>> tables by using recursion. There should be some Python way to do that
>>> recursion for any number of dimensions. I also added the 1-way table (but
>>> that has a different hypothesis than the 2-way table) so users can send a
>>> 1-d table.
>>>
>>>
>>> (very briefly because I don't have much time today)
>>>
>>> I think, these are good extensions, but to handle all cases, the
>>> function is getting too large and would need several options.
>>>
>>> On your code and SAS, Z(correct me if my quick reading is wrong)
>>> You seem to be calculating conditional independence for the last two
>>> variables conditional on the values of the first variables. I think
>>> this could be generalized to all pairwise independence tests.
>>>
>>> Similar, I'm a bit surprised that SAS uses conditional and not
>>> marginal independence, I would have thought that the test for marginal
>>> independence (aggregate out all but 2 variables) would be the more
>>> common use case.
>>>
>>>
>>> You can argue SAS's formulation relates to how the table is constructed
>>> because the hypothesis associated with the table is dependent on how the
>>> user constructs it. For example, the 3-way table A by (B by C) is very
>>> different from the 3-way table C by (B by A) yet these involve the same
>>> underlying numbers. If a user did not specify an order then considering all
>>> possible hypotheses is an option.
>>>
>> I don't know the SAS notation, what I thought in analogy to regression
>> analysis, is that if one variable is considered as endogenous, then
>> only pairwise tests with this variable need to be included.
>>
> This is not the same as regression for multiple reasons. Here we are
> testing independence without any distribution assumption associated with
> the actual data. (Of course under the normality assumption then these
> are the same. )
>
>>
>>> Really log-linear models are a better approach to analysis n-way tables
>>> because these allow you to examine all these different hypotheses.
>>>
>>> just some more questions and comments (until I have time to check this)
>>>
>>> looking at conditional independence looks similar to linear regression
>>> models, where the effect of other variables is taken out. However,
>>> looking at all chisquare tests (conditional on all possible other
>>> values) runs into the multiple test problem. Is the some kind of
>>> post-hoc or Bonferroni correction or is there a distribution for eg.
>>> the max of all chisquare test statistics.
>>>
>>>
>>> Ignoring my views on this, first 'multiple test problems' do not change the
>>> probability calculation for most approaches to compute the 'raw' p-value as
>>> the vast majority of the approaches require the 'raw' p-value.
>>>
>>> Second, it is very easy to say 'correct for multiple tests' but that is pure
>>> ignorance when 'what' you are correcting is for is not stated. If you are
>>> correcting the 'family-wise error rate' then you need to correctly define
>>> 'family-wise' in this situation especially to address at least one other
>>> assumption being made.
>>>
>> I know nothing about this in the context of contingency tables.
> In a 2-way table there is no need for any correction so it is pointless
> to say 'correct for multiple tests'. In a 3-way or higher table, as you
> indicated, is essentially a test of conditional independence as I
> implemented it. It is also pointless to say 'correct for multiple tests'
> because you are first assuming conditional independence between say A by
> B given C=1 and A by B for C=2. So what happens when C=1 is independent
> of when C=2 so these do belong to different 'families'. Second, there is
> nothing said about the relation of either A ?or B with C - which may be
> a more critical problem.
>
>> We
>> recently had the discussion about multiple tests in the context of
>> post-hoc tests for anova, where I had to read up.
>>
> I am perhaps too aware of multiple testing and unfortunately these types
> of discussions go on and on and on. A lot depends on which of many
> 'schools' of thought you subscribe to. It basically amounts to 'hand
> waving' ?with no solution because these schools are defined by different
> fundamental ?assumptions that can not be challenged. Ultimately none are
> correct because we never know the true situation - if we did we would
> not be doing it.

I think it depends on the hypothesis and the general statistical
theory is relatively clear, but maybe some people prefer a
"test-mining" approach.


>> In econometrics, there is an extensive literature on this, and some
>> cases like structural change tests with unknown change points I know
>> pretty well.
>>
>> The main point that I wanted to make is, that multiple change tests
>> need more attention and at least a warning in the docstring which
>> (raw) p-values are reported, since it is easy for unwary users to
>> misinterpret the reported p-values. But hopefully this could be
>> extended to provide the user with options to do an appropriate
>> correction.
>>
>> Josef
>>
> This is pointless because you are misunderstanding what is meant by
> 'multiple test correction'.

???

> Placing those kinds of statements in the
> wrong places also reflects ignorance especially when the correct value
> maybe given and there is no 'appropriate' correction possible. Further
> no statement is ever going to protect users from misinterpreting p-values.

Doing a quick search on the recent literature, it seems there is a lot
going on in doing proper multiple test correction, additional to more
traditional tests, that I haven't tried you to really understand or
where I don't know how well they generalize, e.g. (generalized)
Cochran-Mantel-Haenszel Chi-Squared Test, Cochran?s Q test.

I only read the abstract of this:
http://jnci.oxfordjournals.org/cgi/content/abstract/99/2/147

"Twenty-one (50%) of them contained at least one of the following
three basic flaws: 1) in outcome-related gene finding, an unstated,
unclear, or inadequate control for multiple testing; 2) ....."

Josef

>
> Bruce
>
>
>>
>>
>>> with an iterator (numpy mailinglist), my version for the conditional
>>> independence of the last two variables for all values of the earlier
>>> variables looks like
>>>
>>> for ind in allbut2ax_iterator(table3, axes=(-2,-1)):
>>> ? ? ?print chisquare_contingency(table3[ind])
>>>
>>> Josef
>>>
>>>
>>>
>>> A link:
>>> http://article.gmane.org/gmane.comp.python.numeric.general/38352
>>>
>>> I would have to see.
>>>
>>> Bruce
>>>
>>> Initially, I was thinking just about independence of all variables in
>>> a 3 or more way table, i.e. P(x,y,z)=P(x)*P(y)*P(z)
>>>
>>> My opinion is that these variations of tests would fit better in a
>>> class where all pairwise conditional, and marginal and joint
>>> hypotheses can be supplied as methods, or split it up into a group of
>>> functions.
>>>
>>> Thanks,
>>>
>>> Josef
>>>
>>>
>>>
>>> The data used is from two SAS examples and I added a dimension to get a
>>> 4-way table. I included the SAS values but these are only to 4 decimal
>>> places for reference.
>>>
>>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect029.htm
>>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect030.htm
>>>
>>> What is missing:
>>> 1) Docstring and tests but those are dependent what is ultimately decided
>>> 2) Other test statistics but scipy.stats versions are not very friendly in
>>> that these do not accept a 2-d array
>>> 3) A way to do recursion
>>> 4) Ability to label the levels etc.
>>> 5) Correct handling of input types.
>>>
>>> Bruce
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From david at silveregg.co.jp  Mon Jun  7 21:03:51 2010
From: david at silveregg.co.jp (David)
Date: Tue, 08 Jun 2010 10:03:51 +0900
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
Message-ID: <4C0D96F7.1000406@silveregg.co.jp>

On 06/08/2010 01:56 AM, Ralf Gommers wrote:
>
>
> On Mon, Jun 7, 2010 at 10:20 PM, Charles R Harris
> <charlesr.harris at gmail.com <mailto:charlesr.harris at gmail.com>> wrote:
>
>
>
>     On Mon, Jun 7, 2010 at 5:55 AM, Ralf Gommers
>     <ralf.gommers at googlemail.com <mailto:ralf.gommers at googlemail.com>>
>     wrote:
>
>
>
>         On Mon, Jun 7, 2010 at 7:34 PM, Neil Crighton
>         <neilcrighton at gmail.com <mailto:neilcrighton at gmail.com>> wrote:
>
>             Ralf Gommers <ralf.gommers <at> googlemail.com
>             <http://googlemail.com>> writes:
>
>              >  I'm pleased to announce the first beta release of SciPy
>              > 0.8.0.SciPy is a package of tools for science and engineering
>              > for Python.It includes modules for statistics, optimization,
>              > integration, linearalgebra, Fourier transforms, signal and
>              > image processing, ODE solvers, and more.This beta release
>             comes
>              > almost one and a half year after the 0.7.0 release
>             andcontains
>              > many new features, numerous bug-fixes, improved testcoverage,
>              > and better documentation.  Please note that SciPy 0.8.0b1
>              > requires Python 2.4 or greater and NumPy 1.4.1 or greater.
>
>             Thanks for getting the beta out!
>
>             The release notes say Numpy 1.3 or greater is needed - is this
>             correct? Above you say 1.4.1 is needed.
>
>
>         No, 1.4.1 is needed. Notes are fixed now.
>
>             I think "support for
>             Python 3 in Scipy might not yet be included in Scipy 0.8" is too
>             ambiguous. Just say 0.8 will not be compatible with Python
>             3, but
>             we expect the next version (0.9?) to be compatible, if
>             that's the
>             case.
>
>         Reworded as:
>         "Python 3 compatibility is planned and is currently technically
>         feasible, since Numpy has been ported. However, since the Python 3
>         compatible Numpy 2.0 has not been released yet, support for Python 3
>         in Scipy is not yet included in Scipy 0.8.  SciPy 0.9, planned
>         for fall
>         2010, will very likely include experimental support for Python 3."
>
>
>     Are we going to release a Numpy 1.5?
>
>
> Yes. Guess I should reread such a paragraph a few times before committing.
>
> The only reason I've not made a 1.5 branch yet is I will only have time
> for a numpy release cycle at or towards the end of this scipy release
> cycle. Saves some backporting. If you think it'd be useful to do it now
> please let me know.

I don't think we should make the 1.5 branch now - there is a lot of 
things missing, and I would really like to put everything that is needed 
for python 3.x support in scipy in the 1.5 release. And AFAIK, we have 
not cleaned up the branch to make it ABI compatible with 1.4.x.

I can't give a hard timeline, but I hope to have some time during euroscipy,

cheers,

David


From ralf.gommers at googlemail.com  Mon Jun  7 21:10:58 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Tue, 8 Jun 2010 09:10:58 +0800
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <4C0D96F7.1000406@silveregg.co.jp>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
Message-ID: <AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>

On Tue, Jun 8, 2010 at 9:03 AM, David <david at silveregg.co.jp> wrote:

> On 06/08/2010 01:56 AM, Ralf Gommers wrote:
> >
> >
> > On Mon, Jun 7, 2010 at 10:20 PM, Charles R Harris
> > <charlesr.harris at gmail.com <mailto:charlesr.harris at gmail.com>> wrote:
> >
> >
> >
> >     On Mon, Jun 7, 2010 at 5:55 AM, Ralf Gommers
> >     <ralf.gommers at googlemail.com <mailto:ralf.gommers at googlemail.com>>
> >     wrote:
> >
> >
> >
> >         On Mon, Jun 7, 2010 at 7:34 PM, Neil Crighton
> >         <neilcrighton at gmail.com <mailto:neilcrighton at gmail.com>> wrote:
> >
> >             Ralf Gommers <ralf.gommers <at> googlemail.com
> >             <http://googlemail.com>> writes:
> >
> >              >  I'm pleased to announce the first beta release of SciPy
> >              > 0.8.0.SciPy is a package of tools for science and
> engineering
> >              > for Python.It includes modules for statistics,
> optimization,
> >              > integration, linearalgebra, Fourier transforms, signal and
> >              > image processing, ODE solvers, and more.This beta release
> >             comes
> >              > almost one and a half year after the 0.7.0 release
> >             andcontains
> >              > many new features, numerous bug-fixes, improved
> testcoverage,
> >              > and better documentation.  Please note that SciPy 0.8.0b1
> >              > requires Python 2.4 or greater and NumPy 1.4.1 or greater.
> >
> >             Thanks for getting the beta out!
> >
> >             The release notes say Numpy 1.3 or greater is needed - is
> this
> >             correct? Above you say 1.4.1 is needed.
> >
> >
> >         No, 1.4.1 is needed. Notes are fixed now.
> >
> >             I think "support for
> >             Python 3 in Scipy might not yet be included in Scipy 0.8" is
> too
> >             ambiguous. Just say 0.8 will not be compatible with Python
> >             3, but
> >             we expect the next version (0.9?) to be compatible, if
> >             that's the
> >             case.
> >
> >         Reworded as:
> >         "Python 3 compatibility is planned and is currently technically
> >         feasible, since Numpy has been ported. However, since the Python
> 3
> >         compatible Numpy 2.0 has not been released yet, support for
> Python 3
> >         in Scipy is not yet included in Scipy 0.8.  SciPy 0.9, planned
> >         for fall
> >         2010, will very likely include experimental support for Python
> 3."
> >
> >
> >     Are we going to release a Numpy 1.5?
> >
> >
> > Yes. Guess I should reread such a paragraph a few times before
> committing.
> >
> > The only reason I've not made a 1.5 branch yet is I will only have time
> > for a numpy release cycle at or towards the end of this scipy release
> > cycle. Saves some backporting. If you think it'd be useful to do it now
> > please let me know.
>
> I don't think we should make the 1.5 branch now - there is a lot of
> things missing, and I would really like to put everything that is needed
> for python 3.x support in scipy in the 1.5 release.


Threads are mixing a bit, but we're talking about numpy here. I thought
numpy 3.x support was pretty much finished?


> And AFAIK, we have
> not cleaned up the branch to make it ABI compatible with 1.4.x.
>
> That should be done after making the branch right? If you remove datetime
in trunk you're just going to have to put it back later.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100608/f66b5476/attachment.html>

From david at silveregg.co.jp  Mon Jun  7 21:15:10 2010
From: david at silveregg.co.jp (David)
Date: Tue, 08 Jun 2010 10:15:10 +0900
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
Message-ID: <4C0D999E.9040704@silveregg.co.jp>

On 06/08/2010 10:10 AM, Ralf Gommers wrote:
>
>
> On Tue, Jun 8, 2010 at 9:03 AM, David <david at silveregg.co.jp
> <mailto:david at silveregg.co.jp>> wrote:
>
>     On 06/08/2010 01:56 AM, Ralf Gommers wrote:
>      >
>      >
>      > On Mon, Jun 7, 2010 at 10:20 PM, Charles R Harris
>      > <charlesr.harris at gmail.com <mailto:charlesr.harris at gmail.com>
>     <mailto:charlesr.harris at gmail.com
>     <mailto:charlesr.harris at gmail.com>>> wrote:
>      >
>      >
>      >
>      >     On Mon, Jun 7, 2010 at 5:55 AM, Ralf Gommers
>      > <ralf.gommers at googlemail.com <mailto:ralf.gommers at googlemail.com>
>     <mailto:ralf.gommers at googlemail.com
>     <mailto:ralf.gommers at googlemail.com>>>
>      >     wrote:
>      >
>      >
>      >
>      >         On Mon, Jun 7, 2010 at 7:34 PM, Neil Crighton
>      > <neilcrighton at gmail.com <mailto:neilcrighton at gmail.com>
>     <mailto:neilcrighton at gmail.com <mailto:neilcrighton at gmail.com>>> wrote:
>      >
>      >             Ralf Gommers <ralf.gommers <at> googlemail.com
>     <http://googlemail.com>
>      > <http://googlemail.com>> writes:
>      >
>      > >  I'm pleased to announce the first beta release of SciPy
>      > > 0.8.0.SciPy is a package of tools for science and engineering
>      > > for Python.It includes modules for statistics, optimization,
>      > > integration, linearalgebra, Fourier transforms, signal and
>      > > image processing, ODE solvers, and more.This beta release
>      >             comes
>      > > almost one and a half year after the 0.7.0 release
>      >             andcontains
>      > > many new features, numerous bug-fixes, improved testcoverage,
>      > > and better documentation.  Please note that SciPy 0.8.0b1
>      > > requires Python 2.4 or greater and NumPy 1.4.1 or greater.
>      >
>      >             Thanks for getting the beta out!
>      >
>      >             The release notes say Numpy 1.3 or greater is needed
>     - is this
>      >             correct? Above you say 1.4.1 is needed.
>      >
>      >
>      >         No, 1.4.1 is needed. Notes are fixed now.
>      >
>      >             I think "support for
>      >             Python 3 in Scipy might not yet be included in Scipy
>     0.8" is too
>      >             ambiguous. Just say 0.8 will not be compatible with
>     Python
>      >             3, but
>      >             we expect the next version (0.9?) to be compatible, if
>      >             that's the
>      >             case.
>      >
>      >         Reworded as:
>      > "Python 3 compatibility is planned and is currently technically
>      >         feasible, since Numpy has been ported. However, since the
>     Python 3
>      >         compatible Numpy 2.0 has not been released yet, support
>     for Python 3
>      >         in Scipy is not yet included in Scipy 0.8.  SciPy 0.9,
>     planned
>      >         for fall
>      >         2010, will very likely include experimental support for
>     Python 3."
>      >
>      >
>      >     Are we going to release a Numpy 1.5?
>      >
>      >
>      > Yes. Guess I should reread such a paragraph a few times before
>     committing.
>      >
>      > The only reason I've not made a 1.5 branch yet is I will only
>     have time
>      > for a numpy release cycle at or towards the end of this scipy release
>      > cycle. Saves some backporting. If you think it'd be useful to do
>     it now
>      > please let me know.
>
>     I don't think we should make the 1.5 branch now - there is a lot of
>     things missing, and I would really like to put everything that is needed
>     for python 3.x support in scipy in the 1.5 release.
>
>
> Threads are mixing a bit, but we're talking about numpy here. I thought
> numpy 3.x support was pretty much finished?

Yes, but to make scipy compatible with 3.x, it is easier to add some 
stuff in numpy.distutils, etc... for scipy.

>
>     And AFAIK, we have
>     not cleaned up the branch to make it ABI compatible with 1.4.x.
>
> That should be done after making the branch right? If you remove
> datetime in trunk you're just going to have to put it back later.

But we agreed to remove it, right ? What if we decide to have a 1.6, 
etc... ?

cheers,

David


From charlesr.harris at gmail.com  Mon Jun  7 21:20:22 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Mon, 7 Jun 2010 19:20:22 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <4C0D999E.9040704@silveregg.co.jp>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0D999E.9040704@silveregg.co.jp>
Message-ID: <AANLkTilg5bZ9yR1hMRqnBzrc_T6tUpqJe4pDjOjTCYhh@mail.gmail.com>

On Mon, Jun 7, 2010 at 7:15 PM, David <david at silveregg.co.jp> wrote:

> On 06/08/2010 10:10 AM, Ralf Gommers wrote:
> >
> >
> > On Tue, Jun 8, 2010 at 9:03 AM, David <david at silveregg.co.jp
> > <mailto:david at silveregg.co.jp>> wrote:
> >
> >     On 06/08/2010 01:56 AM, Ralf Gommers wrote:
> >      >
> >      >
> >      > On Mon, Jun 7, 2010 at 10:20 PM, Charles R Harris
> >      > <charlesr.harris at gmail.com <mailto:charlesr.harris at gmail.com>
> >     <mailto:charlesr.harris at gmail.com
> >     <mailto:charlesr.harris at gmail.com>>> wrote:
> >      >
> >      >
> >      >
> >      >     On Mon, Jun 7, 2010 at 5:55 AM, Ralf Gommers
> >      > <ralf.gommers at googlemail.com <mailto:ralf.gommers at googlemail.com>
> >     <mailto:ralf.gommers at googlemail.com
> >     <mailto:ralf.gommers at googlemail.com>>>
> >      >     wrote:
> >      >
> >      >
> >      >
> >      >         On Mon, Jun 7, 2010 at 7:34 PM, Neil Crighton
> >      > <neilcrighton at gmail.com <mailto:neilcrighton at gmail.com>
> >     <mailto:neilcrighton at gmail.com <mailto:neilcrighton at gmail.com>>>
> wrote:
> >      >
> >      >             Ralf Gommers <ralf.gommers <at> googlemail.com
> >     <http://googlemail.com>
> >      > <http://googlemail.com>> writes:
> >      >
> >      > >  I'm pleased to announce the first beta release of SciPy
> >      > > 0.8.0.SciPy is a package of tools for science and engineering
> >      > > for Python.It includes modules for statistics, optimization,
> >      > > integration, linearalgebra, Fourier transforms, signal and
> >      > > image processing, ODE solvers, and more.This beta release
> >      >             comes
> >      > > almost one and a half year after the 0.7.0 release
> >      >             andcontains
> >      > > many new features, numerous bug-fixes, improved testcoverage,
> >      > > and better documentation.  Please note that SciPy 0.8.0b1
> >      > > requires Python 2.4 or greater and NumPy 1.4.1 or greater.
> >      >
> >      >             Thanks for getting the beta out!
> >      >
> >      >             The release notes say Numpy 1.3 or greater is needed
> >     - is this
> >      >             correct? Above you say 1.4.1 is needed.
> >      >
> >      >
> >      >         No, 1.4.1 is needed. Notes are fixed now.
> >      >
> >      >             I think "support for
> >      >             Python 3 in Scipy might not yet be included in Scipy
> >     0.8" is too
> >      >             ambiguous. Just say 0.8 will not be compatible with
> >     Python
> >      >             3, but
> >      >             we expect the next version (0.9?) to be compatible, if
> >      >             that's the
> >      >             case.
> >      >
> >      >         Reworded as:
> >      > "Python 3 compatibility is planned and is currently technically
> >      >         feasible, since Numpy has been ported. However, since the
> >     Python 3
> >      >         compatible Numpy 2.0 has not been released yet, support
> >     for Python 3
> >      >         in Scipy is not yet included in Scipy 0.8.  SciPy 0.9,
> >     planned
> >      >         for fall
> >      >         2010, will very likely include experimental support for
> >     Python 3."
> >      >
> >      >
> >      >     Are we going to release a Numpy 1.5?
> >      >
> >      >
> >      > Yes. Guess I should reread such a paragraph a few times before
> >     committing.
> >      >
> >      > The only reason I've not made a 1.5 branch yet is I will only
> >     have time
> >      > for a numpy release cycle at or towards the end of this scipy
> release
> >      > cycle. Saves some backporting. If you think it'd be useful to do
> >     it now
> >      > please let me know.
> >
> >     I don't think we should make the 1.5 branch now - there is a lot of
> >     things missing, and I would really like to put everything that is
> needed
> >     for python 3.x support in scipy in the 1.5 release.
> >
> >
> > Threads are mixing a bit, but we're talking about numpy here. I thought
> > numpy 3.x support was pretty much finished?
>
> Yes, but to make scipy compatible with 3.x, it is easier to add some
> stuff in numpy.distutils, etc... for scipy.
>
> >
> >     And AFAIK, we have
> >     not cleaned up the branch to make it ABI compatible with 1.4.x.
> >
> > That should be done after making the branch right? If you remove
> > datetime in trunk you're just going to have to put it back later.
>
> But we agreed to remove it, right ? What if we decide to have a 1.6,
> etc... ?
>
>
I've been toying with the idea that the trunk should be branched, with one
branch for the datetime and other API changes and another that is compatible
with 1.4, 1.5, etc. When the changes are ready, they can then be merged back
in. Of course, this will all be easier when the GIT transition is finished.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100607/1d0577a7/attachment.html>

From nwagner at iam.uni-stuttgart.de  Tue Jun  8 04:55:25 2010
From: nwagner at iam.uni-stuttgart.de (Nils Wagner)
Date: Tue, 08 Jun 2010 10:55:25 +0200
Subject: [SciPy-Dev] ERROR: Failure: SyntaxError (invalid syntax
 (test_distributions.py, line 391)
Message-ID: <web-134370768@uni-stuttgart.de>

Hi all,

I am using

>>> numpy.__version__
'2.0.0.dev8460'
>>> import scipy
>>> scipy.__version__
'0.9.0.dev6493'

and I found some (new) errors

======================================================================
ERROR: 
test_continuous_basic.test_cont_basic(<scipy.stats.distributions.wald_gen 
object at 0x4cb5c90>, (), 'wald')
----------------------------------------------------------------------
Traceback (most recent call last):
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/case.py", 
line 183, in runTest
     self.test(*self.arg)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/tests/test_continuous_basic.py", 
line 291, in check_cdf_ppf
     npt.assert_almost_equal(distfn.cdf(distfn.ppf([0.001,0.5,0.999], 
*arg), *arg),
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py", 
line 1324, in ppf
     place(output,cond,self._ppf(*goodargs)*scale + loc)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py", 
line 1028, in _ppf
     return self.vecfunc(q,*args)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/numpy/lib/function_base.py", 
line 1794, in __call__
     theout = self.thefunc(*newargs)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py", 
line 974, in _ppf_single_call
     return optimize.brentq(self._ppf_to_solve, self.xa, 
self.xb, args=(q,)+args, xtol=self.xtol)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/scipy/optimize/zeros.py", 
line 262, in brentq
     r = 
_zeros._brentq(f,a,b,xtol,maxiter,args,full_output,disp)
ValueError: f(a) and f(b) must have different signs

======================================================================
ERROR: Failure: SyntaxError (invalid syntax 
(test_distributions.py, line 391))
----------------------------------------------------------------------
Traceback (most recent call last):
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/loader.py", 
line 379, in loadTestsFromName
     addr.filename, addr.module)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/importer.py", 
line 39, in importFromPath
     return self.importFromDir(dir_path, fqname)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/importer.py", 
line 86, in importFromDir
     mod = load_module(part_fqname, fh, filename, desc)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/tests/test_distributions.py", 
line 391
     res = distfunc.rvs(*args, size=200)
                                  ^
SyntaxError: invalid syntax

======================================================================
ERROR: test_mpmath.test_expi_complex
----------------------------------------------------------------------
Traceback (most recent call last):
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/case.py", 
line 183, in runTest
     self.test(*self.arg)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/numpy/testing/decorators.py", 
line 146, in skipper_func
     return f(*args, **kwargs)
   File 
"/data/home/nwagner/local/lib/python2.5/site-packages/scipy/special/tests/test_mpmath.py", 
line 46, in test_expi_complex
     dataset = np.array(dataset, dtype=np.complex_)
TypeError: a float is required

Nils


From bsouthey at gmail.com  Tue Jun  8 11:40:59 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Tue, 08 Jun 2010 10:40:59 -0500
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>	<loom.20100607T132750-19@post.gmane.org>	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
Message-ID: <4C0E648B.10204@gmail.com>

Hi,
I got 2 errors and 1 failure when I installed the beta using Python 2.6 
(Linux 64-bit) with numpy '2.0.0.dev8445' .

Can we get a fix for ticket 1152 or at least mark it as known?
http://projects.scipy.org/scipy/ticket/1152

The others are below.

There are also a number of overflow warnings that should be checked and 
avoided. The same warnings also occur in test_continuous_basic for 
certain distributions.
test_iv_cephes_vs_amos (test_basic.TestBessel) ... Warning: overflow 
encountered in iv
Warning: overflow encountered in iv
Warning: invalid value encountered in isinf

Should these be tickets?

Bruce

======================================================================
ERROR: Ticket #1124.
----------------------------------------------------------------------
Traceback (most recent call last):
   File 
"/usr/lib64/python2.6/site-packages/scipy/signal/tests/test_signaltools.py", 
line 287, in test_none
     signal.medfilt(None)
   File 
"/usr/lib64/python2.6/site-packages/scipy/signal/signaltools.py", line 
317, in medfilt
     return sigtools._order_filterND(volume,domain,order)
ValueError: order_filterND not available for this type

======================================================================
FAIL: test_random_real (test_basic.TestSingleIFFT)
----------------------------------------------------------------------
Traceback (most recent call last):
   File 
"/usr/lib64/python2.6/site-packages/scipy/fftpack/tests/test_basic.py", 
line 205, in test_random_real
     assert_array_almost_equal (y1, x)
   File "/usr/lib64/python2.6/site-packages/numpy/testing/utils.py", 
line 774, in assert_array_almost_equal
     header='Arrays are not almost equal')
   File "/usr/lib64/python2.6/site-packages/numpy/testing/utils.py", 
line 618, in assert_array_compare
     raise AssertionError(msg)
AssertionError:
Arrays are not almost equal

(mismatch 0.900900900901%)
  x: array([ 0.41364330 +5.90676663e-09j,  0.80715483 +2.64462052e-08j,
         0.05271048 -3.67830459e-08j,  0.72591031 -9.31092980e-09j,
         0.35162351 +1.40012923e-09j,  0.17632297 -1.25899486e-08j,...
  y: array([ 0.41364321,  0.80715483,  0.05271063,  0.72591019,  0.35162321,
         0.17632306,  0.3850981 ,  0.75712842,  0.68898875,  0.52632052,
         0.69728118,  0.68721569,  0.69135427,  0.34033701,  0.65788335,...
 >>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch 
0.900900900901%)\n x: array([ 0.41364330 +5.90676663e-09j,  0.80715483 
+2.64462052e-08j,\n        0.05271048 -3.67830459e-08j,  0.72591031 
-9.31092980e-09j,\n        0.35162351 +1.40012923e-09j,  0.17632297 
-1.25899486e-08j,...\n y: array([ 0.41364321,  0.80715483,  0.05271063,  
0.72591019,  0.35162321,\n        0.17632306,  0.3850981 ,  0.75712842,  
0.68898875,  0.52632052,\n        0.69728118,  0.68721569,  0.69135427,  
0.34033701,  0.65788335,...')


----------------------------------------------------------------------


From stefan.czesla at hs.uni-hamburg.de  Tue Jun  8 11:51:33 2010
From: stefan.czesla at hs.uni-hamburg.de (Stefan)
Date: Tue, 8 Jun 2010 15:51:33 +0000 (UTC)
Subject: [SciPy-Dev] np.savetxt: apply patch in enhancement ticket 1079
	to add headers?
References: <c048da1c0911081526t906a3ebke1448544cce8366@mail.gmail.com>
	<loom.20100601T190316-782@post.gmane.org>	<AANLkTincEnXUDzntKj_kgratbmT9xq0h6WGwTIWzj2mf@mail.gmail.com>
	<loom.20100602T125421-882@post.gmane.org>
	<4C066DA3.8010609@gmail.com>
	<loom.20100602T181159-631@post.gmane.org>	<AANLkTikUf1HH5LADb1htC1xBr-KOnZFY1Xb7BP3OaJD_@mail.gmail.com>
	<loom.20100604T173209-199@post.gmane.org>
	<AANLkTikhwqTsdw6qZjwiI7Ct5lOZcBamz0BWizKgcklj@mail.gmail.com>
	<4C09410F.1010900@gmail.com>
Message-ID: <loom.20100608T175105-530@post.gmane.org>

Hi all,
dear Bruce and Skipper,

we very much appreciate your feedback. In response to Skipper's
annotation we added a paragraph in the notes section and also
tried to indicate the purpose of the keywords more precisely in
the parameter section.

The keyword renaming suggested by Bruce lead to some internal
discussions here. We also were not 100% satisfied with the
'comments-comment_character' solution proposed in the first patch, and
we see the conflict with loadtxt. Yet, also the combination of
'Preamble-Comments' appears, somewhat, awkward, because
both seem to indicate the same, at least in our opinion.
We appreciate Bruce's suggestion to
call the keyword Preamble, because it expresses its purpose 
much more clearly than 'Comments' did. For the same reason, we decided
to stay with 'comment_character' instead of 'Comments'.
For the sake of clarity, this solution sacrifices
full compatibility with np.loadtxt,
but it does not create a conflict either.

An adapted patch is available via ticket 1079 at:
http://projects.scipy.org/numpy/ticket/1079

Christian & Stefan


From charlesr.harris at gmail.com  Tue Jun  8 11:57:04 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Tue, 8 Jun 2010 09:57:04 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <4C0E648B.10204@gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
Message-ID: <AANLkTilFFvtnyZNX7Auu1vb9RKYLeDf0QXjLTQwymOQI@mail.gmail.com>

On Tue, Jun 8, 2010 at 9:40 AM, Bruce Southey <bsouthey at gmail.com> wrote:

> Hi,
> I got 2 errors and 1 failure when I installed the beta using Python 2.6
> (Linux 64-bit) with numpy '2.0.0.dev8445' .
>
>
Can we get a fix for ticket 1152 or at least mark it as known?
> http://projects.scipy.org/scipy/ticket/1152
>
> The others are below.
>
> There are also a number of overflow warnings that should be checked and
> avoided. The same warnings also occur in test_continuous_basic for
> certain distributions.
> test_iv_cephes_vs_amos (test_basic.TestBessel) ... Warning: overflow
> encountered in iv
> Warning: overflow encountered in iv
> Warning: invalid value encountered in isinf
>
>
Numpy revision r8455 fixes the isinf warnings for most platforms.

<snip>

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100608/93204410/attachment.html>

From ilanschnell at gmail.com  Tue Jun  8 13:51:01 2010
From: ilanschnell at gmail.com (Ilan Schnell)
Date: Tue, 8 Jun 2010 12:51:01 -0500
Subject: [SciPy-Dev] ANN: EPD 6.2 released
Message-ID: <AANLkTila-dx0gJ0qU-quhXQpUBlROw3j_WbEeM2_MdLT@mail.gmail.com>

Hello,

I am pleased to announce that EPD (Enthought Python Distribution)
version 6.2 has been released.  This release includes an update to
Python 2.6.5, SciPy 0.8.0beta1, as well updates to many other
packages and bug fixes. You can find a complete list of updates in
the change log:

         http://www.enthought.com/EPDChangelog.html

To find more information about EPD, as well as download a 30 day
free trial, visit this page:

         http://www.enthought.com/products/epd.php

In order to be able to serve the Python community better, we made
a small survey.  Please consider taking a few minutes:

         http://www.surveygizmo.com/s/307237/epd-user-feedback


About EPD
---------
The Enthought Python Distribution (EPD) is a "kitchen-sink-included"
distribution of the Python Programming Language, including over 80
additional tools and libraries. The EPD bundle includes NumPy, SciPy,
IPython, 2D and 3D visualization, and many other tools.

         http://www.enthought.com/products/epdlibraries.php

It is currently available as a single-click installer for Windows XP,
Vista and 7, MacOS (10.5 and 10.6), RedHat 3, 4 and 5, as well as
Solaris 10 (x86 and x86_64/amd64 on all platforms).

The 32-bit EPD is free for academic use.  An annual subscription
including installation support is available for individual and
commercial use.  Additional support options, including customization,
bug fixes and training classes are also available:

         http://www.enthought.com/products/support_level_table.php


- Ilan


From josef.pktd at gmail.com  Tue Jun  8 15:38:59 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 8 Jun 2010 15:38:59 -0400
Subject: [SciPy-Dev] ERROR: Failure: SyntaxError (invalid syntax
	(test_distributions.py, line 391)
In-Reply-To: <web-134370768@uni-stuttgart.de>
References: <web-134370768@uni-stuttgart.de>
Message-ID: <AANLkTinjCxAKZx4-_fYrH7rEaBuM_bbuBfDaRZImlWrn@mail.gmail.com>

On Tue, Jun 8, 2010 at 4:55 AM, Nils Wagner
<nwagner at iam.uni-stuttgart.de> wrote:
> Hi all,
>
> I am using
>
>>>> numpy.__version__
> '2.0.0.dev8460'
>>>> import scipy
>>>> scipy.__version__
> '0.9.0.dev6493'
>
> and I found some (new) errors
>
> ======================================================================
> ERROR:
> test_continuous_basic.test_cont_basic(<scipy.stats.distributions.wald_gen
> object at 0x4cb5c90>, (), 'wald')
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/case.py",
> line 183, in runTest
> ? ? self.test(*self.arg)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/tests/test_continuous_basic.py",
> line 291, in check_cdf_ppf
> ? ? npt.assert_almost_equal(distfn.cdf(distfn.ppf([0.001,0.5,0.999],
> *arg), *arg),
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py",
> line 1324, in ppf
> ? ? place(output,cond,self._ppf(*goodargs)*scale + loc)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py",
> line 1028, in _ppf
> ? ? return self.vecfunc(q,*args)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/numpy/lib/function_base.py",
> line 1794, in __call__
> ? ? theout = self.thefunc(*newargs)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py",
> line 974, in _ppf_single_call
> ? ? return optimize.brentq(self._ppf_to_solve, self.xa,
> self.xb, args=(q,)+args, xtol=self.xtol)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/optimize/zeros.py",
> line 262, in brentq
> ? ? r =
> _zeros._brentq(f,a,b,xtol,maxiter,args,full_output,disp)
> ValueError: f(a) and f(b) must have different signs


looking at changeset 6472, it looks like there are two possible errors
using logcdf instead of cdf
and switching to the internal method (underline) which might not do
correct bounds handling (but I'm not sure about the latter)

4289	 	        return invnorm.cdf(x,1,0)
 	4291	        return invnorm._logcdf(x, 1.0)

reverting this line, I guess, fixes it


>
> ======================================================================
> ERROR: Failure: SyntaxError (invalid syntax
> (test_distributions.py, line 391))
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/loader.py",
> line 379, in loadTestsFromName
> ? ? addr.filename, addr.module)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/importer.py",
> line 39, in importFromPath
> ? ? return self.importFromDir(dir_path, fqname)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/importer.py",
> line 86, in importFromDir
> ? ? mod = load_module(part_fqname, fh, filename, desc)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/tests/test_distributions.py",
> line 391
> ? ? res = distfunc.rvs(*args, size=200)
> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?^
> SyntaxError: invalid syntax


I think here the arguments need to be reversed

res = distfunc.rvs(size=200, *args)


Josef


>
> ======================================================================
> ERROR: test_mpmath.test_expi_complex
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/case.py",
> line 183, in runTest
> ? ? self.test(*self.arg)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/numpy/testing/decorators.py",
> line 146, in skipper_func
> ? ? return f(*args, **kwargs)
> ? File
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/special/tests/test_mpmath.py",
> line 46, in test_expi_complex
> ? ? dataset = np.array(dataset, dtype=np.complex_)
> TypeError: a float is required
>
> Nils
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From oliphant at enthought.com  Wed Jun  9 00:29:25 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 8 Jun 2010 23:29:25 -0500
Subject: [SciPy-Dev] ERROR: Failure: SyntaxError (invalid syntax
	(test_distributions.py, line 391)
In-Reply-To: <web-134370768@uni-stuttgart.de>
References: <web-134370768@uni-stuttgart.de>
Message-ID: <3520C593-6833-4071-8CC1-85D44C7A12FF@enthought.com>


On Jun 8, 2010, at 3:55 AM, Nils Wagner wrote:

> Hi all,
> 
> I am using
> 
>>>> numpy.__version__
> '2.0.0.dev8460'
>>>> import scipy
>>>> scipy.__version__
> '0.9.0.dev6493'
> 
> and I found some (new) errors
> 
> ======================================================================
> ERROR: 
> test_continuous_basic.test_cont_basic(<scipy.stats.distributions.wald_gen 
> object at 0x4cb5c90>, (), 'wald')
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/case.py", 
> line 183, in runTest
>     self.test(*self.arg)
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/tests/test_continuous_basic.py", 
> line 291, in check_cdf_ppf
>     npt.assert_almost_equal(distfn.cdf(distfn.ppf([0.001,0.5,0.999], 
> *arg), *arg),
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py", 
> line 1324, in ppf
>     place(output,cond,self._ppf(*goodargs)*scale + loc)
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py", 
> line 1028, in _ppf
>     return self.vecfunc(q,*args)
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/numpy/lib/function_base.py", 
> line 1794, in __call__
>     theout = self.thefunc(*newargs)
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/distributions.py", 
> line 974, in _ppf_single_call
>     return optimize.brentq(self._ppf_to_solve, self.xa, 
> self.xb, args=(q,)+args, xtol=self.xtol)
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/optimize/zeros.py", 
> line 262, in brentq
>     r = 
> _zeros._brentq(f,a,b,xtol,maxiter,args,full_output,disp)
> ValueError: f(a) and f(b) must have different signs
> 
> ======================================================================
> ERROR: Failure: SyntaxError (invalid syntax 
> (test_distributions.py, line 391))
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/loader.py", 
> line 379, in loadTestsFromName
>     addr.filename, addr.module)
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/importer.py", 
> line 39, in importFromPath
>     return self.importFromDir(dir_path, fqname)
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/nose-0.11.1-py2.5.egg/nose/importer.py", 
> line 86, in importFromDir
>     mod = load_module(part_fqname, fh, filename, desc)
>   File 
> "/data/home/nwagner/local/lib/python2.5/site-packages/scipy/stats/tests/test_distributions.py", 
> line 391
>     res = distfunc.rvs(*args, size=200)
>                                  ^
> SyntaxError: invalid syntax


The above two should be fixed in trunk.   This last one is  an old syntax issue with not being able to pass keyword arguments after *args without building the dictionary.   


I don't know what is causing the error below. 

-Travis


From nwagner at iam.uni-stuttgart.de  Wed Jun  9 03:25:46 2010
From: nwagner at iam.uni-stuttgart.de (Nils Wagner)
Date: Wed, 09 Jun 2010 09:25:46 +0200
Subject: [SciPy-Dev] test_complex_dotc (test_blas.TestFBLAS1Simple) ...
Message-ID: <web-134410323@uni-stuttgart.de>

Hi all,
I installed numpy and scipy via svn on
CentOS release 5.2
I have used the prebuild blas and lapack libraries (see 
below).

scipy.test('1','10') segfaults in

test_complex_dotc (test_blas.TestFBLAS1Simple) ...
Program received signal SIGSEGV, Segmentation fault.
0x00002aaab719a257 in cdotc_ () from 
/usr/lib64/libblas.so.3
(gdb) bt
#0  0x00002aaab719a257 in cdotc_ () from 
/usr/lib64/libblas.so.3
#1  0x00002aaab913d0c1 in f2py_rout_fblas_cdotc 
(capi_self=<value optimized out>, capi_args=<value 
optimized out>, capi_keywds=<value optimized out>,
     f2py_func=0x2aaab9140720 <wcdotc_>) at 
build/src.linux-x86_64-2.4/build/src.linux-x86_64-2.4/scipy/linalg/fblasmodule.c:5310


Is this a known issue ?

Any pointer would be appreciated.

Cheers,
              Nils

rpm -qi blas
Name        : blas                         Relocations: 
(not relocatable)
Version     : 3.0                               Vendor: 
CentOS
Release     : 37.el5                        Build Date: Sa 
06 Jan 2007 17:21:23 CET
Install Date: Di 08 Jun 2010 15:57:57 CEST      Build 
Host: builder5.centos.org
Group       : Development/Libraries         Source RPM: 
lapack-3.0-37.el5.src.rpm
Size        : 695196                           License: 
Freely distributable
Signature   : DSA/SHA1, Mi 04 Apr 2007 02:22:00 CEST, Key 
ID a8a447dce8562897
URL         : http://www.netlib.org/lapack/
Summary     : Die BLAS (Basic Linear Algebra 
Subprograms)-Bibliothek.
Description :
BLAS (Basic Linear Algebra Subprograms) is a standard 
library which
provides a number of basic algorithms for numerical 
algebra. Man
pages for blas are available in the blas-man package.

rpm -qi lapack
Name        : lapack                       Relocations: 
(not relocatable)
Version     : 3.0                               Vendor: 
CentOS
Release     : 37.el5                        Build Date: Sa 
06 Jan 2007 17:21:23 CET
Install Date: Di 08 Jun 2010 15:58:12 CEST      Build 
Host: builder5.centos.org
Group       : Development/Libraries         Source RPM: 
lapack-3.0-37.el5.src.rpm
Size        : 5910874                          License: 
Freely distributable
Signature   : DSA/SHA1, Mi 04 Apr 2007 02:24:47 CEST, Key 
ID a8a447dce8562897
URL         : http://www.netlib.org/lapack/


From nwagner at iam.uni-stuttgart.de  Wed Jun  9 04:05:53 2010
From: nwagner at iam.uni-stuttgart.de (Nils Wagner)
Date: Wed, 09 Jun 2010 10:05:53 +0200
Subject: [SciPy-Dev] Building rpms
Message-ID: <web-134412694@uni-stuttgart.de>

Hi all,

I tried to build rpms from numpy and scipy.

It failed with

tar -cf dist/numpy-2.0.0.dev8460.tar numpy-2.0.0.dev8460
gzip -f9 dist/numpy-2.0.0.dev8460.tar
removing 'numpy-2.0.0.dev8460' (and everything under it)
copying dist/numpy-2.0.0.dev8460.tar.gz -> 
build/bdist.linux-x86_64/rpm/SOURCES
building RPMs
rpm -ba --define _topdir 
/data/home/nwagner/svn/numpy/build/bdist.linux-x86_64/rpm 
--clean build/bdist.linux-x86_64/rpm/SPECS/numpy.spec
-ba: unknown option
error: command 'rpm' failed with exit status 1


Any idea ?

Nils


From cournape at gmail.com  Wed Jun  9 05:55:32 2010
From: cournape at gmail.com (David Cournapeau)
Date: Wed, 9 Jun 2010 18:55:32 +0900
Subject: [SciPy-Dev] Building rpms
In-Reply-To: <web-134412694@uni-stuttgart.de>
References: <web-134412694@uni-stuttgart.de>
Message-ID: <AANLkTin8uuNePKa1zAM_MXwwjtLLO9GdgrZ2p6-JLOBo@mail.gmail.com>

On Wed, Jun 9, 2010 at 5:05 PM, Nils Wagner
<nwagner at iam.uni-stuttgart.de> wrote:
> Hi all,
>
> I tried to build rpms from numpy and scipy.
>
> It failed with
>
> tar -cf dist/numpy-2.0.0.dev8460.tar numpy-2.0.0.dev8460
> gzip -f9 dist/numpy-2.0.0.dev8460.tar
> removing 'numpy-2.0.0.dev8460' (and everything under it)
> copying dist/numpy-2.0.0.dev8460.tar.gz ->
> build/bdist.linux-x86_64/rpm/SOURCES
> building RPMs
> rpm -ba --define _topdir
> /data/home/nwagner/svn/numpy/build/bdist.linux-x86_64/rpm
> --clean build/bdist.linux-x86_64/rpm/SPECS/numpy.spec
> -ba: unknown option

-ba should be an option for rpmbuild, not rpm. I don't know why
distutils calls rpm here,

David


From nwagner at iam.uni-stuttgart.de  Wed Jun  9 07:32:28 2010
From: nwagner at iam.uni-stuttgart.de (Nils Wagner)
Date: Wed, 09 Jun 2010 13:32:28 +0200
Subject: [SciPy-Dev] Building rpms
In-Reply-To: <AANLkTin8uuNePKa1zAM_MXwwjtLLO9GdgrZ2p6-JLOBo@mail.gmail.com>
References: <web-134412694@uni-stuttgart.de>
	<AANLkTin8uuNePKa1zAM_MXwwjtLLO9GdgrZ2p6-JLOBo@mail.gmail.com>
Message-ID: <web-134425109@uni-stuttgart.de>

On Wed, 9 Jun 2010 18:55:32 +0900
  David Cournapeau <cournape at gmail.com> wrote:
> On Wed, Jun 9, 2010 at 5:05 PM, Nils Wagner
> <nwagner at iam.uni-stuttgart.de> wrote:
>> Hi all,
>>
>> I tried to build rpms from numpy and scipy.
>>
>> It failed with
>>
>> tar -cf dist/numpy-2.0.0.dev8460.tar numpy-2.0.0.dev8460
>> gzip -f9 dist/numpy-2.0.0.dev8460.tar
>> removing 'numpy-2.0.0.dev8460' (and everything under it)
>> copying dist/numpy-2.0.0.dev8460.tar.gz ->
>> build/bdist.linux-x86_64/rpm/SOURCES
>> building RPMs
>> rpm -ba --define _topdir
>> /data/home/nwagner/svn/numpy/build/bdist.linux-x86_64/rpm
>> --clean build/bdist.linux-x86_64/rpm/SPECS/numpy.spec
>> -ba: unknown option
> 
> -ba should be an option for rpmbuild, not rpm. I don't 
>know why
> distutils calls rpm here,
> 
> David
  
Exactly. It looks like a bug in distutils.
However, as soon as I have installed the rpm-build.rpm 
package on CentOS it works for me.

Nils


From m.boumans at gmx.net  Sun Jun  6 01:45:28 2010
From: m.boumans at gmx.net (bowie_22)
Date: Sun, 6 Jun 2010 05:45:28 +0000 (UTC)
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
Message-ID: <loom.20100606T072856-807@post.gmane.org>

Ralf Gommers <ralf.gommers <at> googlemail.com> writes:

> 
> I'm pleased to announce the first beta release of SciPy 0.8.0.SciPy is a
package of tools for science and engineering for Python.It includes modules for
statistics, optimization, integration, linearalgebra, Fourier transforms, signal
and image processing, ODE solvers,
> and more.This beta release comes almost one and a half year after the 0.7.0
release andcontains many new features, numerous bug-fixes, improved
testcoverage, and better documentation. ?Please note that SciPy 0.8.0b1
> requires Python 2.4 or greater and NumPy 1.4.1 or greater.For information,
please see the release
notes:http://sourceforge.net/projects/scipy/files/scipy/0.8.0b1/NOTES.txt/viewYou can
download the release from here:https://sourceforge.net/projects/scipy/Python
2.5/2.6 binaries for Windows and OS X are available as well as source tarballs
for other platforms. Thank you to everybody who contributed to this
release.Cheers,Ralf
> 
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev <at> scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
> 


Hello everybody,

I just have read the announcement for  SciPy 0.8.0 and of course I have
installed it immediatly.
I am quite new in Scipy (coming from Matlab) and I thought a good starting point
for a contribution would be to review and check the Scipy documentation.

I added some hints in 

http://docs.scipy.org/numpy/Front%20Page/

and now I ask myself how the release of the documentation is conntected to the
release of a new scipy version. 
Is it connected at all?

Browsing throw the docs give at 
http://docs.scipy.org/doc 
shows a documentation for scipy 0.7.
Does "...and better documentation..." mean an improvement in the docstrings
(As I am still not sure which place is the best to look at)?

As a scipy rookie I would appreciate same information about this topics (release
of documentation and release of a new scipy package)

Thank you!

Regs 
Marcus


From amcmorl at gmail.com  Thu Jun 10 10:22:24 2010
From: amcmorl at gmail.com (Angus McMorland)
Date: Thu, 10 Jun 2010 10:22:24 -0400
Subject: [SciPy-Dev] Docstrings permissions
Message-ID: <AANLkTikV3W5AjQWFBOYvtOdqz6xYcTEePWRkotbr-gvZ@mail.gmail.com>

I've found a scipy docstring that needs slight adjustment. Please can
someone give me edit permissions on the docstring site: I'll try to
make this the impetus I need to get contributing in general. I've
registered an account under the name amcmorl.

Thanks all,

Angus.
-- 
AJC McMorland
Post-doctoral research fellow
Neurobiology, University of Pittsburgh


From d.l.goldsmith at gmail.com  Thu Jun 10 17:21:56 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 10 Jun 2010 14:21:56 -0700
Subject: [SciPy-Dev] Marathon skypecon tomorrow and an agenda item
Message-ID: <AANLkTimytxFORnFAcA5Kg6fYRmDrD0ihCQqEM5AH4hcA@mail.gmail.com>

Agenda item: no one has "registered" themselves to work on any of the
Milestones - was that a bad idea?  If so, what are some other things we can
do to kick-start this thing?

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100610/ae6bd464/attachment.html>

From vincent at vincentdavis.net  Thu Jun 10 19:26:15 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 10 Jun 2010 17:26:15 -0600
Subject: [SciPy-Dev] Marathon skypecon tomorrow and an agenda item
In-Reply-To: <AANLkTimytxFORnFAcA5Kg6fYRmDrD0ihCQqEM5AH4hcA@mail.gmail.com>
References: <AANLkTimytxFORnFAcA5Kg6fYRmDrD0ihCQqEM5AH4hcA@mail.gmail.com>
Message-ID: <AANLkTinnOKIpYkpmYedceCga_PnRd6-Z2Q1BtENLoSeK@mail.gmail.com>

On Thu, Jun 10, 2010 at 3:21 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> Agenda item: no one has "registered" themselves to work on any of the
> Milestones - was that a bad idea?? If so, what are some other things we can
> do to kick-start this thing?

Well guess I either missed the list or looked and didn't think I was
capable of any of them. So could you point me to the list (again?).

>
> DG
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From d.l.goldsmith at gmail.com  Thu Jun 10 19:31:01 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 10 Jun 2010 16:31:01 -0700
Subject: [SciPy-Dev] Marathon skypecon tomorrow and an agenda item
In-Reply-To: <AANLkTinnOKIpYkpmYedceCga_PnRd6-Z2Q1BtENLoSeK@mail.gmail.com>
References: <AANLkTimytxFORnFAcA5Kg6fYRmDrD0ihCQqEM5AH4hcA@mail.gmail.com>
	<AANLkTinnOKIpYkpmYedceCga_PnRd6-Z2Q1BtENLoSeK@mail.gmail.com>
Message-ID: <AANLkTimQhObkX-MFajjJBoJ8HOJVk5AildgwoOMD0E5v@mail.gmail.com>

On Thu, Jun 10, 2010 at 4:26 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Thu, Jun 10, 2010 at 3:21 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > Agenda item: no one has "registered" themselves to work on any of the
> > Milestones - was that a bad idea?  If so, what are some other things we
> can
> > do to kick-start this thing?
>
> Well guess I either missed the list or looked and didn't think I was
> capable of any of them. So could you point me to the list (again?).
>

http://docs.scipy.org/scipy/Milestones/

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100610/7414c4e1/attachment.html>

From vincent at vincentdavis.net  Thu Jun 10 19:32:06 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 10 Jun 2010 17:32:06 -0600
Subject: [SciPy-Dev] Data Types documentation page questions?
Message-ID: <AANLkTil012eWTMSwAupvTOF8o8dVAxi66qEysL6tzwge@mail.gmail.com>

Regarding this page
http://docs.scipy.org/doc/numpy/user/basics.types.html

I assume there is a "-" missing here
int64	Integer (9223372036854775808 to 9223372036854775807)

Also I would suggest that the intervals on these data type use
standard mathematical notations for open and closed interval. "(", and
"["


Vincent


From d.l.goldsmith at gmail.com  Thu Jun 10 19:52:14 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 10 Jun 2010 16:52:14 -0700
Subject: [SciPy-Dev] Data Types documentation page questions?
In-Reply-To: <AANLkTil012eWTMSwAupvTOF8o8dVAxi66qEysL6tzwge@mail.gmail.com>
References: <AANLkTil012eWTMSwAupvTOF8o8dVAxi66qEysL6tzwge@mail.gmail.com>
Message-ID: <AANLkTilKAhXnlAs8ooLFsAgirXsyEaHjDErxne2kIolp@mail.gmail.com>

On Thu, Jun 10, 2010 at 4:32 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> Regarding this page
> http://docs.scipy.org/doc/numpy/user/basics.types.html
>
> I assume there is a "-" missing here
> int64   Integer (9223372036854775808 to 9223372036854775807)
>

Yes, the left number should be negative.


> Also I would suggest that the intervals on these data type use
> standard mathematical notations for open and closed interval. "(", and
> "["
>

Except that notation is used for real number intervals, not sets of integers
(at least, I've never seen it used w/ sets of integers).  And were you to
make the change, it would simply be replacing every "(" with a "[" and every
")" with a "]" (because, the way the ranges are given, they all include
their endpoints) - I think the risk of misunderstanding on the part of the
reader is minimal and thus any benefit is hardly worth the effort.

DG

>
>
> Vincent
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100610/eb82116b/attachment.html>

From d.l.goldsmith at gmail.com  Thu Jun 10 21:45:14 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 10 Jun 2010 18:45:14 -0700
Subject: [SciPy-Dev] How to document parameters *args and **kwds
Message-ID: <AANLkTinkqarECRPO0EaYv8V30dC_uFHVmfoBV3V_PELC@mail.gmail.com>

We've kind of discussed *args before (see
http://docs.scipy.org/numpy/Questions+Answers/#variable-arguments), though
we didn't note a "canonical answer."  Can we:

A) agree on such, and

B) extend it to parameter **kwds?

Thanks!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100610/3f1c9e51/attachment.html>

From robert.kern at gmail.com  Thu Jun 10 21:50:27 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Thu, 10 Jun 2010 21:50:27 -0400
Subject: [SciPy-Dev] How to document parameters *args and **kwds
In-Reply-To: <AANLkTinkqarECRPO0EaYv8V30dC_uFHVmfoBV3V_PELC@mail.gmail.com>
References: <AANLkTinkqarECRPO0EaYv8V30dC_uFHVmfoBV3V_PELC@mail.gmail.com>
Message-ID: <AANLkTil6LGWI9eCiwpe7u_DokMH5--sakSW8-yDyoyzR@mail.gmail.com>

On Thu, Jun 10, 2010 at 21:45, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> We've kind of discussed *args before (see
> http://docs.scipy.org/numpy/Questions+Answers/#variable-arguments), though
> we didn't note a "canonical answer."? Can we:
>
> A) agree on such, and
>
> B) extend it to parameter **kwds?

In most cases, I would leave the type field blank unless if they
happen to be homogeneous. They often aren't.

*args :
    Arguments to pass to the callback.
**kwds :
    Keyword arguments to pass to the callback.

But sometimes they are.

*indices : ints
    Possibly multiple indices.

I don't think Sphinx has a problem with these constructs, but I could be wrong.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From d.l.goldsmith at gmail.com  Thu Jun 10 22:05:53 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 10 Jun 2010 19:05:53 -0700
Subject: [SciPy-Dev] How to document parameters *args and **kwds
In-Reply-To: <AANLkTil6LGWI9eCiwpe7u_DokMH5--sakSW8-yDyoyzR@mail.gmail.com>
References: <AANLkTinkqarECRPO0EaYv8V30dC_uFHVmfoBV3V_PELC@mail.gmail.com>
	<AANLkTil6LGWI9eCiwpe7u_DokMH5--sakSW8-yDyoyzR@mail.gmail.com>
Message-ID: <AANLkTiksVsYvmzU2Ja9c6q1q5L1m_YfK47RvM92gcKUh@mail.gmail.com>

On Thu, Jun 10, 2010 at 6:50 PM, Robert Kern <robert.kern at gmail.com> wrote:

> On Thu, Jun 10, 2010 at 21:45, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
> > We've kind of discussed *args before (see
> > http://docs.scipy.org/numpy/Questions+Answers/#variable-arguments),
> though
> > we didn't note a "canonical answer."  Can we:
> >
> > A) agree on such, and
> >
> > B) extend it to parameter **kwds?
>
> In most cases, I would leave the type field blank unless if they
> happen to be homogeneous. They often aren't.
>
> *args :
>    Arguments to pass to the callback.
> **kwds :
>    Keyword arguments to pass to the callback.
>
> But sometimes they are.
>
> *indices : ints
>    Possibly multiple indices.
>
> I don't think Sphinx has a problem with these constructs, but I could be
> wrong.
>

What we were (only slightly) leaning to in the Q+A page discussion, in part
because Ralf said there was already precedent for it in the docs, was:

\*args : Arguments

Explanation of number and type of arguments ....

but escaping the '*' with an '\' doesn't appear to be working, but leaving
it un-escaped gets misinterpreted, too (as an un-closed emphasis mark-up).
So the **kwds analog would be

\*\*kwds : Keyword arguments
    Explanation of number and type...

but now that I look at that typed out, I predict that the command-line crowd
will protest. :-)  Regardless, right now, using * & ** "breaks" the Wiki,
whereas \* & \*\* keeps the Wiki from complaining, but the \ aren't removed
by it, and look ugly be it in a terminal or rendered.  What to do, what to
do...

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100610/d1a7623c/attachment.html>

From robert.kern at gmail.com  Thu Jun 10 22:36:58 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Thu, 10 Jun 2010 22:36:58 -0400
Subject: [SciPy-Dev] How to document parameters *args and **kwds
In-Reply-To: <AANLkTiksVsYvmzU2Ja9c6q1q5L1m_YfK47RvM92gcKUh@mail.gmail.com>
References: <AANLkTinkqarECRPO0EaYv8V30dC_uFHVmfoBV3V_PELC@mail.gmail.com> 
	<AANLkTil6LGWI9eCiwpe7u_DokMH5--sakSW8-yDyoyzR@mail.gmail.com> 
	<AANLkTiksVsYvmzU2Ja9c6q1q5L1m_YfK47RvM92gcKUh@mail.gmail.com>
Message-ID: <AANLkTikW-StVmtRyE5fbk4jpK_xJuoeu6OkdXXp_j1bo@mail.gmail.com>

On Thu, Jun 10, 2010 at 22:05, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> On Thu, Jun 10, 2010 at 6:50 PM, Robert Kern <robert.kern at gmail.com> wrote:
>>
>> On Thu, Jun 10, 2010 at 21:45, David Goldsmith <d.l.goldsmith at gmail.com>
>> wrote:
>> > We've kind of discussed *args before (see
>> > http://docs.scipy.org/numpy/Questions+Answers/#variable-arguments),
>> > though
>> > we didn't note a "canonical answer."? Can we:
>> >
>> > A) agree on such, and
>> >
>> > B) extend it to parameter **kwds?
>>
>> In most cases, I would leave the type field blank unless if they
>> happen to be homogeneous. They often aren't.
>>
>> *args :
>> ? ?Arguments to pass to the callback.
>> **kwds :
>> ? ?Keyword arguments to pass to the callback.
>>
>> But sometimes they are.
>>
>> *indices : ints
>> ? ?Possibly multiple indices.
>>
>> I don't think Sphinx has a problem with these constructs, but I could be
>> wrong.
>
> What we were (only slightly) leaning to in the Q+A page discussion, in part
> because Ralf said there was already precedent for it in the docs, was:
>
> \*args : Arguments
>
> Explanation of number and type of arguments ....
>
> but escaping the '*' with an '\' doesn't appear to be working, but leaving
> it un-escaped gets misinterpreted, too (as an un-closed emphasis mark-up).
> So the **kwds analog would be
>
> \*\*kwds : Keyword arguments
> ??? Explanation of number and type...
>
> but now that I look at that typed out, I predict that the command-line crowd
> will protest. :-)? Regardless, right now, using * & ** "breaks" the Wiki,
> whereas \* & \*\* keeps the Wiki from complaining, but the \ aren't removed
> by it, and look ugly be it in a terminal or rendered.? What to do, what to
> do...

Fix the wiki software. The generated Sphinx docs are fine with the
unescaped version. The wiki is a tool to help build the Sphinx docs,
not the other way around.

However, my typeless examples do not work directly. You need to omit
the colon. Shame, because I like the look of the colon. Ah well. The
following do work, and I prefer them to the "Arguments" and "Keyword
arguments" placeholders. The *, ** and usually the names of those
variables usually state clearly that they are arguments or keyword
arguments. Stating it a third time just seems weird. I'd say, add real
type information if it makes sense, otherwise omit it. But that's just
me.

*args
    Arguments to pass to the callback.
**kwds
    Keyword arguments to pass to the callback.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From vincent at vincentdavis.net  Thu Jun 10 23:50:02 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 10 Jun 2010 21:50:02 -0600
Subject: [SciPy-Dev] mono space text in document editor.
Message-ID: <AANLkTimQyC4ZR78QiF4HycQ01R4xuJyYV4ARniJX_utL@mail.gmail.com>

It seems that the text in the document editor is not a mono space
text. Can this be changed?

Vincent


From vincent at vincentdavis.net  Fri Jun 11 00:11:57 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 10 Jun 2010 22:11:57 -0600
Subject: [SciPy-Dev] Updating constants
Message-ID: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>

The current constants in scipy are from 2002, the newest set available
are from 2006.
Should they be updated, What are the issues with updating with regard
to notifying users ie documenting the update.

Vincent


From josef.pktd at gmail.com  Fri Jun 11 00:38:18 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 00:38:18 -0400
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
Message-ID: <AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>

On Fri, Jun 11, 2010 at 12:11 AM, Vincent Davis
<vincent at vincentdavis.net> wrote:
> The current constants in scipy are from 2002, the newest set available
> are from 2006.
> Should they be updated, What are the issues with updating with regard
> to notifying users ie documenting the update.

I thought these are constants. Did they change the value of Pi recently?

just curious: What has changed?

Josef

>
> Vincent
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Fri Jun 11 00:54:13 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 10 Jun 2010 22:54:13 -0600
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
	<AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
Message-ID: <AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>

On Thu, Jun 10, 2010 at 10:38 PM,  <josef.pktd at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 12:11 AM, Vincent Davis
> <vincent at vincentdavis.net> wrote:
>> The current constants in scipy are from 2002, the newest set available
>> are from 2006.
>> Should they be updated, What are the issues with updating with regard
>> to notifying users ie documenting the update.
>
> I thought these are constants. Did they change the value of Pi recently?

They found more digits of Pi :)
I was referring to "Fundamental Physical Constants" scipy.constants.codata
I don't know, it's not easy to compare. I assume there was a reason
they updated the list. There is an Uncertainty value on many of them
so I assume the actual value don't change but our estimate does.

http://physics.nist.gov/cuu/Constants/index.html
release dates 1986, 1998, 2002, 2006

Vincent

>
> just curious: What has changed?
>
> Josef
>
>>
>> Vincent
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Fri Jun 11 01:09:22 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 10 Jun 2010 23:09:22 -0600
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
	<AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
	<AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>
Message-ID: <AANLkTin_8yGnzxznFnjh5_rmpt9h33YVqxfQyGxMZCiZ@mail.gmail.com>

On Thu, Jun 10, 2010 at 10:54 PM, Vincent Davis
<vincent at vincentdavis.net> wrote:
> On Thu, Jun 10, 2010 at 10:38 PM, ?<josef.pktd at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 12:11 AM, Vincent Davis
>> <vincent at vincentdavis.net> wrote:
>>> The current constants in scipy are from 2002, the newest set available
>>> are from 2006.
>>> Should they be updated, What are the issues with updating with regard
>>> to notifying users ie documenting the update.
>>
>> I thought these are constants. Did they change the value of Pi recently?
>
> They found more digits of Pi :)
> I was referring to "Fundamental Physical Constants" scipy.constants.codata
> I don't know, it's not easy to compare. I assume there was a reason
> they updated the list. There is an Uncertainty value on many of them
> so I assume the actual value don't change but our estimate does.
>
> http://physics.nist.gov/cuu/Constants/index.html
> release dates 1986, 1998, 2002, 2006
>
> Vincent
>
>>
>> just curious: What has changed?
>>

I just relived I can update the constants and then compare them with
the old. Should be quick, I will send out the diff in the morning.

Vincent


>> Josef
>>
>>>
>>> Vincent
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>


From josef.pktd at gmail.com  Fri Jun 11 01:29:39 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 01:29:39 -0400
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTin_8yGnzxznFnjh5_rmpt9h33YVqxfQyGxMZCiZ@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
	<AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
	<AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>
	<AANLkTin_8yGnzxznFnjh5_rmpt9h33YVqxfQyGxMZCiZ@mail.gmail.com>
Message-ID: <AANLkTikGYRj8HOuEeDLSYGTT70nfSZMo_9UEzAoO03ua@mail.gmail.com>

On Fri, Jun 11, 2010 at 1:09 AM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Thu, Jun 10, 2010 at 10:54 PM, Vincent Davis
> <vincent at vincentdavis.net> wrote:
>> On Thu, Jun 10, 2010 at 10:38 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Fri, Jun 11, 2010 at 12:11 AM, Vincent Davis
>>> <vincent at vincentdavis.net> wrote:
>>>> The current constants in scipy are from 2002, the newest set available
>>>> are from 2006.
>>>> Should they be updated, What are the issues with updating with regard
>>>> to notifying users ie documenting the update.
>>>
>>> I thought these are constants. Did they change the value of Pi recently?
>>
>> They found more digits of Pi :)
>> I was referring to "Fundamental Physical Constants" scipy.constants.codata
>> I don't know, it's not easy to compare. I assume there was a reason
>> they updated the list. There is an Uncertainty value on many of them
>> so I assume the actual value don't change but our estimate does.
>>
>> http://physics.nist.gov/cuu/Constants/index.html
>> release dates 1986, 1998, 2002, 2006
>>
>> Vincent
>>
>>>
>>> just curious: What has changed?
>>>
>
> I just relived I can update the constants and then compare them with
> the old. Should be quick, I will send out the diff in the morning.

quote from the REVIEWS OF MODERN PHYSICS paper:

"Although just four years separate the 31 December closing dates of
the 2002 and 2006 adjustments, there are a number of important new
results to consider. Experimental advances include the 2003 Atomic
Mass Evaluation from the Atomic Mass Data Center (AMDC),which provides
new values for the relative atomic masses Ar(X) of a number of
relevant atoms; a new value of ..."

Josef
?Curiouser and curiouser!?

>
> Vincent
>
>
>>> Josef
>>>
>>>>
>>>> Vincent
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From pav at iki.fi  Fri Jun 11 06:25:08 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 11 Jun 2010 10:25:08 +0000 (UTC)
Subject: [SciPy-Dev] mono space text in document editor.
References: <AANLkTimQyC4ZR78QiF4HycQ01R4xuJyYV4ARniJX_utL@mail.gmail.com>
Message-ID: <hut2u4$4e4$1@dough.gmane.org>

Thu, 10 Jun 2010 21:50:02 -0600, Vincent Davis wrote:
> It seems that the text in the document editor is not a mono space text.
> Can this be changed?

What text?

The text in the edit window is monospace. The output should not be made 
monospaced; it is intended for Sphinx, and plain text should be variable 
spaced.

-- 
Pauli Virtanen


From tpk at kraussfamily.org  Fri Jun 11 08:50:49 2010
From: tpk at kraussfamily.org (Tom K.)
Date: Fri, 11 Jun 2010 12:50:49 +0000 (UTC)
Subject: [SciPy-Dev] [SciPy] #902: need high, stop,
	pass options to signal.firwin
Message-ID: <loom.20100611T144621-413@post.gmane.org>

Looks like this patch for firwin might have been overlooked.  
Posting here as requested.

#902: need high, stop, pass options to signal.firwin
----------------------------------+----------------------------
Reporter:  tpk@?                 |       Owner:  somebody
Type:  enhancement           |      Status:  new     
Priority:  normal                |   Milestone:  0.8.0   
Component:  scipy.signal          |     Version:  0.7.0   
Keywords:                        |  
----------------------------------+----------------------------

Comment(by charris):

It probably got overlooked. Send a note to the list.

-- 
Ticket URL: <http://projects.scipy.org/scipy/ticket/902#comment:3>
SciPy <http://www.scipy.org>
SciPy is open-source software for mathematics, science, and engineering.


From d.l.goldsmith at gmail.com  Fri Jun 11 11:19:34 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 11 Jun 2010 08:19:34 -0700
Subject: [SciPy-Dev] Marathon Skypecon in 45 minutes
Message-ID: <AANLkTini7H0bYCSy69KCPWQ9s4jDGw-40hMjL26PEJ_G@mail.gmail.com>

My Skype ID is d.l.goldsmith - message me and I'll add you.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100611/6a01917d/attachment.html>

From d.l.goldsmith at gmail.com  Fri Jun 11 11:45:03 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 11 Jun 2010 08:45:03 -0700
Subject: [SciPy-Dev] How to document parameters *args and **kwds
In-Reply-To: <AANLkTikW-StVmtRyE5fbk4jpK_xJuoeu6OkdXXp_j1bo@mail.gmail.com>
References: <AANLkTinkqarECRPO0EaYv8V30dC_uFHVmfoBV3V_PELC@mail.gmail.com>
	<AANLkTil6LGWI9eCiwpe7u_DokMH5--sakSW8-yDyoyzR@mail.gmail.com>
	<AANLkTiksVsYvmzU2Ja9c6q1q5L1m_YfK47RvM92gcKUh@mail.gmail.com>
	<AANLkTikW-StVmtRyE5fbk4jpK_xJuoeu6OkdXXp_j1bo@mail.gmail.com>
Message-ID: <AANLkTinrDu6KzsbkLer2PHTzUy5-I0o4d7jCfD5j9MJs@mail.gmail.com>

On Thu, Jun 10, 2010 at 7:36 PM, Robert Kern <robert.kern at gmail.com> wrote:

> On Thu, Jun 10, 2010 at 22:05, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
> > On Thu, Jun 10, 2010 at 6:50 PM, Robert Kern <robert.kern at gmail.com>
> wrote:
> >>
> >> On Thu, Jun 10, 2010 at 21:45, David Goldsmith <d.l.goldsmith at gmail.com
> >
> >> wrote:
> >> > We've kind of discussed *args before (see
> >> > http://docs.scipy.org/numpy/Questions+Answers/#variable-arguments),
> >> > though
> >> > we didn't note a "canonical answer."  Can we:
> >> >
> >> > A) agree on such, and
> >> >
> >> > B) extend it to parameter **kwds?
> >>
> >> In most cases, I would leave the type field blank unless if they
> >> happen to be homogeneous. They often aren't.
> >>
> >> *args :
> >>    Arguments to pass to the callback.
> >> **kwds :
> >>    Keyword arguments to pass to the callback.
> >>
> >> But sometimes they are.
> >>
> >> *indices : ints
> >>    Possibly multiple indices.
> >>
> >> I don't think Sphinx has a problem with these constructs, but I could be
> >> wrong.
> >
> > What we were (only slightly) leaning to in the Q+A page discussion, in
> part
> > because Ralf said there was already precedent for it in the docs, was:
> >
> > \*args : Arguments
> >
> > Explanation of number and type of arguments ....
> >
> > but escaping the '*' with an '\' doesn't appear to be working, but
> leaving
> > it un-escaped gets misinterpreted, too (as an un-closed emphasis
> mark-up).
> > So the **kwds analog would be
> >
> > \*\*kwds : Keyword arguments
> >     Explanation of number and type...
> >
> > but now that I look at that typed out, I predict that the command-line
> crowd
> > will protest. :-)  Regardless, right now, using * & ** "breaks" the Wiki,
> > whereas \* & \*\* keeps the Wiki from complaining, but the \ aren't
> removed
> > by it, and look ugly be it in a terminal or rendered.  What to do, what
> to
> > do...
>
> Fix the wiki software. The generated Sphinx docs are fine with the
> unescaped version. The wiki is a tool to help build the Sphinx docs,
> not the other way around.
>
> However, my typeless examples do not work directly. You need to omit
> the colon. Shame, because I like the look of the colon. Ah well. The
> following do work, and I prefer them to the "Arguments" and "Keyword
> arguments" placeholders. The *, ** and usually the names of those
> variables usually state clearly that they are arguments or keyword
> arguments. Stating it a third time just seems weird. I'd say, add real
> type information if it makes sense, otherwise omit it. But that's just
> me.
>
> *args
>    Arguments to pass to the callback.
> **kwds
>    Keyword arguments to pass to the callback.
>
> --
> Robert Kern
>

I excerpted this over on the Q+A page; please go check to confirm that I
haven't misrepresented you.  And thanks for your input!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100611/1fcd7298/attachment.html>

From vincent at vincentdavis.net  Fri Jun 11 12:07:59 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 11 Jun 2010 10:07:59 -0600
Subject: [SciPy-Dev] mono space text in document editor.
In-Reply-To: <hut2u4$4e4$1@dough.gmane.org>
References: <AANLkTimQyC4ZR78QiF4HycQ01R4xuJyYV4ARniJX_utL@mail.gmail.com>
	<hut2u4$4e4$1@dough.gmane.org>
Message-ID: <AANLkTimeYC2WC-oubAXW3dhWezNUX4-tzdTgo5PjURmH@mail.gmail.com>

On Fri, Jun 11, 2010 at 4:25 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Thu, 10 Jun 2010 21:50:02 -0600, Vincent Davis wrote:
>> It seems that the text in the document editor is not a mono space text.
>> Can this be changed?
>
> What text?
On this page for example.
http://docs.scipy.org/scipy/docs/scipy.constants.codata.precision/edit/
When you edit the text it is not monospace and there is no 75 char guide.
This makes both staying within the 75 char limit and getting the rst
syntax correct difficult. (for example putting "-" under a word to get
bold font.) I could use an external editor and copy paste.
See attached images for you don't know what I mean.

Thanks
Vincent


>
> The text in the edit window is monospace. The output should not be made
> monospaced; it is intended for Sphinx, and plain text should be variable
> spaced.
>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: scipy doc edit.tiff
Type: image/tiff
Size: 96158 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100611/3051665b/attachment.tiff>

From d.l.goldsmith at gmail.com  Fri Jun 11 12:30:23 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 11 Jun 2010 09:30:23 -0700
Subject: [SciPy-Dev] mono space text in document editor.
In-Reply-To: <AANLkTimeYC2WC-oubAXW3dhWezNUX4-tzdTgo5PjURmH@mail.gmail.com>
References: <AANLkTimQyC4ZR78QiF4HycQ01R4xuJyYV4ARniJX_utL@mail.gmail.com>
	<hut2u4$4e4$1@dough.gmane.org>
	<AANLkTimeYC2WC-oubAXW3dhWezNUX4-tzdTgo5PjURmH@mail.gmail.com>
Message-ID: <AANLkTimDVP5cqdSY3ymtlD4DrDK7hqWzKz-H5ZBRnAPk@mail.gmail.com>

On Fri, Jun 11, 2010 at 9:07 AM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Fri, Jun 11, 2010 at 4:25 AM, Pauli Virtanen <pav at iki.fi> wrote:
> > Thu, 10 Jun 2010 21:50:02 -0600, Vincent Davis wrote:
> >> It seems that the text in the document editor is not a mono space text.
> >> Can this be changed?
> >
> > What text?
> On this page for example.
> http://docs.scipy.org/scipy/docs/scipy.constants.codata.precision/edit/
> When you edit the text it is not monospace and there is no 75 char guide.
> This makes both staying within the 75 char limit


Just err on the side of being too short (i.e., if in doubt, break the line).


> and getting the rst
> syntax correct difficult. (for example putting "-" under a word to get
> bold font.)


**<text>** gives us bold font.  ("-" under a word makes it a section
heading; only Parameters, Returns, Other parameters, Raises, See also,
Notes, References, and Examples, starting in the first column and on a line
by themselves, should have "-" under them.)


> I could use an external editor and copy paste.
>

Actually, nominally, that _is_ what we'd prefer writer/editors to do:

(from http://docs.scipy.org/numpy/Front%20Page/#roles-reviewing) "**It is
best to grab the whole existing page [from the Edit window, not the View
window], or a template, edit it on your computer, return [and] check that
nobody else has edited first, and then upload your document. Please do not
edit incrementally, unless making trivial changes like fixing markup or
reformatting."

(Full disclosure: I "cheat," but since you brought it up, I figured I should
emphasize that what you state is the preferred modus operendi.)

DG

See attached images for you don't know what I mean.
>
> Thanks
> Vincent
>
>
>
> >
> > The text in the edit window is monospace. The output should not be made
> > monospaced; it is intended for Sphinx, and plain text should be variable
> > spaced.
> >
> > --
> > Pauli Virtanen
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100611/30c244ad/attachment.html>

From josef.pktd at gmail.com  Fri Jun 11 12:41:17 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 12:41:17 -0400
Subject: [SciPy-Dev] mono space text in document editor.
In-Reply-To: <AANLkTimDVP5cqdSY3ymtlD4DrDK7hqWzKz-H5ZBRnAPk@mail.gmail.com>
References: <AANLkTimQyC4ZR78QiF4HycQ01R4xuJyYV4ARniJX_utL@mail.gmail.com>
	<hut2u4$4e4$1@dough.gmane.org>
	<AANLkTimeYC2WC-oubAXW3dhWezNUX4-tzdTgo5PjURmH@mail.gmail.com>
	<AANLkTimDVP5cqdSY3ymtlD4DrDK7hqWzKz-H5ZBRnAPk@mail.gmail.com>
Message-ID: <AANLkTilLAcKP-zc1E101UCSX3p1SWgCKVw3y0jq3wZCg@mail.gmail.com>

On Fri, Jun 11, 2010 at 12:30 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 9:07 AM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
>>
>> On Fri, Jun 11, 2010 at 4:25 AM, Pauli Virtanen <pav at iki.fi> wrote:
>> > Thu, 10 Jun 2010 21:50:02 -0600, Vincent Davis wrote:
>> >> It seems that the text in the document editor is not a mono space text.
>> >> Can this be changed?
>> >
>> > What text?
>> On this page for example.
>> http://docs.scipy.org/scipy/docs/scipy.constants.codata.precision/edit/
>> When you edit the text it is not monospace and there is no 75 char guide.
>> This makes both staying within the 75 char limit

it's monospaced for me in firefox, underlines line up with header.
maybe it's a browser setting.

Josef

>
> Just err on the side of being too short (i.e., if in doubt, break the line).
>
>>
>> and getting the rst
>> syntax correct difficult. (for example putting "-" under a word to get
>> bold font.)
>
> **<text>** gives us bold font.? ("-" under a word makes it a section
> heading; only Parameters, Returns, Other parameters, Raises, See also,
> Notes, References, and Examples, starting in the first column and on a line
> by themselves, should have "-" under them.)
>
>>
>> I could use an external editor and copy paste.
>
> Actually, nominally, that _is_ what we'd prefer writer/editors to do:
>
> (from http://docs.scipy.org/numpy/Front%20Page/#roles-reviewing) "It is best
> to grab the whole existing page [from the Edit window, not the View window],
> or a template, edit it on your computer, return [and] check that nobody else
> has edited first, and then upload your document. Please do not edit
> incrementally, unless making trivial changes like fixing markup or
> reformatting."
>
> (Full disclosure: I "cheat," but since you brought it up, I figured I should
> emphasize that what you state is the preferred modus operendi.)
>
> DG
>
>> See attached images for you don't know what I mean.
>>
>> Thanks
>> Vincent
>>
>>
>>
>> >
>> > The text in the edit window is monospace. The output should not be made
>> > monospaced; it is intended for Sphinx, and plain text should be variable
>> > spaced.
>> >
>> > --
>> > Pauli Virtanen
>> >
>> > _______________________________________________
>> > SciPy-Dev mailing list
>> > SciPy-Dev at scipy.org
>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>> >
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide. ?(As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From jsseabold at gmail.com  Fri Jun 11 12:45:05 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Fri, 11 Jun 2010 12:45:05 -0400
Subject: [SciPy-Dev] Warnings raised (from fit in scipy.stats)
Message-ID: <AANLkTimG2L2_M1-NcCMZgKmuyk29-v-rhjHDJu2PzdRv@mail.gmail.com>

Since the raising of warning behavior has been changed (I believe), I
have been running into a lot of warnings in my code when say I do
something like

In [120]: from scipy import stats

In [121]: y = [-45, -3, 1, 0, 1, 3]

In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
Warning: invalid value encountered in divide

Sometimes, this is useful to know.  Sometimes, though, it's very
disturbing when it's encountered in some kind of iteration or
optimization.  I have been using numpy.clip to get around this in my
own code, but when it's buried a bit deeper, it's not quite so simple.

Take this example.

In [123]: import numpy as np

In [124]: np.random.seed(12345)

In [125]: B = 6.0

In [126]: x = np.random.exponential(scale=B, size=5000)

In [127]: from scipy.stats import expon

In [128]: expon.fit(x)

<dozens of warnings clipped>

Out[128]: (0.21874043533906118, 5.7122829778172939)

The fit is achieved by fmin (as far as I know, since disp=0 in the
rv_continuous.fit...), but there are a number of warnings emitted.  Is
there any middle ground to be had in these type of situations via
context management perhaps?

Should I file a ticket?

Skipper


From pav at iki.fi  Fri Jun 11 12:50:41 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 11 Jun 2010 16:50:41 +0000 (UTC)
Subject: [SciPy-Dev] mono space text in document editor.
References: <AANLkTimQyC4ZR78QiF4HycQ01R4xuJyYV4ARniJX_utL@mail.gmail.com>
	<hut2u4$4e4$1@dough.gmane.org>
	<AANLkTimeYC2WC-oubAXW3dhWezNUX4-tzdTgo5PjURmH@mail.gmail.com>
Message-ID: <hutph1$l1v$2@dough.gmane.org>

Fri, 11 Jun 2010 10:07:59 -0600, Vincent Davis wrote:
[clip]
> On this page for example.
> http://docs.scipy.org/scipy/docs/scipy.constants.codata.precision/edit/
> When you edit the text it is not monospace and there is no 75 char
> guide. This makes both staying within the 75 char limit and getting the
> rst syntax correct difficult. (for example putting "-" under a word to
> get bold font.) I could use an external editor and copy paste. See
> attached images for you don't know what I mean.

That's specific to the browser you are using, and possibly also user-
specific customizations -- is it Safari on OSX?

Anyway, there's nothing in the CSS forcing <textarea> to monospaced font. 
I believed all browsers had that as a default, but apparently some do not.

-- 
Pauli Virtanen


From josef.pktd at gmail.com  Fri Jun 11 13:07:37 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 13:07:37 -0400
Subject: [SciPy-Dev] Warnings raised (from fit in scipy.stats)
In-Reply-To: <AANLkTimG2L2_M1-NcCMZgKmuyk29-v-rhjHDJu2PzdRv@mail.gmail.com>
References: <AANLkTimG2L2_M1-NcCMZgKmuyk29-v-rhjHDJu2PzdRv@mail.gmail.com>
Message-ID: <AANLkTimKCMGzgnpN2FGf2bwQgWLtpB8vQKrNcBvlTddV@mail.gmail.com>

On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
> Since the raising of warning behavior has been changed (I believe), I
> have been running into a lot of warnings in my code when say I do
> something like
>
> In [120]: from scipy import stats
>
> In [121]: y = [-45, -3, 1, 0, 1, 3]
>
> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
> Warning: invalid value encountered in divide
>
> Sometimes, this is useful to know. ?Sometimes, though, it's very
> disturbing when it's encountered in some kind of iteration or
> optimization. ?I have been using numpy.clip to get around this in my
> own code, but when it's buried a bit deeper, it's not quite so simple.
>
> Take this example.
>
> In [123]: import numpy as np
>
> In [124]: np.random.seed(12345)
>
> In [125]: B = 6.0
>
> In [126]: x = np.random.exponential(scale=B, size=5000)
>
> In [127]: from scipy.stats import expon
>
> In [128]: expon.fit(x)
>
> <dozens of warnings clipped>
>
> Out[128]: (0.21874043533906118, 5.7122829778172939)
>
> The fit is achieved by fmin (as far as I know, since disp=0 in the
> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
> there any middle ground to be had in these type of situations via
> context management perhaps?
>
> Should I file a ticket?

Which numpy scipy versions are you using?

I don't get any warning with the first example. (numpy 1.4.0)
(I cannot run the second example because I have a scipy revision with
a broken fit() method)

I don't think wrapping functions/methods to turn off warnings is a
good option. (many of them are in inner loops for example for random
number generation)

Josef

>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Fri Jun 11 13:19:54 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 11 Jun 2010 11:19:54 -0600
Subject: [SciPy-Dev] mono space text in document editor.
In-Reply-To: <hutph1$l1v$2@dough.gmane.org>
References: <AANLkTimQyC4ZR78QiF4HycQ01R4xuJyYV4ARniJX_utL@mail.gmail.com>
	<hut2u4$4e4$1@dough.gmane.org>
	<AANLkTimeYC2WC-oubAXW3dhWezNUX4-tzdTgo5PjURmH@mail.gmail.com>
	<hutph1$l1v$2@dough.gmane.org>
Message-ID: <AANLkTimX8mkDs9AskEGeCM2sSeU9ZBRBeNqohZMOavCH@mail.gmail.com>

On Fri, Jun 11, 2010 at 10:50 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Fri, 11 Jun 2010 10:07:59 -0600, Vincent Davis wrote:
> [clip]
>> On this page for example.
>> http://docs.scipy.org/scipy/docs/scipy.constants.codata.precision/edit/
>> When you edit the text it is not monospace and there is no 75 char
>> guide. This makes both staying within the 75 char limit and getting the
>> rst syntax correct difficult. (for example putting "-" under a word to
>> get bold font.) I could use an external editor and copy paste. See
>> attached images for you don't know what I mean.
>
> That's specific to the browser you are using, and possibly also user-
> specific customizations -- is it Safari on OSX?

Chrome on osx, I'll try the other browsers
Vincent

>
> Anyway, there's nothing in the CSS forcing <textarea> to monospaced font.
> I believed all browsers had that as a default, but apparently some do not.
>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From jsseabold at gmail.com  Fri Jun 11 13:34:35 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Fri, 11 Jun 2010 13:34:35 -0400
Subject: [SciPy-Dev] Warnings raised (from fit in scipy.stats)
In-Reply-To: <AANLkTimKCMGzgnpN2FGf2bwQgWLtpB8vQKrNcBvlTddV@mail.gmail.com>
References: <AANLkTimG2L2_M1-NcCMZgKmuyk29-v-rhjHDJu2PzdRv@mail.gmail.com> 
	<AANLkTimKCMGzgnpN2FGf2bwQgWLtpB8vQKrNcBvlTddV@mail.gmail.com>
Message-ID: <AANLkTinyAP1g8s2BCuWxCxzsYrkMT6RAyJuml780Tafp@mail.gmail.com>

On Fri, Jun 11, 2010 at 1:07 PM,  <josef.pktd at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>> Since the raising of warning behavior has been changed (I believe), I
>> have been running into a lot of warnings in my code when say I do
>> something like
>>
>> In [120]: from scipy import stats
>>
>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>
>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>> Warning: invalid value encountered in divide
>>
>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>> disturbing when it's encountered in some kind of iteration or
>> optimization. ?I have been using numpy.clip to get around this in my
>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>
>> Take this example.
>>
>> In [123]: import numpy as np
>>
>> In [124]: np.random.seed(12345)
>>
>> In [125]: B = 6.0
>>
>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>
>> In [127]: from scipy.stats import expon
>>
>> In [128]: expon.fit(x)
>>
>> <dozens of warnings clipped>
>>
>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>
>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>> there any middle ground to be had in these type of situations via
>> context management perhaps?
>>
>> Should I file a ticket?
>
> Which numpy scipy versions are you using?
>

Numpy

'2.0.0.dev8417'

Scipy

'0.9.0.dev6447'

> I don't get any warning with the first example. (numpy 1.4.0)
> (I cannot run the second example because I have a scipy revision with
> a broken fit() method)
>
> I don't think wrapping functions/methods to turn off warnings is a
> good option. (many of them are in inner loops for example for random
> number generation)
>

Granted I haven't looked too much into the details of the warnings
context manager other than using some toy examples once or twice, but
if you could just suppress them for when the solver is called within a
function/method then this would do the trick (at least for the ones I
have been running into, mostly to do with fitting like this or with
maximum likelihood).

Skipper


From josef.pktd at gmail.com  Fri Jun 11 13:53:46 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 13:53:46 -0400
Subject: [SciPy-Dev] stats.zmap mstats.zmap backport
Message-ID: <AANLkTinZ2nrowhYX3zCp1KEaxFyLs2e21FdHlvjbBsJq@mail.gmail.com>

Ralf,

Can you check and backport http://projects.scipy.org/scipy/ticket/1195 ?

I forgot to check mstats, when I corrected the axis handling of zmap
and just saw it while looking at the docs.
mstats.zmap still has inconsistent axis handling if axis != 0

Josef


From vincent at vincentdavis.net  Fri Jun 11 14:17:00 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 11 Jun 2010 12:17:00 -0600
Subject: [SciPy-Dev] Warnings raised (from fit in scipy.stats)
In-Reply-To: <AANLkTinyAP1g8s2BCuWxCxzsYrkMT6RAyJuml780Tafp@mail.gmail.com>
References: <AANLkTimG2L2_M1-NcCMZgKmuyk29-v-rhjHDJu2PzdRv@mail.gmail.com>
	<AANLkTimKCMGzgnpN2FGf2bwQgWLtpB8vQKrNcBvlTddV@mail.gmail.com>
	<AANLkTinyAP1g8s2BCuWxCxzsYrkMT6RAyJuml780Tafp@mail.gmail.com>
Message-ID: <AANLkTimC9QwGFlpaqBPO2o49aqv6pkcDdH7nJNTenGSC@mail.gmail.com>

On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>> Since the raising of warning behavior has been changed (I believe), I
>>> have been running into a lot of warnings in my code when say I do
>>> something like
>>>
>>> In [120]: from scipy import stats
>>>
>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>
>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>> Warning: invalid value encountered in divide
>>>
>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>> disturbing when it's encountered in some kind of iteration or
>>> optimization. ?I have been using numpy.clip to get around this in my
>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>
>>> Take this example.
>>>
>>> In [123]: import numpy as np
>>>
>>> In [124]: np.random.seed(12345)
>>>
>>> In [125]: B = 6.0
>>>
>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>
>>> In [127]: from scipy.stats import expon
>>>
>>> In [128]: expon.fit(x)
>>>
>>> <dozens of warnings clipped>
>>>
>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>
>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>> there any middle ground to be had in these type of situations via
>>> context management perhaps?
>>>
>>> Should I file a ticket?
>>
>> Which numpy scipy versions are you using?

I get know warnings
>>> import numpy as np
>>> np.random.seed(12345)
>>> B = 6.0
>>> x = np.random.exponential(scale=B, size=5000)
>>> from scipy.stats import expon
>>> expon.fit(x)
array([  6.43573559e-04,   5.93058867e+00])

>>> from scipy import stats
>>> y = [-45, -3, 1, 0, 1, 3]
>>> v  = stats.norm.pdf(y)/stats.norm.cdf(y)

numpy 1.4.0
scipy 0.8.0b1

Vincent

> Numpy
>
> '2.0.0.dev8417'
>
> Scipy
>
> '0.9.0.dev6447'
>
>> I don't get any warning with the first example. (numpy 1.4.0)
>> (I cannot run the second example because I have a scipy revision with
>> a broken fit() method)
>>
>> I don't think wrapping functions/methods to turn off warnings is a
>> good option. (many of them are in inner loops for example for random
>> number generation)
>>
>
> Granted I haven't looked too much into the details of the warnings
> context manager other than using some toy examples once or twice, but
> if you could just suppress them for when the solver is called within a
> function/method then this would do the trick (at least for the ones I
> have been running into, mostly to do with fitting like this or with
> maximum likelihood).
>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From jsseabold at gmail.com  Fri Jun 11 14:20:56 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Fri, 11 Jun 2010 14:20:56 -0400
Subject: [SciPy-Dev] Warnings raised (from fit in scipy.stats)
In-Reply-To: <AANLkTimC9QwGFlpaqBPO2o49aqv6pkcDdH7nJNTenGSC@mail.gmail.com>
References: <AANLkTimG2L2_M1-NcCMZgKmuyk29-v-rhjHDJu2PzdRv@mail.gmail.com> 
	<AANLkTimKCMGzgnpN2FGf2bwQgWLtpB8vQKrNcBvlTddV@mail.gmail.com> 
	<AANLkTinyAP1g8s2BCuWxCxzsYrkMT6RAyJuml780Tafp@mail.gmail.com> 
	<AANLkTimC9QwGFlpaqBPO2o49aqv6pkcDdH7nJNTenGSC@mail.gmail.com>
Message-ID: <AANLkTilMwQ8AJrpcgMqbpSo1dyGOPkc1bfJgpM12bu5n@mail.gmail.com>

On Fri, Jun 11, 2010 at 2:17 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>> Since the raising of warning behavior has been changed (I believe), I
>>>> have been running into a lot of warnings in my code when say I do
>>>> something like
>>>>
>>>> In [120]: from scipy import stats
>>>>
>>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>>
>>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>>> Warning: invalid value encountered in divide
>>>>
>>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>>> disturbing when it's encountered in some kind of iteration or
>>>> optimization. ?I have been using numpy.clip to get around this in my
>>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>>
>>>> Take this example.
>>>>
>>>> In [123]: import numpy as np
>>>>
>>>> In [124]: np.random.seed(12345)
>>>>
>>>> In [125]: B = 6.0
>>>>
>>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>>
>>>> In [127]: from scipy.stats import expon
>>>>
>>>> In [128]: expon.fit(x)
>>>>
>>>> <dozens of warnings clipped>
>>>>
>>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>>
>>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>>> there any middle ground to be had in these type of situations via
>>>> context management perhaps?
>>>>
>>>> Should I file a ticket?
>>>
>>> Which numpy scipy versions are you using?
>
> I get know warnings
>>>> import numpy as np
>>>> np.random.seed(12345)
>>>> B = 6.0
>>>> x = np.random.exponential(scale=B, size=5000)
>>>> from scipy.stats import expon
>>>> expon.fit(x)
> array([ ?6.43573559e-04, ? 5.93058867e+00])
>

You also get different values than I do, which shouldn't be the case.

I just discovered that my expon.fit(x) just returns the first and
second moments of the data (even when I set floc = 0, I still get the
second moment), and I am trying to figure out why.  It looks like
something is amiss.

Skipper


From vincent at vincentdavis.net  Fri Jun 11 14:46:02 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 11 Jun 2010 12:46:02 -0600
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTikGYRj8HOuEeDLSYGTT70nfSZMo_9UEzAoO03ua@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
	<AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
	<AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>
	<AANLkTin_8yGnzxznFnjh5_rmpt9h33YVqxfQyGxMZCiZ@mail.gmail.com>
	<AANLkTikGYRj8HOuEeDLSYGTT70nfSZMo_9UEzAoO03ua@mail.gmail.com>
Message-ID: <AANLkTinBHCHiFhPCJOoq5Xgvp1DaRqfof9aSJ0rykU2Q@mail.gmail.com>

On Thu, Jun 10, 2010 at 11:29 PM,  <josef.pktd at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 1:09 AM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Thu, Jun 10, 2010 at 10:54 PM, Vincent Davis
>> <vincent at vincentdavis.net> wrote:
>>> On Thu, Jun 10, 2010 at 10:38 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Fri, Jun 11, 2010 at 12:11 AM, Vincent Davis
>>>> <vincent at vincentdavis.net> wrote:
>>>>> The current constants in scipy are from 2002, the newest set available
>>>>> are from 2006.
>>>>> Should they be updated, What are the issues with updating with regard
>>>>> to notifying users ie documenting the update.
>>>>
>>>> I thought these are constants. Did they change the value of Pi recently?
>>>
>>> They found more digits of Pi :)
>>> I was referring to "Fundamental Physical Constants" scipy.constants.codata
>>> I don't know, it's not easy to compare. I assume there was a reason
>>> they updated the list. There is an Uncertainty value on many of them
>>> so I assume the actual value don't change but our estimate does.
>>>
>>> http://physics.nist.gov/cuu/Constants/index.html
>>> release dates 1986, 1998, 2002, 2006
>>>
>>> Vincent
>>>
>>>>
>>>> just curious: What has changed?
>>>>
>>
>> I just relived I can update the constants and then compare them with
>> the old. Should be quick, I will send out the diff in the morning.
>
> quote from the REVIEWS OF MODERN PHYSICS paper:
>
> "Although just four years separate the 31 December closing dates of
> the 2002 and 2006 adjustments, there are a number of important new
> results to consider. Experimental advances include the 2003 Atomic
> Mass Evaluation from the Atomic Mass Data Center (AMDC),which provides
> new values for the relative atomic masses Ar(X) of a number of
> relevant atoms; a new value of ..."

Ok a few more issues/question about constants.codata.
Some of the names seem rather messed up because of how the raw text is
 read. For example
Loschmidt constant (273.15 K, 101.325 kPa)             2.686 7774 e25
      0.000 0047 e25        m^-3

The above line gets added to the physical_constants dict with the key
value 'Loschmidt constant (273.15 K, 101.325 kPa)'.
codata.value('Loschmidt constant (273.15 K, 101.325 kPa)')
2.6867773000000001e+25

Seems like it should be no more that 'Loschmidt constant' and I would
recommend 'Loschmidt' unless there is a conflict.

So as well as updating the variable constants :) I would like to
update naming issue like this above.
Should I? , What kind of warning should there be......?

Vincent
>
> Josef
> ?Curiouser and curiouser!?
>
>>
>> Vincent
>>
>>
>>>> Josef
>>>>
>>>>>
>>>>> Vincent
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Fri Jun 11 14:49:36 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 11 Jun 2010 12:49:36 -0600
Subject: [SciPy-Dev] Warnings raised (from fit in scipy.stats)
In-Reply-To: <AANLkTilMwQ8AJrpcgMqbpSo1dyGOPkc1bfJgpM12bu5n@mail.gmail.com>
References: <AANLkTimG2L2_M1-NcCMZgKmuyk29-v-rhjHDJu2PzdRv@mail.gmail.com>
	<AANLkTimKCMGzgnpN2FGf2bwQgWLtpB8vQKrNcBvlTddV@mail.gmail.com>
	<AANLkTinyAP1g8s2BCuWxCxzsYrkMT6RAyJuml780Tafp@mail.gmail.com>
	<AANLkTimC9QwGFlpaqBPO2o49aqv6pkcDdH7nJNTenGSC@mail.gmail.com>
	<AANLkTilMwQ8AJrpcgMqbpSo1dyGOPkc1bfJgpM12bu5n@mail.gmail.com>
Message-ID: <AANLkTime_I9PFY7G46eltdiglmBFw5PRk6n7TZc64JKq@mail.gmail.com>

On Fri, Jun 11, 2010 at 12:20 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 2:17 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>> Since the raising of warning behavior has been changed (I believe), I
>>>>> have been running into a lot of warnings in my code when say I do
>>>>> something like
>>>>>
>>>>> In [120]: from scipy import stats
>>>>>
>>>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>>>
>>>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>>>> Warning: invalid value encountered in divide
>>>>>
>>>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>>>> disturbing when it's encountered in some kind of iteration or
>>>>> optimization. ?I have been using numpy.clip to get around this in my
>>>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>>>
>>>>> Take this example.
>>>>>
>>>>> In [123]: import numpy as np
>>>>>
>>>>> In [124]: np.random.seed(12345)
>>>>>
>>>>> In [125]: B = 6.0
>>>>>
>>>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>>>
>>>>> In [127]: from scipy.stats import expon
>>>>>
>>>>> In [128]: expon.fit(x)
>>>>>
>>>>> <dozens of warnings clipped>
>>>>>
>>>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>>>
>>>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>>>> there any middle ground to be had in these type of situations via
>>>>> context management perhaps?
>>>>>
>>>>> Should I file a ticket?
>>>>
>>>> Which numpy scipy versions are you using?
>>
>> I get know warnings
>>>>> import numpy as np
>>>>> np.random.seed(12345)
>>>>> B = 6.0
>>>>> x = np.random.exponential(scale=B, size=5000)
>>>>> from scipy.stats import expon
>>>>> expon.fit(x)
>> array([ ?6.43573559e-04, ? 5.93058867e+00])
>>
>
> You also get different values than I do, which shouldn't be the case.
>
> I just discovered that my expon.fit(x) just returns the first and
> second moments of the data (even when I set floc = 0, I still get the
> second moment), and I am trying to figure out why. ?It looks like
> something is amiss.

You also got a tuple back and I got an array, unless you edited.

Vincent

>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From njs at pobox.com  Fri Jun 11 14:54:24 2010
From: njs at pobox.com (Nathaniel Smith)
Date: Fri, 11 Jun 2010 11:54:24 -0700
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTinBHCHiFhPCJOoq5Xgvp1DaRqfof9aSJ0rykU2Q@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
	<AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
	<AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>
	<AANLkTin_8yGnzxznFnjh5_rmpt9h33YVqxfQyGxMZCiZ@mail.gmail.com>
	<AANLkTikGYRj8HOuEeDLSYGTT70nfSZMo_9UEzAoO03ua@mail.gmail.com>
	<AANLkTinBHCHiFhPCJOoq5Xgvp1DaRqfof9aSJ0rykU2Q@mail.gmail.com>
Message-ID: <AANLkTikjDmAxE5Bz60B64rS4V1GE9bnTS3adgxPrrplv@mail.gmail.com>

On Fri, Jun 11, 2010 at 11:46 AM, Vincent Davis
<vincent at vincentdavis.net> wrote:
> Ok a few more issues/question about constants.codata.
> Some of the names seem rather messed up because of how the raw text is
> ?read. For example
> Loschmidt constant (273.15 K, 101.325 kPa) ? ? ? ? ? ? 2.686 7774 e25
> ? ? ?0.000 0047 e25 ? ? ? ?m^-3
>
> The above line gets added to the physical_constants dict with the key
> value 'Loschmidt constant (273.15 K, 101.325 kPa)'.
> codata.value('Loschmidt constant (273.15 K, 101.325 kPa)')
> 2.6867773000000001e+25
>
> Seems like it should be no more that 'Loschmidt constant' and I would
> recommend 'Loschmidt' unless there is a conflict.

If that's the "official name", then it seems to me that scipy should
support it... possibly along with shorter aliases for convenience?

-- Nathaniel


From charlesr.harris at gmail.com  Fri Jun 11 14:55:33 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Fri, 11 Jun 2010 12:55:33 -0600
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTinBHCHiFhPCJOoq5Xgvp1DaRqfof9aSJ0rykU2Q@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
	<AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
	<AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>
	<AANLkTin_8yGnzxznFnjh5_rmpt9h33YVqxfQyGxMZCiZ@mail.gmail.com>
	<AANLkTikGYRj8HOuEeDLSYGTT70nfSZMo_9UEzAoO03ua@mail.gmail.com>
	<AANLkTinBHCHiFhPCJOoq5Xgvp1DaRqfof9aSJ0rykU2Q@mail.gmail.com>
Message-ID: <AANLkTilR9tTDFiRW_dwSgtucftQ7YrjNAwv6eijyEWkx@mail.gmail.com>

On Fri, Jun 11, 2010 at 12:46 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Thu, Jun 10, 2010 at 11:29 PM,  <josef.pktd at gmail.com> wrote:
> > On Fri, Jun 11, 2010 at 1:09 AM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
> >> On Thu, Jun 10, 2010 at 10:54 PM, Vincent Davis
> >> <vincent at vincentdavis.net> wrote:
> >>> On Thu, Jun 10, 2010 at 10:38 PM,  <josef.pktd at gmail.com> wrote:
> >>>> On Fri, Jun 11, 2010 at 12:11 AM, Vincent Davis
> >>>> <vincent at vincentdavis.net> wrote:
> >>>>> The current constants in scipy are from 2002, the newest set
> available
> >>>>> are from 2006.
> >>>>> Should they be updated, What are the issues with updating with regard
> >>>>> to notifying users ie documenting the update.
> >>>>
> >>>> I thought these are constants. Did they change the value of Pi
> recently?
> >>>
> >>> They found more digits of Pi :)
> >>> I was referring to "Fundamental Physical Constants"
> scipy.constants.codata
> >>> I don't know, it's not easy to compare. I assume there was a reason
> >>> they updated the list. There is an Uncertainty value on many of them
> >>> so I assume the actual value don't change but our estimate does.
> >>>
> >>> http://physics.nist.gov/cuu/Constants/index.html
> >>> release dates 1986, 1998, 2002, 2006
> >>>
> >>> Vincent
> >>>
> >>>>
> >>>> just curious: What has changed?
> >>>>
> >>
> >> I just relived I can update the constants and then compare them with
> >> the old. Should be quick, I will send out the diff in the morning.
> >
> > quote from the REVIEWS OF MODERN PHYSICS paper:
> >
> > "Although just four years separate the 31 December closing dates of
> > the 2002 and 2006 adjustments, there are a number of important new
> > results to consider. Experimental advances include the 2003 Atomic
> > Mass Evaluation from the Atomic Mass Data Center (AMDC),which provides
> > new values for the relative atomic masses Ar(X) of a number of
> > relevant atoms; a new value of ..."
>
> Ok a few more issues/question about constants.codata.
> Some of the names seem rather messed up because of how the raw text is
>  read. For example
> Loschmidt constant (273.15 K, 101.325 kPa)             2.686 7774 e25
>      0.000 0047 e25        m^-3
>
> The above line gets added to the physical_constants dict with the key
> value 'Loschmidt constant (273.15 K, 101.325 kPa)'.
> codata.value('Loschmidt constant (273.15 K, 101.325 kPa)')
> 2.6867773000000001e+25
>
> Seems like it should be no more that 'Loschmidt constant' and I would
> recommend 'Loschmidt' unless there is a conflict.
>
>
The new codata will need a bit of editing to work, for examples the columns
used to extract the various parts will need modification. I think "Loschmidt
constant" is the right name to use, but this "constant" depends on
temperature and pressure, so that should be noted somewhere, maybe in the
documentation.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100611/e5969a9e/attachment.html>

From vincent at vincentdavis.net  Fri Jun 11 15:05:36 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 11 Jun 2010 13:05:36 -0600
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTilR9tTDFiRW_dwSgtucftQ7YrjNAwv6eijyEWkx@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
	<AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
	<AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>
	<AANLkTin_8yGnzxznFnjh5_rmpt9h33YVqxfQyGxMZCiZ@mail.gmail.com>
	<AANLkTikGYRj8HOuEeDLSYGTT70nfSZMo_9UEzAoO03ua@mail.gmail.com>
	<AANLkTinBHCHiFhPCJOoq5Xgvp1DaRqfof9aSJ0rykU2Q@mail.gmail.com>
	<AANLkTilR9tTDFiRW_dwSgtucftQ7YrjNAwv6eijyEWkx@mail.gmail.com>
Message-ID: <AANLkTimKpEz5-0-T2RZVY3DdO5Aw3ZM_zLwuxy7iJc_U@mail.gmail.com>

On Fri, Jun 11, 2010 at 12:55 PM, Charles R Harris
<charlesr.harris at gmail.com> wrote:
>
>
> On Fri, Jun 11, 2010 at 12:46 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
>>
>> On Thu, Jun 10, 2010 at 11:29 PM, ?<josef.pktd at gmail.com> wrote:
>> > On Fri, Jun 11, 2010 at 1:09 AM, Vincent Davis
>> > <vincent at vincentdavis.net> wrote:
>> >> On Thu, Jun 10, 2010 at 10:54 PM, Vincent Davis
>> >> <vincent at vincentdavis.net> wrote:
>> >>> On Thu, Jun 10, 2010 at 10:38 PM, ?<josef.pktd at gmail.com> wrote:
>> >>>> On Fri, Jun 11, 2010 at 12:11 AM, Vincent Davis
>> >>>> <vincent at vincentdavis.net> wrote:
>> >>>>> The current constants in scipy are from 2002, the newest set
>> >>>>> available
>> >>>>> are from 2006.
>> >>>>> Should they be updated, What are the issues with updating with
>> >>>>> regard
>> >>>>> to notifying users ie documenting the update.
>> >>>>
>> >>>> I thought these are constants. Did they change the value of Pi
>> >>>> recently?
>> >>>
>> >>> They found more digits of Pi :)
>> >>> I was referring to "Fundamental Physical Constants"
>> >>> scipy.constants.codata
>> >>> I don't know, it's not easy to compare. I assume there was a reason
>> >>> they updated the list. There is an Uncertainty value on many of them
>> >>> so I assume the actual value don't change but our estimate does.
>> >>>
>> >>> http://physics.nist.gov/cuu/Constants/index.html
>> >>> release dates 1986, 1998, 2002, 2006
>> >>>
>> >>> Vincent
>> >>>
>> >>>>
>> >>>> just curious: What has changed?
>> >>>>
>> >>
>> >> I just relived I can update the constants and then compare them with
>> >> the old. Should be quick, I will send out the diff in the morning.
>> >
>> > quote from the REVIEWS OF MODERN PHYSICS paper:
>> >
>> > "Although just four years separate the 31 December closing dates of
>> > the 2002 and 2006 adjustments, there are a number of important new
>> > results to consider. Experimental advances include the 2003 Atomic
>> > Mass Evaluation from the Atomic Mass Data Center (AMDC),which provides
>> > new values for the relative atomic masses Ar(X) of a number of
>> > relevant atoms; a new value of ..."
>>
>> Ok a few more issues/question about constants.codata.
>> Some of the names seem rather messed up because of how the raw text is
>> ?read. For example
>> Loschmidt constant (273.15 K, 101.325 kPa) ? ? ? ? ? ? 2.686 7774 e25
>> ? ? ?0.000 0047 e25 ? ? ? ?m^-3
>>
>> The above line gets added to the physical_constants dict with the key
>> value 'Loschmidt constant (273.15 K, 101.325 kPa)'.
>> codata.value('Loschmidt constant (273.15 K, 101.325 kPa)')
>> 2.6867773000000001e+25
>>
>> Seems like it should be no more that 'Loschmidt constant' and I would
>> recommend 'Loschmidt' unless there is a conflict.
>>
>
> The new codata will need a bit of editing to work, for examples the columns
> used to extract the various parts will need modification. I think "Loschmidt
> constant" is the right name to use, but this "constant" depends on
> temperature and pressure, so that should be noted somewhere, maybe in the
> documentation.

Thats what I was thinking. I could add an additional item to the list
stored values, "notes", [value, precision, units, notes] and add a
helper function like the others have ie codata.notes(Loschmidt) =
(273.15 K, 101.325 kPa)

Does that sound ok?

Vincent


>
> Chuck
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From d.l.goldsmith at gmail.com  Fri Jun 11 16:25:04 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 11 Jun 2010 13:25:04 -0700
Subject: [SciPy-Dev] Wrong Link?
In-Reply-To: <AANLkTindfe2PQqYHUdRbRf35FMkYDswBLzQ0XZIsXxAb@mail.gmail.com>
References: <AANLkTindfe2PQqYHUdRbRf35FMkYDswBLzQ0XZIsXxAb@mail.gmail.com>
Message-ID: <AANLkTilndRfYOHjbkJOeGfO8tEnDUL4jb8MY648sHG1g@mail.gmail.com>

On Fri, Jun 11, 2010 at 1:12 PM, Benjamin Root <ben.root at ou.edu> wrote:

> David,
>
> I just noticed on the SciPy doc editor pages, the "Milestones" link points
> to the numpy milestones instead of the scipy milestones.


Just a reminder to all: this is a known, reported bug in the Wiki program.
Vincent Davis is hoping to be able to make progress on it, but Wiki
maintenance/enhancement in general is a big problem right now - anyone w/
Django experience is urged to look at
http://code.google.com/p/pydocweb/issues/list for open issues they might be
able to help w/ and coordinate their efforts w/ Vincent and Pauli Virtanen.

DG


> Meanwhile, all the other links like "Search" and "Stats" all refer to the
> scipy version of those pages.
>
> Ben Root
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100611/62673e80/attachment.html>

From jsseabold at gmail.com  Fri Jun 11 17:11:13 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Fri, 11 Jun 2010 17:11:13 -0400
Subject: [SciPy-Dev] Fitting distributions [Was Re: Warnings raised (from
	fit in scipy.stats)]
Message-ID: <AANLkTim4Aff5TnxB4Yk5MHCh7-5Y11uYr8RE9OUYRXbF@mail.gmail.com>

On Fri, Jun 11, 2010 at 2:49 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Fri, Jun 11, 2010 at 12:20 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 2:17 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>>>>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>> Since the raising of warning behavior has been changed (I believe), I
>>>>>> have been running into a lot of warnings in my code when say I do
>>>>>> something like
>>>>>>
>>>>>> In [120]: from scipy import stats
>>>>>>
>>>>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>>>>
>>>>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>>>>> Warning: invalid value encountered in divide
>>>>>>
>>>>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>>>>> disturbing when it's encountered in some kind of iteration or
>>>>>> optimization. ?I have been using numpy.clip to get around this in my
>>>>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>>>>
>>>>>> Take this example.
>>>>>>
>>>>>> In [123]: import numpy as np
>>>>>>
>>>>>> In [124]: np.random.seed(12345)
>>>>>>
>>>>>> In [125]: B = 6.0
>>>>>>
>>>>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>>>>
>>>>>> In [127]: from scipy.stats import expon
>>>>>>
>>>>>> In [128]: expon.fit(x)
>>>>>>
>>>>>> <dozens of warnings clipped>
>>>>>>
>>>>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>>>>
>>>>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>>>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>>>>> there any middle ground to be had in these type of situations via
>>>>>> context management perhaps?
>>>>>>
>>>>>> Should I file a ticket?
>>>>>
>>>>> Which numpy scipy versions are you using?
>>>
>>> I get know warnings
>>>>>> import numpy as np
>>>>>> np.random.seed(12345)
>>>>>> B = 6.0
>>>>>> x = np.random.exponential(scale=B, size=5000)
>>>>>> from scipy.stats import expon
>>>>>> expon.fit(x)
>>> array([ ?6.43573559e-04, ? 5.93058867e+00])
>>>
>>
>> You also get different values than I do, which shouldn't be the case.
>>
>> I just discovered that my expon.fit(x) just returns the first and
>> second moments of the data (even when I set floc = 0, I still get the
>> second moment), and I am trying to figure out why. ?It looks like
>> something is amiss.
>

So maybe I am missing something (quite likely), but the reason that
the expon.fit(x), (silently) doesn't work in the above is that
expon.nnlf returns inf for the default start loc.

Should fit_loc_scale be overwritten for expon?

In [60]: expon.fit_loc_scale(x)
Out[60]: (0.21874043533906118, 5.7122829778172939)

In [61]: expon.nnlf(expon.fit_loc_scale(x),x)
Out[61]: inf

Changing fmin to disp=1, gives

In[62]: expon.fit(x)

<snip all the warnings, except from the solver>

Warning: Maximum number of function evaluations has been exceeded.
Out[62]: (0.21874043533906118, 5.7122829778172939)

The default loc is defined (for this case as)

loc = x.mean() - x.std()

so for any x <= loc it is outside of the domain of the exponential
distribution when it gets centered in nnlf.

But

In [63]: expon.fit(x,loc=0)
Optimization terminated successfully.
         Current function value: 13900.441325
         Iterations: 59
         Function evaluations: 107
Out[63]: (0.00064357307755842945, 5.9303783425183241)

Skipper


From josef.pktd at gmail.com  Fri Jun 11 17:42:46 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 17:42:46 -0400
Subject: [SciPy-Dev] Fitting distributions [Was Re: Warnings raised
	(from fit in scipy.stats)]
In-Reply-To: <AANLkTim4Aff5TnxB4Yk5MHCh7-5Y11uYr8RE9OUYRXbF@mail.gmail.com>
References: <AANLkTim4Aff5TnxB4Yk5MHCh7-5Y11uYr8RE9OUYRXbF@mail.gmail.com>
Message-ID: <AANLkTimIaQXaqjCgZNfCJenZmZZj561Yu5XeopHIulnF@mail.gmail.com>

On Fri, Jun 11, 2010 at 5:11 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 2:49 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Fri, Jun 11, 2010 at 12:20 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>> On Fri, Jun 11, 2010 at 2:17 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>> Since the raising of warning behavior has been changed (I believe), I
>>>>>>> have been running into a lot of warnings in my code when say I do
>>>>>>> something like
>>>>>>>
>>>>>>> In [120]: from scipy import stats
>>>>>>>
>>>>>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>>>>>
>>>>>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>>>>>> Warning: invalid value encountered in divide
>>>>>>>
>>>>>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>>>>>> disturbing when it's encountered in some kind of iteration or
>>>>>>> optimization. ?I have been using numpy.clip to get around this in my
>>>>>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>>>>>
>>>>>>> Take this example.
>>>>>>>
>>>>>>> In [123]: import numpy as np
>>>>>>>
>>>>>>> In [124]: np.random.seed(12345)
>>>>>>>
>>>>>>> In [125]: B = 6.0
>>>>>>>
>>>>>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>>>>>
>>>>>>> In [127]: from scipy.stats import expon
>>>>>>>
>>>>>>> In [128]: expon.fit(x)
>>>>>>>
>>>>>>> <dozens of warnings clipped>
>>>>>>>
>>>>>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>>>>>
>>>>>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>>>>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>>>>>> there any middle ground to be had in these type of situations via
>>>>>>> context management perhaps?
>>>>>>>
>>>>>>> Should I file a ticket?
>>>>>>
>>>>>> Which numpy scipy versions are you using?
>>>>
>>>> I get know warnings
>>>>>>> import numpy as np
>>>>>>> np.random.seed(12345)
>>>>>>> B = 6.0
>>>>>>> x = np.random.exponential(scale=B, size=5000)
>>>>>>> from scipy.stats import expon
>>>>>>> expon.fit(x)
>>>> array([ ?6.43573559e-04, ? 5.93058867e+00])
>>>>
>>>
>>> You also get different values than I do, which shouldn't be the case.
>>>
>>> I just discovered that my expon.fit(x) just returns the first and
>>> second moments of the data (even when I set floc = 0, I still get the
>>> second moment), and I am trying to figure out why. ?It looks like
>>> something is amiss.
>>
>
> So maybe I am missing something (quite likely), but the reason that
> the expon.fit(x), (silently) doesn't work in the above is that
> expon.nnlf returns inf for the default start loc.
>
> Should fit_loc_scale be overwritten for expon?
>
> In [60]: expon.fit_loc_scale(x)
> Out[60]: (0.21874043533906118, 5.7122829778172939)
>
> In [61]: expon.nnlf(expon.fit_loc_scale(x),x)
> Out[61]: inf
>
> Changing fmin to disp=1, gives
>
> In[62]: expon.fit(x)
>
> <snip all the warnings, except from the solver>
>
> Warning: Maximum number of function evaluations has been exceeded.
> Out[62]: (0.21874043533906118, 5.7122829778172939)
>
> The default loc is defined (for this case as)
>
> loc = x.mean() - x.std()
>
> so for any x <= loc it is outside of the domain of the exponential
> distribution when it gets centered in nnlf.
>
> But
>
> In [63]: expon.fit(x,loc=0)
> Optimization terminated successfully.
> ? ? ? ? Current function value: 13900.441325
> ? ? ? ? Iterations: 59
> ? ? ? ? Function evaluations: 107
> Out[63]: (0.00064357307755842945, 5.9303783425183241)

I haven't looked yet in details at the new fit method.

But for many distributions, especially those with a finite boundary,
fit works only if the starting values are well chosen. In this case, I
would set the starting value for loc at (x.min() - a little bit),
unless loc is frozen at zero.

I didn't check where the default starting value for expon has been changed.

Josef


>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From jsseabold at gmail.com  Fri Jun 11 17:47:28 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Fri, 11 Jun 2010 17:47:28 -0400
Subject: [SciPy-Dev] Fitting distributions [Was Re: Warnings raised
	(from fit in scipy.stats)]
In-Reply-To: <AANLkTimIaQXaqjCgZNfCJenZmZZj561Yu5XeopHIulnF@mail.gmail.com>
References: <AANLkTim4Aff5TnxB4Yk5MHCh7-5Y11uYr8RE9OUYRXbF@mail.gmail.com> 
	<AANLkTimIaQXaqjCgZNfCJenZmZZj561Yu5XeopHIulnF@mail.gmail.com>
Message-ID: <AANLkTikvt5eTn4qYt2TP5LPfWG8O_lcS80xmqzyaoeTR@mail.gmail.com>

On Fri, Jun 11, 2010 at 5:42 PM,  <josef.pktd at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 5:11 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 2:49 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Fri, Jun 11, 2010 at 12:20 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>> On Fri, Jun 11, 2010 at 2:17 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>> On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>>> Since the raising of warning behavior has been changed (I believe), I
>>>>>>>> have been running into a lot of warnings in my code when say I do
>>>>>>>> something like
>>>>>>>>
>>>>>>>> In [120]: from scipy import stats
>>>>>>>>
>>>>>>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>>>>>>
>>>>>>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>>>>>>> Warning: invalid value encountered in divide
>>>>>>>>
>>>>>>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>>>>>>> disturbing when it's encountered in some kind of iteration or
>>>>>>>> optimization. ?I have been using numpy.clip to get around this in my
>>>>>>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>>>>>>
>>>>>>>> Take this example.
>>>>>>>>
>>>>>>>> In [123]: import numpy as np
>>>>>>>>
>>>>>>>> In [124]: np.random.seed(12345)
>>>>>>>>
>>>>>>>> In [125]: B = 6.0
>>>>>>>>
>>>>>>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>>>>>>
>>>>>>>> In [127]: from scipy.stats import expon
>>>>>>>>
>>>>>>>> In [128]: expon.fit(x)
>>>>>>>>
>>>>>>>> <dozens of warnings clipped>
>>>>>>>>
>>>>>>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>>>>>>
>>>>>>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>>>>>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>>>>>>> there any middle ground to be had in these type of situations via
>>>>>>>> context management perhaps?
>>>>>>>>
>>>>>>>> Should I file a ticket?
>>>>>>>
>>>>>>> Which numpy scipy versions are you using?
>>>>>
>>>>> I get know warnings
>>>>>>>> import numpy as np
>>>>>>>> np.random.seed(12345)
>>>>>>>> B = 6.0
>>>>>>>> x = np.random.exponential(scale=B, size=5000)
>>>>>>>> from scipy.stats import expon
>>>>>>>> expon.fit(x)
>>>>> array([ ?6.43573559e-04, ? 5.93058867e+00])
>>>>>
>>>>
>>>> You also get different values than I do, which shouldn't be the case.
>>>>
>>>> I just discovered that my expon.fit(x) just returns the first and
>>>> second moments of the data (even when I set floc = 0, I still get the
>>>> second moment), and I am trying to figure out why. ?It looks like
>>>> something is amiss.
>>>
>>
>> So maybe I am missing something (quite likely), but the reason that
>> the expon.fit(x), (silently) doesn't work in the above is that
>> expon.nnlf returns inf for the default start loc.
>>
>> Should fit_loc_scale be overwritten for expon?
>>
>> In [60]: expon.fit_loc_scale(x)
>> Out[60]: (0.21874043533906118, 5.7122829778172939)
>>
>> In [61]: expon.nnlf(expon.fit_loc_scale(x),x)
>> Out[61]: inf
>>
>> Changing fmin to disp=1, gives
>>
>> In[62]: expon.fit(x)
>>
>> <snip all the warnings, except from the solver>
>>
>> Warning: Maximum number of function evaluations has been exceeded.
>> Out[62]: (0.21874043533906118, 5.7122829778172939)
>>
>> The default loc is defined (for this case as)
>>
>> loc = x.mean() - x.std()
>>
>> so for any x <= loc it is outside of the domain of the exponential
>> distribution when it gets centered in nnlf.
>>
>> But
>>
>> In [63]: expon.fit(x,loc=0)
>> Optimization terminated successfully.
>> ? ? ? ? Current function value: 13900.441325
>> ? ? ? ? Iterations: 59
>> ? ? ? ? Function evaluations: 107
>> Out[63]: (0.00064357307755842945, 5.9303783425183241)
>
> I haven't looked yet in details at the new fit method.
>
> But for many distributions, especially those with a finite boundary,
> fit works only if the starting values are well chosen. In this case, I
> would set the starting value for loc at (x.min() - a little bit),
> unless loc is frozen at zero.
>

Yeah that's what I settled on, though it wasn't obvious (to me) and
took some digging to get to the bottom of.

> I didn't check where the default starting value for expon has been changed.
>

It's inherited from rv_continuous.

Skipper


From josef.pktd at gmail.com  Fri Jun 11 17:58:26 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 17:58:26 -0400
Subject: [SciPy-Dev] Fitting distributions [Was Re: Warnings raised
	(from fit in scipy.stats)]
In-Reply-To: <AANLkTikvt5eTn4qYt2TP5LPfWG8O_lcS80xmqzyaoeTR@mail.gmail.com>
References: <AANLkTim4Aff5TnxB4Yk5MHCh7-5Y11uYr8RE9OUYRXbF@mail.gmail.com>
	<AANLkTimIaQXaqjCgZNfCJenZmZZj561Yu5XeopHIulnF@mail.gmail.com>
	<AANLkTikvt5eTn4qYt2TP5LPfWG8O_lcS80xmqzyaoeTR@mail.gmail.com>
Message-ID: <AANLkTiknAKaL_R7n7sjSlGyvC_Nx9oTiX5FzgOMvSyj_@mail.gmail.com>

On Fri, Jun 11, 2010 at 5:47 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 5:42 PM, ?<josef.pktd at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 5:11 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>> On Fri, Jun 11, 2010 at 2:49 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> On Fri, Jun 11, 2010 at 12:20 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>> On Fri, Jun 11, 2010 at 2:17 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>> On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>>>> Since the raising of warning behavior has been changed (I believe), I
>>>>>>>>> have been running into a lot of warnings in my code when say I do
>>>>>>>>> something like
>>>>>>>>>
>>>>>>>>> In [120]: from scipy import stats
>>>>>>>>>
>>>>>>>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>>>>>>>
>>>>>>>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>>>>>>>> Warning: invalid value encountered in divide
>>>>>>>>>
>>>>>>>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>>>>>>>> disturbing when it's encountered in some kind of iteration or
>>>>>>>>> optimization. ?I have been using numpy.clip to get around this in my
>>>>>>>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>>>>>>>
>>>>>>>>> Take this example.
>>>>>>>>>
>>>>>>>>> In [123]: import numpy as np
>>>>>>>>>
>>>>>>>>> In [124]: np.random.seed(12345)
>>>>>>>>>
>>>>>>>>> In [125]: B = 6.0
>>>>>>>>>
>>>>>>>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>>>>>>>
>>>>>>>>> In [127]: from scipy.stats import expon
>>>>>>>>>
>>>>>>>>> In [128]: expon.fit(x)
>>>>>>>>>
>>>>>>>>> <dozens of warnings clipped>
>>>>>>>>>
>>>>>>>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>>>>>>>
>>>>>>>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>>>>>>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>>>>>>>> there any middle ground to be had in these type of situations via
>>>>>>>>> context management perhaps?
>>>>>>>>>
>>>>>>>>> Should I file a ticket?
>>>>>>>>
>>>>>>>> Which numpy scipy versions are you using?
>>>>>>
>>>>>> I get know warnings
>>>>>>>>> import numpy as np
>>>>>>>>> np.random.seed(12345)
>>>>>>>>> B = 6.0
>>>>>>>>> x = np.random.exponential(scale=B, size=5000)
>>>>>>>>> from scipy.stats import expon
>>>>>>>>> expon.fit(x)
>>>>>> array([ ?6.43573559e-04, ? 5.93058867e+00])
>>>>>>
>>>>>
>>>>> You also get different values than I do, which shouldn't be the case.
>>>>>
>>>>> I just discovered that my expon.fit(x) just returns the first and
>>>>> second moments of the data (even when I set floc = 0, I still get the
>>>>> second moment), and I am trying to figure out why. ?It looks like
>>>>> something is amiss.
>>>>
>>>
>>> So maybe I am missing something (quite likely), but the reason that
>>> the expon.fit(x), (silently) doesn't work in the above is that
>>> expon.nnlf returns inf for the default start loc.
>>>
>>> Should fit_loc_scale be overwritten for expon?
>>>
>>> In [60]: expon.fit_loc_scale(x)
>>> Out[60]: (0.21874043533906118, 5.7122829778172939)
>>>
>>> In [61]: expon.nnlf(expon.fit_loc_scale(x),x)
>>> Out[61]: inf
>>>
>>> Changing fmin to disp=1, gives
>>>
>>> In[62]: expon.fit(x)
>>>
>>> <snip all the warnings, except from the solver>
>>>
>>> Warning: Maximum number of function evaluations has been exceeded.
>>> Out[62]: (0.21874043533906118, 5.7122829778172939)
>>>
>>> The default loc is defined (for this case as)
>>>
>>> loc = x.mean() - x.std()
>>>
>>> so for any x <= loc it is outside of the domain of the exponential
>>> distribution when it gets centered in nnlf.
>>>
>>> But
>>>
>>> In [63]: expon.fit(x,loc=0)
>>> Optimization terminated successfully.
>>> ? ? ? ? Current function value: 13900.441325
>>> ? ? ? ? Iterations: 59
>>> ? ? ? ? Function evaluations: 107
>>> Out[63]: (0.00064357307755842945, 5.9303783425183241)
>>
>> I haven't looked yet in details at the new fit method.
>>
>> But for many distributions, especially those with a finite boundary,
>> fit works only if the starting values are well chosen. In this case, I
>> would set the starting value for loc at (x.min() - a little bit),
>> unless loc is frozen at zero.
>>
>
> Yeah that's what I settled on, though it wasn't obvious (to me) and
> took some digging to get to the bottom of.
>
>> I didn't check where the default starting value for expon has been changed.
>>
>
> It's inherited from rv_continuous.

If you need starting values for more distributions, I have a script
where I categorize all distributions, by whether the support is open,
left-bounded, right-bounded, bounded or ambiguous, and choose
appropriate starting values.

But this is not yet merged with a semi-frozen fit method.

fit for distributions that have support on the entire real line work
pretty well.

Josef


>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From jsseabold at gmail.com  Fri Jun 11 18:03:59 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Fri, 11 Jun 2010 18:03:59 -0400
Subject: [SciPy-Dev] Fitting distributions [Was Re: Warnings raised
	(from fit in scipy.stats)]
In-Reply-To: <AANLkTiknAKaL_R7n7sjSlGyvC_Nx9oTiX5FzgOMvSyj_@mail.gmail.com>
References: <AANLkTim4Aff5TnxB4Yk5MHCh7-5Y11uYr8RE9OUYRXbF@mail.gmail.com> 
	<AANLkTimIaQXaqjCgZNfCJenZmZZj561Yu5XeopHIulnF@mail.gmail.com> 
	<AANLkTikvt5eTn4qYt2TP5LPfWG8O_lcS80xmqzyaoeTR@mail.gmail.com> 
	<AANLkTiknAKaL_R7n7sjSlGyvC_Nx9oTiX5FzgOMvSyj_@mail.gmail.com>
Message-ID: <AANLkTimFcEArXqVrlGZEtxvES2219qo0YCNsIlGvtaHc@mail.gmail.com>

On Fri, Jun 11, 2010 at 5:58 PM,  <josef.pktd at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 5:47 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 5:42 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Fri, Jun 11, 2010 at 5:11 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>> On Fri, Jun 11, 2010 at 2:49 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>> On Fri, Jun 11, 2010 at 12:20 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>> On Fri, Jun 11, 2010 at 2:17 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>> On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>>> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>>>>> Since the raising of warning behavior has been changed (I believe), I
>>>>>>>>>> have been running into a lot of warnings in my code when say I do
>>>>>>>>>> something like
>>>>>>>>>>
>>>>>>>>>> In [120]: from scipy import stats
>>>>>>>>>>
>>>>>>>>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>>>>>>>>
>>>>>>>>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>>>>>>>>> Warning: invalid value encountered in divide
>>>>>>>>>>
>>>>>>>>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>>>>>>>>> disturbing when it's encountered in some kind of iteration or
>>>>>>>>>> optimization. ?I have been using numpy.clip to get around this in my
>>>>>>>>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>>>>>>>>
>>>>>>>>>> Take this example.
>>>>>>>>>>
>>>>>>>>>> In [123]: import numpy as np
>>>>>>>>>>
>>>>>>>>>> In [124]: np.random.seed(12345)
>>>>>>>>>>
>>>>>>>>>> In [125]: B = 6.0
>>>>>>>>>>
>>>>>>>>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>>>>>>>>
>>>>>>>>>> In [127]: from scipy.stats import expon
>>>>>>>>>>
>>>>>>>>>> In [128]: expon.fit(x)
>>>>>>>>>>
>>>>>>>>>> <dozens of warnings clipped>
>>>>>>>>>>
>>>>>>>>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>>>>>>>>
>>>>>>>>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>>>>>>>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>>>>>>>>> there any middle ground to be had in these type of situations via
>>>>>>>>>> context management perhaps?
>>>>>>>>>>
>>>>>>>>>> Should I file a ticket?
>>>>>>>>>
>>>>>>>>> Which numpy scipy versions are you using?
>>>>>>>
>>>>>>> I get know warnings
>>>>>>>>>> import numpy as np
>>>>>>>>>> np.random.seed(12345)
>>>>>>>>>> B = 6.0
>>>>>>>>>> x = np.random.exponential(scale=B, size=5000)
>>>>>>>>>> from scipy.stats import expon
>>>>>>>>>> expon.fit(x)
>>>>>>> array([ ?6.43573559e-04, ? 5.93058867e+00])
>>>>>>>
>>>>>>
>>>>>> You also get different values than I do, which shouldn't be the case.
>>>>>>
>>>>>> I just discovered that my expon.fit(x) just returns the first and
>>>>>> second moments of the data (even when I set floc = 0, I still get the
>>>>>> second moment), and I am trying to figure out why. ?It looks like
>>>>>> something is amiss.
>>>>>
>>>>
>>>> So maybe I am missing something (quite likely), but the reason that
>>>> the expon.fit(x), (silently) doesn't work in the above is that
>>>> expon.nnlf returns inf for the default start loc.
>>>>
>>>> Should fit_loc_scale be overwritten for expon?
>>>>
>>>> In [60]: expon.fit_loc_scale(x)
>>>> Out[60]: (0.21874043533906118, 5.7122829778172939)
>>>>
>>>> In [61]: expon.nnlf(expon.fit_loc_scale(x),x)
>>>> Out[61]: inf
>>>>
>>>> Changing fmin to disp=1, gives
>>>>
>>>> In[62]: expon.fit(x)
>>>>
>>>> <snip all the warnings, except from the solver>
>>>>
>>>> Warning: Maximum number of function evaluations has been exceeded.
>>>> Out[62]: (0.21874043533906118, 5.7122829778172939)
>>>>
>>>> The default loc is defined (for this case as)
>>>>
>>>> loc = x.mean() - x.std()
>>>>
>>>> so for any x <= loc it is outside of the domain of the exponential
>>>> distribution when it gets centered in nnlf.
>>>>
>>>> But
>>>>
>>>> In [63]: expon.fit(x,loc=0)
>>>> Optimization terminated successfully.
>>>> ? ? ? ? Current function value: 13900.441325
>>>> ? ? ? ? Iterations: 59
>>>> ? ? ? ? Function evaluations: 107
>>>> Out[63]: (0.00064357307755842945, 5.9303783425183241)
>>>
>>> I haven't looked yet in details at the new fit method.
>>>
>>> But for many distributions, especially those with a finite boundary,
>>> fit works only if the starting values are well chosen. In this case, I
>>> would set the starting value for loc at (x.min() - a little bit),
>>> unless loc is frozen at zero.
>>>
>>
>> Yeah that's what I settled on, though it wasn't obvious (to me) and
>> took some digging to get to the bottom of.
>>
>>> I didn't check where the default starting value for expon has been changed.
>>>
>>
>> It's inherited from rv_continuous.
>
> If you need starting values for more distributions, I have a script
> where I categorize all distributions, by whether the support is open,
> left-bounded, right-bounded, bounded or ambiguous, and choose
> appropriate starting values.
>

Could it at some point be used within the distributions for
distribution-specific starting values?

> But this is not yet merged with a semi-frozen fit method.
>
> fit for distributions that have support on the entire real line work
> pretty well.
>

Ah, ok.  I was just working on the generic likelihood model and
happened to start my comparison with an exponential distribution,
which turned out to not be such a good choice.

Skipper


From josef.pktd at gmail.com  Fri Jun 11 18:47:16 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 18:47:16 -0400
Subject: [SciPy-Dev] Fitting distributions [Was Re: Warnings raised
	(from fit in scipy.stats)]
In-Reply-To: <AANLkTimFcEArXqVrlGZEtxvES2219qo0YCNsIlGvtaHc@mail.gmail.com>
References: <AANLkTim4Aff5TnxB4Yk5MHCh7-5Y11uYr8RE9OUYRXbF@mail.gmail.com>
	<AANLkTimIaQXaqjCgZNfCJenZmZZj561Yu5XeopHIulnF@mail.gmail.com>
	<AANLkTikvt5eTn4qYt2TP5LPfWG8O_lcS80xmqzyaoeTR@mail.gmail.com>
	<AANLkTiknAKaL_R7n7sjSlGyvC_Nx9oTiX5FzgOMvSyj_@mail.gmail.com>
	<AANLkTimFcEArXqVrlGZEtxvES2219qo0YCNsIlGvtaHc@mail.gmail.com>
Message-ID: <AANLkTin3hXa32Gr-1m5GRF6ony3dE-KFTobQ7bhTtS5s@mail.gmail.com>

On Fri, Jun 11, 2010 at 6:03 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 5:58 PM, ?<josef.pktd at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 5:47 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>> On Fri, Jun 11, 2010 at 5:42 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Fri, Jun 11, 2010 at 5:11 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>> On Fri, Jun 11, 2010 at 2:49 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>> On Fri, Jun 11, 2010 at 12:20 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>> On Fri, Jun 11, 2010 at 2:17 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>>> On Fri, Jun 11, 2010 at 11:34 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>>>> On Fri, Jun 11, 2010 at 1:07 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>>>> On Fri, Jun 11, 2010 at 12:45 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>>>>>>>>>>> Since the raising of warning behavior has been changed (I believe), I
>>>>>>>>>>> have been running into a lot of warnings in my code when say I do
>>>>>>>>>>> something like
>>>>>>>>>>>
>>>>>>>>>>> In [120]: from scipy import stats
>>>>>>>>>>>
>>>>>>>>>>> In [121]: y = [-45, -3, 1, 0, 1, 3]
>>>>>>>>>>>
>>>>>>>>>>> In [122]: v = stats.norm.pdf(y)/stats.norm.cdf(y)
>>>>>>>>>>> Warning: invalid value encountered in divide
>>>>>>>>>>>
>>>>>>>>>>> Sometimes, this is useful to know. ?Sometimes, though, it's very
>>>>>>>>>>> disturbing when it's encountered in some kind of iteration or
>>>>>>>>>>> optimization. ?I have been using numpy.clip to get around this in my
>>>>>>>>>>> own code, but when it's buried a bit deeper, it's not quite so simple.
>>>>>>>>>>>
>>>>>>>>>>> Take this example.
>>>>>>>>>>>
>>>>>>>>>>> In [123]: import numpy as np
>>>>>>>>>>>
>>>>>>>>>>> In [124]: np.random.seed(12345)
>>>>>>>>>>>
>>>>>>>>>>> In [125]: B = 6.0
>>>>>>>>>>>
>>>>>>>>>>> In [126]: x = np.random.exponential(scale=B, size=5000)
>>>>>>>>>>>
>>>>>>>>>>> In [127]: from scipy.stats import expon
>>>>>>>>>>>
>>>>>>>>>>> In [128]: expon.fit(x)
>>>>>>>>>>>
>>>>>>>>>>> <dozens of warnings clipped>
>>>>>>>>>>>
>>>>>>>>>>> Out[128]: (0.21874043533906118, 5.7122829778172939)
>>>>>>>>>>>
>>>>>>>>>>> The fit is achieved by fmin (as far as I know, since disp=0 in the
>>>>>>>>>>> rv_continuous.fit...), but there are a number of warnings emitted. ?Is
>>>>>>>>>>> there any middle ground to be had in these type of situations via
>>>>>>>>>>> context management perhaps?
>>>>>>>>>>>
>>>>>>>>>>> Should I file a ticket?
>>>>>>>>>>
>>>>>>>>>> Which numpy scipy versions are you using?
>>>>>>>>
>>>>>>>> I get know warnings
>>>>>>>>>>> import numpy as np
>>>>>>>>>>> np.random.seed(12345)
>>>>>>>>>>> B = 6.0
>>>>>>>>>>> x = np.random.exponential(scale=B, size=5000)
>>>>>>>>>>> from scipy.stats import expon
>>>>>>>>>>> expon.fit(x)
>>>>>>>> array([ ?6.43573559e-04, ? 5.93058867e+00])
>>>>>>>>
>>>>>>>
>>>>>>> You also get different values than I do, which shouldn't be the case.
>>>>>>>
>>>>>>> I just discovered that my expon.fit(x) just returns the first and
>>>>>>> second moments of the data (even when I set floc = 0, I still get the
>>>>>>> second moment), and I am trying to figure out why. ?It looks like
>>>>>>> something is amiss.
>>>>>>
>>>>>
>>>>> So maybe I am missing something (quite likely), but the reason that
>>>>> the expon.fit(x), (silently) doesn't work in the above is that
>>>>> expon.nnlf returns inf for the default start loc.
>>>>>
>>>>> Should fit_loc_scale be overwritten for expon?
>>>>>
>>>>> In [60]: expon.fit_loc_scale(x)
>>>>> Out[60]: (0.21874043533906118, 5.7122829778172939)
>>>>>
>>>>> In [61]: expon.nnlf(expon.fit_loc_scale(x),x)
>>>>> Out[61]: inf
>>>>>
>>>>> Changing fmin to disp=1, gives
>>>>>
>>>>> In[62]: expon.fit(x)
>>>>>
>>>>> <snip all the warnings, except from the solver>
>>>>>
>>>>> Warning: Maximum number of function evaluations has been exceeded.
>>>>> Out[62]: (0.21874043533906118, 5.7122829778172939)
>>>>>
>>>>> The default loc is defined (for this case as)
>>>>>
>>>>> loc = x.mean() - x.std()
>>>>>
>>>>> so for any x <= loc it is outside of the domain of the exponential
>>>>> distribution when it gets centered in nnlf.
>>>>>
>>>>> But
>>>>>
>>>>> In [63]: expon.fit(x,loc=0)
>>>>> Optimization terminated successfully.
>>>>> ? ? ? ? Current function value: 13900.441325
>>>>> ? ? ? ? Iterations: 59
>>>>> ? ? ? ? Function evaluations: 107
>>>>> Out[63]: (0.00064357307755842945, 5.9303783425183241)
>>>>
>>>> I haven't looked yet in details at the new fit method.
>>>>
>>>> But for many distributions, especially those with a finite boundary,
>>>> fit works only if the starting values are well chosen. In this case, I
>>>> would set the starting value for loc at (x.min() - a little bit),
>>>> unless loc is frozen at zero.
>>>>
>>>
>>> Yeah that's what I settled on, though it wasn't obvious (to me) and
>>> took some digging to get to the bottom of.
>>>
>>>> I didn't check where the default starting value for expon has been changed.
>>>>
>>>
>>> It's inherited from rv_continuous.
>>
>> If you need starting values for more distributions, I have a script
>> where I categorize all distributions, by whether the support is open,
>> left-bounded, right-bounded, bounded or ambiguous, and choose
>> appropriate starting values.
>>
>
> Could it at some point be used within the distributions for
> distribution-specific starting values?

That's the plan, but it's still quite a bit of work to get it working
out of the box for all distributions. Cases like exponential
(distributions with lower bound at zero) would be relatively easy.

Josef

>
>> But this is not yet merged with a semi-frozen fit method.
>>
>> fit for distributions that have support on the entire real line work
>> pretty well.
>>
>
> Ah, ok. ?I was just working on the generic likelihood model and
> happened to start my comparison with an exponential distribution,
> which turned out to not be such a good choice.
>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From derek at astro.physik.uni-goettingen.de  Fri Jun 11 19:05:52 2010
From: derek at astro.physik.uni-goettingen.de (Derek Homeier)
Date: Sat, 12 Jun 2010 01:05:52 +0200
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <4C0E648B.10204@gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>	<loom.20100607T132750-19@post.gmane.org>	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
Message-ID: <C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>

Hi,

On Jun 8, 2010, at 5:40 PM, Bruce Southey wrote:

> I got 2 errors and 1 failure when I installed the beta using Python 2.6 
> (Linux 64-bit) with numpy '2.0.0.dev8445' .
> 
> Can we get a fix for ticket 1152 or at least mark it as known?
> http://projects.scipy.org/scipy/ticket/1152
> 
> The others are below.
> 
> There are also a number of overflow warnings that should be checked and 
> avoided. The same warnings also occur in test_continuous_basic for 
> certain distributions.
> test_iv_cephes_vs_amos (test_basic.TestBessel) ... Warning: overflow 
> encountered in iv
> Warning: overflow encountered in iv
> Warning: invalid value encountered in isinf
> 
FYI, this test fails on MacOS X 10.5  darwin-ppc /w numpy 1.4.1 both in scipy 0.7.2 and 0.8.0b1:

FAIL: test_iv_cephes_vs_amos (test_basic.TestBessel)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py", line 1690, in test_iv_cephes_vs_amos
    self.check_cephes_vs_amos(iv, iv, rtol=1e-12, atol=1e-305)
  File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py", line 1671, in check_cephes_vs_amos
    assert_tol_equal(c1, c2, err_msg=(v, z), rtol=rtol, atol=atol)
  File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py", line 38, in assert_tol_equal
    verbose=verbose, header=header)
  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
    raise AssertionError(msg)
AssertionError: 
Not equal to tolerance rtol=1e-12, atol=1e-305
(-120, 700.60000000000002)
(mismatch 100.0%)
 x: array(9.7596849076267918e+297)
 y: array((9.7596849444884899e+297+0j))

while the following also fails on darwin-i386:

FAIL: test_iv_cephes_vs_amos_mass_test (test_basic.TestBessel)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py", line 1712, in test_iv_cephes_vs_amos_mass_test
    assert dc[k] < 1e-9, (iv(v[k], x[k]), iv(v[k], x[k]+0j))
AssertionError: (1.8320048963545875e+306, (inf+0j))

- following Chuck Harris' suggestion, should I try if updating to Numpy revision r8455 
removes the failures as well?

++++++++++++++++++++++++++++++++++++++++++++++++++++++++

This next one fails on 0.8.0b1/darwin-i386 as well, but with twice the mismatch (1.8018...%):

> FAIL: test_random_real (test_basic.TestSingleIFFT)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File 
> "/usr/lib64/python2.6/site-packages/scipy/fftpack/tests/test_basic.py", 
> line 205, in test_random_real
>     assert_array_almost_equal (y1, x)
>   File "/usr/lib64/python2.6/site-packages/numpy/testing/utils.py", 
> line 774, in assert_array_almost_equal
>     header='Arrays are not almost equal')
>   File "/usr/lib64/python2.6/site-packages/numpy/testing/utils.py", 
> line 618, in assert_array_compare
>     raise AssertionError(msg)
> AssertionError:
> Arrays are not almost equal
> 
> (mismatch 0.900900900901%)
>  x: array([ 0.41364330 +5.90676663e-09j,  0.80715483 +2.64462052e-08j,
>         0.05271048 -3.67830459e-08j,  0.72591031 -9.31092980e-09j,
>         0.35162351 +1.40012923e-09j,  0.17632297 -1.25899486e-08j,...
>  y: array([ 0.41364321,  0.80715483,  0.05271063,  0.72591019,  0.35162321,
>         0.17632306,  0.3850981 ,  0.75712842,  0.68898875,  0.52632052,
>         0.69728118,  0.68721569,  0.69135427,  0.34033701,  0.65788335,...
>>> raise AssertionError('\nArrays are not almost equal\n\n(mismatch 
> 0.900900900901%)\n x: array([ 0.41364330 +5.90676663e-09j,  0.80715483 
> +2.64462052e-08j,\n        0.05271048 -3.67830459e-08j,  0.72591031 
> -9.31092980e-09j,\n        0.35162351 +1.40012923e-09j,  0.17632297 
> -1.25899486e-08j,...\n y: array([ 0.41364321,  0.80715483,  0.05271063,  
> 0.72591019,  0.35162321,\n        0.17632306,  0.3850981 ,  0.75712842,  
> 0.68898875,  0.52632052,\n        0.69728118,  0.68721569,  0.69135427,  
> 0.34033701,  0.65788335,...')

The Ticket #1124 did not occur on MacOS 10.5,

but a number of other tests failed on various MacOS architectures, the following one 
on darwin-ppc in 0.7.2 and 0.8.0b1
 (while I don't understand the alleged mismatch...):

FAIL: test_stats.test_kstest
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
    self.test(*self.arg)
  File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
    np.array((0.0072115233216310994, 0.98531158590396228)), 14)
  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
    return assert_array_almost_equal(actual, desired, decimal, err_msg)
  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
    header='Arrays are not almost equal')
  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
    raise AssertionError(msg)
AssertionError: 
Arrays are not almost equal

(mismatch 100.0%)
 x: array([ 0.007,  0.985])
 y: array([ 0.007,  0.985])

while the next three are new in 0.8.0b1 / darwin-ppc:

FAIL: test_data.test_boost(<Data for gamma: test_gamma_data_ipp-near_m55>,)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
    self.test(*self.arg)
  File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_data.py", line 205, in _test_factory
    test.check(dtype=dtype)
  File "/sw/lib/python2.6/site-packages/scipy/special/tests/testutils.py", line 187, in check
    assert False, "\n".join(msg)
AssertionError: 
Max |adiff|: 5.56451e-80
Max |rdiff|: 6.01399e-12
Bad results for the following points (in output 0):
                        -55.25 =>          1.281342652143248e-73 !=         1.2813426521356121e-73  (rdiff         5.9592794108831318e-12)
                      -55.0625 =>         9.8673260749254454e-73 !=         9.8673260748719804e-73  (rdiff         5.4183854671083142e-12)
                    -55.015625 =>          4.736069454765531e-72 !=         4.7360694547400013e-72  (rdiff         5.3904770710359471e-12)
                 -55.001953125 =>         4.0011527026054843e-71 !=         4.0011527025839184e-71  (rdiff         5.3899101047365405e-12)
                -55.0009765625 =>         8.0337165791763099e-71 !=         8.0337165791330061e-71  (rdiff          5.390260882973932e-12)
            -55.00006103515625 =>         1.2901278999535244e-69 !=         1.2901278999465708e-69  (rdiff         5.3898362867905277e-12)
           -55.000015258789062 =>         5.1614604487935911e-69 !=         5.1614604487657727e-69  (rdiff         5.3896432998251817e-12)
           -55.000007629394531 =>         1.0323237221382923e-68 !=         1.0323237221327282e-68  (rdiff         5.3898770611534905e-12)
           -54.999992370605469 =>        -1.0323869903952338e-68 !=        -1.0323869903896693e-68  (rdiff         5.3899456366789772e-12)
           -54.999969482421875 =>        -2.5812047538146166e-69 !=        -2.5812047538007038e-69  (rdiff         5.3900484385549852e-12)
              -54.999755859375 =>         -3.229275765714629e-70 !=        -3.2292757656972238e-70  (rdiff         5.3898088494869903e-12)
               -54.99951171875 =>        -1.6162223904478387e-70 !=        -1.6162223904391274e-70  (rdiff         5.3898993756420872e-12)
                 -54.998046875 =>        -4.0644220032500644e-71 !=        -4.0644220032281559e-71  (rdiff         5.3903083062977191e-12)
                   -54.9921875 =>        -1.0403990765960148e-71 !=         -1.040399076590406e-71  (rdiff          5.391034204122906e-12)
                    -54.984375 =>        -5.3694203178537616e-72 !=        -5.3694203178248015e-72  (rdiff         5.3935130122571759e-12)
                      -54.9375 =>        -1.6301847489550233e-72 !=        -1.6301847489461702e-72  (rdiff         5.4307621029371105e-12)
                       -54.875 =>        -1.0680846658730133e-72 !=        -1.0680846658670925e-72  (rdiff         5.5434016349406124e-12)
                       -54.875 =>        -1.0680846658730133e-72 !=        -1.0680846658670925e-72  (rdiff         5.5434016349406124e-12)
                        -54.75 =>        -9.5458361856825864e-73 !=        -9.5458361856251778e-73  (rdiff          6.013992626968188e-12)
                       -54.625 =>        -1.2061864756549609e-72 !=        -1.2061864756493961e-72  (rdiff         4.6135414163673135e-12)

======================================================================
FAIL: test_data.test_boost(<Data for gammaincinv: gamma_inv_big_data_ipp-gamma_inv_big_data>,)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
    self.test(*self.arg)
  File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_data.py", line 205, in _test_factory
    test.check(dtype=dtype)
  File "/sw/lib/python2.6/site-packages/scipy/special/tests/testutils.py", line 187, in check
    assert False, "\n".join(msg)
AssertionError: 
Max |adiff|: 39782.8
Max |rdiff|: 1
Bad results for the following points (in output 0):
                   40010.84375             0.12698681652545929 =>                            0.0 !=             39782.764004009827  (rdiff                            1.0)

ERROR: test_wavfile.test_read_1
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
    self.test(*self.arg)
  File "/sw/lib/python2.6/site-packages/scipy/io/tests/test_wavfile.py", line 12, in test_read_1
    rate, data = wavfile.read(datafile('test-44100-le-1ch-4bytes.wav'))
  File "/sw/lib/python2.6/site-packages/scipy/io/wavfile.py", line 124, in read
    return rate, data
UnboundLocalError: local variable 'data' referenced before assignment

+++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Finally on MacOS 10.6 / darwin-x86_64 these additional failures occur since 0.7.2:

ERROR: test_complex_nonsymmetric_modes (test_arpack.TestEigenComplexNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 267, in test_complex_nonsymmetric_modes
    self.eval_evec(m,typ,k,which)
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 248, in eval_evec
    eval,evec=eigen(a,k,which=which)
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 397, in eigen
    params.iterate()
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 204, in test_nonsymmetric_modes
    self.eval_evec(m,typ,k,which)
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 186, in eval_evec
    eval,evec=eigen(a,k,which=which,**kwds)
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 397, in eigen
    params.iterate()
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 214, in test_starting_vector
    self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 186, in eval_evec
    eval,evec=eigen(a,k,which=which,**kwds)
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 397, in eigen
    params.iterate()
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
FAIL: test_complex_symmetric_modes (test_arpack.TestEigenComplexSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 156, in test_complex_symmetric_modes
    self.eval_evec(self.symmetric[0],typ,k,which)
  File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 145, in eval_evec
    assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
    header='Arrays are not almost equal')
  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
    raise AssertionError(msg)
AssertionError: 
Arrays are not almost equal

(mismatch 100.0%)
 x: array([ 1.07188725 +6.23436023e-08j,  4.91291142 -3.25412906e-08j], dtype=complex64)
 y: array([ 5.+0.j,  6.+0.j], dtype=complex64)

plus this last one, new in 0.8.0b1:

======================================================================
FAIL: test_definition (test_basic.TestIfftnDouble)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-packages/scipy/fftpack/tests/test_basic.py", line 498, in test_definition
    assert_array_almost_equal_nulp(ifftn(x),direct_idftn(x),self.maxnlp)
  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 1116, in assert_array_almost_equal_nulp
    raise AssertionError(msg)
AssertionError: X and Y are not equal to 2000 ULP


HTH,
							Derek


From josef.pktd at gmail.com  Fri Jun 11 19:37:49 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 19:37:49 -0400
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
Message-ID: <AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>

On Fri, Jun 11, 2010 at 7:05 PM, Derek Homeier
<derek at astro.physik.uni-goettingen.de> wrote:
> Hi,
>
> On Jun 8, 2010, at 5:40 PM, Bruce Southey wrote:
>
>> I got 2 errors and 1 failure when I installed the beta using Python 2.6
>> (Linux 64-bit) with numpy '2.0.0.dev8445' .
>>
>> Can we get a fix for ticket 1152 or at least mark it as known?
>> http://projects.scipy.org/scipy/ticket/1152
>>
>> The others are below.
>>
>> There are also a number of overflow warnings that should be checked and
>> avoided. The same warnings also occur in test_continuous_basic for
>> certain distributions.
>> test_iv_cephes_vs_amos (test_basic.TestBessel) ... Warning: overflow
>> encountered in iv
>> Warning: overflow encountered in iv
>> Warning: invalid value encountered in isinf
>>
> FYI, this test fails on MacOS X 10.5 ?darwin-ppc /w numpy 1.4.1 both in scipy 0.7.2 and 0.8.0b1:
>
> FAIL: test_iv_cephes_vs_amos (test_basic.TestBessel)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py", line 1690, in test_iv_cephes_vs_amos
> ? ?self.check_cephes_vs_amos(iv, iv, rtol=1e-12, atol=1e-305)
> ?File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py", line 1671, in check_cephes_vs_amos
> ? ?assert_tol_equal(c1, c2, err_msg=(v, z), rtol=rtol, atol=atol)
> ?File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py", line 38, in assert_tol_equal
> ? ?verbose=verbose, header=header)
> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
> ? ?raise AssertionError(msg)
> AssertionError:
> Not equal to tolerance rtol=1e-12, atol=1e-305
> (-120, 700.60000000000002)
> (mismatch 100.0%)
> ?x: array(9.7596849076267918e+297)
> ?y: array((9.7596849444884899e+297+0j))
>
> while the following also fails on darwin-i386:
>
> FAIL: test_iv_cephes_vs_amos_mass_test (test_basic.TestBessel)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py", line 1712, in test_iv_cephes_vs_amos_mass_test
> ? ?assert dc[k] < 1e-9, (iv(v[k], x[k]), iv(v[k], x[k]+0j))
> AssertionError: (1.8320048963545875e+306, (inf+0j))
>
> - following Chuck Harris' suggestion, should I try if updating to Numpy revision r8455
> removes the failures as well?
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
> This next one fails on 0.8.0b1/darwin-i386 as well, but with twice the mismatch (1.8018...%):
>
>> FAIL: test_random_real (test_basic.TestSingleIFFT)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ? File
>> "/usr/lib64/python2.6/site-packages/scipy/fftpack/tests/test_basic.py",
>> line 205, in test_random_real
>> ? ? assert_array_almost_equal (y1, x)
>> ? File "/usr/lib64/python2.6/site-packages/numpy/testing/utils.py",
>> line 774, in assert_array_almost_equal
>> ? ? header='Arrays are not almost equal')
>> ? File "/usr/lib64/python2.6/site-packages/numpy/testing/utils.py",
>> line 618, in assert_array_compare
>> ? ? raise AssertionError(msg)
>> AssertionError:
>> Arrays are not almost equal
>>
>> (mismatch 0.900900900901%)
>> ?x: array([ 0.41364330 +5.90676663e-09j, ?0.80715483 +2.64462052e-08j,
>> ? ? ? ? 0.05271048 -3.67830459e-08j, ?0.72591031 -9.31092980e-09j,
>> ? ? ? ? 0.35162351 +1.40012923e-09j, ?0.17632297 -1.25899486e-08j,...
>> ?y: array([ 0.41364321, ?0.80715483, ?0.05271063, ?0.72591019, ?0.35162321,
>> ? ? ? ? 0.17632306, ?0.3850981 , ?0.75712842, ?0.68898875, ?0.52632052,
>> ? ? ? ? 0.69728118, ?0.68721569, ?0.69135427, ?0.34033701, ?0.65788335,...
>>>> raise AssertionError('\nArrays are not almost equal\n\n(mismatch
>> 0.900900900901%)\n x: array([ 0.41364330 +5.90676663e-09j, ?0.80715483
>> +2.64462052e-08j,\n ? ? ? ?0.05271048 -3.67830459e-08j, ?0.72591031
>> -9.31092980e-09j,\n ? ? ? ?0.35162351 +1.40012923e-09j, ?0.17632297
>> -1.25899486e-08j,...\n y: array([ 0.41364321, ?0.80715483, ?0.05271063,
>> 0.72591019, ?0.35162321,\n ? ? ? ?0.17632306, ?0.3850981 , ?0.75712842,
>> 0.68898875, ?0.52632052,\n ? ? ? ?0.69728118, ?0.68721569, ?0.69135427,
>> 0.34033701, ?0.65788335,...')
>
> The Ticket #1124 did not occur on MacOS 10.5,
>
> but a number of other tests failed on various MacOS architectures, the following one
> on darwin-ppc in 0.7.2 and 0.8.0b1
> ?(while I don't understand the alleged mismatch...):
>
> FAIL: test_stats.test_kstest
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
> ? ?self.test(*self.arg)
> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
> ? ?header='Arrays are not almost equal')
> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
> ? ?raise AssertionError(msg)
> AssertionError:
> Arrays are not almost equal
>
> (mismatch 100.0%)
> ?x: array([ 0.007, ?0.985])
> ?y: array([ 0.007, ?0.985])

maybe the precision (decimal 14) is too high for this test across platforms

Could you check how large the difference is ?

np.random.seed(987654321)
x = stats.norm.rvs(loc=0.2, size=100)
np.array(stats.kstest(x,'norm', alternative = 'greater')) -
                np.array((0.0072115233216310994, 0.98531158590396228))

(my line numbers differ, but this should be the right test given your numbers)

Thanks,

Josef

>
> while the next three are new in 0.8.0b1 / darwin-ppc:
>
> FAIL: test_data.test_boost(<Data for gamma: test_gamma_data_ipp-near_m55>,)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
> ? ?self.test(*self.arg)
> ?File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_data.py", line 205, in _test_factory
> ? ?test.check(dtype=dtype)
> ?File "/sw/lib/python2.6/site-packages/scipy/special/tests/testutils.py", line 187, in check
> ? ?assert False, "\n".join(msg)
> AssertionError:
> Max |adiff|: 5.56451e-80
> Max |rdiff|: 6.01399e-12
> Bad results for the following points (in output 0):
> ? ? ? ? ? ? ? ? ? ? ? ?-55.25 => ? ? ? ? ?1.281342652143248e-73 != ? ? ? ? 1.2813426521356121e-73 ?(rdiff ? ? ? ? 5.9592794108831318e-12)
> ? ? ? ? ? ? ? ? ? ? ?-55.0625 => ? ? ? ? 9.8673260749254454e-73 != ? ? ? ? 9.8673260748719804e-73 ?(rdiff ? ? ? ? 5.4183854671083142e-12)
> ? ? ? ? ? ? ? ? ? ?-55.015625 => ? ? ? ? ?4.736069454765531e-72 != ? ? ? ? 4.7360694547400013e-72 ?(rdiff ? ? ? ? 5.3904770710359471e-12)
> ? ? ? ? ? ? ? ? -55.001953125 => ? ? ? ? 4.0011527026054843e-71 != ? ? ? ? 4.0011527025839184e-71 ?(rdiff ? ? ? ? 5.3899101047365405e-12)
> ? ? ? ? ? ? ? ?-55.0009765625 => ? ? ? ? 8.0337165791763099e-71 != ? ? ? ? 8.0337165791330061e-71 ?(rdiff ? ? ? ? ?5.390260882973932e-12)
> ? ? ? ? ? ?-55.00006103515625 => ? ? ? ? 1.2901278999535244e-69 != ? ? ? ? 1.2901278999465708e-69 ?(rdiff ? ? ? ? 5.3898362867905277e-12)
> ? ? ? ? ? -55.000015258789062 => ? ? ? ? 5.1614604487935911e-69 != ? ? ? ? 5.1614604487657727e-69 ?(rdiff ? ? ? ? 5.3896432998251817e-12)
> ? ? ? ? ? -55.000007629394531 => ? ? ? ? 1.0323237221382923e-68 != ? ? ? ? 1.0323237221327282e-68 ?(rdiff ? ? ? ? 5.3898770611534905e-12)
> ? ? ? ? ? -54.999992370605469 => ? ? ? ?-1.0323869903952338e-68 != ? ? ? ?-1.0323869903896693e-68 ?(rdiff ? ? ? ? 5.3899456366789772e-12)
> ? ? ? ? ? -54.999969482421875 => ? ? ? ?-2.5812047538146166e-69 != ? ? ? ?-2.5812047538007038e-69 ?(rdiff ? ? ? ? 5.3900484385549852e-12)
> ? ? ? ? ? ? ?-54.999755859375 => ? ? ? ? -3.229275765714629e-70 != ? ? ? ?-3.2292757656972238e-70 ?(rdiff ? ? ? ? 5.3898088494869903e-12)
> ? ? ? ? ? ? ? -54.99951171875 => ? ? ? ?-1.6162223904478387e-70 != ? ? ? ?-1.6162223904391274e-70 ?(rdiff ? ? ? ? 5.3898993756420872e-12)
> ? ? ? ? ? ? ? ? -54.998046875 => ? ? ? ?-4.0644220032500644e-71 != ? ? ? ?-4.0644220032281559e-71 ?(rdiff ? ? ? ? 5.3903083062977191e-12)
> ? ? ? ? ? ? ? ? ? -54.9921875 => ? ? ? ?-1.0403990765960148e-71 != ? ? ? ? -1.040399076590406e-71 ?(rdiff ? ? ? ? ?5.391034204122906e-12)
> ? ? ? ? ? ? ? ? ? ?-54.984375 => ? ? ? ?-5.3694203178537616e-72 != ? ? ? ?-5.3694203178248015e-72 ?(rdiff ? ? ? ? 5.3935130122571759e-12)
> ? ? ? ? ? ? ? ? ? ? ?-54.9375 => ? ? ? ?-1.6301847489550233e-72 != ? ? ? ?-1.6301847489461702e-72 ?(rdiff ? ? ? ? 5.4307621029371105e-12)
> ? ? ? ? ? ? ? ? ? ? ? -54.875 => ? ? ? ?-1.0680846658730133e-72 != ? ? ? ?-1.0680846658670925e-72 ?(rdiff ? ? ? ? 5.5434016349406124e-12)
> ? ? ? ? ? ? ? ? ? ? ? -54.875 => ? ? ? ?-1.0680846658730133e-72 != ? ? ? ?-1.0680846658670925e-72 ?(rdiff ? ? ? ? 5.5434016349406124e-12)
> ? ? ? ? ? ? ? ? ? ? ? ?-54.75 => ? ? ? ?-9.5458361856825864e-73 != ? ? ? ?-9.5458361856251778e-73 ?(rdiff ? ? ? ? ?6.013992626968188e-12)
> ? ? ? ? ? ? ? ? ? ? ? -54.625 => ? ? ? ?-1.2061864756549609e-72 != ? ? ? ?-1.2061864756493961e-72 ?(rdiff ? ? ? ? 4.6135414163673135e-12)
>
> ======================================================================
> FAIL: test_data.test_boost(<Data for gammaincinv: gamma_inv_big_data_ipp-gamma_inv_big_data>,)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
> ? ?self.test(*self.arg)
> ?File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_data.py", line 205, in _test_factory
> ? ?test.check(dtype=dtype)
> ?File "/sw/lib/python2.6/site-packages/scipy/special/tests/testutils.py", line 187, in check
> ? ?assert False, "\n".join(msg)
> AssertionError:
> Max |adiff|: 39782.8
> Max |rdiff|: 1
> Bad results for the following points (in output 0):
> ? ? ? ? ? ? ? ? ? 40010.84375 ? ? ? ? ? ? 0.12698681652545929 => ? ? ? ? ? ? ? ? ? ? ? ? ? ?0.0 != ? ? ? ? ? ? 39782.764004009827 ?(rdiff ? ? ? ? ? ? ? ? ? ? ? ? ? ?1.0)
>
> ERROR: test_wavfile.test_read_1
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
> ? ?self.test(*self.arg)
> ?File "/sw/lib/python2.6/site-packages/scipy/io/tests/test_wavfile.py", line 12, in test_read_1
> ? ?rate, data = wavfile.read(datafile('test-44100-le-1ch-4bytes.wav'))
> ?File "/sw/lib/python2.6/site-packages/scipy/io/wavfile.py", line 124, in read
> ? ?return rate, data
> UnboundLocalError: local variable 'data' referenced before assignment
>
> +++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
> Finally on MacOS 10.6 / darwin-x86_64 these additional failures occur since 0.7.2:
>
> ERROR: test_complex_nonsymmetric_modes (test_arpack.TestEigenComplexNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 267, in test_complex_nonsymmetric_modes
> ? ?self.eval_evec(m,typ,k,which)
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 248, in eval_evec
> ? ?eval,evec=eigen(a,k,which=which)
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 397, in eigen
> ? ?params.iterate()
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 222, in iterate
> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 204, in test_nonsymmetric_modes
> ? ?self.eval_evec(m,typ,k,which)
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 186, in eval_evec
> ? ?eval,evec=eigen(a,k,which=which,**kwds)
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 397, in eigen
> ? ?params.iterate()
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 222, in iterate
> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 214, in test_starting_vector
> ? ?self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 186, in eval_evec
> ? ?eval,evec=eigen(a,k,which=which,**kwds)
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 397, in eigen
> ? ?params.iterate()
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py", line 222, in iterate
> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> FAIL: test_complex_symmetric_modes (test_arpack.TestEigenComplexSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 156, in test_complex_symmetric_modes
> ? ?self.eval_evec(self.symmetric[0],typ,k,which)
> ?File "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py", line 145, in eval_evec
> ? ?assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
> ? ?header='Arrays are not almost equal')
> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
> ? ?raise AssertionError(msg)
> AssertionError:
> Arrays are not almost equal
>
> (mismatch 100.0%)
> ?x: array([ 1.07188725 +6.23436023e-08j, ?4.91291142 -3.25412906e-08j], dtype=complex64)
> ?y: array([ 5.+0.j, ?6.+0.j], dtype=complex64)
>
> plus this last one, new in 0.8.0b1:
>
> ======================================================================
> FAIL: test_definition (test_basic.TestIfftnDouble)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/sw/lib/python2.6/site-packages/scipy/fftpack/tests/test_basic.py", line 498, in test_definition
> ? ?assert_array_almost_equal_nulp(ifftn(x),direct_idftn(x),self.maxnlp)
> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 1116, in assert_array_almost_equal_nulp
> ? ?raise AssertionError(msg)
> AssertionError: X and Y are not equal to 2000 ULP
>
>
> HTH,
> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From derek at astro.physik.uni-goettingen.de  Fri Jun 11 19:54:56 2010
From: derek at astro.physik.uni-goettingen.de (Derek Homeier)
Date: Sat, 12 Jun 2010 01:54:56 +0200
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
Message-ID: <9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>

Hi Josef,

>> FAIL: test_stats.test_kstest
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>>  File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>    self.test(*self.arg)
>>  File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>    np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>    return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>    header='Arrays are not almost equal')
>>  File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>    raise AssertionError(msg)
>> AssertionError:
>> Arrays are not almost equal
>> 
>> (mismatch 100.0%)
>>  x: array([ 0.007,  0.985])
>>  y: array([ 0.007,  0.985])
> 
> maybe the precision (decimal 14) is too high for this test across platforms
> 
> Could you check how large the difference is ?
> 
> np.random.seed(987654321)
> x = stats.norm.rvs(loc=0.2, size=100)
> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>                np.array((0.0072115233216310994, 0.98531158590396228))
> 
> (my line numbers differ, but this should be the right test given your numbers)


yes, just a decimal or two too high, if I got the numbers right:
# OS X 10.5 i386 / 10.6 x86_64:
array([  8.67361738e-18,   1.66533454e-15])

# OS X 10.5 ppc:
array([  2.05955045e-13,  -7.16759985e-13])

Cheers,
						Derek


From josef.pktd at gmail.com  Fri Jun 11 20:41:27 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 11 Jun 2010 20:41:27 -0400
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
Message-ID: <AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>

On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
<derek at astro.physik.uni-goettingen.de> wrote:
> Hi Josef,
>
>>> FAIL: test_stats.test_kstest
>>> ----------------------------------------------------------------------
>>> Traceback (most recent call last):
>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>> ? ?self.test(*self.arg)
>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>> ? ?header='Arrays are not almost equal')
>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>> ? ?raise AssertionError(msg)
>>> AssertionError:
>>> Arrays are not almost equal
>>>
>>> (mismatch 100.0%)
>>> ?x: array([ 0.007, ?0.985])
>>> ?y: array([ 0.007, ?0.985])
>>
>> maybe the precision (decimal 14) is too high for this test across platforms
>>
>> Could you check how large the difference is ?
>>
>> np.random.seed(987654321)
>> x = stats.norm.rvs(loc=0.2, size=100)
>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>
>> (my line numbers differ, but this should be the right test given your numbers)
>
>
> yes, just a decimal or two too high, if I got the numbers right:
> # OS X 10.5 i386 / 10.6 x86_64:
> array([ ?8.67361738e-18, ? 1.66533454e-15])
>
> # OS X 10.5 ppc:
> array([ ?2.05955045e-13, ?-7.16759985e-13])

interesting that there are differences in the calculations, but for
the test we can just reduce the precision to decimal=12 to avoid the
test failure.

Thanks,
Josef

>
> Cheers,
> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Fri Jun 11 22:31:00 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 11 Jun 2010 20:31:00 -0600
Subject: [SciPy-Dev] Updating constants
In-Reply-To: <AANLkTimKpEz5-0-T2RZVY3DdO5Aw3ZM_zLwuxy7iJc_U@mail.gmail.com>
References: <AANLkTim-_hQhVEZTwVYOmc93WyJ-r5sTUZ7Z_FPIZcK3@mail.gmail.com>
	<AANLkTimmjCd6jii6l1mhqkjP-wzBliII59VDHpRmTyOX@mail.gmail.com>
	<AANLkTin62gyAdCOtg5VfdU0hD2SbYy-tYsZCGLRhYohW@mail.gmail.com>
	<AANLkTin_8yGnzxznFnjh5_rmpt9h33YVqxfQyGxMZCiZ@mail.gmail.com>
	<AANLkTikGYRj8HOuEeDLSYGTT70nfSZMo_9UEzAoO03ua@mail.gmail.com>
	<AANLkTinBHCHiFhPCJOoq5Xgvp1DaRqfof9aSJ0rykU2Q@mail.gmail.com>
	<AANLkTilR9tTDFiRW_dwSgtucftQ7YrjNAwv6eijyEWkx@mail.gmail.com>
	<AANLkTimKpEz5-0-T2RZVY3DdO5Aw3ZM_zLwuxy7iJc_U@mail.gmail.com>
Message-ID: <AANLkTikD8OmqIGU4W4abl8CCkJ3TNqMozzbEdqmsJNxn@mail.gmail.com>

On Fri, Jun 11, 2010 at 1:05 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Fri, Jun 11, 2010 at 12:55 PM, Charles R Harris
> <charlesr.harris at gmail.com> wrote:
>>
>>
>> On Fri, Jun 11, 2010 at 12:46 PM, Vincent Davis <vincent at vincentdavis.net>
>> wrote:
>>>
>>> On Thu, Jun 10, 2010 at 11:29 PM, ?<josef.pktd at gmail.com> wrote:
>>> > On Fri, Jun 11, 2010 at 1:09 AM, Vincent Davis
>>> > <vincent at vincentdavis.net> wrote:
>>> >> On Thu, Jun 10, 2010 at 10:54 PM, Vincent Davis
>>> >> <vincent at vincentdavis.net> wrote:
>>> >>> On Thu, Jun 10, 2010 at 10:38 PM, ?<josef.pktd at gmail.com> wrote:
>>> >>>> On Fri, Jun 11, 2010 at 12:11 AM, Vincent Davis
>>> >>>> <vincent at vincentdavis.net> wrote:
>>> >>>>> The current constants in scipy are from 2002, the newest set
>>> >>>>> available
>>> >>>>> are from 2006.
>>> >>>>> Should they be updated, What are the issues with updating with
>>> >>>>> regard
>>> >>>>> to notifying users ie documenting the update.
>>> >>>>
>>> >>>> I thought these are constants. Did they change the value of Pi
>>> >>>> recently?
>>> >>>
>>> >>> They found more digits of Pi :)
>>> >>> I was referring to "Fundamental Physical Constants"
>>> >>> scipy.constants.codata
>>> >>> I don't know, it's not easy to compare. I assume there was a reason
>>> >>> they updated the list. There is an Uncertainty value on many of them
>>> >>> so I assume the actual value don't change but our estimate does.
>>> >>>
>>> >>> http://physics.nist.gov/cuu/Constants/index.html
>>> >>> release dates 1986, 1998, 2002, 2006
>>> >>>
>>> >>> Vincent
>>> >>>
>>> >>>>
>>> >>>> just curious: What has changed?
>>> >>>>
>>> >>
>>> >> I just relived I can update the constants and then compare them with
>>> >> the old. Should be quick, I will send out the diff in the morning.
>>> >
>>> > quote from the REVIEWS OF MODERN PHYSICS paper:
>>> >
>>> > "Although just four years separate the 31 December closing dates of
>>> > the 2002 and 2006 adjustments, there are a number of important new
>>> > results to consider. Experimental advances include the 2003 Atomic
>>> > Mass Evaluation from the Atomic Mass Data Center (AMDC),which provides
>>> > new values for the relative atomic masses Ar(X) of a number of
>>> > relevant atoms; a new value of ..."
>>>
>>> Ok a few more issues/question about constants.codata.
>>> Some of the names seem rather messed up because of how the raw text is
>>> ?read. For example
>>> Loschmidt constant (273.15 K, 101.325 kPa) ? ? ? ? ? ? 2.686 7774 e25
>>> ? ? ?0.000 0047 e25 ? ? ? ?m^-3
>>>
>>> The above line gets added to the physical_constants dict with the key
>>> value 'Loschmidt constant (273.15 K, 101.325 kPa)'.
>>> codata.value('Loschmidt constant (273.15 K, 101.325 kPa)')
>>> 2.6867773000000001e+25
>>>
>>> Seems like it should be no more that 'Loschmidt constant' and I would
>>> recommend 'Loschmidt' unless there is a conflict.
>>>
>>
>> The new codata will need a bit of editing to work, for examples the columns
>> used to extract the various parts will need modification. I think "Loschmidt
>> constant" is the right name to use, but this "constant" depends on
>> temperature and pressure, so that should be noted somewhere, maybe in the
>> documentation.
>
> Thats what I was thinking. I could add an additional item to the list
> stored values, "notes", [value, precision, units, notes] and add a
> helper function like the others have ie codata.notes(Loschmidt) =
> (273.15 K, 101.325 kPa)
>
> Does that sound ok?

I have attached a file that when run makes it easy to see what has
changed in the constants (modified from the codata file). If
interested review and give me any feedback you have about the changes.
and how best to document the update.

Vincent

>
> Vincent
>
>
>>
>> Chuck
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: review_constants.py
Type: application/octet-stream
Size: 67439 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100611/5d16092a/attachment.obj>

From vincent at vincentdavis.net  Fri Jun 11 22:50:13 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 11 Jun 2010 20:50:13 -0600
Subject: [SciPy-Dev] mono space text in document editor.
In-Reply-To: <hutph1$l1v$2@dough.gmane.org>
References: <AANLkTimQyC4ZR78QiF4HycQ01R4xuJyYV4ARniJX_utL@mail.gmail.com>
	<hut2u4$4e4$1@dough.gmane.org>
	<AANLkTimeYC2WC-oubAXW3dhWezNUX4-tzdTgo5PjURmH@mail.gmail.com>
	<hutph1$l1v$2@dough.gmane.org>
Message-ID: <AANLkTikdy5eQfmI2SP5Rm6kjfdHfnng3TJl7ON5290xi@mail.gmail.com>

On Fri, Jun 11, 2010 at 10:50 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Fri, 11 Jun 2010 10:07:59 -0600, Vincent Davis wrote:
> [clip]
>> On this page for example.
>> http://docs.scipy.org/scipy/docs/scipy.constants.codata.precision/edit/
>> When you edit the text it is not monospace and there is no 75 char
>> guide. This makes both staying within the 75 char limit and getting the
>> rst syntax correct difficult. (for example putting "-" under a word to
>> get bold font.) I could use an external editor and copy paste. See
>> attached images for you don't know what I mean.
>
> That's specific to the browser you are using, and possibly also user-
> specific customizations -- is it Safari on OSX?

It is mono spaced in FireFox on OSX but not Chrome. I don't see a way
to change this from within Chrome.
Thanks
Vincent

>
> Anyway, there's nothing in the CSS forcing <textarea> to monospaced font.
> I believed all browsers had that as a default, but apparently some do not.
>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sat Jun 12 15:02:54 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 13:02:54 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
Message-ID: <AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>

On Fri, Jun 11, 2010 at 6:41 PM,  <josef.pktd at gmail.com> wrote:
> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
> <derek at astro.physik.uni-goettingen.de> wrote:
>> Hi Josef,
>>
>>>> FAIL: test_stats.test_kstest
>>>> ----------------------------------------------------------------------
>>>> Traceback (most recent call last):
>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>> ? ?self.test(*self.arg)
>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>> ? ?header='Arrays are not almost equal')
>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>> ? ?raise AssertionError(msg)
>>>> AssertionError:
>>>> Arrays are not almost equal
>>>>
>>>> (mismatch 100.0%)
>>>> ?x: array([ 0.007, ?0.985])
>>>> ?y: array([ 0.007, ?0.985])
>>>
>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>
>>> Could you check how large the difference is ?
>>>
>>> np.random.seed(987654321)
>>> x = stats.norm.rvs(loc=0.2, size=100)
>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>
>>> (my line numbers differ, but this should be the right test given your numbers)
>>
>>
>> yes, just a decimal or two too high, if I got the numbers right:
>> # OS X 10.5 i386 / 10.6 x86_64:
>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>
>> # OS X 10.5 ppc:
>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>
> interesting that there are differences in the calculations, but for
> the test we can just reduce the precision to decimal=12 to avoid the
> test failure.

I must be doing something wrong here becuase I don't get anything
close that what you have above.
In [4]: np.random.seed(987654321)

In [5]: x = stats.norm.rvs(loc=0.2, size=100)

In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))

In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))

In [8]: r1-r2
Out[8]: array([ 0.03704986, -0.32866092])

Vincent


>
> Thanks,
> Josef
>
>>
>> Cheers,
>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Sat Jun 12 15:22:46 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sat, 12 Jun 2010 15:22:46 -0400
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
Message-ID: <AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>

On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>> <derek at astro.physik.uni-goettingen.de> wrote:
>>> Hi Josef,
>>>
>>>>> FAIL: test_stats.test_kstest
>>>>> ----------------------------------------------------------------------
>>>>> Traceback (most recent call last):
>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>> ? ?self.test(*self.arg)
>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>> ? ?header='Arrays are not almost equal')
>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>> ? ?raise AssertionError(msg)
>>>>> AssertionError:
>>>>> Arrays are not almost equal
>>>>>
>>>>> (mismatch 100.0%)
>>>>> ?x: array([ 0.007, ?0.985])
>>>>> ?y: array([ 0.007, ?0.985])
>>>>
>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>
>>>> Could you check how large the difference is ?
>>>>
>>>> np.random.seed(987654321)
>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>
>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>
>>>
>>> yes, just a decimal or two too high, if I got the numbers right:
>>> # OS X 10.5 i386 / 10.6 x86_64:
>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>
>>> # OS X 10.5 ppc:
>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>
>> interesting that there are differences in the calculations, but for
>> the test we can just reduce the precision to decimal=12 to avoid the
>> test failure.
>
> I must be doing something wrong here becuase I don't get anything
> close that what you have above.
> In [4]: np.random.seed(987654321)
>
> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>
> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>
> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>
> In [8]: r1-r2
> Out[8]: array([ 0.03704986, -0.32866092])

>>> np.random.seed(987654321)
>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>> r1-r2
array([  8.67361738e-18,   1.66533454e-15])

Can you check mean and var to see if you have the same random  numbers?

>>> xrvs.mean()
0.20830662128271851
>>> xrvs.var()
1.1210385272356511

otherwise I have no clue, (but I guess your scipy.stats tests pass)

Josef


>
> Vincent
>
>
>>
>> Thanks,
>> Josef
>>
>>>
>>> Cheers,
>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sat Jun 12 15:28:29 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 13:28:29 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
Message-ID: <AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>

On Sat, Jun 12, 2010 at 1:22 PM,  <josef.pktd at gmail.com> wrote:
> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>> Hi Josef,
>>>>
>>>>>> FAIL: test_stats.test_kstest
>>>>>> ----------------------------------------------------------------------
>>>>>> Traceback (most recent call last):
>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>> ? ?self.test(*self.arg)
>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>> ? ?header='Arrays are not almost equal')
>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>> ? ?raise AssertionError(msg)
>>>>>> AssertionError:
>>>>>> Arrays are not almost equal
>>>>>>
>>>>>> (mismatch 100.0%)
>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>
>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>
>>>>> Could you check how large the difference is ?
>>>>>
>>>>> np.random.seed(987654321)
>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>
>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>
>>>>
>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>
>>>> # OS X 10.5 ppc:
>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>
>>> interesting that there are differences in the calculations, but for
>>> the test we can just reduce the precision to decimal=12 to avoid the
>>> test failure.
>>
>> I must be doing something wrong here becuase I don't get anything
>> close that what you have above.

>> In [4]: np.random.seed(987654321)
>>
>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>
>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>
>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>
>> In [8]: r1-r2
>> Out[8]: array([ 0.03704986, -0.32866092])
>
>>>> np.random.seed(987654321)
>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>> r1-r2
> array([ ?8.67361738e-18, ? 1.66533454e-15])
>
> Can you check mean and var to see if you have the same random ?numbers?
>
>>>> xrvs.mean()
> 0.20830662128271851
>>>> xrvs.var()
> 1.1210385272356511

In [11]: x.mean()
Out[11]: 0.054996065027031464

In [12]: x.var()
Out[12]: 0.92731406990162746

I am cheating and using the enthought distribution, I just click install.
How do I run all of the tests for scipy or numpy when they are already
installed?

Vincent

>
> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>
> Josef
>
>
>>
>> Vincent
>>
>>
>>>
>>> Thanks,
>>> Josef
>>>
>>>>
>>>> Cheers,
>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Sat Jun 12 15:37:33 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sat, 12 Jun 2010 15:37:33 -0400
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
Message-ID: <AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>

On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>> Hi Josef,
>>>>>
>>>>>>> FAIL: test_stats.test_kstest
>>>>>>> ----------------------------------------------------------------------
>>>>>>> Traceback (most recent call last):
>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>> ? ?self.test(*self.arg)
>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>> ? ?raise AssertionError(msg)
>>>>>>> AssertionError:
>>>>>>> Arrays are not almost equal
>>>>>>>
>>>>>>> (mismatch 100.0%)
>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>
>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>
>>>>>> Could you check how large the difference is ?
>>>>>>
>>>>>> np.random.seed(987654321)
>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>
>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>
>>>>>
>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>
>>>>> # OS X 10.5 ppc:
>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>
>>>> interesting that there are differences in the calculations, but for
>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>> test failure.
>>>
>>> I must be doing something wrong here becuase I don't get anything
>>> close that what you have above.
>
>>> In [4]: np.random.seed(987654321)
>>>
>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>
>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>
>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>
>>> In [8]: r1-r2
>>> Out[8]: array([ 0.03704986, -0.32866092])
>>
>>>>> np.random.seed(987654321)
>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>> r1-r2
>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>
>> Can you check mean and var to see if you have the same random ?numbers?
>>
>>>>> xrvs.mean()
>> 0.20830662128271851
>>>>> xrvs.var()
>> 1.1210385272356511
>
> In [11]: x.mean()
> Out[11]: 0.054996065027031464
>
> In [12]: x.var()
> Out[12]: 0.92731406990162746

looks like you have different random numbers

>
> I am cheating and using the enthought distribution, I just click install.
> How do I run all of the tests for scipy or numpy when they are already
> installed?

scipy.stats.test()
.test() works for scipy and every subpackage

is ipython messing with the RandomState ?

Josef

>
> Vincent
>
>>
>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>
>> Josef
>>
>>
>>>
>>> Vincent
>>>
>>>
>>>>
>>>> Thanks,
>>>> Josef
>>>>
>>>>>
>>>>> Cheers,
>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sat Jun 12 15:41:14 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 13:41:14 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
	<AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
Message-ID: <AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>

On Sat, Jun 12, 2010 at 1:37 PM,  <josef.pktd at gmail.com> wrote:
> On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>>> Hi Josef,
>>>>>>
>>>>>>>> FAIL: test_stats.test_kstest
>>>>>>>> ----------------------------------------------------------------------
>>>>>>>> Traceback (most recent call last):
>>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>>> ? ?self.test(*self.arg)
>>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>>> ? ?raise AssertionError(msg)
>>>>>>>> AssertionError:
>>>>>>>> Arrays are not almost equal
>>>>>>>>
>>>>>>>> (mismatch 100.0%)
>>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>>
>>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>>
>>>>>>> Could you check how large the difference is ?
>>>>>>>
>>>>>>> np.random.seed(987654321)
>>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>
>>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>>
>>>>>>
>>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>
>>>>>> # OS X 10.5 ppc:
>>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>>
>>>>> interesting that there are differences in the calculations, but for
>>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>>> test failure.
>>>>
>>>> I must be doing something wrong here becuase I don't get anything
>>>> close that what you have above.
>>
>>>> In [4]: np.random.seed(987654321)
>>>>
>>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>>
>>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>>
>>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>
>>>> In [8]: r1-r2
>>>> Out[8]: array([ 0.03704986, -0.32866092])
>>>
>>>>>> np.random.seed(987654321)
>>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>> r1-r2
>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>
>>> Can you check mean and var to see if you have the same random ?numbers?
>>>
>>>>>> xrvs.mean()
>>> 0.20830662128271851
>>>>>> xrvs.var()
>>> 1.1210385272356511
>>
>> In [11]: x.mean()
>> Out[11]: 0.054996065027031464
>>
>> In [12]: x.var()
>> Out[12]: 0.92731406990162746
>
> looks like you have different random numbers
>
>>
>> I am cheating and using the enthought distribution, I just click install.
>> How do I run all of the tests for scipy or numpy when they are already
>> installed?
>
> scipy.stats.test()
> .test() works for scipy and every subpackage
>
> is ipython messing with the RandomState ?

In [19]: np.random.seed(987654321)

In [20]: np.random.rand(3)
Out[20]: array([ 0.07298833,  0.2160365 ,  0.46475349])

In [21]: np.random.rand(3)
Out[21]: array([ 0.62258994,  0.61838812,  0.42737911])

In [22]: np.random.seed(987654321)

In [23]: np.random.rand(3)
Out[23]: array([ 0.07298833,  0.2160365 ,  0.46475349])

Vincent

>
> Josef
>
>>
>> Vincent
>>
>>>
>>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>>
>>> Josef
>>>
>>>
>>>>
>>>> Vincent
>>>>
>>>>
>>>>>
>>>>> Thanks,
>>>>> Josef
>>>>>
>>>>>>
>>>>>> Cheers,
>>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Sat Jun 12 15:47:07 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sat, 12 Jun 2010 15:47:07 -0400
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
	<AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
	<AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>
Message-ID: <AANLkTimbnZ08N_9uFbtIxmfTHi7OXOzOWWXz155AL3na@mail.gmail.com>

On Sat, Jun 12, 2010 at 3:41 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sat, Jun 12, 2010 at 1:37 PM, ?<josef.pktd at gmail.com> wrote:
>> On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>>>> Hi Josef,
>>>>>>>
>>>>>>>>> FAIL: test_stats.test_kstest
>>>>>>>>> ----------------------------------------------------------------------
>>>>>>>>> Traceback (most recent call last):
>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>>>> ? ?self.test(*self.arg)
>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>>>> ? ?raise AssertionError(msg)
>>>>>>>>> AssertionError:
>>>>>>>>> Arrays are not almost equal
>>>>>>>>>
>>>>>>>>> (mismatch 100.0%)
>>>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>>>
>>>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>>>
>>>>>>>> Could you check how large the difference is ?
>>>>>>>>
>>>>>>>> np.random.seed(987654321)
>>>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>
>>>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>>>
>>>>>>>
>>>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>
>>>>>>> # OS X 10.5 ppc:
>>>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>>>
>>>>>> interesting that there are differences in the calculations, but for
>>>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>>>> test failure.
>>>>>
>>>>> I must be doing something wrong here becuase I don't get anything
>>>>> close that what you have above.
>>>
>>>>> In [4]: np.random.seed(987654321)
>>>>>
>>>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>>>
>>>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>>>
>>>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>
>>>>> In [8]: r1-r2
>>>>> Out[8]: array([ 0.03704986, -0.32866092])
>>>>
>>>>>>> np.random.seed(987654321)
>>>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>> r1-r2
>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>
>>>> Can you check mean and var to see if you have the same random ?numbers?
>>>>
>>>>>>> xrvs.mean()
>>>> 0.20830662128271851
>>>>>>> xrvs.var()
>>>> 1.1210385272356511
>>>
>>> In [11]: x.mean()
>>> Out[11]: 0.054996065027031464
>>>
>>> In [12]: x.var()
>>> Out[12]: 0.92731406990162746
>>
>> looks like you have different random numbers
>>
>>>
>>> I am cheating and using the enthought distribution, I just click install.
>>> How do I run all of the tests for scipy or numpy when they are already
>>> installed?
>>
>> scipy.stats.test()
>> .test() works for scipy and every subpackage
>>
>> is ipython messing with the RandomState ?
>
> In [19]: np.random.seed(987654321)
>
> In [20]: np.random.rand(3)
> Out[20]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>
> In [21]: np.random.rand(3)
> Out[21]: array([ 0.62258994, ?0.61838812, ?0.42737911])

same here

>>> np.random.seed(987654321)
>>> np.random.rand(3)
array([ 0.07298833,  0.2160365 ,  0.46475349])
>>> np.random.rand(3)
array([ 0.62258994,  0.61838812,  0.42737911])

??

Josef
>
> In [22]: np.random.seed(987654321)
>
> In [23]: np.random.rand(3)
> Out[23]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>
> Vincent
>
>>
>> Josef
>>
>>>
>>> Vincent
>>>
>>>>
>>>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>>>
>>>> Josef
>>>>
>>>>
>>>>>
>>>>> Vincent
>>>>>
>>>>>
>>>>>>
>>>>>> Thanks,
>>>>>> Josef
>>>>>>
>>>>>>>
>>>>>>> Cheers,
>>>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> SciPy-Dev mailing list
>>>>>>> SciPy-Dev at scipy.org
>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sat Jun 12 15:50:00 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 13:50:00 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTimbnZ08N_9uFbtIxmfTHi7OXOzOWWXz155AL3na@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
	<AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
	<AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>
	<AANLkTimbnZ08N_9uFbtIxmfTHi7OXOzOWWXz155AL3na@mail.gmail.com>
Message-ID: <AANLkTinTVivq88v95XqUbSmNQqHABBP2aJqwQZZNxSNE@mail.gmail.com>

On Sat, Jun 12, 2010 at 1:47 PM,  <josef.pktd at gmail.com> wrote:
> On Sat, Jun 12, 2010 at 3:41 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Sat, Jun 12, 2010 at 1:37 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>>>>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>>>>> Hi Josef,
>>>>>>>>
>>>>>>>>>> FAIL: test_stats.test_kstest
>>>>>>>>>> ----------------------------------------------------------------------
>>>>>>>>>> Traceback (most recent call last):
>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>>>>> ? ?self.test(*self.arg)
>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>>>>> ? ?raise AssertionError(msg)
>>>>>>>>>> AssertionError:
>>>>>>>>>> Arrays are not almost equal
>>>>>>>>>>
>>>>>>>>>> (mismatch 100.0%)
>>>>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>>>>
>>>>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>>>>
>>>>>>>>> Could you check how large the difference is ?
>>>>>>>>>
>>>>>>>>> np.random.seed(987654321)
>>>>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>
>>>>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>>>>
>>>>>>>>
>>>>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>
>>>>>>>> # OS X 10.5 ppc:
>>>>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>>>>
>>>>>>> interesting that there are differences in the calculations, but for
>>>>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>>>>> test failure.
>>>>>>
>>>>>> I must be doing something wrong here becuase I don't get anything
>>>>>> close that what you have above.
>>>>
>>>>>> In [4]: np.random.seed(987654321)
>>>>>>
>>>>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>
>>>>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>>>>
>>>>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>
>>>>>> In [8]: r1-r2
>>>>>> Out[8]: array([ 0.03704986, -0.32866092])
>>>>>
>>>>>>>> np.random.seed(987654321)
>>>>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>> r1-r2
>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>
>>>>> Can you check mean and var to see if you have the same random ?numbers?
>>>>>
>>>>>>>> xrvs.mean()
>>>>> 0.20830662128271851
>>>>>>>> xrvs.var()
>>>>> 1.1210385272356511
>>>>
>>>> In [11]: x.mean()
>>>> Out[11]: 0.054996065027031464
>>>>
>>>> In [12]: x.var()
>>>> Out[12]: 0.92731406990162746
>>>
>>> looks like you have different random numbers
>>>
>>>>
>>>> I am cheating and using the enthought distribution, I just click install.
>>>> How do I run all of the tests for scipy or numpy when they are already
>>>> installed?
>>>
>>> scipy.stats.test()
>>> .test() works for scipy and every subpackage
>>>
>>> is ipython messing with the RandomState ?
>>
>> In [19]: np.random.seed(987654321)
>>
>> In [20]: np.random.rand(3)
>> Out[20]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>
>> In [21]: np.random.rand(3)
>> Out[21]: array([ 0.62258994, ?0.61838812, ?0.42737911])
>
> same here
>
>>>> np.random.seed(987654321)
>>>> np.random.rand(3)
> array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>> np.random.rand(3)
> array([ 0.62258994, ?0.61838812, ?0.42737911])
>
> ??

Gets better, I just ran the test, I need to look above to see how this relates.

FAIL: test_stats.test_kstest
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
line 186, in runTest
    self.test(*self.arg)
  File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
line 1228, in test_kstest
    assert_almost_equal( D, 0.12464329735846891, 15)
  File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
line 459, in assert_almost_equal
    raise AssertionError(msg)
AssertionError:
Arrays are not almost equal
 ACTUAL: 0.093893737596468518
 DESIRED: 0.12464329735846891

Vincent

>
> Josef
>>
>> In [22]: np.random.seed(987654321)
>>
>> In [23]: np.random.rand(3)
>> Out[23]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>
>> Vincent
>>
>>>
>>> Josef
>>>
>>>>
>>>> Vincent
>>>>
>>>>>
>>>>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>>>>
>>>>> Josef
>>>>>
>>>>>
>>>>>>
>>>>>> Vincent
>>>>>>
>>>>>>
>>>>>>>
>>>>>>> Thanks,
>>>>>>> Josef
>>>>>>>
>>>>>>>>
>>>>>>>> Cheers,
>>>>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> SciPy-Dev mailing list
>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> SciPy-Dev mailing list
>>>>>>> SciPy-Dev at scipy.org
>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Sat Jun 12 16:00:20 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sat, 12 Jun 2010 16:00:20 -0400
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTinTVivq88v95XqUbSmNQqHABBP2aJqwQZZNxSNE@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
	<AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
	<AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>
	<AANLkTimbnZ08N_9uFbtIxmfTHi7OXOzOWWXz155AL3na@mail.gmail.com>
	<AANLkTinTVivq88v95XqUbSmNQqHABBP2aJqwQZZNxSNE@mail.gmail.com>
Message-ID: <AANLkTilH3WXMrWSfHaLOCdvBCsWuD-D5dz9WSQg2H7m-@mail.gmail.com>

On Sat, Jun 12, 2010 at 3:50 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sat, Jun 12, 2010 at 1:47 PM, ?<josef.pktd at gmail.com> wrote:
>> On Sat, Jun 12, 2010 at 3:41 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Sat, Jun 12, 2010 at 1:37 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>>>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>>>>>> Hi Josef,
>>>>>>>>>
>>>>>>>>>>> FAIL: test_stats.test_kstest
>>>>>>>>>>> ----------------------------------------------------------------------
>>>>>>>>>>> Traceback (most recent call last):
>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>>>>>> ? ?self.test(*self.arg)
>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>>>>>> ? ?raise AssertionError(msg)
>>>>>>>>>>> AssertionError:
>>>>>>>>>>> Arrays are not almost equal
>>>>>>>>>>>
>>>>>>>>>>> (mismatch 100.0%)
>>>>>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>>>>>
>>>>>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>>>>>
>>>>>>>>>> Could you check how large the difference is ?
>>>>>>>>>>
>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>
>>>>>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>>
>>>>>>>>> # OS X 10.5 ppc:
>>>>>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>>>>>
>>>>>>>> interesting that there are differences in the calculations, but for
>>>>>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>>>>>> test failure.
>>>>>>>
>>>>>>> I must be doing something wrong here becuase I don't get anything
>>>>>>> close that what you have above.
>>>>>
>>>>>>> In [4]: np.random.seed(987654321)
>>>>>>>
>>>>>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>
>>>>>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>>>>>
>>>>>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>
>>>>>>> In [8]: r1-r2
>>>>>>> Out[8]: array([ 0.03704986, -0.32866092])
>>>>>>
>>>>>>>>> np.random.seed(987654321)
>>>>>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>> r1-r2
>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>
>>>>>> Can you check mean and var to see if you have the same random ?numbers?
>>>>>>
>>>>>>>>> xrvs.mean()
>>>>>> 0.20830662128271851
>>>>>>>>> xrvs.var()
>>>>>> 1.1210385272356511
>>>>>
>>>>> In [11]: x.mean()
>>>>> Out[11]: 0.054996065027031464
>>>>>
>>>>> In [12]: x.var()
>>>>> Out[12]: 0.92731406990162746
>>>>
>>>> looks like you have different random numbers
>>>>
>>>>>
>>>>> I am cheating and using the enthought distribution, I just click install.
>>>>> How do I run all of the tests for scipy or numpy when they are already
>>>>> installed?
>>>>
>>>> scipy.stats.test()
>>>> .test() works for scipy and every subpackage
>>>>
>>>> is ipython messing with the RandomState ?
>>>
>>> In [19]: np.random.seed(987654321)
>>>
>>> In [20]: np.random.rand(3)
>>> Out[20]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>
>>> In [21]: np.random.rand(3)
>>> Out[21]: array([ 0.62258994, ?0.61838812, ?0.42737911])
>>
>> same here
>>
>>>>> np.random.seed(987654321)
>>>>> np.random.rand(3)
>> array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>> np.random.rand(3)
>> array([ 0.62258994, ?0.61838812, ?0.42737911])
>>
>> ??
>
> Gets better, I just ran the test, I need to look above to see how this relates.
>
> FAIL: test_stats.test_kstest
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
> line 186, in runTest
> ? ?self.test(*self.arg)
> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
> line 1228, in test_kstest
> ? ?assert_almost_equal( D, 0.12464329735846891, 15)
> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
> line 459, in assert_almost_equal
> ? ?raise AssertionError(msg)
> AssertionError:
> Arrays are not almost equal
> ?ACTUAL: 0.093893737596468518
> ?DESIRED: 0.12464329735846891


can you check stats random numbers

>>> np.random.seed(987654321)
>>> stats.norm.rvs(size=3)
array([ 2.24655081, -0.64591822, -1.18357699])
>>> np.random.seed(987654321)
>>> np.random.randn(3)
array([ 2.24655081, -0.64591822, -1.18357699])

which numpy, scipy versions?

Josef

>
> Vincent
>
>>
>> Josef
>>>
>>> In [22]: np.random.seed(987654321)
>>>
>>> In [23]: np.random.rand(3)
>>> Out[23]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>
>>> Vincent
>>>
>>>>
>>>> Josef
>>>>
>>>>>
>>>>> Vincent
>>>>>
>>>>>>
>>>>>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>>>>>
>>>>>> Josef
>>>>>>
>>>>>>
>>>>>>>
>>>>>>> Vincent
>>>>>>>
>>>>>>>
>>>>>>>>
>>>>>>>> Thanks,
>>>>>>>> Josef
>>>>>>>>
>>>>>>>>>
>>>>>>>>> Cheers,
>>>>>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>>>>>
>>>>>>>>> _______________________________________________
>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> SciPy-Dev mailing list
>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> SciPy-Dev mailing list
>>>>>>> SciPy-Dev at scipy.org
>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sat Jun 12 16:04:40 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 14:04:40 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTilH3WXMrWSfHaLOCdvBCsWuD-D5dz9WSQg2H7m-@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
	<AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
	<AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>
	<AANLkTimbnZ08N_9uFbtIxmfTHi7OXOzOWWXz155AL3na@mail.gmail.com>
	<AANLkTinTVivq88v95XqUbSmNQqHABBP2aJqwQZZNxSNE@mail.gmail.com>
	<AANLkTilH3WXMrWSfHaLOCdvBCsWuD-D5dz9WSQg2H7m-@mail.gmail.com>
Message-ID: <AANLkTimgQtP0jd6ODdolVGVCr6e3RkW-dDws4mVU8McX@mail.gmail.com>

On Sat, Jun 12, 2010 at 2:00 PM,  <josef.pktd at gmail.com> wrote:
> On Sat, Jun 12, 2010 at 3:50 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Sat, Jun 12, 2010 at 1:47 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Sat, Jun 12, 2010 at 3:41 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> On Sat, Jun 12, 2010 at 1:37 PM, ?<josef.pktd at gmail.com> wrote:
>>>>> On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>>>>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>>>>>>> Hi Josef,
>>>>>>>>>>
>>>>>>>>>>>> FAIL: test_stats.test_kstest
>>>>>>>>>>>> ----------------------------------------------------------------------
>>>>>>>>>>>> Traceback (most recent call last):
>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>>>>>>> ? ?self.test(*self.arg)
>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>>>>>>> ? ?raise AssertionError(msg)
>>>>>>>>>>>> AssertionError:
>>>>>>>>>>>> Arrays are not almost equal
>>>>>>>>>>>>
>>>>>>>>>>>> (mismatch 100.0%)
>>>>>>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>>>>>>
>>>>>>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>>>>>>
>>>>>>>>>>> Could you check how large the difference is ?
>>>>>>>>>>>
>>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>>
>>>>>>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>>>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>>>
>>>>>>>>>> # OS X 10.5 ppc:
>>>>>>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>>>>>>
>>>>>>>>> interesting that there are differences in the calculations, but for
>>>>>>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>>>>>>> test failure.
>>>>>>>>
>>>>>>>> I must be doing something wrong here becuase I don't get anything
>>>>>>>> close that what you have above.
>>>>>>
>>>>>>>> In [4]: np.random.seed(987654321)
>>>>>>>>
>>>>>>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>
>>>>>>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>>>>>>
>>>>>>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>
>>>>>>>> In [8]: r1-r2
>>>>>>>> Out[8]: array([ 0.03704986, -0.32866092])
>>>>>>>
>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>>>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>> r1-r2
>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>
>>>>>>> Can you check mean and var to see if you have the same random ?numbers?
>>>>>>>
>>>>>>>>>> xrvs.mean()
>>>>>>> 0.20830662128271851
>>>>>>>>>> xrvs.var()
>>>>>>> 1.1210385272356511
>>>>>>
>>>>>> In [11]: x.mean()
>>>>>> Out[11]: 0.054996065027031464
>>>>>>
>>>>>> In [12]: x.var()
>>>>>> Out[12]: 0.92731406990162746
>>>>>
>>>>> looks like you have different random numbers
>>>>>
>>>>>>
>>>>>> I am cheating and using the enthought distribution, I just click install.
>>>>>> How do I run all of the tests for scipy or numpy when they are already
>>>>>> installed?
>>>>>
>>>>> scipy.stats.test()
>>>>> .test() works for scipy and every subpackage
>>>>>
>>>>> is ipython messing with the RandomState ?
>>>>
>>>> In [19]: np.random.seed(987654321)
>>>>
>>>> In [20]: np.random.rand(3)
>>>> Out[20]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>
>>>> In [21]: np.random.rand(3)
>>>> Out[21]: array([ 0.62258994, ?0.61838812, ?0.42737911])
>>>
>>> same here
>>>
>>>>>> np.random.seed(987654321)
>>>>>> np.random.rand(3)
>>> array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>> np.random.rand(3)
>>> array([ 0.62258994, ?0.61838812, ?0.42737911])
>>>
>>> ??
>>
>> Gets better, I just ran the test, I need to look above to see how this relates.
>>
>> FAIL: test_stats.test_kstest
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
>> line 186, in runTest
>> ? ?self.test(*self.arg)
>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
>> line 1228, in test_kstest
>> ? ?assert_almost_equal( D, 0.12464329735846891, 15)
>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
>> line 459, in assert_almost_equal
>> ? ?raise AssertionError(msg)
>> AssertionError:
>> Arrays are not almost equal
>> ?ACTUAL: 0.093893737596468518
>> ?DESIRED: 0.12464329735846891
>
>
> can you check stats random numbers
>
>>>> np.random.seed(987654321)
>>>> stats.norm.rvs(size=3)
> array([ 2.24655081, -0.64591822, -1.18357699])
>>>> np.random.seed(987654321)
>>>> np.random.randn(3)
> array([ 2.24655081, -0.64591822, -1.18357699])
>
> which numpy, scipy versions?

>>> np.random.seed(987654321)
>>> stats.norm.rvs(size=3)
array([-2.35810307,  0.97313103, -0.52004087])
>>> np.random.seed(987654321)
>>> np.random.randn(3)
array([-2.35810307,  0.97313103, -0.52004087])

Obviously different. Not sure how to get the build number from the
scipy and numpy version.

Vincent

>>> scipy.__version__
'0.8.0b1'
>>> import numpy
>>> numpy.__version__
'1.4.0'

Vincent

>
> Josef
>
>>
>> Vincent
>>
>>>
>>> Josef
>>>>
>>>> In [22]: np.random.seed(987654321)
>>>>
>>>> In [23]: np.random.rand(3)
>>>> Out[23]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>
>>>> Vincent
>>>>
>>>>>
>>>>> Josef
>>>>>
>>>>>>
>>>>>> Vincent
>>>>>>
>>>>>>>
>>>>>>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>>>>>>
>>>>>>> Josef
>>>>>>>
>>>>>>>
>>>>>>>>
>>>>>>>> Vincent
>>>>>>>>
>>>>>>>>
>>>>>>>>>
>>>>>>>>> Thanks,
>>>>>>>>> Josef
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Cheers,
>>>>>>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>>>>>>
>>>>>>>>>> _______________________________________________
>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>
>>>>>>>>> _______________________________________________
>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> SciPy-Dev mailing list
>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> SciPy-Dev mailing list
>>>>>>> SciPy-Dev at scipy.org
>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Sat Jun 12 16:13:53 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sat, 12 Jun 2010 16:13:53 -0400
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTimgQtP0jd6ODdolVGVCr6e3RkW-dDws4mVU8McX@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
	<AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
	<AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>
	<AANLkTimbnZ08N_9uFbtIxmfTHi7OXOzOWWXz155AL3na@mail.gmail.com>
	<AANLkTinTVivq88v95XqUbSmNQqHABBP2aJqwQZZNxSNE@mail.gmail.com>
	<AANLkTilH3WXMrWSfHaLOCdvBCsWuD-D5dz9WSQg2H7m-@mail.gmail.com>
	<AANLkTimgQtP0jd6ODdolVGVCr6e3RkW-dDws4mVU8McX@mail.gmail.com>
Message-ID: <AANLkTikvznDiSpi7i_q2Ysfr21CQxwDKeujtu1Ytd_1U@mail.gmail.com>

On Sat, Jun 12, 2010 at 4:04 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sat, Jun 12, 2010 at 2:00 PM, ?<josef.pktd at gmail.com> wrote:
>> On Sat, Jun 12, 2010 at 3:50 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Sat, Jun 12, 2010 at 1:47 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Sat, Jun 12, 2010 at 3:41 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>> On Sat, Jun 12, 2010 at 1:37 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>> On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>>>>>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>>>>>>>> Hi Josef,
>>>>>>>>>>>
>>>>>>>>>>>>> FAIL: test_stats.test_kstest
>>>>>>>>>>>>> ----------------------------------------------------------------------
>>>>>>>>>>>>> Traceback (most recent call last):
>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>>>>>>>> ? ?self.test(*self.arg)
>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>>>>>>>> ? ?raise AssertionError(msg)
>>>>>>>>>>>>> AssertionError:
>>>>>>>>>>>>> Arrays are not almost equal
>>>>>>>>>>>>>
>>>>>>>>>>>>> (mismatch 100.0%)
>>>>>>>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>>>>>>>
>>>>>>>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>>>>>>>
>>>>>>>>>>>> Could you check how large the difference is ?
>>>>>>>>>>>>
>>>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>>>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>>>
>>>>>>>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>>>>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>>>>
>>>>>>>>>>> # OS X 10.5 ppc:
>>>>>>>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>>>>>>>
>>>>>>>>>> interesting that there are differences in the calculations, but for
>>>>>>>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>>>>>>>> test failure.
>>>>>>>>>
>>>>>>>>> I must be doing something wrong here becuase I don't get anything
>>>>>>>>> close that what you have above.
>>>>>>>
>>>>>>>>> In [4]: np.random.seed(987654321)
>>>>>>>>>
>>>>>>>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>
>>>>>>>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>>>>>>>
>>>>>>>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>
>>>>>>>>> In [8]: r1-r2
>>>>>>>>> Out[8]: array([ 0.03704986, -0.32866092])
>>>>>>>>
>>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>>>>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>> r1-r2
>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>
>>>>>>>> Can you check mean and var to see if you have the same random ?numbers?
>>>>>>>>
>>>>>>>>>>> xrvs.mean()
>>>>>>>> 0.20830662128271851
>>>>>>>>>>> xrvs.var()
>>>>>>>> 1.1210385272356511
>>>>>>>
>>>>>>> In [11]: x.mean()
>>>>>>> Out[11]: 0.054996065027031464
>>>>>>>
>>>>>>> In [12]: x.var()
>>>>>>> Out[12]: 0.92731406990162746
>>>>>>
>>>>>> looks like you have different random numbers
>>>>>>
>>>>>>>
>>>>>>> I am cheating and using the enthought distribution, I just click install.
>>>>>>> How do I run all of the tests for scipy or numpy when they are already
>>>>>>> installed?
>>>>>>
>>>>>> scipy.stats.test()
>>>>>> .test() works for scipy and every subpackage
>>>>>>
>>>>>> is ipython messing with the RandomState ?
>>>>>
>>>>> In [19]: np.random.seed(987654321)
>>>>>
>>>>> In [20]: np.random.rand(3)
>>>>> Out[20]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>
>>>>> In [21]: np.random.rand(3)
>>>>> Out[21]: array([ 0.62258994, ?0.61838812, ?0.42737911])
>>>>
>>>> same here
>>>>
>>>>>>> np.random.seed(987654321)
>>>>>>> np.random.rand(3)
>>>> array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>>> np.random.rand(3)
>>>> array([ 0.62258994, ?0.61838812, ?0.42737911])
>>>>
>>>> ??
>>>
>>> Gets better, I just ran the test, I need to look above to see how this relates.
>>>
>>> FAIL: test_stats.test_kstest
>>> ----------------------------------------------------------------------
>>> Traceback (most recent call last):
>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
>>> line 186, in runTest
>>> ? ?self.test(*self.arg)
>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
>>> line 1228, in test_kstest
>>> ? ?assert_almost_equal( D, 0.12464329735846891, 15)
>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
>>> line 459, in assert_almost_equal
>>> ? ?raise AssertionError(msg)
>>> AssertionError:
>>> Arrays are not almost equal
>>> ?ACTUAL: 0.093893737596468518
>>> ?DESIRED: 0.12464329735846891
>>
>>
>> can you check stats random numbers
>>
>>>>> np.random.seed(987654321)
>>>>> stats.norm.rvs(size=3)
>> array([ 2.24655081, -0.64591822, -1.18357699])
>>>>> np.random.seed(987654321)
>>>>> np.random.randn(3)
>> array([ 2.24655081, -0.64591822, -1.18357699])
>>
>> which numpy, scipy versions?
>
>>>> np.random.seed(987654321)
>>>> stats.norm.rvs(size=3)
> array([-2.35810307, ?0.97313103, -0.52004087])
>>>> np.random.seed(987654321)
>>>> np.random.randn(3)
> array([-2.35810307, ?0.97313103, -0.52004087])

looks like the np.random.randn implementation differs, but I also have
numpy 1.4.0

maybe a different seed
>>> np.random.seed(0)
>>> np.random.randn(3)
array([ 1.76405235,  0.40015721,  0.97873798])

np.random.rand was the same

I've never seen this and no idea what might be going on, but we won't
be able to use seeded random numbers in tests if this is "real"

Josef
http://www.ruthannzaroff.com/wonderland/curiouser.htm


>
> Obviously different. Not sure how to get the build number from the
> scipy and numpy version.
>
> Vincent
>
>>>> scipy.__version__
> '0.8.0b1'
>>>> import numpy
>>>> numpy.__version__
> '1.4.0'
>
> Vincent
>
>>
>> Josef
>>
>>>
>>> Vincent
>>>
>>>>
>>>> Josef
>>>>>
>>>>> In [22]: np.random.seed(987654321)
>>>>>
>>>>> In [23]: np.random.rand(3)
>>>>> Out[23]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>
>>>>> Vincent
>>>>>
>>>>>>
>>>>>> Josef
>>>>>>
>>>>>>>
>>>>>>> Vincent
>>>>>>>
>>>>>>>>
>>>>>>>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>>>>>>>
>>>>>>>> Josef
>>>>>>>>
>>>>>>>>
>>>>>>>>>
>>>>>>>>> Vincent
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Thanks,
>>>>>>>>>> Josef
>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> Cheers,
>>>>>>>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>>>>>>>
>>>>>>>>>>> _______________________________________________
>>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>>
>>>>>>>>>> _______________________________________________
>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>
>>>>>>>>> _______________________________________________
>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> SciPy-Dev mailing list
>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> SciPy-Dev mailing list
>>>>>>> SciPy-Dev at scipy.org
>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sat Jun 12 16:26:58 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 14:26:58 -0600
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTikvznDiSpi7i_q2Ysfr21CQxwDKeujtu1Ytd_1U@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
	<AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
	<AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>
	<AANLkTimbnZ08N_9uFbtIxmfTHi7OXOzOWWXz155AL3na@mail.gmail.com>
	<AANLkTinTVivq88v95XqUbSmNQqHABBP2aJqwQZZNxSNE@mail.gmail.com>
	<AANLkTilH3WXMrWSfHaLOCdvBCsWuD-D5dz9WSQg2H7m-@mail.gmail.com>
	<AANLkTimgQtP0jd6ODdolVGVCr6e3RkW-dDws4mVU8McX@mail.gmail.com>
	<AANLkTikvznDiSpi7i_q2Ysfr21CQxwDKeujtu1Ytd_1U@mail.gmail.com>
Message-ID: <AANLkTincJpOpS7ywYIyrxMD9744cwxJmWroJBiwpGT--@mail.gmail.com>

On Sat, Jun 12, 2010 at 2:13 PM,  <josef.pktd at gmail.com> wrote:
> On Sat, Jun 12, 2010 at 4:04 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Sat, Jun 12, 2010 at 2:00 PM, ?<josef.pktd at gmail.com> wrote:
>>> On Sat, Jun 12, 2010 at 3:50 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> On Sat, Jun 12, 2010 at 1:47 PM, ?<josef.pktd at gmail.com> wrote:
>>>>> On Sat, Jun 12, 2010 at 3:41 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>> On Sat, Jun 12, 2010 at 1:37 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>> On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>>> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>>>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>>>>>>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>>>>>>>>> Hi Josef,
>>>>>>>>>>>>
>>>>>>>>>>>>>> FAIL: test_stats.test_kstest
>>>>>>>>>>>>>> ----------------------------------------------------------------------
>>>>>>>>>>>>>> Traceback (most recent call last):
>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>>>>>>>>> ? ?self.test(*self.arg)
>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>>>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>>>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>>>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>>>>>>>>> ? ?raise AssertionError(msg)
>>>>>>>>>>>>>> AssertionError:
>>>>>>>>>>>>>> Arrays are not almost equal
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> (mismatch 100.0%)
>>>>>>>>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>>>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>>>>>>>>
>>>>>>>>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>>>>>>>>
>>>>>>>>>>>>> Could you check how large the difference is ?
>>>>>>>>>>>>>
>>>>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>>>>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>>>>
>>>>>>>>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>>>>>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>>>>>
>>>>>>>>>>>> # OS X 10.5 ppc:
>>>>>>>>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>>>>>>>>
>>>>>>>>>>> interesting that there are differences in the calculations, but for
>>>>>>>>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>>>>>>>>> test failure.
>>>>>>>>>>
>>>>>>>>>> I must be doing something wrong here becuase I don't get anything
>>>>>>>>>> close that what you have above.
>>>>>>>>
>>>>>>>>>> In [4]: np.random.seed(987654321)
>>>>>>>>>>
>>>>>>>>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>
>>>>>>>>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>>>>>>>>
>>>>>>>>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>
>>>>>>>>>> In [8]: r1-r2
>>>>>>>>>> Out[8]: array([ 0.03704986, -0.32866092])
>>>>>>>>>
>>>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>>>>>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>>> r1-r2
>>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>>
>>>>>>>>> Can you check mean and var to see if you have the same random ?numbers?
>>>>>>>>>
>>>>>>>>>>>> xrvs.mean()
>>>>>>>>> 0.20830662128271851
>>>>>>>>>>>> xrvs.var()
>>>>>>>>> 1.1210385272356511
>>>>>>>>
>>>>>>>> In [11]: x.mean()
>>>>>>>> Out[11]: 0.054996065027031464
>>>>>>>>
>>>>>>>> In [12]: x.var()
>>>>>>>> Out[12]: 0.92731406990162746
>>>>>>>
>>>>>>> looks like you have different random numbers
>>>>>>>
>>>>>>>>
>>>>>>>> I am cheating and using the enthought distribution, I just click install.
>>>>>>>> How do I run all of the tests for scipy or numpy when they are already
>>>>>>>> installed?
>>>>>>>
>>>>>>> scipy.stats.test()
>>>>>>> .test() works for scipy and every subpackage
>>>>>>>
>>>>>>> is ipython messing with the RandomState ?
>>>>>>
>>>>>> In [19]: np.random.seed(987654321)
>>>>>>
>>>>>> In [20]: np.random.rand(3)
>>>>>> Out[20]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>>
>>>>>> In [21]: np.random.rand(3)
>>>>>> Out[21]: array([ 0.62258994, ?0.61838812, ?0.42737911])
>>>>>
>>>>> same here
>>>>>
>>>>>>>> np.random.seed(987654321)
>>>>>>>> np.random.rand(3)
>>>>> array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>>>> np.random.rand(3)
>>>>> array([ 0.62258994, ?0.61838812, ?0.42737911])
>>>>>
>>>>> ??
>>>>
>>>> Gets better, I just ran the test, I need to look above to see how this relates.
>>>>
>>>> FAIL: test_stats.test_kstest
>>>> ----------------------------------------------------------------------
>>>> Traceback (most recent call last):
>>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
>>>> line 186, in runTest
>>>> ? ?self.test(*self.arg)
>>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
>>>> line 1228, in test_kstest
>>>> ? ?assert_almost_equal( D, 0.12464329735846891, 15)
>>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
>>>> line 459, in assert_almost_equal
>>>> ? ?raise AssertionError(msg)
>>>> AssertionError:
>>>> Arrays are not almost equal
>>>> ?ACTUAL: 0.093893737596468518
>>>> ?DESIRED: 0.12464329735846891
>>>
>>>
>>> can you check stats random numbers
>>>
>>>>>> np.random.seed(987654321)
>>>>>> stats.norm.rvs(size=3)
>>> array([ 2.24655081, -0.64591822, -1.18357699])
>>>>>> np.random.seed(987654321)
>>>>>> np.random.randn(3)
>>> array([ 2.24655081, -0.64591822, -1.18357699])
>>>
>>> which numpy, scipy versions?
>>
>>>>> np.random.seed(987654321)
>>>>> stats.norm.rvs(size=3)
>> array([-2.35810307, ?0.97313103, -0.52004087])
>>>>> np.random.seed(987654321)
>>>>> np.random.randn(3)
>> array([-2.35810307, ?0.97313103, -0.52004087])
>
> looks like the np.random.randn implementation differs, but I also have
> numpy 1.4.0
>
> maybe a different seed
>>>> np.random.seed(0)
>>>> np.random.randn(3)
> array([ 1.76405235, ?0.40015721, ?0.97873798])

>>> np.random.seed(0)
>>> np.random.randn(3)
array([ 0.06897149,  1.32078057,  1.5997924 ])

I thought you where getting the same rand earlier? And just to be sure
I tried it in ipython and got the same.

For numpy tests I get
Ran 2510 tests in 12.698s
OK (KNOWNFAIL=3, SKIP=1)

Vincent


Thats a strange link, or was that random?
http://www.ruthannzaroff.com/wonderland/curiouser.htm


>
> np.random.rand was the same
>
> I've never seen this and no idea what might be going on, but we won't
> be able to use seeded random numbers in tests if this is "real"
>
> Josef
> http://www.ruthannzaroff.com/wonderland/curiouser.htm
>
>
>>
>> Obviously different. Not sure how to get the build number from the
>> scipy and numpy version.
>>
>> Vincent
>>
>>>>> scipy.__version__
>> '0.8.0b1'
>>>>> import numpy
>>>>> numpy.__version__
>> '1.4.0'
>>
>> Vincent
>>
>>>
>>> Josef
>>>
>>>>
>>>> Vincent
>>>>
>>>>>
>>>>> Josef
>>>>>>
>>>>>> In [22]: np.random.seed(987654321)
>>>>>>
>>>>>> In [23]: np.random.rand(3)
>>>>>> Out[23]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>>
>>>>>> Vincent
>>>>>>
>>>>>>>
>>>>>>> Josef
>>>>>>>
>>>>>>>>
>>>>>>>> Vincent
>>>>>>>>
>>>>>>>>>
>>>>>>>>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>>>>>>>>
>>>>>>>>> Josef
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Vincent
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> Thanks,
>>>>>>>>>>> Josef
>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> Cheers,
>>>>>>>>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>>>>>>>>
>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>>>
>>>>>>>>>>> _______________________________________________
>>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>>
>>>>>>>>>> _______________________________________________
>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>
>>>>>>>>> _______________________________________________
>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> SciPy-Dev mailing list
>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> SciPy-Dev mailing list
>>>>>>> SciPy-Dev at scipy.org
>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Sat Jun 12 16:47:27 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sat, 12 Jun 2010 16:47:27 -0400
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <AANLkTincJpOpS7ywYIyrxMD9744cwxJmWroJBiwpGT--@mail.gmail.com>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100607T132750-19@post.gmane.org>
	<AANLkTilICMlrJSeEXFq7RBNi4mYWu3oTmZQpu2_E8JWb@mail.gmail.com>
	<AANLkTilLZXjidTEeh90uj64r0F-8CdLUePviJhDykkx6@mail.gmail.com>
	<AANLkTil8VcryJ4y_HB7i-AbqdYgaVOW_BfcVxiQtG21z@mail.gmail.com>
	<4C0D96F7.1000406@silveregg.co.jp>
	<AANLkTilS0TOATpi5Qae-M6w85uCQ6DsWllrvW98bCtGU@mail.gmail.com>
	<4C0E648B.10204@gmail.com>
	<C13CC010-38EB-4151-A1AC-3132C98610E3@astro.physik.uni-goettingen.de>
	<AANLkTiliolwEOlFE73LDcGiheWwdxNXbGVgwXml4-Eo_@mail.gmail.com>
	<9644E753-D708-4983-8BE6-DCE492AC7AD8@astro.physik.uni-goettingen.de>
	<AANLkTinWjCCmCBVsHs_tk1LjD7Sn9Jtm_G5Z9ZnreR7n@mail.gmail.com>
	<AANLkTinCFh5ROqhSlxA1d3c1dGW0BQArYJQDdmSEku3U@mail.gmail.com>
	<AANLkTil8IzoPlTHEpml48EhIyZ5qnwnz16FHMMmAaiGE@mail.gmail.com>
	<AANLkTilQKbtqTvRT4_HfhfOumbDETgFXEv5TyejWORZ6@mail.gmail.com>
	<AANLkTilI4O4htteMRk2sSth0L3WhDGS0RUujZ0gViLhG@mail.gmail.com>
	<AANLkTikf39cwqudt8yzBf3QRYnSCiV2uWo8cMcu9zq8V@mail.gmail.com>
	<AANLkTimbnZ08N_9uFbtIxmfTHi7OXOzOWWXz155AL3na@mail.gmail.com>
	<AANLkTinTVivq88v95XqUbSmNQqHABBP2aJqwQZZNxSNE@mail.gmail.com>
	<AANLkTilH3WXMrWSfHaLOCdvBCsWuD-D5dz9WSQg2H7m-@mail.gmail.com>
	<AANLkTimgQtP0jd6ODdolVGVCr6e3RkW-dDws4mVU8McX@mail.gmail.com>
	<AANLkTikvznDiSpi7i_q2Ysfr21CQxwDKeujtu1Ytd_1U@mail.gmail.com>
	<AANLkTincJpOpS7ywYIyrxMD9744cwxJmWroJBiwpGT--@mail.gmail.com>
Message-ID: <AANLkTilhDTQZN4AIFxu6XS8yQkE2m0lm0n1s6w7FNT62@mail.gmail.com>

On Sat, Jun 12, 2010 at 4:26 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sat, Jun 12, 2010 at 2:13 PM, ?<josef.pktd at gmail.com> wrote:
>> On Sat, Jun 12, 2010 at 4:04 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Sat, Jun 12, 2010 at 2:00 PM, ?<josef.pktd at gmail.com> wrote:
>>>> On Sat, Jun 12, 2010 at 3:50 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>> On Sat, Jun 12, 2010 at 1:47 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>> On Sat, Jun 12, 2010 at 3:41 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>> On Sat, Jun 12, 2010 at 1:37 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>> On Sat, Jun 12, 2010 at 3:28 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>>>> On Sat, Jun 12, 2010 at 1:22 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>>>> On Sat, Jun 12, 2010 at 3:02 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>>>>>>> On Fri, Jun 11, 2010 at 6:41 PM, ?<josef.pktd at gmail.com> wrote:
>>>>>>>>>>>> On Fri, Jun 11, 2010 at 7:54 PM, Derek Homeier
>>>>>>>>>>>> <derek at astro.physik.uni-goettingen.de> wrote:
>>>>>>>>>>>>> Hi Josef,
>>>>>>>>>>>>>
>>>>>>>>>>>>>>> FAIL: test_stats.test_kstest
>>>>>>>>>>>>>>> ----------------------------------------------------------------------
>>>>>>>>>>>>>>> Traceback (most recent call last):
>>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
>>>>>>>>>>>>>>> ? ?self.test(*self.arg)
>>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py", line 1078, in test_kstest
>>>>>>>>>>>>>>> ? ?np.array((0.0072115233216310994, 0.98531158590396228)), 14)
>>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 441, in assert_almost_equal
>>>>>>>>>>>>>>> ? ?return assert_array_almost_equal(actual, desired, decimal, err_msg)
>>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 765, in assert_array_almost_equal
>>>>>>>>>>>>>>> ? ?header='Arrays are not almost equal')
>>>>>>>>>>>>>>> ?File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in assert_array_compare
>>>>>>>>>>>>>>> ? ?raise AssertionError(msg)
>>>>>>>>>>>>>>> AssertionError:
>>>>>>>>>>>>>>> Arrays are not almost equal
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> (mismatch 100.0%)
>>>>>>>>>>>>>>> ?x: array([ 0.007, ?0.985])
>>>>>>>>>>>>>>> ?y: array([ 0.007, ?0.985])
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> maybe the precision (decimal 14) is too high for this test across platforms
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Could you check how large the difference is ?
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>>>>>> x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>>>>> np.array(stats.kstest(x,'norm', alternative = 'greater')) -
>>>>>>>>>>>>>> ? ? ? ? ? ? ? ?np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> (my line numbers differ, but this should be the right test given your numbers)
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> yes, just a decimal or two too high, if I got the numbers right:
>>>>>>>>>>>>> # OS X 10.5 i386 / 10.6 x86_64:
>>>>>>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>>>>>>
>>>>>>>>>>>>> # OS X 10.5 ppc:
>>>>>>>>>>>>> array([ ?2.05955045e-13, ?-7.16759985e-13])
>>>>>>>>>>>>
>>>>>>>>>>>> interesting that there are differences in the calculations, but for
>>>>>>>>>>>> the test we can just reduce the precision to decimal=12 to avoid the
>>>>>>>>>>>> test failure.
>>>>>>>>>>>
>>>>>>>>>>> I must be doing something wrong here becuase I don't get anything
>>>>>>>>>>> close that what you have above.
>>>>>>>>>
>>>>>>>>>>> In [4]: np.random.seed(987654321)
>>>>>>>>>>>
>>>>>>>>>>> In [5]: x = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>>
>>>>>>>>>>> In [6]: r1 = np.array(stats.kstest(x,'norm', alternative = 'greater'))
>>>>>>>>>>>
>>>>>>>>>>> In [7]: r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>>
>>>>>>>>>>> In [8]: r1-r2
>>>>>>>>>>> Out[8]: array([ 0.03704986, -0.32866092])
>>>>>>>>>>
>>>>>>>>>>>>> np.random.seed(987654321)
>>>>>>>>>>>>> xrvs = stats.norm.rvs(loc=0.2, size=100)
>>>>>>>>>>>>> r1 = np.array(stats.kstest(xrvs,'norm', alternative = 'greater'))
>>>>>>>>>>>>> r2 = np.array((0.0072115233216310994, 0.98531158590396228))
>>>>>>>>>>>>> r1-r2
>>>>>>>>>> array([ ?8.67361738e-18, ? 1.66533454e-15])
>>>>>>>>>>
>>>>>>>>>> Can you check mean and var to see if you have the same random ?numbers?
>>>>>>>>>>
>>>>>>>>>>>>> xrvs.mean()
>>>>>>>>>> 0.20830662128271851
>>>>>>>>>>>>> xrvs.var()
>>>>>>>>>> 1.1210385272356511
>>>>>>>>>
>>>>>>>>> In [11]: x.mean()
>>>>>>>>> Out[11]: 0.054996065027031464
>>>>>>>>>
>>>>>>>>> In [12]: x.var()
>>>>>>>>> Out[12]: 0.92731406990162746
>>>>>>>>
>>>>>>>> looks like you have different random numbers
>>>>>>>>
>>>>>>>>>
>>>>>>>>> I am cheating and using the enthought distribution, I just click install.
>>>>>>>>> How do I run all of the tests for scipy or numpy when they are already
>>>>>>>>> installed?
>>>>>>>>
>>>>>>>> scipy.stats.test()
>>>>>>>> .test() works for scipy and every subpackage
>>>>>>>>
>>>>>>>> is ipython messing with the RandomState ?
>>>>>>>
>>>>>>> In [19]: np.random.seed(987654321)
>>>>>>>
>>>>>>> In [20]: np.random.rand(3)
>>>>>>> Out[20]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>>>
>>>>>>> In [21]: np.random.rand(3)
>>>>>>> Out[21]: array([ 0.62258994, ?0.61838812, ?0.42737911])
>>>>>>
>>>>>> same here
>>>>>>
>>>>>>>>> np.random.seed(987654321)
>>>>>>>>> np.random.rand(3)
>>>>>> array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>>>>> np.random.rand(3)
>>>>>> array([ 0.62258994, ?0.61838812, ?0.42737911])
>>>>>>
>>>>>> ??
>>>>>
>>>>> Gets better, I just ran the test, I need to look above to see how this relates.
>>>>>
>>>>> FAIL: test_stats.test_kstest
>>>>> ----------------------------------------------------------------------
>>>>> Traceback (most recent call last):
>>>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
>>>>> line 186, in runTest
>>>>> ? ?self.test(*self.arg)
>>>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
>>>>> line 1228, in test_kstest
>>>>> ? ?assert_almost_equal( D, 0.12464329735846891, 15)
>>>>> ?File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
>>>>> line 459, in assert_almost_equal
>>>>> ? ?raise AssertionError(msg)
>>>>> AssertionError:
>>>>> Arrays are not almost equal
>>>>> ?ACTUAL: 0.093893737596468518
>>>>> ?DESIRED: 0.12464329735846891
>>>>
>>>>
>>>> can you check stats random numbers
>>>>
>>>>>>> np.random.seed(987654321)
>>>>>>> stats.norm.rvs(size=3)
>>>> array([ 2.24655081, -0.64591822, -1.18357699])
>>>>>>> np.random.seed(987654321)
>>>>>>> np.random.randn(3)
>>>> array([ 2.24655081, -0.64591822, -1.18357699])
>>>>
>>>> which numpy, scipy versions?
>>>
>>>>>> np.random.seed(987654321)
>>>>>> stats.norm.rvs(size=3)
>>> array([-2.35810307, ?0.97313103, -0.52004087])
>>>>>> np.random.seed(987654321)
>>>>>> np.random.randn(3)
>>> array([-2.35810307, ?0.97313103, -0.52004087])
>>
>> looks like the np.random.randn implementation differs, but I also have
>> numpy 1.4.0
>>
>> maybe a different seed
>>>>> np.random.seed(0)
>>>>> np.random.randn(3)
>> array([ 1.76405235, ?0.40015721, ?0.97873798])
>
>>>> np.random.seed(0)
>>>> np.random.randn(3)
> array([ 0.06897149, ?1.32078057, ?1.5997924 ])
>
> I thought you where getting the same rand earlier?

same rand, but different randn

> And just to be sure
> I tried it in ipython and got the same.
>
> For numpy tests I get
> Ran 2510 tests in 12.698s
> OK (KNOWNFAIL=3, SKIP=1)

I don't think numpy tests the (values of the) random numbers .

>
> Vincent
>
>
>
> Thats a strange link, or was that random?
> http://www.ruthannzaroff.com/wonderland/curiouser.htm

with kids, some things are not so strange

Josef

>
>
>>
>> np.random.rand was the same
>>
>> I've never seen this and no idea what might be going on, but we won't
>> be able to use seeded random numbers in tests if this is "real"
>>
>> Josef
>> http://www.ruthannzaroff.com/wonderland/curiouser.htm
>>
>>
>>>
>>> Obviously different. Not sure how to get the build number from the
>>> scipy and numpy version.
>>>
>>> Vincent
>>>
>>>>>> scipy.__version__
>>> '0.8.0b1'
>>>>>> import numpy
>>>>>> numpy.__version__
>>> '1.4.0'
>>>
>>> Vincent
>>>
>>>>
>>>> Josef
>>>>
>>>>>
>>>>> Vincent
>>>>>
>>>>>>
>>>>>> Josef
>>>>>>>
>>>>>>> In [22]: np.random.seed(987654321)
>>>>>>>
>>>>>>> In [23]: np.random.rand(3)
>>>>>>> Out[23]: array([ 0.07298833, ?0.2160365 , ?0.46475349])
>>>>>>>
>>>>>>> Vincent
>>>>>>>
>>>>>>>>
>>>>>>>> Josef
>>>>>>>>
>>>>>>>>>
>>>>>>>>> Vincent
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> otherwise I have no clue, (but I guess your scipy.stats tests pass)
>>>>>>>>>>
>>>>>>>>>> Josef
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> Vincent
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> Thanks,
>>>>>>>>>>>> Josef
>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> Cheers,
>>>>>>>>>>>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Derek
>>>>>>>>>>>>>
>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>>>>
>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>>>
>>>>>>>>>>> _______________________________________________
>>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>>
>>>>>>>>>> _______________________________________________
>>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>>
>>>>>>>>> _______________________________________________
>>>>>>>>> SciPy-Dev mailing list
>>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> SciPy-Dev mailing list
>>>>>>>> SciPy-Dev at scipy.org
>>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> SciPy-Dev mailing list
>>>>>>> SciPy-Dev at scipy.org
>>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>>
>>>>>> _______________________________________________
>>>>>> SciPy-Dev mailing list
>>>>>> SciPy-Dev at scipy.org
>>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>>
>>>>> _______________________________________________
>>>>> SciPy-Dev mailing list
>>>>> SciPy-Dev at scipy.org
>>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sat Jun 12 17:08:57 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 15:08:57 -0600
Subject: [SciPy-Dev] need to install python-dev|python-devel.
Message-ID: <AANLkTimmq6K4gG5kqk2kGTgDZeG9P6tOSOaB1DG80mPK@mail.gmail.com>

The current numpy downloaded from http://github.com/cournape/numpy.git
installed. I get an error which tells me to install
python-dev|python-devel
I am running ubuntu 10. which comes with py 2.6.5 installed. I was not
sure the best way to to get python-dev installed as I am not really
sure what it is.

File "numpy/core/setup.py", line 255, in check_types
    "Cannot compiler 'Python.h'. Perhaps you need to "\
SystemError: Cannot compiler 'Python.h'. Perhaps you need to install
python-dev|python-devel.

Seems to me this is the same error I got when trying to install on my
osx machine. Of course there is more.

Thanks
Vincent


From vincent at vincentdavis.net  Sat Jun 12 17:11:52 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 15:11:52 -0600
Subject: [SciPy-Dev] need to install python-dev|python-devel.
In-Reply-To: <AANLkTimmq6K4gG5kqk2kGTgDZeG9P6tOSOaB1DG80mPK@mail.gmail.com>
References: <AANLkTimmq6K4gG5kqk2kGTgDZeG9P6tOSOaB1DG80mPK@mail.gmail.com>
Message-ID: <AANLkTimtWy1dXq2W8XAsX4IfnwX78-_BJ4bQTFC4opRg@mail.gmail.com>

I think I just found it. I don't know my way around ubuntu that well.

Vincent

On Sat, Jun 12, 2010 at 3:08 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> The current numpy downloaded from http://github.com/cournape/numpy.git
> installed. I get an error which tells me to install
> python-dev|python-devel
> I am running ubuntu 10. which comes with py 2.6.5 installed. I was not
> sure the best way to to get python-dev installed as I am not really
> sure what it is.
>
> File "numpy/core/setup.py", line 255, in check_types
> ? ?"Cannot compiler 'Python.h'. Perhaps you need to "\
> SystemError: Cannot compiler 'Python.h'. Perhaps you need to install
> python-dev|python-devel.
>
> Seems to me this is the same error I got when trying to install on my
> osx machine. Of course there is more.
>
> Thanks
> Vincent
>


From ben.root at ou.edu  Sat Jun 12 17:19:04 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Sat, 12 Jun 2010 16:19:04 -0500
Subject: [SciPy-Dev] need to install python-dev|python-devel.
In-Reply-To: <AANLkTimmq6K4gG5kqk2kGTgDZeG9P6tOSOaB1DG80mPK@mail.gmail.com>
References: <AANLkTimmq6K4gG5kqk2kGTgDZeG9P6tOSOaB1DG80mPK@mail.gmail.com>
Message-ID: <AANLkTil1yKPtwk1btDEULzFbMkDOOvwC0FvvvMdufj-G@mail.gmail.com>

Vincent,

Using synaptic, you can find a package called "python-dev" and install it.
Notice that you will also need several other packages like libatlas3gf-base,
libatlas-base-dev, and libatlas-headers, build-essential, liblapack3gf,
liblapack-dev, libblas3gf, and libblas-dev.

Synpatic package manager and you should become good friends!

I hope that helps.

Ben Root


On Sat, Jun 12, 2010 at 4:08 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> The current numpy downloaded from http://github.com/cournape/numpy.git
> installed. I get an error which tells me to install
> python-dev|python-devel
> I am running ubuntu 10. which comes with py 2.6.5 installed. I was not
> sure the best way to to get python-dev installed as I am not really
> sure what it is.
>
> File "numpy/core/setup.py", line 255, in check_types
>    "Cannot compiler 'Python.h'. Perhaps you need to "\
> SystemError: Cannot compiler 'Python.h'. Perhaps you need to install
> python-dev|python-devel.
>
> Seems to me this is the same error I got when trying to install on my
> osx machine. Of course there is more.
>
> Thanks
> Vincent
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100612/2a399d7f/attachment.html>

From vincent at vincentdavis.net  Sat Jun 12 21:44:30 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 19:44:30 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
Message-ID: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>

There was another tread (scipy 8.0b release) describing a problem with
FAIL: test_stats.test_kstest see full test fail below.

After a little discusion with Josef here are some results. It appears
to me that osx gets different values from a seeded randn

>From Josef, I assume windows
>>>>> np.random.seed(0)
>>>>> np.random.randn(3)
>> array([ 1.76405235,  0.40015721,  0.97873798])
>
On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>>>> np.random.seed(0)
>>>> np.random.randn(3)
> array([ 0.06897149,  1.32078057,  1.5997924 ])

ubuntu 10.0.4  py 2.5.6 current scipy and numpy just built from the git source.
>>> np.random.seed(0)
>>> np.random.randn(3)
array([ 1.76405235,  0.40015721,  0.97873798])

FAIL: test_stats.test_kstest
----------------------------------------------------------------------
Traceback (most recent call last):
 File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
line 186, in runTest
   self.test(*self.arg)
 File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
line 1228, in test_kstest
   assert_almost_equal( D, 0.12464329735846891, 15)
 File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
line 459, in assert_almost_equal
   raise AssertionError(msg)
AssertionError:
Arrays are not almost equal
 ACTUAL: 0.093893737596468518
 DESIRED: 0.12464329735846891


From charlesr.harris at gmail.com  Sat Jun 12 22:09:30 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Sat, 12 Jun 2010 20:09:30 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
Message-ID: <AANLkTikr8ILuD8PxQdZY6jIamBXFHPiJY275OzXFDwSo@mail.gmail.com>

On Sat, Jun 12, 2010 at 7:44 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> There was another tread (scipy 8.0b release) describing a problem with
> FAIL: test_stats.test_kstest see full test fail below.
>
> After a little discusion with Josef here are some results. It appears
> to me that osx gets different values from a seeded randn
>
> >From Josef, I assume windows
> >>>>> np.random.seed(0)
> >>>>> np.random.randn(3)
> >> array([ 1.76405235,  0.40015721,  0.97873798])
> >
> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
> >>>> np.random.seed(0)
> >>>> np.random.randn(3)
> > array([ 0.06897149,  1.32078057,  1.5997924 ])
>
> ubuntu 10.0.4  py 2.5.6 current scipy and numpy just built from the git
> source.
> >>> np.random.seed(0)
> >>> np.random.randn(3)
> array([ 1.76405235,  0.40015721,  0.97873798])
>
> FAIL: test_stats.test_kstest
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
> line 186, in runTest
>   self.test(*self.arg)
>  File
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
> line 1228, in test_kstest
>   assert_almost_equal( D, 0.12464329735846891, 15)
>  File
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
> line 459, in assert_almost_equal
>   raise AssertionError(msg)
> AssertionError:
> Arrays are not almost equal
>  ACTUAL: 0.093893737596468518
>  DESIRED: 0.12464329735846891
> ______


There were some changes to the random normal generator in trunk, but I don't
think they should affect this. What versions of numpy is everyone using?

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100612/e5655db6/attachment.html>

From josef.pktd at gmail.com  Sun Jun 13 00:06:39 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sun, 13 Jun 2010 00:06:39 -0400
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTikr8ILuD8PxQdZY6jIamBXFHPiJY275OzXFDwSo@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikr8ILuD8PxQdZY6jIamBXFHPiJY275OzXFDwSo@mail.gmail.com>
Message-ID: <AANLkTik1MlnoQuHzpdKM3gF8k0wDy8g39Mz8FcCnR1qe@mail.gmail.com>

On Sat, Jun 12, 2010 at 10:09 PM, Charles R Harris
<charlesr.harris at gmail.com> wrote:
>
>
> On Sat, Jun 12, 2010 at 7:44 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
>>
>> There was another tread (scipy 8.0b release) describing a problem with
>> FAIL: test_stats.test_kstest see full test fail below.
>>
>> After a little discusion with Josef here are some results. It appears
>> to me that osx gets different values from a seeded randn
>>
>> >From Josef, I assume windows
>> >>>>> np.random.seed(0)
>> >>>>> np.random.randn(3)
>> >> array([ 1.76405235, ?0.40015721, ?0.97873798])
>> >
>> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>> >>>> np.random.seed(0)
>> >>>> np.random.randn(3)
>> > array([ 0.06897149, ?1.32078057, ?1.5997924 ])
>>
>> ubuntu 10.0.4 ?py 2.5.6 current scipy and numpy just built from the git
>> source.
>> >>> np.random.seed(0)
>> >>> np.random.randn(3)
>> array([ 1.76405235, ?0.40015721, ?0.97873798])
>>
>> FAIL: test_stats.test_kstest
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
>> line 186, in runTest
>> ? self.test(*self.arg)
>> ?File
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
>> line 1228, in test_kstest
>> ? assert_almost_equal( D, 0.12464329735846891, 15)
>> ?File
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
>> line 459, in assert_almost_equal
>> ? raise AssertionError(msg)
>> AssertionError:
>> Arrays are not almost equal
>> ?ACTUAL: 0.093893737596468518
>> ?DESIRED: 0.12464329735846891
>> ______
>
> There were some changes to the random normal generator in trunk, but I don't
> think they should affect this. What versions of numpy is everyone using?

I'm also using numpy 1.4.0, but the results for the random numbers in
the tests have been hard coded for 19 months ( numpy 1.3.0 ?)

OS X 10.5 ppc:darwin-ppc in 0.7.2 and 0.8.0b1 numpy 1.4.1(?) has a
small divergence 10e-13 (Derek Homeier)

OSX py 2.6.5, numpy 1.4.0 scipy 8.0b is pretty far off  (Vincent)

This is a numpy issue that shows up in stats because of the tests.

Josef


>
> Chuck
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Sun Jun 13 00:12:27 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 12 Jun 2010 22:12:27 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTik1MlnoQuHzpdKM3gF8k0wDy8g39Mz8FcCnR1qe@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikr8ILuD8PxQdZY6jIamBXFHPiJY275OzXFDwSo@mail.gmail.com>
	<AANLkTik1MlnoQuHzpdKM3gF8k0wDy8g39Mz8FcCnR1qe@mail.gmail.com>
Message-ID: <AANLkTikr_hYTH2CIAQDGtNRm6dqfmpy-aXz62VdEnc0N@mail.gmail.com>

On Sat, Jun 12, 2010 at 10:06 PM, <josef.pktd at gmail.com> wrote:

> On Sat, Jun 12, 2010 at 10:09 PM, Charles R Harris
> <charlesr.harris at gmail.com> wrote:
> >
> >
> > On Sat, Jun 12, 2010 at 7:44 PM, Vincent Davis <vincent at vincentdavis.net
> >
> > wrote:
> >>
> >> There was another tread (scipy 8.0b release) describing a problem with
> >> FAIL: test_stats.test_kstest see full test fail below.
> >>
> >> After a little discusion with Josef here are some results. It appears
> >> to me that osx gets different values from a seeded randn
> >>
> >> >From Josef, I assume windows
> >> >>>>> np.random.seed(0)
> >> >>>>> np.random.randn(3)
> >> >> array([ 1.76405235,  0.40015721,  0.97873798])
> >> >
> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
> >> >>>> np.random.seed(0)
> >> >>>> np.random.randn(3)
> >> > array([ 0.06897149,  1.32078057,  1.5997924 ])
> >>
> >> ubuntu 10.0.4  py 2.5.6 current scipy and numpy just built from the git
> >> source.
> >> >>> np.random.seed(0)
> >> >>> np.random.randn(3)
> >> array([ 1.76405235,  0.40015721,  0.97873798])
> >>
> >> FAIL: test_stats.test_kstest
> >> ----------------------------------------------------------------------
> >> Traceback (most recent call last):
> >>  File
> >>
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
> >> line 186, in runTest
> >>   self.test(*self.arg)
> >>  File
> >>
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
> >> line 1228, in test_kstest
> >>   assert_almost_equal( D, 0.12464329735846891, 15)
> >>  File
> >>
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
> >> line 459, in assert_almost_equal
> >>   raise AssertionError(msg)
> >> AssertionError:
> >> Arrays are not almost equal
> >>  ACTUAL: 0.093893737596468518
> >>  DESIRED: 0.12464329735846891
> >> ______
> >
> > There were some changes to the random normal generator in trunk, but I
> don't
> > think they should affect this. What versions of numpy is everyone using?
>
> I'm also using numpy 1.4.0, but the results for the random numbers in
> the tests have been hard coded for 19 months ( numpy 1.3.0 ?)
>
> OS X 10.5 ppc:darwin-ppc in 0.7.2 and 0.8.0b1 numpy 1.4.1(?) has a
> small divergence 10e-13 (Derek Homeier)
>
> OSX py 2.6.5, numpy 1.4.0 scipy 8.0b is pretty far off  (Vincent)
>

OSX 10.6 intel if the details matter.
Vincent


> This is a numpy issue that shows up in stats because of the tests.
>
> Josef
>
>
> >
> > Chuck
> >
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>

  *Vincent Davis
720-301-3003 *
vincent at vincentdavis.net
 my blog <http://vincentdavis.net> |
LinkedIn<http://www.linkedin.com/in/vincentdavis>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100612/20db1bfb/attachment.html>

From stefan at sun.ac.za  Sun Jun 13 02:56:11 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Sun, 13 Jun 2010 00:56:11 -0600
Subject: [SciPy-Dev] Docstrings permissions
In-Reply-To: <AANLkTikV3W5AjQWFBOYvtOdqz6xYcTEePWRkotbr-gvZ@mail.gmail.com>
References: <AANLkTikV3W5AjQWFBOYvtOdqz6xYcTEePWRkotbr-gvZ@mail.gmail.com>
Message-ID: <AANLkTimC3dzMjUImmK8A8TaSsNFp9wsw7SEJahlmd3qX@mail.gmail.com>

Hey Angus,

Sorry no one responded earlier.  Your account is now active.

Thanks for helping out!
St?fan

On 10 June 2010 08:22, Angus McMorland <amcmorl at gmail.com> wrote:
> I've found a scipy docstring that needs slight adjustment. Please can
> someone give me edit permissions on the docstring site: I'll try to
> make this the impetus I need to get contributing in general. I've
> registered an account under the name amcmorl.
>
> Thanks all,
>
> Angus.
> --
> AJC McMorland
> Post-doctoral research fellow
> Neurobiology, University of Pittsburgh
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From charlesr.harris at gmail.com  Sun Jun 13 10:57:52 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Sun, 13 Jun 2010 08:57:52 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTik1MlnoQuHzpdKM3gF8k0wDy8g39Mz8FcCnR1qe@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikr8ILuD8PxQdZY6jIamBXFHPiJY275OzXFDwSo@mail.gmail.com>
	<AANLkTik1MlnoQuHzpdKM3gF8k0wDy8g39Mz8FcCnR1qe@mail.gmail.com>
Message-ID: <AANLkTil8bHn1BsdD6Ul-5B7-OFuj7vTppWeHDl87FAP0@mail.gmail.com>

On Sat, Jun 12, 2010 at 10:06 PM, <josef.pktd at gmail.com> wrote:

> On Sat, Jun 12, 2010 at 10:09 PM, Charles R Harris
> <charlesr.harris at gmail.com> wrote:
> >
> >
> > On Sat, Jun 12, 2010 at 7:44 PM, Vincent Davis <vincent at vincentdavis.net
> >
> > wrote:
> >>
> >> There was another tread (scipy 8.0b release) describing a problem with
> >> FAIL: test_stats.test_kstest see full test fail below.
> >>
> >> After a little discusion with Josef here are some results. It appears
> >> to me that osx gets different values from a seeded randn
> >>
> >> >From Josef, I assume windows
> >> >>>>> np.random.seed(0)
> >> >>>>> np.random.randn(3)
> >> >> array([ 1.76405235,  0.40015721,  0.97873798])
> >> >
> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
> >> >>>> np.random.seed(0)
> >> >>>> np.random.randn(3)
> >> > array([ 0.06897149,  1.32078057,  1.5997924 ])
> >>
> >> ubuntu 10.0.4  py 2.5.6 current scipy and numpy just built from the git
> >> source.
> >> >>> np.random.seed(0)
> >> >>> np.random.randn(3)
> >> array([ 1.76405235,  0.40015721,  0.97873798])
> >>
> >> FAIL: test_stats.test_kstest
> >> ----------------------------------------------------------------------
> >> Traceback (most recent call last):
> >>  File
> >>
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
> >> line 186, in runTest
> >>   self.test(*self.arg)
> >>  File
> >>
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
> >> line 1228, in test_kstest
> >>   assert_almost_equal( D, 0.12464329735846891, 15)
> >>  File
> >>
> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
> >> line 459, in assert_almost_equal
> >>   raise AssertionError(msg)
> >> AssertionError:
> >> Arrays are not almost equal
> >>  ACTUAL: 0.093893737596468518
> >>  DESIRED: 0.12464329735846891
> >> ______
> >
> > There were some changes to the random normal generator in trunk, but I
> don't
> > think they should affect this. What versions of numpy is everyone using?
>
> I'm also using numpy 1.4.0, but the results for the random numbers in
> the tests have been hard coded for 19 months ( numpy 1.3.0 ?)
>
> OS X 10.5 ppc:darwin-ppc in 0.7.2 and 0.8.0b1 numpy 1.4.1(?) has a
> small divergence 10e-13 (Derek Homeier)
>
> OSX py 2.6.5, numpy 1.4.0 scipy 8.0b is pretty far off  (Vincent)
>
> This is a numpy issue that shows up in stats because of the tests.
>
>
I don't see anything obvious in the revision history. Does the uniform
distribution also show changes? It is also possible that compiler
optimizations are messing with the code, unlikely, but it would be nice to
know the compiler versions used and/or 64 bits vs 32 bits. There is at least
one numpy test that depends on random numbers from a given seed being
reproducible and it isn't showing any problems. Sounds like we need to add a
test specifically to test reproducibility.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/a2f2602e/attachment.html>

From josef.pktd at gmail.com  Sun Jun 13 11:05:51 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sun, 13 Jun 2010 11:05:51 -0400
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTil8bHn1BsdD6Ul-5B7-OFuj7vTppWeHDl87FAP0@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikr8ILuD8PxQdZY6jIamBXFHPiJY275OzXFDwSo@mail.gmail.com>
	<AANLkTik1MlnoQuHzpdKM3gF8k0wDy8g39Mz8FcCnR1qe@mail.gmail.com>
	<AANLkTil8bHn1BsdD6Ul-5B7-OFuj7vTppWeHDl87FAP0@mail.gmail.com>
Message-ID: <AANLkTinUzPmBjn4WkRkaboiJx9ncW5vW5tnJK0mwmBf3@mail.gmail.com>

On Sun, Jun 13, 2010 at 10:57 AM, Charles R Harris
<charlesr.harris at gmail.com> wrote:
>
>
> On Sat, Jun 12, 2010 at 10:06 PM, <josef.pktd at gmail.com> wrote:
>>
>> On Sat, Jun 12, 2010 at 10:09 PM, Charles R Harris
>> <charlesr.harris at gmail.com> wrote:
>> >
>> >
>> > On Sat, Jun 12, 2010 at 7:44 PM, Vincent Davis
>> > <vincent at vincentdavis.net>
>> > wrote:
>> >>
>> >> There was another tread (scipy 8.0b release) describing a problem with
>> >> FAIL: test_stats.test_kstest see full test fail below.
>> >>
>> >> After a little discusion with Josef here are some results. It appears
>> >> to me that osx gets different values from a seeded randn
>> >>
>> >> >From Josef, I assume windows
>> >> >>>>> np.random.seed(0)
>> >> >>>>> np.random.randn(3)
>> >> >> array([ 1.76405235, ?0.40015721, ?0.97873798])
>> >> >
>> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>> >> >>>> np.random.seed(0)
>> >> >>>> np.random.randn(3)
>> >> > array([ 0.06897149, ?1.32078057, ?1.5997924 ])
>> >>
>> >> ubuntu 10.0.4 ?py 2.5.6 current scipy and numpy just built from the git
>> >> source.
>> >> >>> np.random.seed(0)
>> >> >>> np.random.randn(3)
>> >> array([ 1.76405235, ?0.40015721, ?0.97873798])
>> >>
>> >> FAIL: test_stats.test_kstest
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >> ?File
>> >>
>> >> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
>> >> line 186, in runTest
>> >> ? self.test(*self.arg)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
>> >> line 1228, in test_kstest
>> >> ? assert_almost_equal( D, 0.12464329735846891, 15)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
>> >> line 459, in assert_almost_equal
>> >> ? raise AssertionError(msg)
>> >> AssertionError:
>> >> Arrays are not almost equal
>> >> ?ACTUAL: 0.093893737596468518
>> >> ?DESIRED: 0.12464329735846891
>> >> ______
>> >
>> > There were some changes to the random normal generator in trunk, but I
>> > don't
>> > think they should affect this. What versions of numpy is everyone using?
>>
>> I'm also using numpy 1.4.0, but the results for the random numbers in
>> the tests have been hard coded for 19 months ( numpy 1.3.0 ?)
>>
>> OS X 10.5 ppc:darwin-ppc in 0.7.2 and 0.8.0b1 numpy 1.4.1(?) has a
>> small divergence 10e-13 (Derek Homeier)
>>
>> OSX py 2.6.5, numpy 1.4.0 scipy 8.0b is pretty far off ?(Vincent)
>>
>> This is a numpy issue that shows up in stats because of the tests.
>>
>
> I don't see anything obvious in the revision history. Does the uniform
> distribution also show changes? It is also possible that compiler
> optimizations are messing with the code, unlikely, but it would be nice to
> know the compiler versions used and/or 64 bits vs 32 bits. There is at least
> one numpy test that depends on random numbers from a given seed being
> reproducible and it isn't showing any problems. Sounds like we need to add a
> test specifically to test reproducibility.

we only compared randn and rand across machines,

random.rand has identical results on Vincent's and my computers

Josef


>
> Chuck
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Sun Jun 13 11:07:59 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 09:07:59 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTil8bHn1BsdD6Ul-5B7-OFuj7vTppWeHDl87FAP0@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikr8ILuD8PxQdZY6jIamBXFHPiJY275OzXFDwSo@mail.gmail.com>
	<AANLkTik1MlnoQuHzpdKM3gF8k0wDy8g39Mz8FcCnR1qe@mail.gmail.com>
	<AANLkTil8bHn1BsdD6Ul-5B7-OFuj7vTppWeHDl87FAP0@mail.gmail.com>
Message-ID: <AANLkTikFrIBNgF9U6uKaajak7R3_k_NN0zPTD_7WQLyf@mail.gmail.com>

On Sun, Jun 13, 2010 at 8:57 AM, Charles R Harris <charlesr.harris at gmail.com
> wrote:

>
> On Sat, Jun 12, 2010 at 10:06 PM, <josef.pktd at gmail.com> wrote:
>
>> On Sat, Jun 12, 2010 at 10:09 PM, Charles R Harris
>> <charlesr.harris at gmail.com> wrote:
>> >
>> >
>> > On Sat, Jun 12, 2010 at 7:44 PM, Vincent Davis <
>> vincent at vincentdavis.net>
>> > wrote:
>> >>
>> >> There was another tread (scipy 8.0b release) describing a problem with
>> >> FAIL: test_stats.test_kstest see full test fail below.
>> >>
>> >> After a little discusion with Josef here are some results. It appears
>> >> to me that osx gets different values from a seeded randn
>> >>
>> >> >From Josef, I assume windows
>> >> >>>>> np.random.seed(0)
>> >> >>>>> np.random.randn(3)
>> >> >> array([ 1.76405235,  0.40015721,  0.97873798])
>> >> >
>> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>> >> >>>> np.random.seed(0)
>> >> >>>> np.random.randn(3)
>> >> > array([ 0.06897149,  1.32078057,  1.5997924 ])
>> >>
>> >> ubuntu 10.0.4  py 2.5.6 current scipy and numpy just built from the git
>> >> source.
>> >> >>> np.random.seed(0)
>> >> >>> np.random.randn(3)
>> >> array([ 1.76405235,  0.40015721,  0.97873798])
>> >>
>> >> FAIL: test_stats.test_kstest
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >>  File
>> >>
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
>> >> line 186, in runTest
>> >>   self.test(*self.arg)
>> >>  File
>> >>
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
>> >> line 1228, in test_kstest
>> >>   assert_almost_equal( D, 0.12464329735846891, 15)
>> >>  File
>> >>
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
>> >> line 459, in assert_almost_equal
>> >>   raise AssertionError(msg)
>> >> AssertionError:
>> >> Arrays are not almost equal
>> >>  ACTUAL: 0.093893737596468518
>> >>  DESIRED: 0.12464329735846891
>> >> ______
>> >
>> > There were some changes to the random normal generator in trunk, but I
>> don't
>> > think they should affect this. What versions of numpy is everyone using?
>>
>> I'm also using numpy 1.4.0, but the results for the random numbers in
>> the tests have been hard coded for 19 months ( numpy 1.3.0 ?)
>>
>> OS X 10.5 ppc:darwin-ppc in 0.7.2 and 0.8.0b1 numpy 1.4.1(?) has a
>> small divergence 10e-13 (Derek Homeier)
>>
>> OSX py 2.6.5, numpy 1.4.0 scipy 8.0b is pretty far off  (Vincent)
>>
>> This is a numpy issue that shows up in stats because of the tests.
>>
>>
> I don't see anything obvious in the revision history. Does the uniform
> distribution also show changes? It is also possible that compiler
> optimizations are messing with the code, unlikely, but it would be nice to
> know the compiler versions used and/or 64 bits vs 32 bits. There is at least
> one numpy test that depends on random numbers from a given seed being
> reproducible and it isn't showing any problems. Sounds like we need to add a
> test specifically to test reproducibility.
>
> I am running the recent Enthought release 6.2. I get the same results for
both the 64bit and 32 bit version.

Python 2.6.5 |EPD 6.2-1 (32-bit)| (r265:79063, May 28 2010, 15:13:03)
[GCC 4.0.1 (Apple Inc. build 5488)] on darwin

Python 2.6.5 |EPD 6.2-1 (64-bit)| (r265:79063, May 28 2010, 15:24:16)
[GCC 4.0.1 (Apple Inc. build 5488)] on darwin

Is it possible to know what svn/git branch/revision.. is installed? It seems
it would be nice for this info to be part of something like
scipy.__version__

Vincent


Chuck
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
  *Vincent Davis
720-301-3003 *
vincent at vincentdavis.net
 my blog <http://vincentdavis.net> |
LinkedIn<http://www.linkedin.com/in/vincentdavis>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/99378f8a/attachment.html>

From vincent at vincentdavis.net  Sun Jun 13 11:08:17 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 09:08:17 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTil8bHn1BsdD6Ul-5B7-OFuj7vTppWeHDl87FAP0@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikr8ILuD8PxQdZY6jIamBXFHPiJY275OzXFDwSo@mail.gmail.com>
	<AANLkTik1MlnoQuHzpdKM3gF8k0wDy8g39Mz8FcCnR1qe@mail.gmail.com>
	<AANLkTil8bHn1BsdD6Ul-5B7-OFuj7vTppWeHDl87FAP0@mail.gmail.com>
Message-ID: <AANLkTik7WIKUP39y-5CjY1caU5b_9ihKSkkdSgrlVyso@mail.gmail.com>

On Sun, Jun 13, 2010 at 8:57 AM, Charles R Harris <charlesr.harris at gmail.com
> wrote:

>
> On Sat, Jun 12, 2010 at 10:06 PM, <josef.pktd at gmail.com> wrote:
>
>> On Sat, Jun 12, 2010 at 10:09 PM, Charles R Harris
>> <charlesr.harris at gmail.com> wrote:
>> >
>> >
>> > On Sat, Jun 12, 2010 at 7:44 PM, Vincent Davis <
>> vincent at vincentdavis.net>
>> > wrote:
>> >>
>> >> There was another tread (scipy 8.0b release) describing a problem with
>> >> FAIL: test_stats.test_kstest see full test fail below.
>> >>
>> >> After a little discusion with Josef here are some results. It appears
>> >> to me that osx gets different values from a seeded randn
>> >>
>> >> >From Josef, I assume windows
>> >> >>>>> np.random.seed(0)
>> >> >>>>> np.random.randn(3)
>> >> >> array([ 1.76405235,  0.40015721,  0.97873798])
>> >> >
>> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>> >> >>>> np.random.seed(0)
>> >> >>>> np.random.randn(3)
>> >> > array([ 0.06897149,  1.32078057,  1.5997924 ])
>> >>
>> >> ubuntu 10.0.4  py 2.5.6 current scipy and numpy just built from the git
>> >> source.
>> >> >>> np.random.seed(0)
>> >> >>> np.random.randn(3)
>> >> array([ 1.76405235,  0.40015721,  0.97873798])
>> >>
>> >> FAIL: test_stats.test_kstest
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >>  File
>> >>
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/nose/case.py",
>> >> line 186, in runTest
>> >>   self.test(*self.arg)
>> >>  File
>> >>
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/scipy/stats/tests/test_stats.py",
>> >> line 1228, in test_kstest
>> >>   assert_almost_equal( D, 0.12464329735846891, 15)
>> >>  File
>> >>
>> "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
>> >> line 459, in assert_almost_equal
>> >>   raise AssertionError(msg)
>> >> AssertionError:
>> >> Arrays are not almost equal
>> >>  ACTUAL: 0.093893737596468518
>> >>  DESIRED: 0.12464329735846891
>> >> ______
>> >
>> > There were some changes to the random normal generator in trunk, but I
>> don't
>> > think they should affect this. What versions of numpy is everyone using?
>>
>> I'm also using numpy 1.4.0, but the results for the random numbers in
>> the tests have been hard coded for 19 months ( numpy 1.3.0 ?)
>>
>> OS X 10.5 ppc:darwin-ppc in 0.7.2 and 0.8.0b1 numpy 1.4.1(?) has a
>> small divergence 10e-13 (Derek Homeier)
>>
>> OSX py 2.6.5, numpy 1.4.0 scipy 8.0b is pretty far off  (Vincent)
>>
>> This is a numpy issue that shows up in stats because of the tests.
>>
>>
> I don't see anything obvious in the revision history. Does the uniform
> distribution also show changes? It is also possible that compiler
> optimizations are messing with the code, unlikely, but it would be nice to
> know the compiler versions used and/or 64 bits vs 32 bits. There is at least
> one numpy test that depends on random numbers from a given seed being
> reproducible and it isn't showing any problems. Sounds like we need to add a
> test specifically to test reproducibility.
>
> I am running the recent Enthought release 6.2. I get the same results for
both the 64bit and 32 bit version.

Python 2.6.5 |EPD 6.2-1 (32-bit)| (r265:79063, May 28 2010, 15:13:03)
[GCC 4.0.1 (Apple Inc. build 5488)] on darwin

Python 2.6.5 |EPD 6.2-1 (64-bit)| (r265:79063, May 28 2010, 15:24:16)
[GCC 4.0.1 (Apple Inc. build 5488)] on darwin

Is it possible to know what svn/git branch/revision.. is installed? It seems
it would be nice for this info to be part of scipy.__version__ or similair

Vincent


Chuck
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
  *Vincent Davis
720-301-3003 *
vincent at vincentdavis.net
 my blog <http://vincentdavis.net> |
LinkedIn<http://www.linkedin.com/in/vincentdavis>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/b91c6339/attachment.html>

From matthew.brett at gmail.com  Sun Jun 13 11:21:50 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Sun, 13 Jun 2010 17:21:50 +0200
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
Message-ID: <AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>

Hi,

I'm not getting the deviation on my laptop:

[mb312 at blair-2 ~]$ uname -a
Darwin blair-2.local 10.3.0 Darwin Kernel Version 10.3.0: Fri Feb 26
11:58:09 PST 2010; root:xnu-1504.3.12~1/RELEASE_I386 i386
[mb312 at blair-2 ~]$ python --version
Python 2.6.4

> After a little discusion with Josef here are some results. It appears
> to me that osx gets different values from a seeded randn

> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>>>>> np.random.seed(0)
>>>>> np.random.randn(3)
>> array([ 0.06897149, ?1.32078057, ?1.5997924 ])

In [2]: import numpy as np

In [3]: np.random.seed(0)

In [4]: np.random.randn(3)
Out[4]: array([ 1.76405235,  0.40015721,  0.97873798])

In [5]: np.__version__
Out[5]: '2.0.0.dev8445'

I compiled python myself.

Nor to I get it with EPD 6.1 and numpy 1.4.0:

[mb312 at blair-2 ~]$ /Library/Frameworks/Python.framework/Versions/6.1/bin/python
Enthought Python Distribution -- http://www.enthought.com
Version: 6.1-1 (32-bit)

Python 2.6.4 |EPD 6.1-1 (32-bit)| (r264:75706, Dec 11 2009, 10:58:54)
[GCC 4.0.1 (Apple Inc. build 5465)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> np.__version__
'1.4.0'
>>> np.random.seed(0)
>>> np.random.randn(3)
array([ 1.76405235,  0.40015721,  0.97873798])
>>>

I wonder if there is something strange about your build?

Best,

Matthew


From matthew.brett at gmail.com  Sun Jun 13 11:46:41 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Sun, 13 Jun 2010 17:46:41 +0200
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
Message-ID: <AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>

Hi,

>> After a little discusion with Josef here are some results. It appears
>> to me that osx gets different values from a seeded randn
>
>> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>>>>>> np.random.seed(0)
>>>>>> np.random.randn(3)
>>> array([ 0.06897149, ?1.32078057, ?1.5997924 ])

Nor in a numpy-1.4.0 virtualenv:

In [1]: import numpy as np

In [2]: np.__version__
Out[2]: '1.4.0'

In [3]: np.random.seed(0)

In [4]: np.random.randn(3)
Out[4]: array([ 1.76405235,  0.40015721,  0.97873798])

Best,

Matthew


From vincent at vincentdavis.net  Sun Jun 13 13:17:24 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 11:17:24 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
Message-ID: <AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>

On Sun, Jun 13, 2010 at 9:46 AM, Matthew Brett <matthew.brett at gmail.com>wrote:

> Hi,
>
> >> After a little discusion with Josef here are some results. It appears
> >> to me that osx gets different values from a seeded randn
> >
> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
> >>>>>> np.random.seed(0)
> >>>>>> np.random.randn(3)
> >>> array([ 0.06897149,  1.32078057,  1.5997924 ])
>
> Nor in a numpy-1.4.0 virtualenv:
>
> In [1]: import numpy as np
>
> In [2]: np.__version__
> Out[2]: '1.4.0'
>
> In [3]: np.random.seed(0)
>
> In [4]: np.random.randn(3)
> Out[4]: array([ 1.76405235,  0.40015721,  0.97873798])
>

I just built 2.6 from current daily snapshot and installed numpy. Now I get
the right answer

MacBookPro-new-2:~ vmd$ py26
Python 2.6.5+ (release26-maint, Jun 13 2010, 10:02:04)
[GCC 4.2.1 (Apple Inc. build 5659)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy
>>> import numpy as np
>>> np.random.seed(0)
>>> np.random.randn(3)
array([ 1.76405235,  0.40015721,  0.97873798])

So I guess there is something wrong with the enthought 6.2 dist.

Should there be a test added for this type of error?

Thanks
Vincent


> Best,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>

  *Vincent Davis
720-301-3003 *
vincent at vincentdavis.net
 my blog <http://vincentdavis.net> |
LinkedIn<http://www.linkedin.com/in/vincentdavis>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/5cb261e6/attachment.html>

From charlesr.harris at gmail.com  Sun Jun 13 13:29:38 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Sun, 13 Jun 2010 11:29:38 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
Message-ID: <AANLkTilNiMystLvnwchw8-FGR2ZTHdm-3KXdODEF0JQd@mail.gmail.com>

On Sun, Jun 13, 2010 at 11:17 AM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Sun, Jun 13, 2010 at 9:46 AM, Matthew Brett <matthew.brett at gmail.com>wrote:
>
>> Hi,
>>
>> >> After a little discusion with Josef here are some results. It appears
>> >> to me that osx gets different values from a seeded randn
>> >
>> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>> >>>>>> np.random.seed(0)
>> >>>>>> np.random.randn(3)
>> >>> array([ 0.06897149,  1.32078057,  1.5997924 ])
>>
>> Nor in a numpy-1.4.0 virtualenv:
>>
>> In [1]: import numpy as np
>>
>> In [2]: np.__version__
>> Out[2]: '1.4.0'
>>
>> In [3]: np.random.seed(0)
>>
>> In [4]: np.random.randn(3)
>> Out[4]: array([ 1.76405235,  0.40015721,  0.97873798])
>>
>
> I just built 2.6 from current daily snapshot and installed numpy. Now I get
> the right answer
>
> MacBookPro-new-2:~ vmd$ py26
> Python 2.6.5+ (release26-maint, Jun 13 2010, 10:02:04)
> [GCC 4.2.1 (Apple Inc. build 5659)] on darwin
> Type "help", "copyright", "credits" or "license" for more information.
> >>> import numpy
> >>> import numpy as np
> >>> np.random.seed(0)
> >>> np.random.randn(3)
> array([ 1.76405235,  0.40015721,  0.97873798])
>
> So I guess there is something wrong with the enthought 6.2 dist.
>
> Should there be a test added for this type of error?
>
>
Yes, open a ticket. Strictly speaking we should probably test all the
distributions, but just having something for the uniform and normal
distributions would be a good start. They don't need to be long, just
picking a seed (42?) and generating a couple of values should be sufficient.
The floating results can't be expected to be exact due to various hardware
floating implementations, but they should be exact to a reasonable number of
figures.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/06e75355/attachment.html>

From vincent at vincentdavis.net  Sun Jun 13 13:32:29 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 11:32:29 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTilNiMystLvnwchw8-FGR2ZTHdm-3KXdODEF0JQd@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
	<AANLkTilNiMystLvnwchw8-FGR2ZTHdm-3KXdODEF0JQd@mail.gmail.com>
Message-ID: <AANLkTin2dnTG72ZEtpA6XrPFYSdibHjucFwy1P_QPH_4@mail.gmail.com>

On Sun, Jun 13, 2010 at 11:29 AM, Charles R Harris <
charlesr.harris at gmail.com> wrote:

>
>
> On Sun, Jun 13, 2010 at 11:17 AM, Vincent Davis <vincent at vincentdavis.net>wrote:
>
>> On Sun, Jun 13, 2010 at 9:46 AM, Matthew Brett <matthew.brett at gmail.com>wrote:
>>
>>> Hi,
>>>
>>> >> After a little discusion with Josef here are some results. It appears
>>> >> to me that osx gets different values from a seeded randn
>>> >
>>> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>>> >>>>>> np.random.seed(0)
>>> >>>>>> np.random.randn(3)
>>> >>> array([ 0.06897149,  1.32078057,  1.5997924 ])
>>>
>>> Nor in a numpy-1.4.0 virtualenv:
>>>
>>> In [1]: import numpy as np
>>>
>>> In [2]: np.__version__
>>> Out[2]: '1.4.0'
>>>
>>> In [3]: np.random.seed(0)
>>>
>>> In [4]: np.random.randn(3)
>>> Out[4]: array([ 1.76405235,  0.40015721,  0.97873798])
>>>
>>
>> I just built 2.6 from current daily snapshot and installed numpy. Now I
>> get the right answer
>>
>> MacBookPro-new-2:~ vmd$ py26
>> Python 2.6.5+ (release26-maint, Jun 13 2010, 10:02:04)
>> [GCC 4.2.1 (Apple Inc. build 5659)] on darwin
>> Type "help", "copyright", "credits" or "license" for more information.
>> >>> import numpy
>> >>> import numpy as np
>> >>> np.random.seed(0)
>> >>> np.random.randn(3)
>> array([ 1.76405235,  0.40015721,  0.97873798])
>>
>> So I guess there is something wrong with the enthought 6.2 dist.
>>
>> Should there be a test added for this type of error?
>>
>>
> Yes, open a ticket. Strictly speaking we should probably test all the
> distributions, but just having something for the uniform and normal
> distributions would be a good start. They don't need to be long, just
> picking a seed (42?) and generating a couple of values should be sufficient.
> The floating results can't be expected to be exact due to various hardware
> floating implementations, but they should be exact to a reasonable number of
> figures.
>
> Ok I will file a ticket. I actually feel qualified to make a patch/test.

Vincent

Chuck
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
  *Vincent Davis
720-301-3003 *
vincent at vincentdavis.net
 my blog <http://vincentdavis.net> |
LinkedIn<http://www.linkedin.com/in/vincentdavis>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/1ab7ec1a/attachment.html>

From d.l.goldsmith at gmail.com  Sun Jun 13 15:20:33 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sun, 13 Jun 2010 12:20:33 -0700
Subject: [SciPy-Dev] import scipy convention in docstring Examples
Message-ID: <AANLkTilJHkWX7Izolh2t4eLpqOSZyc-iw9U-rssdA0Re@mail.gmail.com>

Hi!  For the Examples in the numpy docstrings, we have the convention that
we do not need to include "import numpy" (because that is done by our
automated example tester) but when we're using a numpy namespace object, we
need to prefix it with "np." (because that's how our automated tester is
importing it).  Is there a similar convention for docstring Examples in
scipy?

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/c6827c91/attachment.html>

From warren.weckesser at enthought.com  Sun Jun 13 15:41:31 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Sun, 13 Jun 2010 14:41:31 -0500
Subject: [SciPy-Dev] import scipy convention in docstring Examples
In-Reply-To: <AANLkTilJHkWX7Izolh2t4eLpqOSZyc-iw9U-rssdA0Re@mail.gmail.com>
References: <AANLkTilJHkWX7Izolh2t4eLpqOSZyc-iw9U-rssdA0Re@mail.gmail.com>
Message-ID: <4C15346B.8010208@enthought.com>

David Goldsmith wrote:
> Hi!  For the Examples in the numpy docstrings, we have the convention 
> that we do not need to include "import numpy" (because that is done by 
> our automated example tester) but when we're using a numpy namespace 
> object, we need to prefix it with "np." (because that's how our 
> automated tester is importing it).  Is there a similar convention for 
> docstring Examples in scipy?
>


According to http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines, 
"The examples may assume that import numpy as np is executed before the 
example code in numpy, and import scipy as sp in scipy. "  But sp does 
not appear to be widely used; I can only find it used in 
ndimage.geometric_transform and ndimage.map_coordinates.

Warren


> DG
> ------------------------------------------------------------------------
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From d.l.goldsmith at gmail.com  Sun Jun 13 15:43:31 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sun, 13 Jun 2010 12:43:31 -0700
Subject: [SciPy-Dev] import scipy convention in docstring Examples
In-Reply-To: <4C15346B.8010208@enthought.com>
References: <AANLkTilJHkWX7Izolh2t4eLpqOSZyc-iw9U-rssdA0Re@mail.gmail.com>
	<4C15346B.8010208@enthought.com>
Message-ID: <AANLkTinhOBieNdevGZbW573H5YBht_P8WtEl50JWG1HU@mail.gmail.com>

On Sun, Jun 13, 2010 at 12:41 PM, Warren Weckesser <
warren.weckesser at enthought.com> wrote:

> David Goldsmith wrote:
> > Hi!  For the Examples in the numpy docstrings, we have the convention
> > that we do not need to include "import numpy" (because that is done by
> > our automated example tester) but when we're using a numpy namespace
> > object, we need to prefix it with "np." (because that's how our
> > automated tester is importing it).  Is there a similar convention for
> > docstring Examples in scipy?
> >
>
>
> According to http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines,
> "The examples may assume that import numpy as np is executed before the
> example code in numpy, and import scipy as sp in scipy. "  But sp does
> not appear to be widely used; I can only find it used in
> ndimage.geometric_transform and ndimage.map_coordinates.
>

Thanks, Warren, I d

>
> Warren
>
>
> > DG
> > ------------------------------------------------------------------------
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/2f5a6820/attachment.html>

From d.l.goldsmith at gmail.com  Sun Jun 13 15:54:33 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sun, 13 Jun 2010 12:54:33 -0700
Subject: [SciPy-Dev] import scipy convention in docstring Examples
In-Reply-To: <AANLkTinhOBieNdevGZbW573H5YBht_P8WtEl50JWG1HU@mail.gmail.com>
References: <AANLkTilJHkWX7Izolh2t4eLpqOSZyc-iw9U-rssdA0Re@mail.gmail.com>
	<4C15346B.8010208@enthought.com>
	<AANLkTinhOBieNdevGZbW573H5YBht_P8WtEl50JWG1HU@mail.gmail.com>
Message-ID: <AANLkTimZLNYJkjyofFNlp5Fy0YV8e9XHx2mN1s0ZorOA@mail.gmail.com>

On Sun, Jun 13, 2010 at 12:43 PM, David Goldsmith
<d.l.goldsmith at gmail.com>wrote:

> On Sun, Jun 13, 2010 at 12:41 PM, Warren Weckesser <
> warren.weckesser at enthought.com> wrote:
>
>> David Goldsmith wrote:
>> > Hi!  For the Examples in the numpy docstrings, we have the convention
>> > that we do not need to include "import numpy" (because that is done by
>> > our automated example tester) but when we're using a numpy namespace
>> > object, we need to prefix it with "np." (because that's how our
>> > automated tester is importing it).  Is there a similar convention for
>> > docstring Examples in scipy?
>> >
>>
>>
>> According to http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines,
>> "The examples may assume that import numpy as np is executed before the
>> example code in numpy, and import scipy as sp in scipy. "  But sp does
>> not appear to be widely used; I can only find it used in
>> ndimage.geometric_transform and ndimage.map_coordinates.
>>
>
Sorry, hit send too soon. :(  As I was saying: thanks Warren - obviously, I
need to do a little better research before I post.  As far as the low
occurrence, however, this actually does not surprise me much: aside from the
simple dearth of scipy docstrings from which such a search would produce
results, unlike the numpy examples, where it is common to want/need
something in the numpy-level namespace, scipy is structured such that it
will probably be rare that things in the scipy-level namespace are
needed/wanted - I mainly asked to just get it announced: in the Examples in
the scipy docstrings, explicitly import the namespaces you need, EXCEPT for
scipy itself - if you need something from that namespace, it will already be
there for you, abbreviated as "sp."

Thanks again,

DG


> Thanks, Warren, I d
>

>> Warren
>>
>>
>> > DG
>> > ------------------------------------------------------------------------
>> >
>> > _______________________________________________
>> > SciPy-Dev mailing list
>> > SciPy-Dev at scipy.org
>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>> >
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide.  (As interpreted
> by Robert Graves)
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/e7cef0d7/attachment.html>

From josef.pktd at gmail.com  Sun Jun 13 16:03:52 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sun, 13 Jun 2010 16:03:52 -0400
Subject: [SciPy-Dev] import scipy convention in docstring Examples
In-Reply-To: <AANLkTinhOBieNdevGZbW573H5YBht_P8WtEl50JWG1HU@mail.gmail.com>
References: <AANLkTilJHkWX7Izolh2t4eLpqOSZyc-iw9U-rssdA0Re@mail.gmail.com>
	<4C15346B.8010208@enthought.com>
	<AANLkTinhOBieNdevGZbW573H5YBht_P8WtEl50JWG1HU@mail.gmail.com>
Message-ID: <AANLkTin6wULap3Zf4b9UPjkWfTgGnFoYbr4cqLhwLeFu@mail.gmail.com>

On Sun, Jun 13, 2010 at 3:43 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 12:41 PM, Warren Weckesser
> <warren.weckesser at enthought.com> wrote:
>>
>> David Goldsmith wrote:
>> > Hi! ?For the Examples in the numpy docstrings, we have the convention
>> > that we do not need to include "import numpy" (because that is done by
>> > our automated example tester) but when we're using a numpy namespace
>> > object, we need to prefix it with "np." (because that's how our
>> > automated tester is importing it). ?Is there a similar convention for
>> > docstring Examples in scipy?
>> >
>>
>>
>> According to http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines,
>> "The examples may assume that import numpy as np is executed before the
>> example code in numpy, and import scipy as sp in scipy. " ?But sp does
>> not appear to be widely used; I can only find it used in
>> ndimage.geometric_transform and ndimage.map_coordinates.
>
> Thanks, Warren, I d


import scipy is not very useful, since it doesn't import the sub-packages

I think the most common use is
from scipy import signal, optimize, stats

or maybe
import scipy.ndimage as ndi

but I don't know whether there are recommendations for the convention or rules

Josef


>>
>> Warren
>>
>>
>> > DG
>> > ------------------------------------------------------------------------
>> >
>> > _______________________________________________
>> > SciPy-Dev mailing list
>> > SciPy-Dev at scipy.org
>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>> >
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide. ?(As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From robert.kern at gmail.com  Sun Jun 13 16:15:27 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 15:15:27 -0500
Subject: [SciPy-Dev] import scipy convention in docstring Examples
In-Reply-To: <AANLkTin6wULap3Zf4b9UPjkWfTgGnFoYbr4cqLhwLeFu@mail.gmail.com>
References: <AANLkTilJHkWX7Izolh2t4eLpqOSZyc-iw9U-rssdA0Re@mail.gmail.com> 
	<4C15346B.8010208@enthought.com>
	<AANLkTinhOBieNdevGZbW573H5YBht_P8WtEl50JWG1HU@mail.gmail.com> 
	<AANLkTin6wULap3Zf4b9UPjkWfTgGnFoYbr4cqLhwLeFu@mail.gmail.com>
Message-ID: <AANLkTim-cNkYuxoAq5ObDgrUk7iDoX_VsVUiaTC2dfE8@mail.gmail.com>

On Sun, Jun 13, 2010 at 15:03,  <josef.pktd at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 3:43 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
>> On Sun, Jun 13, 2010 at 12:41 PM, Warren Weckesser
>> <warren.weckesser at enthought.com> wrote:
>>>
>>> David Goldsmith wrote:
>>> > Hi! ?For the Examples in the numpy docstrings, we have the convention
>>> > that we do not need to include "import numpy" (because that is done by
>>> > our automated example tester) but when we're using a numpy namespace
>>> > object, we need to prefix it with "np." (because that's how our
>>> > automated tester is importing it). ?Is there a similar convention for
>>> > docstring Examples in scipy?
>>> >
>>>
>>>
>>> According to http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines,
>>> "The examples may assume that import numpy as np is executed before the
>>> example code in numpy, and import scipy as sp in scipy. " ?But sp does
>>> not appear to be widely used; I can only find it used in
>>> ndimage.geometric_transform and ndimage.map_coordinates.
>>
>> Thanks, Warren, I d
>
> import scipy is not very useful, since it doesn't import the sub-packages

Correct, and doing so should not be encouraged by the examples.

> I think the most common use is
> from scipy import signal, optimize, stats
>
> or maybe
> import scipy.ndimage as ndi
>
> but I don't know whether there are recommendations for the convention or rules

I recommend against abbreviating the subpackage names.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From d.l.goldsmith at gmail.com  Sun Jun 13 16:20:22 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sun, 13 Jun 2010 13:20:22 -0700
Subject: [SciPy-Dev] import scipy convention in docstring Examples
In-Reply-To: <AANLkTin6wULap3Zf4b9UPjkWfTgGnFoYbr4cqLhwLeFu@mail.gmail.com>
References: <AANLkTilJHkWX7Izolh2t4eLpqOSZyc-iw9U-rssdA0Re@mail.gmail.com>
	<4C15346B.8010208@enthought.com>
	<AANLkTinhOBieNdevGZbW573H5YBht_P8WtEl50JWG1HU@mail.gmail.com>
	<AANLkTin6wULap3Zf4b9UPjkWfTgGnFoYbr4cqLhwLeFu@mail.gmail.com>
Message-ID: <AANLkTikDp8gVOCjHpXVim6oAmoJAMYhBnGnYNuhnH8lZ@mail.gmail.com>

On Sun, Jun 13, 2010 at 1:03 PM, <josef.pktd at gmail.com> wrote:

> On Sun, Jun 13, 2010 at 3:43 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > On Sun, Jun 13, 2010 at 12:41 PM, Warren Weckesser
> > <warren.weckesser at enthought.com> wrote:
> >>
> >> David Goldsmith wrote:
> >> > Hi!  For the Examples in the numpy docstrings, we have the convention
> >> > that we do not need to include "import numpy" (because that is done by
> >> > our automated example tester) but when we're using a numpy namespace
> >> > object, we need to prefix it with "np." (because that's how our
> >> > automated tester is importing it).  Is there a similar convention for
> >> > docstring Examples in scipy?
> >> >
> >>
> >> According to http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines
> ,
> >> "The examples may assume that import numpy as np is executed before the
> >> example code in numpy, and import scipy as sp in scipy. "  But sp does
> >> not appear to be widely used; I can only find it used in
> >> ndimage.geometric_transform and ndimage.map_coordinates.
> >
> > Thanks, Warren, I d
>
>
> import scipy is not very useful, since it doesn't import the sub-packages
>
> I think the most common use is
> from scipy import signal, optimize, stats
>
> or maybe
> import scipy.ndimage as ndi
>
> but I don't know whether there are recommendations for the convention or
> rules
>

The recommendation is: consistency - when you need to import something in an
Example, search the Wiki using Examples%import%<name-of-object-to-import>
and check any results for a precedent, then follow it (or change it
consistently if you feel it is inappropriate for some reason and/or you see
that the results aren't consistent).

DG


>
> Josef
>
>
> >>
> >> Warren
> >>
> >>
> >> > DG
> >> >
> ------------------------------------------------------------------------
> >> >
> >> > _______________________________________________
> >> > SciPy-Dev mailing list
> >> > SciPy-Dev at scipy.org
> >> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >> >
> >>
> >> _______________________________________________
> >> SciPy-Dev mailing list
> >> SciPy-Dev at scipy.org
> >> http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> >
> > --
> > Mathematician: noun, someone who disavows certainty when their
> uncertainty
> > set is non-empty, even if that set has measure zero.
> >
> > Hope: noun, that delusive spirit which escaped Pandora's jar and, with
> her
> > lies, prevents mankind from committing a general suicide.  (As
> interpreted
> > by Robert Graves)
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/2d60bbb1/attachment.html>

From robert.kern at gmail.com  Sun Jun 13 17:20:16 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 16:20:16 -0500
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com> 
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com> 
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com> 
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
Message-ID: <AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com>

On Sun, Jun 13, 2010 at 12:17, Vincent Davis <vincent at vincentdavis.net> wrote:
>
> On Sun, Jun 13, 2010 at 9:46 AM, Matthew Brett <matthew.brett at gmail.com> wrote:
>>
>> Hi,
>>
>> >> After a little discusion with Josef here are some results. It appears
>> >> to me that osx gets different values from a seeded randn
>> >
>> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>> >>>>>> np.random.seed(0)
>> >>>>>> np.random.randn(3)
>> >>> array([ 0.06897149, ?1.32078057, ?1.5997924 ])
>>
>> Nor in a numpy-1.4.0 virtualenv:
>>
>> In [1]: import numpy as np
>>
>> In [2]: np.__version__
>> Out[2]: '1.4.0'
>>
>> In [3]: np.random.seed(0)
>>
>> In [4]: np.random.randn(3)
>> Out[4]: array([ 1.76405235, ?0.40015721, ?0.97873798])
>
> I just built 2.6 from current daily snapshot and installed numpy. Now I get the right answer
> MacBookPro-new-2:~ vmd$ py26
> Python 2.6.5+ (release26-maint, Jun 13 2010, 10:02:04)
> [GCC 4.2.1 (Apple Inc. build 5659)] on darwin
> Type "help", "copyright", "credits" or "license" for more information.
> >>> import numpy
> >>> import numpy as np
> >>> np.random.seed(0)
> >>> np.random.randn(3)
> array([ 1.76405235, ?0.40015721, ?0.97873798])
> So I guess there is something wrong with the enthought 6.2 dist.

It was an intentional, but poorly thought out change. Ilan Schnell
implemented a fast ziggurat algorithm for generating normal variates.
I think we will be reverting this modification for the stock numpy in
EPD (but don't quote me on that).

--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
?-- Umberto Eco


From josef.pktd at gmail.com  Sun Jun 13 17:25:03 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sun, 13 Jun 2010 17:25:03 -0400
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
	<AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com>
Message-ID: <AANLkTilVTXwiIdoAf7XoJkC5av3Ul7wV92uEPKDKjM3Q@mail.gmail.com>

On Sun, Jun 13, 2010 at 5:20 PM, Robert Kern <robert.kern at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 12:17, Vincent Davis <vincent at vincentdavis.net> wrote:
>>
>> On Sun, Jun 13, 2010 at 9:46 AM, Matthew Brett <matthew.brett at gmail.com> wrote:
>>>
>>> Hi,
>>>
>>> >> After a little discusion with Josef here are some results. It appears
>>> >> to me that osx gets different values from a seeded randn
>>> >
>>> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>>> >>>>>> np.random.seed(0)
>>> >>>>>> np.random.randn(3)
>>> >>> array([ 0.06897149, ?1.32078057, ?1.5997924 ])
>>>
>>> Nor in a numpy-1.4.0 virtualenv:
>>>
>>> In [1]: import numpy as np
>>>
>>> In [2]: np.__version__
>>> Out[2]: '1.4.0'
>>>
>>> In [3]: np.random.seed(0)
>>>
>>> In [4]: np.random.randn(3)
>>> Out[4]: array([ 1.76405235, ?0.40015721, ?0.97873798])
>>
>> I just built 2.6 from current daily snapshot and installed numpy. Now I get the right answer
>> MacBookPro-new-2:~ vmd$ py26
>> Python 2.6.5+ (release26-maint, Jun 13 2010, 10:02:04)
>> [GCC 4.2.1 (Apple Inc. build 5659)] on darwin
>> Type "help", "copyright", "credits" or "license" for more information.
>> >>> import numpy
>> >>> import numpy as np
>> >>> np.random.seed(0)
>> >>> np.random.randn(3)
>> array([ 1.76405235, ?0.40015721, ?0.97873798])
>> So I guess there is something wrong with the enthought 6.2 dist.
>
> It was an intentional, but poorly thought out change. Ilan Schnell
> implemented a fast ziggurat algorithm for generating normal variates.
> I think we will be reverting this modification for the stock numpy in
> EPD (but don't quote me on that).

It makes testing and bug-hunting a bit difficult if there are
different versions of numpy with the same version number.

Josef


>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
> ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From robert.kern at gmail.com  Sun Jun 13 17:27:00 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 16:27:00 -0500
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTilVTXwiIdoAf7XoJkC5av3Ul7wV92uEPKDKjM3Q@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com> 
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com> 
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com> 
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com> 
	<AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com> 
	<AANLkTilVTXwiIdoAf7XoJkC5av3Ul7wV92uEPKDKjM3Q@mail.gmail.com>
Message-ID: <AANLkTinWK-YkMfP1iCK-XFXh7_d3-g5nuwYHONi1LMaL@mail.gmail.com>

On Sun, Jun 13, 2010 at 16:25,  <josef.pktd at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 5:20 PM, Robert Kern <robert.kern at gmail.com> wrote:
>> On Sun, Jun 13, 2010 at 12:17, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>
>>> On Sun, Jun 13, 2010 at 9:46 AM, Matthew Brett <matthew.brett at gmail.com> wrote:
>>>>
>>>> Hi,
>>>>
>>>> >> After a little discusion with Josef here are some results. It appears
>>>> >> to me that osx gets different values from a seeded randn
>>>> >
>>>> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
>>>> >>>>>> np.random.seed(0)
>>>> >>>>>> np.random.randn(3)
>>>> >>> array([ 0.06897149, ?1.32078057, ?1.5997924 ])
>>>>
>>>> Nor in a numpy-1.4.0 virtualenv:
>>>>
>>>> In [1]: import numpy as np
>>>>
>>>> In [2]: np.__version__
>>>> Out[2]: '1.4.0'
>>>>
>>>> In [3]: np.random.seed(0)
>>>>
>>>> In [4]: np.random.randn(3)
>>>> Out[4]: array([ 1.76405235, ?0.40015721, ?0.97873798])
>>>
>>> I just built 2.6 from current daily snapshot and installed numpy. Now I get the right answer
>>> MacBookPro-new-2:~ vmd$ py26
>>> Python 2.6.5+ (release26-maint, Jun 13 2010, 10:02:04)
>>> [GCC 4.2.1 (Apple Inc. build 5659)] on darwin
>>> Type "help", "copyright", "credits" or "license" for more information.
>>> >>> import numpy
>>> >>> import numpy as np
>>> >>> np.random.seed(0)
>>> >>> np.random.randn(3)
>>> array([ 1.76405235, ?0.40015721, ?0.97873798])
>>> So I guess there is something wrong with the enthought 6.2 dist.
>>
>> It was an intentional, but poorly thought out change. Ilan Schnell
>> implemented a fast ziggurat algorithm for generating normal variates.
>> I think we will be reverting this modification for the stock numpy in
>> EPD (but don't quote me on that).
>
> It makes testing and bug-hunting a bit difficult if there are
> different versions of numpy with the same version number.

Hence, poorly thought out.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From charlesr.harris at gmail.com  Sun Jun 13 18:16:56 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Sun, 13 Jun 2010 16:16:56 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
	<AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com>
Message-ID: <AANLkTimIbjUOAEdjjDDs-fYSeD1yAyoQ8hv3d-73CE5_@mail.gmail.com>

On Sun, Jun 13, 2010 at 3:20 PM, Robert Kern <robert.kern at gmail.com> wrote:

> On Sun, Jun 13, 2010 at 12:17, Vincent Davis <vincent at vincentdavis.net>
> wrote:
> >
> > On Sun, Jun 13, 2010 at 9:46 AM, Matthew Brett <matthew.brett at gmail.com>
> wrote:
> >>
> >> Hi,
> >>
> >> >> After a little discusion with Josef here are some results. It appears
> >> >> to me that osx gets different values from a seeded randn
> >> >
> >> >> On my machine OSX py 2.6.5, numpy 1.4.0 scipy 8.0b
> >> >>>>>> np.random.seed(0)
> >> >>>>>> np.random.randn(3)
> >> >>> array([ 0.06897149,  1.32078057,  1.5997924 ])
> >>
> >> Nor in a numpy-1.4.0 virtualenv:
> >>
> >> In [1]: import numpy as np
> >>
> >> In [2]: np.__version__
> >> Out[2]: '1.4.0'
> >>
> >> In [3]: np.random.seed(0)
> >>
> >> In [4]: np.random.randn(3)
> >> Out[4]: array([ 1.76405235,  0.40015721,  0.97873798])
> >
> > I just built 2.6 from current daily snapshot and installed numpy. Now I
> get the right answer
> > MacBookPro-new-2:~ vmd$ py26
> > Python 2.6.5+ (release26-maint, Jun 13 2010, 10:02:04)
> > [GCC 4.2.1 (Apple Inc. build 5659)] on darwin
> > Type "help", "copyright", "credits" or "license" for more information.
> > >>> import numpy
> > >>> import numpy as np
> > >>> np.random.seed(0)
> > >>> np.random.randn(3)
> > array([ 1.76405235,  0.40015721,  0.97873798])
> > So I guess there is something wrong with the enthought 6.2 dist.
>
> It was an intentional, but poorly thought out change. Ilan Schnell
> implemented a fast ziggurat algorithm for generating normal variates.
> I think we will be reverting this modification for the stock numpy in
> EPD (but don't quote me on that).
>
>
I've been thinking that the ziggurat implementation should be added as a
separate function, znormal or zignomal, perhaps together with a fast version
returning lower precision that could be helpful in simulating noise.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/fd4c106d/attachment.html>

From vincent at vincentdavis.net  Sun Jun 13 19:59:50 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 17:59:50 -0600
Subject: [SciPy-Dev] adding test to test_random.py
Message-ID: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>

I was thinking that I could add a test to test_random.py withing numpy
to test rand(), randn()..
I added a few tests and modified/saved it to my currently install
numpy. The test should fail but I get no failed tests. I guess there
is something I don't know :)

I added  (assert 1==2 #just to make sure it fails :) but the randn
test should also fail.

class TestRandomDist(TestCase):
    """ Make sure the random distrobution return the correct value for a
    given seed
    """
    self.seed = 1234567890

    def test_rand(self):
        np.random.seed(self.seed)
        actual = np.random.rand(3,3)
        desired = array([[ 0.61879477,  0.59162363,  0.88868359],
                         [ 0.8916548 ,  0.45756748,  0.77818808],
                         [ 0.26706377,  0.99610621,  0.54009489]])
        assert actual == desired
        assert 1==2

    def test_randn(self):
        np.random.seed(self.seed)
        actual = np.random.randn(3,3)
        desired = array([[ 1.34016346,  1.73759123,  1.49898834],
                         [-0.22864333,  2.031034  ,  2.17032495],
                         [ 1.59645265, -0.76945156,  0.56605824]])
        assert actual == desired


From robert.kern at gmail.com  Sun Jun 13 20:10:02 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 19:10:02 -0500
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
Message-ID: <AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>

On Sun, Jun 13, 2010 at 18:59, Vincent Davis <vincent at vincentdavis.net> wrote:
> I was thinking that I could add a test to test_random.py withing numpy
> to test rand(), randn()..
> I added a few tests and modified/saved it to my currently install
> numpy. The test should fail but I get no failed tests. I guess there
> is something I don't know :)
>
> I added ?(assert 1==2 #just to make sure it fails :) but the randn
> test should also fail.
>
> class TestRandomDist(TestCase):
> ? ?""" Make sure the random distrobution return the correct value for a
> ? ?given seed
> ? ?"""
> ? ?self.seed = 1234567890

This is not correct. At the class level, there is no "self".

> ? ?def test_rand(self):
> ? ? ? ?np.random.seed(self.seed)
> ? ? ? ?actual = np.random.rand(3,3)
> ? ? ? ?desired = array([[ 0.61879477, ?0.59162363, ?0.88868359],
> ? ? ? ? ? ? ? ? ? ? ? ? [ 0.8916548 , ?0.45756748, ?0.77818808],
> ? ? ? ? ? ? ? ? ? ? ? ? [ 0.26706377, ?0.99610621, ?0.54009489]])

These are not full precision. Use set_print_options() to raise the
precision to 17 before printing out the desired results.

> ? ? ? ?assert actual == desired

Do not use asserts; they are turned off in optimized .pyo compiled
bytecode. Use np.testing.assert_array_equal(). This may or may not be
the reason you are not seeing the failures you are expecting. Run the
test suite with some verbosity so that the names of the test functions
will print out.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From kwgoodman at gmail.com  Sun Jun 13 20:20:16 2010
From: kwgoodman at gmail.com (Keith Goodman)
Date: Sun, 13 Jun 2010 17:20:16 -0700
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
Message-ID: <AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>

On Sun, Jun 13, 2010 at 5:10 PM, Robert Kern <robert.kern at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 18:59, Vincent Davis <vincent at vincentdavis.net> wrote:
>> I was thinking that I could add a test to test_random.py withing numpy
>> to test rand(), randn()..
>> I added a few tests and modified/saved it to my currently install
>> numpy. The test should fail but I get no failed tests. I guess there
>> is something I don't know :)

The test count will tell you if your test ran.

>> I added ?(assert 1==2 #just to make sure it fails :) but the randn
>> test should also fail.
>>
>> class TestRandomDist(TestCase):
>> ? ?""" Make sure the random distrobution return the correct value for a
>> ? ?given seed
>> ? ?"""
>> ? ?self.seed = 1234567890
>
> This is not correct. At the class level, there is no "self".

You could do:

    def setUp(self):
        self.seed = 1234567890

>> ? ?def test_rand(self):
>> ? ? ? ?np.random.seed(self.seed)
>> ? ? ? ?actual = np.random.rand(3,3)
>> ? ? ? ?desired = array([[ 0.61879477, ?0.59162363, ?0.88868359],
>> ? ? ? ? ? ? ? ? ? ? ? ? [ 0.8916548 , ?0.45756748, ?0.77818808],
>> ? ? ? ? ? ? ? ? ? ? ? ? [ 0.26706377, ?0.99610621, ?0.54009489]])
>
> These are not full precision. Use set_print_options() to raise the
> precision to 17 before printing out the desired results.
>
>> ? ? ? ?assert actual == desired
>
> Do not use asserts; they are turned off in optimized .pyo compiled
> bytecode. Use np.testing.assert_array_equal(). This may or may not be
> the reason you are not seeing the failures you are expecting. Run the
> test suite with some verbosity so that the names of the test functions
> will print out.

np.testing.assert_array_equal() has nice output. But even if you went
with a straight assert such as self.assert_, you'd need reduce the
truth using all: (actual == desired).all()

> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
> ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sun Jun 13 20:23:25 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 18:23:25 -0600
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>
Message-ID: <AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>

On Sun, Jun 13, 2010 at 6:20 PM, Keith Goodman <kwgoodman at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 5:10 PM, Robert Kern <robert.kern at gmail.com> wrote:
>> On Sun, Jun 13, 2010 at 18:59, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> I was thinking that I could add a test to test_random.py withing numpy
>>> to test rand(), randn()..
>>> I added a few tests and modified/saved it to my currently install
>>> numpy. The test should fail but I get no failed tests. I guess there
>>> is something I don't know :)
>
> The test count will tell you if your test ran.
>
>>> I added ?(assert 1==2 #just to make sure it fails :) but the randn
>>> test should also fail.
>>>
>>> class TestRandomDist(TestCase):
>>> ? ?""" Make sure the random distrobution return the correct value for a
>>> ? ?given seed
>>> ? ?"""
>>> ? ?self.seed = 1234567890
>>
>> This is not correct. At the class level, there is no "self".
>
> You could do:
>
> ? ?def setUp(self):
> ? ? ? ?self.seed = 1234567890
>
>>> ? ?def test_rand(self):
>>> ? ? ? ?np.random.seed(self.seed)
>>> ? ? ? ?actual = np.random.rand(3,3)
>>> ? ? ? ?desired = array([[ 0.61879477, ?0.59162363, ?0.88868359],
>>> ? ? ? ? ? ? ? ? ? ? ? ? [ 0.8916548 , ?0.45756748, ?0.77818808],
>>> ? ? ? ? ? ? ? ? ? ? ? ? [ 0.26706377, ?0.99610621, ?0.54009489]])
>>
>> These are not full precision. Use set_print_options() to raise the
>> precision to 17 before printing out the desired results.
>>
>>> ? ? ? ?assert actual == desired
>>
>> Do not use asserts; they are turned off in optimized .pyo compiled
>> bytecode. Use np.testing.assert_array_equal(). This may or may not be
>> the reason you are not seeing the failures you are expecting. Run the
>> test suite with some verbosity so that the names of the test functions
>> will print out.
>
> np.testing.assert_array_equal() has nice output. But even if you went
> with a straight assert such as self.assert_, you'd need reduce the
> truth using all: (actual == desired).all()

Thanks for the recommendations, how do I run the test in a more
verbose way. I am google how to do this now.

Thanks again
Vincent
>
>> --
>> Robert Kern
>>
>> "I have come to believe that the whole world is an enigma, a harmless
>> enigma that is made terrible by our own mad attempt to interpret it as
>> though it had an underlying truth."
>> ?-- Umberto Eco
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From robert.kern at gmail.com  Sun Jun 13 20:36:09 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 19:36:09 -0500
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com> 
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com> 
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com> 
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>
Message-ID: <AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com>

On Sun, Jun 13, 2010 at 19:23, Vincent Davis <vincent at vincentdavis.net> wrote:
> Thanks for the recommendations, how do I run the test in a more
> verbose way. I am google how to do this now.

np.test(verbose=2)

Check the np.test() docstring for more options.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From vincent at vincentdavis.net  Sun Jun 13 21:03:22 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 19:03:22 -0600
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com>
Message-ID: <AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com>

On Sun, Jun 13, 2010 at 6:36 PM, Robert Kern <robert.kern at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 19:23, Vincent Davis <vincent at vincentdavis.net> wrote:
>> Thanks for the recommendations, how do I run the test in a more
>> verbose way. I am google how to do this now.
>
> np.test(verbose=2)

The only test with random in it is
test_random (test_function_base.TestDigitize) ... ok
Which is not part of test_random.py file. ie I see no evidence that is
being run.

So I assume I am still missing something ???

Thanks
Vincent


>
> Check the np.test() docstring for more options.
>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
> ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From robert.kern at gmail.com  Sun Jun 13 21:07:24 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 20:07:24 -0500
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com> 
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com> 
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com> 
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com> 
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com> 
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com>
Message-ID: <AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com>

On Sun, Jun 13, 2010 at 20:03, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sun, Jun 13, 2010 at 6:36 PM, Robert Kern <robert.kern at gmail.com> wrote:
>> On Sun, Jun 13, 2010 at 19:23, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> Thanks for the recommendations, how do I run the test in a more
>>> verbose way. I am google how to do this now.
>>
>> np.test(verbose=2)
>
> The only test with random in it is
> test_random (test_function_base.TestDigitize) ... ok
> Which is not part of test_random.py file. ie I see no evidence that is
> being run.
>
> So I assume I am still missing something ???

Your methods are named "test_rand" and "test_randn". They don't have
the string "random" in them.

Try "np.random.test(verbose=2)" to help cut down the noise.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From vincent at vincentdavis.net  Sun Jun 13 21:13:36 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 19:13:36 -0600
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com>
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com>
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com>
Message-ID: <AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com>

On Sun, Jun 13, 2010 at 7:07 PM, Robert Kern <robert.kern at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 20:03, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Sun, Jun 13, 2010 at 6:36 PM, Robert Kern <robert.kern at gmail.com> wrote:
>>> On Sun, Jun 13, 2010 at 19:23, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> Thanks for the recommendations, how do I run the test in a more
>>>> verbose way. I am google how to do this now.
>>>
>>> np.test(verbose=2)
>>
>> The only test with random in it is
>> test_random (test_function_base.TestDigitize) ... ok
>> Which is not part of test_random.py file. ie I see no evidence that is
>> being run.
>>
>> So I assume I am still missing something ???
>
> Your methods are named "test_rand" and "test_randn". They don't have
> the string "random" in them.

I should have been more clear I could not find , test_rand and
test_randn nor the test_gaussian_reset or other which are in the
original test_random.py file. I was searching  the output from running
np.random.test(verbose=2)

Vincent

>
> Try "np.random.test(verbose=2)" to help cut down the noise.
>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
> ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From robert.kern at gmail.com  Sun Jun 13 21:20:26 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 20:20:26 -0500
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com> 
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com> 
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com> 
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com> 
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com> 
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com> 
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com> 
	<AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com>
Message-ID: <AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com>

On Sun, Jun 13, 2010 at 20:13, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sun, Jun 13, 2010 at 7:07 PM, Robert Kern <robert.kern at gmail.com> wrote:
>> On Sun, Jun 13, 2010 at 20:03, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Sun, Jun 13, 2010 at 6:36 PM, Robert Kern <robert.kern at gmail.com> wrote:
>>>> On Sun, Jun 13, 2010 at 19:23, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>> Thanks for the recommendations, how do I run the test in a more
>>>>> verbose way. I am google how to do this now.
>>>>
>>>> np.test(verbose=2)
>>>
>>> The only test with random in it is
>>> test_random (test_function_base.TestDigitize) ... ok
>>> Which is not part of test_random.py file. ie I see no evidence that is
>>> being run.
>>>
>>> So I assume I am still missing something ???
>>
>> Your methods are named "test_rand" and "test_randn". They don't have
>> the string "random" in them.
>
> I should have been more clear I could not find , test_rand and
> test_randn nor the test_gaussian_reset or other which are in the
> original test_random.py file. I was searching ?the output from running
> np.random.test(verbose=2)

If the methods have docstrings, those are printed instead of the name
of the method.

[~]$ python -c "import numpy;numpy.random.test(verbose=2)"
Running unit tests for numpy.random
NumPy version 1.4.0
NumPy is installed in
/Library/Frameworks/Python.framework/Versions/6.1/lib/python2.6/site-packages/numpy
Python version 2.6.4 |EPD 6.1-1 (32-bit)| (r264:75706, Dec 11 2009,
10:58:54) [GCC 4.0.1 (Apple Inc. build 5465)]
nose version 0.11.1
test_basic (test_random.TestMultinomial) ... ok
test_int_negative_interval (test_random.TestMultinomial) ... ok
test_zero_probability (test_random.TestMultinomial) ... ok
Make sure generated random variables are in [-pi, pi]. ... ok
Test for ticket #921 ... ok
Test for ticket #923 ... ok
Make sure we can accept old state tuples that do not have the cached ... ok
test_basic (test_random.TestSetState) ... ok
Make sure the cached every-other-Gaussian is reset. ... ok
When the state is saved with a cached Gaussian, make sure the cached ... ok
Ensure that the negative binomial results take floating point ... ok

----------------------------------------------------------------------
Ran 11 tests in 0.054s

OK

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From vincent at vincentdavis.net  Sun Jun 13 21:38:58 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 19:38:58 -0600
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com>
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com>
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com>
	<AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com>
	<AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com>
Message-ID: <AANLkTinnuWAdfY89Qjuu-rg3EPTU_hTnbN_qYjEBa0_C@mail.gmail.com>

On Sun, Jun 13, 2010 at 7:20 PM, Robert Kern <robert.kern at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 20:13, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Sun, Jun 13, 2010 at 7:07 PM, Robert Kern <robert.kern at gmail.com> wrote:
>>> On Sun, Jun 13, 2010 at 20:03, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> On Sun, Jun 13, 2010 at 6:36 PM, Robert Kern <robert.kern at gmail.com> wrote:
>>>>> On Sun, Jun 13, 2010 at 19:23, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>> Thanks for the recommendations, how do I run the test in a more
>>>>>> verbose way. I am google how to do this now.
>>>>>
>>>>> np.test(verbose=2)
>>>>
>>>> The only test with random in it is
>>>> test_random (test_function_base.TestDigitize) ... ok
>>>> Which is not part of test_random.py file. ie I see no evidence that is
>>>> being run.
>>>>
>>>> So I assume I am still missing something ???
>>>
>>> Your methods are named "test_rand" and "test_randn". They don't have
>>> the string "random" in them.
>>
>> I should have been more clear I could not find , test_rand and
>> test_randn nor the test_gaussian_reset or other which are in the
>> original test_random.py file. I was searching ?the output from running
>> np.random.test(verbose=2)
>
> If the methods have docstrings, those are printed instead of the name
> of the method.
>
> [~]$ python -c "import numpy;numpy.random.test(verbose=2)"
> Running unit tests for numpy.random
> NumPy version 1.4.0
> NumPy is installed in
> /Library/Frameworks/Python.framework/Versions/6.1/lib/python2.6/site-packages/numpy
> Python version 2.6.4 |EPD 6.1-1 (32-bit)| (r264:75706, Dec 11 2009,
> 10:58:54) [GCC 4.0.1 (Apple Inc. build 5465)]
> nose version 0.11.1
> test_basic (test_random.TestMultinomial) ... ok
> test_int_negative_interval (test_random.TestMultinomial) ... ok
> test_zero_probability (test_random.TestMultinomial) ... ok
> Make sure generated random variables are in [-pi, pi]. ... ok
> Test for ticket #921 ... ok
> Test for ticket #923 ... ok
> Make sure we can accept old state tuples that do not have the cached ... ok
> test_basic (test_random.TestSetState) ... ok
> Make sure the cached every-other-Gaussian is reset. ... ok
> When the state is saved with a cached Gaussian, make sure the cached ... ok
> Ensure that the negative binomial results take floating point ... ok
>
> ----------------------------------------------------------------------
> Ran 11 tests in 0.054s
>
> OK

It seems I have screwed something up. It works for the 32bit version
but not the 64bit I have been modifying. (see below). I guess I need
to spend more time investigating this.

Thanks again fro your time
Vincent

Python 2.6.5 |EPD 6.2-1 (32-bit)| (r265:79063, May 28 2010, 15:13:03)
[GCC 4.0.1 (Apple Inc. build 5488)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> np.random.test(verbose=2)
Running unit tests for numpy.random
NumPy version 1.4.0
NumPy is installed in
/Library/Frameworks/Python.framework/Versions/6.2/lib/python2.6/site-packages/numpy
Python version 2.6.5 |EPD 6.2-1 (32-bit)| (r265:79063, May 28 2010,
15:13:03) [GCC 4.0.1 (Apple Inc. build 5488)]
nose version 0.11.3
test_basic (test_random.TestMultinomial) ... ok
test_int_negative_interval (test_random.TestMultinomial) ... ok
test_zero_probability (test_random.TestMultinomial) ... ok
Make sure generated random variables are in [-pi, pi]. ... ok
Test for ticket #921 ... ok
Test for ticket #923 ... ok
Make sure we can accept old state tuples that do not have the cached ... ok
test_basic (test_random.TestSetState) ... ok
Make sure the cached every-other-Gaussian is reset. ... ok
When the state is saved with a cached Gaussian, make sure the cached ... ok
Ensure that the negative binomial results take floating point ... ok

----------------------------------------------------------------------
Ran 11 tests in 0.040s

OK
<nose.result.TextTestResult run=11 errors=0 failures=0>
>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
> ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From robert.kern at gmail.com  Sun Jun 13 21:55:50 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 20:55:50 -0500
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTinnuWAdfY89Qjuu-rg3EPTU_hTnbN_qYjEBa0_C@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com> 
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com> 
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com> 
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com> 
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com> 
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com> 
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com> 
	<AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com> 
	<AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com> 
	<AANLkTinnuWAdfY89Qjuu-rg3EPTU_hTnbN_qYjEBa0_C@mail.gmail.com>
Message-ID: <AANLkTimgA2acuJZnmN6ZwIabEkFneM7K4U7kVsK-0Ezd@mail.gmail.com>

On Sun, Jun 13, 2010 at 20:38, Vincent Davis <vincent at vincentdavis.net> wrote:
> It seems I have screwed something up. It works for the 32bit version
> but not the 64bit I have been modifying. (see below). I guess I need
> to spend more time investigating this.

Exactly what are you doing? What file are you modifying?

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From vincent at vincentdavis.net  Sun Jun 13 22:09:40 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 20:09:40 -0600
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTimgA2acuJZnmN6ZwIabEkFneM7K4U7kVsK-0Ezd@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com>
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com>
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com>
	<AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com>
	<AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com>
	<AANLkTinnuWAdfY89Qjuu-rg3EPTU_hTnbN_qYjEBa0_C@mail.gmail.com>
	<AANLkTimgA2acuJZnmN6ZwIabEkFneM7K4U7kVsK-0Ezd@mail.gmail.com>
Message-ID: <AANLkTilOFZrxzCADx6jLlxSltRs2kXvfKq8HDqPyjXDG@mail.gmail.com>

On Sunday, June 13, 2010, Robert Kern <robert.kern at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 20:38, Vincent Davis <vincent at vincentdavis.net> wrote:
>> It seems I have screwed something up. It works for the 32bit version
>> but not the 64bit I have been modifying. (see below). I guess I need
>> to spend more time investigating this.
>
> Exactly what are you doing? What file are you modifying?

Not at my computer now but I think
sitepackages/numpy/random/tests/test_random.py

Vincent

>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
>  ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From robert.kern at gmail.com  Sun Jun 13 22:14:10 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 21:14:10 -0500
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTilOFZrxzCADx6jLlxSltRs2kXvfKq8HDqPyjXDG@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com> 
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com> 
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com> 
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com> 
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com> 
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com> 
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com> 
	<AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com> 
	<AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com> 
	<AANLkTinnuWAdfY89Qjuu-rg3EPTU_hTnbN_qYjEBa0_C@mail.gmail.com> 
	<AANLkTimgA2acuJZnmN6ZwIabEkFneM7K4U7kVsK-0Ezd@mail.gmail.com> 
	<AANLkTilOFZrxzCADx6jLlxSltRs2kXvfKq8HDqPyjXDG@mail.gmail.com>
Message-ID: <AANLkTinscMr-6w2z3fGMFuLcZERHG3ix6YKLg75EK5oh@mail.gmail.com>

On Sun, Jun 13, 2010 at 21:09, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sunday, June 13, 2010, Robert Kern <robert.kern at gmail.com> wrote:
>> On Sun, Jun 13, 2010 at 20:38, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> It seems I have screwed something up. It works for the 32bit version
>>> but not the 64bit I have been modifying. (see below). I guess I need
>>> to spend more time investigating this.
>>
>> Exactly what are you doing? What file are you modifying?
>
> Not at my computer now but I think
> sitepackages/numpy/random/tests/test_random.py

But the Python executable that you are running is the 32-bit version:

Python 2.6.5 |EPD 6.2-1 (32-bit)| (r265:79063, May 28 2010, 15:13:03)

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From vincent at vincentdavis.net  Sun Jun 13 22:23:00 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 13 Jun 2010 20:23:00 -0600
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTinscMr-6w2z3fGMFuLcZERHG3ix6YKLg75EK5oh@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com>
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com>
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com>
	<AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com>
	<AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com>
	<AANLkTinnuWAdfY89Qjuu-rg3EPTU_hTnbN_qYjEBa0_C@mail.gmail.com>
	<AANLkTimgA2acuJZnmN6ZwIabEkFneM7K4U7kVsK-0Ezd@mail.gmail.com>
	<AANLkTilOFZrxzCADx6jLlxSltRs2kXvfKq8HDqPyjXDG@mail.gmail.com>
	<AANLkTinscMr-6w2z3fGMFuLcZERHG3ix6YKLg75EK5oh@mail.gmail.com>
Message-ID: <AANLkTinB9vF6qPNZ1iNsonc6WW4wGZobsc4qXNJ26Nan@mail.gmail.com>

On Sunday, June 13, 2010, Robert Kern <robert.kern at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 21:09, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Sunday, June 13, 2010, Robert Kern <robert.kern at gmail.com> wrote:
>>> On Sun, Jun 13, 2010 at 20:38, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> It seems I have screwed something up. It works for the 32bit version
>>>> but not the 64bit I have been modifying. (see below). I guess I need
>>>> to spend more time investigating this.
>>>
>>> Exactly what are you doing? What file are you modifying?
>>
>> Not at my computer now but I think
>> sitepackages/numpy/random/tests/test_random.py
>
> But the Python executable that you are running is the 32-bit version:

I have both versions installed. I am modifying the 64 bit install and
that test does not work. The 32 bit still works.
I just need to look at it later. As I said at this point I assume I
have screwed somthing up, not sure how but....

Vincent

>
> Python 2.6.5 |EPD 6.2-1 (32-bit)| (r265:79063, May 28 2010, 15:13:03)
>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
>  ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From robert.kern at gmail.com  Sun Jun 13 22:27:56 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Sun, 13 Jun 2010 21:27:56 -0500
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTinB9vF6qPNZ1iNsonc6WW4wGZobsc4qXNJ26Nan@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com> 
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com> 
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com> 
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com> 
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com> 
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com> 
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com> 
	<AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com> 
	<AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com> 
	<AANLkTinnuWAdfY89Qjuu-rg3EPTU_hTnbN_qYjEBa0_C@mail.gmail.com> 
	<AANLkTimgA2acuJZnmN6ZwIabEkFneM7K4U7kVsK-0Ezd@mail.gmail.com> 
	<AANLkTilOFZrxzCADx6jLlxSltRs2kXvfKq8HDqPyjXDG@mail.gmail.com> 
	<AANLkTinscMr-6w2z3fGMFuLcZERHG3ix6YKLg75EK5oh@mail.gmail.com> 
	<AANLkTinB9vF6qPNZ1iNsonc6WW4wGZobsc4qXNJ26Nan@mail.gmail.com>
Message-ID: <AANLkTik9GGvJXUeF66G0_kbayD0jbaM1fi3kzR6yEITs@mail.gmail.com>

On Sun, Jun 13, 2010 at 21:23, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Sunday, June 13, 2010, Robert Kern <robert.kern at gmail.com> wrote:
>> On Sun, Jun 13, 2010 at 21:09, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> On Sunday, June 13, 2010, Robert Kern <robert.kern at gmail.com> wrote:
>>>> On Sun, Jun 13, 2010 at 20:38, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>> It seems I have screwed something up. It works for the 32bit version
>>>>> but not the 64bit I have been modifying. (see below). I guess I need
>>>>> to spend more time investigating this.
>>>>
>>>> Exactly what are you doing? What file are you modifying?
>>>
>>> Not at my computer now but I think
>>> sitepackages/numpy/random/tests/test_random.py
>>
>> But the Python executable that you are running is the 32-bit version:
>
> I have both versions installed. I am modifying the 64 bit install and
> that test does not work.

Right. What you showed me was the result of your running the 32-bit
Python executable. That will not show any modifications you made to
your 64-bit install.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From ben.root at ou.edu  Sun Jun 13 22:29:48 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Sun, 13 Jun 2010 21:29:48 -0500
Subject: [SciPy-Dev] scipy.io.recaster?
Message-ID: <AANLkTinSkkEkyypwKnUaF8fA2DCPjm5w5oJWM-qq8ze3@mail.gmail.com>

Hi,

I am working on the documentation for scipy.io, and I came across the
recaster module.  Looking through it, I really start to wonder if it should
be a part of numpy or possibly belong elsewhere in scipy.  The best I can
tell, its job is to do "smart" recasting of the data values in numpy
arrays.  There doesn't seem to be any saving to files or reading from files,
which is what everything else in scipy.io does.

If anybody can give me any insight on this, I will be sure to include it in
the documentation.  Otherwise, I'll just head back into my cave...

Thanks,
Ben Root
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100613/c540ae8e/attachment.html>

From matthew.brett at gmail.com  Mon Jun 14 04:20:07 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Mon, 14 Jun 2010 10:20:07 +0200
Subject: [SciPy-Dev] scipy.io.recaster?
In-Reply-To: <AANLkTinSkkEkyypwKnUaF8fA2DCPjm5w5oJWM-qq8ze3@mail.gmail.com>
References: <AANLkTinSkkEkyypwKnUaF8fA2DCPjm5w5oJWM-qq8ze3@mail.gmail.com>
Message-ID: <AANLkTikzVZ8jP5oWDd_ygQ51WiF0b6KPoO-mVOT0nlh8@mail.gmail.com>

Hi,

> I am working on the documentation for scipy.io, and I came across the
> recaster module.? Looking through it, I really start to wonder if it should
> be a part of numpy or possibly belong elsewhere in scipy.? The best I can
> tell, its job is to do "smart" recasting of the data values in numpy
> arrays.? There doesn't seem to be any saving to files or reading from files,
> which is what everything else in scipy.io does.

Sorry - that one's mine.  I intended to use it for deciding the matlab
output type for any given numpy input type, but I never plumbed it
into the reader, so currently it is unused in the scipy tree at least.
 It is probably best to deprecate it for 0.8.x and remove it in 0.9.
I may need the functionality later but we've always got version
control in that case.

Best,

Matthew


From vincent at vincentdavis.net  Mon Jun 14 08:34:03 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 14 Jun 2010 06:34:03 -0600
Subject: [SciPy-Dev] adding test to test_random.py
In-Reply-To: <AANLkTik9GGvJXUeF66G0_kbayD0jbaM1fi3kzR6yEITs@mail.gmail.com>
References: <AANLkTilQp7iZmXP0FEWCoGIKluLUOoHd77pQASIVBpjB@mail.gmail.com>
	<AANLkTin_YBZrj298VPiAVSUDAQAzyJauQeG56-3F3cLG@mail.gmail.com>
	<AANLkTinhHv8vLxCFpBrxgzaE145P-hT1kpegXyhXIQAT@mail.gmail.com>
	<AANLkTilNjLykOcE1WmuaYLlBkE8JXTMH0qprxlTLhEz3@mail.gmail.com>
	<AANLkTinRhnn8GDMqUU_oujF4E6GugkA1fiqxQZTCqYwQ@mail.gmail.com>
	<AANLkTilIBcxM8UWcvxw8K4oG-rYLJtORq_nvM7I7TSNV@mail.gmail.com>
	<AANLkTinnQf9uMm_-BlnE1KB4af4CGdDGc_HoY5IAYZyW@mail.gmail.com>
	<AANLkTik0eQ7gEzuTqEa7Ttu8bwmNgouV_Du-Xouiv86m@mail.gmail.com>
	<AANLkTilRcPVHwqhymA0zYxao3ORCNPyufg_gK0Wp8-1p@mail.gmail.com>
	<AANLkTinnuWAdfY89Qjuu-rg3EPTU_hTnbN_qYjEBa0_C@mail.gmail.com>
	<AANLkTimgA2acuJZnmN6ZwIabEkFneM7K4U7kVsK-0Ezd@mail.gmail.com>
	<AANLkTilOFZrxzCADx6jLlxSltRs2kXvfKq8HDqPyjXDG@mail.gmail.com>
	<AANLkTinscMr-6w2z3fGMFuLcZERHG3ix6YKLg75EK5oh@mail.gmail.com>
	<AANLkTinB9vF6qPNZ1iNsonc6WW4wGZobsc4qXNJ26Nan@mail.gmail.com>
	<AANLkTik9GGvJXUeF66G0_kbayD0jbaM1fi3kzR6yEITs@mail.gmail.com>
Message-ID: <AANLkTimR6UV43KBNyjGPFV7baCpS5XJPHDJ2E82o547o@mail.gmail.com>

On Sun, Jun 13, 2010 at 8:27 PM, Robert Kern <robert.kern at gmail.com> wrote:
> On Sun, Jun 13, 2010 at 21:23, Vincent Davis <vincent at vincentdavis.net> wrote:
>> On Sunday, June 13, 2010, Robert Kern <robert.kern at gmail.com> wrote:
>>> On Sun, Jun 13, 2010 at 21:09, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>> On Sunday, June 13, 2010, Robert Kern <robert.kern at gmail.com> wrote:
>>>>> On Sun, Jun 13, 2010 at 20:38, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>>>> It seems I have screwed something up. It works for the 32bit version
>>>>>> but not the 64bit I have been modifying. (see below). I guess I need
>>>>>> to spend more time investigating this.
>>>>>
>>>>> Exactly what are you doing? What file are you modifying?
>>>>
>>>> Not at my computer now but I think
>>>> sitepackages/numpy/random/tests/test_random.py
>>>
>>> But the Python executable that you are running is the 32-bit version:
>>
>> I have both versions installed. I am modifying the 64 bit install and
>> that test does not work.
>
> Right. What you showed me was the result of your running the 32-bit
> Python executable. That will not show any modifications you made to
> your 64-bit install.

Ok got it working, I think I was chasing my tail. Actually do file
permissions matter, I had changed the permission on the file to allow
me to edit it without root permissions?

So the the randn(3,3) test fails as expected (see below). Now to add a
few more tests.
Thanks again
Vincent

Python 2.6.5 |EPD 6.2-1 (64-bit)| (r265:79063, May 28 2010, 15:24:16)
[GCC 4.0.1 (Apple Inc. build 5488)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import numpy as np
>>> np.random.test(verbose=2)
Running unit tests for numpy.random
NumPy version 1.4.0
NumPy is installed in
/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy
Python version 2.6.5 |EPD 6.2-1 (64-bit)| (r265:79063, May 28 2010,
15:24:16) [GCC 4.0.1 (Apple Inc. build 5488)]
nose version 0.11.3
test_basic (test_random.TestMultinomial) ... ok
test_int_negative_interval (test_random.TestMultinomial) ... ok
test_zero_probability (test_random.TestMultinomial) ... ok
test_rand (test_random.TestRandomDist) ... ok
test_randn (test_random.TestRandomDist) ... FAIL
Make sure generated random variables are in [-pi, pi]. ... ok
Test for ticket #921 ... ok
Test for ticket #923 ... ok
Make sure we can accept old state tuples that do not have the cached ... ok
test_basic (test_random.TestSetState) ... ok
Make sure the cached every-other-Gaussian is reset. ... ok
When the state is saved with a cached Gaussian, make sure the cached ... ok
Ensure that the negative binomial results take floating point ... ok

======================================================================
FAIL: test_randn (test_random.TestRandomDist)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/random/tests/test_random.py",
line 123, in test_randn
    np.testing.assert_array_equal(actual, desired)
  File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
line 677, in assert_array_equal
    verbose=verbose, header='Arrays are not equal')
  File "/Library/Frameworks/EPD64.framework/Versions/6.2/lib/python2.6/site-packages/numpy/testing/utils.py",
line 609, in assert_array_compare
    raise AssertionError(msg)
AssertionError:
Arrays are not equal

(mismatch 100.0%)
 x: array([[ 0.2373197 , -1.35843925, -0.39112844],
       [-0.30154253,  1.83148329, -0.57565507],
       [ 0.13782929, -0.30625025,  0.33962918]])
 y: array([[ 1.34016346,  1.73759123,  1.49898834],
       [-0.22864333,  2.031034  ,  2.17032495],
       [ 1.59645265, -0.76945156,  0.56605824]])

----------------------------------------------------------------------
Ran 13 tests in 0.085s

FAILED (failures=1)
<nose.result.TextTestResult run=13 errors=0 failures=1>


>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
> ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From d.l.goldsmith at gmail.com  Mon Jun 14 16:46:13 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 14 Jun 2010 13:46:13 -0700
Subject: [SciPy-Dev] Slight change to reference/index.rst
Message-ID: <AANLkTimL-E3UcU8J3HIgAa1wK20mcaXR-LazzSvx0zWJ@mail.gmail.com>

Does anyone object vehemently to the following change to
reference/index.rst?

("Executive summary": move the info about contributing to the doc from the
bottom of the page to the end of the first narrative paragraph on the page.)

*Present structure:*

Release:|version|<http://docs.scipy.org/numpy/docs/numpy-docs/reference/index.rst/#id1>
Date: |today|<http://docs.scipy.org/numpy/docs/numpy-docs/reference/index.rst/#id3>

This reference manual details functions, modules, and objects included in
Numpy, describing what they are and what they do. For learning how to use
NumPy, see also
user<http://docs.scipy.org/numpy/docs/numpy-docs/user/index.rst/#user>
.

Toctree

:
Acknowledgements

Large parts of this manual originate from Travis E. Oliphant's book Guide to
Numpy <http://www.tramy.us/> (which generously entered Public Domain in
August 2008).  The reference documentation for many of the functions are
written by numerous contributors and developers of Numpy, both prior to and
during the Numpy Documentation
Marathon<http://scipy.org/Developer_Zone/DocMarathon2008>
.

*Proposed change:
*
:

This reference manual details functions, modules, and objects included in
Numpy, describing what they are and what they do. For learning how to use
NumPy, see also
user<http://docs.scipy.org/numpy/docs/numpy-docs/user/index.rst/#user>.
For learning how to contribute to improving the documentation, see
http://docs.scipy.org/numpy/Front%20Page/.

Toctree

:
Acknowledgments Large parts of this manual originate from Travis E.
Oliphant's book Guide to Numpy <http://www.tramy.us/> (which generously
entered Public Domain in August 2008).  The reference documentation for many
of the functions are written by numerous contributors and developers of
Numpy.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100614/c7faddbd/attachment.html>

From vincent at vincentdavis.net  Mon Jun 14 17:43:08 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 14 Jun 2010 15:43:08 -0600
Subject: [SciPy-Dev] Can you have to many tests?
Message-ID: <AANLkTimm5Bfu3tHk7V9LDa7EZ61keDmUYfWoPT5PadVp@mail.gmail.com>

This is in regard to tests of numpy.random but also in general. Can
you have to many tests?
Is there a reason not to have a test for each distribution in numpy.random?
Vincent


From robert.kern at gmail.com  Mon Jun 14 17:48:51 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Mon, 14 Jun 2010 16:48:51 -0500
Subject: [SciPy-Dev] Can you have to many tests?
In-Reply-To: <AANLkTimm5Bfu3tHk7V9LDa7EZ61keDmUYfWoPT5PadVp@mail.gmail.com>
References: <AANLkTimm5Bfu3tHk7V9LDa7EZ61keDmUYfWoPT5PadVp@mail.gmail.com>
Message-ID: <AANLkTikVseHyq89hFF5bqYSFHwN9r-QcFlO2qMBzLbgX@mail.gmail.com>

On Mon, Jun 14, 2010 at 16:43, Vincent Davis <vincent at vincentdavis.net> wrote:
> This is in regard to tests of numpy.random but also in general. Can
> you have to many tests?

Yes. Overcomplete tests means a slower test suite which means fewer
people will bother to run them.

> Is there a reason not to have a test for each distribution in numpy.random?

No.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From ben.root at ou.edu  Mon Jun 14 21:43:55 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Mon, 14 Jun 2010 20:43:55 -0500
Subject: [SciPy-Dev] netcdf segfault
Message-ID: <AANLkTilaNeBEQ3zmXRZ91viREjgtbALEk_kMxMm0vHjS@mail.gmail.com>

Hello,

I was trying out the scipy.io.netcdf module in order to work on the
documentation and I came across a way to segfault out of python.  If you
open an existing netcdf file as read-only and then try to set a scalar value
using .assignValue for a 1-element netcdf_variable, the program segfaults
rather than raising an exception.

Here is an example to go with a very small netcdf file as a test:

from scipy.io import netcdf
f = netcdf.netcdf_file("testdata.netcdf", 'r')
tval = f.variables['t']
tval.assignValue(42)
f.close()

Ben Root
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100614/d2cfc30a/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: testdata.nc
Type: application/x-netcdf
Size: 84 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100614/d2cfc30a/attachment.nc>

From warren.weckesser at enthought.com  Mon Jun 14 21:54:33 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Mon, 14 Jun 2010 20:54:33 -0500
Subject: [SciPy-Dev] netcdf segfault
In-Reply-To: <AANLkTilaNeBEQ3zmXRZ91viREjgtbALEk_kMxMm0vHjS@mail.gmail.com>
References: <AANLkTilaNeBEQ3zmXRZ91viREjgtbALEk_kMxMm0vHjS@mail.gmail.com>
Message-ID: <4C16DD59.8070608@enthought.com>

Benjamin Root wrote:
> Hello,
>
> I was trying out the scipy.io.netcdf module in order to work on the 
> documentation and I came across a way to segfault out of python.  If 
> you open an existing netcdf file as read-only and then try to set a 
> scalar value using .assignValue for a 1-element netcdf_variable, the 
> program segfaults rather than raising an exception.
>
> Here is an example to go with a very small netcdf file as a test:
>
> from scipy.io <http://scipy.io> import netcdf
> f = netcdf.netcdf_file("testdata.netcdf", 'r')
> tval = f.variables['t']
> tval.assignValue(42)
> f.close()
>

Yup--I get a bus error on Mac OSX 10.5 with scipy trunk.

Warren

> Ben Root
> ------------------------------------------------------------------------
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From ben.root at ou.edu  Mon Jun 14 22:55:42 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Mon, 14 Jun 2010 21:55:42 -0500
Subject: [SciPy-Dev] scipy.io.recaster?
In-Reply-To: <AANLkTikzVZ8jP5oWDd_ygQ51WiF0b6KPoO-mVOT0nlh8@mail.gmail.com>
References: <AANLkTinSkkEkyypwKnUaF8fA2DCPjm5w5oJWM-qq8ze3@mail.gmail.com> 
	<AANLkTikzVZ8jP5oWDd_ygQ51WiF0b6KPoO-mVOT0nlh8@mail.gmail.com>
Message-ID: <AANLkTilS7u90f_leeXOkXOv1a74ZIH7sESwNjxiG406N@mail.gmail.com>

Ok, but I don't have commit rights to the scipy tree.  If you wish to have
it deprecated, then feel free.  If I have time this summer, I'll get around
to putting some documentation to them, but they are now low priority for me.

Thanks for the info,
Ben Root

On Mon, Jun 14, 2010 at 3:20 AM, Matthew Brett <matthew.brett at gmail.com>wrote:

> Hi,
>
> > I am working on the documentation for scipy.io, and I came across the
> > recaster module.  Looking through it, I really start to wonder if it
> should
> > be a part of numpy or possibly belong elsewhere in scipy.  The best I can
> > tell, its job is to do "smart" recasting of the data values in numpy
> > arrays.  There doesn't seem to be any saving to files or reading from
> files,
> > which is what everything else in scipy.io does.
>
> Sorry - that one's mine.  I intended to use it for deciding the matlab
> output type for any given numpy input type, but I never plumbed it
> into the reader, so currently it is unused in the scipy tree at least.
>  It is probably best to deprecate it for 0.8.x and remove it in 0.9.
> I may need the functionality later but we've always got version
> control in that case.
>
> Best,
>
> Matthew
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100614/6d04e2dd/attachment.html>

From vincent at vincentdavis.net  Mon Jun 14 23:07:18 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 14 Jun 2010 21:07:18 -0600
Subject: [SciPy-Dev] proper way to test distributions
Message-ID: <AANLkTikHwXdI0UsETB1O1LdxeZ8kmDM0wZQOd4qHLWhR@mail.gmail.com>

I was reviewing the how tests of distribution where done in scipy with
the thought of applying the same methods to numpy.random. I have a lot
to learn here and appreciate you suggestions.

Link to the scipy test
http://github.com/pv/scipy-work/blob/master/scipy/stats/tests/test_continuous_basic.py

If I understand correctly the tests create a sample of 2000 from a
given distribution and the compares stats (mean, var...) calculate
with functions from numpy with those stored in the distribution
instant .stats  I am not sure how the mean is calculated within the
distribution (is it just using the scipy mean)  Anyway this seems a
little circular.

Maybe I am missing something but here are my thought.

1) Using seed() and the comparing the actual results (arrays) helps to
make sure the code is stable but tells you nothing about the quality
of the distribution.

2) Using seed() and the calculating the moments (with numpy and
dist.stats) is not really any different that (1)

3) drawing a large sample (possibly using seed()) and calculating the
moments and comparing the to the theoretical moments seems like the
best option. But this could be slow.

What is the best way?
What is desired in numpy?

And a little off topic but isn't numpy.random duplicating scipy or
scipy duplicating numpy?

Thanks
Vincent


From robert.kern at gmail.com  Mon Jun 14 23:26:28 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Mon, 14 Jun 2010 22:26:28 -0500
Subject: [SciPy-Dev] proper way to test distributions
In-Reply-To: <AANLkTikHwXdI0UsETB1O1LdxeZ8kmDM0wZQOd4qHLWhR@mail.gmail.com>
References: <AANLkTikHwXdI0UsETB1O1LdxeZ8kmDM0wZQOd4qHLWhR@mail.gmail.com>
Message-ID: <AANLkTinaXNL7dpTNT9ZzFuZ_jFdxlpvDlImrAz589FSU@mail.gmail.com>

On Mon, Jun 14, 2010 at 22:07, Vincent Davis <vincent at vincentdavis.net> wrote:
> I was reviewing the how tests of distribution where done in scipy with
> the thought of applying the same methods to numpy.random. I have a lot
> to learn here and appreciate you suggestions.
>
> Link to the scipy test
> http://github.com/pv/scipy-work/blob/master/scipy/stats/tests/test_continuous_basic.py
>
> If I understand correctly the tests create a sample of 2000 from a
> given distribution and the compares stats (mean, var...) calculate
> with functions from numpy with those stored in the distribution
> instant .stats ?I am not sure how the mean is calculated within the
> distribution (is it just using the scipy mean) ?Anyway this seems a
> little circular.
>
> Maybe I am missing something but here are my thought.
>
> 1) Using seed() and the comparing the actual results (arrays) helps to
> make sure the code is stable but tells you nothing about the quality
> of the distribution.
>
> 2) Using seed() and the calculating the moments (with numpy and
> dist.stats) is not really any different that (1)
>
> 3) drawing a large sample (possibly using seed()) and calculating the
> moments and comparing the to the theoretical moments seems like the
> best option. But this could be slow.
>
> What is the best way?
> What is desired in numpy?

While it's worthwhile to have both, you really only want (1) in the
standard unit test suite. (3) is good for working out the bugs in the
initial implementation (or retroactively doing so after the grad
student who wrote the initial implementation suddenly ran off and got
a real job. <ahem>). You can provide them, if you wish to do that
verification, but it doesn't need to be in the main test suite. (1)
provides the first layer of protection. If we make an unintentional
change to the results, (1) will catch it. If we make an intentional
change, we can use (3) to verify that our changes are good. But we
don't need to write (3) until we are actually faced with that task.

> And a little off topic but isn't numpy.random duplicating scipy or
> scipy duplicating numpy?

Not really. scipy is using those routines from numpy for most of the
duplicated distributions. numpy needed that functionality to match
Numeric's. Of course, this means that scipy's (3)-type tests should be
providing us coverage for many of numpy's distributions.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From vincent at vincentdavis.net  Mon Jun 14 23:31:11 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 14 Jun 2010 21:31:11 -0600
Subject: [SciPy-Dev] proper way to test distributions
In-Reply-To: <AANLkTinaXNL7dpTNT9ZzFuZ_jFdxlpvDlImrAz589FSU@mail.gmail.com>
References: <AANLkTikHwXdI0UsETB1O1LdxeZ8kmDM0wZQOd4qHLWhR@mail.gmail.com>
	<AANLkTinaXNL7dpTNT9ZzFuZ_jFdxlpvDlImrAz589FSU@mail.gmail.com>
Message-ID: <AANLkTikEo3FLpX_yAQvu6EO-Ouoyf29kO0GqycO46CZS@mail.gmail.com>

On Mon, Jun 14, 2010 at 9:26 PM, Robert Kern <robert.kern at gmail.com> wrote:
> On Mon, Jun 14, 2010 at 22:07, Vincent Davis <vincent at vincentdavis.net> wrote:
>> I was reviewing the how tests of distribution where done in scipy with
>> the thought of applying the same methods to numpy.random. I have a lot
>> to learn here and appreciate you suggestions.
>>
>> Link to the scipy test
>> http://github.com/pv/scipy-work/blob/master/scipy/stats/tests/test_continuous_basic.py
>>
>> If I understand correctly the tests create a sample of 2000 from a
>> given distribution and the compares stats (mean, var...) calculate
>> with functions from numpy with those stored in the distribution
>> instant .stats ?I am not sure how the mean is calculated within the
>> distribution (is it just using the scipy mean) ?Anyway this seems a
>> little circular.
>>
>> Maybe I am missing something but here are my thought.
>>
>> 1) Using seed() and the comparing the actual results (arrays) helps to
>> make sure the code is stable but tells you nothing about the quality
>> of the distribution.
>>
>> 2) Using seed() and the calculating the moments (with numpy and
>> dist.stats) is not really any different that (1)
>>
>> 3) drawing a large sample (possibly using seed()) and calculating the
>> moments and comparing the to the theoretical moments seems like the
>> best option. But this could be slow.
>>
>> What is the best way?
>> What is desired in numpy?
>
> While it's worthwhile to have both, you really only want (1) in the
> standard unit test suite. (3) is good for working out the bugs in the
> initial implementation (or retroactively doing so after the grad
> student who wrote the initial implementation suddenly ran off and got
> a real job. <ahem>). You can provide them, if you wish to do that
> verification, but it doesn't need to be in the main test suite. (1)
> provides the first layer of protection. If we make an unintentional
> change to the results, (1) will catch it. If we make an intentional
> change, we can use (3) to verify that our changes are good. But we
> don't need to write (3) until we are actually faced with that task.
>
>> And a little off topic but isn't numpy.random duplicating scipy or
>> scipy duplicating numpy?
>
> Not really. scipy is using those routines from numpy for most of the
> duplicated distributions. numpy needed that functionality to match
> Numeric's. Of course, this means that scipy's (3)-type tests should be
> providing us coverage for many of numpy's distributions.
>

Thanks for the feedback, makes sense to me.

Vincent

> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
> ?-- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Mon Jun 14 23:36:05 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Mon, 14 Jun 2010 23:36:05 -0400
Subject: [SciPy-Dev] proper way to test distributions
In-Reply-To: <AANLkTikHwXdI0UsETB1O1LdxeZ8kmDM0wZQOd4qHLWhR@mail.gmail.com>
References: <AANLkTikHwXdI0UsETB1O1LdxeZ8kmDM0wZQOd4qHLWhR@mail.gmail.com>
Message-ID: <AANLkTilNc-sZAAKZlQTXriyjvyrw78NxwuIDWZmfMBLP@mail.gmail.com>

On Mon, Jun 14, 2010 at 11:07 PM, Vincent Davis
<vincent at vincentdavis.net> wrote:
> I was reviewing the how tests of distribution where done in scipy with
> the thought of applying the same methods to numpy.random. I have a lot
> to learn here and appreciate you suggestions.
>
> Link to the scipy test
> http://github.com/pv/scipy-work/blob/master/scipy/stats/tests/test_continuous_basic.py
>
> If I understand correctly the tests create a sample of 2000 from a
> given distribution and the compares stats (mean, var...) calculate
> with functions from numpy with those stored in the distribution
> instant .stats ?I am not sure how the mean is calculated within the
> distribution (is it just using the scipy mean) ?Anyway this seems a
> little circular.

sample size is 1000 not 2000 from a quick look

distribution mean, var, skew, kurtosis are theoretical results which
are compared to the sample moments from the random numbers. This tests
that the random numbers correspond to the theoretical distribution and
that the theoretical moments are correct.

The scipy.stats.distributions test suite is "heavy", with full (slow
tests are not skipped) the test suite takes 5-6 minutes on my
computer.
The test suite initially didn't use a seed and I had additional
scripts for fuzz testing to catch the bugs. The test suite was written
for the purpose of bug-hunting and could be simplified for regression
tests.

The advantage of the current setup is that I only need to add one line
to test a new set of parameters for a given distribution or for a new
distribution and I get the full checks that the results are
(statistically) correct.

Another option would be to use "certified" benchmark results for the
distributions, e.g. compared to R, and I checked some of them for
distributions where I needed to look at some details, but doing it
across the board for 90 (+ possibly new) distributions sounds very
painful.

Some tests are still disabled because there are still (un)known errors
in skew, kurtosis and entropy in some distributions, and fit doesn't
work with the defaults for all distributions.

One advantage to testing numpy.random in scipy.stats is that
scipy.stats has all the theoretical results available.

so currently the tests for scipy.stats.distributions are mostly 3) in
your categorization

Josef

>
> Maybe I am missing something but here are my thought.
>
> 1) Using seed() and the comparing the actual results (arrays) helps to
> make sure the code is stable but tells you nothing about the quality
> of the distribution.
>
> 2) Using seed() and the calculating the moments (with numpy and
> dist.stats) is not really any different that (1)
>
> 3) drawing a large sample (possibly using seed()) and calculating the
> moments and comparing the to the theoretical moments seems like the
> best option. But this could be slow.
>
> What is the best way?
> What is desired in numpy?
>
> And a little off topic but isn't numpy.random duplicating scipy or
> scipy duplicating numpy?


>
> Thanks
> Vincent
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From d.l.goldsmith at gmail.com  Tue Jun 15 01:47:44 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 14 Jun 2010 22:47:44 -0700
Subject: [SciPy-Dev] scipy.stats.signaltonoise docstring
Message-ID: <AANLkTinJfeHhFR-2_w7r5S5GeXMCOsyjH1DiLrGioR5E@mail.gmail.com>

Hi, folks!  The docstring for scipy.stats.stats.signaltonoise says that if
axis == None, the input is first ravel'd, but I see no code that does this;
here's the code:

def signaltonoise(a, axis=0, ddof=0):
    """..."""
    a = np.asanyarray(a)
    m = a.mean(axis)
    sd = a.std(axis=axis, ddof=ddof)
    return np.where(sd == 0, 0, m/sd)

Even though it's not surrounded by an "if axis is None:", I checked the
docstring for asanyarray anyway, and there's nothing there about
raveling.  So, what gives?

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100614/5908c90e/attachment.html>

From robert.kern at gmail.com  Tue Jun 15 01:52:36 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Tue, 15 Jun 2010 00:52:36 -0500
Subject: [SciPy-Dev] scipy.stats.signaltonoise docstring
In-Reply-To: <AANLkTinJfeHhFR-2_w7r5S5GeXMCOsyjH1DiLrGioR5E@mail.gmail.com>
References: <AANLkTinJfeHhFR-2_w7r5S5GeXMCOsyjH1DiLrGioR5E@mail.gmail.com>
Message-ID: <AANLkTiku3M-O7T7Ca9yPxNbeXlYtnQNdWs-ml1kVHgiq@mail.gmail.com>

On Tue, Jun 15, 2010 at 00:47, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> Hi, folks!? The docstring for scipy.stats.stats.signaltonoise says that if
> axis == None, the input is first ravel'd, but I see no code that does this;
> here's the code:
>
> def signaltonoise(a, axis=0, ddof=0):
>
>     """..."""
>     a = np.asanyarray(a)
>     m = a.mean(axis)
>     sd = a.std(axis=axis, ddof=ddof)
>     return np.where(sd == 0, 0, m/sd)
>
>
> Even though it's not surrounded by an "if axis is None:", I checked the
> docstring for asanyarray anyway, and there's nothing there about
> raveling.? So, what gives?

axis=None also ravels for the .mean() and .std() methods. Since the
rest of the function doesn't touch the input except through the
results of those methods, the function as a whole treats axis=None as
raveling.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From d.l.goldsmith at gmail.com  Tue Jun 15 02:00:25 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 14 Jun 2010 23:00:25 -0700
Subject: [SciPy-Dev] scipy.stats.signaltonoise docstring
In-Reply-To: <AANLkTiku3M-O7T7Ca9yPxNbeXlYtnQNdWs-ml1kVHgiq@mail.gmail.com>
References: <AANLkTinJfeHhFR-2_w7r5S5GeXMCOsyjH1DiLrGioR5E@mail.gmail.com>
	<AANLkTiku3M-O7T7Ca9yPxNbeXlYtnQNdWs-ml1kVHgiq@mail.gmail.com>
Message-ID: <AANLkTinodv0iSfDG6G_TrK4NGGcEATM2Tdr4xCgdWyMG@mail.gmail.com>

On Mon, Jun 14, 2010 at 10:52 PM, Robert Kern <robert.kern at gmail.com> wrote:

> On Tue, Jun 15, 2010 at 00:47, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
> > Hi, folks!  The docstring for scipy.stats.stats.signaltonoise says that
> if
> > axis == None, the input is first ravel'd, but I see no code that does
> this;
> > here's the code:
> >
> > def signaltonoise(a, axis=0, ddof=0):
> >
> >     """..."""
> >     a = np.asanyarray(a)
> >     m = a.mean(axis)
> >     sd = a.std(axis=axis, ddof=ddof)
> >     return np.where(sd == 0, 0, m/sd)
> >
> >
> > Even though it's not surrounded by an "if axis is None:", I checked the
> > docstring for asanyarray anyway, and there's nothing there about
> > raveling.  So, what gives?
>
> axis=None also ravels for the .mean() and .std() methods. Since the
> rest of the function doesn't touch the input except through the
> results of those methods, the function as a whole treats axis=None as
> raveling.
>
> --
> Robert Kern
>

Gotchya!  Thanks!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100614/21140a8c/attachment.html>

From d.l.goldsmith at gmail.com  Mon Jun 14 05:05:35 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 14 Jun 2010 02:05:35 -0700
Subject: [SciPy-Dev] SciPy docs marathon
Message-ID: <AANLkTimKo_G6ibII3skj6XRbA8CFfmtVURZOI1J6yF2I@mail.gmail.com>

Hi, all!  The scipy doc marathon has gotten off to a very slow start this
summer.  We are producing less than 1000 words a week, perhaps because
many universities are still finishing up spring classes.  So, this is
a second appeal to everyone to pitch in and help get scipy documented
so that it's easy to learn how to use it.  Because some of the
packages are quite specialized, we need both "regular" contributors to
write lots of pages, and some people experienced in using each module
(and the mathematics behind the software) to make sure we don't water
it down or make it wrong in the process.  If you can help, please, now is
the
time to step forward.  Thanks!

On behalf of Joe and myself,

David Goldsmith
Olympia, WA
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100614/b8edb3c4/attachment.html>

From d.l.goldsmith at gmail.com  Tue Jun 15 20:38:46 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 15 Jun 2010 17:38:46 -0700
Subject: [SciPy-Dev] Working on the docstring for
	stats.distributions.rv_continuous.fit
Message-ID: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com>

Hi!  IMHO, the descriptions of the *args & **kwds parameters of the
Subject-referenced method are not very clear, so let me see if I understand
correctly:

*args : float(s), optional
    If the distribution in question depends on n parameters, _excluding
location and scale_, then *args may contain 0 to n floats, which are
starting estimates for the corresponding parameters.  No default value(s).

**kwds _may_ contain the following:

loc : float, optional
    Starting estimate for the location parameter, no default.

scale : float, optional
    Starting estimate for the scale parameter, no default.

floc : bool, optional
    Hold the location parameter constant; default: False.

fscale : bool, optional
    Hold the scale parameter constant; default: False

fi : bool, optional
    Hold the i-th scale parameter constant; there may be up to len(args) of
these; default: False

Is this correct?  Thanks!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100615/fdc6d76d/attachment.html>

From charlesr.harris at gmail.com  Tue Jun 15 22:27:26 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Tue, 15 Jun 2010 20:27:26 -0600
Subject: [SciPy-Dev] Errors in 0.8.x
Message-ID: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>

Hi All,

I get some error in the tests:

======================================================================
ERROR: Failure: ValueError (__builtin__.file does not appear to be the
correct type object)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/lib/pymodules/python2.6/nose/loader.py", line 379, in
loadTestsFromName
    addr.filename, addr.module)
  File "/usr/lib/pymodules/python2.6/nose/importer.py", line 39, in
importFromPath
    return self.importFromDir(dir_path, fqname)
  File "/usr/lib/pymodules/python2.6/nose/importer.py", line 86, in
importFromDir
    mod = load_module(part_fqname, fh, filename, desc)
  File "/usr/local/lib/python2.6/dist-packages/scipy/io/__init__.py", line
75, in <module>
    from matlab.mio import loadmat, savemat
  File "/usr/local/lib/python2.6/dist-packages/scipy/io/matlab/__init__.py",
line 2, in <module>
    from mio import loadmat, savemat
  File "/usr/local/lib/python2.6/dist-packages/scipy/io/matlab/mio.py", line
13, in <module>
    from mio5 import MatFile5Reader, MatFile5Writer
  File "/usr/local/lib/python2.6/dist-packages/scipy/io/matlab/mio5.py",
line 90, in <module>
    from mio5_utils import VarReader5
  File "streams.pxd", line 3, in init scipy.io.matlab.mio5_utils
(scipy/io/matlab/mio5_utils.c:9394)
  File "streams.pyx", line 26, in init scipy.io.matlab.streams
(scipy/io/matlab/streams.c:3394)
ValueError: __builtin__.file does not appear to be the correct type object

======================================================================
ERROR: test_decomp.test_lapack_misaligned(<function solve at 0x3851500>,
(array([[  1.734e-255,   8.189e-217,   4.025e-178,   1.903e-139,
9.344e-101,
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/lib/pymodules/python2.6/nose/case.py", line 183, in runTest
    self.test(*self.arg)
  File
"/usr/local/lib/python2.6/dist-packages/scipy/linalg/tests/test_decomp.py",
line 1118, in check_lapack_misaligned
    func(*a,**kwargs)
  File "/usr/local/lib/python2.6/dist-packages/scipy/linalg/basic.py", line
133, in solve
    a1, b1 = map(asarray_chkfinite,(a,b))
  File "/usr/local/lib/python2.6/dist-packages/numpy/lib/function_base.py",
line 528, in asarray_chkfinite
    "array must not contain infs or NaNs")
ValueError: array must not contain infs or NaNs

======================================================================
ERROR: Ticket #1124.
----------------------------------------------------------------------
Traceback (most recent call last):
  File
"/usr/local/lib/python2.6/dist-packages/scipy/signal/tests/test_signaltools.py",
line 287, in test_none
    signal.medfilt(None)
  File "/usr/local/lib/python2.6/dist-packages/scipy/signal/signaltools.py",
line 308, in medfilt
    return sigtools._order_filterND(volume,domain,order)
ValueError: order_filterND not available for this type

----------------------------------------------------------------------
Ran 4007 tests in 39.457s

FAILED (KNOWNFAIL=12, SKIP=29, errors=3)


There are also a lot of compiler warnings like:

gfortran:f77: ./lbfgsb/routines.f
Warning: Nonconforming tab character in column 1 of line 510


Are these things already known?

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100615/b33b3b0a/attachment.html>

From vincent at vincentdavis.net  Tue Jun 15 22:44:32 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 15 Jun 2010 20:44:32 -0600
Subject: [SciPy-Dev] Errors in 0.8.x
In-Reply-To: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
References: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
Message-ID: <AANLkTilvM1W_6bOthpYClvvD5ZHmSnKn8Dua-tRPPpy6@mail.gmail.com>

On Tue, Jun 15, 2010 at 8:27 PM, Charles R Harris
<charlesr.harris at gmail.com> wrote:
> Hi All,
>
> I get some error in the tests:
>
> ======================================================================
> ERROR: Failure: ValueError (__builtin__.file does not appear to be the
> correct type object)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File "/usr/lib/pymodules/python2.6/nose/loader.py", line 379, in
> loadTestsFromName
>     addr.filename, addr.module)
>   File "/usr/lib/pymodules/python2.6/nose/importer.py", line 39, in
> importFromPath
>     return self.importFromDir(dir_path, fqname)
>   File "/usr/lib/pymodules/python2.6/nose/importer.py", line 86, in
> importFromDir
>     mod = load_module(part_fqname, fh, filename, desc)
>   File "/usr/local/lib/python2.6/dist-packages/scipy/io/__init__.py", line
> 75, in <module>
>     from matlab.mio import loadmat, savemat
>   File "/usr/local/lib/python2.6/dist-packages/scipy/io/matlab/__init__.py",
> line 2, in <module>
>     from mio import loadmat, savemat
>   File "/usr/local/lib/python2.6/dist-packages/scipy/io/matlab/mio.py", line
> 13, in <module>
>     from mio5 import MatFile5Reader, MatFile5Writer
>   File "/usr/local/lib/python2.6/dist-packages/scipy/io/matlab/mio5.py",
> line 90, in <module>
>     from mio5_utils import VarReader5
>   File "streams.pxd", line 3, in init scipy.io.matlab.mio5_utils
> (scipy/io/matlab/mio5_utils.c:9394)
>   File "streams.pyx", line 26, in init scipy.io.matlab.streams
> (scipy/io/matlab/streams.c:3394)
> ValueError: __builtin__.file does not appear to be the correct type object
>
> ======================================================================
> ERROR: test_decomp.test_lapack_misaligned(<function solve at 0x3851500>,
> (array([[  1.734e-255,   8.189e-217,   4.025e-178,   1.903e-139,
> 9.344e-101,
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File "/usr/lib/pymodules/python2.6/nose/case.py", line 183, in runTest
>     self.test(*self.arg)
>   File
> "/usr/local/lib/python2.6/dist-packages/scipy/linalg/tests/test_decomp.py",
> line 1118, in check_lapack_misaligned
>     func(*a,**kwargs)
>   File "/usr/local/lib/python2.6/dist-packages/scipy/linalg/basic.py", line
> 133, in solve
>     a1, b1 = map(asarray_chkfinite,(a,b))
>   File "/usr/local/lib/python2.6/dist-packages/numpy/lib/function_base.py",
> line 528, in asarray_chkfinite
>     "array must not contain infs or NaNs")
> ValueError: array must not contain infs or NaNs
>
> ======================================================================
> ERROR: Ticket #1124.
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File
> "/usr/local/lib/python2.6/dist-packages/scipy/signal/tests/test_signaltools.py",
> line 287, in test_none
>     signal.medfilt(None)
>   File "/usr/local/lib/python2.6/dist-packages/scipy/signal/signaltools.py",
> line 308, in medfilt
>     return sigtools._order_filterND(volume,domain,order)
> ValueError: order_filterND not available for this type
>
> ----------------------------------------------------------------------
> Ran 4007 tests in 39.457s
>
> FAILED (KNOWNFAIL=12, SKIP=29, errors=3)
>
>
> There are also a lot of compiler warnings like:
>
> gfortran:f77: ./lbfgsb/routines.f
> Warning: Nonconforming tab character in column 1 of line 510
>
>
> Are these things already known?
>
> Chuck


I just built py26 and numopy, scipy this last weekend, These are the
errors I get (ubuntu 10.4, not sure what FORTRAN compiler I have)

======================================================================
ERROR: test_decomp.test_lapack_misaligned(<function solve at
0x89928b4>, (array([[  1.734e-255,   8.189e-217,   4.025e-178,
1.903e-139,   9.344e-101,
----------------------------------------------------------------------
Traceback (most recent call last):
  File "nose/case.py", line 186, in runTest
    self.test(*self.arg)
  File "/usr/local/lib/python2.6/dist-packages/scipy/linalg/tests/test_decomp.py",
line 1071, in check_lapack_misaligned
    func(*a,**kwargs)
  File "/usr/local/lib/python2.6/dist-packages/scipy/linalg/basic.py",
line 47, in solve
    a1, b1 = map(asarray_chkfinite,(a,b))
  File "/usr/local/lib/python2.6/dist-packages/numpy/lib/function_base.py",
line 516, in asarray_chkfinite
    raise ValueError, "array must not contain infs or NaNs"
ValueError: array must not contain infs or NaNs

======================================================================
ERROR: Ticket #1124.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/local/lib/python2.6/dist-packages/scipy/signal/tests/test_signaltools.py",
line 287, in test_none
    signal.medfilt(None)
  File "/usr/local/lib/python2.6/dist-packages/scipy/signal/signaltools.py",
line 308, in medfilt
    return sigtools._order_filterND(volume,domain,order)
ValueError: order_filterND not available for this type

======================================================================
ERROR: test_cdf (test_distributions.TestRandInt)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/local/lib/python2.6/dist-packages/scipy/stats/tests/test_distributions.py",
line 118, in test_cdf
    out = numpy.select([k>=30,k>=5],[1.0,(k-5.0+1)/(30-5.0)],0)
  File "/usr/local/lib/python2.6/dist-packages/numpy/lib/function_base.py",
line 704, in select
    return choose(S, tuple(choicelist))
  File "/usr/local/lib/python2.6/dist-packages/numpy/core/fromnumeric.py",
line 296, in choose
    return choose(choices, out=out, mode=mode)
RuntimeError: More keyword list entries (5) than format specifiers (2)

----------------------------------------------------------------------
Ran 4412 tests in 60.582s

FAILED (KNOWNFAIL=11, SKIP=33, errors=3)
<nose.result.TextTestResult run=4412 errors=3 failures=0>


>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From ben.root at ou.edu  Tue Jun 15 22:46:21 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Tue, 15 Jun 2010 21:46:21 -0500
Subject: [SciPy-Dev] possible speed-up for arffread
Message-ID: <AANLkTin0MNL9UsuJFNaO9mNRkBnigNUClhWjqeG6Ahz1@mail.gmail.com>

Hello,

I was looking at the scipy.io.arff module to see if I could easily shave
some processing time for loading an ARFF file.  Doing some profiling on a
file with 40,000 floating point numbers pointed me to the safe_float()
function in the arffread.py file.  In it, it was stripping the string token
of any whitespace and then comparing it to '?' (which is ARFF's missing data
indicator).  I found that if one just does a check for the '?' character,
you can shave almost 30% of the processing time off of the safe_float()
function.

In addition, I found a very slight improvement by calculating the range(ni)
once and reusing that variable in the generator function.  Attached is my
patch file.

It isn't much, but it is noticeable.

Thanks,
Ben Root
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100615/5a1c2d5c/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: arffread_speedup.patch
Type: text/x-patch
Size: 1289 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100615/5a1c2d5c/attachment.bin>

From d.l.goldsmith at gmail.com  Tue Jun 15 22:55:34 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 15 Jun 2010 19:55:34 -0700
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
	<AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
	<AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>
Message-ID: <AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>

On Thu, Jun 3, 2010 at 11:18 AM, David Goldsmith <d.l.goldsmith at gmail.com>
wrote:
>
> http://projects.scipy.org/numpy/ticket/1501
>
> Filed Description:
>
> "Presently, the docstring standard does not specify how to note that an
object is to be deprecated; it has been proposed that this needs to be
rectified.
>
> "Obviously, this should be an optional section in general, but required
for objects once it is decided that they are to be deprecated.
>
> "Discussion on scipy-dev agreed that this section should be at or near the
top, but at the top or between the One-line and Extended Summaries have both
been proposed - we will try to reach a consensus [in the ticket comments].
>
> "Proposed format is to utilize Sphinx' .. deprecated:: directive; someone
please provide a concrete example of what this looks like (for example, does
this directive support multi-line content, and if so, what does that look
like).
>
> "Proposed content: summaries of deprecation schedule (in version number
time, not real time) and justification for deprecation (e.g., being
replaced, duplicates extant functionality elsewhere); existing alternatives
to obtain the same functionality. (Feel strongly that it should contain
something else? Add it below as a comment.)
>
> "IMO, we should try to decide on this and update the standard by June 15
at the latest.
>
> "Have I forgotten anything"
>
> DG

Well, the deadline is upon us; here's my concrete proposal:
http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines#docstring-standardshould
be modified as follows:

*Present Content*
     :
Sections

The sections of the docstring are:

1.  *Short summary*...

*Proposed Content*
     :
Sections

The sections of the docstring are:

0.  *Deprecation warning* (optional, but required if the object is
designated for deprecation)

     .. deprecated:: <*deprecation schedule*>, <*justification, if known*>
(optional), <*functional equivalents, if extant*> (optional), <*example*>

1.  *Sort summary*...

To submit this as a patch, do I grab HOWTO_DOCUMENT.txt, modify it, then
attach the modified version as an attachment to the ticket?

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100615/43204fe8/attachment.html>

From vincent at vincentdavis.net  Tue Jun 15 23:02:00 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 15 Jun 2010 21:02:00 -0600
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
	<AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
	<AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>
	<AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>
Message-ID: <AANLkTimgYGdnSSRd5SgO4xVGKkGkCq43QGu3AT08q2H6@mail.gmail.com>

On Tue, Jun 15, 2010 at 8:55 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> On Thu, Jun 3, 2010 at 11:18 AM, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
>>
>> http://projects.scipy.org/numpy/ticket/1501
>>
>> Filed Description:
>>
>> "Presently, the docstring standard does not specify how to note that an
>> object is to be deprecated; it has been proposed that this needs to be
>> rectified.
>>
>> "Obviously, this should be an optional section in general, but required
>> for objects once it is decided that they are to be deprecated.
>>
>> "Discussion on scipy-dev agreed that this section should be at or near the
>> top, but at the top or between the One-line and Extended Summaries have both
>> been proposed - we will try to reach a consensus [in the ticket comments].
>>
>> "Proposed format is to utilize Sphinx' .. deprecated:: directive; someone
>> please provide a concrete example of what this looks like (for example, does
>> this directive support multi-line content, and if so, what does that look
>> like).
>>
>> "Proposed content: summaries of deprecation schedule (in version number
>> time, not real time) and justification for deprecation (e.g., being
>> replaced, duplicates extant functionality elsewhere); existing alternatives
>> to obtain the same functionality. (Feel strongly that it should contain
>> something else? Add it below as a comment.)
>>
>> "IMO, we should try to decide on this and update the standard by June 15
>> at the latest.
>>
>> "Have I forgotten anything"
>>
>> DG
>
> Well, the deadline is upon us; here's my concrete proposal:
> http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines#docstring-standard
> should be modified as follows:
>
> Present Content
> ???? :
> Sections
>
> The sections of the docstring are:
>
> 1.? Short summary...
>
> Proposed Content
> ???? :
> Sections
>
> The sections of the docstring are:
>
> 0.? Deprecation warning (optional, but required if the object is designated
> for deprecation)
>
> ???? .. deprecated:: <deprecation schedule>, <justification, if known>
> (optional), <functional equivalents, if extant> (optional), <example>
>
> 1.? Sort summary...
>
> To submit this as a patch, do I grab HOWTO_DOCUMENT.txt, modify it, then
> attach the modified version as an attachment to the ticket?
>

You must have had that on your calendar. It looks good to me.

Vincent


> DG
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From d.l.goldsmith at gmail.com  Tue Jun 15 23:11:08 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 15 Jun 2010 20:11:08 -0700
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTimgYGdnSSRd5SgO4xVGKkGkCq43QGu3AT08q2H6@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
	<AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
	<AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>
	<AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>
	<AANLkTimgYGdnSSRd5SgO4xVGKkGkCq43QGu3AT08q2H6@mail.gmail.com>
Message-ID: <AANLkTinPmJ8zmuD0onWdJ2HBhjCge1QeAMRpSNV4mX51@mail.gmail.com>

On Tue, Jun 15, 2010 at 8:02 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Tue, Jun 15, 2010 at 8:55 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > On Thu, Jun 3, 2010 at 11:18 AM, David Goldsmith <
> d.l.goldsmith at gmail.com>
> > wrote:
> >>
> >> http://projects.scipy.org/numpy/ticket/1501
> >>
> >> Filed Description:
> >>
> >> "Presently, the docstring standard does not specify how to note that an
> >> object is to be deprecated; it has been proposed that this needs to be
> >> rectified.
> >>
> >> "Obviously, this should be an optional section in general, but required
> >> for objects once it is decided that they are to be deprecated.
> >>
> >> "Discussion on scipy-dev agreed that this section should be at or near
> the
> >> top, but at the top or between the One-line and Extended Summaries have
> both
> >> been proposed - we will try to reach a consensus [in the ticket
> comments].
> >>
> >> "Proposed format is to utilize Sphinx' .. deprecated:: directive;
> someone
> >> please provide a concrete example of what this looks like (for example,
> does
> >> this directive support multi-line content, and if so, what does that
> look
> >> like).
> >>
> >> "Proposed content: summaries of deprecation schedule (in version number
> >> time, not real time) and justification for deprecation (e.g., being
> >> replaced, duplicates extant functionality elsewhere); existing
> alternatives
> >> to obtain the same functionality. (Feel strongly that it should contain
> >> something else? Add it below as a comment.)
> >>
> >> "IMO, we should try to decide on this and update the standard by June 15
> >> at the latest.
> >>
> >> "Have I forgotten anything"
> >>
> >> DG
> >
> > Well, the deadline is upon us; here's my concrete proposal:
> >
> http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines#docstring-standard
> > should be modified as follows:
> >
> > Present Content
> >      :
> > Sections
> >
> > The sections of the docstring are:
> >
> > 1.  Short summary...
> >
> > Proposed Content
> >      :
> > Sections
> >
> > The sections of the docstring are:
> >
> > 0.  Deprecation warning (optional, but required if the object is
> designated
> > for deprecation)
> >
> >      .. deprecated:: <deprecation schedule>, <justification, if known>
> > (optional), <functional equivalents, if extant> (optional), <example>
> >
> > 1.  Sort summary...
> >
> > To submit this as a patch, do I grab HOWTO_DOCUMENT.txt, modify it, then
> > attach the modified version as an attachment to the ticket?
> >
>
> You must have had that on your calendar. It looks good to me.
>

Nope. I should have, but actually, editing some 'really bad docstrings'
(TM), I discovered a set that raise NotImplementedError and that got me
thinking about where I should place a warning to that effect, which in turn
jogged my memory about the deprecation section deadline. :-)

DG


> Vincent
>
>
> > DG
> >
> >
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100615/5b7db124/attachment.html>

From vincent at vincentdavis.net  Tue Jun 15 23:55:04 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 15 Jun 2010 21:55:04 -0600
Subject: [SciPy-Dev] Updated codata.py
Message-ID: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>

I have updated the codata.py file to the 2006 values. basicly all is
the same (not that values) but added a defualt actuion to
find(sub=None) this defualt will print a list of all constants.
What to do with the old values/file. I can't think anyone whats the
old(wrong) values.

What is easier, I can attach a file or I have a branch/fork on github
of scipy-work. If you see something you would like different let me
know.
http://github.com/vincentdavis/scipy-work/blob/master/scipy/constants/codata2006.py

Vincent


From pav at iki.fi  Wed Jun 16 03:47:53 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Wed, 16 Jun 2010 07:47:53 +0000 (UTC)
Subject: [SciPy-Dev] Updated codata.py
References: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>
Message-ID: <hv9vj9$6mt$1@dough.gmane.org>

Tue, 15 Jun 2010 21:55:04 -0600, Vincent Davis wrote:
[clip]
> What is easier, I can attach a file or I have a branch/fork on github of
> scipy-work. If you see something you would like different let me know.
> http://github.com/vincentdavis/scipy-work/blob/master/scipy/constants/
codata2006.py

It's perhaps best to create a branch on github. Then you can always give 
a link to the file, and our Git-using people can easily get your stuff.

Note that for some reason you have a lot of branches in your Github repo. 
I'd perhaps suggest doing

	git push origin :DEintegrator
	git push origin :chdtr-fix-commit
	...

to prune the branches you are not actually working on, so it's easier to 
keep track of what's potentially interesting. (Ditto for pydocweb and 
numpy.)

-- 
Pauli Virtanen


From pav at iki.fi  Wed Jun 16 03:54:19 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Wed, 16 Jun 2010 07:54:19 +0000 (UTC)
Subject: [SciPy-Dev] Updated codata.py
References: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>
	<hv9vj9$6mt$1@dough.gmane.org>
Message-ID: <hv9vvb$6mt$2@dough.gmane.org>

Wed, 16 Jun 2010 07:47:53 +0000, Pauli Virtanen wrote:

> Tue, 15 Jun 2010 21:55:04 -0600, Vincent Davis wrote: [clip]
>> What is easier, I can attach a file or I have a branch/fork on github
>> of scipy-work. If you see something you would like different let me
>> know.
>> http://github.com/vincentdavis/scipy-work/blob/master/scipy/constants/
>> codata2006.py

Also, you might want to rebase your work on the current SVN trunk -- I 
think there were some recent changes in codata.py.

I'd also suggest creating a new git branch, and just editing the contents 
of the codata.py file, rather than creating a new file.

If I understand correctly, the only thing you need to change is the 
actual constant list, not the other code there?

-- 
Pauli Virtanen


From ralf.gommers at googlemail.com  Wed Jun 16 07:06:33 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Wed, 16 Jun 2010 19:06:33 +0800
Subject: [SciPy-Dev] stats.zmap mstats.zmap backport
In-Reply-To: <AANLkTinZ2nrowhYX3zCp1KEaxFyLs2e21FdHlvjbBsJq@mail.gmail.com>
References: <AANLkTinZ2nrowhYX3zCp1KEaxFyLs2e21FdHlvjbBsJq@mail.gmail.com>
Message-ID: <AANLkTikt5YvR-w84bXxAbsSVTr4e4vZ4VIY2q4gz85aK@mail.gmail.com>

Hi Josef,

On Sat, Jun 12, 2010 at 1:53 AM, <josef.pktd at gmail.com> wrote:

> Ralf,
>
> Can you check and backport http://projects.scipy.org/scipy/ticket/1195 ?
>

That change does not look very complete. I think the changes in r6169
(deprecate z and zs, add zscore) have to be duplicated in mstats_basic,
right? And stats.zmap should probably have one test with a masked array.

Also, Pierre is probably a better person to ping about this than me. If the
change in trunk is done before RC1 (hopefully this weekend) I think it's
fine to backport the changes.

Cheers,
Ralf


> I forgot to check mstats, when I corrected the axis handling of zmap
> and just saw it while looking at the docs.
> mstats.zmap still has inconsistent axis handling if axis != 0
>
> Josef
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/bdd2eb8a/attachment.html>

From matthew.brett at gmail.com  Wed Jun 16 07:18:09 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Wed, 16 Jun 2010 12:18:09 +0100
Subject: [SciPy-Dev] Errors in 0.8.x
In-Reply-To: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
References: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
Message-ID: <AANLkTikTNG6mI_M08Mm3IDIfGp6x2Gwsnn78mhVeK7yu@mail.gmail.com>

Hi Chuck,

> ======================================================================
> ERROR: Failure: ValueError (__builtin__.file does not appear to be the
> correct type object)

That's a strange one - what system are you on?

See you,

Matthew


From ralf.gommers at googlemail.com  Wed Jun 16 07:23:56 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Wed, 16 Jun 2010 19:23:56 +0800
Subject: [SciPy-Dev] scipy.io cleanup
Message-ID: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>

A long list of functions in scipy.io was deprecated in 0.7 and has to be
removed now. It's in the 'io' branch at
http://github.com/rgommers/scipy/commits/io now. In addition the recaster
module is deprecated as Matthew requested in another thread. Does someone
want to have a quick look at it?

Other question: is anything in dumb_shelve.py,  dumbdbm_patched.py and
data_store.py still useful for anyone? I think it can all be deprecated as
well, because it doesn't fit in scipy.io (it's not io for external formats,
just random stuff), is untested and, at least in the case of dumbdbm, buggy.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/bd254d92/attachment.html>

From ralf.gommers at googlemail.com  Wed Jun 16 07:32:12 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Wed, 16 Jun 2010 19:32:12 +0800
Subject: [SciPy-Dev] Errors in 0.8.x
In-Reply-To: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
References: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
Message-ID: <AANLkTimqpqehyhcUL35IX2WrRLLgJcm7YpsU4-cTEvmH@mail.gmail.com>

On Wed, Jun 16, 2010 at 10:27 AM, Charles R Harris <
charlesr.harris at gmail.com> wrote:

> Hi All,
>
> I get some error in the tests:
>
>
> ======================================================================
> ERROR: test_decomp.test_lapack_misaligned(<function solve at 0x3851500>,
> (array([[  1.734e-255,   8.189e-217,   4.025e-178,   1.903e-139,
> 9.344e-101,
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File "/usr/lib/pymodules/python2.6/nose/case.py", line 183, in runTest
>     self.test(*self.arg)
>   File
> "/usr/local/lib/python2.6/dist-packages/scipy/linalg/tests/test_decomp.py",
> line 1118, in check_lapack_misaligned
>     func(*a,**kwargs)
>   File "/usr/local/lib/python2.6/dist-packages/scipy/linalg/basic.py", line
> 133, in solve
>     a1, b1 = map(asarray_chkfinite,(a,b))
>   File "/usr/local/lib/python2.6/dist-packages/numpy/lib/function_base.py",
> line 528, in asarray_chkfinite
>     "array must not contain infs or NaNs")
> ValueError: array must not contain infs or NaNs
>
> This is marked as a known failure in 0.8.x now. And it's ticket #1152.


> ======================================================================
> ERROR: Ticket #1124.
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File
> "/usr/local/lib/python2.6/dist-packages/scipy/signal/tests/test_signaltools.py",
> line 287, in test_none
>     signal.medfilt(None)
>   File
> "/usr/local/lib/python2.6/dist-packages/scipy/signal/signaltools.py", line
> 308, in medfilt
>     return sigtools._order_filterND(volume,domain,order)
> ValueError: order_filterND not available for this type
>

That was fixed in r6250 and works for me:
In [15]: medfilt(None)
Out[15]: array([0], dtype=object)

Should be easy to figure out, can you have a look at it?


>
>
> There are also a lot of compiler warnings like:
>
> gfortran:f77: ./lbfgsb/routines.f
> Warning: Nonconforming tab character in column 1 of line 510
>
> Those warnings are annoying. They're all in Fortran code, mostly from
upstream sources. I asked about them a while ago and the conclusion was they
should be filtered, not fixed.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/c78fed99/attachment.html>

From amcmorl at gmail.com  Wed Jun 16 08:55:29 2010
From: amcmorl at gmail.com (Angus McMorland)
Date: Wed, 16 Jun 2010 08:55:29 -0400
Subject: [SciPy-Dev] stats.nanstd interface
Message-ID: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com>

Hi all,

I've just updated the docstring for scipy.stats.nanstd to the new
docstring standard's format. I wonder if, for consistency of
interface, we should consider changing it to use a `ddof` parameter,
as numpy's std function does, instead of its current `bias` boolean
parameter. I'm aware that there are deprecation/API implications
associated with this, but I'm not sure what the specifics of those
are.

Angus.
-- 
AJC McMorland
Post-doctoral research fellow
Neurobiology, University of Pittsburgh


From ralf.gommers at googlemail.com  Wed Jun 16 09:44:27 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Wed, 16 Jun 2010 21:44:27 +0800
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
	<AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
	<AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>
	<AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>
Message-ID: <AANLkTin4GSA-elW_-bymdxBtcPFnT1iWsN1bvIaShWmY@mail.gmail.com>

On Wed, Jun 16, 2010 at 10:55 AM, David Goldsmith
<d.l.goldsmith at gmail.com>wrote:

>
> Well, the deadline is upon us; here's my concrete proposal:
>
> http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines#docstring-standardshould be modified as follows:
>
> *Present Content*
>      :
> Sections
>
> The sections of the docstring are:
>
> 1.  *Short summary*...
>
> *Proposed Content*
>      :
> Sections
>
> The sections of the docstring are:
>
> 0.  *Deprecation warning* (optional, but required if the object is
> designated for deprecation)
>
>      .. deprecated:: <*deprecation schedule*>, <*justification, if known*>
> (optional), <*functional equivalents, if extant*> (optional), <*example*>
>
> 1.  *Sort summary*...
>

The short summary gets pulled into function overview tables, it should
really be the first line of the docstring. Please put the deprecation
warning just after it.

Content looks fine to me.


>
> To submit this as a patch, do I grab HOWTO_DOCUMENT.txt, modify it,


Yes so far.


> then attach the modified version as an attachment to the ticket?
>
> Make that a diff please.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/48e123cb/attachment.html>

From vincent at vincentdavis.net  Wed Jun 16 09:53:30 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 16 Jun 2010 07:53:30 -0600
Subject: [SciPy-Dev] Updated codata.py
In-Reply-To: <hv9vj9$6mt$1@dough.gmane.org>
References: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>
	<hv9vj9$6mt$1@dough.gmane.org>
Message-ID: <AANLkTilf4GbxLsrj2YN-KoCCy0Z0Eo0J4R8SVMIu6ND4@mail.gmail.com>

On Wed, Jun 16, 2010 at 1:47 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Tue, 15 Jun 2010 21:55:04 -0600, Vincent Davis wrote:
> [clip]
>> What is easier, I can attach a file or I have a branch/fork on github of
>> scipy-work. If you see something you would like different let me know.
>> http://github.com/vincentdavis/scipy-work/blob/master/scipy/constants/
> codata2006.py
>
> It's perhaps best to create a branch on github. Then you can always give
> a link to the file, and our Git-using people can easily get your stuff.
>
> Note that for some reason you have a lot of branches in your Github repo.
> I'd perhaps suggest doing
>
> ? ? ? ?git push origin :DEintegrator
> ? ? ? ?git push origin :chdtr-fix-commit
> ? ? ? ?...
>
> to prune the branches you are not actually working on, so it's easier to
> keep track of what's potentially interesting. (Ditto for pydocweb and
> numpy.)

????
I am not sure what that means, for scipy-work, numpy, pydocweb I chose
"fork" on the gitub website and then cloned that new fork ie git clone
git://........
How should I be doing this?

Thanks
Vincent


>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Wed Jun 16 10:01:52 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 16 Jun 2010 08:01:52 -0600
Subject: [SciPy-Dev] Updated codata.py
In-Reply-To: <hv9vvb$6mt$2@dough.gmane.org>
References: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>
	<hv9vj9$6mt$1@dough.gmane.org> <hv9vvb$6mt$2@dough.gmane.org>
Message-ID: <AANLkTinCCF4gTNCawIwAaN9VEdEHNHd3c3sZfMWi1BBe@mail.gmail.com>

On Wed, Jun 16, 2010 at 1:54 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Wed, 16 Jun 2010 07:47:53 +0000, Pauli Virtanen wrote:
>
>> Tue, 15 Jun 2010 21:55:04 -0600, Vincent Davis wrote: [clip]
>>> What is easier, I can attach a file or I have a branch/fork on github
>>> of scipy-work. If you see something you would like different let me
>>> know.
>>> http://github.com/vincentdavis/scipy-work/blob/master/scipy/constants/
>>> codata2006.py
>
> Also, you might want to rebase your work on the current SVN trunk -- I
> think there were some recent changes in codata.py.
>
> I'd also suggest creating a new git branch, and just editing the contents
> of the codata.py file, rather than creating a new file.
>
> If I understand correctly, the only thing you need to change is the
> actual constant list, not the other code there?

I updates some of the documentation to better reference the source of
the data ans that it is now 2006 of course and added a default to
find() if not supplied with a sub find(sub)

I ran these commands, not sure what they did :-)
git push origin :DEintegrator
git push origin :chdtr-fix-commit


Vincent


>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From bsouthey at gmail.com  Wed Jun 16 10:02:22 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 16 Jun 2010 09:02:22 -0500
Subject: [SciPy-Dev] stats.nanstd interface
In-Reply-To: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com>
References: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com>
Message-ID: <4C18D96E.9020905@gmail.com>

On 06/16/2010 07:55 AM, Angus McMorland wrote:
> Hi all,
>
> I've just updated the docstring for scipy.stats.nanstd to the new
> docstring standard's format. I wonder if, for consistency of
> interface, we should consider changing it to use a `ddof` parameter,
> as numpy's std function does, instead of its current `bias` boolean
> parameter. I'm aware that there are deprecation/API implications
> associated with this, but I'm not sure what the specifics of those
> are.
>
> Angus.
>    

Please file a ticket for it.
Can you please add all the differences between the signature between 
numpy's version and this version?
In particular, the default axis of stats.nanstd is zero compared to None.
It also lacks the dtype argument.

Really the function needs at least a rewrite unless numpy can provide 
same functionality.

Bruce


From jsseabold at gmail.com  Wed Jun 16 10:19:05 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Wed, 16 Jun 2010 10:19:05 -0400
Subject: [SciPy-Dev] Working on the docstring for
	stats.distributions.rv_continuous.fit
In-Reply-To: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com>
References: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com>
Message-ID: <AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>

On Tue, Jun 15, 2010 at 8:38 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> Hi!? IMHO, the descriptions of the *args & **kwds parameters of the
> Subject-referenced method are not very clear, so let me see if I understand
> correctly:
>

I agree and was having a look at this last week.  Here's my take.  I
would err on the side of verbose in these docs, as the stats docs seem
to be a general source of confusion from comments I've received off
list, though obviously Josef, Travis, and others would know the
details better than I.

> *args : float(s), optional
> ??? If the distribution in question depends on n parameters, _excluding
> location and scale_, then *args may contain 0 to n floats, which are
> starting estimates for the corresponding parameters.? No default value(s).
>

I would add a note that n can be found in the numargs attribute of the
distribution.

> **kwds _may_ contain the following:
>
> loc : float, optional
> ??? Starting estimate for the location parameter, no default.
>
> scale : float, optional
> ??? Starting estimate for the scale parameter, no default.

If the extra args *and* loc *and* scale are not specified, then the
default starting estimates for loc, scale, and args are taken from the
distribution's _fitstart(data) method.  I think it would make more
sense to take the defaults for ones that are not provided by the user
only, but this is not how the code reads at the moment as far as I can
tell.

> floc : bool, optional
> ??? Hold the location parameter constant; default: False.
>

floc : float, optional
    Hold the location parameter constant at the given value.  Default:
Fit this parameter using the data.

> fscale : bool, optional
> ??? Hold the scale parameter constant; default: False
>

See above.

> fi : bool, optional
> ??? Hold the i-th scale parameter constant; there may be up to len(args) of
> these; default: False
>

I would keep it as something like

f1...fn : float, optional
   Hold shape parameter fi constant at the given value, where i may be
1 to numargs of the distribution.


Skipper


From vincent at vincentdavis.net  Wed Jun 16 10:19:53 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 16 Jun 2010 08:19:53 -0600
Subject: [SciPy-Dev] Updated codata.py
In-Reply-To: <hv9vj9$6mt$1@dough.gmane.org>
References: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>
	<hv9vj9$6mt$1@dough.gmane.org>
Message-ID: <AANLkTinuAU_j9NrUOFKq8z1L_tZRVhAUlfmqpvT6_f-C@mail.gmail.com>

On Wed, Jun 16, 2010 at 1:47 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Tue, 15 Jun 2010 21:55:04 -0600, Vincent Davis wrote:
> [clip]
>> What is easier, I can attach a file or I have a branch/fork on github of
>> scipy-work. If you see something you would like different let me know.
>> http://github.com/vincentdavis/scipy-work/blob/master/scipy/constants/
> codata2006.py
>
> It's perhaps best to create a branch on github. Then you can always give
> a link to the file, and our Git-using people can easily get your stuff.
>
> Note that for some reason you have a lot of branches in your Github repo.
> I'd perhaps suggest doing

Ok I think this is the same as the list of the branches from
scipy-work that I cloned. I thought this was being updated regularly
from the svn.

Vincent

>
> ? ? ? ?git push origin :DEintegrator
> ? ? ? ?git push origin :chdtr-fix-commit
> ? ? ? ?...
>
> to prune the branches you are not actually working on, so it's easier to
> keep track of what's potentially interesting. (Ditto for pydocweb and
> numpy.)
>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Wed Jun 16 10:20:34 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Wed, 16 Jun 2010 10:20:34 -0400
Subject: [SciPy-Dev] stats.nanstd interface
In-Reply-To: <4C18D96E.9020905@gmail.com>
References: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com>
	<4C18D96E.9020905@gmail.com>
Message-ID: <AANLkTil6z3C4FUyf8-HT9fSc2GygKJUc74pley611fyf@mail.gmail.com>

On Wed, Jun 16, 2010 at 10:02 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/16/2010 07:55 AM, Angus McMorland wrote:
>> Hi all,
>>
>> I've just updated the docstring for scipy.stats.nanstd to the new
>> docstring standard's format. I wonder if, for consistency of
>> interface, we should consider changing it to use a `ddof` parameter,
>> as numpy's std function does, instead of its current `bias` boolean
>> parameter. I'm aware that there are deprecation/API implications
>> associated with this, but I'm not sure what the specifics of those
>> are.
>>
>> Angus.
>>
>
> Please file a ticket for it.
> Can you please add all the differences between the signature between
> numpy's version and this version?
> In particular, the default axis of stats.nanstd is zero compared to None.
> It also lacks the dtype argument.

default axis in scipy.stats is zero not None as in numpy.
np.nansum has no dtype argument, nans can be only in float (I never
checked complex for this), so I don't know whether dtype would be
useful in this case.

>
> Really the function needs at least a rewrite unless numpy can provide
> same functionality.

Can you be more specific, we just rewrote axis handling

I think switching to ddof is a good idea. (FYI: I cannot work on
anything for another two weeks).

Josef

>
> Bruce
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From josef.pktd at gmail.com  Wed Jun 16 10:26:38 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Wed, 16 Jun 2010 10:26:38 -0400
Subject: [SciPy-Dev] Working on the docstring for
	stats.distributions.rv_continuous.fit
In-Reply-To: <AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>
References: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com>
	<AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>
Message-ID: <AANLkTil54w_llirb-FJe9C_LFFToZWxo2g3rdxiRwtIk@mail.gmail.com>

On Wed, Jun 16, 2010 at 10:19 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Tue, Jun 15, 2010 at 8:38 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
>> Hi!? IMHO, the descriptions of the *args & **kwds parameters of the
>> Subject-referenced method are not very clear, so let me see if I understand
>> correctly:
>>
>
> I agree and was having a look at this last week. ?Here's my take. ?I
> would err on the side of verbose in these docs, as the stats docs seem
> to be a general source of confusion from comments I've received off
> list, though obviously Josef, Travis, and others would know the
> details better than I.
>
>> *args : float(s), optional
>> ??? If the distribution in question depends on n parameters, _excluding
>> location and scale_, then *args may contain 0 to n floats, which are
>> starting estimates for the corresponding parameters.? No default value(s).
>>
>
> I would add a note that n can be found in the numargs attribute of the
> distribution.
>
>> **kwds _may_ contain the following:
>>
>> loc : float, optional
>> ??? Starting estimate for the location parameter, no default.
>>
>> scale : float, optional
>> ??? Starting estimate for the scale parameter, no default.
>
> If the extra args *and* loc *and* scale are not specified, then the
> default starting estimates for loc, scale, and args are taken from the
> distribution's _fitstart(data) method. ?I think it would make more
> sense to take the defaults for ones that are not provided by the user
> only, but this is not how the code reads at the moment as far as I can
> tell.

If you have a proposal how to improve the code (e.g. default
handling), then you could file a ticket, so we know what to review
before the next release.

Josef

>
>> floc : bool, optional
>> ??? Hold the location parameter constant; default: False.
>>
>
> floc : float, optional
> ? ?Hold the location parameter constant at the given value. ?Default:
> Fit this parameter using the data.
>
>> fscale : bool, optional
>> ??? Hold the scale parameter constant; default: False
>>
>
> See above.
>
>> fi : bool, optional
>> ??? Hold the i-th scale parameter constant; there may be up to len(args) of
>> these; default: False
>>
>
> I would keep it as something like
>
> f1...fn : float, optional
> ? Hold shape parameter fi constant at the given value, where i may be
> 1 to numargs of the distribution.
>
>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From jsseabold at gmail.com  Wed Jun 16 10:37:37 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Wed, 16 Jun 2010 10:37:37 -0400
Subject: [SciPy-Dev] Working on the docstring for
	stats.distributions.rv_continuous.fit
In-Reply-To: <AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>
References: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com> 
	<AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>
Message-ID: <AANLkTikZce8bstIxxXLw7Xpa0Ma-9swmNAtOCW7x6RVz@mail.gmail.com>

On Wed, Jun 16, 2010 at 10:19 AM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Tue, Jun 15, 2010 at 8:38 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
>> Hi!? IMHO, the descriptions of the *args & **kwds parameters of the
>> Subject-referenced method are not very clear, so let me see if I understand
>> correctly:
>>
>
> I agree and was having a look at this last week. ?Here's my take. ?I
> would err on the side of verbose in these docs, as the stats docs seem
> to be a general source of confusion from comments I've received off
> list, though obviously Josef, Travis, and others would know the
> details better than I.
>
>> *args : float(s), optional
>> ??? If the distribution in question depends on n parameters, _excluding
>> location and scale_, then *args may contain 0 to n floats, which are
>> starting estimates for the corresponding parameters.? No default value(s).
>>
>
> I would add a note that n can be found in the numargs attribute of the
> distribution.
>
>> **kwds _may_ contain the following:
>>
>> loc : float, optional
>> ??? Starting estimate for the location parameter, no default.
>>
>> scale : float, optional
>> ??? Starting estimate for the scale parameter, no default.
>
> If the extra args *and* loc *and* scale are not specified, then the
> default starting estimates for loc, scale, and args are taken from the
> distribution's _fitstart(data) method. ?I think it would make more
> sense to take the defaults for ones that are not provided by the user
> only, but this is not how the code reads at the moment as far as I can
> tell.
>

What I said is incorrect.  If any args, loc, or scale is specified by
the user they are used and not thrown away.

>> floc : bool, optional
>> ??? Hold the location parameter constant; default: False.
>>
>
> floc : float, optional
> ? ?Hold the location parameter constant at the given value. ?Default:
> Fit this parameter using the data.
>
>> fscale : bool, optional
>> ??? Hold the scale parameter constant; default: False
>>
>
> See above.
>
>> fi : bool, optional
>> ??? Hold the i-th scale parameter constant; there may be up to len(args) of
>> these; default: False
>>
>
> I would keep it as something like
>
> f1...fn : float, optional
> ? Hold shape parameter fi constant at the given value, where i may be
> 1 to numargs of the distribution.
>
>
> Skipper
>


From pav at iki.fi  Wed Jun 16 10:47:40 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Wed, 16 Jun 2010 14:47:40 +0000 (UTC)
Subject: [SciPy-Dev] Updated codata.py
References: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>
	<hv9vj9$6mt$1@dough.gmane.org>
	<AANLkTinuAU_j9NrUOFKq8z1L_tZRVhAUlfmqpvT6_f-C@mail.gmail.com>
Message-ID: <hvao6c$1f3$1@dough.gmane.org>

Wed, 16 Jun 2010 08:19:53 -0600, Vincent Davis wrote:
[clip]
>> Note that for some reason you have a lot of branches in your Github
>> repo. I'd perhaps suggest doing
> 
> Ok I think this is the same as the list of the branches from scipy-work
> that I cloned.

This behavior from Github is a bit strange, then. If you just clone with 
"git clone", you don't get any extra local branches in your repo, and it 
would make sense for the "Fork" button on the website to work in the same 
way.

But anyway it's good practice not to keep zombie branches around that 
you're not working or going to work on.

The command

	git push origin :BRANCH

removes the BRANCH given from the remote repository, and needs to be done 
separately for each branch you want to remove.

> I thought this was being updated regularly from the svn.

None of the Numpy/Scipy repositories on Github are updated regularly. The 
automatically updating ones are here:

	http://projects.scipy.org/git/

-- 
Pauli Virtanen


From ralf.gommers at googlemail.com  Wed Jun 16 10:58:25 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Wed, 16 Jun 2010 22:58:25 +0800
Subject: [SciPy-Dev] [SciPy] #902: need high, stop,
	pass options to 	signal.firwin
In-Reply-To: <loom.20100611T144621-413@post.gmane.org>
References: <loom.20100611T144621-413@post.gmane.org>
Message-ID: <AANLkTikoDS1xWgOzV2wKWRJliLFTSIF4GYzzagtd0V2s@mail.gmail.com>

This looks useful, but the patch doesn't apply anymore. Would you mind
updating it?

Thanks,
Ralf


On Fri, Jun 11, 2010 at 8:50 PM, Tom K. <tpk at kraussfamily.org> wrote:

> Looks like this patch for firwin might have been overlooked.
> Posting here as requested.
>
> #902: need high, stop, pass options to signal.firwin
> ----------------------------------+----------------------------
> Reporter:  tpk@?                 |       Owner:  somebody
> Type:  enhancement           |      Status:  new
> Priority:  normal                |   Milestone:  0.8.0
> Component:  scipy.signal          |     Version:  0.7.0
> Keywords:                        |
> ----------------------------------+----------------------------
>
> Comment(by charris):
>
> It probably got overlooked. Send a note to the list.
>
> --
> Ticket URL: <http://projects.scipy.org/scipy/ticket/902#comment:3>
> SciPy <http://www.scipy.org>
> SciPy is open-source software for mathematics, science, and engineering.
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/b90a27f5/attachment.html>

From vincent at vincentdavis.net  Wed Jun 16 11:07:20 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 16 Jun 2010 09:07:20 -0600
Subject: [SciPy-Dev] Updated codata.py
In-Reply-To: <hvao6c$1f3$1@dough.gmane.org>
References: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>
	<hv9vj9$6mt$1@dough.gmane.org>
	<AANLkTinuAU_j9NrUOFKq8z1L_tZRVhAUlfmqpvT6_f-C@mail.gmail.com>
	<hvao6c$1f3$1@dough.gmane.org>
Message-ID: <AANLkTilSDwvtFjNblijx9FcYllpcUuPih_clIGLwq6oE@mail.gmail.com>

On Wed, Jun 16, 2010 at 8:47 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Wed, 16 Jun 2010 08:19:53 -0600, Vincent Davis wrote:
> [clip]
>>> Note that for some reason you have a lot of branches in your Github
>>> repo. I'd perhaps suggest doing
>>
>> Ok I think this is the same as the list of the branches from scipy-work
>> that I cloned.
>
> This behavior from Github is a bit strange, then. If you just clone with
> "git clone", you don't get any extra local branches in your repo, and it
> would make sense for the "Fork" button on the website to work in the same
> way.
>
> But anyway it's good practice not to keep zombie branches around that
> you're not working or going to work on.
>
> The command
>
> ? ? ? ?git push origin :BRANCH
>
> removes the BRANCH given from the remote repository, and needs to be done
> separately for each branch you want to remove.
>
>> I thought this was being updated regularly from the svn.
>
> None of the Numpy/Scipy repositories on Github are updated regularly. The
> automatically updating ones are here:
>
> ? ? ? ?http://projects.scipy.org/git/

Thanks for the info. I will compare the version a codata that I have
to make sure I didn't miss an update to codata.py

Thanks again
Vincent
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From warren.weckesser at enthought.com  Wed Jun 16 11:10:49 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Wed, 16 Jun 2010 10:10:49 -0500
Subject: [SciPy-Dev] [SciPy] #902: need high, stop,
	pass options to 	signal.firwin
In-Reply-To: <AANLkTikoDS1xWgOzV2wKWRJliLFTSIF4GYzzagtd0V2s@mail.gmail.com>
References: <loom.20100611T144621-413@post.gmane.org>
	<AANLkTikoDS1xWgOzV2wKWRJliLFTSIF4GYzzagtd0V2s@mail.gmail.com>
Message-ID: <4C18E979.6050501@enthought.com>

Ralf Gommers wrote:
> This looks useful, but the patch doesn't apply anymore. Would you mind 
> updating it?
>


I have worked on an updated version of the patch.  I'll try to add it to 
the ticket this week.

Warren


> Thanks,
> Ralf
>
>
> On Fri, Jun 11, 2010 at 8:50 PM, Tom K. <tpk at kraussfamily.org 
> <mailto:tpk at kraussfamily.org>> wrote:
>
>     Looks like this patch for firwin might have been overlooked.
>     Posting here as requested.
>
>     #902: need high, stop, pass options to signal.firwin
>     ----------------------------------+----------------------------
>     Reporter:  tpk@?                 |       Owner:  somebody
>     Type:  enhancement           |      Status:  new
>     Priority:  normal                |   Milestone:  0.8.0
>     Component:  scipy.signal          |     Version:  0.7.0
>     Keywords:                        |
>     ----------------------------------+----------------------------
>
>     Comment(by charris):
>
>     It probably got overlooked. Send a note to the list.
>
>     --
>     Ticket URL: <http://projects.scipy.org/scipy/ticket/902#comment:3>
>     SciPy <http://www.scipy.org>
>     SciPy is open-source software for mathematics, science, and
>     engineering.
>
>     _______________________________________________
>     SciPy-Dev mailing list
>     SciPy-Dev at scipy.org <mailto:SciPy-Dev at scipy.org>
>     http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From bsouthey at gmail.com  Wed Jun 16 11:17:42 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 16 Jun 2010 10:17:42 -0500
Subject: [SciPy-Dev] stats.nanstd interface
In-Reply-To: <AANLkTil6z3C4FUyf8-HT9fSc2GygKJUc74pley611fyf@mail.gmail.com>
References: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com>	<4C18D96E.9020905@gmail.com>
	<AANLkTil6z3C4FUyf8-HT9fSc2GygKJUc74pley611fyf@mail.gmail.com>
Message-ID: <4C18EB16.9020309@gmail.com>

On 06/16/2010 09:20 AM, josef.pktd at gmail.com wrote:
> On Wed, Jun 16, 2010 at 10:02 AM, Bruce Southey<bsouthey at gmail.com>  wrote:
>    
>> On 06/16/2010 07:55 AM, Angus McMorland wrote:
>>      
>>> Hi all,
>>>
>>> I've just updated the docstring for scipy.stats.nanstd to the new
>>> docstring standard's format. I wonder if, for consistency of
>>> interface, we should consider changing it to use a `ddof` parameter,
>>> as numpy's std function does, instead of its current `bias` boolean
>>> parameter. I'm aware that there are deprecation/API implications
>>> associated with this, but I'm not sure what the specifics of those
>>> are.
>>>
>>> Angus.
>>>
>>>        
>> Please file a ticket for it.
>> Can you please add all the differences between the signature between
>> numpy's version and this version?
>> In particular, the default axis of stats.nanstd is zero compared to None.
>> It also lacks the dtype argument.
>>      
> default axis in scipy.stats is zero not None as in numpy.
> np.nansum has no dtype argument, nans can be only in float (I never
> checked complex for this), so I don't know whether dtype would be
> useful in this case.
>    
 From np.std docstring:
"
     dtype : dtype, optional
         Type to use in computing the standard deviation. For arrays of
         integer type the default is float64, for arrays of float types 
it is
         the same as the array type.
"

>    
>> Really the function needs at least a rewrite unless numpy can provide
>> same functionality.
>>      
> Can you be more specific, we just rewrote axis handling
>
> I think switching to ddof is a good idea. (FYI: I cannot work on
> anything for another two weeks).
>
> Josef
>    
>
I know that the broadcasting is not correct in the following but I do 
not know how to fix it.
Also, np.nansum does not accept the dtype so need to convert the input 
to the new precision.

I would like it to handle other array subtypes or at least fail to work 
on inputs like masked arrays, Matrix class etc.

Perhaps something like this works:


import numpy as np
import scipy.stats as stats

def nanstd(x, axis=None, dtype=None, ddof=0):
    if dtype == np.float128:   #only convert if desired input is  better 
than the default float64 dtype
         x=np.array(x, dtype=dtype)
     denom=np.isfinite(x).sum(axis=axis) # number of finite numbers
     mean=np.nansum(x, axis=axis)/denom # This is not correct because 
the broadcasting is wrong for axis >0
     diff=a-mean # a minus the mean - which must broadcast correctly
     return np.sqrt(np.nansum(diff*diff, axis=axis)/(denom-ddof))

a=np.array([[1,2,3], [4, np.nan, 5], [6, 7, np.nan]])
print 'stdnan=:', stdnan(a, axis=None), 'stats.nanstd=:', 
stats.nanstd(a,axis=None, bias=1)
print 'stdnan=:', stdnan(a, axis=None, ddof=1), 'stats.nanstd=:', 
stats.nanstd(a,axis=None, bias=0)
print 'stdnan=:', stdnan(a, axis=0), 'stats.nanstd=:', 
stats.nanstd(a,axis=0, bias=1)
print 'stdnan=:', stdnan(a, axis=0, ddof=1), 'stats.nanstd=:', 
stats.nanstd(a,axis=0, bias=0)
print 'The following is wrong because the broadcasting is not correct 
when computing the difference'
print 'stdnan=:', stdnan(a, axis=1), 'stats.nanstd=:', 
stats.nanstd(a,axis=1, bias=1)

Bruce


From leek at broadinstitute.org  Wed Jun 16 11:31:56 2010
From: leek at broadinstitute.org (Lee Kamentsky)
Date: Wed, 16 Jun 2010 15:31:56 +0000 (UTC)
Subject: [SciPy-Dev] Errors in 0.8.x
References: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
	<AANLkTikTNG6mI_M08Mm3IDIfGp6x2Gwsnn78mhVeK7yu@mail.gmail.com>
Message-ID: <loom.20100616T172750-387@post.gmane.org>

Matthew Brett <matthew.brett <at> gmail.com> writes:

> 
> Hi Chuck,
> 
> > ======================================================================
> > ERROR: Failure: ValueError (__builtin__.file does not appear to be the
> > correct type object)
> 
> That's a strange one - what system are you on?
> 
> See you,
> 
> Matthew
> 


I had this error on Windows 7 x86 / Python 2.6.5 / scipy 0.8.0b1 installed from
the superpack downloaded this morning. I recompiled scipy/io/matlab/streams.c
which solved the problem. I'm guessing that build wasn't clean?


From d.l.goldsmith at gmail.com  Wed Jun 16 11:49:06 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 16 Jun 2010 08:49:06 -0700
Subject: [SciPy-Dev] Working on the docstring for
	stats.distributions.rv_continuous.fit
In-Reply-To: <AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>
References: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com>
	<AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>
Message-ID: <AANLkTik8_UvTHb3uSbilJKFJkvBgv0iJt8d6Mdm0Wwat@mail.gmail.com>

Skipper:

First, thanks for your help! :-)

On Wed, Jun 16, 2010 at 7:19 AM, Skipper Seabold <jsseabold at gmail.com>wrote:

> On Tue, Jun 15, 2010 at 8:38 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > Hi!  IMHO, the descriptions of the *args & **kwds parameters of the
> > Subject-referenced method are not very clear, so let me see if I
> understand
> > correctly:
> >
>
> I agree and was having a look at this last week.  Here's my take.  I
> would err on the side of verbose in these docs, as the stats docs seem
> to be a general source of confusion from comments I've received off
> list, though obviously Josef, Travis, and others would know the
> details better than I.
>
> > *args : float(s), optional
> >     If the distribution in question depends on n parameters, _excluding
> > location and scale_, then *args may contain 0 to n floats, which are
> > starting estimates for the corresponding parameters.  No default
> value(s).
> >
>
> I would add a note that n can be found in the numargs attribute of the
> distribution.
>
> > **kwds _may_ contain the following:
> >
> > loc : float, optional
> >     Starting estimate for the location parameter, no default.
> >
> > scale : float, optional
> >     Starting estimate for the scale parameter, no default.
>
> If the extra args *and* loc *and* scale are not specified, then the
> default starting estimates for loc, scale, and args are taken from the
> distribution's _fitstart(data) method.  I think it would make more
> sense to take the defaults for ones that are not provided by the user
> only, but this is not how the code reads at the moment as far as I can
> tell.
>

So I can ignore this comment, correct?  If not, we once again have the
issue: document desired/intended behavior, if that differs from extant
behavior...

>
> > floc : bool, optional
> >     Hold the location parameter constant; default: False.
> >
>
> floc : float, optional
>    Hold the location parameter constant at the given value.  Default:
> Fit this parameter using the data.
>

So does this override a value provided by loc?  Is one supposed to not
specify both loc and floc?  What happens if one does? Is an exception
raised?

>
> > fscale : bool, optional
> >     Hold the scale parameter constant; default: False
> >
>
> See above.
>

Ditto.

>
> > fi : bool, optional
> >     Hold the i-th scale parameter constant; there may be up to len(args)
> of
> > these; default: False
> >
>
> I would keep it as something like
>
> f1...fn : float, optional
>   Hold shape parameter fi constant at the given value, where i may be
> 1 to numargs of the distribution.
>

Ditto.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/f9fa5268/attachment.html>

From d.l.goldsmith at gmail.com  Wed Jun 16 11:50:41 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 16 Jun 2010 08:50:41 -0700
Subject: [SciPy-Dev] Working on the docstring for
	stats.distributions.rv_continuous.fit
In-Reply-To: <AANLkTil54w_llirb-FJe9C_LFFToZWxo2g3rdxiRwtIk@mail.gmail.com>
References: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com>
	<AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>
	<AANLkTil54w_llirb-FJe9C_LFFToZWxo2g3rdxiRwtIk@mail.gmail.com>
Message-ID: <AANLkTinBFItr8sm2y5Zf6soNJ4jF2msr29m-vwOyh0v8@mail.gmail.com>

On Wed, Jun 16, 2010 at 7:26 AM, <josef.pktd at gmail.com> wrote:

> On Wed, Jun 16, 2010 at 10:19 AM, Skipper Seabold <jsseabold at gmail.com>
> wrote:
> > On Tue, Jun 15, 2010 at 8:38 PM, David Goldsmith
> > <d.l.goldsmith at gmail.com> wrote:
> >> Hi!  IMHO, the descriptions of the *args & **kwds parameters of the
> >> Subject-referenced method are not very clear, so let me see if I
> understand
> >> correctly:
> >>
> >
> > I agree and was having a look at this last week.  Here's my take.  I
> > would err on the side of verbose in these docs, as the stats docs seem
> > to be a general source of confusion from comments I've received off
> > list, though obviously Josef, Travis, and others would know the
> > details better than I.
> >
> >> *args : float(s), optional
> >>     If the distribution in question depends on n parameters, _excluding
> >> location and scale_, then *args may contain 0 to n floats, which are
> >> starting estimates for the corresponding parameters.  No default
> value(s).
> >>
> >
> > I would add a note that n can be found in the numargs attribute of the
> > distribution.
> >
> >> **kwds _may_ contain the following:
> >>
> >> loc : float, optional
> >>     Starting estimate for the location parameter, no default.
> >>
> >> scale : float, optional
> >>     Starting estimate for the scale parameter, no default.
> >
> > If the extra args *and* loc *and* scale are not specified, then the
> > default starting estimates for loc, scale, and args are taken from the
> > distribution's _fitstart(data) method.  I think it would make more
> > sense to take the defaults for ones that are not provided by the user
> > only, but this is not how the code reads at the moment as far as I can
> > tell.
>
> If you have a proposal how to improve the code (e.g. default
> handling), then you could file a ticket, so we know what to review
> before the next release.
>
> Josef
>

And in this particular case, please announce on the list when you've done
so.  Thanks!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/97b746da/attachment.html>

From ralf.gommers at googlemail.com  Wed Jun 16 11:58:31 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Wed, 16 Jun 2010 23:58:31 +0800
Subject: [SciPy-Dev] Errors in 0.8.x
In-Reply-To: <loom.20100616T172750-387@post.gmane.org>
References: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
	<AANLkTikTNG6mI_M08Mm3IDIfGp6x2Gwsnn78mhVeK7yu@mail.gmail.com>
	<loom.20100616T172750-387@post.gmane.org>
Message-ID: <AANLkTimFvNysotTIHx09oi6Owk_hWcsN5XyIHF928Psy@mail.gmail.com>

On Wed, Jun 16, 2010 at 11:31 PM, Lee Kamentsky <leek at broadinstitute.org>wrote:

> Matthew Brett <matthew.brett <at> gmail.com> writes:
>
> >
> > Hi Chuck,
> >
> > > ======================================================================
> > > ERROR: Failure: ValueError (__builtin__.file does not appear to be the
> > > correct type object)
> >
> > That's a strange one - what system are you on?
> >
> > See you,
> >
> > Matthew
> >
>
>
> I had this error on Windows 7 x86 / Python 2.6.5 / scipy 0.8.0b1 installed
> from
> the superpack downloaded this morning. I recompiled
> scipy/io/matlab/streams.c
> which solved the problem. I'm guessing that build wasn't clean?
>
>
> streams.c is generated from Cython with version 0.11.3. It seems we have a
mix of Cython versions in use now, while I'm pretty sure we're supposed to
use >0.12 only:

$ grin "Cython 0."
./io/matlab/mio5_utils.c:
    1 : /* Generated by Cython 0.12.1 on Wed May 26 12:20:26 2010 */
./io/matlab/mio_utils.c:
    1 : /* Generated by Cython 0.11.3 on Wed Jun  9 11:39:04 2010 */
./io/matlab/streams.c:
    1 : /* Generated by Cython 0.11.3 on Tue Jan  5 22:26:04 2010 */
./spatial/ckdtree.c:
    1 : /* Generated by Cython 0.10.3 on Thu Mar 12 23:38:09 2009 */
./special/lambertw.c:
    1 : /* Generated by Cython 0.12.1 on Mon May 31 10:16:35 2010 */
./special/orthogonal_eval.c:
    1 : /* Generated by Cython 0.12.1 on Mon May 31 10:17:30 2010 */
./stats/vonmises_cython.c:
    1 : /* Generated by Cython 0.10.3 on Tue Nov 17 16:14:26 2009 */

Also in r6496 (trunk) Stefan regenerated a Cython file, not sure what the
reason was. Should we regenerate everything with 0.12.1 again?

Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/571f1b46/attachment.html>

From d.l.goldsmith at gmail.com  Wed Jun 16 11:59:10 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 16 Jun 2010 08:59:10 -0700
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTin4GSA-elW_-bymdxBtcPFnT1iWsN1bvIaShWmY@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
	<AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
	<AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>
	<AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>
	<AANLkTin4GSA-elW_-bymdxBtcPFnT1iWsN1bvIaShWmY@mail.gmail.com>
Message-ID: <AANLkTilU6uxWvqRfTNmSt1qizWhwfU-u6cMFKwKOBM5R@mail.gmail.com>

On Wed, Jun 16, 2010 at 6:44 AM, Ralf Gommers
<ralf.gommers at googlemail.com>wrote:

>
>
> On Wed, Jun 16, 2010 at 10:55 AM, David Goldsmith <d.l.goldsmith at gmail.com
> > wrote:
>
>>
>> Well, the deadline is upon us; here's my concrete proposal:
>>
>> http://projects.scipy.org/numpy/wiki/CodingStyleGuidelines#docstring-standardshould be modified as follows:
>>
>> *Present Content*
>>      :
>> Sections
>>
>> The sections of the docstring are:
>>
>> 1.  *Short summary*...
>>
>> *Proposed Content*
>>      :
>> Sections
>>
>> The sections of the docstring are:
>>
>> 0.  *Deprecation warning* (optional, but required if the object is
>> designated for deprecation)
>>
>>      .. deprecated:: <*deprecation schedule*>, <*justification, if known*>
>> (optional), <*functional equivalents, if extant*> (optional), <*example*>
>>
>> 1.  *Sort summary*...
>>
>
> The short summary gets pulled into function overview tables, it should
> really be the first line of the docstring. Please put the deprecation
> warning just after it.
>

Gotchya, NP.


> Content looks fine to me.
>
>
>>
>> To submit this as a patch, do I grab HOWTO_DOCUMENT.txt, modify it,
>
>
> Yes so far.
>
>
>> then attach the modified version as an attachment to the ticket?
>>
>> Make that a diff please.
>

Sorry, can you "cookbook" that for me please (i.e., exactly how do I do
that)?

Thanks!!!

DG

>
> Cheers,
> Ralf
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/5ae802e1/attachment.html>

From jsseabold at gmail.com  Wed Jun 16 11:59:10 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Wed, 16 Jun 2010 11:59:10 -0400
Subject: [SciPy-Dev] Working on the docstring for
	stats.distributions.rv_continuous.fit
In-Reply-To: <AANLkTik8_UvTHb3uSbilJKFJkvBgv0iJt8d6Mdm0Wwat@mail.gmail.com>
References: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com> 
	<AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com> 
	<AANLkTik8_UvTHb3uSbilJKFJkvBgv0iJt8d6Mdm0Wwat@mail.gmail.com>
Message-ID: <AANLkTinj7Q2IJSnpUP4mkNotlqb3ziGj3sIi-k19S_xL@mail.gmail.com>

On Wed, Jun 16, 2010 at 11:49 AM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> Skipper:
>
> First, thanks for your help! :-)
>
> On Wed, Jun 16, 2010 at 7:19 AM, Skipper Seabold <jsseabold at gmail.com>
> wrote:
>>
>> On Tue, Jun 15, 2010 at 8:38 PM, David Goldsmith
>> <d.l.goldsmith at gmail.com> wrote:
>> > Hi!? IMHO, the descriptions of the *args & **kwds parameters of the
>> > Subject-referenced method are not very clear, so let me see if I
>> > understand
>> > correctly:
>> >
>>
>> I agree and was having a look at this last week. ?Here's my take. ?I
>> would err on the side of verbose in these docs, as the stats docs seem
>> to be a general source of confusion from comments I've received off
>> list, though obviously Josef, Travis, and others would know the
>> details better than I.
>>
>> > *args : float(s), optional
>> > ??? If the distribution in question depends on n parameters, _excluding
>> > location and scale_, then *args may contain 0 to n floats, which are
>> > starting estimates for the corresponding parameters.? No default
>> > value(s).
>> >
>>
>> I would add a note that n can be found in the numargs attribute of the
>> distribution.
>>
>> > **kwds _may_ contain the following:
>> >
>> > loc : float, optional
>> > ??? Starting estimate for the location parameter, no default.
>> >
>> > scale : float, optional
>> > ??? Starting estimate for the scale parameter, no default.
>>
>> If the extra args *and* loc *and* scale are not specified, then the
>> default starting estimates for loc, scale, and args are taken from the
>> distribution's _fitstart(data) method. ?I think it would make more
>> sense to take the defaults for ones that are not provided by the user
>> only, but this is not how the code reads at the moment as far as I can
>> tell.
>
> So I can ignore this comment, correct?? If not, we once again have the
> issue: document desired/intended behavior, if that differs from extant
> behavior...

Yeah, you can ignore this comment.  Anything the user gives is not thrown away.

>>
>> > floc : bool, optional
>> > ??? Hold the location parameter constant; default: False.
>> >
>>
>> floc : float, optional
>> ? ?Hold the location parameter constant at the given value. ?Default:
>> Fit this parameter using the data.
>
> So does this override a value provided by loc?? Is one supposed to not
> specify both loc and floc?? What happens if one does? Is an exception
> raised?

Yeah.  If floc, or whatever is provided, then it is always passed to
nnlf (negative log likelihood that we are minimizing).  So it's not
technically wrong to pass a starting loc, but it doesn't make any
difference for the fitting because it never gets used.

>>
>> > fscale : bool, optional
>> > ??? Hold the scale parameter constant; default: False
>> >
>>
>> See above.
>
> Ditto.
>>
>> > fi : bool, optional
>> > ??? Hold the i-th scale parameter constant; there may be up to len(args)
>> > of
>> > these; default: False
>> >
>>
>> I would keep it as something like
>>
>> f1...fn : float, optional
>> ? Hold shape parameter fi constant at the given value, where i may be
>> 1 to numargs of the distribution.
>
> Ditto.
>
> DG
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From matthew.brett at gmail.com  Wed Jun 16 12:12:36 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Wed, 16 Jun 2010 17:12:36 +0100
Subject: [SciPy-Dev] Errors in 0.8.x
In-Reply-To: <AANLkTimFvNysotTIHx09oi6Owk_hWcsN5XyIHF928Psy@mail.gmail.com>
References: <AANLkTilQKYs6j07ltMUJXE9g-sIufv9wTpf3BdNWAOD2@mail.gmail.com>
	<AANLkTikTNG6mI_M08Mm3IDIfGp6x2Gwsnn78mhVeK7yu@mail.gmail.com>
	<loom.20100616T172750-387@post.gmane.org>
	<AANLkTimFvNysotTIHx09oi6Owk_hWcsN5XyIHF928Psy@mail.gmail.com>
Message-ID: <AANLkTikCLFnJGjGyOhNPOP00Q6c3IeW4CHJz-raPdoym@mail.gmail.com>

Hi,

> streams.c is generated from Cython with version 0.11.3. It seems we have a
> mix of Cython versions in use now, while I'm pretty sure we're supposed to
> use >0.12 only:

That's a good spot, thanks for checking.  Yes, we should regenerate
them all with Cython 0.12.1 .  I will do that now.

Best,

Matthew


From d.l.goldsmith at gmail.com  Wed Jun 16 12:18:01 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 16 Jun 2010 09:18:01 -0700
Subject: [SciPy-Dev] Working on the docstring for
	stats.distributions.rv_continuous.fit
In-Reply-To: <AANLkTinj7Q2IJSnpUP4mkNotlqb3ziGj3sIi-k19S_xL@mail.gmail.com>
References: <AANLkTileWq0ZZqUhtBby2dI2LmWAFY3j4XVyo8N0woiP@mail.gmail.com>
	<AANLkTilLS2SeqHkj5_3RXvyAaT8KUnRX6c_Z61VgXJJ5@mail.gmail.com>
	<AANLkTik8_UvTHb3uSbilJKFJkvBgv0iJt8d6Mdm0Wwat@mail.gmail.com>
	<AANLkTinj7Q2IJSnpUP4mkNotlqb3ziGj3sIi-k19S_xL@mail.gmail.com>
Message-ID: <AANLkTikoGuXclpvDYyhuJZV6h_ghdUHF8_8sKdVflnsA@mail.gmail.com>

On Wed, Jun 16, 2010 at 8:59 AM, Skipper Seabold <jsseabold at gmail.com>wrote:

> On Wed, Jun 16, 2010 at 11:49 AM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > Skipper:
> >
> > First, thanks for your help! :-)
> >
> > On Wed, Jun 16, 2010 at 7:19 AM, Skipper Seabold <jsseabold at gmail.com>
> > wrote:
> >>
> >> On Tue, Jun 15, 2010 at 8:38 PM, David Goldsmith
> >> <d.l.goldsmith at gmail.com> wrote:
> >> > Hi!  IMHO, the descriptions of the *args & **kwds parameters of the
> >> > Subject-referenced method are not very clear, so let me see if I
> >> > understand
> >> > correctly:
> >> >
> >>
> >> I agree and was having a look at this last week.  Here's my take.  I
> >> would err on the side of verbose in these docs, as the stats docs seem
> >> to be a general source of confusion from comments I've received off
> >> list, though obviously Josef, Travis, and others would know the
> >> details better than I.
> >>
> >> > *args : float(s), optional
> >> >     If the distribution in question depends on n parameters,
> _excluding
> >> > location and scale_, then *args may contain 0 to n floats, which are
> >> > starting estimates for the corresponding parameters.  No default
> >> > value(s).
> >> >
> >>
> >> I would add a note that n can be found in the numargs attribute of the
> >> distribution.
> >>
> >> > **kwds _may_ contain the following:
> >> >
> >> > loc : float, optional
> >> >     Starting estimate for the location parameter, no default.
> >> >
> >> > scale : float, optional
> >> >     Starting estimate for the scale parameter, no default.
> >>
> >> If the extra args *and* loc *and* scale are not specified, then the
> >> default starting estimates for loc, scale, and args are taken from the
> >> distribution's _fitstart(data) method.  I think it would make more
> >> sense to take the defaults for ones that are not provided by the user
> >> only, but this is not how the code reads at the moment as far as I can
> >> tell.
> >
> > So I can ignore this comment, correct?  If not, we once again have the
> > issue: document desired/intended behavior, if that differs from extant
> > behavior...
>
> Yeah, you can ignore this comment.  Anything the user gives is not thrown
> away.
>
> >>
> >> > floc : bool, optional
> >> >     Hold the location parameter constant; default: False.
> >> >
> >>
> >> floc : float, optional
> >>    Hold the location parameter constant at the given value.  Default:
> >> Fit this parameter using the data.
> >
> > So does this override a value provided by loc?  Is one supposed to not
> > specify both loc and floc?  What happens if one does? Is an exception
> > raised?
>
> Yeah.  If floc, or whatever is provided, then it is always passed to
> nnlf (negative log likelihood that we are minimizing).  So it's not
> technically wrong to pass a starting loc, but it doesn't make any
> difference for the fitting because it never gets used.
>

Ah, that's important to note as well.  Thanks!

DG


>
> >>
> >> > fscale : bool, optional
> >> >     Hold the scale parameter constant; default: False
> >> >
> >>
> >> See above.
> >
> > Ditto.
> >>
> >> > fi : bool, optional
> >> >     Hold the i-th scale parameter constant; there may be up to
> len(args)
> >> > of
> >> > these; default: False
> >> >
> >>
> >> I would keep it as something like
> >>
> >> f1...fn : float, optional
> >>   Hold shape parameter fi constant at the given value, where i may be
> >> 1 to numargs of the distribution.
> >
> > Ditto.
> >
> > DG
> >
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/2d4de3ac/attachment.html>

From bsouthey at gmail.com  Wed Jun 16 12:26:25 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 16 Jun 2010 11:26:25 -0500
Subject: [SciPy-Dev] stats.nanstd interface
In-Reply-To: <4C18EB16.9020309@gmail.com>
References: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com>
	<4C18D96E.9020905@gmail.com>
	<AANLkTil6z3C4FUyf8-HT9fSc2GygKJUc74pley611fyf@mail.gmail.com>
	<4C18EB16.9020309@gmail.com>
Message-ID: <AANLkTinLhYtfAD_zd8Q2PYwRCd6WYbB3w0UBvvtfN2fg@mail.gmail.com>

On Wed, Jun 16, 2010 at 10:17 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/16/2010 09:20 AM, josef.pktd at gmail.com wrote:
>>
>> On Wed, Jun 16, 2010 at 10:02 AM, Bruce Southey<bsouthey at gmail.com>
>> ?wrote:
>>
>>>
>>> On 06/16/2010 07:55 AM, Angus McMorland wrote:
>>>
>>>>
>>>> Hi all,
>>>>
>>>> I've just updated the docstring for scipy.stats.nanstd to the new
>>>> docstring standard's format. I wonder if, for consistency of
>>>> interface, we should consider changing it to use a `ddof` parameter,
>>>> as numpy's std function does, instead of its current `bias` boolean
>>>> parameter. I'm aware that there are deprecation/API implications
>>>> associated with this, but I'm not sure what the specifics of those
>>>> are.
>>>>
>>>> Angus.
>>>>
>>>>
>>>
>>> Please file a ticket for it.
>>> Can you please add all the differences between the signature between
>>> numpy's version and this version?
>>> In particular, the default axis of stats.nanstd is zero compared to None.
>>> It also lacks the dtype argument.
>>>
>>
>> default axis in scipy.stats is zero not None as in numpy.
>> np.nansum has no dtype argument, nans can be only in float (I never
>> checked complex for this), so I don't know whether dtype would be
>> useful in this case.
>>
>
> From np.std docstring:
> "
> ? ?dtype : dtype, optional
> ? ? ? ?Type to use in computing the standard deviation. For arrays of
> ? ? ? ?integer type the default is float64, for arrays of float types it is
> ? ? ? ?the same as the array type.
> "
>
>>
>>>
>>> Really the function needs at least a rewrite unless numpy can provide
>>> same functionality.
>>>
>>
>> Can you be more specific, we just rewrote axis handling
>>
>> I think switching to ddof is a good idea. (FYI: I cannot work on
>> anything for another two weeks).
>>
>> Josef
>>
>
> I know that the broadcasting is not correct in the following but I do not
> know how to fix it.
> Also, np.nansum does not accept the dtype so need to convert the input to
> the new precision.
>
> I would like it to handle other array subtypes or at least fail to work on
> inputs like masked arrays, Matrix class etc.
>
> Perhaps something like this works:
>
>
> import numpy as np
> import scipy.stats as stats
>
> def nanstd(x, axis=None, dtype=None, ddof=0):
> ? if dtype == np.float128: ? #only convert if desired input is ?better than
> the default float64 dtype
> ? ? ? ?x=np.array(x, dtype=dtype)
> ? ?denom=np.isfinite(x).sum(axis=axis) # number of finite numbers
> ? ?mean=np.nansum(x, axis=axis)/denom # This is not correct because the
> broadcasting is wrong for axis >0
> ? ?diff=a-mean # a minus the mean - which must broadcast correctly
> ? ?return np.sqrt(np.nansum(diff*diff, axis=axis)/(denom-ddof))
>
> a=np.array([[1,2,3], [4, np.nan, 5], [6, 7, np.nan]])
> print 'stdnan=:', stdnan(a, axis=None), 'stats.nanstd=:',
> stats.nanstd(a,axis=None, bias=1)
> print 'stdnan=:', stdnan(a, axis=None, ddof=1), 'stats.nanstd=:',
> stats.nanstd(a,axis=None, bias=0)
> print 'stdnan=:', stdnan(a, axis=0), 'stats.nanstd=:',
> stats.nanstd(a,axis=0, bias=1)
> print 'stdnan=:', stdnan(a, axis=0, ddof=1), 'stats.nanstd=:',
> stats.nanstd(a,axis=0, bias=0)
> print 'The following is wrong because the broadcasting is not correct when
> computing the difference'
> print 'stdnan=:', stdnan(a, axis=1), 'stats.nanstd=:',
> stats.nanstd(a,axis=1, bias=1)
>
> Bruce
>

Thanks Angus for the ticket 1200:
http://projects.scipy.org/scipy/ticket/1200

I added code to the ticket that I think fixes the broadcasting issue I
mentioned above and added 'support' for masked array input. Also I
created the variance function as standard deviation is the square root
of variance.

I really think that all these stats 'nan functions' probably could
just be converted into masked arrays and using the appropriate masked
array functions instead of creating separate functions. This would
also address how to handle the 'out' argument.

Bruce
-------------- next part --------------
A non-text attachment was scrubbed...
Name: newstd.py
Type: application/octet-stream
Size: 1821 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/0239491c/attachment.obj>

From warren.weckesser at enthought.com  Wed Jun 16 13:34:55 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Wed, 16 Jun 2010 12:34:55 -0500
Subject: [SciPy-Dev] netcdf segfault
In-Reply-To: <4C16DD59.8070608@enthought.com>
References: <AANLkTilaNeBEQ3zmXRZ91viREjgtbALEk_kMxMm0vHjS@mail.gmail.com>
	<4C16DD59.8070608@enthought.com>
Message-ID: <4C190B3F.2010707@enthought.com>

I created a ticket for this issue: 
http://projects.scipy.org/scipy/ticket/1202

Warren

Warren Weckesser wrote:
> Benjamin Root wrote:
>   
>> Hello,
>>
>> I was trying out the scipy.io.netcdf module in order to work on the 
>> documentation and I came across a way to segfault out of python.  If 
>> you open an existing netcdf file as read-only and then try to set a 
>> scalar value using .assignValue for a 1-element netcdf_variable, the 
>> program segfaults rather than raising an exception.
>>
>> Here is an example to go with a very small netcdf file as a test:
>>
>> from scipy.io <http://scipy.io> import netcdf
>> f = netcdf.netcdf_file("testdata.netcdf", 'r')
>> tval = f.variables['t']
>> tval.assignValue(42)
>> f.close()
>>
>>     
>
> Yup--I get a bus error on Mac OSX 10.5 with scipy trunk.
>
> Warren
>
>   
>> Ben Root
>> ------------------------------------------------------------------------
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>   
>>     
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From warren.weckesser at enthought.com  Thu Jun 17 00:58:38 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Wed, 16 Jun 2010 23:58:38 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>
	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>
	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>
Message-ID: <4C19AB7E.1000405@enthought.com>

The feedback in this thread inspired me to generalize my original code 
to the n-way test of independence.  I have attached the revised code to 
a new ticket:

    http://projects.scipy.org/scipy/ticket/1203

More feedback would be great!

Warren


josef.pktd at gmail.com wrote:
> On Mon, Jun 7, 2010 at 12:45 PM, Bruce Southey <bsouthey at gmail.com> wrote:
>   
>> On 06/07/2010 10:45 AM, josef.pktd at gmail.com wrote:
>>     
>>> On Mon, Jun 7, 2010 at 11:00 AM, Bruce Southey<bsouthey at gmail.com>  wrote:
>>>
>>>       
>>>> On 06/07/2010 09:15 AM, josef.pktd at gmail.com wrote:
>>>>
>>>> On Fri, Jun 4, 2010 at 2:12 PM,<josef.pktd at gmail.com>  wrote:
>>>>
>>>>
>>>> On Fri, Jun 4, 2010 at 1:08 PM, Bruce Southey<bsouthey at gmail.com>  wrote:
>>>>
>>>>
>>>> On 06/03/2010 08:27 AM, Warren Weckesser wrote:
>>>>
>>>>
>>>> Just letting you know that I'm not ignoring all the great comments from
>>>> josef, Neil and Bruce about my suggestion for chisquare_contingency.
>>>> Unfortunately, I won't have time to think about all the deeper
>>>> suggestions for another week or so.   For now, I'll just say that I
>>>> agree with josef's and Neil's suggestions for the docstring, and that
>>>> Neil's summary of the function as simply a convenience function that
>>>> calls stats.chisquare with appropriate arguments to perform a test of
>>>> independence on a contingency table is exactly what I had in mind.
>>>>
>>>> Warren
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> Hi,
>>>> I looked at how SAS handles n-way tables. What it appears to do is break the
>>>> original table down into a set of 2-way tables and does the analysis on each
>>>> of these. So a 3 by 4 by 5 table is processed as three 2-way tables with the
>>>> results of each 4 by 5 table presented. I do not know how Stata and R
>>>> analysis analyze n-way tables.
>>>>
>>>> Consequently, I rewrote my suggested code (attached) to handle 3 and 4 way
>>>> tables by using recursion. There should be some Python way to do that
>>>> recursion for any number of dimensions. I also added the 1-way table (but
>>>> that has a different hypothesis than the 2-way table) so users can send a
>>>> 1-d table.
>>>>
>>>>
>>>> (very briefly because I don't have much time today)
>>>>
>>>> I think, these are good extensions, but to handle all cases, the
>>>> function is getting too large and would need several options.
>>>>
>>>> On your code and SAS, Z(correct me if my quick reading is wrong)
>>>> You seem to be calculating conditional independence for the last two
>>>> variables conditional on the values of the first variables. I think
>>>> this could be generalized to all pairwise independence tests.
>>>>
>>>> Similar, I'm a bit surprised that SAS uses conditional and not
>>>> marginal independence, I would have thought that the test for marginal
>>>> independence (aggregate out all but 2 variables) would be the more
>>>> common use case.
>>>>
>>>>
>>>> You can argue SAS's formulation relates to how the table is constructed
>>>> because the hypothesis associated with the table is dependent on how the
>>>> user constructs it. For example, the 3-way table A by (B by C) is very
>>>> different from the 3-way table C by (B by A) yet these involve the same
>>>> underlying numbers. If a user did not specify an order then considering all
>>>> possible hypotheses is an option.
>>>>
>>>>         
>>> I don't know the SAS notation, what I thought in analogy to regression
>>> analysis, is that if one variable is considered as endogenous, then
>>> only pairwise tests with this variable need to be included.
>>>
>>>       
>> This is not the same as regression for multiple reasons. Here we are
>> testing independence without any distribution assumption associated with
>> the actual data. (Of course under the normality assumption then these
>> are the same. )
>>
>>     
>>>> Really log-linear models are a better approach to analysis n-way tables
>>>> because these allow you to examine all these different hypotheses.
>>>>
>>>> just some more questions and comments (until I have time to check this)
>>>>
>>>> looking at conditional independence looks similar to linear regression
>>>> models, where the effect of other variables is taken out. However,
>>>> looking at all chisquare tests (conditional on all possible other
>>>> values) runs into the multiple test problem. Is the some kind of
>>>> post-hoc or Bonferroni correction or is there a distribution for eg.
>>>> the max of all chisquare test statistics.
>>>>
>>>>
>>>> Ignoring my views on this, first 'multiple test problems' do not change the
>>>> probability calculation for most approaches to compute the 'raw' p-value as
>>>> the vast majority of the approaches require the 'raw' p-value.
>>>>
>>>> Second, it is very easy to say 'correct for multiple tests' but that is pure
>>>> ignorance when 'what' you are correcting is for is not stated. If you are
>>>> correcting the 'family-wise error rate' then you need to correctly define
>>>> 'family-wise' in this situation especially to address at least one other
>>>> assumption being made.
>>>>
>>>>         
>>> I know nothing about this in the context of contingency tables.
>>>       
>> In a 2-way table there is no need for any correction so it is pointless
>> to say 'correct for multiple tests'. In a 3-way or higher table, as you
>> indicated, is essentially a test of conditional independence as I
>> implemented it. It is also pointless to say 'correct for multiple tests'
>> because you are first assuming conditional independence between say A by
>> B given C=1 and A by B for C=2. So what happens when C=1 is independent
>> of when C=2 so these do belong to different 'families'. Second, there is
>> nothing said about the relation of either A  or B with C - which may be
>> a more critical problem.
>>
>>     
>>> We
>>> recently had the discussion about multiple tests in the context of
>>> post-hoc tests for anova, where I had to read up.
>>>
>>>       
>> I am perhaps too aware of multiple testing and unfortunately these types
>> of discussions go on and on and on. A lot depends on which of many
>> 'schools' of thought you subscribe to. It basically amounts to 'hand
>> waving'  with no solution because these schools are defined by different
>> fundamental  assumptions that can not be challenged. Ultimately none are
>> correct because we never know the true situation - if we did we would
>> not be doing it.
>>     
>
> I think it depends on the hypothesis and the general statistical
> theory is relatively clear, but maybe some people prefer a
> "test-mining" approach.
>
>
>   
>>> In econometrics, there is an extensive literature on this, and some
>>> cases like structural change tests with unknown change points I know
>>> pretty well.
>>>
>>> The main point that I wanted to make is, that multiple change tests
>>> need more attention and at least a warning in the docstring which
>>> (raw) p-values are reported, since it is easy for unwary users to
>>> misinterpret the reported p-values. But hopefully this could be
>>> extended to provide the user with options to do an appropriate
>>> correction.
>>>
>>> Josef
>>>
>>>       
>> This is pointless because you are misunderstanding what is meant by
>> 'multiple test correction'.
>>     
>
> ???
>
>   
>> Placing those kinds of statements in the
>> wrong places also reflects ignorance especially when the correct value
>> maybe given and there is no 'appropriate' correction possible. Further
>> no statement is ever going to protect users from misinterpreting p-values.
>>     
>
> Doing a quick search on the recent literature, it seems there is a lot
> going on in doing proper multiple test correction, additional to more
> traditional tests, that I haven't tried you to really understand or
> where I don't know how well they generalize, e.g. (generalized)
> Cochran-Mantel-Haenszel Chi-Squared Test, Cochran?s Q test.
>
> I only read the abstract of this:
> http://jnci.oxfordjournals.org/cgi/content/abstract/99/2/147
>
> "Twenty-one (50%) of them contained at least one of the following
> three basic flaws: 1) in outcome-related gene finding, an unstated,
> unclear, or inadequate control for multiple testing; 2) ....."
>
> Josef
>
>   
>> Bruce
>>
>>
>>     
>>>       
>>>> with an iterator (numpy mailinglist), my version for the conditional
>>>> independence of the last two variables for all values of the earlier
>>>> variables looks like
>>>>
>>>> for ind in allbut2ax_iterator(table3, axes=(-2,-1)):
>>>>      print chisquare_contingency(table3[ind])
>>>>
>>>> Josef
>>>>
>>>>
>>>>
>>>> A link:
>>>> http://article.gmane.org/gmane.comp.python.numeric.general/38352
>>>>
>>>> I would have to see.
>>>>
>>>> Bruce
>>>>
>>>> Initially, I was thinking just about independence of all variables in
>>>> a 3 or more way table, i.e. P(x,y,z)=P(x)*P(y)*P(z)
>>>>
>>>> My opinion is that these variations of tests would fit better in a
>>>> class where all pairwise conditional, and marginal and joint
>>>> hypotheses can be supplied as methods, or split it up into a group of
>>>> functions.
>>>>
>>>> Thanks,
>>>>
>>>> Josef
>>>>
>>>>
>>>>
>>>> The data used is from two SAS examples and I added a dimension to get a
>>>> 4-way table. I included the SAS values but these are only to 4 decimal
>>>> places for reference.
>>>>
>>>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect029.htm
>>>> http://support.sas.com/documentation/cdl/en/procstat/63104/HTML/default/viewer.htm#/documentation/cdl/en/procstat/63104/HTML/default/procstat_freq_sect030.htm
>>>>
>>>> What is missing:
>>>> 1) Docstring and tests but those are dependent what is ultimately decided
>>>> 2) Other test statistics but scipy.stats versions are not very friendly in
>>>> that these do not accept a 2-d array
>>>> 3) A way to do recursion
>>>> 4) Ability to label the levels etc.
>>>> 5) Correct handling of input types.
>>>>
>>>> Bruce
>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>>>
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>>
>>>>
>>>>
>>>>         
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>       
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>     
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From vincent at vincentdavis.net  Thu Jun 17 00:59:22 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Wed, 16 Jun 2010 22:59:22 -0600
Subject: [SciPy-Dev] Updated codata.py
In-Reply-To: <AANLkTilSDwvtFjNblijx9FcYllpcUuPih_clIGLwq6oE@mail.gmail.com>
References: <AANLkTikUyocnOH8Cd5_3B3gvqCIcaLWD_vlYdLGaAkMu@mail.gmail.com>
	<hv9vj9$6mt$1@dough.gmane.org>
	<AANLkTinuAU_j9NrUOFKq8z1L_tZRVhAUlfmqpvT6_f-C@mail.gmail.com>
	<hvao6c$1f3$1@dough.gmane.org>
	<AANLkTilSDwvtFjNblijx9FcYllpcUuPih_clIGLwq6oE@mail.gmail.com>
Message-ID: <AANLkTiknYQEEM4cXVInFm-5MFqWGkcRiQQI6DGZWiH89@mail.gmail.com>

Ok I download the current scipy from http://projects.scipy.org/git/

Created a new github repo.

updated codata.py

So I think all is good to go. File is at this link.
http://github.com/vincentdavis/SciPy-vmd-dev/blob/master/scipy/constants/codata.py

Thanks
Vincent

On Wed, Jun 16, 2010 at 9:07 AM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Wed, Jun 16, 2010 at 8:47 AM, Pauli Virtanen <pav at iki.fi> wrote:
>> Wed, 16 Jun 2010 08:19:53 -0600, Vincent Davis wrote:
>> [clip]
>>>> Note that for some reason you have a lot of branches in your Github
>>>> repo. I'd perhaps suggest doing
>>>
>>> Ok I think this is the same as the list of the branches from scipy-work
>>> that I cloned.
>>
>> This behavior from Github is a bit strange, then. If you just clone with
>> "git clone", you don't get any extra local branches in your repo, and it
>> would make sense for the "Fork" button on the website to work in the same
>> way.
>>
>> But anyway it's good practice not to keep zombie branches around that
>> you're not working or going to work on.
>>
>> The command
>>
>> ? ? ? ?git push origin :BRANCH
>>
>> removes the BRANCH given from the remote repository, and needs to be done
>> separately for each branch you want to remove.
>>
>>> I thought this was being updated regularly from the svn.
>>
>> None of the Numpy/Scipy repositories on Github are updated regularly. The
>> automatically updating ones are here:
>>
>> ? ? ? ?http://projects.scipy.org/git/
>
> Thanks for the info. I will compare the version a codata that I have
> to make sure I didn't miss an update to codata.py
>
> Thanks again
> Vincent
>> --
>> Pauli Virtanen
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>


From ralf.gommers at googlemail.com  Thu Jun 17 06:51:29 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 17 Jun 2010 18:51:29 +0800
Subject: [SciPy-Dev] possible speed-up for arffread
In-Reply-To: <AANLkTin0MNL9UsuJFNaO9mNRkBnigNUClhWjqeG6Ahz1@mail.gmail.com>
References: <AANLkTin0MNL9UsuJFNaO9mNRkBnigNUClhWjqeG6Ahz1@mail.gmail.com>
Message-ID: <AANLkTinsGUZZi0d24AhG0a35iEG6s-hN7WCoSH775mv9@mail.gmail.com>

On Wed, Jun 16, 2010 at 10:46 AM, Benjamin Root <ben.root at ou.edu> wrote:

> Hello,
>
> I was looking at the scipy.io.arff module to see if I could easily shave
> some processing time for loading an ARFF file.  Doing some profiling on a
> file with 40,000 floating point numbers pointed me to the safe_float()
> function in the arffread.py file.  In it, it was stripping the string token
> of any whitespace and then comparing it to '?' (which is ARFF's missing data
> indicator).  I found that if one just does a check for the '?' character,
> you can shave almost 30% of the processing time off of the safe_float()
> function.
>
> In addition, I found a very slight improvement by calculating the range(ni)
> once and reusing that variable in the generator function.  Attached is my
> patch file.
>
> It isn't much, but it is noticeable.
>
> That looks good, thanks. I included it in
http://github.com/rgommers/scipy/tree/io, will commit that soon.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/b78e4432/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun 17 07:05:18 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 17 Jun 2010 19:05:18 +0800
Subject: [SciPy-Dev] ANN: SciPy 0.8.0 beta 1
In-Reply-To: <loom.20100606T072856-807@post.gmane.org>
References: <AANLkTikMBZxC2E8su9TNseBLvFtXaVO2h6dctmNW2WkT@mail.gmail.com>
	<loom.20100606T072856-807@post.gmane.org>
Message-ID: <AANLkTilJgKLkILSCrA2CRVY7Xg5P7e-vIGcWw9XBRo8I@mail.gmail.com>

Hi Marcus,

Sorry for the late reply.

On Sun, Jun 6, 2010 at 1:45 PM, bowie_22 <m.boumans at gmx.net> wrote:

>
>
> Hello everybody,
>
> I just have read the announcement for  SciPy 0.8.0 and of course I have
> installed it immediatly.
> I am quite new in Scipy (coming from Matlab) and I thought a good starting
> point
> for a contribution would be to review and check the Scipy documentation.
>
> I added some hints in
>
> http://docs.scipy.org/numpy/Front%20Page/
>
> and now I ask myself how the release of the documentation is conntected to
> the
> release of a new scipy version.
> Is it connected at all?
>

It is, but the binaries and docs on the website are done by different
people, so the timing may be a bit off. Once the final 0.8.0 is out a new
snapshot of the docs should appear. At the moment the 0.8 docs are
essentially the same as http://docs.scipy.org/doc/scipy/reference/

>
> Browsing throw the docs give at
> http://docs.scipy.org/doc
> shows a documentation for scipy 0.7.
> Does "...and better documentation..." mean an improvement in the docstrings
>

Yes.


> (As I am still not sure which place is the best to look at)?
>
> The link above.

Cheers,
Ralf


> As a scipy rookie I would appreciate same information about this topics
> (release
> of documentation and release of a new scipy package)
>
> Thank you!
>
> Regs
> Marcus
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/45e1e8f0/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun 17 07:12:12 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 17 Jun 2010 19:12:12 +0800
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTilU6uxWvqRfTNmSt1qizWhwfU-u6cMFKwKOBM5R@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
	<AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
	<AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>
	<AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>
	<AANLkTin4GSA-elW_-bymdxBtcPFnT1iWsN1bvIaShWmY@mail.gmail.com>
	<AANLkTilU6uxWvqRfTNmSt1qizWhwfU-u6cMFKwKOBM5R@mail.gmail.com>
Message-ID: <AANLkTikMiG2tibqyARv3bHzkg8AR8SUuZayQstkjQ6KG@mail.gmail.com>

On Wed, Jun 16, 2010 at 11:59 PM, David Goldsmith
<d.l.goldsmith at gmail.com>wrote:

> On Wed, Jun 16, 2010 at 6:44 AM, Ralf Gommers <ralf.gommers at googlemail.com
> > wrote:
>
>>
>>
>>> then attach the modified version as an attachment to the ticket?
>>>
>>> Make that a diff please.
>>
>
> Sorry, can you "cookbook" that for me please (i.e., exactly how do I do
> that)?
>
> Google for "howto create a patch" or similar. For example
http://tortoisesvn.net/docs/release/TortoiseSVN_en/tsvn-dug-patch.html

Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/40003757/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun 17 07:28:17 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 17 Jun 2010 19:28:17 +0800
Subject: [SciPy-Dev] ticket #1102, long -> npy_intp
Message-ID: <AANLkTiluZW05j6OVFWiAlLYPJHNikj3GlN69e_a2Imyb@mail.gmail.com>

Can any C guru confirm that the patch in
http://projects.scipy.org/scipy/ticket/1102 is correct? It works on my
32-bit system and fixes a 64-bit problem. Do I understand correctly that
long varies in size for 32/64-bit and npy_intp does not, therefore the
latter is preferred?

Thanks,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/ceeef9d9/attachment.html>

From pav at iki.fi  Thu Jun 17 07:35:02 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Thu, 17 Jun 2010 11:35:02 +0000 (UTC)
Subject: [SciPy-Dev] ticket #1102, long -> npy_intp
References: <AANLkTiluZW05j6OVFWiAlLYPJHNikj3GlN69e_a2Imyb@mail.gmail.com>
Message-ID: <hvd196$e0v$1@dough.gmane.org>

Thu, 17 Jun 2010 19:28:17 +0800, Ralf Gommers wrote:
> Can any C guru confirm that the patch in
> http://projects.scipy.org/scipy/ticket/1102 is correct? It works on my
> 32-bit system and fixes a 64-bit problem. Do I understand correctly that
> long varies in size for 32/64-bit and npy_intp does not, therefore the
> latter is preferred?

Yes, sizeof(npy_intp) is the same as the pointer size, and may be 
different from sizeof(long).

The patch seems OK, especially as the code later on tries to access the 
data as intp*.

-- 
Pauli Virtanen


From ralf.gommers at googlemail.com  Thu Jun 17 07:49:11 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 17 Jun 2010 19:49:11 +0800
Subject: [SciPy-Dev] ticket #1102, long -> npy_intp
In-Reply-To: <hvd196$e0v$1@dough.gmane.org>
References: <AANLkTiluZW05j6OVFWiAlLYPJHNikj3GlN69e_a2Imyb@mail.gmail.com>
	<hvd196$e0v$1@dough.gmane.org>
Message-ID: <AANLkTily9US2XrLAOU2OOAxMjfD-MQvl-if-TolU483A@mail.gmail.com>

On Thu, Jun 17, 2010 at 7:35 PM, Pauli Virtanen <pav at iki.fi> wrote:

> Thu, 17 Jun 2010 19:28:17 +0800, Ralf Gommers wrote:
> > Can any C guru confirm that the patch in
> > http://projects.scipy.org/scipy/ticket/1102 is correct? It works on my
> > 32-bit system and fixes a 64-bit problem. Do I understand correctly that
> > long varies in size for 32/64-bit and npy_intp does not, therefore the
> > latter is preferred?
>
> Yes, sizeof(npy_intp) is the same as the pointer size, and may be
> different from sizeof(long).
>
> The patch seems OK, especially as the code later on tries to access the
> data as intp*.
>
> Thanks. Applied in r6513, probably fixes the test failure for ticket #1124
reported by Charles and Bruce as well.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/4f27809d/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun 17 10:05:43 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 17 Jun 2010 22:05:43 +0800
Subject: [SciPy-Dev] scipy.special test failures for 0.8.0b1
Message-ID: <AANLkTimAFedlDIKRZFKpU7wCPC0asg7Ndabhi58z_Rzr@mail.gmail.com>

These scipy.special test failures were reported by Derek Homeier for
0.8.0b1. Are these all just cases of the test precision being a little too
high, or is there an actual problem? What should be done with them for the
0.8.0 release?

Thanks,
Ralf


this test fails on MacOS X 10.5  darwin-ppc /w numpy 1.4.1 both in scipy
0.7.2 and 0.8.0b1:

FAIL: test_iv_cephes_vs_amos (test_basic.TestBessel)
----------------------------------------------------------------------
Traceback (most recent call last):
 File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py",
line 1690, in test_iv_cephes_vs_amos
   self.check_cephes_vs_amos(iv, iv, rtol=1e-12, atol=1e-305)
 File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py",
line 1671, in check_cephes_vs_amos
   assert_tol_equal(c1, c2, err_msg=(v, z), rtol=rtol, atol=atol)
 File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py",
line 38, in assert_tol_equal
   verbose=verbose, header=header)
 File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 609, in
assert_array_compare
   raise AssertionError(msg)
AssertionError:
Not equal to tolerance rtol=1e-12, atol=1e-305
(-120, 700.60000000000002)
(mismatch 100.0%)
 x: array(9.7596849076267918e+297)
 y: array((9.7596849444884899e+297+0j))


while the following also fails on darwin-i386:

FAIL: test_iv_cephes_vs_amos_mass_test (test_basic.TestBessel)
----------------------------------------------------------------------
Traceback (most recent call last):
 File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py",
line 1712, in test_iv_cephes_vs_amos_mass_test
   assert dc[k] < 1e-9, (iv(v[k], x[k]), iv(v[k], x[k]+0j))
AssertionError: (1.8320048963545875e+306, (inf+0j))


new in 0.8.0b1 / darwin-ppc:

FAIL: test_data.test_boost(<Data for gamma: test_gamma_data_ipp-near_m55>,)
----------------------------------------------------------------------
Traceback (most recent call last):
 File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
   self.test(*self.arg)
 File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_data.py",
line 205, in _test_factory
   test.check(dtype=dtype)
 File "/sw/lib/python2.6/site-packages/scipy/special/tests/testutils.py",
line 187, in check
   assert False, "\n".join(msg)
AssertionError:
Max |adiff|: 5.56451e-80
Max |rdiff|: 6.01399e-12
Bad results for the following points (in output 0):
                       -55.25 =>          1.281342652143248e-73 !=
1.2813426521356121e-73  (rdiff         5.9592794108831318e-12)
                     -55.0625 =>         9.8673260749254454e-73 !=
9.8673260748719804e-73  (rdiff         5.4183854671083142e-12)
                   -55.015625 =>          4.736069454765531e-72 !=
4.7360694547400013e-72  (rdiff         5.3904770710359471e-12)
                -55.001953125 =>         4.0011527026054843e-71 !=
4.0011527025839184e-71  (rdiff         5.3899101047365405e-12)
               -55.0009765625 =>         8.0337165791763099e-71 !=
8.0337165791330061e-71  (rdiff          5.390260882973932e-12)
           -55.00006103515625 =>         1.2901278999535244e-69 !=
1.2901278999465708e-69  (rdiff         5.3898362867905277e-12)
          -55.000015258789062 =>         5.1614604487935911e-69 !=
5.1614604487657727e-69  (rdiff         5.3896432998251817e-12)
          -55.000007629394531 =>         1.0323237221382923e-68 !=
1.0323237221327282e-68  (rdiff         5.3898770611534905e-12)
          -54.999992370605469 =>        -1.0323869903952338e-68 !=
-1.0323869903896693e-68  (rdiff         5.3899456366789772e-12)
          -54.999969482421875 =>        -2.5812047538146166e-69 !=
-2.5812047538007038e-69  (rdiff         5.3900484385549852e-12)
             -54.999755859375 =>         -3.229275765714629e-70 !=
-3.2292757656972238e-70  (rdiff         5.3898088494869903e-12)
              -54.99951171875 =>        -1.6162223904478387e-70 !=
-1.6162223904391274e-70  (rdiff         5.3898993756420872e-12)
                -54.998046875 =>        -4.0644220032500644e-71 !=
-4.0644220032281559e-71  (rdiff         5.3903083062977191e-12)
                  -54.9921875 =>        -1.0403990765960148e-71 !=
-1.040399076590406e-71  (rdiff          5.391034204122906e-12)
                   -54.984375 =>        -5.3694203178537616e-72 !=
-5.3694203178248015e-72  (rdiff         5.3935130122571759e-12)
                     -54.9375 =>        -1.6301847489550233e-72 !=
-1.6301847489461702e-72  (rdiff         5.4307621029371105e-12)
                      -54.875 =>        -1.0680846658730133e-72 !=
-1.0680846658670925e-72  (rdiff         5.5434016349406124e-12)
                      -54.875 =>        -1.0680846658730133e-72 !=
-1.0680846658670925e-72  (rdiff         5.5434016349406124e-12)
                       -54.75 =>        -9.5458361856825864e-73 !=
-9.5458361856251778e-73  (rdiff          6.013992626968188e-12)
                      -54.625 =>        -1.2061864756549609e-72 !=
-1.2061864756493961e-72  (rdiff         4.6135414163673135e-12)


======================================================================
FAIL: test_data.test_boost(<Data for gammaincinv:
gamma_inv_big_data_ipp-gamma_inv_big_data>,)
----------------------------------------------------------------------
Traceback (most recent call last):
 File "/sw/lib/python2.6/site-packages/nose/case.py", line 186, in runTest
   self.test(*self.arg)
 File "/sw/lib/python2.6/site-packages/scipy/special/tests/test_data.py",
line 205, in _test_factory
   test.check(dtype=dtype)
 File "/sw/lib/python2.6/site-packages/scipy/special/tests/testutils.py",
line 187, in check
   assert False, "\n".join(msg)
AssertionError:
Max |adiff|: 39782.8
Max |rdiff|: 1
Bad results for the following points (in output 0):
                  40010.84375             0.12698681652545929
=>                            0.0 !=             39782.764004009827
(rdiff                            1.0)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/b8360bd5/attachment.html>

From bsouthey at gmail.com  Thu Jun 17 10:09:09 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 17 Jun 2010 09:09:09 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C19AB7E.1000405@enthought.com>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>
	<4C19AB7E.1000405@enthought.com>
Message-ID: <4C1A2C85.7090001@gmail.com>

On 06/16/2010 11:58 PM, Warren Weckesser wrote:
> The feedback in this thread inspired me to generalize my original code
> to the n-way test of independence.  I have attached the revised code to
> a new ticket:
>
>      http://projects.scipy.org/scipy/ticket/1203
>
> More feedback would be great!
>
> Warren
>
>
>    
The handling for a one way table is wrong:
 >>>print 'One way', chisquare_nway([6, 2])
(0.0, 1.0, 0, array([ 6.,  2.]))

It should also do the marginal independence tests.

I would have expected the conversion of the input into an array in the 
chisquare_nway function.  If the input is is not an array, then there is 
a potential bug waiting to happen because you expect numpy to correctly 
compute the observed minus expected. For example, if the input is a list 
then it relies on numpy doing a list minus a ndarray.  It is also 
inefficient in the sense that you have to convert the input twice (once 
for the expected values and once for the observed minus expected 
calculation. You can also get interesting errors with a string input 
where the reason may not be obvious:

 >>>print 'twoway', chisquare_nway([['6', '2'], ['4', '11']])
   File "chisquare_nway.py", line 132, in chisquare_nway
     chi2 = ((table - expected)**2 / expected).sum()
TypeError: unsupported operand type(s) for -: 'list' and 'numpy.ndarray'

I don't recall how np.asarray handles very large numbers but I would 
also suggest an optional dtype argument instead of forcing float64 dtype:
"table = np.asarray(table, dtype=np.float64)"

In expected_nway(), you could prestore a variable with the  'range(d)' 
although the saving is little for small tables.
Also, I would like to remove the usage of set() in the loop.
If k=2:

 >>> list(set(range(d))-set([k]))
[0, 1, 3, 4]
 >>> rd=range(5) #which would be outside the loop
 >>> [ elem for elem in rd if elem != k ]
[0, 1, 3, 4]

Bruce


From warren.weckesser at enthought.com  Thu Jun 17 10:41:21 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Thu, 17 Jun 2010 09:41:21 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1A2C85.7090001@gmail.com>
References: <4C05DDF3.9010206@enthought.com>	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>	<4C19AB7E.1000405@enthought.com>
	<4C1A2C85.7090001@gmail.com>
Message-ID: <4C1A3411.1000806@enthought.com>

Bruce Southey wrote:
> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>   
>> The feedback in this thread inspired me to generalize my original code
>> to the n-way test of independence.  I have attached the revised code to
>> a new ticket:
>>
>>      http://projects.scipy.org/scipy/ticket/1203
>>
>> More feedback would be great!
>>
>> Warren
>>
>>
>>    
>>     
> The handling for a one way table is wrong:
>  >>>print 'One way', chisquare_nway([6, 2])
> (0.0, 1.0, 0, array([ 6.,  2.]))
>
> It should also do the marginal independence tests.
>   

As I explained in the description of the ticket and in the docstring, 
this function is not intended for doing the 'one-way' goodness of fit.  
stats.chisquare should be used for that.  Calling chisquare_nway with a 
1D array amounts to doing a test of independence between groupings but 
only giving a single grouping, hence the trivial result.  This is 
intentional.

I guess the question is: should there be a "clever" chi-square function 
that figures out what the user probably wants to do?


> I would have expected the conversion of the input into an array in the 
> chisquare_nway function.  If the input is is not an array, then there is 
> a potential bug waiting to happen because you expect numpy to correctly 
> compute the observed minus expected. For example, if the input is a list 
> then it relies on numpy doing a list minus a ndarray.  It is also 
> inefficient in the sense that you have to convert the input twice (once 
> for the expected values and once for the observed minus expected 
> calculation.


I was going to put in something like table = np.asarray(table), but then 
I noticed that, since `expected` had already been converted to an array, 
the calculation worked even if `table` was a list.  E.g.

In [4]: chisquare_nway([[10,10],[5,25]])
Out[4]:
(6.3492063492063489,
 0.011743382301172606,
 1,
 array([[  6.,  14.],
       [  9.,  21.]]))

But I will put in the conversion--that will make it easier to do a few 
other sanity checks on the input before trying to do any calculations.

>  You can also get interesting errors with a string input 
> where the reason may not be obvious:
>
>  >>>print 'twoway', chisquare_nway([['6', '2'], ['4', '11']])
>    File "chisquare_nway.py", line 132, in chisquare_nway
>      chi2 = ((table - expected)**2 / expected).sum()
> TypeError: unsupported operand type(s) for -: 'list' and 'numpy.ndarray'
>
>   
> I don't recall how np.asarray handles very large numbers but I would 
> also suggest an optional dtype argument instead of forcing float64 dtype:
> "table = np.asarray(table, dtype=np.float64)"
>
>   

Sure, I can add that.

> In expected_nway(), you could prestore a variable with the  'range(d)' 
> although the saving is little for small tables.
> Also, I would like to remove the usage of set() in the loop.
> If k=2:
>
>  >>> list(set(range(d))-set([k]))
> [0, 1, 3, 4]
>  >>> rd=range(5) #which would be outside the loop
>  >>> [ elem for elem in rd if elem != k ]
> [0, 1, 3, 4]
>
>   

Looks good--I'll make that change.


> Bruce
>
>
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From josef.pktd at gmail.com  Thu Jun 17 10:50:13 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 17 Jun 2010 10:50:13 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1A3411.1000806@enthought.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
	<4C0D0993.1080206@gmail.com>
	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>
	<4C0D2213.7020302@gmail.com>
	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>
	<4C19AB7E.1000405@enthought.com> <4C1A2C85.7090001@gmail.com>
	<4C1A3411.1000806@enthought.com>
Message-ID: <AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>

On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> Bruce Southey wrote:
>> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>>
>>> The feedback in this thread inspired me to generalize my original code
>>> to the n-way test of independence. ?I have attached the revised code to
>>> a new ticket:
>>>
>>> ? ? ?http://projects.scipy.org/scipy/ticket/1203
>>>
>>> More feedback would be great!
>>>
>>> Warren
>>>
>>>
>>>
>>>
>> The handling for a one way table is wrong:
>> ?>>>print 'One way', chisquare_nway([6, 2])
>> (0.0, 1.0, 0, array([ 6., ?2.]))
>>
>> It should also do the marginal independence tests.
>>
>
> As I explained in the description of the ticket and in the docstring,
> this function is not intended for doing the 'one-way' goodness of fit.
> stats.chisquare should be used for that. ?Calling chisquare_nway with a
> 1D array amounts to doing a test of independence between groupings but
> only giving a single grouping, hence the trivial result. ?This is
> intentional.
>
> I guess the question is: should there be a "clever" chi-square function
> that figures out what the user probably wants to do?
>
>
>> I would have expected the conversion of the input into an array in the
>> chisquare_nway function. ?If the input is is not an array, then there is
>> a potential bug waiting to happen because you expect numpy to correctly
>> compute the observed minus expected. For example, if the input is a list
>> then it relies on numpy doing a list minus a ndarray. ?It is also
>> inefficient in the sense that you have to convert the input twice (once
>> for the expected values and once for the observed minus expected
>> calculation.
>
>
> I was going to put in something like table = np.asarray(table), but then
> I noticed that, since `expected` had already been converted to an array,
> the calculation worked even if `table` was a list. ?E.g.
>
> In [4]: chisquare_nway([[10,10],[5,25]])
> Out[4]:
> (6.3492063492063489,
> ?0.011743382301172606,
> ?1,
> ?array([[ ?6., ?14.],
> ? ? ? [ ?9., ?21.]]))
>
> But I will put in the conversion--that will make it easier to do a few
> other sanity checks on the input before trying to do any calculations.
>
>> ?You can also get interesting errors with a string input
>> where the reason may not be obvious:
>>
>> ?>>>print 'twoway', chisquare_nway([['6', '2'], ['4', '11']])
>> ? ?File "chisquare_nway.py", line 132, in chisquare_nway
>> ? ? ?chi2 = ((table - expected)**2 / expected).sum()
>> TypeError: unsupported operand type(s) for -: 'list' and 'numpy.ndarray'
>>
>>
>> I don't recall how np.asarray handles very large numbers but I would
>> also suggest an optional dtype argument instead of forcing float64 dtype:
>> "table = np.asarray(table, dtype=np.float64)"
>>
>>
>
> Sure, I can add that.

the table values are integers and I don't think there can be a problem
with float64.

If we start to add dtype arguments in stats function, then we might
need more checking where and whether it's really relevant.

Josef


>
>> In expected_nway(), you could prestore a variable with the ?'range(d)'
>> although the saving is little for small tables.
>> Also, I would like to remove the usage of set() in the loop.
>> If k=2:
>>
>> ?>>> list(set(range(d))-set([k]))
>> [0, 1, 3, 4]
>> ?>>> rd=range(5) #which would be outside the loop
>> ?>>> [ elem for elem in rd if elem != k ]
>> [0, 1, 3, 4]
>>
>>
>
> Looks good--I'll make that change.
>
>
>> Bruce
>>
>>
>>
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From ralf.gommers at googlemail.com  Thu Jun 17 10:50:40 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 17 Jun 2010 22:50:40 +0800
Subject: [SciPy-Dev] fftpack test failures for 0.8.0b1
Message-ID: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>

Below are two test failures in fftpack. The test data is random, so they
happen only occasionally. I decreased the precision to decimal=5 and
maxulp=1e4, but still got one failure of each test in 200 test runs. The
TestSingleIFFT failure always occurs with an array of size 111. This is on
OS X 10.6, the same first error was also reported on 10.5.

What should be done with these failures?

Thanks,
Ralf


======================================================================
FAIL: test_random_real (test_basic.TestSingleIFFT)
----------------------------------------------------------------------
Traceback (most recent call last):
  File
"/usr/lib64/python2.6/site-packages/scipy/fftpack/tests/test_basic.py",
line 205, in test_random_real
    assert_array_almost_equal (y1, x)
  File "/usr/lib64/python2.6/site-packages/numpy/testing/utils.py",
line 774, in assert_array_almost_equal
    header='Arrays are not almost equal')
  File "/usr/lib64/python2.6/site-packages/numpy/testing/utils.py",
line 618, in assert_array_compare
    raise AssertionError(msg)
AssertionError:
Arrays are not almost equal

(mismatch 0.900900900901%)
 x: array([ 0.41364330 +5.90676663e-09j,  0.80715483 +2.64462052e-08j,
        0.05271048 -3.67830459e-08j,  0.72591031 -9.31092980e-09j,
        0.35162351 +1.40012923e-09j,  0.17632297 -1.25899486e-08j,...
 y: array([ 0.41364321,  0.80715483,  0.05271063,  0.72591019,  0.35162321,
        0.17632306,  0.3850981 ,  0.75712842,  0.68898875,  0.52632052,
        0.69728118,  0.68721569,  0.69135427,  0.34033701,  0.65788335,...
 >>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch
0.900900900901%)\n x: array([ 0.41364330 +5.90676663e-09j,  0.80715483
+2.64462052e-08j,\n        0.05271048 -3.67830459e-08j,  0.72591031
-9.31092980e-09j,\n        0.35162351 +1.40012923e-09j,  0.17632297
-1.25899486e-08j,...\n y: array([ 0.41364321,  0.80715483,  0.05271063,
0.72591019,  0.35162321,\n        0.17632306,  0.3850981 ,  0.75712842,
0.68898875,  0.52632052,\n        0.69728118,  0.68721569,  0.69135427,
0.34033701,  0.65788335,...')


======================================================================
FAIL: test_definition (test_basic.TestIfftnDouble)
----------------------------------------------------------------------
Traceback (most recent call last):
 File "/sw/lib/python2.6/site-packages/scipy/fftpack/tests/test_basic.py",
line 498, in test_definition
   assert_array_almost_equal_nulp(ifftn(x),direct_idftn(x),self.maxnlp)
 File "/sw/lib/python2.6/site-packages/numpy/testing/utils.py", line 1116,
in assert_array_almost_equal_nulp
   raise AssertionError(msg)
AssertionError: X and Y are not equal to 2000 ULP
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/3b1112c7/attachment.html>

From bsouthey at gmail.com  Thu Jun 17 11:31:36 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 17 Jun 2010 10:31:36 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C064D79.4030106@wartburg.edu>	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>	<4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>
	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>	<4C19AB7E.1000405@enthought.com>
	<4C1A2C85.7090001@gmail.com>	<4C1A3411.1000806@enthought.com>
	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>
Message-ID: <4C1A3FD8.3050801@gmail.com>

On 06/17/2010 09:50 AM, josef.pktd at gmail.com wrote:
> On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
> <warren.weckesser at enthought.com>  wrote:
>    
>> Bruce Southey wrote:
>>      
>>> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>>>
>>>        
>>>> The feedback in this thread inspired me to generalize my original code
>>>> to the n-way test of independence.  I have attached the revised code to
>>>> a new ticket:
>>>>
>>>>       http://projects.scipy.org/scipy/ticket/1203
>>>>
>>>> More feedback would be great!
>>>>
>>>> Warren
>>>>
>>>>
>>>>
>>>>
>>>>          
>>> The handling for a one way table is wrong:
>>>   >>>print 'One way', chisquare_nway([6, 2])
>>> (0.0, 1.0, 0, array([ 6.,  2.]))
>>>
>>> It should also do the marginal independence tests.
>>>
>>>        
>> As I explained in the description of the ticket and in the docstring,
>> this function is not intended for doing the 'one-way' goodness of fit.
>> stats.chisquare should be used for that.  Calling chisquare_nway with a
>> 1D array amounts to doing a test of independence between groupings but
>> only giving a single grouping, hence the trivial result.  This is
>> intentional.
>>      


In expected-nway, you say that "While this function can handle a 1D 
array," but clearly it does not handle it correctly.
If it was your intention not to do one way tables, then you *must* check 
the input and reject one way tables!
>> I guess the question is: should there be a "clever" chi-square function
>> that figures out what the user probably wants to do?
>>      
My issue is that the chi-squared test statistic is still calculated in 
exactly the same way for n-way tables where n>0. So it is pure 
unnecessary duplication of functionality if you require a second 
function for the one way table. I also prefer the one-stop shopping approach

>>
>>      
>>> I would have expected the conversion of the input into an array in the
>>> chisquare_nway function.  If the input is is not an array, then there is
>>> a potential bug waiting to happen because you expect numpy to correctly
>>> compute the observed minus expected. For example, if the input is a list
>>> then it relies on numpy doing a list minus a ndarray.  It is also
>>> inefficient in the sense that you have to convert the input twice (once
>>> for the expected values and once for the observed minus expected
>>> calculation.
>>>        
>>
>> I was going to put in something like table = np.asarray(table), but then
>> I noticed that, since `expected` had already been converted to an array,
>> the calculation worked even if `table` was a list.  E.g.
>>
>> In [4]: chisquare_nway([[10,10],[5,25]])
>> Out[4]:
>> (6.3492063492063489,
>>   0.011743382301172606,
>>   1,
>>   array([[  6.,  14.],
>>        [  9.,  21.]]))
>>
>> But I will put in the conversion--that will make it easier to do a few
>> other sanity checks on the input before trying to do any calculations.
>>
>>      
>>>   You can also get interesting errors with a string input
>>> where the reason may not be obvious:
>>>
>>>   >>>print 'twoway', chisquare_nway([['6', '2'], ['4', '11']])
>>>     File "chisquare_nway.py", line 132, in chisquare_nway
>>>       chi2 = ((table - expected)**2 / expected).sum()
>>> TypeError: unsupported operand type(s) for -: 'list' and 'numpy.ndarray'
>>>
>>>
>>> I don't recall how np.asarray handles very large numbers but I would
>>> also suggest an optional dtype argument instead of forcing float64 dtype:
>>> "table = np.asarray(table, dtype=np.float64)"
>>>
>>>
>>>        
>> Sure, I can add that.
>>      
> the table values are integers and I don't think there can be a problem
> with float64.
>
> If we start to add dtype arguments in stats function, then we might
> need more checking where and whether it's really relevant.
>
> Josef
>    
Any time an operation uses summation, there will be the potential for 
overflow that can be very serious for certain numerical types such as 
integers. Consequently, numpy provides an optional dtype in accumulation 
related functions like sum and mean. This avoids a user having to change 
the input from a lower precision to a higher precision thus mitigating 
the overflow problem. Thus, if a function uses say numpy's sum or 
variance, adding the dtype option is free protection.


>
>    
>>      
>>> In expected_nway(), you could prestore a variable with the  'range(d)'
>>> although the saving is little for small tables.
>>> Also, I would like to remove the usage of set() in the loop.
>>> If k=2:
>>>
>>>   >>>  list(set(range(d))-set([k]))
>>> [0, 1, 3, 4]
>>>   >>>  rd=range(5) #which would be outside the loop
>>>   >>>  [ elem for elem in rd if elem != k ]
>>> [0, 1, 3, 4]
>>>
>>>
>>>        
>> Looks good--I'll make that change.
>>      
>>
Bruce


From josef.pktd at gmail.com  Thu Jun 17 11:45:54 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 17 Jun 2010 11:45:54 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1A3FD8.3050801@gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C064D79.4030106@wartburg.edu>
	<AANLkTikGkU7YJokRwjs_c3F7ZO6i0j5HeC49kIbS9Lib@mail.gmail.com>
	<4C06807A.40301@gmail.com> <4C06861C.1060401@wartburg.edu>
	<4C069E84.4020308@gmail.com> <4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
	<4C0D0993.1080206@gmail.com>
	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>
	<4C0D2213.7020302@gmail.com>
	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>
	<4C19AB7E.1000405@enthought.com> <4C1A2C85.7090001@gmail.com>
	<4C1A3411.1000806@enthought.com>
	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>
	<4C1A3FD8.3050801@gmail.com>
Message-ID: <AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>

On Thu, Jun 17, 2010 at 11:31 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/17/2010 09:50 AM, josef.pktd at gmail.com wrote:
>> On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
>> <warren.weckesser at enthought.com> ?wrote:
>>
>>> Bruce Southey wrote:
>>>
>>>> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>>>>
>>>>
>>>>> The feedback in this thread inspired me to generalize my original code
>>>>> to the n-way test of independence. ?I have attached the revised code to
>>>>> a new ticket:
>>>>>
>>>>> ? ? ? http://projects.scipy.org/scipy/ticket/1203
>>>>>
>>>>> More feedback would be great!
>>>>>
>>>>> Warren
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>> The handling for a one way table is wrong:
>>>> ? >>>print 'One way', chisquare_nway([6, 2])
>>>> (0.0, 1.0, 0, array([ 6., ?2.]))
>>>>
>>>> It should also do the marginal independence tests.
>>>>
>>>>
>>> As I explained in the description of the ticket and in the docstring,
>>> this function is not intended for doing the 'one-way' goodness of fit.
>>> stats.chisquare should be used for that. ?Calling chisquare_nway with a
>>> 1D array amounts to doing a test of independence between groupings but
>>> only giving a single grouping, hence the trivial result. ?This is
>>> intentional.
>>>
>
>
> In expected-nway, you say that "While this function can handle a 1D
> array," but clearly it does not handle it correctly.
> If it was your intention not to do one way tables, then you *must* check
> the input and reject one way tables!
>>> I guess the question is: should there be a "clever" chi-square function
>>> that figures out what the user probably wants to do?
>>>
> My issue is that the chi-squared test statistic is still calculated in
> exactly the same way for n-way tables where n>0. So it is pure
> unnecessary duplication of functionality if you require a second
> function for the one way table. I also prefer the one-stop shopping approach

just because it's chisquare doesn't mean it's the same kind of tests.
This is a test for independence or association that only makes sense
if there are at least two random variables.

>>> np.corrcoef(np.arange(5))
1

the equivalent function in R is summary(xtab(...))  not chisquare

I don't like mixing shoes and apples.

>
>>>
>>>
>>>> I would have expected the conversion of the input into an array in the
>>>> chisquare_nway function. ?If the input is is not an array, then there is
>>>> a potential bug waiting to happen because you expect numpy to correctly
>>>> compute the observed minus expected. For example, if the input is a list
>>>> then it relies on numpy doing a list minus a ndarray. ?It is also
>>>> inefficient in the sense that you have to convert the input twice (once
>>>> for the expected values and once for the observed minus expected
>>>> calculation.
>>>>
>>>
>>> I was going to put in something like table = np.asarray(table), but then
>>> I noticed that, since `expected` had already been converted to an array,
>>> the calculation worked even if `table` was a list. ?E.g.
>>>
>>> In [4]: chisquare_nway([[10,10],[5,25]])
>>> Out[4]:
>>> (6.3492063492063489,
>>> ? 0.011743382301172606,
>>> ? 1,
>>> ? array([[ ?6., ?14.],
>>> ? ? ? ?[ ?9., ?21.]]))
>>>
>>> But I will put in the conversion--that will make it easier to do a few
>>> other sanity checks on the input before trying to do any calculations.
>>>
>>>
>>>> ? You can also get interesting errors with a string input
>>>> where the reason may not be obvious:
>>>>
>>>> ? >>>print 'twoway', chisquare_nway([['6', '2'], ['4', '11']])
>>>> ? ? File "chisquare_nway.py", line 132, in chisquare_nway
>>>> ? ? ? chi2 = ((table - expected)**2 / expected).sum()
>>>> TypeError: unsupported operand type(s) for -: 'list' and 'numpy.ndarray'
>>>>
>>>>
>>>> I don't recall how np.asarray handles very large numbers but I would
>>>> also suggest an optional dtype argument instead of forcing float64 dtype:
>>>> "table = np.asarray(table, dtype=np.float64)"
>>>>
>>>>
>>>>
>>> Sure, I can add that.
>>>
>> the table values are integers and I don't think there can be a problem
>> with float64.
>>
>> If we start to add dtype arguments in stats function, then we might
>> need more checking where and whether it's really relevant.
>>
>> Josef
>>
> Any time an operation uses summation, there will be the potential for
> overflow that can be very serious for certain numerical types such as
> integers. Consequently, numpy provides an optional dtype in accumulation
> related functions like sum and mean. This avoids a user having to change
> the input from a lower precision to a higher precision thus mitigating
> the overflow problem. Thus, if a function uses say numpy's sum or
> variance, adding the dtype option is free protection.

That's true in general, but is there a contingency table where
overflow would ever occur at float64 or where there would be a
significant loss of precision?
My suspicion for this case is that a dtype argument just creates noise.

Josef

>
>
>
>>
>>
>>>
>>>> In expected_nway(), you could prestore a variable with the ?'range(d)'
>>>> although the saving is little for small tables.
>>>> Also, I would like to remove the usage of set() in the loop.
>>>> If k=2:
>>>>
>>>> ? >>> ?list(set(range(d))-set([k]))
>>>> [0, 1, 3, 4]
>>>> ? >>> ?rd=range(5) #which would be outside the loop
>>>> ? >>> ?[ elem for elem in rd if elem != k ]
>>>> [0, 1, 3, 4]
>>>>
>>>>
>>>>
>>> Looks good--I'll make that change.
>>>
>>>
> Bruce
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From charlesr.harris at gmail.com  Thu Jun 17 11:48:33 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Thu, 17 Jun 2010 09:48:33 -0600
Subject: [SciPy-Dev] fftpack test failures for 0.8.0b1
In-Reply-To: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
Message-ID: <AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>

On Thu, Jun 17, 2010 at 8:50 AM, Ralf Gommers
<ralf.gommers at googlemail.com>wrote:

> Below are two test failures in fftpack. The test data is random, so they
> happen only occasionally. I decreased the precision to decimal=5 and
> maxulp=1e4, but still got one failure of each test in 200 test runs. The
> TestSingleIFFT failure always occurs with an array of size 111. This is on
> OS X 10.6, the same first error was also reported on 10.5.
>
> What should be done with these failures?
>
>
I guess the first thing is to figure out what is going on, and that needs
something repeatable. Maybe the test could be repeated with a set of seeds
until the problem appears. Also, since 111 = 3*37, I wonder what happens
with other odd primes besides 37. It might be worth testing with 97 and
other such.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/1ec29f89/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun 17 12:15:44 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Fri, 18 Jun 2010 00:15:44 +0800
Subject: [SciPy-Dev] wavfile test failure on OS X
Message-ID: <AANLkTinMBO5JpwZh6gvTGh1TguuG1F26sQvMKtcG-Iwt@mail.gmail.com>

On OS X 10.5 this failure was reported:

ERROR: test_wavfile.test_read_1
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/sw/lib/python2.6/site-
packages/nose/case.py", line 186, in runTest
   self.test(*self.arg)
 File "/sw/lib/python2.6/site-packages/scipy/io/tests/test_wavfile.py", line
12, in test_read_1
   rate, data = wavfile.read(datafile('test-44100-le-1ch-4bytes.wav'))
 File "/sw/lib/python2.6/site-packages/scipy/io/wavfile.py", line 124, in
read
   return rate, data
UnboundLocalError: local variable 'data' referenced before assignment


On 10.6 it doesn't fail but has been spitting out warnings as well for a
long time:
  warnings.warn(depdoc, DeprecationWarning)
..../Users/rgommers/Code/scipy/scipy/io/wavfile.py:30: WavFileWarning:
Unfamiliar format bytes
  warnings.warn("Unfamiliar format bytes", WavFileWarning)
/Users/rgommers/Code/scipy/scipy/io/wavfile.py:120: WavFileWarning: chunk
not understood
  warnings.warn("chunk not understood", WavFileWarning)


Anyone know how to fix this?

Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100618/88325cd7/attachment.html>

From pav at iki.fi  Thu Jun 17 12:37:47 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Thu, 17 Jun 2010 16:37:47 +0000 (UTC)
Subject: [SciPy-Dev] scipy.special test failures for 0.8.0b1
References: <AANLkTimAFedlDIKRZFKpU7wCPC0asg7Ndabhi58z_Rzr@mail.gmail.com>
Message-ID: <hvdj0r$i17$1@dough.gmane.org>

Thu, 17 Jun 2010 22:05:43 +0800, Ralf Gommers wrote:
> These scipy.special test failures were reported by Derek Homeier for
> 0.8.0b1. Are these all just cases of the test precision being a little
> too high, or is there an actual problem? What should be done with them
> for the 0.8.0 release?
[clip]

I can try to take a closer look today evening.

[clip]
> FAIL: test_iv_cephes_vs_amos (test_basic.TestBessel)
[clip]
> Not equal to tolerance rtol=1e-12, atol=1e-305 (-120,
> 700.60000000000002) (mismatch 100.0%)
>  x: array(9.7596849076267918e+297)
>  y: array((9.7596849444884899e+297+0j))

The AMOS result 9.7596849444884899e+297 is correct.

For some reason, on that platform the Boost code computes gives a result 
that is off by 4e-9. Not disastrous, so the test tolerance could simply 
be bumped to 5e-9.

> while the following also fails on darwin-i386:
> 
> FAIL: test_iv_cephes_vs_amos_mass_test (test_basic.TestBessel)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
>  "/sw/lib/python2.6/site-packages/scipy/special/tests/test_basic.py",
> line 1712, in test_iv_cephes_vs_amos_mass_test
>    assert dc[k] < 1e-9, (iv(v[k], x[k]), iv(v[k], x[k]+0j))
> AssertionError: (1.8320048963545875e+306, (inf+0j))

The test is probably buggy -- it probably shouldn't assume that much 
about when AMOS decides a result is inf. That also seems to be platform-
dependent.

[clip]
> FAIL: test_data.test_boost(<Data for gamma:
> test_gamma_data_ipp-near_m55>,)
[clip]
> Max |adiff|: 5.56451e-80
> Max |rdiff|: 6.01399e-12

The tolerance is just too strict. rtol should probably be around 1e-11

[clip]
> FAIL: test_data.test_boost(<Data for gammaincinv:
> gamma_inv_big_data_ipp-gamma_inv_big_data>,)
[clip]
> Max |adiff|: 39782.8
> Max |rdiff|: 1
> Bad results for the following points (in output 0):
>                   40010.84375             0.12698681652545929
> =>                            0.0 !=             39782.764004009827
> (rdiff
>                            1.0)

I get this also on x86,

>>> scipy.special.gammaincinv(40010.84375, 0.12698681652545929)
__main__:1: SpecialFunctionWarning: gammaincinv: failed to converge at 
(a, y) = (40010.84375, 0.12698681652545928955): 3

My bad -- the function tries to be too strict about tolerance. There are 
two bugs here: ~eps tolerance is probably too much asked, and certainly 0 
is not the correct value to return if tolerances cannot be satisfied...

The function should be fixed.

-- 
Pauli Virtanen


From ben.root at ou.edu  Thu Jun 17 12:45:53 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Thu, 17 Jun 2010 11:45:53 -0500
Subject: [SciPy-Dev] scipy.io cleanup
In-Reply-To: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>
References: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>
Message-ID: <AANLkTil58B95vGVAR9R4iMVt1g5MvZKmcX9NPM1Lgmc2@mail.gmail.com>

Just double-checking...  From a documentation point of view, should the
deprecation warnings be placed in the source code's docstrings, or should
they be added through the online documentation editor?

Ben Root

On Wed, Jun 16, 2010 at 6:23 AM, Ralf Gommers
<ralf.gommers at googlemail.com>wrote:

> A long list of functions in scipy.io was deprecated in 0.7 and has to be
> removed now. It's in the 'io' branch at
> http://github.com/rgommers/scipy/commits/io now. In addition the recaster
> module is deprecated as Matthew requested in another thread. Does someone
> want to have a quick look at it?
>
> Other question: is anything in dumb_shelve.py,  dumbdbm_patched.py and
> data_store.py still useful for anyone? I think it can all be deprecated as
> well, because it doesn't fit in scipy.io (it's not io for external
> formats, just random stuff), is untested and, at least in the case of
> dumbdbm, buggy.
>
> Cheers,
> Ralf
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/0e3f2281/attachment.html>

From pav at iki.fi  Thu Jun 17 12:49:38 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Thu, 17 Jun 2010 16:49:38 +0000 (UTC)
Subject: [SciPy-Dev] scipy.io cleanup
References: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>
	<AANLkTil58B95vGVAR9R4iMVt1g5MvZKmcX9NPM1Lgmc2@mail.gmail.com>
Message-ID: <hvdjn2$p2q$1@dough.gmane.org>

Thu, 17 Jun 2010 11:45:53 -0500, Benjamin Root wrote:
> Just double-checking...  From a documentation point of view, should the
> deprecation warnings be placed in the source code's docstrings, or
> should they be added through the online documentation editor?

That does not matter much. If they are added in the documentation editor, 
they get in only when someone applies and commits the doc patch manually. 
Stuff goes to the same place in the end, but with the doc editor there is 
an unspecified delay.

-- 
Pauli Virtanen


From bsouthey at gmail.com  Thu Jun 17 12:59:34 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 17 Jun 2010 11:59:34 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>
	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>	<4C19AB7E.1000405@enthought.com>
	<4C1A2C85.7090001@gmail.com>	<4C1A3411.1000806@enthought.com>	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>	<4C1A3FD8.3050801@gmail.com>
	<AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>
Message-ID: <4C1A5476.1040407@gmail.com>

On 06/17/2010 10:45 AM, josef.pktd at gmail.com wrote:
> On Thu, Jun 17, 2010 at 11:31 AM, Bruce Southey<bsouthey at gmail.com>  wrote:
>    
>> On 06/17/2010 09:50 AM, josef.pktd at gmail.com wrote:
>>      
>>> On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
>>> <warren.weckesser at enthought.com>    wrote:
>>>
>>>        
>>>> Bruce Southey wrote:
>>>>
>>>>          
>>>>> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>>>>>
>>>>>
>>>>>            
>>>>>> The feedback in this thread inspired me to generalize my original code
>>>>>> to the n-way test of independence.  I have attached the revised code to
>>>>>> a new ticket:
>>>>>>
>>>>>>        http://projects.scipy.org/scipy/ticket/1203
>>>>>>
>>>>>> More feedback would be great!
>>>>>>
>>>>>> Warren
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>              
>>>>> The handling for a one way table is wrong:
>>>>>    >>>print 'One way', chisquare_nway([6, 2])
>>>>> (0.0, 1.0, 0, array([ 6.,  2.]))
>>>>>
>>>>> It should also do the marginal independence tests.
>>>>>
>>>>>
>>>>>            
>>>> As I explained in the description of the ticket and in the docstring,
>>>> this function is not intended for doing the 'one-way' goodness of fit.
>>>> stats.chisquare should be used for that.  Calling chisquare_nway with a
>>>> 1D array amounts to doing a test of independence between groupings but
>>>> only giving a single grouping, hence the trivial result.  This is
>>>> intentional.
>>>>
>>>>          
>>
>> In expected-nway, you say that "While this function can handle a 1D
>> array," but clearly it does not handle it correctly.
>> If it was your intention not to do one way tables, then you *must* check
>> the input and reject one way tables!
>>      
>>>> I guess the question is: should there be a "clever" chi-square function
>>>> that figures out what the user probably wants to do?
>>>>
>>>>          
>> My issue is that the chi-squared test statistic is still calculated in
>> exactly the same way for n-way tables where n>0. So it is pure
>> unnecessary duplication of functionality if you require a second
>> function for the one way table. I also prefer the one-stop shopping approach
>>      
> just because it's chisquare doesn't mean it's the same kind of tests.
> This is a test for independence or association that only makes sense
> if there are at least two random variables.
>    

Wrong!
See for example:
http://en.wikipedia.org/wiki/Pearson's_chi-square_test
"Pearson's chi-square is used to assess two types of comparison: tests 
of goodness of fit and tests of independence."

The exact same test statistic is being calculated just that the 
hypothesis is different (which is the user's problem not the function's 
problem). So please separate the hypothesis from the test statistic.


>>>> np.corrcoef(np.arange(5))
>>>>          
> 1
>
> the equivalent function in R is summary(xtab(...))  not chisquare
>    
I think you have the wrong function because corrcoef "Return correlation 
coefficients" which is R's cor function.
What is xtab? It is not an R function perhaps someone's R package.

> I don't like mixing shoes and apples.
>    
Then please don't.
>>      
>>>>
>>>>          
>>>>> I would have expected the conversion of the input into an array in the
>>>>> chisquare_nway function.  If the input is is not an array, then there is
>>>>> a potential bug waiting to happen because you expect numpy to correctly
>>>>> compute the observed minus expected. For example, if the input is a list
>>>>> then it relies on numpy doing a list minus a ndarray.  It is also
>>>>> inefficient in the sense that you have to convert the input twice (once
>>>>> for the expected values and once for the observed minus expected
>>>>> calculation.
>>>>>
>>>>>            
>>>> I was going to put in something like table = np.asarray(table), but then
>>>> I noticed that, since `expected` had already been converted to an array,
>>>> the calculation worked even if `table` was a list.  E.g.
>>>>
>>>> In [4]: chisquare_nway([[10,10],[5,25]])
>>>> Out[4]:
>>>> (6.3492063492063489,
>>>>    0.011743382301172606,
>>>>    1,
>>>>    array([[  6.,  14.],
>>>>         [  9.,  21.]]))
>>>>
>>>> But I will put in the conversion--that will make it easier to do a few
>>>> other sanity checks on the input before trying to do any calculations.
>>>>
>>>>
>>>>          
>>>>>    You can also get interesting errors with a string input
>>>>> where the reason may not be obvious:
>>>>>
>>>>>    >>>print 'twoway', chisquare_nway([['6', '2'], ['4', '11']])
>>>>>      File "chisquare_nway.py", line 132, in chisquare_nway
>>>>>        chi2 = ((table - expected)**2 / expected).sum()
>>>>> TypeError: unsupported operand type(s) for -: 'list' and 'numpy.ndarray'
>>>>>
>>>>>
>>>>> I don't recall how np.asarray handles very large numbers but I would
>>>>> also suggest an optional dtype argument instead of forcing float64 dtype:
>>>>> "table = np.asarray(table, dtype=np.float64)"
>>>>>
>>>>>
>>>>>
>>>>>            
>>>> Sure, I can add that.
>>>>
>>>>          
>>> the table values are integers and I don't think there can be a problem
>>> with float64.
>>>
>>> If we start to add dtype arguments in stats function, then we might
>>> need more checking where and whether it's really relevant.
>>>
>>> Josef
>>>
>>>        
>> Any time an operation uses summation, there will be the potential for
>> overflow that can be very serious for certain numerical types such as
>> integers. Consequently, numpy provides an optional dtype in accumulation
>> related functions like sum and mean. This avoids a user having to change
>> the input from a lower precision to a higher precision thus mitigating
>> the overflow problem. Thus, if a function uses say numpy's sum or
>> variance, adding the dtype option is free protection.
>>      
> That's true in general, but is there a contingency table where
> overflow would ever occur at float64 or where there would be a
> significant loss of precision?
> My suspicion for this case is that a dtype argument just creates noise.
>
> Josef
>    
Please do not mix 'precision' with 'overflow' as these are not same issue:
http://steve.hollasch.net/cgindex/coding/ieeefloat.html
"Overflow means that values have grown too large for the representation."

The way the expected values are computed involves summation so it can 
occur for large dimensions with large values in it.
 >>> np.sum([10000000000000]*1000000)
-8446744073709551616

Obviously avoidable if your OS supports higher precision (but that is 
indeed noise when the system does not support higher precision)
 >>> np.sum([10000000000000]*1000000, dtype=np.float128)
10000000000000000000.0


Bruce


>    
>>
>>
>>      
>>>
>>>        
>>>>          
>>>>> In expected_nway(), you could prestore a variable with the  'range(d)'
>>>>> although the saving is little for small tables.
>>>>> Also, I would like to remove the usage of set() in the loop.
>>>>> If k=2:
>>>>>
>>>>>    >>>    list(set(range(d))-set([k]))
>>>>> [0, 1, 3, 4]
>>>>>    >>>    rd=range(5) #which would be outside the loop
>>>>>    >>>    [ elem for elem in rd if elem != k ]
>>>>> [0, 1, 3, 4]
>>>>>
>>>>>
>>>>>
>>>>>            
>>>> Looks good--I'll make that change.
>>>>
>>>>
>>>>          
>> Bruce
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>      
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/174b71a6/attachment.html>

From matthew.brett at gmail.com  Thu Jun 17 13:14:04 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Thu, 17 Jun 2010 18:14:04 +0100
Subject: [SciPy-Dev] scipy.io cleanup
In-Reply-To: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>
References: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>
Message-ID: <AANLkTikZabADLb5Qj9_e7JEgJlFNNVANeSy8UUbPkA9H@mail.gmail.com>

Hi,

On Wed, Jun 16, 2010 at 12:23 PM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
> A long list of functions in scipy.io was deprecated in 0.7 and has to be
> removed now. It's in the 'io' branch at
> http://github.com/rgommers/scipy/commits/io now. In addition the recaster
> module is deprecated as Matthew requested in another thread. Does someone
> want to have a quick look at it?

Sorry that I didn't do this myself - and thanks for doing it.

I noticed that your patch means that every import of scipy.io
generates a deprecation warning.  This in turn was because I'm calling
the deprecated sctype_attributes in the Recaster class definition.
I've worked round this, version of your 'recaster' patch attached...

Thanks again,

Matthew
-------------- next part --------------
A non-text attachment was scrubbed...
Name: recaster_patch.py
Type: application/octet-stream
Size: 1701 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/05c6a10a/attachment.obj>

From josef.pktd at gmail.com  Thu Jun 17 13:24:40 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 17 Jun 2010 13:24:40 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1A5476.1040407@gmail.com>
References: <4C05DDF3.9010206@enthought.com> <4C06807A.40301@gmail.com>
	<4C06861C.1060401@wartburg.edu> <4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
	<4C0D0993.1080206@gmail.com>
	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>
	<4C0D2213.7020302@gmail.com>
	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>
	<4C19AB7E.1000405@enthought.com> <4C1A2C85.7090001@gmail.com>
	<4C1A3411.1000806@enthought.com>
	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>
	<4C1A3FD8.3050801@gmail.com>
	<AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>
	<4C1A5476.1040407@gmail.com>
Message-ID: <AANLkTilGU2Nb3zm1_Dpw04h8qyl606gU7g2l3aCTz0hh@mail.gmail.com>

On Thu, Jun 17, 2010 at 12:59 PM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/17/2010 10:45 AM, josef.pktd at gmail.com wrote:
>
> On Thu, Jun 17, 2010 at 11:31 AM, Bruce Southey <bsouthey at gmail.com> wrote:
>
>
> On 06/17/2010 09:50 AM, josef.pktd at gmail.com wrote:
>
>
> On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
> <warren.weckesser at enthought.com> ?wrote:
>
>
>
> Bruce Southey wrote:
>
>
>
> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>
>
>
>
> The feedback in this thread inspired me to generalize my original code
> to the n-way test of independence. ?I have attached the revised code to
> a new ticket:
>
> ? ? ? http://projects.scipy.org/scipy/ticket/1203
>
> More feedback would be great!
>
> Warren
>
>
>
>
>
>
>
> The handling for a one way table is wrong:
> ? >>>print 'One way', chisquare_nway([6, 2])
> (0.0, 1.0, 0, array([ 6., ?2.]))
>
> It should also do the marginal independence tests.
>
>
>
>
> As I explained in the description of the ticket and in the docstring,
> this function is not intended for doing the 'one-way' goodness of fit.
> stats.chisquare should be used for that. ?Calling chisquare_nway with a
> 1D array amounts to doing a test of independence between groupings but
> only giving a single grouping, hence the trivial result. ?This is
> intentional.
>
>
>
> In expected-nway, you say that "While this function can handle a 1D
> array," but clearly it does not handle it correctly.
> If it was your intention not to do one way tables, then you *must* check
> the input and reject one way tables!
>
>
> I guess the question is: should there be a "clever" chi-square function
> that figures out what the user probably wants to do?
>
>
>
> My issue is that the chi-squared test statistic is still calculated in
> exactly the same way for n-way tables where n>0. So it is pure
> unnecessary duplication of functionality if you require a second
> function for the one way table. I also prefer the one-stop shopping approach
>
>
> just because it's chisquare doesn't mean it's the same kind of tests.
> This is a test for independence or association that only makes sense
> if there are at least two random variables.
>
>
> Wrong!
> See for example:
> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
> "Pearson's chi-square is used to assess two types of comparison: tests of
> goodness of fit and tests of independence."
>
> The exact same test statistic is being calculated just that the hypothesis
> is different (which is the user's problem not the function's problem). So
> please separate the hypothesis from the test statistic.
>
>
> np.corrcoef(np.arange(5))
>
>
> 1
>
> the equivalent function in R is summary(xtab(...))  not chisquare
>
>
> I think you have the wrong function because corrcoef "Return correlation
> coefficients" which is R's cor function.
> What is xtab? It is not an R function perhaps someone's R package.
>
> I don't like mixing shoes and apples.
>
>
> Then please don't.
>
>
>
>
>
> I would have expected the conversion of the input into an array in the
> chisquare_nway function. ?If the input is is not an array, then there is
> a potential bug waiting to happen because you expect numpy to correctly
> compute the observed minus expected. For example, if the input is a list
> then it relies on numpy doing a list minus a ndarray. ?It is also
> inefficient in the sense that you have to convert the input twice (once
> for the expected values and once for the observed minus expected
> calculation.
>
>
>
> I was going to put in something like table = np.asarray(table), but then
> I noticed that, since `expected` had already been converted to an array,
> the calculation worked even if `table` was a list. ?E.g.
>
> In [4]: chisquare_nway([[10,10],[5,25]])
> Out[4]:
> (6.3492063492063489,
> ? 0.011743382301172606,
> ? 1,
> ? array([[ ?6., ?14.],
> ? ? ? ?[ ?9., ?21.]]))
>
> But I will put in the conversion--that will make it easier to do a few
> other sanity checks on the input before trying to do any calculations.
>
>
>
>
> ? You can also get interesting errors with a string input
> where the reason may not be obvious:
>
> ? >>>print 'twoway', chisquare_nway([['6', '2'], ['4', '11']])
> ? ? File "chisquare_nway.py", line 132, in chisquare_nway
> ? ? ? chi2 = ((table - expected)**2 / expected).sum()
> TypeError: unsupported operand type(s) for -: 'list' and 'numpy.ndarray'
>
>
> I don't recall how np.asarray handles very large numbers but I would
> also suggest an optional dtype argument instead of forcing float64 dtype:
> "table = np.asarray(table, dtype=np.float64)"
>
>
>
>
>
> Sure, I can add that.
>
>
>
> the table values are integers and I don't think there can be a problem
> with float64.
>
> If we start to add dtype arguments in stats function, then we might
> need more checking where and whether it's really relevant.
>
> Josef
>
>
>
> Any time an operation uses summation, there will be the potential for
> overflow that can be very serious for certain numerical types such as
> integers. Consequently, numpy provides an optional dtype in accumulation
> related functions like sum and mean. This avoids a user having to change
> the input from a lower precision to a higher precision thus mitigating
> the overflow problem. Thus, if a function uses say numpy's sum or
> variance, adding the dtype option is free protection.
>
>
> That's true in general, but is there a contingency table where
> overflow would ever occur at float64 or where there would be a
> significant loss of precision?
> My suspicion for this case is that a dtype argument just creates noise.
>
> Josef
>
>
> Please do not mix 'precision' with 'overflow' as these are not same issue:
> http://steve.hollasch.net/cgindex/coding/ieeefloat.html
> "Overflow means that values have grown too large for the representation."
>
> The way the expected values are computed involves summation so it can occur
> for large dimensions with large values in it.
>>>> np.sum([10000000000000]*1000000)
> -8446744073709551616
>
> Obviously avoidable if your OS supports higher precision (but that is indeed
> noise when the system does not support higher precision)
>>>> np.sum([10000000000000]*1000000, dtype=np.float128)
> 10000000000000000000.0

since Warren converts to float
33	    table = np.asarray(table, dtype=np.float64)

there is never an integer sum, so integer overflow is converted to a
possible precision problem

>>> np.sum([10000000000000]*1000000, dtype=np.float64)
1e+019

If there were gigantic contingency tables
reduce(np.multiply, margins)
could be replaced by exp(sum(log(...))

Josef

Josef


>
> Bruce
>
>
>
>
>
>
>
>
>
>
> In expected_nway(), you could prestore a variable with the ?'range(d)'
> although the saving is little for small tables.
> Also, I would like to remove the usage of set() in the loop.
> If k=2:
>
> ? >>> ?list(set(range(d))-set([k]))
> [0, 1, 3, 4]
> ? >>> ?rd=range(5) #which would be outside the loop
> ? >>> ?[ elem for elem in rd if elem != k ]
> [0, 1, 3, 4]
>
>
>
>
>
> Looks good--I'll make that change.
>
>
>
>
> Bruce
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Thu Jun 17 15:18:41 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 17 Jun 2010 13:18:41 -0600
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
Message-ID: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com>

The problem is that for example C2F is

def C2F(C):
     return 1.8 * C + 32
So array_like (list, tuple...) does not work.

What is the preferred solution?

1, Change the docs to say "scalar or array" and return a warning if C
is not a scalar or array
2, Ad the ability to handle all array_like.

Vincent


From vincent at vincentdavis.net  Thu Jun 17 15:23:28 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 17 Jun 2010 13:23:28 -0600
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com>
Message-ID: <AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com>

On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> The problem is that for example C2F is
>
> def C2F(C):
> ? ? return 1.8 * C + 32
> So array_like (list, tuple...) does not work.
>
> What is the preferred solution?
>
> 1, Change the docs to say "scalar or array" and return a warning if C
> is not a scalar or array
> 2, Ad the ability to handle all array_like.
>
def C2F(C):
? ? return np.add(np.multilpy(1.8, C), 32)

I think that accepts array-like and scaler and returns the same

vincent

> Vincent
>


From jsseabold at gmail.com  Thu Jun 17 15:24:27 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Thu, 17 Jun 2010 15:24:27 -0400
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com> 
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com>
Message-ID: <AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com>

On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> The problem is that for example C2F is
>>
>> def C2F(C):
>> ? ? return 1.8 * C + 32
>> So array_like (list, tuple...) does not work.
>>
>> What is the preferred solution?
>>
>> 1, Change the docs to say "scalar or array" and return a warning if C
>> is not a scalar or array
>> 2, Ad the ability to handle all array_like.
>>
> def C2F(C):
> ? ? return np.add(np.multilpy(1.8, C), 32)
>
> I think that accepts array-like and scaler and returns the same

Or just C = np.asarray(C)

This is quite often what is done.

Skipper


From ben.root at ou.edu  Thu Jun 17 15:30:34 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Thu, 17 Jun 2010 14:30:34 -0500
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com> 
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com> 
	<AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com>
Message-ID: <AANLkTikyD9LbWR6O8zRyIG592wDSNUyyzVd6gsxx0qpI@mail.gmail.com>

On Thu, Jun 17, 2010 at 2:24 PM, Skipper Seabold <jsseabold at gmail.com>wrote:

> On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
> > On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
> >> The problem is that for example C2F is
> >>
> >> def C2F(C):
> >>     return 1.8 * C + 32
> >> So array_like (list, tuple...) does not work.
> >>
> >> What is the preferred solution?
> >>
> >> 1, Change the docs to say "scalar or array" and return a warning if C
> >> is not a scalar or array
> >> 2, Ad the ability to handle all array_like.
> >>
> > def C2F(C):
> >     return np.add(np.multilpy(1.8, C), 32)
> >
> > I think that accepts array-like and scaler and returns the same
>
> Or just C = np.asarray(C)
>
> This is quite often what is done.
>
> What about masked arrays?  Should np.asanyarray() be used?

Ben Root


>  Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/5b55e41c/attachment.html>

From vincent at vincentdavis.net  Thu Jun 17 15:52:17 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 17 Jun 2010 13:52:17 -0600
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com>
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com>
	<AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com>
Message-ID: <BB592616-64A4-414D-AC2C-F99BA9C28D0B@vincentdavis.net>


Sent from my iPhone

On Jun 17, 2010, at 1:24 PM, Skipper Seabold <jsseabold at gmail.com>  
wrote:

> On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis <vincent at vincentdavis.net 
> > wrote:
>> On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis <vincent at vincentdavis.net 
>> > wrote:
>>> The problem is that for example C2F is
>>>
>>> def C2F(C):
>>>     return 1.8 * C + 32
>>> So array_like (list, tuple...) does not work.
>>>
>>> What is the preferred solution?
>>>
>>> 1, Change the docs to say "scalar or array" and return a warning  
>>> if C
>>> is not a scalar or array
>>> 2, Ad the ability to handle all array_like.
>>>
>> def C2F(C):
>>     return np.add(np.multilpy(1.8, C), 32)
>>
>> I think that accepts array-like and scaler and returns the same
>
> Or just C = np.asarray(C)
>
> This is quite often what is done.
But if I start with a list I expect a list back.

Vincent
>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev


From jsseabold at gmail.com  Thu Jun 17 15:58:12 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Thu, 17 Jun 2010 15:58:12 -0400
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <BB592616-64A4-414D-AC2C-F99BA9C28D0B@vincentdavis.net>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com> 
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com> 
	<AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com> 
	<BB592616-64A4-414D-AC2C-F99BA9C28D0B@vincentdavis.net>
Message-ID: <AANLkTik8KJ-QP7YWkzTKhyR_0VWzaq7s-9MpGMxKZQ3c@mail.gmail.com>

On Thu, Jun 17, 2010 at 3:52 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>
>
> Sent from my iPhone
>
> On Jun 17, 2010, at 1:24 PM, Skipper Seabold <jsseabold at gmail.com>
> wrote:
>
>> On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis <vincent at vincentdavis.net
>> > wrote:
>>> On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis <vincent at vincentdavis.net
>>> > wrote:
>>>> The problem is that for example C2F is
>>>>
>>>> def C2F(C):
>>>> ? ? return 1.8 * C + 32
>>>> So array_like (list, tuple...) does not work.
>>>>
>>>> What is the preferred solution?
>>>>
>>>> 1, Change the docs to say "scalar or array" and return a warning
>>>> if C
>>>> is not a scalar or array
>>>> 2, Ad the ability to handle all array_like.
>>>>
>>> def C2F(C):
>>> ? ? return np.add(np.multilpy(1.8, C), 32)
>>>
>>> I think that accepts array-like and scaler and returns the same
>>
>> Or just C = np.asarray(C)
>>
>> This is quite often what is done.
> But if I start with a list I expect a list back.
>

I don't know of any functions that preserve list inputs.

In [3]: np.multiply([1,2,3,4,5],[2,3,4,5,6])
Out[3]: array([ 2,  6, 12, 20, 30])


From rmay31 at gmail.com  Thu Jun 17 16:10:01 2010
From: rmay31 at gmail.com (Ryan May)
Date: Thu, 17 Jun 2010 15:10:01 -0500
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTikyD9LbWR6O8zRyIG592wDSNUyyzVd6gsxx0qpI@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com> 
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com> 
	<AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com> 
	<AANLkTikyD9LbWR6O8zRyIG592wDSNUyyzVd6gsxx0qpI@mail.gmail.com>
Message-ID: <AANLkTim0GewJID_qwo1wstkRhLoYRq3aTCPS0sC8XVls@mail.gmail.com>

On Thu, Jun 17, 2010 at 2:30 PM, Benjamin Root <ben.root at ou.edu> wrote:
> On Thu, Jun 17, 2010 at 2:24 PM, Skipper Seabold <jsseabold at gmail.com>
> wrote:
>>
>> On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis <vincent at vincentdavis.net>
>> wrote:
>> > On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis
>> > <vincent at vincentdavis.net> wrote:
>> >> The problem is that for example C2F is
>> >>
>> >> def C2F(C):
>> >> ? ? return 1.8 * C + 32
>> >> So array_like (list, tuple...) does not work.
>> >>
>> >> What is the preferred solution?
>> >>
>> >> 1, Change the docs to say "scalar or array" and return a warning if C
>> >> is not a scalar or array
>> >> 2, Ad the ability to handle all array_like.
>> >>
>> > def C2F(C):
>> > ? ? return np.add(np.multilpy(1.8, C), 32)
>> >
>> > I think that accepts array-like and scaler and returns the same
>>
>> Or just C = np.asarray(C)
>>
>> This is quite often what is done.
>>
> What about masked arrays?? Should np.asanyarray() be used?

I'd say (emphatically) yes.

Ryan

-- 
Ryan May
Graduate Research Assistant
School of Meteorology
University of Oklahoma


From vincent at vincentdavis.net  Thu Jun 17 16:10:53 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 17 Jun 2010 14:10:53 -0600
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTik8KJ-QP7YWkzTKhyR_0VWzaq7s-9MpGMxKZQ3c@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com>
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com>
	<AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com>
	<BB592616-64A4-414D-AC2C-F99BA9C28D0B@vincentdavis.net>
	<AANLkTik8KJ-QP7YWkzTKhyR_0VWzaq7s-9MpGMxKZQ3c@mail.gmail.com>
Message-ID: <AANLkTinAoKtIUbquHQR6Ojq0kwAPMEKIWuEBvxzynDgR@mail.gmail.com>

On Thu, Jun 17, 2010 at 1:58 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Thu, Jun 17, 2010 at 3:52 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>
>>
>> Sent from my iPhone
>>
>> On Jun 17, 2010, at 1:24 PM, Skipper Seabold <jsseabold at gmail.com>
>> wrote:
>>
>>> On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis <vincent at vincentdavis.net
>>> > wrote:
>>>> On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis <vincent at vincentdavis.net
>>>> > wrote:
>>>>> The problem is that for example C2F is
>>>>>
>>>>> def C2F(C):
>>>>> ? ? return 1.8 * C + 32
>>>>> So array_like (list, tuple...) does not work.
>>>>>
>>>>> What is the preferred solution?
>>>>>
>>>>> 1, Change the docs to say "scalar or array" and return a warning
>>>>> if C
>>>>> is not a scalar or array
>>>>> 2, Ad the ability to handle all array_like.
>>>>>
>>>> def C2F(C):
>>>> ? ? return np.add(np.multilpy(1.8, C), 32)
>>>>
>>>> I think that accepts array-like and scaler and returns the same
>>>
>>> Or just C = np.asarray(C)
>>>
>>> This is quite often what is done.
>> But if I start with a list I expect a list back.
>>
>
> I don't know of any functions that preserve list inputs.
>
> In [3]: np.multiply([1,2,3,4,5],[2,3,4,5,6])
> Out[3]: array([ 2, ?6, 12, 20, 30])

Your right, it preserves a scaler but not a list.
Ok then would you prefer it accepts array_likem or scaler and returns
a array or scaler?

Vincent
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From jsseabold at gmail.com  Thu Jun 17 16:17:17 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Thu, 17 Jun 2010 16:17:17 -0400
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTinAoKtIUbquHQR6Ojq0kwAPMEKIWuEBvxzynDgR@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com> 
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com> 
	<AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com> 
	<BB592616-64A4-414D-AC2C-F99BA9C28D0B@vincentdavis.net>
	<AANLkTik8KJ-QP7YWkzTKhyR_0VWzaq7s-9MpGMxKZQ3c@mail.gmail.com> 
	<AANLkTinAoKtIUbquHQR6Ojq0kwAPMEKIWuEBvxzynDgR@mail.gmail.com>
Message-ID: <AANLkTin_to-0Jp97RiArYUShraQxLFerBXi-k7QupWz0@mail.gmail.com>

On Thu, Jun 17, 2010 at 4:10 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Thu, Jun 17, 2010 at 1:58 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
>> On Thu, Jun 17, 2010 at 3:52 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>
>>>
>>> Sent from my iPhone
>>>
>>> On Jun 17, 2010, at 1:24 PM, Skipper Seabold <jsseabold at gmail.com>
>>> wrote:
>>>
>>>> On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis <vincent at vincentdavis.net
>>>> > wrote:
>>>>> On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis <vincent at vincentdavis.net
>>>>> > wrote:
>>>>>> The problem is that for example C2F is
>>>>>>
>>>>>> def C2F(C):
>>>>>> ? ? return 1.8 * C + 32
>>>>>> So array_like (list, tuple...) does not work.
>>>>>>
>>>>>> What is the preferred solution?
>>>>>>
>>>>>> 1, Change the docs to say "scalar or array" and return a warning
>>>>>> if C
>>>>>> is not a scalar or array
>>>>>> 2, Ad the ability to handle all array_like.
>>>>>>
>>>>> def C2F(C):
>>>>> ? ? return np.add(np.multilpy(1.8, C), 32)
>>>>>
>>>>> I think that accepts array-like and scaler and returns the same
>>>>
>>>> Or just C = np.asarray(C)
>>>>
>>>> This is quite often what is done.
>>> But if I start with a list I expect a list back.
>>>
>>
>> I don't know of any functions that preserve list inputs.
>>
>> In [3]: np.multiply([1,2,3,4,5],[2,3,4,5,6])
>> Out[3]: array([ 2, ?6, 12, 20, 30])
>
> Your right, it preserves a scaler but not a list.
> Ok then would you prefer it accepts array_likem or scaler and returns
> a array or scaler?
>

You can just say accepts array_like and use asanyarray (point taken) I think.

Skipper


From pav at iki.fi  Thu Jun 17 17:40:28 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Thu, 17 Jun 2010 21:40:28 +0000 (UTC)
Subject: [SciPy-Dev] scipy.special test failures for 0.8.0b1
References: <AANLkTimAFedlDIKRZFKpU7wCPC0asg7Ndabhi58z_Rzr@mail.gmail.com>
	<hvdj0r$i17$1@dough.gmane.org>
Message-ID: <hve4oc$6m6$1@dough.gmane.org>

Thu, 17 Jun 2010 16:37:47 +0000, Pauli Virtanen wrote:

> Thu, 17 Jun 2010 22:05:43 +0800, Ralf Gommers wrote:
>> These scipy.special test failures were reported by Derek Homeier for
>> 0.8.0b1. Are these all just cases of the test precision being a little
>> too high, or is there an actual problem? What should be done with them
>> for the 0.8.0 release?
> [clip]
> 
> I can try to take a closer look today evening.

Fixed and cherry-picked now.

-- 
Pauli Virtanen


From d.l.goldsmith at gmail.com  Thu Jun 17 18:36:04 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 17 Jun 2010 15:36:04 -0700
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTin_to-0Jp97RiArYUShraQxLFerBXi-k7QupWz0@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com>
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com>
	<AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com>
	<BB592616-64A4-414D-AC2C-F99BA9C28D0B@vincentdavis.net>
	<AANLkTik8KJ-QP7YWkzTKhyR_0VWzaq7s-9MpGMxKZQ3c@mail.gmail.com>
	<AANLkTinAoKtIUbquHQR6Ojq0kwAPMEKIWuEBvxzynDgR@mail.gmail.com>
	<AANLkTin_to-0Jp97RiArYUShraQxLFerBXi-k7QupWz0@mail.gmail.com>
Message-ID: <AANLkTikQ_Mqa40cWucQj-quEukmG67fu7Za08HIWknO9@mail.gmail.com>

On Thu, Jun 17, 2010 at 1:17 PM, Skipper Seabold <jsseabold at gmail.com>wrote:

> On Thu, Jun 17, 2010 at 4:10 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
> > On Thu, Jun 17, 2010 at 1:58 PM, Skipper Seabold <jsseabold at gmail.com>
> wrote:
> >> On Thu, Jun 17, 2010 at 3:52 PM, Vincent Davis <
> vincent at vincentdavis.net> wrote:
> >>>
> >>>
> >>> Sent from my iPhone
> >>>
> >>> On Jun 17, 2010, at 1:24 PM, Skipper Seabold <jsseabold at gmail.com>
> >>> wrote:
> >>>
> >>>> On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis <
> vincent at vincentdavis.net
> >>>> > wrote:
> >>>>> On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis <
> vincent at vincentdavis.net
> >>>>> > wrote:
> >>>>>> The problem is that for example C2F is
> >>>>>>
> >>>>>> def C2F(C):
> >>>>>>     return 1.8 * C + 32
> >>>>>> So array_like (list, tuple...) does not work.
> >>>>>>
> >>>>>> What is the preferred solution?
> >>>>>>
> >>>>>> 1, Change the docs to say "scalar or array" and return a warning
> >>>>>> if C
> >>>>>> is not a scalar or array
> >>>>>> 2, Ad the ability to handle all array_like.
> >>>>>>
> >>>>> def C2F(C):
> >>>>>     return np.add(np.multilpy(1.8, C), 32)
> >>>>>
> >>>>> I think that accepts array-like and scaler and returns the same
> >>>>
> >>>> Or just C = np.asarray(C)
> >>>>
> >>>> This is quite often what is done.
> >>> But if I start with a list I expect a list back.
> >>>
> >>
> >> I don't know of any functions that preserve list inputs.
> >>
> >> In [3]: np.multiply([1,2,3,4,5],[2,3,4,5,6])
> >> Out[3]: array([ 2,  6, 12, 20, 30])
> >
> > Your right, it preserves a scaler but not a list.
> > Ok then would you prefer it accepts array_likem or scaler and returns
> > a array or scaler?
> >
>
> You can just say accepts array_like and use asanyarray (point taken) I
> think.
>

Yes, we have the convention that "array_like" includes (single) scalars.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/d3023d1a/attachment.html>

From d.l.goldsmith at gmail.com  Thu Jun 17 18:46:44 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 17 Jun 2010 15:46:44 -0700
Subject: [SciPy-Dev] Email me if you want in on tomorrow's Doc Marathon
	Skypecon
Message-ID: <AANLkTinb7NBusqh2pcgtE4xDBv_ShoBUtByzE0T7CD6Q@mail.gmail.com>

Noon, EDT.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/a09b3385/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun 17 19:44:12 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Fri, 18 Jun 2010 07:44:12 +0800
Subject: [SciPy-Dev] scipy.special test failures for 0.8.0b1
In-Reply-To: <hve4oc$6m6$1@dough.gmane.org>
References: <AANLkTimAFedlDIKRZFKpU7wCPC0asg7Ndabhi58z_Rzr@mail.gmail.com>
	<hvdj0r$i17$1@dough.gmane.org> <hve4oc$6m6$1@dough.gmane.org>
Message-ID: <AANLkTikixu_r2IvsX5BMOPeUcc6iyt2M8mJTXg47O7ql@mail.gmail.com>

On Fri, Jun 18, 2010 at 5:40 AM, Pauli Virtanen <pav at iki.fi> wrote:

> Thu, 17 Jun 2010 16:37:47 +0000, Pauli Virtanen wrote:
>
> > Thu, 17 Jun 2010 22:05:43 +0800, Ralf Gommers wrote:
> >> These scipy.special test failures were reported by Derek Homeier for
> >> 0.8.0b1. Are these all just cases of the test precision being a little
> >> too high, or is there an actual problem? What should be done with them
> >> for the 0.8.0 release?
> > [clip]
> >
> > I can try to take a closer look today evening.
>
> Fixed and cherry-picked now.
>
> Great, thanks.

Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100618/7159ed10/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun 17 19:57:35 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Fri, 18 Jun 2010 07:57:35 +0800
Subject: [SciPy-Dev] scipy.io cleanup
In-Reply-To: <AANLkTikZabADLb5Qj9_e7JEgJlFNNVANeSy8UUbPkA9H@mail.gmail.com>
References: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>
	<AANLkTikZabADLb5Qj9_e7JEgJlFNNVANeSy8UUbPkA9H@mail.gmail.com>
Message-ID: <AANLkTima6QrA8BHqCGHZSyqUkWIXrLPZyPxdxLHS38NU@mail.gmail.com>

On Fri, Jun 18, 2010 at 1:14 AM, Matthew Brett <matthew.brett at gmail.com>wrote:

> Hi,
>
> On Wed, Jun 16, 2010 at 12:23 PM, Ralf Gommers
> <ralf.gommers at googlemail.com> wrote:
> > A long list of functions in scipy.io was deprecated in 0.7 and has to be
> > removed now. It's in the 'io' branch at
> > http://github.com/rgommers/scipy/commits/io now. In addition the
> recaster
> > module is deprecated as Matthew requested in another thread. Does someone
> > want to have a quick look at it?
>
> Sorry that I didn't do this myself - and thanks for doing it.
>
> I noticed that your patch means that every import of scipy.io
> generates a deprecation warning.  This in turn was because I'm calling
> the deprecated sctype_attributes in the Recaster class definition.
> I've worked round this, version of your 'recaster' patch attached...
>
> Yeah I noticed that. Will use your fix, thanks. One other thing I wasn't
sure about was the best way to deprecate a class. Since the deprecate
decorator/function doesn't work on the class I applied it on the __init__
method, is this the best way to do it?

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100618/234c5aab/attachment.html>

From charlesr.harris at gmail.com  Thu Jun 17 20:16:56 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Thu, 17 Jun 2010 18:16:56 -0600
Subject: [SciPy-Dev] scipy.io cleanup
In-Reply-To: <AANLkTima6QrA8BHqCGHZSyqUkWIXrLPZyPxdxLHS38NU@mail.gmail.com>
References: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>
	<AANLkTikZabADLb5Qj9_e7JEgJlFNNVANeSy8UUbPkA9H@mail.gmail.com>
	<AANLkTima6QrA8BHqCGHZSyqUkWIXrLPZyPxdxLHS38NU@mail.gmail.com>
Message-ID: <AANLkTimOwpRfjuWkP1GMYSpEWjsi8zL_Bpqf9lOtE_Ol@mail.gmail.com>

On Thu, Jun 17, 2010 at 5:57 PM, Ralf Gommers
<ralf.gommers at googlemail.com>wrote:

>
>
> On Fri, Jun 18, 2010 at 1:14 AM, Matthew Brett <matthew.brett at gmail.com>wrote:
>
>> Hi,
>>
>> On Wed, Jun 16, 2010 at 12:23 PM, Ralf Gommers
>> <ralf.gommers at googlemail.com> wrote:
>> > A long list of functions in scipy.io was deprecated in 0.7 and has to
>> be
>> > removed now. It's in the 'io' branch at
>> > http://github.com/rgommers/scipy/commits/io now. In addition the
>> recaster
>> > module is deprecated as Matthew requested in another thread. Does
>> someone
>> > want to have a quick look at it?
>>
>> Sorry that I didn't do this myself - and thanks for doing it.
>>
>> I noticed that your patch means that every import of scipy.io
>> generates a deprecation warning.  This in turn was because I'm calling
>> the deprecated sctype_attributes in the Recaster class definition.
>> I've worked round this, version of your 'recaster' patch attached...
>>
>> Yeah I noticed that. Will use your fix, thanks. One other thing I wasn't
> sure about was the best way to deprecate a class. Since the deprecate
> decorator/function doesn't work on the class I applied it on the __init__
> method, is this the best way to do it?
>
>
I don't know that we've done it before, but looks to be the best, maybe
only, way to do it.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/38d4588f/attachment.html>

From nmb at wartburg.edu  Thu Jun 17 20:43:18 2010
From: nmb at wartburg.edu (Neil Martinsen-Burrell)
Date: Thu, 17 Jun 2010 19:43:18 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1A5476.1040407@gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C06807A.40301@gmail.com>	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>	<4C19AB7E.1000405@enthought.com>	<4C1A2C85.7090001@gmail.com>	<4C1A3411.1000806@enthought.com>	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>	<4C1A3FD8.3050801@gmail.com>	<AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>
	<4C1A5476.1040407@gmail.com>
Message-ID: <4C1AC126.1080308@wartburg.edu>

On 2010-06-17 11:59, Bruce Southey wrote:
> On 06/17/2010 10:45 AM, josef.pktd at gmail.com wrote:
>> On Thu, Jun 17, 2010 at 11:31 AM, Bruce Southey<bsouthey at gmail.com>  wrote:
>>
>>> On 06/17/2010 09:50 AM,josef.pktd at gmail.com  wrote:
>>>
>>>> On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
>>>> <warren.weckesser at enthought.com>    wrote:
>>>>
>>>>
>>>>> Bruce Southey wrote:
>>>>>
>>>>>
>>>>>> On 06/16/2010 11:58 PM, Warren Weckesser wrote:

[...]

>>>>>> The handling for a one way table is wrong:
>>>>>>     >>>print 'One way', chisquare_nway([6, 2])
>>>>>> (0.0, 1.0, 0, array([ 6.,  2.]))
>>>>>>
>>>>>> It should also do the marginal independence tests.
>>>>>>
>>>>> As I explained in the description of the ticket and in the docstring,
>>>>> this function is not intended for doing the 'one-way' goodness of fit.
>>>>> stats.chisquare should be used for that.  Calling chisquare_nway with a
>>>>> 1D array amounts to doing a test of independence between groupings but
>>>>> only giving a single grouping, hence the trivial result.  This is
>>>>> intentional.
>>>
>>> In expected-nway, you say that "While this function can handle a 1D
>>> array," but clearly it does not handle it correctly.
>>> If it was your intention not to do one way tables, then you *must* check
>>> the input and reject one way tables!
>>>
>>>>> I guess the question is: should there be a "clever" chi-square function
>>>>> that figures out what the user probably wants to do?
>>>>>
>>>>>
>>> My issue is that the chi-squared test statistic is still calculated in
>>> exactly the same way for n-way tables where n>0. So it is pure
>>> unnecessary duplication of functionality if you require a second
>>> function for the one way table. I also prefer the one-stop shopping approach
>>>
>> just because it's chisquare doesn't mean it's the same kind of tests.
>> This is a test for independence or association that only makes sense
>> if there are at least two random variables.
>
> Wrong!
> See for example:
> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
> "Pearson's chi-square is used to assess two types of comparison: tests
> of goodness of fit and tests of independence."
>
> The exact same test statistic is being calculated just that the
> hypothesis is different (which is the user's problem not the function's
> problem). So please separate the hypothesis from the test statistic.

It is only the exact same test statistic if we know the expected cell 
counts.  How these expected cell counts are determined depends 
completely on the type of test that is being carried out.  In a 
goodness-of-fit test (chisquare_oneway) the proportions of each cell 
must be specified in the null hypothesis.  For an independence test 
(chisquare_nway), the expected cell counts are computed from the given 
data and the null hypothesis of independence.  The fact that the formula 
involving observed and expected numbers is the same should not obscure 
the fact that the expected numbers come from two completely different 
assumptions in the n=1 and n>1 cases.  Can you explain how the expected 
cell counts should be determined in the 1D case without the function 
making assumptions about the user's null hypothesis?

I believe that we CANNOT separate the test statistic from the user's 
null hypothesis and that is the reason that chisquare_oneway and 
chisquare_nway should be separate functions.  The information required 
to properly do a goodness-of-fit test is qualitatively different than 
that required to do an independence test.  I support your suggestion to 
reject 1D arrays as input for chisquare_nway. (With appropriate checking 
for arrays such as np.array([[[1, 2, 3, 4]]].)

>> I don't like mixing shoes and apples.
>>
> Then please don't.

Great.  I'm glad to see that we all agree that chisquare_oneway and 
chisquare_nway should remain separate functions. :)

-Neil


From vincent at vincentdavis.net  Thu Jun 17 21:54:52 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 17 Jun 2010 19:54:52 -0600
Subject: [SciPy-Dev] Email me if you want in on tomorrow's Doc Marathon
	Skypecon
In-Reply-To: <AANLkTinb7NBusqh2pcgtE4xDBv_ShoBUtByzE0T7CD6Q@mail.gmail.com>
References: <AANLkTinb7NBusqh2pcgtE4xDBv_ShoBUtByzE0T7CD6Q@mail.gmail.com>
Message-ID: <AANLkTind2nYdq0wgW0u99F39z7keHNUSEPh43akkBORR@mail.gmail.com>

I am out tomorrow.
Vincent


On Thu, Jun 17, 2010 at 4:46 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> Noon, EDT.
>
> DG
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From ben.root at ou.edu  Thu Jun 17 22:45:26 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Thu, 17 Jun 2010 21:45:26 -0500
Subject: [SciPy-Dev] Email me if you want in on tomorrow's Doc Marathon
	Skypecon
In-Reply-To: <AANLkTind2nYdq0wgW0u99F39z7keHNUSEPh43akkBORR@mail.gmail.com>
References: <AANLkTinb7NBusqh2pcgtE4xDBv_ShoBUtByzE0T7CD6Q@mail.gmail.com> 
	<AANLkTind2nYdq0wgW0u99F39z7keHNUSEPh43akkBORR@mail.gmail.com>
Message-ID: <AANLkTinEnec_eo8v1HPBhoMv_jnjhXIgwsAv-YQhu-JV@mail.gmail.com>

I will be available.

Ben Root

On Thu, Jun 17, 2010 at 8:54 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> I am out tomorrow.
> Vincent
>
>
> On Thu, Jun 17, 2010 at 4:46 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > Noon, EDT.
> >
> > DG
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/acf98e11/attachment.html>

From ilanschnell at gmail.com  Thu Jun 17 23:32:11 2010
From: ilanschnell at gmail.com (Ilan Schnell)
Date: Thu, 17 Jun 2010 22:32:11 -0500
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTimIbjUOAEdjjDDs-fYSeD1yAyoQ8hv3d-73CE5_@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
	<AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com>
	<AANLkTimIbjUOAEdjjDDs-fYSeD1yAyoQ8hv3d-73CE5_@mail.gmail.com>
Message-ID: <AANLkTimvXK_7BHxPjn3bPiTMgZBjC3hGnkCEEBFuh92e@mail.gmail.com>

Yes, I'm responsible for the Ziggurat patch.  I've taken
it out of numpy, and it will be in the EPD 6.2-2 (bug
fix release) which is going out tomorrow.

- Ilan


From vincent at vincentdavis.net  Fri Jun 18 00:05:28 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Thu, 17 Jun 2010 22:05:28 -0600
Subject: [SciPy-Dev] seeded randn gets different values on osx
In-Reply-To: <AANLkTimvXK_7BHxPjn3bPiTMgZBjC3hGnkCEEBFuh92e@mail.gmail.com>
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
	<AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com>
	<AANLkTimIbjUOAEdjjDDs-fYSeD1yAyoQ8hv3d-73CE5_@mail.gmail.com>
	<AANLkTimvXK_7BHxPjn3bPiTMgZBjC3hGnkCEEBFuh92e@mail.gmail.com>
Message-ID: <AANLkTimu1IBxJEyunYn6DmrbiNiakX8FhKSO5kg57VCZ@mail.gmail.com>

On Thu, Jun 17, 2010 at 9:32 PM, Ilan Schnell <ilanschnell at gmail.com> wrote:
> Yes, I'm responsible for the Ziggurat patch. ?I've taken
> it out of numpy, and it will be in the EPD 6.2-2 (bug
> fix release) which is going out tomorrow.

Thanks for the heads up.
Vincent

>
> - Ilan
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From pav at iki.fi  Fri Jun 18 04:09:25 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 18 Jun 2010 08:09:25 +0000 (UTC)
Subject: [SciPy-Dev] seeded randn gets different values on osx
References: <AANLkTin8gh-wP8VtPfZhpQpBqc4GAV-mMQTNv_nuNCey@mail.gmail.com>
	<AANLkTikU7IHDjuWm5K_k-1qfZPrdGjXZsoeUlTfI5YTl@mail.gmail.com>
	<AANLkTikOCt0-mCtEjeqk9dR_QOtb6g4d83HlqA7yJHu5@mail.gmail.com>
	<AANLkTimsY_GLOVcZZM2vtCHrd02-SYyS_DUkkDqTeRh2@mail.gmail.com>
	<AANLkTimPqQl487pjHJkmAGcMEJ4Gl3zovBcGB29BBc4B@mail.gmail.com>
	<AANLkTimIbjUOAEdjjDDs-fYSeD1yAyoQ8hv3d-73CE5_@mail.gmail.com>
Message-ID: <hvf9jl$pp5$1@dough.gmane.org>

Sun, 13 Jun 2010 16:16:56 -0600, Charles R Harris wrote:
[clip]
> I've been thinking that the ziggurat implementation should be added as a
> separate function, znormal or zignomal, perhaps together with a fast
> version returning lower precision that could be helpful in simulating
> noise.

If the only difference in practice is that the ziggurat returns a 
different sequence of numbers for a given seed, then perhaps just a 
ziggurat=True keyword argument could be used for the algorithm selection?

-- 
Pauli Virtanen


From bsouthey at gmail.com  Fri Jun 18 10:11:52 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Fri, 18 Jun 2010 09:11:52 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1AC126.1080308@wartburg.edu>
References: <4C05DDF3.9010206@enthought.com>
	<4C06861C.1060401@wartburg.edu>	<4C069E84.4020308@gmail.com>	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>	<4C19AB7E.1000405@enthought.com>	<4C1A2C85.7090001@gmail.com>	<4C1A3411.1000806@enthought.com>	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>	<4C1A3FD8.3050801@gmail.com>	<AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>
	<4C1A5476.1040407@gmail.com> <4C1AC126.1080308@wartburg.edu>
Message-ID: <4C1B7EA8.4090109@gmail.com>

On 06/17/2010 07:43 PM, Neil Martinsen-Burrell wrote:
> On 2010-06-17 11:59, Bruce Southey wrote:
>> On 06/17/2010 10:45 AM, josef.pktd at gmail.com wrote:
>>> On Thu, Jun 17, 2010 at 11:31 AM, Bruce Southey<bsouthey at gmail.com>  
>>> wrote:
>>>
>>>> On 06/17/2010 09:50 AM,josef.pktd at gmail.com  wrote:
>>>>
>>>>> On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
>>>>> <warren.weckesser at enthought.com>    wrote:
>>>>>
>>>>>
>>>>>> Bruce Southey wrote:
>>>>>>
>>>>>>
>>>>>>> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>
> [...]
>
>>>>>>> The handling for a one way table is wrong:
>>>>>>> >>>print 'One way', chisquare_nway([6, 2])
>>>>>>> (0.0, 1.0, 0, array([ 6.,  2.]))
>>>>>>>
>>>>>>> It should also do the marginal independence tests.
>>>>>>>
>>>>>> As I explained in the description of the ticket and in the 
>>>>>> docstring,
>>>>>> this function is not intended for doing the 'one-way' goodness of 
>>>>>> fit.
>>>>>> stats.chisquare should be used for that.  Calling chisquare_nway 
>>>>>> with a
>>>>>> 1D array amounts to doing a test of independence between 
>>>>>> groupings but
>>>>>> only giving a single grouping, hence the trivial result.  This is
>>>>>> intentional.
>>>>
>>>> In expected-nway, you say that "While this function can handle a 1D
>>>> array," but clearly it does not handle it correctly.
>>>> If it was your intention not to do one way tables, then you *must* 
>>>> check
>>>> the input and reject one way tables!
>>>>
>>>>>> I guess the question is: should there be a "clever" chi-square 
>>>>>> function
>>>>>> that figures out what the user probably wants to do?
>>>>>>
>>>>>>
>>>> My issue is that the chi-squared test statistic is still calculated in
>>>> exactly the same way for n-way tables where n>0. So it is pure
>>>> unnecessary duplication of functionality if you require a second
>>>> function for the one way table. I also prefer the one-stop shopping 
>>>> approach
>>>>
>>> just because it's chisquare doesn't mean it's the same kind of tests.
>>> This is a test for independence or association that only makes sense
>>> if there are at least two random variables.
>>
>> Wrong!
>> See for example:
>> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>> "Pearson's chi-square is used to assess two types of comparison: tests
>> of goodness of fit and tests of independence."
>>
>> The exact same test statistic is being calculated just that the
>> hypothesis is different (which is the user's problem not the function's
>> problem). So please separate the hypothesis from the test statistic.
>
> It is only the exact same test statistic if we know the expected cell 
> counts.  How these expected cell counts are determined depends 
> completely on the type of test that is being carried out.  In a 
> goodness-of-fit test (chisquare_oneway) the proportions of each cell 
> must be specified in the null hypothesis.  For an independence test 
> (chisquare_nway), the expected cell counts are computed from the given 
> data and the null hypothesis of independence.  The fact that the 
> formula involving observed and expected numbers is the same should not 
> obscure the fact that the expected numbers come from two completely 
> different assumptions in the n=1 and n>1 cases.  Can you explain how 
> the expected cell counts should be determined in the 1D case without 
> the function making assumptions about the user's null hypothesis?
The user's hypothesis is totally irrelevant here because you are 
computing the TEST STATISTIC (sum of observed minus expected squared 
divided by degree of freedom), that is available in multiple statistical 
books and a not so good link:
http://en.wikipedia.org/wiki/Pearson's_chi-square_test

This is one reason that the current stats.chisquare function accepts 
observed and expected values as input.

So given a set of observed values and the corresponding set of expected 
values, you calculate the test statistic and consequently a p-value (you 
can assume that the test statistic has a chi-squared distribution but 
nothing is stopping you from some other approach such as bootstrapping). 
Yet, nothing in that calculation of the actual test statistic nor the 
p-value makes any reference to the user's assumptions.

Where the user's assumption enter is how they compute the expected 
values as in your expected_nway function, which is NOT your 
chisquare_nway function. How these expected values defined do depend on 
the user's hypothesis and if the user does not define these then the 
function has to make a guess of what the user wants. That guess depends 
partly on the input (obviously independence is irrelevant for a 1way 
table) and what the developer wants such as (marginal) independence for 
higher order tables.

Thus it is up to the user to know what assumptions they made and what 
hypothesis is being tested when they interpret the p-value because scipy 
is not an expert system that somehow knows what the user wants to get in 
all cases.

>
> I believe that we CANNOT separate the test statistic from the user's 
> null hypothesis and that is the reason that chisquare_oneway and 
> chisquare_nway should be separate functions.  The information required 
> to properly do a goodness-of-fit test is qualitatively different than 
> that required to do an independence test.  I support your suggestion 
> to reject 1D arrays as input for chisquare_nway. (With appropriate 
> checking for arrays such as np.array([[[1, 2, 3, 4]]].)

All you need to calculate the Pearson chi-squared is the observed and 
expected values. I don't care how you obtain these values, but once you 
do then everything afterwards is the same until the user decides on how 
to interpret the p-value.

Under your argument there needs to be a new function for every different 
hypothesis that a user wants. So a 'goodness-of-fit test' must have a 
separate function (actually probably a good idea) than a one-way test 
function etc. But that is pure code bloat when the test statistic is 
identical.

>
>>> I don't like mixing shoes and apples.
>>>
>> Then please don't.
>
> Great.  I'm glad to see that we all agree that chisquare_oneway and 
> chisquare_nway should remain separate functions. :)
>
> -Neil
Sure as we need more code duplication to increase the size of scipy and 
confuse any user!

Bruce


From ben.root at ou.edu  Fri Jun 18 12:53:15 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Fri, 18 Jun 2010 11:53:15 -0500
Subject: [SciPy-Dev] "ok to apply" permission request
Message-ID: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com>

Hello,

I am requesting permission for "ok to apply" in the wiki documentation
editor.  My wiki user name is "weathergod"

Thanks,
Ben Root
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100618/6d175786/attachment.html>

From d.l.goldsmith at gmail.com  Fri Jun 18 13:43:23 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 18 Jun 2010 10:43:23 -0700
Subject: [SciPy-Dev] SciPy docs marathon: a little more info
Message-ID: <AANLkTinyMLStM34qiidtgC4Aq3P9qTiZe0rVxkf7vnGD@mail.gmail.com>

On Mon, Jun 14, 2010 at 2:05 AM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> Hi, all!  The scipy doc marathon has gotten off to a very slow start this
> summer.  We are producing less than 1000 words a week, perhaps because
> many universities are still finishing up spring classes.  So, this is
> a second appeal to everyone to pitch in and help get scipy documented
> so that it's easy to learn how to use it.  Because some of the
> packages are quite specialized, we need both "regular" contributors to
> write lots of pages, and some people experienced in using each module
> (and the mathematics behind the software) to make sure we don't water
> it down or make it wrong in the process.  If you can help, please, now is
> the
> time to step forward.  Thanks!
>
> On behalf of Joe and myself,
>
> David Goldsmith
> Olympia, WA
>

(Apparently this didn't go through the first time.)

OK, a few people have come forward - thanks!

Let me enumerate the categories that still have no "declared" volunteer
writer-editors (all categories are in need of leaders):

Max. Entropy, Misc., Image Manip. (Milestone 6)
Signal processing (Milestone 8)
Sparse Matrices (Milestone 9)
Spatial Algorithms., Special funcs. (Milestone 10)
C/C++ Integration (Milestone 13)

As for the rest, only Interpolation (Milestone 3) has more than one person
(but I'm one of the two), and I'm the only person on four others.

So, hopefully, knowing specifically which areas are in dire need will
inspire people skilled in those areas to sign up.  Thanks for your time and
help,

DG

PS: For your convenience, here's the link to the scipy
Milestones<http://docs.scipy.org/scipy/Milestones/>page.  (Note that
the Milestones link at the top of each Wiki page links,
incorrectly in the case of the SciPy pages, to the NumPy Milestones page,
which we are not actively working on in this Marathon; this is a known,
reported bug in the Wiki program.)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100618/19dc8ecc/attachment.html>

From ralf.gommers at googlemail.com  Fri Jun 18 14:55:27 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Sat, 19 Jun 2010 02:55:27 +0800
Subject: [SciPy-Dev] fftpack test failures for 0.8.0b1
In-Reply-To: <AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
Message-ID: <AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>

On Thu, Jun 17, 2010 at 11:48 PM, Charles R Harris <
charlesr.harris at gmail.com> wrote:

>
>
> On Thu, Jun 17, 2010 at 8:50 AM, Ralf Gommers <ralf.gommers at googlemail.com
> > wrote:
>
>> Below are two test failures in fftpack. The test data is random, so they
>> happen only occasionally. I decreased the precision to decimal=5 and
>> maxulp=1e4, but still got one failure of each test in 200 test runs. The
>> TestSingleIFFT failure always occurs with an array of size 111. This is on
>> OS X 10.6, the same first error was also reported on 10.5.
>>
>> What should be done with these failures?
>>
>>
> I guess the first thing is to figure out what is going on, and that needs
> something repeatable. Maybe the test could be repeated with a set of seeds
> until the problem appears. Also, since 111 = 3*37, I wonder what happens
> with other odd primes besides 37. It might be worth testing with 97 and
> other such.
>
> Attached is a small script to test with larger primes. You can supply a
fixed seed but it's not even necessary. Typical result:
arraysize, max error
   1 0.0
  17 4.76837e-07
  37 2.98023e-06
  97 0.000104427
 313 0.000443935
 701 0.00112867
1447 0.00620008
2011 0.0138307
3469 0.16958

So even decimal=4 would fail for 97 already. For larger primes the FFT
should be slower but not less accurate, right?

If I use the numpy.fft.fft/ifft the result is:
   1 0.0
  17 7.54951656745e-15
  37 2.99760216649e-15
  97 4.99600361081e-15
 313 1.65423230669e-14
 701 5.66213742559e-15
1447 3.0253577421e-14
2011 1.87072579649e-14
3469 1.09912079438e-14

Looks a lot better.

Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/480cbd98/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: fftpack_failures.py
Type: application/octet-stream
Size: 540 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/480cbd98/attachment.obj>

From pav at iki.fi  Fri Jun 18 16:20:50 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 18 Jun 2010 20:20:50 +0000 (UTC)
Subject: [SciPy-Dev] "ok to apply" permission request
References: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com>
Message-ID: <hvgkf2$qid$1@dough.gmane.org>

Hi,

Fri, 18 Jun 2010 11:53:15 -0500, Benjamin Root wrote:
> I am requesting permission for "ok to apply" in the wiki documentation
> editor.  My wiki user name is "weathergod"

What do you need it for?

The feature was primarily intended to be mainly used by the person who is 
going to commit the changes finally to SVN, for checking that nothing 
absurd slips past. Typically the way to just indicate that stuff is 
"done", is to mark is as "Needs review", at the moment.

-- 
Pauli Virtanen


From d.l.goldsmith at gmail.com  Fri Jun 18 16:44:11 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 18 Jun 2010 13:44:11 -0700
Subject: [SciPy-Dev] "ok to apply" permission request
In-Reply-To: <hvgkf2$qid$1@dough.gmane.org>
References: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com>
	<hvgkf2$qid$1@dough.gmane.org>
Message-ID: <AANLkTilj2wn7WAox_YSPU7C4oqIiTPJEXjzRo3vS9w9-@mail.gmail.com>

On Fri, Jun 18, 2010 at 1:20 PM, Pauli Virtanen <pav at iki.fi> wrote:

> Hi,
>
> Fri, 18 Jun 2010 11:53:15 -0500, Benjamin Root wrote:
> > I am requesting permission for "ok to apply" in the wiki documentation
> > editor.  My wiki user name is "weathergod"
>
> What do you need it for?
>
> The feature was primarily intended to be mainly used by the person who is
> going to commit the changes finally to SVN, for checking that nothing
> absurd slips past.
>

I don't understand: if they're going to commit the changes, why do they need
to be able to mark that they're going to commit the changes?  To help them
remember which ones they've screened as possessing nothing "absurd" in case
they can't commit the changes immediately after they've decided to commit
the changes?


> Typically the way to just indicate that stuff is
> "done", is to mark is as "Needs review", at the moment.
>

I guess then I'm really unclear as to the need for the "OK to apply"; my
understanding was that it was there for the editor to signal to the commitor
that, even thought the docstring is technically *not* ready for review
(e.g., it's still missing an Example, say, or a needed Reference), it still
represents a big enough improvement over what's in SVN that, in the editor's
opinion, it is "OK to apply".  If "Needs review" is necessary and sufficient
for something to be applied, then why do we need the extra "OK to apply"?

DG

PS: I'm the one who told Ben he should ask for this permission, because I
felt he needed it.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100618/be7b3f7b/attachment.html>

From d.l.goldsmith at gmail.com  Fri Jun 18 17:25:07 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 18 Jun 2010 14:25:07 -0700
Subject: [SciPy-Dev] Warning of deprecation in doc's ?
In-Reply-To: <AANLkTikMiG2tibqyARv3bHzkg8AR8SUuZayQstkjQ6KG@mail.gmail.com>
References: <AANLkTikELk_hNw_-JaF0beoCR1AhC5gDkPIR7ScSkPeB@mail.gmail.com>
	<AANLkTinaNMFwbPArnNlWZvDQJFDVpnrY8nPpopZ0d-e0@mail.gmail.com>
	<AANLkTiktV2L9PkaL8Gmnstz7ONfs5K-Sdcoz7-R3H01r@mail.gmail.com>
	<4C07B72E.2050504@gmail.com>
	<AANLkTimVEgYYPYDSV0VNjduUZ5RQYUXYOf2Z-3KwyeNV@mail.gmail.com>
	<AANLkTil_bhnD6os2rO91o1hySDJCX8b6aK3ixmOgbUbx@mail.gmail.com>
	<AANLkTinqgeZuhUtnKPjlIoJlJ1TmXsDq3haqQNSUUz32@mail.gmail.com>
	<AANLkTineuswdfoFgWh6GoLYO2KL_SY2fx8cUaBliSC4x@mail.gmail.com>
	<AANLkTil1EBDEGtIlVr36KIUBLeRDp6k2x5v1BoY6YK3R@mail.gmail.com>
	<AANLkTimskcznY0os7T8kn_jf2nconqCahp6AgFXDpx7p@mail.gmail.com>
	<AANLkTin4GSA-elW_-bymdxBtcPFnT1iWsN1bvIaShWmY@mail.gmail.com>
	<AANLkTilU6uxWvqRfTNmSt1qizWhwfU-u6cMFKwKOBM5R@mail.gmail.com>
	<AANLkTikMiG2tibqyARv3bHzkg8AR8SUuZayQstkjQ6KG@mail.gmail.com>
Message-ID: <AANLkTim-ZDsJmcLFhLyPPdBgKGN1rcyDiWR_tn1PVv0c@mail.gmail.com>

On Thu, Jun 17, 2010 at 4:12 AM, Ralf Gommers
<ralf.gommers at googlemail.com>wrote:

>
>
> On Wed, Jun 16, 2010 at 11:59 PM, David Goldsmith <d.l.goldsmith at gmail.com
> > wrote:
>
>> On Wed, Jun 16, 2010 at 6:44 AM, Ralf Gommers <
>> ralf.gommers at googlemail.com> wrote:
>>
>>>
>>>
>>>> then attach the modified version as an attachment to the ticket?
>>>>
>>>> Make that a diff please.
>>>
>>
>> Sorry, can you "cookbook" that for me please (i.e., exactly how do I do
>> that)?
>>
>> Google for "howto create a patch" or similar. For example
> http://tortoisesvn.net/docs/release/TortoiseSVN_en/tsvn-dug-patch.html
>
> Ralf
>

Okee-doke, diff patch submitted; I hope I did it right - it's my first one!
:-)

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100618/ca3aa5e3/attachment.html>

From pav at iki.fi  Fri Jun 18 17:38:02 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 18 Jun 2010 21:38:02 +0000 (UTC)
Subject: [SciPy-Dev] "ok to apply" permission request
References: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com>
	<hvgkf2$qid$1@dough.gmane.org>
	<AANLkTilj2wn7WAox_YSPU7C4oqIiTPJEXjzRo3vS9w9-@mail.gmail.com>
Message-ID: <hvgovp$lif$1@dough.gmane.org>

Fri, 18 Jun 2010 13:44:11 -0700, David Goldsmith wrote:
[clip]
> I don't understand: if they're going to commit the changes, why do they
> need to be able to mark that they're going to commit the changes?  To
> help them remember which ones they've screened as possessing nothing
> "absurd" in case they can't commit the changes immediately after they've
> decided to commit the changes?

The point is that you typically commit a huge batch of docstring changes 
at once, and reading through a long patch listing makes your eyes glaze 
over really fast.

So a rough sanity check is much easier to do in the web system, and the 
burden can be distributed across multiple people if necessary. Currently 
ok-to-apply is married with the Reviewer permissions.

At least this is what I used and intended the feature for. I'm not sure 
if anyone else actually understands it the same way, especially as this 
is not written down anywhere :)

> > Typically the way to just indicate that stuff is "done", is to mark is
> > as "Needs review", at the moment.
>
> I guess then I'm really unclear as to the need for the "OK to apply"; my
> understanding was that it was there for the editor to signal to the
> commitor that, even thought the docstring is technically *not* ready for
> review (e.g., it's still missing an Example, say, or a needed
> Reference), it still represents a big enough improvement over what's in
> SVN that, in the editor's opinion, it is "OK to apply".  If "Needs
> review" is necessary and sufficient for something to be applied, then
> why do we need the extra "OK to apply"?

It was intended mostly as a reviewer/committer-level tool, at least 
originally, which is why it's not active with Editor permissions. Since 
anyone can in principle come and edit the wiki, I thought something like 
this would come useful.

I haven't been following the edits lately, so I guess its your call as 
the present active guy to decide who gets which privileges :)

-- 
Pauli Virtanen


From d.l.goldsmith at gmail.com  Fri Jun 18 17:49:53 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 18 Jun 2010 14:49:53 -0700
Subject: [SciPy-Dev] "ok to apply" permission request
In-Reply-To: <hvgovp$lif$1@dough.gmane.org>
References: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com>
	<hvgkf2$qid$1@dough.gmane.org>
	<AANLkTilj2wn7WAox_YSPU7C4oqIiTPJEXjzRo3vS9w9-@mail.gmail.com>
	<hvgovp$lif$1@dough.gmane.org>
Message-ID: <AANLkTinKqQ-cofrcnPbrBpRxiklnpQIgpkPHJlEqFvNm@mail.gmail.com>

On Fri, Jun 18, 2010 at 2:38 PM, Pauli Virtanen <pav at iki.fi> wrote:

> Fri, 18 Jun 2010 13:44:11 -0700, David Goldsmith wrote:
> [clip]
> > I don't understand: if they're going to commit the changes, why do they
> > need to be able to mark that they're going to commit the changes?  To
> > help them remember which ones they've screened as possessing nothing
> > "absurd" in case they can't commit the changes immediately after they've
> > decided to commit the changes?
>
> The point is that you typically commit a huge batch of docstring changes
> at once, and reading through a long patch listing makes your eyes glaze
> over really fast.
>

OK, I understand that, and it makes sense during the regular course of the
year when docstring changes aren't happening as frequently, but, something
to consider, perhaps through the course of the Summer Marathon, "OK to
apply"s should be merged once per week or some such?

So a rough sanity check is much easier to do in the web system, and the
> burden can be distributed across multiple people if necessary. Currently
> ok-to-apply is married with the Reviewer permissions.
>
> At least this is what I used and intended the feature for. I'm not sure
> if anyone else actually understands it the same way, especially as this
> is not written down anywhere :)
>
> > > Typically the way to just indicate that stuff is "done", is to mark is
> > > as "Needs review", at the moment.
> >
> > I guess then I'm really unclear as to the need for the "OK to apply"; my
> > understanding was that it was there for the editor to signal to the
> > commitor that, even thought the docstring is technically *not* ready for
> > review (e.g., it's still missing an Example, say, or a needed
> > Reference), it still represents a big enough improvement over what's in
> > SVN that, in the editor's opinion, it is "OK to apply".  If "Needs
> > review" is necessary and sufficient for something to be applied, then
> > why do we need the extra "OK to apply"?
>
> It was intended mostly as a reviewer/committer-level tool, at least
> originally, which is why it's not active with Editor permissions. Since
> anyone can in principle come and edit the wiki, I thought something like
> this would come useful.
>
> I haven't been following the edits lately, so I guess its your call as
> the present active guy to decide who gets which privileges :)
>

Not necessarily (certainly not in that I don't have the permissions to grant
such permissions; I don't even have permissions to commit changes): it
depends on the purpose of the attribute - if it's closer to what you say,
then I agree, it should be a reviewer/committor (though I didn't think those
were one and the same) who controls this; if, on the other hand, the purpose
is closer to what I say, then, at minimum, we need to think more about how
we do this.

DG

>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100618/085c8b74/attachment.html>

From warren.weckesser at enthought.com  Fri Jun 18 22:02:36 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Fri, 18 Jun 2010 21:02:36 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1B7EA8.4090109@gmail.com>
References: <4C05DDF3.9010206@enthought.com>	<4C069E84.4020308@gmail.com>	<4C06A059.6020901@wartburg.edu>	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>	<4C19AB7E.1000405@enthought.com>	<4C1A2C85.7090001@gmail.com>	<4C1A3411.1000806@enthought.com>	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>	<4C1A3FD8.3050801@gmail.com>	<AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>	<4C1A5476.1040407@gmail.com>
	<4C1AC126.1080308@wartburg.edu> <4C1B7EA8.4090109@gmail.com>
Message-ID: <4C1C253C.5050604@enthought.com>

Bruce Southey wrote:
> On 06/17/2010 07:43 PM, Neil Martinsen-Burrell wrote:
>   
>> On 2010-06-17 11:59, Bruce Southey wrote:
>>     
>>> On 06/17/2010 10:45 AM, josef.pktd at gmail.com wrote:
>>>       
>>>> On Thu, Jun 17, 2010 at 11:31 AM, Bruce Southey<bsouthey at gmail.com>  
>>>> wrote:
>>>>
>>>>         
>>>>> On 06/17/2010 09:50 AM,josef.pktd at gmail.com  wrote:
>>>>>
>>>>>           
>>>>>> On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
>>>>>> <warren.weckesser at enthought.com>    wrote:
>>>>>>
>>>>>>
>>>>>>             
>>>>>>> Bruce Southey wrote:
>>>>>>>
>>>>>>>
>>>>>>>               
>>>>>>>> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>>>>>>>>                 
>> [...]
>>
>>     
>>>>>>>> The handling for a one way table is wrong:
>>>>>>>>                 
>>>>>>>>>>> print 'One way', chisquare_nway([6, 2])
>>>>>>>>>>>                       
>>>>>>>> (0.0, 1.0, 0, array([ 6.,  2.]))
>>>>>>>>
>>>>>>>> It should also do the marginal independence tests.
>>>>>>>>
>>>>>>>>                 
>>>>>>> As I explained in the description of the ticket and in the 
>>>>>>> docstring,
>>>>>>> this function is not intended for doing the 'one-way' goodness of 
>>>>>>> fit.
>>>>>>> stats.chisquare should be used for that.  Calling chisquare_nway 
>>>>>>> with a
>>>>>>> 1D array amounts to doing a test of independence between 
>>>>>>> groupings but
>>>>>>> only giving a single grouping, hence the trivial result.  This is
>>>>>>> intentional.
>>>>>>>               
>>>>> In expected-nway, you say that "While this function can handle a 1D
>>>>> array," but clearly it does not handle it correctly.
>>>>> If it was your intention not to do one way tables, then you *must* 
>>>>> check
>>>>> the input and reject one way tables!
>>>>>
>>>>>           
>>>>>>> I guess the question is: should there be a "clever" chi-square 
>>>>>>> function
>>>>>>> that figures out what the user probably wants to do?
>>>>>>>
>>>>>>>
>>>>>>>               
>>>>> My issue is that the chi-squared test statistic is still calculated in
>>>>> exactly the same way for n-way tables where n>0. So it is pure
>>>>> unnecessary duplication of functionality if you require a second
>>>>> function for the one way table. I also prefer the one-stop shopping 
>>>>> approach
>>>>>
>>>>>           
>>>> just because it's chisquare doesn't mean it's the same kind of tests.
>>>> This is a test for independence or association that only makes sense
>>>> if there are at least two random variables.
>>>>         
>>> Wrong!
>>> See for example:
>>> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>>> "Pearson's chi-square is used to assess two types of comparison: tests
>>> of goodness of fit and tests of independence."
>>>
>>> The exact same test statistic is being calculated just that the
>>> hypothesis is different (which is the user's problem not the function's
>>> problem). So please separate the hypothesis from the test statistic.
>>>       
>> It is only the exact same test statistic if we know the expected cell 
>> counts.  How these expected cell counts are determined depends 
>> completely on the type of test that is being carried out.  In a 
>> goodness-of-fit test (chisquare_oneway) the proportions of each cell 
>> must be specified in the null hypothesis.  For an independence test 
>> (chisquare_nway), the expected cell counts are computed from the given 
>> data and the null hypothesis of independence.  The fact that the 
>> formula involving observed and expected numbers is the same should not 
>> obscure the fact that the expected numbers come from two completely 
>> different assumptions in the n=1 and n>1 cases.  Can you explain how 
>> the expected cell counts should be determined in the 1D case without 
>> the function making assumptions about the user's null hypothesis?
>>     
> The user's hypothesis is totally irrelevant here because you are 
> computing the TEST STATISTIC (sum of observed minus expected squared 
> divided by degree of freedom), that is available in multiple statistical 
> books and a not so good link:
> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>
> This is one reason that the current stats.chisquare function accepts 
> observed and expected values as input.
>
> So given a set of observed values and the corresponding set of expected 
> values, you calculate the test statistic and consequently a p-value (you 
> can assume that the test statistic has a chi-squared distribution but 
> nothing is stopping you from some other approach such as bootstrapping). 
> Yet, nothing in that calculation of the actual test statistic nor the 
> p-value makes any reference to the user's assumptions.
>
> Where the user's assumption enter is how they compute the expected 
> values as in your expected_nway function, which is NOT your 
> chisquare_nway function. How these expected values defined do depend on 
> the user's hypothesis and if the user does not define these then the 
> function has to make a guess of what the user wants. That guess depends 
> partly on the input (obviously independence is irrelevant for a 1way 
> table) and what the developer wants such as (marginal) independence for 
> higher order tables.
>
> Thus it is up to the user to know what assumptions they made and what 
> hypothesis is being tested when they interpret the p-value because scipy 
> is not an expert system that somehow knows what the user wants to get in 
> all cases.
>
>   
>> I believe that we CANNOT separate the test statistic from the user's 
>> null hypothesis and that is the reason that chisquare_oneway and 
>> chisquare_nway should be separate functions.  The information required 
>> to properly do a goodness-of-fit test is qualitatively different than 
>> that required to do an independence test.  I support your suggestion 
>> to reject 1D arrays as input for chisquare_nway. (With appropriate 
>> checking for arrays such as np.array([[[1, 2, 3, 4]]].)
>>     
>
> All you need to calculate the Pearson chi-squared is the observed and 
> expected values. I don't care how you obtain these values, but once you 
> do then everything afterwards is the same until the user decides on how 
> to interpret the p-value.
>
> Under your argument there needs to be a new function for every different 
> hypothesis that a user wants. So a 'goodness-of-fit test' must have a 
> separate function (actually probably a good idea) than a one-way test 
> function etc. But that is pure code bloat when the test statistic is 
> identical.
>
>   
>>>> I don't like mixing shoes and apples.
>>>>
>>>>         
>>> Then please don't.
>>>       
>> Great.  I'm glad to see that we all agree that chisquare_oneway and 
>> chisquare_nway should remain separate functions. :)
>>
>> -Neil
>>     
> Sure as we need more code duplication to increase the size of scipy and 
> confuse any user!
>   
> Bruce
>   

[Grab a cup of coffee--a long post follows!]

Bruce is correct: there is really just one chi-square statistic 
calculation. To do it, you need the observed frequencies, the expected 
frequencies, and the number of degrees of freedom.  A reason for having 
more than one function is that the default values for the expected 
frequencies, and the formulas for the degrees of freedom, are different 
depending on the hypothesis to be tested.

As an exercise, I wrote the function signature and docstring for the 
case of one chi-square function that handles both goodness of fit and 
tests for independence, and then did the same for the API with two 
functions.  I had a small set of use cases in mind.  My candidate APIs 
follow the use cases.

(A fixed font is recommended from here on.)

======================================
Chi-square Test - Elementary Use Cases
======================================

1. (Goodnes of fit, with uniform probabilities for the expected frequencies)
A six-sided die is rolled 60 times and the occurrences of each side are
counted.  The following table shows the result:

Side   1   2   3   4   5   6
----------------------------
Freq.  3  17   8  13   7  12

Is the die fair?

In this case, the expected frequencies are uniform: [10, 10, 10, 10, 10, 
10].

The number of degrees of freedom is 5.


2. (Goodness of fit, with given probabilities for the expected frequencies)
An archery target contains three levels: the bull's eye, the inner ring
and the outer ring.  By area, these make up 5%, 25% and 70% of the target,
respectively.

An archer hits the target with 100 arrows, and the number of hits in each
level are recorded:

Level   Bull's eye    Inner   Outer
-----------------------------------
Freq.      16          29      55

Are these frequencies any better than random?


In this case, the user provides the expected frequencies, as [5, 25, 70]
(or perhaps as a probability distribution [0.05, 0.25, 0.70]).

The number of degrees of freedom is 2.


3. (2x2 test of independence)
A group of 54 men and 46 women are asked whether they prefer vanilla or
chocolate ice cream.  The results are:

       Vanilla   Chocolate
Men      23        31
Women    19        27

Is the flavor preference independent of sex?

In a test of independence like this, one typically used the marginal
sums and the assumption of independence to create the expected frequencies.

The number of degrees of freedom is 1.

Yates' correction should be applied.


4. (3x3x2 test of independence)

No story, just some numbers:

Group 1
    X   Y   Z
A  18  45 113
B  32  60 321
C  74  33  67

Group 2
    X   Y   Z
A  11  33  87
B  19  45 267
C  40  19  80

Are the groupings independent?

The expected frequencies are to be computed from the marginal sums,
assuming that the factors are independent.  The number of degrees of
freedom is 12.


====================================
API: One or two (or more) functions?
====================================

Bruce has suggested implementing a "one-stop shopping" function.
Here's my take on what such a function would look like:

Single Function
---------------

def chisquare(obs, expected=None, correction=False):
    """Chi-square all-in-one function.
  
    Parameters
    ----------
    obs : array_like
        The observed frequencies

    expected : array_like, with the same shape as obs (optional)

        The expected frequencies.  If expected is given, then no matter
        what the dimension of `expected`, the number of degrees of freedom
        is equal to one less than the number of elements in `expected`.
        (This amounts to a "goodness of fit" test.)
       
        If `expected` is not given, and `obs` is 1D, the function
        assumes that expected is uniformly distributed.  The number of 
degrees
        of freedom is one less than the number of elements in `expected`.
        (This is a test of "goodness of fit" to a uniform distribution.)

        If `expected` is not given and `obs` is 2D or higher dimensional,
        the function will compute the expected frequencies based on the
        marginal sums of `obs` and assuming independence of the factors.
        The number degrees of freedom is the number of elements in `obs`,
        minus the sum of the dimensions, plus the number of dimensions 
minus 1.
        (This is the "test of independence".)

    correction : bool (optional)
        If True and the number of degrees of freedom is one, Yates' 
correction
        for continuity is applied.

    Returns
    -------
    chi2 : float
        The Chi-square statistic
    p : float
        The p-value
    dof : int
        The degrees of freedom
    expected : ndarray (same shape as `obs`)
        The expected frequencies
    """

Note that this function handles the case of a given `expected` array
that is 2D or higher dimensional.  It basically ignores the dimension
in this case and treats the problem the same as the 1D case.

I haven't included a `ddof` argument. I am following the YAGNI
principle:  all the use cases that I need are covered without `ddof`.
But that may reflect my inexperience with chi-square tests.

I prefer to not have the return type depend on the arguments, so it
always returns `expected`, even if `expected` was given as an
argument.


Two Functions
-------------

An alternative is to provide two functions.  One function is intended
for goodness-of-fit tests, which really just means that (a) the number
of degrees of freedom is one less than the number of observed
frequencies, and (b) if the expected frequencies are not given, they
are assumed to be uniform.  The other function is for testing
independence of factors.  The test only makes sense for arrays of
dimension 2 or higher. The 1D case can be handled  consistently,
but it is trivial.  This function is what I called chisquare_nway
in ticket #1203; here it is called chisquare_ind, to better reflect
its purpose.

Note that in the test for independence, an `expected` array is not given;
it is always computed by the function.


def chisquare_fit(obs, expected=None, correction=False):
    """Chi-square calculation for goodness-of-fit test.

    The number of degrees of freedom is one less than the number of
    elements in `obs`.

    Parameters
    ----------
    obs : array_like
        The observed frequencies

    expected : array_like, with the same shape as obs (optional)
        The expected frequencies. If `expected` is not given, the function
        assumes that the expected frequencies are uniformly distributed.
        That is,
            expected = float(obs.sum()) * ones_like(obs) / obs.size

    correction : bool (optional)
        If True and the number of degrees of freedom is one, Yates' 
correction
        for continuity is applied.

    Returns
    -------
    chi2 : float
        The Chi-square statistic
    p : float
        The p-value
    dof : int
        The degrees of freedom; this is obs.size - 1
    """


def chisquare_ind(obs, correction=False):
    """Chi-square calculation for test of independence.

    Each dimension of `obs` corresponds to a factor.
    The function will compute the expected frequencies based on the
    marginal sums of `obs` and assuming independence of the factors.
    The number degrees of freedom is the number of elements in `obs`,
    minus the sum of the dimensions, plus the number of dimensions minus 1.
  
    Parameters
    ----------
    obs : array_like
        The observed frequencies.  Each dimension of `obs` corresponds to
        a factor.  Note that if `obs` is 1D, this test is trivial: the
        chi-square statistic will be 0, the p-value will be one, and the
        expected values will be the same as `obs`.

    correction : bool (optional)
        If True and the number of degrees of freedom is one, Yates' 
correction
        for continuity is applied.

    Returns
    -------
    chi2 : float
        The Chi-square statistic
    p : float
        The p-value
    dof : int
        The degrees of freedom
    expected : ndarray (same shape as `obs`)
        The expected frequencies
    """


Each separate function has a somewhat simpler API compared to the
single function.  I felt I had to write more about `expected` in the
single function, because its role is conditional--it depends on its
shape.  So, following the principle that "Simple is better than complex",
I have a slight preference for the two separate functions.  However,
now that I've written the docstring, I don't think the single function
is terrible.

When applied to the use cases, there is only a trivial difference in
the APIs.  For example, using the single function:

1.
 >>> obs = [3,  17,   8,  13,   7,  12]
 >>> chisquare(obs)

2.
 >>> obs = [16, 29, 55]
 >>> expected  = [5, 25, 70]
 >>> chisquare(obs, expected)

3.
 >>> obs = [[23, 31], [19, 27]]
 >>> chisquare(obs, correction=True)

4.
 >>> obs = [...]
 >>> chisquare(obs)


To use the two functions instead, just use chisquare_fit in the first
two cases and chisquare_ind in the second two, with exactly the same
arguments.


Anyone still here?  Still awake?  Thoughts?  Preferences?


Warren


From vincent at vincentdavis.net  Fri Jun 18 22:06:00 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 18 Jun 2010 20:06:00 -0600
Subject: [SciPy-Dev] Ticket #995,
	All scipy.constants.constants functions "choke" on non-array
	sequence input
In-Reply-To: <AANLkTikQ_Mqa40cWucQj-quEukmG67fu7Za08HIWknO9@mail.gmail.com>
References: <AANLkTimrbmuZDm0H79HYYZCW5FV6MmwQ3tAbpKDbHR-E@mail.gmail.com>
	<AANLkTinvmkalSA0VlN7c1Z4QEj-BzKMQHTXF-BEHNmXh@mail.gmail.com>
	<AANLkTikWV_W0eL5C4ZTrcpug5c2CEStLMJ4HSpGdyzJz@mail.gmail.com>
	<BB592616-64A4-414D-AC2C-F99BA9C28D0B@vincentdavis.net>
	<AANLkTik8KJ-QP7YWkzTKhyR_0VWzaq7s-9MpGMxKZQ3c@mail.gmail.com>
	<AANLkTinAoKtIUbquHQR6Ojq0kwAPMEKIWuEBvxzynDgR@mail.gmail.com>
	<AANLkTin_to-0Jp97RiArYUShraQxLFerBXi-k7QupWz0@mail.gmail.com>
	<AANLkTikQ_Mqa40cWucQj-quEukmG67fu7Za08HIWknO9@mail.gmail.com>
Message-ID: <AANLkTimHtn57sebhMVz2mGoLDmlP3Y2UiCgbcU7rITXs@mail.gmail.com>

On Thu, Jun 17, 2010 at 4:36 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> On Thu, Jun 17, 2010 at 1:17 PM, Skipper Seabold <jsseabold at gmail.com>
> wrote:
>>
>> On Thu, Jun 17, 2010 at 4:10 PM, Vincent Davis <vincent at vincentdavis.net>
>> wrote:
>> > On Thu, Jun 17, 2010 at 1:58 PM, Skipper Seabold <jsseabold at gmail.com>
>> > wrote:
>> >> On Thu, Jun 17, 2010 at 3:52 PM, Vincent Davis
>> >> <vincent at vincentdavis.net> wrote:
>> >>>
>> >>>
>> >>> Sent from my iPhone
>> >>>
>> >>> On Jun 17, 2010, at 1:24 PM, Skipper Seabold <jsseabold at gmail.com>
>> >>> wrote:
>> >>>
>> >>>> On Thu, Jun 17, 2010 at 3:23 PM, Vincent Davis
>> >>>> <vincent at vincentdavis.net
>> >>>> > wrote:
>> >>>>> On Thu, Jun 17, 2010 at 1:18 PM, Vincent Davis
>> >>>>> <vincent at vincentdavis.net
>> >>>>> > wrote:
>> >>>>>> The problem is that for example C2F is
>> >>>>>>
>> >>>>>> def C2F(C):
>> >>>>>> ? ? return 1.8 * C + 32
>> >>>>>> So array_like (list, tuple...) does not work.
>> >>>>>>
>> >>>>>> What is the preferred solution?
>> >>>>>>
>> >>>>>> 1, Change the docs to say "scalar or array" and return a warning
>> >>>>>> if C
>> >>>>>> is not a scalar or array
>> >>>>>> 2, Ad the ability to handle all array_like.
>> >>>>>>
>> >>>>> def C2F(C):
>> >>>>> ? ? return np.add(np.multilpy(1.8, C), 32)
>> >>>>>
>> >>>>> I think that accepts array-like and scaler and returns the same
>> >>>>
>> >>>> Or just C = np.asarray(C)
>> >>>>
>> >>>> This is quite often what is done.
>> >>> But if I start with a list I expect a list back.
>> >>>
>> >>
>> >> I don't know of any functions that preserve list inputs.
>> >>
>> >> In [3]: np.multiply([1,2,3,4,5],[2,3,4,5,6])
>> >> Out[3]: array([ 2, ?6, 12, 20, 30])
>> >
>> > Your right, it preserves a scaler but not a list.
>> > Ok then would you prefer it accepts array_likem or scaler and returns
>> > a array or scaler?
>> >
>>
>> You can just say accepts array_like and use asanyarray (point taken) I
>> think.
>
> Yes, we have the convention that "array_like" includes (single) scalars.
>
I applied asanyarry() to the the input of the functions, Should I
review the docs also and then the patch can be applied with doc
updates?
Patch
http://github.com/vincentdavis/SciPy-vmd-dev/blob/master/scipy/constants/constants.py

> DG
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From ralf.gommers at googlemail.com  Fri Jun 18 23:35:07 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Sat, 19 Jun 2010 11:35:07 +0800
Subject: [SciPy-Dev] "ok to apply" permission request
In-Reply-To: <AANLkTinKqQ-cofrcnPbrBpRxiklnpQIgpkPHJlEqFvNm@mail.gmail.com>
References: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com>
	<hvgkf2$qid$1@dough.gmane.org>
	<AANLkTilj2wn7WAox_YSPU7C4oqIiTPJEXjzRo3vS9w9-@mail.gmail.com>
	<hvgovp$lif$1@dough.gmane.org>
	<AANLkTinKqQ-cofrcnPbrBpRxiklnpQIgpkPHJlEqFvNm@mail.gmail.com>
Message-ID: <AANLkTinsvJdIDoNRx2T4oCqM1zJDrcVHJZlV9PnKJlIg@mail.gmail.com>

On Sat, Jun 19, 2010 at 5:49 AM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> On Fri, Jun 18, 2010 at 2:38 PM, Pauli Virtanen <pav at iki.fi> wrote:
>
>> Fri, 18 Jun 2010 13:44:11 -0700, David Goldsmith wrote:
>> [clip]
>> > I don't understand: if they're going to commit the changes, why do they
>> > need to be able to mark that they're going to commit the changes?  To
>> > help them remember which ones they've screened as possessing nothing
>> > "absurd" in case they can't commit the changes immediately after they've
>> > decided to commit the changes?
>>
>> The point is that you typically commit a huge batch of docstring changes
>> at once, and reading through a long patch listing makes your eyes glaze
>> over really fast.
>>
>
> OK, I understand that, and it makes sense during the regular course of the
> year when docstring changes aren't happening as frequently, but, something
> to consider, perhaps through the course of the Summer Marathon, "OK to
> apply"s should be merged once per week or some such?
>

It always makes sense. "OK to apply" should only be used by (a) someone who
is about to commit to svn or (b) someone who is 200% sure what this means.

A few weeks ago I was committing all scipy changes and found many "OK to
apply" ones that couldn't actually be committed, for various reasons. This
meant I had to go back and recheck everything, including ones I had checked
as OK myself before.


> So a rough sanity check is much easier to do in the web system, and the
>> burden can be distributed across multiple people if necessary. Currently
>> ok-to-apply is married with the Reviewer permissions.
>>
>> At least this is what I used and intended the feature for. I'm not sure
>> if anyone else actually understands it the same way, especially as this
>> is not written down anywhere :)
>>
>> > > Typically the way to just indicate that stuff is "done", is to mark is
>> > > as "Needs review", at the moment.
>> >
>> > I guess then I'm really unclear as to the need for the "OK to apply"; my
>> > understanding was that it was there for the editor to signal to the
>> > commitor that, even thought the docstring is technically *not* ready for
>> > review (e.g., it's still missing an Example, say, or a needed
>> > Reference), it still represents a big enough improvement over what's in
>> > SVN that, in the editor's opinion, it is "OK to apply".  If "Needs
>> > review" is necessary and sufficient for something to be applied, then
>> > why do we need the extra "OK to apply"?
>>
>> It was intended mostly as a reviewer/committer-level tool, at least
>> originally, which is why it's not active with Editor permissions. Since
>> anyone can in principle come and edit the wiki, I thought something like
>> this would come useful.
>>
>> I haven't been following the edits lately, so I guess its your call as
>> the present active guy to decide who gets which privileges :)
>>
>
> Not necessarily (certainly not in that I don't have the permissions to
> grant such permissions; I don't even have permissions to commit changes): it
> depends on the purpose of the attribute - if it's closer to what you say,
> then I agree, it should be a reviewer/committor (though I didn't think those
> were one and the same) who controls this; if, on the other hand, the purpose
> is closer to what I say, then, at minimum, we need to think more about how
> we do this.
>

It's not only about content, but also for example knowing for which
docstrings pydocweb can not generate correct patches at the moment.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/54146b32/attachment.html>

From ralf.gommers at googlemail.com  Sat Jun 19 01:00:55 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Sat, 19 Jun 2010 13:00:55 +0800
Subject: [SciPy-Dev] arpack failures 64-bit
Message-ID: <AANLkTilceWq0DSWRiOB-dS3RtSKCecfn1gfgUWSxMXAO@mail.gmail.com>

Arpack test failures have been reported by several people, all on 64 bit. On
32-bit it's fine.

======================================================================
ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
 File
"/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 204, in test_nonsymmetric_modes
   self.eval_evec(m,typ,k,which)
 File
"/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 186, in eval_evec
   eval,evec=eigen(a,k,which=which,**kwds)
 File
"/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 397,
in eigen
   params.iterate()
 File
"/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 222,
in iterate
   raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

For the full list of errors see
http://article.gmane.org/gmane.comp.python.scientific.devel/13706/match=arpack(on
OS X).
Similar report on Linux:
http://thread.gmane.org/gmane.comp.python.scientific.devel/8551
This seems also related: http://projects.scipy.org/scipy/ticket/725.

Does anyone have time to look at this soon? If not, can I mark them as
knownfail for 0.8.0?

Thanks,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/55e717d1/attachment.html>

From josef.pktd at gmail.com  Sat Jun 19 08:39:03 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sat, 19 Jun 2010 08:39:03 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1C253C.5050604@enthought.com>
References: <4C05DDF3.9010206@enthought.com> <4C069E84.4020308@gmail.com>
	<4C06A059.6020901@wartburg.edu>
	<AANLkTilgHWdQN1gi-8eM_jMfktNBzodoQuj1iSvjhy6I@mail.gmail.com>
	<4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
	<4C0D0993.1080206@gmail.com>
	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>
	<4C0D2213.7020302@gmail.com>
	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>
	<4C19AB7E.1000405@enthought.com> <4C1A2C85.7090001@gmail.com>
	<4C1A3411.1000806@enthought.com>
	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>
	<4C1A3FD8.3050801@gmail.com>
	<AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>
	<4C1A5476.1040407@gmail.com> <4C1AC126.1080308@wartburg.edu>
	<4C1B7EA8.4090109@gmail.com> <4C1C253C.5050604@enthought.com>
Message-ID: <AANLkTik-0t2cHa6NfGXBN6h4VN_y3KgyJwrdBhKceLKy@mail.gmail.com>

On Fri, Jun 18, 2010 at 10:02 PM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> Bruce Southey wrote:
>> On 06/17/2010 07:43 PM, Neil Martinsen-Burrell wrote:
>>
>>> On 2010-06-17 11:59, Bruce Southey wrote:
>>>
>>>> On 06/17/2010 10:45 AM, josef.pktd at gmail.com wrote:
>>>>
>>>>> On Thu, Jun 17, 2010 at 11:31 AM, Bruce Southey<bsouthey at gmail.com>
>>>>> wrote:
>>>>>
>>>>>
>>>>>> On 06/17/2010 09:50 AM,josef.pktd at gmail.com ?wrote:
>>>>>>
>>>>>>
>>>>>>> On Thu, Jun 17, 2010 at 10:41 AM, Warren Weckesser
>>>>>>> <warren.weckesser at enthought.com> ? ?wrote:
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>> Bruce Southey wrote:
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>> On 06/16/2010 11:58 PM, Warren Weckesser wrote:
>>>>>>>>>
>>> [...]
>>>
>>>
>>>>>>>>> The handling for a one way table is wrong:
>>>>>>>>>
>>>>>>>>>>>> print 'One way', chisquare_nway([6, 2])
>>>>>>>>>>>>
>>>>>>>>> (0.0, 1.0, 0, array([ 6., ?2.]))
>>>>>>>>>
>>>>>>>>> It should also do the marginal independence tests.
>>>>>>>>>
>>>>>>>>>
>>>>>>>> As I explained in the description of the ticket and in the
>>>>>>>> docstring,
>>>>>>>> this function is not intended for doing the 'one-way' goodness of
>>>>>>>> fit.
>>>>>>>> stats.chisquare should be used for that. ?Calling chisquare_nway
>>>>>>>> with a
>>>>>>>> 1D array amounts to doing a test of independence between
>>>>>>>> groupings but
>>>>>>>> only giving a single grouping, hence the trivial result. ?This is
>>>>>>>> intentional.
>>>>>>>>
>>>>>> In expected-nway, you say that "While this function can handle a 1D
>>>>>> array," but clearly it does not handle it correctly.
>>>>>> If it was your intention not to do one way tables, then you *must*
>>>>>> check
>>>>>> the input and reject one way tables!
>>>>>>
>>>>>>
>>>>>>>> I guess the question is: should there be a "clever" chi-square
>>>>>>>> function
>>>>>>>> that figures out what the user probably wants to do?
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>> My issue is that the chi-squared test statistic is still calculated in
>>>>>> exactly the same way for n-way tables where n>0. So it is pure
>>>>>> unnecessary duplication of functionality if you require a second
>>>>>> function for the one way table. I also prefer the one-stop shopping
>>>>>> approach
>>>>>>
>>>>>>
>>>>> just because it's chisquare doesn't mean it's the same kind of tests.
>>>>> This is a test for independence or association that only makes sense
>>>>> if there are at least two random variables.
>>>>>
>>>> Wrong!
>>>> See for example:
>>>> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>>>> "Pearson's chi-square is used to assess two types of comparison: tests
>>>> of goodness of fit and tests of independence."
>>>>
>>>> The exact same test statistic is being calculated just that the
>>>> hypothesis is different (which is the user's problem not the function's
>>>> problem). So please separate the hypothesis from the test statistic.
>>>>
>>> It is only the exact same test statistic if we know the expected cell
>>> counts. ?How these expected cell counts are determined depends
>>> completely on the type of test that is being carried out. ?In a
>>> goodness-of-fit test (chisquare_oneway) the proportions of each cell
>>> must be specified in the null hypothesis. ?For an independence test
>>> (chisquare_nway), the expected cell counts are computed from the given
>>> data and the null hypothesis of independence. ?The fact that the
>>> formula involving observed and expected numbers is the same should not
>>> obscure the fact that the expected numbers come from two completely
>>> different assumptions in the n=1 and n>1 cases. ?Can you explain how
>>> the expected cell counts should be determined in the 1D case without
>>> the function making assumptions about the user's null hypothesis?
>>>
>> The user's hypothesis is totally irrelevant here because you are
>> computing the TEST STATISTIC (sum of observed minus expected squared
>> divided by degree of freedom), that is available in multiple statistical
>> books and a not so good link:
>> http://en.wikipedia.org/wiki/Pearson's_chi-square_test
>>
>> This is one reason that the current stats.chisquare function accepts
>> observed and expected values as input.
>>
>> So given a set of observed values and the corresponding set of expected
>> values, you calculate the test statistic and consequently a p-value (you
>> can assume that the test statistic has a chi-squared distribution but
>> nothing is stopping you from some other approach such as bootstrapping).
>> Yet, nothing in that calculation of the actual test statistic nor the
>> p-value makes any reference to the user's assumptions.
>>
>> Where the user's assumption enter is how they compute the expected
>> values as in your expected_nway function, which is NOT your
>> chisquare_nway function. How these expected values defined do depend on
>> the user's hypothesis and if the user does not define these then the
>> function has to make a guess of what the user wants. That guess depends
>> partly on the input (obviously independence is irrelevant for a 1way
>> table) and what the developer wants such as (marginal) independence for
>> higher order tables.
>>
>> Thus it is up to the user to know what assumptions they made and what
>> hypothesis is being tested when they interpret the p-value because scipy
>> is not an expert system that somehow knows what the user wants to get in
>> all cases.
>>
>>
>>> I believe that we CANNOT separate the test statistic from the user's
>>> null hypothesis and that is the reason that chisquare_oneway and
>>> chisquare_nway should be separate functions. ?The information required
>>> to properly do a goodness-of-fit test is qualitatively different than
>>> that required to do an independence test. ?I support your suggestion
>>> to reject 1D arrays as input for chisquare_nway. (With appropriate
>>> checking for arrays such as np.array([[[1, 2, 3, 4]]].)
>>>
>>
>> All you need to calculate the Pearson chi-squared is the observed and
>> expected values. I don't care how you obtain these values, but once you
>> do then everything afterwards is the same until the user decides on how
>> to interpret the p-value.
>>
>> Under your argument there needs to be a new function for every different
>> hypothesis that a user wants. So a 'goodness-of-fit test' must have a
>> separate function (actually probably a good idea) than a one-way test
>> function etc. But that is pure code bloat when the test statistic is
>> identical.
>>
>>
>>>>> I don't like mixing shoes and apples.
>>>>>
>>>>>
>>>> Then please don't.
>>>>
>>> Great. ?I'm glad to see that we all agree that chisquare_oneway and
>>> chisquare_nway should remain separate functions. :)
>>>
>>> -Neil
>>>
>> Sure as we need more code duplication to increase the size of scipy and
>> confuse any user!
>>
>> Bruce
>>
>
> [Grab a cup of coffee--a long post follows!]
>
> Bruce is correct: there is really just one chi-square statistic
> calculation. To do it, you need the observed frequencies, the expected
> frequencies, and the number of degrees of freedom. ?A reason for having
> more than one function is that the default values for the expected
> frequencies, and the formulas for the degrees of freedom, are different
> depending on the hypothesis to be tested.
>
> As an exercise, I wrote the function signature and docstring for the
> case of one chi-square function that handles both goodness of fit and
> tests for independence, and then did the same for the API with two
> functions. ?I had a small set of use cases in mind. ?My candidate APIs
> follow the use cases.
>
> (A fixed font is recommended from here on.)
>
> ======================================
> Chi-square Test - Elementary Use Cases
> ======================================
>
> 1. (Goodnes of fit, with uniform probabilities for the expected frequencies)
> A six-sided die is rolled 60 times and the occurrences of each side are
> counted. ?The following table shows the result:
>
> Side ? 1 ? 2 ? 3 ? 4 ? 5 ? 6
> ----------------------------
> Freq. ?3 ?17 ? 8 ?13 ? 7 ?12
>
> Is the die fair?
>
> In this case, the expected frequencies are uniform: [10, 10, 10, 10, 10,
> 10].
>
> The number of degrees of freedom is 5.
>
>
> 2. (Goodness of fit, with given probabilities for the expected frequencies)
> An archery target contains three levels: the bull's eye, the inner ring
> and the outer ring. ?By area, these make up 5%, 25% and 70% of the target,
> respectively.
>
> An archer hits the target with 100 arrows, and the number of hits in each
> level are recorded:
>
> Level ? Bull's eye ? ?Inner ? Outer
> -----------------------------------
> Freq. ? ? ?16 ? ? ? ? ?29 ? ? ?55
>
> Are these frequencies any better than random?
>
>
> In this case, the user provides the expected frequencies, as [5, 25, 70]
> (or perhaps as a probability distribution [0.05, 0.25, 0.70]).
>
> The number of degrees of freedom is 2.
>
>
> 3. (2x2 test of independence)
> A group of 54 men and 46 women are asked whether they prefer vanilla or
> chocolate ice cream. ?The results are:
>
> ? ? ? Vanilla ? Chocolate
> Men ? ? ?23 ? ? ? ?31
> Women ? ?19 ? ? ? ?27
>
> Is the flavor preference independent of sex?
>
> In a test of independence like this, one typically used the marginal
> sums and the assumption of independence to create the expected frequencies.
>
> The number of degrees of freedom is 1.
>
> Yates' correction should be applied.
>
>
> 4. (3x3x2 test of independence)
>
> No story, just some numbers:
>
> Group 1
> ? ?X ? Y ? Z
> A ?18 ?45 113
> B ?32 ?60 321
> C ?74 ?33 ?67
>
> Group 2
> ? ?X ? Y ? Z
> A ?11 ?33 ?87
> B ?19 ?45 267
> C ?40 ?19 ?80
>
> Are the groupings independent?
>
> The expected frequencies are to be computed from the marginal sums,
> assuming that the factors are independent. ?The number of degrees of
> freedom is 12.
>
>
> ====================================
> API: One or two (or more) functions?
> ====================================
>
> Bruce has suggested implementing a "one-stop shopping" function.
> Here's my take on what such a function would look like:
>
> Single Function
> ---------------
>
> def chisquare(obs, expected=None, correction=False):
> ? ?"""Chi-square all-in-one function.
>
> ? ?Parameters
> ? ?----------
> ? ?obs : array_like
> ? ? ? ?The observed frequencies
>
> ? ?expected : array_like, with the same shape as obs (optional)
>
> ? ? ? ?The expected frequencies. ?If expected is given, then no matter
> ? ? ? ?what the dimension of `expected`, the number of degrees of freedom
> ? ? ? ?is equal to one less than the number of elements in `expected`.
> ? ? ? ?(This amounts to a "goodness of fit" test.)
>
> ? ? ? ?If `expected` is not given, and `obs` is 1D, the function
> ? ? ? ?assumes that expected is uniformly distributed. ?The number of
> degrees
> ? ? ? ?of freedom is one less than the number of elements in `expected`.
> ? ? ? ?(This is a test of "goodness of fit" to a uniform distribution.)
>
> ? ? ? ?If `expected` is not given and `obs` is 2D or higher dimensional,
> ? ? ? ?the function will compute the expected frequencies based on the
> ? ? ? ?marginal sums of `obs` and assuming independence of the factors.
> ? ? ? ?The number degrees of freedom is the number of elements in `obs`,
> ? ? ? ?minus the sum of the dimensions, plus the number of dimensions
> minus 1.
> ? ? ? ?(This is the "test of independence".)
>
> ? ?correction : bool (optional)
> ? ? ? ?If True and the number of degrees of freedom is one, Yates'
> correction
> ? ? ? ?for continuity is applied.
>
> ? ?Returns
> ? ?-------
> ? ?chi2 : float
> ? ? ? ?The Chi-square statistic
> ? ?p : float
> ? ? ? ?The p-value
> ? ?dof : int
> ? ? ? ?The degrees of freedom
> ? ?expected : ndarray (same shape as `obs`)
> ? ? ? ?The expected frequencies
> ? ?"""
>
> Note that this function handles the case of a given `expected` array
> that is 2D or higher dimensional. ?It basically ignores the dimension
> in this case and treats the problem the same as the 1D case.
>
> I haven't included a `ddof` argument. I am following the YAGNI
> principle: ?all the use cases that I need are covered without `ddof`.
> But that may reflect my inexperience with chi-square tests.
>
> I prefer to not have the return type depend on the arguments, so it
> always returns `expected`, even if `expected` was given as an
> argument.
>
>
> Two Functions
> -------------
>
> An alternative is to provide two functions. ?One function is intended
> for goodness-of-fit tests, which really just means that (a) the number
> of degrees of freedom is one less than the number of observed
> frequencies, and (b) if the expected frequencies are not given, they
> are assumed to be uniform. ?The other function is for testing
> independence of factors. ?The test only makes sense for arrays of
> dimension 2 or higher. The 1D case can be handled ?consistently,
> but it is trivial. ?This function is what I called chisquare_nway
> in ticket #1203; here it is called chisquare_ind, to better reflect
> its purpose.
>
> Note that in the test for independence, an `expected` array is not given;
> it is always computed by the function.
>
>
> def chisquare_fit(obs, expected=None, correction=False):
> ? ?"""Chi-square calculation for goodness-of-fit test.
>
> ? ?The number of degrees of freedom is one less than the number of
> ? ?elements in `obs`.
>
> ? ?Parameters
> ? ?----------
> ? ?obs : array_like
> ? ? ? ?The observed frequencies
>
> ? ?expected : array_like, with the same shape as obs (optional)
> ? ? ? ?The expected frequencies. If `expected` is not given, the function
> ? ? ? ?assumes that the expected frequencies are uniformly distributed.
> ? ? ? ?That is,
> ? ? ? ? ? ?expected = float(obs.sum()) * ones_like(obs) / obs.size
>
> ? ?correction : bool (optional)
> ? ? ? ?If True and the number of degrees of freedom is one, Yates'
> correction
> ? ? ? ?for continuity is applied.
>
> ? ?Returns
> ? ?-------
> ? ?chi2 : float
> ? ? ? ?The Chi-square statistic
> ? ?p : float
> ? ? ? ?The p-value
> ? ?dof : int
> ? ? ? ?The degrees of freedom; this is obs.size - 1
> ? ?"""
>
>
> def chisquare_ind(obs, correction=False):
> ? ?"""Chi-square calculation for test of independence.
>
> ? ?Each dimension of `obs` corresponds to a factor.
> ? ?The function will compute the expected frequencies based on the
> ? ?marginal sums of `obs` and assuming independence of the factors.
> ? ?The number degrees of freedom is the number of elements in `obs`,
> ? ?minus the sum of the dimensions, plus the number of dimensions minus 1.
>
> ? ?Parameters
> ? ?----------
> ? ?obs : array_like
> ? ? ? ?The observed frequencies. ?Each dimension of `obs` corresponds to
> ? ? ? ?a factor. ?Note that if `obs` is 1D, this test is trivial: the
> ? ? ? ?chi-square statistic will be 0, the p-value will be one, and the
> ? ? ? ?expected values will be the same as `obs`.
>
> ? ?correction : bool (optional)
> ? ? ? ?If True and the number of degrees of freedom is one, Yates'
> correction
> ? ? ? ?for continuity is applied.
>
> ? ?Returns
> ? ?-------
> ? ?chi2 : float
> ? ? ? ?The Chi-square statistic
> ? ?p : float
> ? ? ? ?The p-value
> ? ?dof : int
> ? ? ? ?The degrees of freedom
> ? ?expected : ndarray (same shape as `obs`)
> ? ? ? ?The expected frequencies
> ? ?"""
>
>
> Each separate function has a somewhat simpler API compared to the
> single function. ?I felt I had to write more about `expected` in the
> single function, because its role is conditional--it depends on its
> shape. ?So, following the principle that "Simple is better than complex",
> I have a slight preference for the two separate functions. ?However,
> now that I've written the docstring, I don't think the single function
> is terrible.
>
> When applied to the use cases, there is only a trivial difference in
> the APIs. ?For example, using the single function:
>
> 1.
> ?>>> obs = [3, ?17, ? 8, ?13, ? 7, ?12]
> ?>>> chisquare(obs)
>
> 2.
> ?>>> obs = [16, 29, 55]
> ?>>> expected ?= [5, 25, 70]
> ?>>> chisquare(obs, expected)
>
> 3.
> ?>>> obs = [[23, 31], [19, 27]]
> ?>>> chisquare(obs, correction=True)
>
> 4.
> ?>>> obs = [...]
> ?>>> chisquare(obs)
>
>
> To use the two functions instead, just use chisquare_fit in the first
> two cases and chisquare_ind in the second two, with exactly the same
> arguments.
>
>
> Anyone still here? ?Still awake? ?Thoughts? ?Preferences?

Forget any merging of the functions.

Statistical functions should also be defined by their purpose, we are
not creating universal f_tests and t_tests. Unless someone is
proposing the merge and unify various t_tests, ... ?
misquoting: "The user's hypothesis is totally irrelevant ..." ???

Testing for goodness-of-fit is a completely different use case, with
different extensions, e.g. power discrepancy. What if I have a 2d
array and want to check goodness-of-fit along each axis, which might
be useful once group-by extensions to bincount handle more than 1d
weights. Or if we extend it to multivariate distributions, then the
default might be uniform for each column (and not independence.)
This is a standard test for distributions, and should not be mixed
with contingency tables

contingency tables are a different case, which I never use, and where
I would go with whatever statisticians prefer. But I think, going by
null hypothesis makes functions for statistical tests much cleaner
(easier to categorize, explain, find) than one-stop statistics (at
least for functions and not methods in classes) as is the current
tradition of scipy.stats.

"fit" in your function name is very misleading chisquare_fit, because
your function doesn't do any fitting. If a rename is desired, I would
call it chisquare_gof, but I use a similar name for the actual gof
test based on the sample data, with automatic binning.
Fitting the distribution parameters raises other issues which I don't
think should be mixed with the basic chisquare-test


Josef
(sent from the Road without Apple product)

>
>
> Warren
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From warren.weckesser at enthought.com  Sat Jun 19 09:26:39 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Sat, 19 Jun 2010 08:26:39 -0500
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <AANLkTik-0t2cHa6NfGXBN6h4VN_y3KgyJwrdBhKceLKy@mail.gmail.com>
References: <4C05DDF3.9010206@enthought.com>
	<4C06B8FB.8080806@gmail.com>	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>	<4C07ADC1.6040504@enthought.com>
	<4C0932FC.2020108@gmail.com>	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>	<4C0D0993.1080206@gmail.com>	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>	<4C0D2213.7020302@gmail.com>	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>	<4C19AB7E.1000405@enthought.com>
	<4C1A2C85.7090001@gmail.com>	<4C1A3411.1000806@enthought.com>	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>	<4C1A3FD8.3050801@gmail.com>	<AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>	<4C1A5476.1040407@gmail.com>
	<4C1AC126.1080308@wartburg.edu>	<4C1B7EA8.4090109@gmail.com>
	<4C1C253C.5050604@enthought.com>
	<AANLkTik-0t2cHa6NfGXBN6h4VN_y3KgyJwrdBhKceLKy@mail.gmail.com>
Message-ID: <4C1CC58F.6090102@enthought.com>

josef.pktd at gmail.com wrote:
> <snip>
>   
> Forget any merging of the functions.
>
> Statistical functions should also be defined by their purpose, we are
> not creating universal f_tests and t_tests. Unless someone is
> proposing the merge and unify various t_tests, ... ?
> misquoting: "The user's hypothesis is totally irrelevant ..." ???
>
> Testing for goodness-of-fit is a completely different use case, with
> different extensions, e.g. power discrepancy. What if I have a 2d
> array and want to check goodness-of-fit along each axis, which might
> be useful once group-by extensions to bincount handle more than 1d
> weights.


So you are anticipating something like this (where `obs` is, say, 2D):

 >>> chisquare_fit(table, axis=-1)

Then the result would also be 2D, with the last axis having length 2 and 
holding the (chi2, p) values?

>  Or if we extend it to multivariate distributions, then the
> default might be uniform for each column (and not independence.)
> This is a standard test for distributions, and should not be mixed
> with contingency tables
>
>   

Could you elaborate on this use case?  I don't know enough about it to 
be able to decide if this is something that could be implemented right 
away, or if it is something that might not happen for years, if ever.


> contingency tables are a different case, which I never use, and where
> I would go with whatever statisticians prefer. But I think, going by
> null hypothesis makes functions for statistical tests much cleaner
> (easier to categorize, explain, find) than one-stop statistics (at
> least for functions and not methods in classes) as is the current
> tradition of scipy.stats.
>
> "fit" in your function name is very misleading chisquare_fit, because
> your function doesn't do any fitting. If a rename is desired, I would
> call it chisquare_gof, but I use a similar name for the actual gof
> test based on the sample data, with automatic binning.
> Fitting the distribution parameters raises other issues which I don't
> think should be mixed with the basic chisquare-test
>
>   

Yes, I agree.  I only used "fit" to distinguish it from "ind".  I didn't 
want to use "oneway" and "nway", because those names might lead one to 
think that "oneway" is the n=1 case of "nway", but it is not.


Warren


From josef.pktd at gmail.com  Sat Jun 19 09:58:04 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Sat, 19 Jun 2010 09:58:04 -0400
Subject: [SciPy-Dev] chi-square test for a contingency (R x C) table
In-Reply-To: <4C1CC58F.6090102@enthought.com>
References: <4C05DDF3.9010206@enthought.com> <4C06B8FB.8080806@gmail.com>
	<AANLkTinLbCkxDYZrB68x4MlnSubCATyUNbfoRy3CIg-9@mail.gmail.com>
	<4C07ADC1.6040504@enthought.com> <4C0932FC.2020108@gmail.com>
	<AANLkTikOfFW2yMuHmr7pTxeTzxKA0HX7qeyrnyZI6aof@mail.gmail.com>
	<AANLkTilOD0Bikw0FkbIS53aVlMgpyx1-S3lPZgbAukGI@mail.gmail.com>
	<4C0D0993.1080206@gmail.com>
	<AANLkTilnf0G8uzi4rAMj3g0yagxUmpkZ98q9_hu1kqst@mail.gmail.com>
	<4C0D2213.7020302@gmail.com>
	<AANLkTik2M5TJhTRpMZna0iT_Za5rDKG4plhVg4Rrd11_@mail.gmail.com>
	<4C19AB7E.1000405@enthought.com> <4C1A2C85.7090001@gmail.com>
	<4C1A3411.1000806@enthought.com>
	<AANLkTinPi9JS3-tWEd-NQ6e7HWcYb-tTYRYNlUZe5DAv@mail.gmail.com>
	<4C1A3FD8.3050801@gmail.com>
	<AANLkTilYtOGPo6a_zkb9nKU9JYYi-XmDaJi2Jpe8w_R9@mail.gmail.com>
	<4C1A5476.1040407@gmail.com> <4C1AC126.1080308@wartburg.edu>
	<4C1B7EA8.4090109@gmail.com> <4C1C253C.5050604@enthought.com>
	<AANLkTik-0t2cHa6NfGXBN6h4VN_y3KgyJwrdBhKceLKy@mail.gmail.com>
	<4C1CC58F.6090102@enthought.com>
Message-ID: <AANLkTinKcjge0cwGBIBoll61z9tofnHhKIE3KmHGn3Y-@mail.gmail.com>

On Sat, Jun 19, 2010 at 9:26 AM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> josef.pktd at gmail.com wrote:
>> <snip>
>>
>> Forget any merging of the functions.
>>
>> Statistical functions should also be defined by their purpose, we are
>> not creating universal f_tests and t_tests. Unless someone is
>> proposing the merge and unify various t_tests, ... ?
>> misquoting: "The user's hypothesis is totally irrelevant ..." ???
>>
>> Testing for goodness-of-fit is a completely different use case, with
>> different extensions, e.g. power discrepancy. What if I have a 2d
>> array and want to check goodness-of-fit along each axis, which might
>> be useful once group-by extensions to bincount handle more than 1d
>> weights.
>
>
> So you are anticipating something like this (where `obs` is, say, 2D):
>
> ?>>> chisquare_fit(table, axis=-1)
>
> Then the result would also be 2D, with the last axis having length 2 and
> holding the (chi2, p) values?

I haven't looked at this closely yet, but I would think it would be a
standard reduce by one axis, usually we would return one array for the
test statistic and one array for the p-values (both same dimension
equal to one less than the original)

chisquare_fit(table, axis=-1)  as equivalent to [chisquare(table[k])
for k in range(table.size[0])] for 2d
and apply_along_axis for nd

This would be easy to extend but I don't know how much the need is for
this currently.

eg. if we have a sample by geographic region or groups, we might want
to test whether the distribution is uniform or normal in each group.
(continuous distributions would require binning first)

>
>> ?Or if we extend it to multivariate distributions, then the
>> default might be uniform for each column (and not independence.)
>> This is a standard test for distributions, and should not be mixed
>> with contingency tables
>>
>>
>
> Could you elaborate on this use case? ?I don't know enough about it to
> be able to decide if this is something that could be implemented right
> away, or if it is something that might not happen for years, if ever.

During this thread, I started to think of contingency tables just as a
nd discrete distribution, where we can have functions for the
multivariate distributions, marginal pdf, conditional pdf, ... and
some tests on it.
Independence in this case would be just one hypothesis.
Also, the chisquare independence test conditions on the margin totals,
this might be the most common case, but not necessarily the only
chisquare hypothesis we might test. (I'm not to clear on all the
contingency table stuff.)

multivariate distributions are only on my wish list, and it will
require some work to go beyond pdf, loglike and rvs.
multivariate discrete (contingency tables without the statistics) and
multivariate normal and some others would be the first candidates.
(copulas would be another multivariate distribution wish)

I don't know what would be the ETA (expected time of arrival) for these.


I like your current implementation, because it's right to the point
and easy to explain and use. And it looks forward compatible to
extended functionality that we might think of.

Josef

>
>
>> contingency tables are a different case, which I never use, and where
>> I would go with whatever statisticians prefer. But I think, going by
>> null hypothesis makes functions for statistical tests much cleaner
>> (easier to categorize, explain, find) than one-stop statistics (at
>> least for functions and not methods in classes) as is the current
>> tradition of scipy.stats.
>>
>> "fit" in your function name is very misleading chisquare_fit, because
>> your function doesn't do any fitting. If a rename is desired, I would
>> call it chisquare_gof, but I use a similar name for the actual gof
>> test based on the sample data, with automatic binning.
>> Fitting the distribution parameters raises other issues which I don't
>> think should be mixed with the basic chisquare-test
>>
>>
>
> Yes, I agree. ?I only used "fit" to distinguish it from "ind". ?I didn't
> want to use "oneway" and "nway", because those names might lead one to
> think that "oneway" is the n=1 case of "nway", but it is not.
>
>
> Warren
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From aric.hagberg at gmail.com  Sat Jun 19 10:03:18 2010
From: aric.hagberg at gmail.com (Aric Hagberg)
Date: Sat, 19 Jun 2010 08:03:18 -0600
Subject: [SciPy-Dev] arpack failures 64-bit
In-Reply-To: <AANLkTilceWq0DSWRiOB-dS3RtSKCecfn1gfgUWSxMXAO@mail.gmail.com>
References: <AANLkTilceWq0DSWRiOB-dS3RtSKCecfn1gfgUWSxMXAO@mail.gmail.com>
Message-ID: <AANLkTim3L3IOKAdCscLgerFOk5M7KTt1lpav_e_kgshV@mail.gmail.com>

On Fri, Jun 18, 2010 at 11:00 PM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
> Arpack test failures have been reported by several people, all on 64 bit. On
> 32-bit it's fine.
>
> ======================================================================
> ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
> ?File
> "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 204, in test_nonsymmetric_modes
> ? ?self.eval_evec(m,typ,k,which)
> ?File
> "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 186, in eval_evec
> ? ?eval,evec=eigen(a,k,which=which,**kwds)
> ?File
> "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 397,
> in eigen
> ? ?params.iterate()
> ?File
> "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 222,
> in iterate
> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> For the full list of errors see
> http://article.gmane.org/gmane.comp.python.scientific.devel/13706/match=arpack
> (on OS X).
> Similar report on Linux:
> http://thread.gmane.org/gmane.comp.python.scientific.devel/8551
> This seems also related: http://projects.scipy.org/scipy/ticket/725.
>
> Does anyone have time to look at this soon? If not, can I mark them as
> knownfail for 0.8.0?

I took a quick look at this but I don't have a fix and I also can't
reproduce the problem on 64 bit Linux.

The error is coming from the LAPACK routine zlahqr
which is what the -8 error code refers to.  See error codes at
http://projects.scipy.org/scipy/browser/trunk/scipy/sparse/linalg/eigen/arpack/ARPACK/SRC/zneupd.f#L149
:

c          = -8: Error return from LAPACK eigenvalue calculation.
c                This should never happened.

(and around line 557 where the error is generated after calling zlahqr)
and the section the
http://projects.scipy.org/scipy/browser/trunk/scipy/sparse/linalg/eigen/arpack/ARPACK/SRC/zneupd.f#L557

The LAPACK subroutine zlahqr in currently included in scipy since in earlier
testing (on Linux) we found that this function as included in the
packaged libraries was producing errors.

Further, the interface to complex number functions had to be hacked
to make it work on OSX since the OSX distributed LAPACK had a different
(C, rather than Fortran) interface: http://projects.scipy.org/scipy/ticket/725

Is this an OSX issue only? Those tests pass for me on 64 bit Linux.
Perhaps we need to revisit the code David inserted (see ticket 725 above)?

$ uname -a
Linux ll 2.6.31-22-generic #60-Ubuntu SMP Thu May 27 02:41:03 UTC 2010
x86_64 GNU/Linux
$ dpkg -l |grep atlas
ii  libatlas-base-dev                    3.6.0-22ubuntu2
             Automatically Tuned Linear Algebra Software,
ii  libatlas-headers                     3.6.0-22ubuntu2
             Automatically Tuned Linear Algebra Software,
ii  libatlas3gf-base                     3.6.0-22ubuntu2
             Automatically Tuned Linear Algebra Software,
$ dpkg -l |grep -i lapack
ii  liblapack-dev                        3.2.1-1
             library of linear algebra routines 3 - stati
ii  liblapack3gf                         3.2.1-1
             library of linear algebra routines 3 - share
>>> scipy.test()
Running unit tests for scipy
NumPy version 1.4.1
NumPy is installed in /home/aric/lib/python/numpy
SciPy version 0.8.0b1
SciPy is installed in /home/aric/lib/python/scipy
Python version 2.6.4 (r264:75706, Dec  7 2009, 18:43:55) [GCC 4.4.1]
nose version 0.11.3
......................
snip
......................................................................
======================================================================
ERROR: test_decomp.test_lapack_misaligned(<function solve at
0x314c938>, (array([[  1.734e-255,   8.189e-217,   4.025e-178,
1.903e-139,   9.344e-101,
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/aric/lib/python/nose-0.11.3-py2.6.egg/nose/case.py",
line 186, in runTest
    self.test(*self.arg)
  File "/home/aric/lib/python/scipy/linalg/tests/test_decomp.py", line
1071, in check_lapack_misaligned
    func(*a,**kwargs)
  File "/home/aric/lib/python/scipy/linalg/basic.py", line 49, in solve
    a1, b1 = map(asarray_chkfinite,(a,b))
  File "/home/aric/lib/python/numpy/lib/function_base.py", line 586,
in asarray_chkfinite
    raise ValueError, "array must not contain infs or NaNs"
ValueError: array must not contain infs or NaNs

----------------------------------------------------------------------
Ran 4420 tests in 54.254s

FAILED (KNOWNFAIL=11, SKIP=32, errors=1)
<nose.result.TextTestResult run=4420 errors=1 failures=0>


Aric


From ralf.gommers at googlemail.com  Sat Jun 19 10:47:36 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Sat, 19 Jun 2010 22:47:36 +0800
Subject: [SciPy-Dev] arpack failures 64-bit
In-Reply-To: <AANLkTim3L3IOKAdCscLgerFOk5M7KTt1lpav_e_kgshV@mail.gmail.com>
References: <AANLkTilceWq0DSWRiOB-dS3RtSKCecfn1gfgUWSxMXAO@mail.gmail.com>
	<AANLkTim3L3IOKAdCscLgerFOk5M7KTt1lpav_e_kgshV@mail.gmail.com>
Message-ID: <AANLkTinG7XSEVhisGfwAfJccP9MKI1eYJEyhI_0lIfl1@mail.gmail.com>

On Sat, Jun 19, 2010 at 10:03 PM, Aric Hagberg <aric.hagberg at gmail.com>wrote:

> On Fri, Jun 18, 2010 at 11:00 PM, Ralf Gommers
> <ralf.gommers at googlemail.com> wrote:
> > Arpack test failures have been reported by several people, all on 64 bit.
> On
> > 32-bit it's fine.
> >
> > ======================================================================
> > ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
> > ----------------------------------------------------------------------
> > Traceback (most recent call last):
> >  File
> >
> "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> > line 204, in test_nonsymmetric_modes
> >    self.eval_evec(m,typ,k,which)
> >  File
> >
> "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> > line 186, in eval_evec
> >    eval,evec=eigen(a,k,which=which,**kwds)
> >  File
> >
> "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> > line 397,
> > in eigen
> >    params.iterate()
> >  File
> >
> "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> > line 222,
> > in iterate
> >    raise RuntimeError("Error info=%d in arpack" % self.info)
> > RuntimeError: Error info=-8 in arpack
> >
> > For the full list of errors see
> >
> http://article.gmane.org/gmane.comp.python.scientific.devel/13706/match=arpack
> > (on OS X).
> > Similar report on Linux:
> > http://thread.gmane.org/gmane.comp.python.scientific.devel/8551
> > This seems also related: http://projects.scipy.org/scipy/ticket/725.
> >
> > Does anyone have time to look at this soon? If not, can I mark them as
> > knownfail for 0.8.0?
>
> I took a quick look at this but I don't have a fix and I also can't
> reproduce the problem on 64 bit Linux.
>
> The error is coming from the LAPACK routine zlahqr
> which is what the -8 error code refers to.  See error codes at
>
> http://projects.scipy.org/scipy/browser/trunk/scipy/sparse/linalg/eigen/arpack/ARPACK/SRC/zneupd.f#L149
> :
>
> c          = -8: Error return from LAPACK eigenvalue calculation.
> c                This should never happened.
>
> (and around line 557 where the error is generated after calling zlahqr)
> and the section the
>
> http://projects.scipy.org/scipy/browser/trunk/scipy/sparse/linalg/eigen/arpack/ARPACK/SRC/zneupd.f#L557
>
> The LAPACK subroutine zlahqr in currently included in scipy since in
> earlier
> testing (on Linux) we found that this function as included in the
> packaged libraries was producing errors.
>
> Further, the interface to complex number functions had to be hacked
> to make it work on OSX since the OSX distributed LAPACK had a different
> (C, rather than Fortran) interface:
> http://projects.scipy.org/scipy/ticket/725
>
> Is this an OSX issue only? Those tests pass for me on 64 bit Linux.
> Perhaps we need to revisit the code David inserted (see ticket 725 above)?
>

Yes, looks like it is OS X (10.6) only. This report:
http://article.gmane.org/gmane.comp.python.scientific.devel/13705/match=test_arpack+testeigennonsymmetricseems
to claim that these tests did not fail for 0.7.2.

Ralf


> $ uname -a
> Linux ll 2.6.31-22-generic #60-Ubuntu SMP Thu May 27 02:41:03 UTC 2010
> x86_64 GNU/Linux
> $ dpkg -l |grep atlas
> ii  libatlas-base-dev                    3.6.0-22ubuntu2
>             Automatically Tuned Linear Algebra Software,
> ii  libatlas-headers                     3.6.0-22ubuntu2
>             Automatically Tuned Linear Algebra Software,
> ii  libatlas3gf-base                     3.6.0-22ubuntu2
>             Automatically Tuned Linear Algebra Software,
> $ dpkg -l |grep -i lapack
> ii  liblapack-dev                        3.2.1-1
>             library of linear algebra routines 3 - stati
> ii  liblapack3gf                         3.2.1-1
>             library of linear algebra routines 3 - share
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/25c04c14/attachment.html>

From warren.weckesser at enthought.com  Sat Jun 19 13:30:37 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Sat, 19 Jun 2010 12:30:37 -0500
Subject: [SciPy-Dev] Recommended procedure for splitting a file while
 keeping the web doc editor in sync?
Message-ID: <4C1CFEBD.8090602@enthought.com>

In preparation for getting some FIR filter patches into trunk, I would 
like to split the existing FIR filter functions in 
signal/filter_design.py into a separate file, fir_filter_design.py.  Is 
there anything special I should do to ensure that the web doc editor 
stays in sync with this change?

Warren


From d.l.goldsmith at gmail.com  Sat Jun 19 14:49:57 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sat, 19 Jun 2010 11:49:57 -0700
Subject: [SciPy-Dev] "ok to apply" permission request
In-Reply-To: <AANLkTinsvJdIDoNRx2T4oCqM1zJDrcVHJZlV9PnKJlIg@mail.gmail.com>
References: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com>
	<hvgkf2$qid$1@dough.gmane.org>
	<AANLkTilj2wn7WAox_YSPU7C4oqIiTPJEXjzRo3vS9w9-@mail.gmail.com>
	<hvgovp$lif$1@dough.gmane.org>
	<AANLkTinKqQ-cofrcnPbrBpRxiklnpQIgpkPHJlEqFvNm@mail.gmail.com>
	<AANLkTinsvJdIDoNRx2T4oCqM1zJDrcVHJZlV9PnKJlIg@mail.gmail.com>
Message-ID: <AANLkTimy1C4v-o58aTcaxJuj_72Q8TM0gwKIy4errviU@mail.gmail.com>

On Fri, Jun 18, 2010 at 8:35 PM, Ralf Gommers
<ralf.gommers at googlemail.com>wrote:

>
> On Sat, Jun 19, 2010 at 5:49 AM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:
>
>> On Fri, Jun 18, 2010 at 2:38 PM, Pauli Virtanen <pav at iki.fi> wrote:
>>
>>> Fri, 18 Jun 2010 13:44:11 -0700, David Goldsmith wrote:
>>> [clip]
>>> > I don't understand: if they're going to commit the changes, why do they
>>> > need to be able to mark that they're going to commit the changes?  To
>>> > help them remember which ones they've screened as possessing nothing
>>> > "absurd" in case they can't commit the changes immediately after
>>> they've
>>> > decided to commit the changes?
>>>
>>> The point is that you typically commit a huge batch of docstring changes
>>> at once, and reading through a long patch listing makes your eyes glaze
>>> over really fast.
>>>
>>
>> OK, I understand that, and it makes sense during the regular course of the
>> year when docstring changes aren't happening as frequently, but, something
>> to consider, perhaps through the course of the Summer Marathon, "OK to
>> apply"s should be merged once per week or some such?
>>
>
> It always makes sense. "OK to apply" should only be used by (a) someone who
> is about to commit to svn or (b) someone who is 200% sure what this means.
>
> A few weeks ago I was committing all scipy changes and found many "OK to
> apply" ones that couldn't actually be committed, for various reasons. This
> meant I had to go back and recheck everything, including ones I had checked
> as OK myself before.
>
> So a rough sanity check is much easier to do in the web system, and the
>>> burden can be distributed across multiple people if necessary. Currently
>>> ok-to-apply is married with the Reviewer permissions.
>>>
>>> At least this is what I used and intended the feature for. I'm not sure
>>> if anyone else actually understands it the same way, especially as this
>>> is not written down anywhere :)
>>>
>>> > > Typically the way to just indicate that stuff is "done", is to mark
>>> is
>>> > > as "Needs review", at the moment.
>>> >
>>> > I guess then I'm really unclear as to the need for the "OK to apply";
>>> my
>>> > understanding was that it was there for the editor to signal to the
>>> > commitor that, even thought the docstring is technically *not* ready
>>> for
>>> > review (e.g., it's still missing an Example, say, or a needed
>>> > Reference), it still represents a big enough improvement over what's in
>>> > SVN that, in the editor's opinion, it is "OK to apply".  If "Needs
>>> > review" is necessary and sufficient for something to be applied, then
>>> > why do we need the extra "OK to apply"?
>>>
>>> It was intended mostly as a reviewer/committer-level tool, at least
>>> originally, which is why it's not active with Editor permissions. Since
>>> anyone can in principle come and edit the wiki, I thought something like
>>> this would come useful.
>>>
>>> I haven't been following the edits lately, so I guess its your call as
>>> the present active guy to decide who gets which privileges :)
>>>
>>
>> Not necessarily (certainly not in that I don't have the permissions to
>> grant such permissions; I don't even have permissions to commit changes): it
>> depends on the purpose of the attribute - if it's closer to what you say,
>> then I agree, it should be a reviewer/committor (though I didn't think those
>> were one and the same) who controls this; if, on the other hand, the purpose
>> is closer to what I say, then, at minimum, we need to think more about how
>> we do this.
>>
>
> It's not only about content, but also for example knowing for which
> docstrings pydocweb can not generate correct patches at the moment.
>

Ah, in that case, please rescind my "OK to apply" permissions.  Thanks!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/ee93eedc/attachment.html>

From pav at iki.fi  Sat Jun 19 14:54:52 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Sat, 19 Jun 2010 18:54:52 +0000 (UTC)
Subject: [SciPy-Dev] Recommended procedure for splitting a file while
 keeping the web doc editor in sync?
References: <4C1CFEBD.8090602@enthought.com>
Message-ID: <hvj3ps$pfi$1@dough.gmane.org>

Sat, 19 Jun 2010 12:30:37 -0500, Warren Weckesser wrote:
> In preparation for getting some FIR filter patches into trunk, I would
> like to split the existing FIR filter functions in
> signal/filter_design.py into a separate file, fir_filter_design.py.  Is
> there anything special I should do to ensure that the web doc editor
> stays in sync with this change?

If the documentation of the functions has been edited, it's best to get 
the patch from

	http://docs.scipy.org/scipy/patch/

for the changes in the stuff in the module (if any), and apply it first.

Otherwise, you'll need to persistently bug me to make the corresponding 
moves manually in the SQL db. (The app does not track moves automatically 
at the moment.)

-- 
Pauli Virtanen


From warren.weckesser at enthought.com  Sat Jun 19 15:12:21 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Sat, 19 Jun 2010 14:12:21 -0500
Subject: [SciPy-Dev] Recommended procedure for splitting a file while
 keeping the web doc editor in sync?
In-Reply-To: <hvj3ps$pfi$1@dough.gmane.org>
References: <4C1CFEBD.8090602@enthought.com> <hvj3ps$pfi$1@dough.gmane.org>
Message-ID: <4C1D1695.4000909@enthought.com>

Pauli Virtanen wrote:
> Sat, 19 Jun 2010 12:30:37 -0500, Warren Weckesser wrote:
>   
>> In preparation for getting some FIR filter patches into trunk, I would
>> like to split the existing FIR filter functions in
>> signal/filter_design.py into a separate file, fir_filter_design.py.  Is
>> there anything special I should do to ensure that the web doc editor
>> stays in sync with this change?
>>     
>
> If the documentation of the functions has been edited, it's best to get 
> the patch from
>
> 	http://docs.scipy.org/scipy/patch/
>
> for the changes in the stuff in the module (if any), and apply it first.
>
>   

Is there a quick way to find out if the module has any edits in the web 
interface?

This page

   http://docs.scipy.org/scipy/docs/scipy.signal.filter_design/

doesn't show any red or yellow, but to be honest, I don't recall the 
meaning of the color codes, and I can't seem to find the explanation of 
the colors (but I know it's in there somewhere).

Request for enhancement: Every page that uses the color codes should 
have a link to the explanation of the colors.  Just a button or link 
called "What do the colors mean?" would be great.

Warren

> Otherwise, you'll need to persistently bug me to make the corresponding 
> moves manually in the SQL db. (The app does not track moves automatically 
> at the moment.)
>
>   


From warren.weckesser at enthought.com  Sat Jun 19 15:20:58 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Sat, 19 Jun 2010 14:20:58 -0500
Subject: [SciPy-Dev] Recommended procedure for splitting a file while
 keeping the web doc editor in sync?
In-Reply-To: <4C1D1695.4000909@enthought.com>
References: <4C1CFEBD.8090602@enthought.com> <hvj3ps$pfi$1@dough.gmane.org>
	<4C1D1695.4000909@enthought.com>
Message-ID: <4C1D189A.1090008@enthought.com>

Warren Weckesser wrote:
> Pauli Virtanen wrote:
>   
>> Sat, 19 Jun 2010 12:30:37 -0500, Warren Weckesser wrote:
>>   
>>     
>>> In preparation for getting some FIR filter patches into trunk, I would
>>> like to split the existing FIR filter functions in
>>> signal/filter_design.py into a separate file, fir_filter_design.py.  Is
>>> there anything special I should do to ensure that the web doc editor
>>> stays in sync with this change?
>>>     
>>>       
>> If the documentation of the functions has been edited, it's best to get 
>> the patch from
>>
>> 	http://docs.scipy.org/scipy/patch/
>>
>> for the changes in the stuff in the module (if any), and apply it first.
>>
>>   
>>     
>
> Is there a quick way to find out if the module has any edits in the web 
> interface?
>   

Guessing the answer to my question:  since scipy.signal.filter_design 
doesn't show up in the list of modules under "Generate patch", that 
means there are no changes pending.

> This page
>
>    http://docs.scipy.org/scipy/docs/scipy.signal.filter_design/
>
> doesn't show any red or yellow, but to be honest, I don't recall the 
> meaning of the color codes, and I can't seem to find the explanation of 
> the colors (but I know it's in there somewhere).
>
> Request for enhancement: Every page that uses the color codes should 
> have a link to the explanation of the colors.  Just a button or link 
> called "What do the colors mean?" would be great.
>
> Warren
>
>   
>> Otherwise, you'll need to persistently bug me to make the corresponding 
>> moves manually in the SQL db. (The app does not track moves automatically 
>> at the moment.)
>>
>>   
>>     
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From ben.root at ou.edu  Sat Jun 19 16:10:01 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Sat, 19 Jun 2010 15:10:01 -0500
Subject: [SciPy-Dev] "ok to apply" permission request
In-Reply-To: <AANLkTimy1C4v-o58aTcaxJuj_72Q8TM0gwKIy4errviU@mail.gmail.com>
References: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com> 
	<hvgkf2$qid$1@dough.gmane.org>
	<AANLkTilj2wn7WAox_YSPU7C4oqIiTPJEXjzRo3vS9w9-@mail.gmail.com> 
	<hvgovp$lif$1@dough.gmane.org>
	<AANLkTinKqQ-cofrcnPbrBpRxiklnpQIgpkPHJlEqFvNm@mail.gmail.com> 
	<AANLkTinsvJdIDoNRx2T4oCqM1zJDrcVHJZlV9PnKJlIg@mail.gmail.com> 
	<AANLkTimy1C4v-o58aTcaxJuj_72Q8TM0gwKIy4errviU@mail.gmail.com>
Message-ID: <AANLkTim8s2-knoSoHrcINXuXqfmNfa6omfbYoQJQ5M-1@mail.gmail.com>

On Sat, Jun 19, 2010 at 1:49 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> On Fri, Jun 18, 2010 at 8:35 PM, Ralf Gommers <ralf.gommers at googlemail.com
> > wrote:
>
>>
>> On Sat, Jun 19, 2010 at 5:49 AM, David Goldsmith <d.l.goldsmith at gmail.com
>> > wrote:
>>
>>> On Fri, Jun 18, 2010 at 2:38 PM, Pauli Virtanen <pav at iki.fi> wrote:
>>>
>>>> Fri, 18 Jun 2010 13:44:11 -0700, David Goldsmith wrote:
>>>> [clip]
>>>> > I don't understand: if they're going to commit the changes, why do
>>>> they
>>>> > need to be able to mark that they're going to commit the changes?  To
>>>> > help them remember which ones they've screened as possessing nothing
>>>> > "absurd" in case they can't commit the changes immediately after
>>>> they've
>>>> > decided to commit the changes?
>>>>
>>>> The point is that you typically commit a huge batch of docstring changes
>>>> at once, and reading through a long patch listing makes your eyes glaze
>>>> over really fast.
>>>>
>>>
>>> OK, I understand that, and it makes sense during the regular course of
>>> the year when docstring changes aren't happening as frequently, but,
>>> something to consider, perhaps through the course of the Summer Marathon,
>>> "OK to apply"s should be merged once per week or some such?
>>>
>>
>> It always makes sense. "OK to apply" should only be used by (a) someone
>> who is about to commit to svn or (b) someone who is 200% sure what this
>> means.
>>
>> A few weeks ago I was committing all scipy changes and found many "OK to
>> apply" ones that couldn't actually be committed, for various reasons. This
>> meant I had to go back and recheck everything, including ones I had checked
>> as OK myself before.
>>
>> So a rough sanity check is much easier to do in the web system, and the
>>>> burden can be distributed across multiple people if necessary. Currently
>>>> ok-to-apply is married with the Reviewer permissions.
>>>>
>>>> At least this is what I used and intended the feature for. I'm not sure
>>>> if anyone else actually understands it the same way, especially as this
>>>> is not written down anywhere :)
>>>>
>>>> > > Typically the way to just indicate that stuff is "done", is to mark
>>>> is
>>>> > > as "Needs review", at the moment.
>>>> >
>>>> > I guess then I'm really unclear as to the need for the "OK to apply";
>>>> my
>>>> > understanding was that it was there for the editor to signal to the
>>>> > commitor that, even thought the docstring is technically *not* ready
>>>> for
>>>> > review (e.g., it's still missing an Example, say, or a needed
>>>> > Reference), it still represents a big enough improvement over what's
>>>> in
>>>> > SVN that, in the editor's opinion, it is "OK to apply".  If "Needs
>>>> > review" is necessary and sufficient for something to be applied, then
>>>> > why do we need the extra "OK to apply"?
>>>>
>>>> It was intended mostly as a reviewer/committer-level tool, at least
>>>> originally, which is why it's not active with Editor permissions. Since
>>>> anyone can in principle come and edit the wiki, I thought something like
>>>> this would come useful.
>>>>
>>>> I haven't been following the edits lately, so I guess its your call as
>>>> the present active guy to decide who gets which privileges :)
>>>>
>>>
>>> Not necessarily (certainly not in that I don't have the permissions to
>>> grant such permissions; I don't even have permissions to commit changes): it
>>> depends on the purpose of the attribute - if it's closer to what you say,
>>> then I agree, it should be a reviewer/committor (though I didn't think those
>>> were one and the same) who controls this; if, on the other hand, the purpose
>>> is closer to what I say, then, at minimum, we need to think more about how
>>> we do this.
>>>
>>
>> It's not only about content, but also for example knowing for which
>> docstrings pydocweb can not generate correct patches at the moment.
>>
>
> Ah, in that case, please rescind my "OK to apply" permissions.  Thanks!
>
> DG
>
>
In that case, I will just keep on doing what I have been doing.  Keep in
mind, though, that I have come across some entries that have been marked for
review that were almost a year old.  So, I don't know how much time the
reviewers have to go over the many entries we will be completing over this
summer.

Ben Root
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/e5ff8150/attachment.html>

From warren.weckesser at enthought.com  Sat Jun 19 16:12:01 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Sat, 19 Jun 2010 15:12:01 -0500
Subject: [SciPy-Dev] Recommended procedure for splitting a file while
 keeping the web doc editor in sync?
In-Reply-To: <hvj3ps$pfi$1@dough.gmane.org>
References: <4C1CFEBD.8090602@enthought.com> <hvj3ps$pfi$1@dough.gmane.org>
Message-ID: <4C1D2491.5020808@enthought.com>

Pauli Virtanen wrote:
> Sat, 19 Jun 2010 12:30:37 -0500, Warren Weckesser wrote:
>   
>> In preparation for getting some FIR filter patches into trunk, I would
>> like to split the existing FIR filter functions in
>> signal/filter_design.py into a separate file, fir_filter_design.py.  Is
>> there anything special I should do to ensure that the web doc editor
>> stays in sync with this change?
>>     
>
> If the documentation of the functions has been edited, it's best to get 
> the patch from
>
> 	http://docs.scipy.org/scipy/patch/
>
> for the changes in the stuff in the module (if any), and apply it first.
>
> Otherwise, you'll need to persistently bug me to make the corresponding 
> moves manually in the SQL db. (The app does not track moves automatically 
> at the moment.)
>
>   

Pauli,

In r6544, the functions kaiserord and firwin were moved from 
signal.filter_design to signal.fir_filter_design.  Could you update the 
db when you get a chance?  Thanks.


Warren


From pav at iki.fi  Sat Jun 19 16:40:41 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Sat, 19 Jun 2010 20:40:41 +0000 (UTC)
Subject: [SciPy-Dev] Recommended procedure for splitting a file while
 keeping the web doc editor in sync?
References: <4C1CFEBD.8090602@enthought.com> <hvj3ps$pfi$1@dough.gmane.org>
	<4C1D1695.4000909@enthought.com> <4C1D189A.1090008@enthought.com>
Message-ID: <hvja09$auv$1@dough.gmane.org>

Sat, 19 Jun 2010 14:20:58 -0500, Warren Weckesser wrote:
[clip]
> Guessing the answer to my question:  since scipy.signal.filter_design
> doesn't show up in the list of modules under "Generate patch", that
> means there are no changes pending.

Correct. Then nothing special needs to be done.

-- 
Pauli Virtanen


From d.l.goldsmith at gmail.com  Sat Jun 19 17:43:14 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sat, 19 Jun 2010 14:43:14 -0700
Subject: [SciPy-Dev] "ok to apply" permission request
In-Reply-To: <AANLkTim8s2-knoSoHrcINXuXqfmNfa6omfbYoQJQ5M-1@mail.gmail.com>
References: <AANLkTilOpMaue2WeD15G0Tuq2a6mQtZJ3zngvj7mpq6E@mail.gmail.com>
	<hvgkf2$qid$1@dough.gmane.org>
	<AANLkTilj2wn7WAox_YSPU7C4oqIiTPJEXjzRo3vS9w9-@mail.gmail.com>
	<hvgovp$lif$1@dough.gmane.org>
	<AANLkTinKqQ-cofrcnPbrBpRxiklnpQIgpkPHJlEqFvNm@mail.gmail.com>
	<AANLkTinsvJdIDoNRx2T4oCqM1zJDrcVHJZlV9PnKJlIg@mail.gmail.com>
	<AANLkTimy1C4v-o58aTcaxJuj_72Q8TM0gwKIy4errviU@mail.gmail.com>
	<AANLkTim8s2-knoSoHrcINXuXqfmNfa6omfbYoQJQ5M-1@mail.gmail.com>
Message-ID: <AANLkTilDFu6_I5uVUOsuev6Gx6sECNb0Wrtkfgb_deao@mail.gmail.com>

On Sat, Jun 19, 2010 at 1:10 PM, Benjamin Root <ben.root at ou.edu> wrote:

> On Sat, Jun 19, 2010 at 1:49 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:
>
>> On Fri, Jun 18, 2010 at 8:35 PM, Ralf Gommers <
>> ralf.gommers at googlemail.com> wrote:
>>
>>> On Sat, Jun 19, 2010 at 5:49 AM, David Goldsmith <
>>> d.l.goldsmith at gmail.com> wrote:
>>>
>>>> On Fri, Jun 18, 2010 at 2:38 PM, Pauli Virtanen <pav at iki.fi> wrote:
>>>>
>>>>> Fri, 18 Jun 2010 13:44:11 -0700, David Goldsmith wrote:
>>>>> [clip]
>>>>> > I don't understand: if they're going to commit the changes, why do
>>>>> they
>>>>> > need to be able to mark that they're going to commit the changes?  To
>>>>> > help them remember which ones they've screened as possessing nothing
>>>>> > "absurd" in case they can't commit the changes immediately after
>>>>> they've
>>>>> > decided to commit the changes?
>>>>>
>>>>> The point is that you typically commit a huge batch of docstring
>>>>> changes
>>>>> at once, and reading through a long patch listing makes your eyes glaze
>>>>> over really fast.
>>>>>
>>>>
>>>> OK, I understand that, and it makes sense during the regular course of
>>>> the year when docstring changes aren't happening as frequently, but,
>>>> something to consider, perhaps through the course of the Summer Marathon,
>>>> "OK to apply"s should be merged once per week or some such?
>>>>
>>>
>>> It always makes sense. "OK to apply" should only be used by (a) someone
>>> who is about to commit to svn or (b) someone who is 200% sure what this
>>> means.
>>>
>>> A few weeks ago I was committing all scipy changes and found many "OK to
>>> apply" ones that couldn't actually be committed, for various reasons. This
>>> meant I had to go back and recheck everything, including ones I had checked
>>> as OK myself before.
>>>
>>> So a rough sanity check is much easier to do in the web system, and the
>>>>> burden can be distributed across multiple people if necessary.
>>>>> Currently
>>>>> ok-to-apply is married with the Reviewer permissions.
>>>>>
>>>>> At least this is what I used and intended the feature for. I'm not sure
>>>>> if anyone else actually understands it the same way, especially as this
>>>>> is not written down anywhere :)
>>>>>
>>>>> > > Typically the way to just indicate that stuff is "done", is to mark
>>>>> is
>>>>> > > as "Needs review", at the moment.
>>>>> >
>>>>> > I guess then I'm really unclear as to the need for the "OK to apply";
>>>>> my
>>>>> > understanding was that it was there for the editor to signal to the
>>>>> > commitor that, even thought the docstring is technically *not* ready
>>>>> for
>>>>> > review (e.g., it's still missing an Example, say, or a needed
>>>>> > Reference), it still represents a big enough improvement over what's
>>>>> in
>>>>> > SVN that, in the editor's opinion, it is "OK to apply".  If "Needs
>>>>> > review" is necessary and sufficient for something to be applied, then
>>>>> > why do we need the extra "OK to apply"?
>>>>>
>>>>> It was intended mostly as a reviewer/committer-level tool, at least
>>>>> originally, which is why it's not active with Editor permissions. Since
>>>>> anyone can in principle come and edit the wiki, I thought something
>>>>> like
>>>>> this would come useful.
>>>>>
>>>>> I haven't been following the edits lately, so I guess its your call as
>>>>> the present active guy to decide who gets which privileges :)
>>>>>
>>>>
>>>> Not necessarily (certainly not in that I don't have the permissions to
>>>> grant such permissions; I don't even have permissions to commit changes): it
>>>> depends on the purpose of the attribute - if it's closer to what you say,
>>>> then I agree, it should be a reviewer/committor (though I didn't think those
>>>> were one and the same) who controls this; if, on the other hand, the purpose
>>>> is closer to what I say, then, at minimum, we need to think more about how
>>>> we do this.
>>>>
>>>
>>> It's not only about content, but also for example knowing for which
>>> docstrings pydocweb can not generate correct patches at the moment.
>>>
>>
>> Ah, in that case, please rescind my "OK to apply" permissions.  Thanks!
>>
>> DG
>>
>>
> In that case, I will just keep on doing what I have been doing.  Keep in
> mind, though, that I have come across some entries that have been marked for
> review that were almost a year old.  So, I don't know how much time the
> reviewers have to go over the many entries we will be completing over this
> summer.
>

Right now, "how much time the reviewers have to go over the many entries we
will be completing over this summer," is the least of our concerns: we don't
have many reviewers, we have no schedule for when reviews need to be
completed by, we don't have the technical infrastructure to support the
desired review process, and at the rate we're going, there won't be "many
entries we will be completing over this summer."

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/5ecf5cd7/attachment.html>

From d.l.goldsmith at gmail.com  Sat Jun 19 19:43:36 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sat, 19 Jun 2010 16:43:36 -0700
Subject: [SciPy-Dev] Use of MATLAB (R): A compromise proposal
Message-ID: <AANLkTimWOzFc96GkeDUp0vwOptPZAAR7eD1f6n--FRJ-@mail.gmail.com>

Here's a compromise proposal, based on the fact that, according to a
Wiki-search of the SciPy docstrings for "matlab," there are hundreds of
occurrences, but only 12 *not* in a sub-object of scipy.io.matlab:

0) In those 12, we use MATLAB (R),

1) In scipy.io.matlab we use Matthew's proposal,

2) Everywhere else, we ignore the (R) issue, and change anything different
from MATLAB (i.e., all caps) to that only if/when it's convenient, i.e., if
the docstring is being edited for some other reason _and_ the editor
remembers to do it, i.e., we don't really worry about it (too much)
otherwise.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/83a4d3bc/attachment.html>

From matthew.brett at gmail.com  Sat Jun 19 19:58:49 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Sun, 20 Jun 2010 00:58:49 +0100
Subject: [SciPy-Dev] scipy.io cleanup
In-Reply-To: <AANLkTima6QrA8BHqCGHZSyqUkWIXrLPZyPxdxLHS38NU@mail.gmail.com>
References: <AANLkTintgSJPDqfRXPczyjTtZ7fG6V1Eom89V_VmX3hC@mail.gmail.com>
	<AANLkTikZabADLb5Qj9_e7JEgJlFNNVANeSy8UUbPkA9H@mail.gmail.com>
	<AANLkTima6QrA8BHqCGHZSyqUkWIXrLPZyPxdxLHS38NU@mail.gmail.com>
Message-ID: <AANLkTinixZ9EiQwV3aB1F6-CTgMOAoc-w4cYBeDAd28P@mail.gmail.com>

Hi,

> Yeah I noticed that. Will use your fix, thanks. One other thing I wasn't
> sure about was the best way to deprecate a class. Since the deprecate
> decorator/function doesn't work on the class I applied it on the __init__
> method, is this the best way to do it?

It's not a bad way - you can also do it by old-fashioned decorator
usage - as in:

??Recaster = deprecate(Recaster)

I suppose that has the advantage that it's relatively easy to spot in the code.

Best,

Matthew


From aric.hagberg at gmail.com  Sat Jun 19 21:10:04 2010
From: aric.hagberg at gmail.com (Aric Hagberg)
Date: Sat, 19 Jun 2010 19:10:04 -0600
Subject: [SciPy-Dev] arpack failures 64-bit
In-Reply-To: <AANLkTinG7XSEVhisGfwAfJccP9MKI1eYJEyhI_0lIfl1@mail.gmail.com>
References: <AANLkTilceWq0DSWRiOB-dS3RtSKCecfn1gfgUWSxMXAO@mail.gmail.com>
	<AANLkTim3L3IOKAdCscLgerFOk5M7KTt1lpav_e_kgshV@mail.gmail.com>
	<AANLkTinG7XSEVhisGfwAfJccP9MKI1eYJEyhI_0lIfl1@mail.gmail.com>
Message-ID: <AANLkTinSAw9kPoOufNXYj9m4zIsMcCas8aGzU0ebZLUP@mail.gmail.com>

On Sat, Jun 19, 2010 at 8:47 AM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
>
>
> On Sat, Jun 19, 2010 at 10:03 PM, Aric Hagberg <aric.hagberg at gmail.com>
> wrote:
>>
>> On Fri, Jun 18, 2010 at 11:00 PM, Ralf Gommers
>> <ralf.gommers at googlemail.com> wrote:
>> > Arpack test failures have been reported by several people, all on 64
>> > bit. On
>> > 32-bit it's fine.
>> >
>> > ======================================================================
>> > ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
>> > ----------------------------------------------------------------------
>> > Traceback (most recent call last):
>> > ?File
>> >
>> > "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> > line 204, in test_nonsymmetric_modes
>> > ? ?self.eval_evec(m,typ,k,which)
>> > ?File
>> >
>> > "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> > line 186, in eval_evec
>> > ? ?eval,evec=eigen(a,k,which=which,**kwds)
>> > ?File
>> >
>> > "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> > line 397,
>> > in eigen
>> > ? ?params.iterate()
>> > ?File
>> >
>> > "/sw/lib/python2.6/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> > line 222,
>> > in iterate
>> > ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> > RuntimeError: Error info=-8 in arpack
>> >
>> > For the full list of errors see
>> >
>> > http://article.gmane.org/gmane.comp.python.scientific.devel/13706/match=arpack
>> > (on OS X).
>> > Similar report on Linux:
>> > http://thread.gmane.org/gmane.comp.python.scientific.devel/8551
>> > This seems also related: http://projects.scipy.org/scipy/ticket/725.
>> >
>> > Does anyone have time to look at this soon? If not, can I mark them as
>> > knownfail for 0.8.0?
>>
>> I took a quick look at this but I don't have a fix and I also can't
>> reproduce the problem on 64 bit Linux.
>>
>> The error is coming from the LAPACK routine zlahqr
>> which is what the -8 error code refers to. ?See error codes at
>>
>> http://projects.scipy.org/scipy/browser/trunk/scipy/sparse/linalg/eigen/arpack/ARPACK/SRC/zneupd.f#L149
>> :
>>
>> c ? ? ? ? ?= -8: Error return from LAPACK eigenvalue calculation.
>> c ? ? ? ? ? ? ? ?This should never happened.
>>
>> (and around line 557 where the error is generated after calling zlahqr)
>> and the section the
>>
>> http://projects.scipy.org/scipy/browser/trunk/scipy/sparse/linalg/eigen/arpack/ARPACK/SRC/zneupd.f#L557
>>
>> The LAPACK subroutine zlahqr in currently included in scipy since in
>> earlier
>> testing (on Linux) we found that this function as included in the
>> packaged libraries was producing errors.
>>
>> Further, the interface to complex number functions had to be hacked
>> to make it work on OSX since the OSX distributed LAPACK had a different
>> (C, rather than Fortran) interface:
>> http://projects.scipy.org/scipy/ticket/725
>>
>> Is this an OSX issue only? Those tests pass for me on 64 bit Linux.
>> Perhaps we need to revisit the code David inserted (see ticket 725 above)?
>
> Yes, looks like it is OS X (10.6) only. This report:
> http://article.gmane.org/gmane.comp.python.scientific.devel/13705/match=test_arpack+testeigennonsymmetric
> seems to claim that these tests did not fail for 0.7.2.
>

I have access to a OSX 10.6 machine but no 64 bit python devel
environment there.   If someone who does can help me debug this
(or tell me the best/fast way to set up the 64 bit tools) I can
make a  little time in the next couple days to try to fix it.

Aric


From d.l.goldsmith at gmail.com  Sun Jun 20 00:21:28 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sat, 19 Jun 2010 21:21:28 -0700
Subject: [SciPy-Dev] Tutorials don't need to observe the 75 chars/line limit
Message-ID: <AANLkTinvzGBtQTj-xSdzwfdYDEz5x6lUPcCfAQ-0B-AB@mail.gmail.com>

Do they?

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/dc2ca48b/attachment.html>

From d.l.goldsmith at gmail.com  Sun Jun 20 00:32:25 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sat, 19 Jun 2010 21:32:25 -0700
Subject: [SciPy-Dev] Use of MATLAB (R): A compromise proposal
In-Reply-To: <AANLkTimWOzFc96GkeDUp0vwOptPZAAR7eD1f6n--FRJ-@mail.gmail.com>
References: <AANLkTimWOzFc96GkeDUp0vwOptPZAAR7eD1f6n--FRJ-@mail.gmail.com>
Message-ID: <AANLkTilhj5gONt4PC_4pADIhqjISvF_Uy4baKrWduUq3@mail.gmail.com>

On Sat, Jun 19, 2010 at 4:43 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> Here's a compromise proposal, based on the fact that, according to a
> Wiki-search of the SciPy docstrings for "matlab," there are hundreds of
> occurrences, but only 12 *not* in a sub-object of scipy.io.matlab:
>
> 0) In those 12, we use MATLAB (R),
>

As I'm going through those dozen, I'm seeing that even in most of those
instances, the MATLAB (R) appears excessive, so, mostly, I'm just
capitalizing, nothing more.

DG


> 1) In scipy.io.matlab we use Matthew's proposal,
>
> 2) Everywhere else, we ignore the (R) issue, and change anything different
> from MATLAB (i.e., all caps) to that only if/when it's convenient, i.e., if
> the docstring is being edited for some other reason _and_ the editor
> remembers to do it, i.e., we don't really worry about it (too much)
> otherwise.
>
> DG
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100619/59283be6/attachment.html>

From matthew.brett at gmail.com  Sun Jun 20 06:24:20 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Sun, 20 Jun 2010 11:24:20 +0100
Subject: [SciPy-Dev] Use of MATLAB (R): A compromise proposal
In-Reply-To: <AANLkTilhj5gONt4PC_4pADIhqjISvF_Uy4baKrWduUq3@mail.gmail.com>
References: <AANLkTimWOzFc96GkeDUp0vwOptPZAAR7eD1f6n--FRJ-@mail.gmail.com>
	<AANLkTilhj5gONt4PC_4pADIhqjISvF_Uy4baKrWduUq3@mail.gmail.com>
Message-ID: <AANLkTinjVHaTKiH2nP_bDfwmQr8AVlnsHedM-keXYyNx@mail.gmail.com>

Hi,

>> Here's a compromise proposal, based on the fact that, according to a
>> Wiki-search of the SciPy docstrings for "matlab," there are hundreds of
>> occurrences, but only 12 *not* in a sub-object of scipy.io.matlab:
>>
>> 0) In those 12, we use MATLAB (R),
>
> As I'm going through those dozen, I'm seeing that even in most of those
> instances, the MATLAB (R) appears excessive, so, mostly, I'm just
> capitalizing, nothing more.

Sounds reasonable to me.  Has Joe finished discussions with the lawyer
he was talking to?

Best,

Matthew


From matthew.brett at gmail.com  Sun Jun 20 06:28:26 2010
From: matthew.brett at gmail.com (Matthew Brett)
Date: Sun, 20 Jun 2010 11:28:26 +0100
Subject: [SciPy-Dev] arpack failures 64-bit
In-Reply-To: <AANLkTinSAw9kPoOufNXYj9m4zIsMcCas8aGzU0ebZLUP@mail.gmail.com>
References: <AANLkTilceWq0DSWRiOB-dS3RtSKCecfn1gfgUWSxMXAO@mail.gmail.com>
	<AANLkTim3L3IOKAdCscLgerFOk5M7KTt1lpav_e_kgshV@mail.gmail.com>
	<AANLkTinG7XSEVhisGfwAfJccP9MKI1eYJEyhI_0lIfl1@mail.gmail.com>
	<AANLkTinSAw9kPoOufNXYj9m4zIsMcCas8aGzU0ebZLUP@mail.gmail.com>
Message-ID: <AANLkTil7lrRpGYGsgrkQoqOjcWGEFUjBKQ5G_1Qn2K49@mail.gmail.com>

Hi,

> I have access to a OSX 10.6 machine but no 64 bit python devel
> environment there. ? If someone who does can help me debug this
> (or tell me the best/fast way to set up the 64 bit tools) I can
> make a ?little time in the next couple days to try to fix it.

I'm no expert - but I compiled my own python on Snow Leopard in order
to get a 64-bit version.  The notes that I wrote at the time are in
the attached .rst file - maybe they will be useful...

Best,

Matthew
-------------- next part --------------
A non-text attachment was scrubbed...
Name: macosx_developer_install.rst
Type: application/octet-stream
Size: 3120 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100620/d387a1b5/attachment.obj>

From aric.hagberg at gmail.com  Sun Jun 20 10:05:51 2010
From: aric.hagberg at gmail.com (Aric Hagberg)
Date: Sun, 20 Jun 2010 08:05:51 -0600
Subject: [SciPy-Dev] arpack failures 64-bit
In-Reply-To: <AANLkTil7lrRpGYGsgrkQoqOjcWGEFUjBKQ5G_1Qn2K49@mail.gmail.com>
References: <AANLkTilceWq0DSWRiOB-dS3RtSKCecfn1gfgUWSxMXAO@mail.gmail.com>
	<AANLkTim3L3IOKAdCscLgerFOk5M7KTt1lpav_e_kgshV@mail.gmail.com>
	<AANLkTinG7XSEVhisGfwAfJccP9MKI1eYJEyhI_0lIfl1@mail.gmail.com>
	<AANLkTinSAw9kPoOufNXYj9m4zIsMcCas8aGzU0ebZLUP@mail.gmail.com>
	<AANLkTil7lrRpGYGsgrkQoqOjcWGEFUjBKQ5G_1Qn2K49@mail.gmail.com>
Message-ID: <AANLkTikSNQv5Ztnm-lLQU4moLprjlfvWQjQdglD9eYEG@mail.gmail.com>

On Sun, Jun 20, 2010 at 4:28 AM, Matthew Brett <matthew.brett at gmail.com> wrote:
> Hi,
>
>> I have access to a OSX 10.6 machine but no 64 bit python devel
>> environment there. ? If someone who does can help me debug this
>> (or tell me the best/fast way to set up the 64 bit tools) I can
>> make a ?little time in the next couple days to try to fix it.
>
> I'm no expert - but I compiled my own python on Snow Leopard in order
> to get a 64-bit version. ?The notes that I wrote at the time are in
> the attached .rst file - maybe they will be useful...

Great.  That is helpful.

Unfortunately my OSX machine keeps crashing with a kernel panic every time
I try to build scipy - I must have some hardware issue.  So I'm not going to be
able to test this myself.

Aric


From jsseabold at gmail.com  Sun Jun 20 21:44:01 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Sun, 20 Jun 2010 21:44:01 -0400
Subject: [SciPy-Dev] stats.nanstd interface
In-Reply-To: <AANLkTinLhYtfAD_zd8Q2PYwRCd6WYbB3w0UBvvtfN2fg@mail.gmail.com>
References: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com> 
	<4C18D96E.9020905@gmail.com>
	<AANLkTil6z3C4FUyf8-HT9fSc2GygKJUc74pley611fyf@mail.gmail.com> 
	<4C18EB16.9020309@gmail.com>
	<AANLkTinLhYtfAD_zd8Q2PYwRCd6WYbB3w0UBvvtfN2fg@mail.gmail.com>
Message-ID: <AANLkTinSt_IOEw4psGDvPic0HdUG_lyCY5BLCGcQoOJe@mail.gmail.com>

On Wed, Jun 16, 2010 at 12:26 PM, Bruce Southey <bsouthey at gmail.com> wrote:
> On Wed, Jun 16, 2010 at 10:17 AM, Bruce Southey <bsouthey at gmail.com> wrote:
>> On 06/16/2010 09:20 AM, josef.pktd at gmail.com wrote:
>>>
>>> On Wed, Jun 16, 2010 at 10:02 AM, Bruce Southey<bsouthey at gmail.com>
>>> ?wrote:
>>>
>>>>
>>>> On 06/16/2010 07:55 AM, Angus McMorland wrote:
>>>>
>>>>>
>>>>> Hi all,
>>>>>
>>>>> I've just updated the docstring for scipy.stats.nanstd to the new
>>>>> docstring standard's format. I wonder if, for consistency of
>>>>> interface, we should consider changing it to use a `ddof` parameter,
>>>>> as numpy's std function does, instead of its current `bias` boolean
>>>>> parameter. I'm aware that there are deprecation/API implications
>>>>> associated with this, but I'm not sure what the specifics of those
>>>>> are.
>>>>>
>>>>> Angus.
>>>>>
>>>>>
>>>>
>>>> Please file a ticket for it.
>>>> Can you please add all the differences between the signature between
>>>> numpy's version and this version?
>>>> In particular, the default axis of stats.nanstd is zero compared to None.
>>>> It also lacks the dtype argument.
>>>>
>>>
>>> default axis in scipy.stats is zero not None as in numpy.
>>> np.nansum has no dtype argument, nans can be only in float (I never
>>> checked complex for this), so I don't know whether dtype would be
>>> useful in this case.
>>>
>>
>> From np.std docstring:
>> "
>> ? ?dtype : dtype, optional
>> ? ? ? ?Type to use in computing the standard deviation. For arrays of
>> ? ? ? ?integer type the default is float64, for arrays of float types it is
>> ? ? ? ?the same as the array type.
>> "
>>
>>>
>>>>
>>>> Really the function needs at least a rewrite unless numpy can provide
>>>> same functionality.
>>>>
>>>
>>> Can you be more specific, we just rewrote axis handling
>>>
>>> I think switching to ddof is a good idea. (FYI: I cannot work on
>>> anything for another two weeks).
>>>
>>> Josef
>>>
>>
>> I know that the broadcasting is not correct in the following but I do not
>> know how to fix it.
>> Also, np.nansum does not accept the dtype so need to convert the input to
>> the new precision.
>>
>> I would like it to handle other array subtypes or at least fail to work on
>> inputs like masked arrays, Matrix class etc.
>>
>> Perhaps something like this works:
>>
>>
>> import numpy as np
>> import scipy.stats as stats
>>
>> def nanstd(x, axis=None, dtype=None, ddof=0):
>> ? if dtype == np.float128: ? #only convert if desired input is ?better than
>> the default float64 dtype
>> ? ? ? ?x=np.array(x, dtype=dtype)
>> ? ?denom=np.isfinite(x).sum(axis=axis) # number of finite numbers
>> ? ?mean=np.nansum(x, axis=axis)/denom # This is not correct because the
>> broadcasting is wrong for axis >0
>> ? ?diff=a-mean # a minus the mean - which must broadcast correctly
>> ? ?return np.sqrt(np.nansum(diff*diff, axis=axis)/(denom-ddof))
>>
>> a=np.array([[1,2,3], [4, np.nan, 5], [6, 7, np.nan]])
>> print 'stdnan=:', stdnan(a, axis=None), 'stats.nanstd=:',
>> stats.nanstd(a,axis=None, bias=1)
>> print 'stdnan=:', stdnan(a, axis=None, ddof=1), 'stats.nanstd=:',
>> stats.nanstd(a,axis=None, bias=0)
>> print 'stdnan=:', stdnan(a, axis=0), 'stats.nanstd=:',
>> stats.nanstd(a,axis=0, bias=1)
>> print 'stdnan=:', stdnan(a, axis=0, ddof=1), 'stats.nanstd=:',
>> stats.nanstd(a,axis=0, bias=0)
>> print 'The following is wrong because the broadcasting is not correct when
>> computing the difference'
>> print 'stdnan=:', stdnan(a, axis=1), 'stats.nanstd=:',
>> stats.nanstd(a,axis=1, bias=1)
>>
>> Bruce
>>
>
> Thanks Angus for the ticket 1200:
> http://projects.scipy.org/scipy/ticket/1200
>
> I added code to the ticket that I think fixes the broadcasting issue I
> mentioned above and added 'support' for masked array input. Also I
> created the variance function as standard deviation is the square root
> of variance.
>
> I really think that all these stats 'nan functions' probably could
> just be converted into masked arrays and using the appropriate masked
> array functions instead of creating separate functions. This would
> also address how to handle the 'out' argument.
>

Someone can correct me if I'm wrong, but I believe that there is a
performance hit for using masked arrays over the nan functions.  Wes
and Keith have mentioned it wrt pandas and larry, if I recall.

Skipper


From pgmdevlist at gmail.com  Sun Jun 20 21:58:32 2010
From: pgmdevlist at gmail.com (Pierre GM)
Date: Sun, 20 Jun 2010 21:58:32 -0400
Subject: [SciPy-Dev] stats.nanstd interface
In-Reply-To: <AANLkTinSt_IOEw4psGDvPic0HdUG_lyCY5BLCGcQoOJe@mail.gmail.com>
References: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com>
	<4C18D96E.9020905@gmail.com>
	<AANLkTil6z3C4FUyf8-HT9fSc2GygKJUc74pley611fyf@mail.gmail.com>
	<4C18EB16.9020309@gmail.com>
	<AANLkTinLhYtfAD_zd8Q2PYwRCd6WYbB3w0UBvvtfN2fg@mail.gmail.com>
	<AANLkTinSt_IOEw4psGDvPic0HdUG_lyCY5BLCGcQoOJe@mail.gmail.com>
Message-ID: <E49DD69A-BE6D-489B-AC26-2644C1CD6AE8@gmail.com>


On Jun 20, 2010, at 9:44 PM, Skipper Seabold wrote:
>> 
>> I really think that all these stats 'nan functions' probably could
>> just be converted into masked arrays and using the appropriate masked
>> array functions instead of creating separate functions. This would
>> also address how to handle the 'out' argument.
>> 
> 
> Someone can correct me if I'm wrong, but I believe that there is a
> performance hit for using masked arrays over the nan functions.  Wes
> and Keith have mentioned it wrt pandas and larry, if I recall.

Not a surprise at all: the nanfunctions make use of np.putmask which is quite efficient, while MaskedArrays have their extra baggage  (in __array_finalize__) which tend to slow things down. However, the nanfunctions work only w/ float arrays, while the MaskedArrays function are more generic.

From d.l.goldsmith at gmail.com  Mon Jun 21 02:18:45 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Sun, 20 Jun 2010 23:18:45 -0700
Subject: [SciPy-Dev] Functions auto-listed by
	http://docs.scipy.org/scipy/docs/scipy/
Message-ID: <AANLkTin6CkPmaYPQz5X7B5vRb_46H5e0P54O85q-XfLd@mail.gmail.com>

Hi!  The Contents section of the scipy (package) docstring states: "SciPy
imports all the functions from the NumPy namespace"; then, further down in
the Wiki rendering of that scipy docstring, there's an auto-generated list
of Functions (absolute, add, ..., true_divide, trunc).  Are these the NumPy
namespace functions referred to in the Contents section?  If so, why are
they links to scipy object docstrings?  (In scipy, are they just wrappers
for the corresponding numpy functions?)  If not, many (all?) duplicate
functions available in NumPy - is this intended?  Thanks!

DG

PS: Also, please remind/confirm: "Global symbols from subpackages" are names
of objects defined in some subpackage, but imported by scipy such that they
are available in the scipy namespace, correct?  Thanks again.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100620/5b8b072b/attachment.html>

From pav at iki.fi  Mon Jun 21 04:08:15 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Mon, 21 Jun 2010 08:08:15 +0000 (UTC)
Subject: [SciPy-Dev] Functions auto-listed
	by	http://docs.scipy.org/scipy/docs/scipy/
References: <AANLkTin6CkPmaYPQz5X7B5vRb_46H5e0P54O85q-XfLd@mail.gmail.com>
Message-ID: <hvn6lf$jdl$1@dough.gmane.org>

Sun, 20 Jun 2010 23:18:45 -0700, David Goldsmith wrote:
> Hi!  The Contents section of the scipy (package) docstring states:
> "SciPy imports all the functions from the NumPy namespace"; then,
> further down in the Wiki rendering of that scipy docstring, there's an
> auto-generated list of Functions (absolute, add, ..., true_divide,
> trunc).  Are these the NumPy namespace functions referred to in the
> Contents section?  If so, why are they links to scipy object docstrings?
> (In scipy, are they just wrappers for the corresponding numpy
> functions?)

They're for the most part functions imported from numpy, which the system 
doesn't realize come from there.

> If not, many (all?) duplicate functions available in NumPy

Not all. Probably only those that do not have a __module__ attribute 
defined. They'll also appear only once.

Unfortunately, ufuncs for instance don't have them, since they are not 
"real" Python functions, but class instances.

> - is this intended?

Nope.

But in the way it's implemented now (introspection in Python), it's not 
really possible to tell whether a ufunc present in a module comes from 
Numpy or somewhere else, since ufuncs don't carry any information 
identifying where they come from.

-- 
Pauli Virtanen


From bsouthey at gmail.com  Mon Jun 21 10:15:36 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Mon, 21 Jun 2010 09:15:36 -0500
Subject: [SciPy-Dev] stats.nanstd interface
In-Reply-To: <E49DD69A-BE6D-489B-AC26-2644C1CD6AE8@gmail.com>
References: <AANLkTinWggreFW0otJ6hSk98eyHwGVRS2FZRCn87YU2i@mail.gmail.com>	<4C18D96E.9020905@gmail.com>	<AANLkTil6z3C4FUyf8-HT9fSc2GygKJUc74pley611fyf@mail.gmail.com>	<4C18EB16.9020309@gmail.com>	<AANLkTinLhYtfAD_zd8Q2PYwRCd6WYbB3w0UBvvtfN2fg@mail.gmail.com>	<AANLkTinSt_IOEw4psGDvPic0HdUG_lyCY5BLCGcQoOJe@mail.gmail.com>
	<E49DD69A-BE6D-489B-AC26-2644C1CD6AE8@gmail.com>
Message-ID: <4C1F7408.7070505@gmail.com>

On 06/20/2010 08:58 PM, Pierre GM wrote:
> On Jun 20, 2010, at 9:44 PM, Skipper Seabold wrote:
>    
>>> I really think that all these stats 'nan functions' probably could
>>> just be converted into masked arrays and using the appropriate masked
>>> array functions instead of creating separate functions. This would
>>> also address how to handle the 'out' argument.
>>>
>>>        
>> Someone can correct me if I'm wrong, but I believe that there is a
>> performance hit for using masked arrays over the nan functions.  Wes
>> and Keith have mentioned it wrt pandas and larry, if I recall.
>>      
There is no performance hit here because you are comparing two totally 
different things!


> Not a surprise at all: the nanfunctions make use of np.putmask which is quite efficient, while MaskedArrays have their extra baggage  (in __array_finalize__) which tend to slow things down. However, the nanfunctions work only w/ float arrays, while the MaskedArrays function are more generic.
>    
Furthermore, you have tremendous flexibility with masked arrays that you 
can decide what you want is missing and even undo or modify the mask as 
needed.

 >>> m=np.ma.arange(10)
 >>> m.mask=m.data>6
 >>> m
masked_array(data = [0 1 2 3 4 5 6 -- -- --],
              mask = [False False False False False False False  True  
True  True],
        fill_value = 999999)
 >>> m.sum()
21
 >>> m.mask=m.data>7
 >>> m
masked_array(data = [0 1 2 3 4 5 6 7 -- --],
              mask = [False False False False False False False False  
True  True],
        fill_value = 999999)
 >>> m.sum()
28

It is more a question of what want to do because while using NaN is 
faster than using a masked array for some cases, you can slow down big 
time if you have to recreate the the functionality you require.  Also, 
you have to know where and how the masked arrays are being used because 
the masked array component may be a very small part of the overall problem.

Bruce


From ben.root at ou.edu  Mon Jun 21 11:59:08 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Mon, 21 Jun 2010 10:59:08 -0500
Subject: [SciPy-Dev] netcdf expert info needed
Message-ID: <AANLkTilCDAfNFP4lCvDmcEfx7QhPFIHyjzBJRC3zoXzn@mail.gmail.com>

Hello,

I need some input from the author of the scipy.io.netcdf module for the
netcdf_variable and createVariable docstrings found at
http://docs.scipy.org/scipy/docs/scipy.io.netcdf.netcdf_variable/ and
http://docs.scipy.org/scipy/docs/scipy.io.netcdf.netcdf_file.createVariable/.

I think I need a better description for what can be used to specify the
netcdf_variable's type.  It appears to be a typecode character instead of
the typical dtypes, but I am not 100% if that is true and if this is true
for both docstrings.  Also, it would be nice if there was already a document
that describes valid dtypes that I could reference, or if there is a
specific list of values that are valid here.

Thanks,
Ben Root
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100621/8d80889d/attachment.html>

From d.l.goldsmith at gmail.com  Mon Jun 21 12:19:06 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 21 Jun 2010 09:19:06 -0700
Subject: [SciPy-Dev] Functions auto-listed by
	http://docs.scipy.org/scipy/docs/scipy/
In-Reply-To: <hvn6lf$jdl$1@dough.gmane.org>
References: <AANLkTin6CkPmaYPQz5X7B5vRb_46H5e0P54O85q-XfLd@mail.gmail.com>
	<hvn6lf$jdl$1@dough.gmane.org>
Message-ID: <AANLkTiluN-RZDkCDZ11Gj7IL9ONkvOPRorp42LPjmQBk@mail.gmail.com>

On Mon, Jun 21, 2010 at 1:08 AM, Pauli Virtanen <pav at iki.fi> wrote:

> Sun, 20 Jun 2010 23:18:45 -0700, David Goldsmith wrote:
> > Hi!  The Contents section of the scipy (package) docstring states:
> > "SciPy imports all the functions from the NumPy namespace"; then,
> > further down in the Wiki rendering of that scipy docstring, there's an
> > auto-generated list of Functions (absolute, add, ..., true_divide,
> > trunc).  Are these the NumPy namespace functions referred to in the
> > Contents section?  If so, why are they links to scipy object docstrings?
> > (In scipy, are they just wrappers for the corresponding numpy
> > functions?)
>
> They're for the most part functions imported from numpy, which the system
> doesn't realize come from there.
>

So docstring editors simply need to determine which are which (e.g., by
looking at the code) and document only those that are not being so imported.

DG

>
> > If not, many (all?) duplicate functions available in NumPy
>
> Not all. Probably only those that do not have a __module__ attribute
> defined. They'll also appear only once.
>
> Unfortunately, ufuncs for instance don't have them, since they are not
> "real" Python functions, but class instances.
>
> > - is this intended?
>
> Nope.
>
> But in the way it's implemented now (introspection in Python), it's not
> really possible to tell whether a ufunc present in a module comes from
> Numpy or somewhere else, since ufuncs don't carry any information
> identifying where they come from.
>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100621/ede350f0/attachment.html>

From nwagner at iam.uni-stuttgart.de  Mon Jun 21 13:14:07 2010
From: nwagner at iam.uni-stuttgart.de (Nils Wagner)
Date: Mon, 21 Jun 2010 19:14:07 +0200
Subject: [SciPy-Dev] AttributeError: 'module' object has no attribute
	'write_array'
Message-ID: <web-134850353@uni-stuttgart.de>

>>> scipy.__version__
'0.9.0.dev6558'

======================================================================
ERROR: test_complex (test_array_import.TestReadArray)
----------------------------------------------------------------------
Traceback (most recent call last):
   File 
"/home/nwagner/local/lib64/python2.6/site-packages/scipy/io/tests/test_array_import.py", 
line 38, in test_complex
     io.write_array(fname,a)
AttributeError: 'module' object has no attribute 
'write_array'

======================================================================
ERROR: test_float (test_array_import.TestReadArray)
----------------------------------------------------------------------
Traceback (most recent call last):
   File 
"/home/nwagner/local/lib64/python2.6/site-packages/scipy/io/tests/test_array_import.py", 
line 46, in test_float
     io.write_array(fname,a)
AttributeError: 'module' object has no attribute 
'write_array'

======================================================================
ERROR: test_integer (test_array_import.TestReadArray)
----------------------------------------------------------------------
Traceback (most recent call last):
   File 
"/home/nwagner/local/lib64/python2.6/site-packages/scipy/io/tests/test_array_import.py", 
line 55, in test_integer
     io.write_array(fname,a)
AttributeError: 'module' object has no attribute 
'write_array'


From d.l.goldsmith at gmail.com  Mon Jun 21 15:41:51 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 21 Jun 2010 12:41:51 -0700
Subject: [SciPy-Dev] Functions auto-listed by
	http://docs.scipy.org/scipy/docs/scipy/
In-Reply-To: <AANLkTiluN-RZDkCDZ11Gj7IL9ONkvOPRorp42LPjmQBk@mail.gmail.com>
References: <AANLkTin6CkPmaYPQz5X7B5vRb_46H5e0P54O85q-XfLd@mail.gmail.com>
	<hvn6lf$jdl$1@dough.gmane.org>
	<AANLkTiluN-RZDkCDZ11Gj7IL9ONkvOPRorp42LPjmQBk@mail.gmail.com>
Message-ID: <AANLkTim9LbVAQT0OliNMzwmYee1wKKBBR6JIXHPJn0Hl@mail.gmail.com>

On Mon, Jun 21, 2010 at 9:19 AM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> On Mon, Jun 21, 2010 at 1:08 AM, Pauli Virtanen <pav at iki.fi> wrote:
>
>> Sun, 20 Jun 2010 23:18:45 -0700, David Goldsmith wrote:
>> > Hi!  The Contents section of the scipy (package) docstring states:
>> > "SciPy imports all the functions from the NumPy namespace"; then,
>> > further down in the Wiki rendering of that scipy docstring, there's an
>> > auto-generated list of Functions (absolute, add, ..., true_divide,
>> > trunc).  Are these the NumPy namespace functions referred to in the
>> > Contents section?  If so, why are they links to scipy object docstrings?
>> > (In scipy, are they just wrappers for the corresponding numpy
>> > functions?)
>>
>> They're for the most part functions imported from numpy, which the system
>> doesn't realize come from there.
>>
>
> So docstring editors simply need to determine which are which (e.g., by
> looking at the code) and document only those that are not being so imported.
>
> DG
>

OK, when I use the Wiki Source link when View-ing the docstring for
scipy.absolute, e.g., and search the resulting page for absolute, it's not
found at all - I'm assuming this is an example of where the numpy function
has simply been imported into the scipy namespace, and that similar results
indicate a similar situation...

DG

>
>> > If not, many (all?) duplicate functions available in NumPy
>>
>> Not all. Probably only those that do not have a __module__ attribute
>> defined. They'll also appear only once.
>>
>> Unfortunately, ufuncs for instance don't have them, since they are not
>> "real" Python functions, but class instances.
>>
>> > - is this intended?
>>
>> Nope.
>>
>> But in the way it's implemented now (introspection in Python), it's not
>> really possible to tell whether a ufunc present in a module comes from
>> Numpy or somewhere else, since ufuncs don't carry any information
>> identifying where they come from.
>>
>> --
>> Pauli Virtanen
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide.  (As interpreted
> by Robert Graves)
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100621/e5461abf/attachment.html>

From d.l.goldsmith at gmail.com  Mon Jun 21 23:47:28 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 21 Jun 2010 20:47:28 -0700
Subject: [SciPy-Dev] More doc Marathon prioritization
Message-ID: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>

Hi, folks!  Joe and I would like to prioritize SciPy's top-most objects'
docstrings, i.e., those of the sub-packages and modules primarily, and work
our way down.  As groundwork for this approach, I've created the following
triage table:

https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en

some of it is subjective, most of it, hopefully, is objective.

Mainly what I hope to achieve w/ this post is twofold:

A) Get a few questions (see below) answered regarding general guidelines for
completing these docstrings; and

B) Recruit subject-area experts to complete these top-level docstrings,
subject to the guidelines I hope to elicit below.

Questions:

1) Presently, most of these top-level docstrings consist either completely
or almost completely of sub-package/module content listings w/ brief (i.e.,
typically one-line) descriptions; almost all the rest consist of some amount
(typically meager) of narrative description, with no content listing; no
more than a few have both narrative and a content listing.  So, the
questions here are, what should these docstrings contain: a narrative
description of the object; a content listing; both; or is there some reason
it should vary from object to object?  (Hopefully this last is not the
answer...)  If a narrative description (either solely or in combination with
a content listing) can we formalize what these narratives should
contain/look like?  (Robert Kern's docstring for odr might be a good
nominee/starting point for a sub-package/module docstring standard, at least
for the narrative part.)

2) If these docstrings should consist of or contain a sub-package/module
content listing, shouldn't we be using the Wiki's .. autosummary:: function,
instead of creating these listings manually (as I think is uniformly the
case in SciPy presently)?

So, once we get these things straightened out (or even before then, on your
own machine if you think narrative content is warranted and you feel that
you can provide it - just hold on to it 'til we get these issues resolved),
I hope there will be a stampede of expertise just knocking the doors down to
whip these buggers into shape! :-)

DG

PS: For your convenience, here's the list of top-level objects (potentially)
needing work:

cluster
constants
fftpack
integrate
interpolate
io
lib
linalg
maxentropy
misc
ndimage
odr
optimize
setupscons
signal
sparse
sparse.linalg
sparse.linalg.dsolve
sparse.linalg.dsolve.umfpack
sparse.linalg.eigen.arpack
sparse.linalg.eigen.lobpcg
spatial
special
stats
weave

Thanks again!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100621/a947cda1/attachment.html>

From josef.pktd at gmail.com  Tue Jun 22 00:07:56 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 22 Jun 2010 00:07:56 -0400
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
Message-ID: <AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>

On Mon, Jun 21, 2010 at 11:47 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> Hi, folks!? Joe and I would like to prioritize SciPy's top-most objects'
> docstrings, i.e., those of the sub-packages and modules primarily, and work
> our way down.? As groundwork for this approach, I've created the following
> triage table:
>
> https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en
>
> some of it is subjective, most of it, hopefully, is objective.
>
> Mainly what I hope to achieve w/ this post is twofold:
>
> A) Get a few questions (see below) answered regarding general guidelines for
> completing these docstrings; and
>
> B) Recruit subject-area experts to complete these top-level docstrings,
> subject to the guidelines I hope to elicit below.
>
> Questions:
>
> 1) Presently, most of these top-level docstrings consist either completely
> or almost completely of sub-package/module content listings w/ brief (i.e.,
> typically one-line) descriptions; almost all the rest consist of some amount
> (typically meager) of narrative description, with no content listing; no
> more than a few have both narrative and a content listing.? So, the
> questions here are, what should these docstrings contain: a narrative
> description of the object; a content listing; both; or is there some reason
> it should vary from object to object?? (Hopefully this last is not the
> answer...)? If a narrative description (either solely or in combination with
> a content listing) can we formalize what these narratives should
> contain/look like?? (Robert Kern's docstring for odr might be a good
> nominee/starting point for a sub-package/module docstring standard, at least
> for the narrative part.)
>
> 2) If these docstrings should consist of or contain a sub-package/module
> content listing, shouldn't we be using the Wiki's .. autosummary:: function,
> instead of creating these listings manually (as I think is uniformly the
> case in SciPy presently)?
>
> So, once we get these things straightened out (or even before then, on your
> own machine if you think narrative content is warranted and you feel that
> you can provide it - just hold on to it 'til we get these issues resolved),
> I hope there will be a stampede of expertise just knocking the doors down to
> whip these buggers into shape! :-)
>
> DG
>
> PS: For your convenience, here's the list of top-level objects (potentially)
> needing work:
>
> cluster
> constants
> fftpack
> integrate
> interpolate
> io
> lib
> linalg
> maxentropy
> misc
> ndimage
> odr
> optimize
> setupscons
> signal
> sparse
> sparse.linalg
> sparse.linalg.dsolve
> sparse.linalg.dsolve.umfpack
> sparse.linalg.eigen.arpack
> sparse.linalg.eigen.lobpcg
> spatial
> special
> stats
> weave


Is there a legend for the abbreviations/acronyms ?
What does: 13 NE, 1 BW Modules; 1 BW, 2 NW(R) Classes; 48 NE, 2 BW, 48
BW Functions  mean ?


Several subpackages have the narrative in the tutorials.

Josef


>
> Thanks again!
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From d.l.goldsmith at gmail.com  Tue Jun 22 00:16:38 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 21 Jun 2010 21:16:38 -0700
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
Message-ID: <AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>

On Mon, Jun 21, 2010 at 9:07 PM, <josef.pktd at gmail.com> wrote:

> On Mon, Jun 21, 2010 at 11:47 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > Hi, folks!  Joe and I would like to prioritize SciPy's top-most objects'
> > docstrings, i.e., those of the sub-packages and modules primarily, and
> work
> > our way down.  As groundwork for this approach, I've created the
> following
> > triage table:
> >
> >
> https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en
> >
> > some of it is subjective, most of it, hopefully, is objective.
> >
> > Mainly what I hope to achieve w/ this post is twofold:
> >
> > A) Get a few questions (see below) answered regarding general guidelines
> for
> > completing these docstrings; and
> >
> > B) Recruit subject-area experts to complete these top-level docstrings,
> > subject to the guidelines I hope to elicit below.
> >
> > Questions:
> >
> > 1) Presently, most of these top-level docstrings consist either
> completely
> > or almost completely of sub-package/module content listings w/ brief
> (i.e.,
> > typically one-line) descriptions; almost all the rest consist of some
> amount
> > (typically meager) of narrative description, with no content listing; no
> > more than a few have both narrative and a content listing.  So, the
> > questions here are, what should these docstrings contain: a narrative
> > description of the object; a content listing; both; or is there some
> reason
> > it should vary from object to object?  (Hopefully this last is not the
> > answer...)  If a narrative description (either solely or in combination
> with
> > a content listing) can we formalize what these narratives should
> > contain/look like?  (Robert Kern's docstring for odr might be a good
> > nominee/starting point for a sub-package/module docstring standard, at
> least
> > for the narrative part.)
> >
> > 2) If these docstrings should consist of or contain a sub-package/module
> > content listing, shouldn't we be using the Wiki's .. autosummary::
> function,
> > instead of creating these listings manually (as I think is uniformly the
> > case in SciPy presently)?
> >
> > So, once we get these things straightened out (or even before then, on
> your
> > own machine if you think narrative content is warranted and you feel that
> > you can provide it - just hold on to it 'til we get these issues
> resolved),
> > I hope there will be a stampede of expertise just knocking the doors down
> to
> > whip these buggers into shape! :-)
> >
> > DG
> >
> > PS: For your convenience, here's the list of top-level objects
> (potentially)
> > needing work:
> >
> > cluster
> > constants
> > fftpack
> > integrate
> > interpolate
> > io
> > lib
> > linalg
> > maxentropy
> > misc
> > ndimage
> > odr
> > optimize
> > setupscons
> > signal
> > sparse
> > sparse.linalg
> > sparse.linalg.dsolve
> > sparse.linalg.dsolve.umfpack
> > sparse.linalg.eigen.arpack
> > sparse.linalg.eigen.lobpcg
> > spatial
> > special
> > stats
> > weave
>
>
> Is there a legend for the abbreviations/acronyms ?
> What does: 13 NE, 1 BW Modules; 1 BW, 2 NW(R) Classes; 48 NE, 2 BW, 48
> BW Functions  mean ?
>

Sorry: NE = Needs editing, BW = Being written, NR = Needs review, NW(R) =
Needs work (reviewed), NR(R) = Needs review (revised),

so 13 NE, 1 BW Modules means 13 Modules at Needs editing status, 1 Module at
Being written status, etc.

 Several subpackages have the narrative in the tutorials.
>

=> Question 3: Is that the (only) place we want them?

DG


>
> Josef
>
>
> >
> > Thanks again!
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100621/3954ab18/attachment.html>

From d.l.goldsmith at gmail.com  Tue Jun 22 00:20:56 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 21 Jun 2010 21:20:56 -0700
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
Message-ID: <AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>

On Mon, Jun 21, 2010 at 9:16 PM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> On Mon, Jun 21, 2010 at 9:07 PM, <josef.pktd at gmail.com> wrote:
>
>> On Mon, Jun 21, 2010 at 11:47 PM, David Goldsmith
>> <d.l.goldsmith at gmail.com> wrote:
>> > Hi, folks!  Joe and I would like to prioritize SciPy's top-most objects'
>> > docstrings, i.e., those of the sub-packages and modules primarily, and
>> work
>> > our way down.  As groundwork for this approach, I've created the
>> following
>> > triage table:
>> >
>> >
>> https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en
>> >
>> > some of it is subjective, most of it, hopefully, is objective.
>> >
>> > Mainly what I hope to achieve w/ this post is twofold:
>> >
>> > A) Get a few questions (see below) answered regarding general guidelines
>> for
>> > completing these docstrings; and
>> >
>> > B) Recruit subject-area experts to complete these top-level docstrings,
>> > subject to the guidelines I hope to elicit below.
>> >
>> > Questions:
>> >
>> > 1) Presently, most of these top-level docstrings consist either
>> completely
>> > or almost completely of sub-package/module content listings w/ brief
>> (i.e.,
>> > typically one-line) descriptions; almost all the rest consist of some
>> amount
>> > (typically meager) of narrative description, with no content listing; no
>> > more than a few have both narrative and a content listing.  So, the
>> > questions here are, what should these docstrings contain: a narrative
>> > description of the object; a content listing; both; or is there some
>> reason
>> > it should vary from object to object?  (Hopefully this last is not the
>> > answer...)  If a narrative description (either solely or in combination
>> with
>> > a content listing) can we formalize what these narratives should
>> > contain/look like?  (Robert Kern's docstring for odr might be a good
>> > nominee/starting point for a sub-package/module docstring standard, at
>> least
>> > for the narrative part.)
>> >
>> > 2) If these docstrings should consist of or contain a sub-package/module
>> > content listing, shouldn't we be using the Wiki's .. autosummary::
>> function,
>> > instead of creating these listings manually (as I think is uniformly the
>> > case in SciPy presently)?
>> >
>> > So, once we get these things straightened out (or even before then, on
>> your
>> > own machine if you think narrative content is warranted and you feel
>> that
>> > you can provide it - just hold on to it 'til we get these issues
>> resolved),
>> > I hope there will be a stampede of expertise just knocking the doors
>> down to
>> > whip these buggers into shape! :-)
>> >
>> > DG
>> >
>> > PS: For your convenience, here's the list of top-level objects
>> (potentially)
>> > needing work:
>> >
>> > cluster
>> > constants
>> > fftpack
>> > integrate
>> > interpolate
>> > io
>> > lib
>> > linalg
>> > maxentropy
>> > misc
>> > ndimage
>> > odr
>> > optimize
>> > setupscons
>> > signal
>> > sparse
>> > sparse.linalg
>> > sparse.linalg.dsolve
>> > sparse.linalg.dsolve.umfpack
>> > sparse.linalg.eigen.arpack
>> > sparse.linalg.eigen.lobpcg
>> > spatial
>> > special
>> > stats
>> > weave
>>
>>
>> Is there a legend for the abbreviations/acronyms ?
>> What does: 13 NE, 1 BW Modules; 1 BW, 2 NW(R) Classes; 48 NE, 2 BW, 48
>> BW Functions  mean ?
>>
>
> Sorry: NE = Needs editing, BW = Being written, NR = Needs review, NW(R) =
> Needs work (reviewed), NR(R) = Needs review (revised),
>
> so 13 NE, 1 BW Modules means 13 Modules at Needs editing status, 1 Module
> at Being written status, etc.
>

Just added a Key to the sheet (Column E).

DG


>
>  Several subpackages have the narrative in the tutorials.
>>
>
> => Question 3: Is that the (only) place we want them?
>
> DG
>
>
>>
>> Josef
>>
>>
>> >
>> > Thanks again!
>> >
>> > _______________________________________________
>> > SciPy-Dev mailing list
>> > SciPy-Dev at scipy.org
>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>> >
>> >
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide.  (As interpreted
> by Robert Graves)
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100621/9ea1b314/attachment.html>

From josef.pktd at gmail.com  Tue Jun 22 00:35:23 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 22 Jun 2010 00:35:23 -0400
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
Message-ID: <AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>

On Tue, Jun 22, 2010 at 12:20 AM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> On Mon, Jun 21, 2010 at 9:16 PM, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
>>
>> On Mon, Jun 21, 2010 at 9:07 PM, <josef.pktd at gmail.com> wrote:
>>>
>>> On Mon, Jun 21, 2010 at 11:47 PM, David Goldsmith
>>> <d.l.goldsmith at gmail.com> wrote:
>>> > Hi, folks!? Joe and I would like to prioritize SciPy's top-most
>>> > objects'
>>> > docstrings, i.e., those of the sub-packages and modules primarily, and
>>> > work
>>> > our way down.? As groundwork for this approach, I've created the
>>> > following
>>> > triage table:
>>> >
>>> >
>>> > https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en
>>> >
>>> > some of it is subjective, most of it, hopefully, is objective.
>>> >
>>> > Mainly what I hope to achieve w/ this post is twofold:
>>> >
>>> > A) Get a few questions (see below) answered regarding general
>>> > guidelines for
>>> > completing these docstrings; and
>>> >
>>> > B) Recruit subject-area experts to complete these top-level docstrings,
>>> > subject to the guidelines I hope to elicit below.
>>> >
>>> > Questions:
>>> >
>>> > 1) Presently, most of these top-level docstrings consist either
>>> > completely
>>> > or almost completely of sub-package/module content listings w/ brief
>>> > (i.e.,
>>> > typically one-line) descriptions; almost all the rest consist of some
>>> > amount
>>> > (typically meager) of narrative description, with no content listing;
>>> > no
>>> > more than a few have both narrative and a content listing.? So, the
>>> > questions here are, what should these docstrings contain: a narrative
>>> > description of the object; a content listing; both; or is there some
>>> > reason
>>> > it should vary from object to object?? (Hopefully this last is not the
>>> > answer...)? If a narrative description (either solely or in combination
>>> > with
>>> > a content listing) can we formalize what these narratives should
>>> > contain/look like?? (Robert Kern's docstring for odr might be a good
>>> > nominee/starting point for a sub-package/module docstring standard, at
>>> > least
>>> > for the narrative part.)
>>> >
>>> > 2) If these docstrings should consist of or contain a
>>> > sub-package/module
>>> > content listing, shouldn't we be using the Wiki's .. autosummary::
>>> > function,
>>> > instead of creating these listings manually (as I think is uniformly
>>> > the
>>> > case in SciPy presently)?
>>> >
>>> > So, once we get these things straightened out (or even before then, on
>>> > your
>>> > own machine if you think narrative content is warranted and you feel
>>> > that
>>> > you can provide it - just hold on to it 'til we get these issues
>>> > resolved),
>>> > I hope there will be a stampede of expertise just knocking the doors
>>> > down to
>>> > whip these buggers into shape! :-)
>>> >
>>> > DG
>>> >
>>> > PS: For your convenience, here's the list of top-level objects
>>> > (potentially)
>>> > needing work:
>>> >
>>> > cluster
>>> > constants
>>> > fftpack
>>> > integrate
>>> > interpolate
>>> > io
>>> > lib
>>> > linalg
>>> > maxentropy
>>> > misc
>>> > ndimage
>>> > odr
>>> > optimize
>>> > setupscons
>>> > signal
>>> > sparse
>>> > sparse.linalg
>>> > sparse.linalg.dsolve
>>> > sparse.linalg.dsolve.umfpack
>>> > sparse.linalg.eigen.arpack
>>> > sparse.linalg.eigen.lobpcg
>>> > spatial
>>> > special
>>> > stats
>>> > weave
>>>
>>>
>>> Is there a legend for the abbreviations/acronyms ?
>>> What does: 13 NE, 1 BW Modules; 1 BW, 2 NW(R) Classes; 48 NE, 2 BW, 48
>>> BW Functions ?mean ?
>>
>> Sorry: NE = Needs editing, BW = Being written, NR = Needs review, NW(R) =
>> Needs work (reviewed), NR(R) = Needs review (revised),
>>
>> so 13 NE, 1 BW Modules means 13 Modules at Needs editing status, 1 Module
>> at Being written status, etc.
>
> Just added a Key to the sheet (Column E).
>
> DG
>
>>>
>>> Several subpackages have the narrative in the tutorials.
>>
>> => Question 3: Is that the (only) place we want them?

Are you talking about the module docstrings in info.py which are
imported into __init__.py __doc__ or about the rst file which contains
the sub-package description?
http://docs.scipy.org/scipy/docs/scipy-docs/stats.rst/  or
http://docs.scipy.org/scipy/docs/scipy.stats/

We had the discussion once about the duplication but I don't think or
don't remember whether it got resolved.

I would put all effort into the rst file and keep info at most as
basic listing. autosummary directives in info.py wouldn't help because
they are not available in the interpreter, and I don't think sphinx
would pick them up (but that I don't know).

As for content, for stats, I would like to see mainly a brief
description of the groups of functions, descriptive, tests (tests for
mean, test for variation, tests for correlation), trim ...
Any narrative longer than a basic description would make the front
page of the sub-package much too long.

Josef


>>
>> DG
>>
>>>
>>> Josef
>>>
>>>
>>> >
>>> > Thanks again!
>>> >
>>> > _______________________________________________
>>> > SciPy-Dev mailing list
>>> > SciPy-Dev at scipy.org
>>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>>> >
>>> >
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>>
>> --
>> Mathematician: noun, someone who disavows certainty when their uncertainty
>> set is non-empty, even if that set has measure zero.
>>
>> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
>> lies, prevents mankind from committing a general suicide. ?(As interpreted
>> by Robert Graves)
>
>
>
> --
> Mathematician: noun, someone who disavows certainty when their uncertainty
> set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
> lies, prevents mankind from committing a general suicide. ?(As interpreted
> by Robert Graves)
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From josef.pktd at gmail.com  Tue Jun 22 00:37:24 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 22 Jun 2010 00:37:24 -0400
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
Message-ID: <AANLkTimaSeBjUq69FtTFWR223KszlxI9ZUwtCwjUb3FT@mail.gmail.com>

On Tue, Jun 22, 2010 at 12:35 AM,  <josef.pktd at gmail.com> wrote:
> On Tue, Jun 22, 2010 at 12:20 AM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
>> On Mon, Jun 21, 2010 at 9:16 PM, David Goldsmith <d.l.goldsmith at gmail.com>
>> wrote:
>>>
>>> On Mon, Jun 21, 2010 at 9:07 PM, <josef.pktd at gmail.com> wrote:
>>>>
>>>> On Mon, Jun 21, 2010 at 11:47 PM, David Goldsmith
>>>> <d.l.goldsmith at gmail.com> wrote:
>>>> > Hi, folks!? Joe and I would like to prioritize SciPy's top-most
>>>> > objects'
>>>> > docstrings, i.e., those of the sub-packages and modules primarily, and
>>>> > work
>>>> > our way down.? As groundwork for this approach, I've created the
>>>> > following
>>>> > triage table:
>>>> >
>>>> >
>>>> > https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en
>>>> >
>>>> > some of it is subjective, most of it, hopefully, is objective.
>>>> >
>>>> > Mainly what I hope to achieve w/ this post is twofold:
>>>> >
>>>> > A) Get a few questions (see below) answered regarding general
>>>> > guidelines for
>>>> > completing these docstrings; and
>>>> >
>>>> > B) Recruit subject-area experts to complete these top-level docstrings,
>>>> > subject to the guidelines I hope to elicit below.
>>>> >
>>>> > Questions:
>>>> >
>>>> > 1) Presently, most of these top-level docstrings consist either
>>>> > completely
>>>> > or almost completely of sub-package/module content listings w/ brief
>>>> > (i.e.,
>>>> > typically one-line) descriptions; almost all the rest consist of some
>>>> > amount
>>>> > (typically meager) of narrative description, with no content listing;
>>>> > no
>>>> > more than a few have both narrative and a content listing.? So, the
>>>> > questions here are, what should these docstrings contain: a narrative
>>>> > description of the object; a content listing; both; or is there some
>>>> > reason
>>>> > it should vary from object to object?? (Hopefully this last is not the
>>>> > answer...)? If a narrative description (either solely or in combination
>>>> > with
>>>> > a content listing) can we formalize what these narratives should
>>>> > contain/look like?? (Robert Kern's docstring for odr might be a good
>>>> > nominee/starting point for a sub-package/module docstring standard, at
>>>> > least
>>>> > for the narrative part.)
>>>> >
>>>> > 2) If these docstrings should consist of or contain a
>>>> > sub-package/module
>>>> > content listing, shouldn't we be using the Wiki's .. autosummary::
>>>> > function,
>>>> > instead of creating these listings manually (as I think is uniformly
>>>> > the
>>>> > case in SciPy presently)?
>>>> >
>>>> > So, once we get these things straightened out (or even before then, on
>>>> > your
>>>> > own machine if you think narrative content is warranted and you feel
>>>> > that
>>>> > you can provide it - just hold on to it 'til we get these issues
>>>> > resolved),
>>>> > I hope there will be a stampede of expertise just knocking the doors
>>>> > down to
>>>> > whip these buggers into shape! :-)
>>>> >
>>>> > DG
>>>> >
>>>> > PS: For your convenience, here's the list of top-level objects
>>>> > (potentially)
>>>> > needing work:
>>>> >
>>>> > cluster
>>>> > constants
>>>> > fftpack
>>>> > integrate
>>>> > interpolate
>>>> > io
>>>> > lib
>>>> > linalg
>>>> > maxentropy
>>>> > misc
>>>> > ndimage
>>>> > odr
>>>> > optimize
>>>> > setupscons
>>>> > signal
>>>> > sparse
>>>> > sparse.linalg
>>>> > sparse.linalg.dsolve
>>>> > sparse.linalg.dsolve.umfpack
>>>> > sparse.linalg.eigen.arpack
>>>> > sparse.linalg.eigen.lobpcg
>>>> > spatial
>>>> > special
>>>> > stats
>>>> > weave
>>>>
>>>>
>>>> Is there a legend for the abbreviations/acronyms ?
>>>> What does: 13 NE, 1 BW Modules; 1 BW, 2 NW(R) Classes; 48 NE, 2 BW, 48
>>>> BW Functions ?mean ?
>>>
>>> Sorry: NE = Needs editing, BW = Being written, NR = Needs review, NW(R) =
>>> Needs work (reviewed), NR(R) = Needs review (revised),
>>>
>>> so 13 NE, 1 BW Modules means 13 Modules at Needs editing status, 1 Module
>>> at Being written status, etc.
>>
>> Just added a Key to the sheet (Column E).
>>
>> DG
>>
>>>>
>>>> Several subpackages have the narrative in the tutorials.
>>>
>>> => Question 3: Is that the (only) place we want them?
>
> Are you talking about the module docstrings in info.py which are
> imported into __init__.py __doc__ or about the rst file which contains
> the sub-package description?
> http://docs.scipy.org/scipy/docs/scipy-docs/stats.rst/ ?or
> http://docs.scipy.org/scipy/docs/scipy.stats/

or another example

http://docs.scipy.org/scipy/docs/scipy.interpolate
or
http://docs.scipy.org/scipy/docs/scipy-docs/interpolate.rst/

to answer to your comment in the former

Josef
>
> We had the discussion once about the duplication but I don't think or
> don't remember whether it got resolved.
>
> I would put all effort into the rst file and keep info at most as
> basic listing. autosummary directives in info.py wouldn't help because
> they are not available in the interpreter, and I don't think sphinx
> would pick them up (but that I don't know).
>
> As for content, for stats, I would like to see mainly a brief
> description of the groups of functions, descriptive, tests (tests for
> mean, test for variation, tests for correlation), trim ...
> Any narrative longer than a basic description would make the front
> page of the sub-package much too long.
>
> Josef
>
>
>
>>>
>>> DG
>>>
>>>>
>>>> Josef
>>>>
>>>>
>>>> >
>>>> > Thanks again!
>>>> >
>>>> > _______________________________________________
>>>> > SciPy-Dev mailing list
>>>> > SciPy-Dev at scipy.org
>>>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>> >
>>>> >
>>>> _______________________________________________
>>>> SciPy-Dev mailing list
>>>> SciPy-Dev at scipy.org
>>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>
>>>
>>> --
>>> Mathematician: noun, someone who disavows certainty when their uncertainty
>>> set is non-empty, even if that set has measure zero.
>>>
>>> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
>>> lies, prevents mankind from committing a general suicide. ?(As interpreted
>>> by Robert Graves)
>>
>>
>>
>> --
>> Mathematician: noun, someone who disavows certainty when their uncertainty
>> set is non-empty, even if that set has measure zero.
>>
>> Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
>> lies, prevents mankind from committing a general suicide. ?(As interpreted
>> by Robert Graves)
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>
>


From josef.pktd at gmail.com  Tue Jun 22 00:55:17 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 22 Jun 2010 00:55:17 -0400
Subject: [SciPy-Dev] deprecating
	stats.msthttps://mail.google.com/mail/?hl=en&shva=1#drafts/1295e00c1f2240a6ats
	var, std, ...
Message-ID: <AANLkTincbwGKy9640sMXFkFyHGQ5KIWzaEFHfqNUcZaN@mail.gmail.com>

There are several functions that have been deprecated and some already
removed from scipy.stats.stats, but the corresponding mstats functions
are not deprecated, e.g. mstats.var, mstats.std, mstats.samplevar, ...

Should they all be deprecated?

Josef


From d.l.goldsmith at gmail.com  Tue Jun 22 00:58:52 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 21 Jun 2010 21:58:52 -0700
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
Message-ID: <AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>

On Mon, Jun 21, 2010 at 9:35 PM, <josef.pktd at gmail.com> wrote:

> On Tue, Jun 22, 2010 at 12:20 AM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> > On Mon, Jun 21, 2010 at 9:16 PM, David Goldsmith <
> d.l.goldsmith at gmail.com>
> > wrote:
> >>
> >> On Mon, Jun 21, 2010 at 9:07 PM, <josef.pktd at gmail.com> wrote:
> >>>
> >>> On Mon, Jun 21, 2010 at 11:47 PM, David Goldsmith
> >>> <d.l.goldsmith at gmail.com> wrote:
> >>> > Hi, folks!  Joe and I would like to prioritize SciPy's top-most
> >>> > objects'
> >>> > docstrings, i.e., those of the sub-packages and modules primarily,
> and
> >>> > work
> >>> > our way down.  As groundwork for this approach, I've created the
> >>> > following
> >>> > triage table:
> >>> >
> >>> >
> >>> >
> https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en
> >>> >
> >>> > some of it is subjective, most of it, hopefully, is objective.
> >>> >
> >>> > Mainly what I hope to achieve w/ this post is twofold:
> >>> >
> >>> > A) Get a few questions (see below) answered regarding general
> >>> > guidelines for
> >>> > completing these docstrings; and
> >>> >
> >>> > B) Recruit subject-area experts to complete these top-level
> docstrings,
> >>> > subject to the guidelines I hope to elicit below.
> >>> >
> >>> > Questions:
> >>> >
> >>> > 1) Presently, most of these top-level docstrings consist either
> >>> > completely
> >>> > or almost completely of sub-package/module content listings w/ brief
> >>> > (i.e.,
> >>> > typically one-line) descriptions; almost all the rest consist of some
> >>> > amount
> >>> > (typically meager) of narrative description, with no content listing;
> >>> > no
> >>> > more than a few have both narrative and a content listing.  So, the
> >>> > questions here are, what should these docstrings contain: a narrative
> >>> > description of the object; a content listing; both; or is there some
> >>> > reason
> >>> > it should vary from object to object?  (Hopefully this last is not
> the
> >>> > answer...)  If a narrative description (either solely or in
> combination
> >>> > with
> >>> > a content listing) can we formalize what these narratives should
> >>> > contain/look like?  (Robert Kern's docstring for odr might be a good
> >>> > nominee/starting point for a sub-package/module docstring standard,
> at
> >>> > least
> >>> > for the narrative part.)
> >>> >
> >>> > 2) If these docstrings should consist of or contain a
> >>> > sub-package/module
> >>> > content listing, shouldn't we be using the Wiki's .. autosummary::
> >>> > function,
> >>> > instead of creating these listings manually (as I think is uniformly
> >>> > the
> >>> > case in SciPy presently)?
> >>> >
> >>> > So, once we get these things straightened out (or even before then,
> on
> >>> > your
> >>> > own machine if you think narrative content is warranted and you feel
> >>> > that
> >>> > you can provide it - just hold on to it 'til we get these issues
> >>> > resolved),
> >>> > I hope there will be a stampede of expertise just knocking the doors
> >>> > down to
> >>> > whip these buggers into shape! :-)
> >>> >
> >>> > DG
> >>> >
> >>> > PS: For your convenience, here's the list of top-level objects
> >>> > (potentially)
> >>> > needing work:
> >>> >
> >>> > cluster
> >>> > constants
> >>> > fftpack
> >>> > integrate
> >>> > interpolate
> >>> > io
> >>> > lib
> >>> > linalg
> >>> > maxentropy
> >>> > misc
> >>> > ndimage
> >>> > odr
> >>> > optimize
> >>> > setupscons
> >>> > signal
> >>> > sparse
> >>> > sparse.linalg
> >>> > sparse.linalg.dsolve
> >>> > sparse.linalg.dsolve.umfpack
> >>> > sparse.linalg.eigen.arpack
> >>> > sparse.linalg.eigen.lobpcg
> >>> > spatial
> >>> > special
> >>> > stats
> >>> > weave
> >>>
> >>>
> >>> Is there a legend for the abbreviations/acronyms ?
> >>> What does: 13 NE, 1 BW Modules; 1 BW, 2 NW(R) Classes; 48 NE, 2 BW, 48
> >>> BW Functions  mean ?
> >>
> >> Sorry: NE = Needs editing, BW = Being written, NR = Needs review, NW(R)
> =
> >> Needs work (reviewed), NR(R) = Needs review (revised),
> >>
> >> so 13 NE, 1 BW Modules means 13 Modules at Needs editing status, 1
> Module
> >> at Being written status, etc.
> >
> > Just added a Key to the sheet (Column E).
> >
> > DG
> >
> >>>
> >>> Several subpackages have the narrative in the tutorials.
> >>
> >> => Question 3: Is that the (only) place we want them?
>
> Are you talking about the module docstrings in info.py which are
> imported into __init__.py __doc__ or about the rst file which contains
> the sub-package description?
> http://docs.scipy.org/scipy/docs/scipy-docs/stats.rst/  or
> http://docs.scipy.org/scipy/docs/scipy.stats/
>

The latter, I believe - whichever get's printed by:

>>> help(scipy.stats)

That's what's getting edited at
http://docs.scipy.org/scipy/docs/scipy.stats/, correct?

FTR: My triage table was constructed from an examination of:

http://docs.scipy.org/scipy/docs/scipy/

I haven't (knowingly) done anything pertaining to scipy-docs, as my
understanding of my responsibility is to the docstrings of the package's
objects, first and foremost.

We had the discussion once about the duplication but I don't think or
> don't remember whether it got resolved.
>

Sounds vaguely familiar; any chance you could find that thread?

Thanks for the feedback,

DG

>
> I would put all effort into the rst file and keep info at most as
> basic listing. autosummary directives in info.py wouldn't help because
> they are not available in the interpreter, and I don't think sphinx
> would pick them up (but that I don't know).
>
> As for content, for stats, I would like to see mainly a brief
> description of the groups of functions, descriptive, tests (tests for
> mean, test for variation, tests for correlation), trim ...
> Any narrative longer than a basic description would make the front
> page of the sub-package much too long.
>
> Josef
>
>
>
> >>
> >> DG
> >>
> >>>
> >>> Josef
> >>>
> >>>
> >>> >
> >>> > Thanks again!
> >>> >
> >>> > _______________________________________________
> >>> > SciPy-Dev mailing list
> >>> > SciPy-Dev at scipy.org
> >>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >>> >
> >>> >
> >>> _______________________________________________
> >>> SciPy-Dev mailing list
> >>> SciPy-Dev at scipy.org
> >>> http://mail.scipy.org/mailman/listinfo/scipy-dev
> >>
> >>
> >>
> >> --
> >> Mathematician: noun, someone who disavows certainty when their
> uncertainty
> >> set is non-empty, even if that set has measure zero.
> >>
> >> Hope: noun, that delusive spirit which escaped Pandora's jar and, with
> her
> >> lies, prevents mankind from committing a general suicide.  (As
> interpreted
> >> by Robert Graves)
> >
> >
> >
> > --
> > Mathematician: noun, someone who disavows certainty when their
> uncertainty
> > set is non-empty, even if that set has measure zero.
> >
> > Hope: noun, that delusive spirit which escaped Pandora's jar and, with
> her
> > lies, prevents mankind from committing a general suicide.  (As
> interpreted
> > by Robert Graves)
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100621/52378b7b/attachment.html>

From d.l.goldsmith at gmail.com  Tue Jun 22 01:02:34 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 21 Jun 2010 22:02:34 -0700
Subject: [SciPy-Dev] deprecating
	stats.msthttps://mail.google.com/mail/?hl=en&shva=1#drafts/1295e00c1f2240a6ats
	var, std, ...
In-Reply-To: <AANLkTincbwGKy9640sMXFkFyHGQ5KIWzaEFHfqNUcZaN@mail.gmail.com>
References: <AANLkTincbwGKy9640sMXFkFyHGQ5KIWzaEFHfqNUcZaN@mail.gmail.com>
Message-ID: <AANLkTimR-KJ0pFD49seO4frhOGMOgoAj8e1leLXV2qmA@mail.gmail.com>

On Mon, Jun 21, 2010 at 9:55 PM, <josef.pktd at gmail.com> wrote:

> There are several functions that have been deprecated and some already
> removed from scipy.stats.stats, but the corresponding mstats functions
> are not deprecated, e.g. mstats.var, mstats.std, mstats.samplevar, ...
>
> Should they all be deprecated?
>

Josef: in case I miss it, if the answer comes back "yes," let me know so I
add the deprecation warning to their docstrings and then mark them
"Unimportant" in the Wiki.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100621/76f049af/attachment.html>

From josef.pktd at gmail.com  Tue Jun 22 01:16:50 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Tue, 22 Jun 2010 01:16:50 -0400
Subject: [SciPy-Dev] deprecating
	stats.msthttps://mail.google.com/mail/?hl=en&shva=1#drafts/1295e00c1f2240a6ats
	var, std, ...
In-Reply-To: <AANLkTimR-KJ0pFD49seO4frhOGMOgoAj8e1leLXV2qmA@mail.gmail.com>
References: <AANLkTincbwGKy9640sMXFkFyHGQ5KIWzaEFHfqNUcZaN@mail.gmail.com>
	<AANLkTimR-KJ0pFD49seO4frhOGMOgoAj8e1leLXV2qmA@mail.gmail.com>
Message-ID: <AANLkTik3TVGrSlIcaHgATf8cev-oCvsaTZOsKv1VyLZ5@mail.gmail.com>

On Tue, Jun 22, 2010 at 1:02 AM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
> On Mon, Jun 21, 2010 at 9:55 PM, <josef.pktd at gmail.com> wrote:
>>
>> There are several functions that have been deprecated and some already
>> removed from scipy.stats.stats, but the corresponding mstats functions
>> are not deprecated, e.g. mstats.var, mstats.std, mstats.samplevar, ...
>>
>> Should they all be deprecated?
>
> Josef: in case I miss it, if the answer comes back "yes," let me know so I
> add the deprecation warning to their docstrings and then mark them
> "Unimportant" in the Wiki.

many of them get the docstring from the stats.stats version
http://docs.scipy.org/scipy/source/scipy/dist/lib64/python2.4/site-packages/scipy/stats/mstats_basic.py#1840

So, maybe they should be marked as unimportant in either case.

stderr has deprecation message in the docstring but the mstats version
is not deprecated
zmap has the wrong docstring. (already mentioned before)

(Also I just saw that not all removed/deprecated stats.stats functions
have the deprecation message in the docstrings, and none uses yet the
new deprecated directive.)

Josef

>
> DG
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From scott.sinclair.za at gmail.com  Tue Jun 22 05:39:39 2010
From: scott.sinclair.za at gmail.com (Scott Sinclair)
Date: Tue, 22 Jun 2010 11:39:39 +0200
Subject: [SciPy-Dev] netcdf expert info needed
In-Reply-To: <AANLkTilCDAfNFP4lCvDmcEfx7QhPFIHyjzBJRC3zoXzn@mail.gmail.com>
References: <AANLkTilCDAfNFP4lCvDmcEfx7QhPFIHyjzBJRC3zoXzn@mail.gmail.com>
Message-ID: <AANLkTikX3ng8OwWePeBoexGjJHMwODgftph1AM63uleo@mail.gmail.com>

>On 21 June 2010 17:59, Benjamin Root <ben.root at ou.edu> wrote:
> I think I need a better description for what can be used to specify the
> netcdf_variable's type.? It appears to be a typecode character instead of
> the typical dtypes, but I am not 100% if that is true and if this is true
> for both docstrings.? Also, it would be nice if there was already a document
> that describes valid dtypes that I could reference, or if there is a
> specific list of values that are valid here.

I don't have a complete answer, but you should find some guidance here

http://docs.scipy.org/doc/numpy/reference/arrays.dtypes.html#specifying-and-constructing-data-types

Cheers,
Scott


From warren.weckesser at enthought.com  Tue Jun 22 08:33:19 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Tue, 22 Jun 2010 07:33:19 -0500
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
	<AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>
Message-ID: <4C20AD8F.60909@enthought.com>

David Goldsmith wrote:
> On Mon, Jun 21, 2010 at 9:35 PM, <josef.pktd at gmail.com 
> <mailto:josef.pktd at gmail.com>> wrote:
>
>     On Tue, Jun 22, 2010 at 12:20 AM, David Goldsmith
>     <d.l.goldsmith at gmail.com <mailto:d.l.goldsmith at gmail.com>> wrote:
>     > On Mon, Jun 21, 2010 at 9:16 PM, David Goldsmith
>     <d.l.goldsmith at gmail.com <mailto:d.l.goldsmith at gmail.com>>
>     > wrote:
>     >>
>     >> On Mon, Jun 21, 2010 at 9:07 PM, <josef.pktd at gmail.com
>     <mailto:josef.pktd at gmail.com>> wrote:
>     >>>
>     >>> On Mon, Jun 21, 2010 at 11:47 PM, David Goldsmith
>     >>> <d.l.goldsmith at gmail.com <mailto:d.l.goldsmith at gmail.com>> wrote:
>     >>> > Hi, folks!  Joe and I would like to prioritize SciPy's top-most
>     >>> > objects'
>     >>> > docstrings, i.e., those of the sub-packages and modules
>     primarily, and
>     >>> > work
>     >>> > our way down.  As groundwork for this approach, I've created the
>     >>> > following
>     >>> > triage table:
>     >>> >
>     >>> >
>     >>> >
>     https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en
>     <https://spreadsheets.google.com/ccc?key=0AvCyyT1vWOKJdENtS1lGandfUFRqQU01R2VMSnB6SXc&hl=en>
>     >>> >
>     >>> > some of it is subjective, most of it, hopefully, is objective.
>     >>> >
>     >>> > Mainly what I hope to achieve w/ this post is twofold:
>     >>> >
>     >>> > A) Get a few questions (see below) answered regarding general
>     >>> > guidelines for
>     >>> > completing these docstrings; and
>     >>> >
>     >>> > B) Recruit subject-area experts to complete these top-level
>     docstrings,
>     >>> > subject to the guidelines I hope to elicit below.
>     >>> >
>     >>> > Questions:
>     >>> >
>     >>> > 1) Presently, most of these top-level docstrings consist either
>     >>> > completely
>     >>> > or almost completely of sub-package/module content listings
>     w/ brief
>     >>> > (i.e.,
>     >>> > typically one-line) descriptions; almost all the rest
>     consist of some
>     >>> > amount
>     >>> > (typically meager) of narrative description, with no content
>     listing;
>     >>> > no
>     >>> > more than a few have both narrative and a content listing. 
>     So, the
>     >>> > questions here are, what should these docstrings contain: a
>     narrative
>     >>> > description of the object; a content listing; both; or is
>     there some
>     >>> > reason
>     >>> > it should vary from object to object?  (Hopefully this last
>     is not the
>     >>> > answer...)  If a narrative description (either solely or in
>     combination
>     >>> > with
>     >>> > a content listing) can we formalize what these narratives should
>     >>> > contain/look like?  (Robert Kern's docstring for odr might
>     be a good
>     >>> > nominee/starting point for a sub-package/module docstring
>     standard, at
>     >>> > least
>     >>> > for the narrative part.)
>     >>> >
>     >>> > 2) If these docstrings should consist of or contain a
>     >>> > sub-package/module
>     >>> > content listing, shouldn't we be using the Wiki's ..
>     autosummary::
>     >>> > function,
>     >>> > instead of creating these listings manually (as I think is
>     uniformly
>     >>> > the
>     >>> > case in SciPy presently)?
>     >>> >
>     >>> > So, once we get these things straightened out (or even
>     before then, on
>     >>> > your
>     >>> > own machine if you think narrative content is warranted and
>     you feel
>     >>> > that
>     >>> > you can provide it - just hold on to it 'til we get these issues
>     >>> > resolved),
>     >>> > I hope there will be a stampede of expertise just knocking
>     the doors
>     >>> > down to
>     >>> > whip these buggers into shape! :-)
>     >>> >
>     >>> > DG
>     >>> >
>     >>> > PS: For your convenience, here's the list of top-level objects
>     >>> > (potentially)
>     >>> > needing work:
>     >>> >
>     >>> > cluster
>     >>> > constants
>     >>> > fftpack
>     >>> > integrate
>     >>> > interpolate
>     >>> > io
>     >>> > lib
>     >>> > linalg
>     >>> > maxentropy
>     >>> > misc
>     >>> > ndimage
>     >>> > odr
>     >>> > optimize
>     >>> > setupscons
>     >>> > signal
>     >>> > sparse
>     >>> > sparse.linalg
>     >>> > sparse.linalg.dsolve
>     >>> > sparse.linalg.dsolve.umfpack
>     >>> > sparse.linalg.eigen.arpack
>     >>> > sparse.linalg.eigen.lobpcg
>     >>> > spatial
>     >>> > special
>     >>> > stats
>     >>> > weave
>     >>>
>     >>>
>     >>> Is there a legend for the abbreviations/acronyms ?
>     >>> What does: 13 NE, 1 BW Modules; 1 BW, 2 NW(R) Classes; 48 NE,
>     2 BW, 48
>     >>> BW Functions  mean ?
>     >>
>     >> Sorry: NE = Needs editing, BW = Being written, NR = Needs
>     review, NW(R) =
>     >> Needs work (reviewed), NR(R) = Needs review (revised),
>     >>
>     >> so 13 NE, 1 BW Modules means 13 Modules at Needs editing
>     status, 1 Module
>     >> at Being written status, etc.
>     >
>     > Just added a Key to the sheet (Column E).
>     >
>     > DG
>     >
>     >>>
>     >>> Several subpackages have the narrative in the tutorials.
>     >>
>     >> => Question 3: Is that the (only) place we want them?
>
>     Are you talking about the module docstrings in info.py which are
>     imported into __init__.py __doc__ or about the rst file which contains
>     the sub-package description?
>     http://docs.scipy.org/scipy/docs/scipy-docs/stats.rst/  or
>     http://docs.scipy.org/scipy/docs/scipy.stats/
>
>
> The latter, I believe - whichever get's printed by:
>
> >>> help(scipy.stats)
>
> That's what's getting edited at 
> http://docs.scipy.org/scipy/docs/scipy.stats/, correct?
>
> FTR: My triage table was constructed from an examination of:
>
> http://docs.scipy.org/scipy/docs/scipy/
>
> I haven't (knowingly) done anything pertaining to scipy-docs, as my 
> understanding of my responsibility is to the docstrings of the 
> package's objects, first and foremost.
>
>     We had the discussion once about the duplication but I don't think or
>     don't remember whether it got resolved.
>
>
> Sounds vaguely familiar; any chance you could find that thread?
>


I asked about "package/info.py" and "doc/source/package.rst" back in April:

    http://mail.scipy.org/pipermail/scipy-dev/2010-April/014065.html

My conclusion at that time was that both had to be maintained, despite 
the apparent duplication.

Warren

> Thanks for the feedback,
>
> DG
>
>
>     I would put all effort into the rst file and keep info at most as
>     basic listing. autosummary directives in info.py wouldn't help because
>     they are not available in the interpreter, and I don't think sphinx
>     would pick them up (but that I don't know).
>
>     As for content, for stats, I would like to see mainly a brief
>     description of the groups of functions, descriptive, tests (tests for
>     mean, test for variation, tests for correlation), trim ...
>     Any narrative longer than a basic description would make the front
>     page of the sub-package much too long.
>
>     Josef
>
>
>
>     >>
>     >> DG
>     >>
>     >>>
>     >>> Josef
>     >>>
>     >>>
>     >>> >
>     >>> > Thanks again!
>     >>> >
>     >>> > _______________________________________________
>     >>> > SciPy-Dev mailing list
>     >>> > SciPy-Dev at scipy.org <mailto:SciPy-Dev at scipy.org>
>     >>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>     >>> >
>     >>> >
>     >>> _______________________________________________
>     >>> SciPy-Dev mailing list
>     >>> SciPy-Dev at scipy.org <mailto:SciPy-Dev at scipy.org>
>     >>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>     >>
>     >>
>     >>
>     >> --
>     >> Mathematician: noun, someone who disavows certainty when their
>     uncertainty
>     >> set is non-empty, even if that set has measure zero.
>     >>
>     >> Hope: noun, that delusive spirit which escaped Pandora's jar
>     and, with her
>     >> lies, prevents mankind from committing a general suicide.  (As
>     interpreted
>     >> by Robert Graves)
>     >
>     >
>     >
>     > --
>     > Mathematician: noun, someone who disavows certainty when their
>     uncertainty
>     > set is non-empty, even if that set has measure zero.
>     >
>     > Hope: noun, that delusive spirit which escaped Pandora's jar
>     and, with her
>     > lies, prevents mankind from committing a general suicide.  (As
>     interpreted
>     > by Robert Graves)
>     >
>     > _______________________________________________
>     > SciPy-Dev mailing list
>     > SciPy-Dev at scipy.org <mailto:SciPy-Dev at scipy.org>
>     > http://mail.scipy.org/mailman/listinfo/scipy-dev
>     >
>     >
>     _______________________________________________
>     SciPy-Dev mailing list
>     SciPy-Dev at scipy.org <mailto:SciPy-Dev at scipy.org>
>     http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
>
>
> -- 
> Mathematician: noun, someone who disavows certainty when their 
> uncertainty set is non-empty, even if that set has measure zero.
>
> Hope: noun, that delusive spirit which escaped Pandora's jar and, with 
> her lies, prevents mankind from committing a general suicide.  (As 
> interpreted by Robert Graves)
> ------------------------------------------------------------------------
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From ralf.gommers at googlemail.com  Tue Jun 22 09:31:13 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Tue, 22 Jun 2010 21:31:13 +0800
Subject: [SciPy-Dev] AttributeError: 'module' object has no attribute
	'write_array'
In-Reply-To: <web-134850353@uni-stuttgart.de>
References: <web-134850353@uni-stuttgart.de>
Message-ID: <AANLkTikbZpzRkjUTyN3C5f70kSfyq-1F27pIMNOhsr8A@mail.gmail.com>

Those tests are not in trunk anymore, you have to clean your build/install
dir.

Cheers,
Ralf


On Tue, Jun 22, 2010 at 1:14 AM, Nils Wagner
<nwagner at iam.uni-stuttgart.de>wrote:

> >>> scipy.__version__
> '0.9.0.dev6558'
>
> ======================================================================
> ERROR: test_complex (test_array_import.TestReadArray)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File
>
> "/home/nwagner/local/lib64/python2.6/site-packages/scipy/io/tests/test_array_import.py",
> line 38, in test_complex
>     io.write_array(fname,a)
> AttributeError: 'module' object has no attribute
> 'write_array'
>
> ======================================================================
> ERROR: test_float (test_array_import.TestReadArray)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File
>
> "/home/nwagner/local/lib64/python2.6/site-packages/scipy/io/tests/test_array_import.py",
> line 46, in test_float
>     io.write_array(fname,a)
> AttributeError: 'module' object has no attribute
> 'write_array'
>
> ======================================================================
> ERROR: test_integer (test_array_import.TestReadArray)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File
>
> "/home/nwagner/local/lib64/python2.6/site-packages/scipy/io/tests/test_array_import.py",
> line 55, in test_integer
>     io.write_array(fname,a)
> AttributeError: 'module' object has no attribute
> 'write_array'
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100622/037fcc25/attachment.html>

From scott.sinclair.za at gmail.com  Tue Jun 22 09:59:28 2010
From: scott.sinclair.za at gmail.com (Scott Sinclair)
Date: Tue, 22 Jun 2010 15:59:28 +0200
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <4C20AD8F.60909@enthought.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
	<AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>
	<4C20AD8F.60909@enthought.com>
Message-ID: <AANLkTilgL11OmYHbmr6CQcjbUDEv0GICDTIGTt6d6QTs@mail.gmail.com>

>On 22 June 2010 14:33, Warren Weckesser <warren.weckesser at enthought.com> wrote:
> I asked about "package/info.py" and "doc/source/package.rst" back in April:
>
> ? ?http://mail.scipy.org/pipermail/scipy-dev/2010-April/014065.html
>
> My conclusion at that time was that both had to be maintained, despite
> the apparent duplication.

Perhaps it's best to focus documentation improvements on
"doc/source/package.rst" until this issue can be resolved. The result
of processing "doc/source/package.rst" is what currently ends up at
http://docs.scipy.org and "faces the world".

As far as I can see, it looks like the contents of "package/info.py"
will need to be maintained by hand. Unless Sphinx processing becomes
part of the regular build process? However, I doubt  that complicating
the build would be worth avoiding the maintenance effort.

In any case, I don't think the content in "package/info.py" and
"doc/source/package.rst" needs to contain an excessive amount of
explanation and narrative - that surely belongs in
"doc/source/tutorial/package.rst"?

Cheers,
Scott


From d.l.goldsmith at gmail.com  Tue Jun 22 12:14:09 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 22 Jun 2010 09:14:09 -0700
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTilgL11OmYHbmr6CQcjbUDEv0GICDTIGTt6d6QTs@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
	<AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>
	<4C20AD8F.60909@enthought.com>
	<AANLkTilgL11OmYHbmr6CQcjbUDEv0GICDTIGTt6d6QTs@mail.gmail.com>
Message-ID: <AANLkTilEkm6FEmPqFKo07bkSzpDmlHTOqM4fAEtN9DpN@mail.gmail.com>

On Tue, Jun 22, 2010 at 6:59 AM, Scott Sinclair <scott.sinclair.za at gmail.com
> wrote:

> >On 22 June 2010 14:33, Warren Weckesser <warren.weckesser at enthought.com>
> wrote:
> > I asked about "package/info.py" and "doc/source/package.rst" back in
> April:
> >
> >    http://mail.scipy.org/pipermail/scipy-dev/2010-April/014065.html
> >
> > My conclusion at that time was that both had to be maintained, despite
> > the apparent duplication.
>
> Perhaps it's best to focus documentation improvements on
> "doc/source/package.rst" until this issue can be resolved. The result
> of processing "doc/source/package.rst" is what currently ends up at
> http://docs.scipy.org and "faces the world".
>
> As far as I can see, it looks like the contents of "package/info.py"
> will need to be maintained by hand. Unless Sphinx processing becomes
> part of the regular build process? However, I doubt  that complicating
> the build would be worth avoiding the maintenance effort.
>
> In any case, I don't think the content in "package/info.py" and
> "doc/source/package.rst" needs to contain an excessive amount of
> explanation and narrative - that surely belongs in
> "doc/source/tutorial/package.rst"?
>
> Cheers,
> Scott
>

Sorry, guys, I'm lost: I can't find either of those - complete
URLs/path-names, please.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100622/b456997d/attachment.html>

From warren.weckesser at enthought.com  Tue Jun 22 13:17:43 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Tue, 22 Jun 2010 12:17:43 -0500
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTilEkm6FEmPqFKo07bkSzpDmlHTOqM4fAEtN9DpN@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>	<AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>	<4C20AD8F.60909@enthought.com>	<AANLkTilgL11OmYHbmr6CQcjbUDEv0GICDTIGTt6d6QTs@mail.gmail.com>
	<AANLkTilEkm6FEmPqFKo07bkSzpDmlHTOqM4fAEtN9DpN@mail.gmail.com>
Message-ID: <4C20F037.7000508@enthought.com>

David Goldsmith wrote:
> On Tue, Jun 22, 2010 at 6:59 AM, Scott Sinclair <scott.sinclair.za 
> <http://scott.sinclair.za>@gmail.com <http://gmail.com>> wrote:
>
>     >On 22 June 2010 14:33, Warren Weckesser
>     <warren.weckesser at enthought.com
>     <mailto:warren.weckesser at enthought.com>> wrote:
>     > I asked about "package/info.py" and "doc/source/package.rst"
>     back in April:
>     >
>     >    http://mail.scipy.org/pipermail/scipy-dev/2010-April/014065.html
>     >
>     > My conclusion at that time was that both had to be maintained,
>     despite
>     > the apparent duplication.
>
>     Perhaps it's best to focus documentation improvements on
>     "doc/source/package.rst" until this issue can be resolved. The result
>     of processing "doc/source/package.rst" is what currently ends up at
>     http://docs.scipy.org and "faces the world".
>
>     As far as I can see, it looks like the contents of "package/info.py"
>     will need to be maintained by hand. Unless Sphinx processing becomes
>     part of the regular build process? However, I doubt  that complicating
>     the build would be worth avoiding the maintenance effort.
>
>     In any case, I don't think the content in "package/info.py" and
>     "doc/source/package.rst" needs to contain an excessive amount of
>     explanation and narrative - that surely belongs in
>     "doc/source/tutorial/package.rst"?
>
>     Cheers,
>     Scott
>
>
> Sorry, guys, I'm lost: I can't find either of those - complete 
> URLs/path-names, please.
>

Sorry, I was using "package" as a placeholder for any scipy module.  For 
example, if "package" is "linalg", there is module-level documentation 
in both scipy/linalg/info.py and in doc/source/linalg.rst (both relative 
to subversion trunk).

Warren

> DG
> ------------------------------------------------------------------------
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From d.l.goldsmith at gmail.com  Tue Jun 22 14:35:36 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Tue, 22 Jun 2010 11:35:36 -0700
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <4C20F037.7000508@enthought.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
	<AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>
	<4C20AD8F.60909@enthought.com>
	<AANLkTilgL11OmYHbmr6CQcjbUDEv0GICDTIGTt6d6QTs@mail.gmail.com>
	<AANLkTilEkm6FEmPqFKo07bkSzpDmlHTOqM4fAEtN9DpN@mail.gmail.com>
	<4C20F037.7000508@enthought.com>
Message-ID: <AANLkTikjvAo-fsjWTJn2fmSZVxBJ1RdWXWuQaXLFhJyo@mail.gmail.com>

On Tue, Jun 22, 2010 at 10:17 AM, Warren Weckesser <
warren.weckesser at enthought.com> wrote:

> David Goldsmith wrote:
> > On Tue, Jun 22, 2010 at 6:59 AM, Scott Sinclair <scott.sinclair.za
> > <http://scott.sinclair.za>@gmail.com <http://gmail.com>> wrote:
> >
> >     >On 22 June 2010 14:33, Warren Weckesser
> >     <warren.weckesser at enthought.com
> >     <mailto:warren.weckesser at enthought.com>> wrote:
> >     > I asked about "package/info.py" and "doc/source/package.rst"
> >     back in April:
> >     >
> >     >
> http://mail.scipy.org/pipermail/scipy-dev/2010-April/014065.html
> >     >
> >     > My conclusion at that time was that both had to be maintained,
> >     despite
> >     > the apparent duplication.
> >
> >     Perhaps it's best to focus documentation improvements on
> >     "doc/source/package.rst" until this issue can be resolved. The result
> >     of processing "doc/source/package.rst" is what currently ends up at
> >     http://docs.scipy.org and "faces the world".
> >
> >     As far as I can see, it looks like the contents of "package/info.py"
> >     will need to be maintained by hand. Unless Sphinx processing becomes
> >     part of the regular build process? However, I doubt  that
> complicating
> >     the build would be worth avoiding the maintenance effort.
> >
> >     In any case, I don't think the content in "package/info.py" and
> >     "doc/source/package.rst" needs to contain an excessive amount of
> >     explanation and narrative - that surely belongs in
> >     "doc/source/tutorial/package.rst"?
> >
> >     Cheers,
> >     Scott
> >
> >
> > Sorry, guys, I'm lost: I can't find either of those - complete
> > URLs/path-names, please.
> >
>
> Sorry, I was using "package" as a placeholder for any scipy module.  For
> example, if "package" is "linalg", there is module-level documentation
> in both scipy/linalg/info.py and in doc/source/linalg.rst (both relative
> to subversion trunk).
>
> Warren
>

OK, thanks.  (FWIW, perhaps this is old-skool, but I've often seen - and
used - <package>/info.py as a formalism for that kind of thing.)

DG

>
> > DG
> > ------------------------------------------------------------------------
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100622/3b985aac/attachment.html>

From ben.root at ou.edu  Tue Jun 22 20:32:58 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Tue, 22 Jun 2010 19:32:58 -0500
Subject: [SciPy-Dev] netcdf expert info needed
In-Reply-To: <AANLkTikX3ng8OwWePeBoexGjJHMwODgftph1AM63uleo@mail.gmail.com>
References: <AANLkTilCDAfNFP4lCvDmcEfx7QhPFIHyjzBJRC3zoXzn@mail.gmail.com> 
	<AANLkTikX3ng8OwWePeBoexGjJHMwODgftph1AM63uleo@mail.gmail.com>
Message-ID: <AANLkTildZy5ABh9rDOiwxbMr3dgL51-3nGJtrk8zsaGh@mail.gmail.com>

Thanks, this helps, but I probably will have to write up a to-the-point
table of some sort.

Ben Root

On Tue, Jun 22, 2010 at 4:39 AM, Scott Sinclair <scott.sinclair.za at gmail.com
> wrote:

> >On 21 June 2010 17:59, Benjamin Root <ben.root at ou.edu> wrote:
> > I think I need a better description for what can be used to specify the
> > netcdf_variable's type.  It appears to be a typecode character instead of
> > the typical dtypes, but I am not 100% if that is true and if this is true
> > for both docstrings.  Also, it would be nice if there was already a
> document
> > that describes valid dtypes that I could reference, or if there is a
> > specific list of values that are valid here.
>
> I don't have a complete answer, but you should find some guidance here
>
>
> http://docs.scipy.org/doc/numpy/reference/arrays.dtypes.html#specifying-and-constructing-data-types
>
> Cheers,
> Scott
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100622/558c58ad/attachment.html>

From oliphant at enthought.com  Tue Jun 22 21:47:59 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Tue, 22 Jun 2010 20:47:59 -0500
Subject: [SciPy-Dev] Changes to fsolve interface to match recent changes to
	leastsq interface
Message-ID: <0615D450-A8A1-4467-893C-D8944E80802C@enthought.com>


Hi all, 

This is a heads-up in case there are concerns that I recently checked in (to both trunk and 0.8.x branch) a small one-line change to fsolve to mirror the change that was made to leastsq which removes the special-case handling of one-variable problems.

The effect is to ensure that both fsolve and leastsq will continue to have the same interface which now requires less special-case handling because it always returns an array of length 1. 

This is a relatively small, but real semantic change.   While most code will not be bothered by length-1 arrays instead of scalars being returned from fsolve and leastsq for the 1-variable case, it does represent a bit of a change and so should be stated in the release notes. 

The change to leastsq fixes a problem with curve_fit and also arguably simplifies the leastsq API.     

-Travis


From charlesr.harris at gmail.com  Tue Jun 22 22:02:02 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Tue, 22 Jun 2010 20:02:02 -0600
Subject: [SciPy-Dev] Changes to fsolve interface to match recent changes
	to leastsq interface
In-Reply-To: <0615D450-A8A1-4467-893C-D8944E80802C@enthought.com>
References: <0615D450-A8A1-4467-893C-D8944E80802C@enthought.com>
Message-ID: <AANLkTimJwZb_MDl6xMtCNyIDWkFYawTHV4xlggwt4SVT@mail.gmail.com>

On Tue, Jun 22, 2010 at 7:47 PM, Travis Oliphant <oliphant at enthought.com>wrote:

>
> Hi all,
>
> This is a heads-up in case there are concerns that I recently checked in
> (to both trunk and 0.8.x branch) a small one-line change to fsolve to mirror
> the change that was made to leastsq which removes the special-case handling
> of one-variable problems.
>
> The effect is to ensure that both fsolve and leastsq will continue to have
> the same interface which now requires less special-case handling because it
> always returns an array of length 1.
>
> This is a relatively small, but real semantic change.   While most code
> will not be bothered by length-1 arrays instead of scalars being returned
> from fsolve and leastsq for the 1-variable case, it does represent a bit of
> a change and so should be stated in the release notes.
>
> The change to leastsq fixes a problem with curve_fit and also arguably
> simplifies the leastsq API.
>
>
I think the change should to be documented in both the function and the
release notes, probably with version info in the case of the function.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100622/ab6fbe32/attachment.html>

From ben.root at ou.edu  Tue Jun 22 22:24:28 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Tue, 22 Jun 2010 21:24:28 -0500
Subject: [SciPy-Dev] Mistake in Numpy docs front page
Message-ID: <AANLkTilzMEXM6xdnOnnlHdd-csE9jgrK72eoIKBf9nU4@mail.gmail.com>

Hi,

I just noticed that there is a mistake on the front page of the User's doc
for Numpy.  It lists the Broadcasting topic twice, although the second
instance points to information about byteswapping.

http://docs.scipy.org/doc/numpy/user/

Cheers,
Ben Root
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100622/12b85586/attachment.html>

From d.l.goldsmith at gmail.com  Thu Jun 17 00:32:08 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 16 Jun 2010 21:32:08 -0700
Subject: [SciPy-Dev] SciPy docs marathon: a little more info
Message-ID: <AANLkTikJlKCmPWQx5nKd3tphhns_ycEjzMhKtMN24krm@mail.gmail.com>

On Mon, Jun 14, 2010 at 2:05 AM, David Goldsmith <d.l.goldsmith at gmail.com>wrote:

> Hi, all!  The scipy doc marathon has gotten off to a very slow start this
> summer.  We are producing less than 1000 words a week, perhaps because
> many universities are still finishing up spring classes.  So, this is
> a second appeal to everyone to pitch in and help get scipy documented
> so that it's easy to learn how to use it.  Because some of the
> packages are quite specialized, we need both "regular" contributors to
> write lots of pages, and some people experienced in using each module
> (and the mathematics behind the software) to make sure we don't water
> it down or make it wrong in the process.  If you can help, please, now is
> the
> time to step forward.  Thanks!
>
> On behalf of Joe and myself,
>
> David Goldsmith
> Olympia, WA
>

OK, a few people have come forward.  Let me enumerate the categories that
still have no "declared" volunteer writer-editors (all categories are in
need of leaders):

Max. Entropy, Misc., Image Manip. (Milestone 6)
Signal processing (Milestone 8)
Sparse Matrices (Milestone 9)
Spatial Algorithms., Special funcs. (Milestone 10)
C/C++ Integration (Milestone 13)

As for the rest, only Interpolation (Milestone 3) has more than one person
(but I'm one of the two), and I'm the only person on four others.

So, hopefully, knowing specifically which areas are in dire need will
inspire people skilled in those areas to sign up.  Thanks for your time and
help,

DG

PS: For your convenience, here's the link to the scipy
Milestones<http://docs.scipy.org/scipy/Milestones/>page.  (Note that
the Milestones link at the top of each Wiki page links,
incorrectly in the case of the SciPy pages, to the NumPy Milestones page,
which we are not actively working on in this Marathon; this is a known,
reported bug in the Wiki program.)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100616/781d2d49/attachment.html>

From gokhansever at gmail.com  Wed Jun 23 02:22:38 2010
From: gokhansever at gmail.com (=?UTF-8?Q?G=C3=B6khan_Sever?=)
Date: Wed, 23 Jun 2010 01:22:38 -0500
Subject: [SciPy-Dev] Mistake in Numpy docs front page
In-Reply-To: <AANLkTilzMEXM6xdnOnnlHdd-csE9jgrK72eoIKBf9nU4@mail.gmail.com>
References: <AANLkTilzMEXM6xdnOnnlHdd-csE9jgrK72eoIKBf9nU4@mail.gmail.com>
Message-ID: <AANLkTinLXbW73bM3_FJ1euwCy-Ht-SuVC-7KZ79cMGTm@mail.gmail.com>

On Tue, Jun 22, 2010 at 9:24 PM, Benjamin Root <ben.root at ou.edu> wrote:

> Hi,
>
> I just noticed that there is a mistake on the front page of the User's doc
> for Numpy.  It lists the Broadcasting topic twice, although the second
> instance points to information about byteswapping.
>
> http://docs.scipy.org/doc/numpy/user/
>
> Cheers,
> Ben Root
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
Hi Ben,

I had realized that issue before and committed to the doc wiki. At
http://docs.scipy.org/numpy/docs/numpy-docs/user/basics.rst/#basics it looks
correct but someone needs to update the svn and help pages again.

-- 
G?khan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100623/40d7b7a4/attachment.html>

From scott.sinclair.za at gmail.com  Wed Jun 23 03:20:04 2010
From: scott.sinclair.za at gmail.com (Scott Sinclair)
Date: Wed, 23 Jun 2010 09:20:04 +0200
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTikjvAo-fsjWTJn2fmSZVxBJ1RdWXWuQaXLFhJyo@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
	<AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>
	<4C20AD8F.60909@enthought.com>
	<AANLkTilgL11OmYHbmr6CQcjbUDEv0GICDTIGTt6d6QTs@mail.gmail.com>
	<AANLkTilEkm6FEmPqFKo07bkSzpDmlHTOqM4fAEtN9DpN@mail.gmail.com>
	<4C20F037.7000508@enthought.com>
	<AANLkTikjvAo-fsjWTJn2fmSZVxBJ1RdWXWuQaXLFhJyo@mail.gmail.com>
Message-ID: <AANLkTimmqIfAU5gS31d1ynKBpyAH7NVUxLuDr86E6SgT@mail.gmail.com>

>On 22 June 2010 20:35, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> On Tue, Jun 22, 2010 at 10:17 AM, Warren Weckesser
> <warren.weckesser at enthought.com> wrote:
>>
>> David Goldsmith wrote:
>> > On Tue, Jun 22, 2010 at 6:59 AM, Scott Sinclair <scott.sinclair.za
>> > <http://scott.sinclair.za>@gmail.com <http://gmail.com>> wrote:
>> >
>> > ? ? >On 22 June 2010 14:33, Warren Weckesser
>> > ? ? <warren.weckesser at enthought.com
>> > ? ? <mailto:warren.weckesser at enthought.com>> wrote:
>> > ? ? > I asked about "package/info.py" and "doc/source/package.rst"
>> > ? ? back in April:
>> > ? ? >
>> > ? ? >
>> > ?http://mail.scipy.org/pipermail/scipy-dev/2010-April/014065.html
>> > ? ? >
>> > ? ? > My conclusion at that time was that both had to be maintained,
>> > ? ? despite
>> > ? ? > the apparent duplication.
>> >
>> > ? ? Perhaps it's best to focus documentation improvements on
>> > ? ? "doc/source/package.rst" until this issue can be resolved. The
>> > result
>> > ? ? of processing "doc/source/package.rst" is what currently ends up at
>> > ? ? http://docs.scipy.org and "faces the world".
>> >
>> > ? ? As far as I can see, it looks like the contents of "package/info.py"
>> > ? ? will need to be maintained by hand. Unless Sphinx processing becomes
>> > ? ? part of the regular build process? However, I doubt ?that
>> > complicating
>> > ? ? the build would be worth avoiding the maintenance effort.
>> >
>> > ? ? In any case, I don't think the content in "package/info.py" and
>> > ? ? "doc/source/package.rst" needs to contain an excessive amount of
>> > ? ? explanation and narrative - that surely belongs in
>> > ? ? "doc/source/tutorial/package.rst"?
>> >
>> > Sorry, guys, I'm lost: I can't find either of those - complete
>> > URLs/path-names, please.
>> >
>>
>> Sorry, I was using "package" as a placeholder for any scipy module. ?For
>> example, if "package" is "linalg", there is module-level documentation
>> in both scipy/linalg/info.py and in doc/source/linalg.rst (both relative
>> to subversion trunk).
>
> OK, thanks.? (FWIW, perhaps this is old-skool, but I've often seen - and
> used - <package>/info.py as a formalism for that kind of thing.)

As a concrete example, if <package> == interpolate

<package>/info.py is what you edit at
http://docs.scipy.org/scipy/docs/scipy.interpolate/

doc/source/<package>.rst is what you edit at
http://docs.scipy.org/scipy/docs/scipy-docs/interpolate.rst/

doc/source/tutorial/<package>.rst is what you edit at
http://docs.scipy.org/scipy/docs/scipy-docs/tutorial/interpolate.rst/

Cheers,
Scott


From d.l.goldsmith at gmail.com  Wed Jun 23 04:06:15 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 23 Jun 2010 01:06:15 -0700
Subject: [SciPy-Dev] More doc Marathon prioritization
In-Reply-To: <AANLkTimmqIfAU5gS31d1ynKBpyAH7NVUxLuDr86E6SgT@mail.gmail.com>
References: <AANLkTikxPrXjgvNMA2g8Z8mOHtQaqjUeMIg9wYabU9hw@mail.gmail.com>
	<AANLkTinNvT_iPnY8mhtkEDDXpJlBFX2oADjKer_KKB59@mail.gmail.com>
	<AANLkTikxpLbj65FTREM0qW6PrYec9HfrtwQiIFisoexh@mail.gmail.com>
	<AANLkTingyH-4iHRS_ASEu5oF6I3p9UOtrL573c9eL5WE@mail.gmail.com>
	<AANLkTimtM1JC88E5ZEANh9vb0dBqfnnpkmAJu63tdxEF@mail.gmail.com>
	<AANLkTikp1kbEncqKYKE7jD7_Zq7qBHByGnnces_UNTxT@mail.gmail.com>
	<4C20AD8F.60909@enthought.com>
	<AANLkTilgL11OmYHbmr6CQcjbUDEv0GICDTIGTt6d6QTs@mail.gmail.com>
	<AANLkTilEkm6FEmPqFKo07bkSzpDmlHTOqM4fAEtN9DpN@mail.gmail.com>
	<4C20F037.7000508@enthought.com>
	<AANLkTikjvAo-fsjWTJn2fmSZVxBJ1RdWXWuQaXLFhJyo@mail.gmail.com>
	<AANLkTimmqIfAU5gS31d1ynKBpyAH7NVUxLuDr86E6SgT@mail.gmail.com>
Message-ID: <AANLkTinJnIgX7b4NE59pg8oNyGRmoAchdSwSWdlwueNw@mail.gmail.com>

On Wed, Jun 23, 2010 at 12:20 AM, Scott Sinclair <scott.sinclair.za@
gmail.com> wrote:

> >On 22 June 2010 20:35, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> > On Tue, Jun 22, 2010 at 10:17 AM, Warren Weckesser
> > <warren.weckesser at enthought.com> wrote:
> >>
> >> David Goldsmith wrote:
> >> > On Tue, Jun 22, 2010 at 6:59 AM, Scott Sinclair <scott.sinclair.za
> >> > <http://scott.sinclair.za>@gmail.com <http://gmail.com>> wrote:
> >> >
> >> >     >On 22 June 2010 14:33, Warren Weckesser
> >> >     <warren.weckesser at enthought.com
> >> >     <mailto:warren.weckesser at enthought.com>> wrote:
> >> >     > I asked about "package/info.py" and "doc/source/package.rst"
> >> >     back in April:
> >> >     >
> >> >     >
> >> >  http://mail.scipy.org/pipermail/scipy-dev/2010-April/014065.html
> >> >     >
> >> >     > My conclusion at that time was that both had to be maintained,
> >> >     despite
> >> >     > the apparent duplication.
> >> >
> >> >     Perhaps it's best to focus documentation improvements on
> >> >     "doc/source/package.rst" until this issue can be resolved. The
> >> > result
> >> >     of processing "doc/source/package.rst" is what currently ends up
> at
> >> >     http://docs.scipy.org and "faces the world".
> >> >
> >> >     As far as I can see, it looks like the contents of
> "package/info.py"
> >> >     will need to be maintained by hand. Unless Sphinx processing
> becomes
> >> >     part of the regular build process? However, I doubt  that
> >> > complicating
> >> >     the build would be worth avoiding the maintenance effort.
> >> >
> >> >     In any case, I don't think the content in "package/info.py" and
> >> >     "doc/source/package.rst" needs to contain an excessive amount of
> >> >     explanation and narrative - that surely belongs in
> >> >     "doc/source/tutorial/package.rst"?
> >> >
> >> > Sorry, guys, I'm lost: I can't find either of those - complete
> >> > URLs/path-names, please.
> >> >
> >>
> >> Sorry, I was using "package" as a placeholder for any scipy module.  For
> >> example, if "package" is "linalg", there is module-level documentation
> >> in both scipy/linalg/info.py and in doc/source/linalg.rst (both relative
> >> to subversion trunk).
> >
> > OK, thanks.  (FWIW, perhaps this is old-skool, but I've often seen - and
> > used - <package>/info.py as a formalism for that kind of thing.)
>
> As a concrete example, if <package> == interpolate
>
> <package>/info.py is what you edit at
> http://docs.scipy.org/scipy/docs/scipy.interpolate/
>
> doc/source/<package>.rst is what you edit at
> http://docs.scipy.org/scipy/docs/scipy-docs/interpolate.rst/
>
> doc/source/tutorial/<package>.rst is what you edit at
> http://docs.scipy.org/scipy/docs/scipy-docs/tutorial/interpolate.rst/
>
> Cheers,
> Scott
>

Thanks, Scott.

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100623/5d3370cb/attachment.html>

From jason-sage at creativetrax.com  Wed Jun 23 10:22:26 2010
From: jason-sage at creativetrax.com (Jason Grout)
Date: Wed, 23 Jun 2010 07:22:26 -0700
Subject: [SciPy-Dev] Indexing sparse matrices
Message-ID: <4C2218A2.3030109@creativetrax.com>

Hi everyone,

This bug in scipy sparse matrices was recently reported in Sage:

----------------------------------------------------------------------
| Sage Version 4.4.2, Release Date: 2010-05-19                       |
| Type notebook() for the GUI, and license() for information.        |
----------------------------------------------------------------------
sage: from scipy import sparse
sage: a = sparse.lil_matrix((10,10))
sage: a[1,2] = 1
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)

/Users/grout/sage-4.4.2-test3/local/lib/python2.6/site-packages/numpy/core/<ipython 
console> in <module>()

/Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc 
in __setitem__(self, index, x)
     320                     self._insertat3(row, data, j, xx)
     321         else:
--> 322             raise ValueError('invalid index value: %s' % str((i, 
j)))
     323
     324     def _mul_scalar(self, other):

ValueError: invalid index value: (1, 2)


Since the preparser in Sage makes integers instances of the Sage Integer 
class, and the Sage Integer class is not in the specific list of types 
checked by numpy.isscalar, the case falls through the checks in the 
sparse matrix setitem method to the last error-raising case. The problem 
seems to be that scipy checks for a specific list of types (via 
numpy.isscalar), instead of just using the __index__ magic method which 
was designed for this purpose (by Travis Oliphant for numpy! see 
http://docs.python.org/whatsnew/2.5.html#pep-357-the-index-method).

Note that things work fine in numpy:

sage: import numpy
sage: a=numpy.array([[1,2],[3,4]],dtype=int); a
array([[1, 2],
        [3, 4]])
sage: a[1,1]=5
sage: a
array([[1, 2],
        [3, 5]])


Thanks,

Jason


From d.l.goldsmith at gmail.com  Wed Jun 23 13:00:07 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Wed, 23 Jun 2010 10:00:07 -0700
Subject: [SciPy-Dev] Steve Spicklemire needs edit rights, please
Message-ID: <AANLkTimFHX4n-k5FivUD9mSlrEx4aot1U7eeaHGqI5lR@mail.gmail.com>

On Wed, Jun 23, 2010 at 3:07 AM, Steve Spicklemire <steve at spvi.com> wrote:

> Hi David,
>
> I might be able to help out with the C++ integration part. I have a user
> (sspickle) but no edit privs...  Can you grant those?
>
> thanks,
> -steve


Hi, Steve, and thanks.  Unfortunately, no I don't have that power, but I'm
cc-ing this to scipy-dev at scipy.org where some people do.  Thanks again!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100623/c2e81cb3/attachment.html>

From jh at physics.ucf.edu  Wed Jun 23 13:27:00 2010
From: jh at physics.ucf.edu (Joe Harrington)
Date: Wed, 23 Jun 2010 13:27:00 -0400
Subject: [SciPy-Dev] Matlab trademark - was: Re: [SciPy-User] SciPy-User
	Digest, Vol 	82, Issue 49
Message-ID: <wl84ogtslqz.fsf@glup.physics.ucf.edu>

I am copying this to scipy-dev, where documentation policy discussions
belong, as it proposes a policy decision on the use of trademarked
words in the documentation.

I have exchanged email with Natasha Hellerich in UCF's Office of
General Counsel about how to refer to MATLAB? in our documentation.
As noted previously, our use, without their specific permission, falls
under "fair use".  Fair use requires good faith, which in turn
requires that we follow the trademark holder's (reasonable) requests
for how to use the term and acknowledge that the trademark is theirs.

It also requires that we restrict ourselves to descriptive use of the
trademarked term.  Calling a module "matlab", even as part of
"scipy.io.matlab", might not be in line with the latter requirement,
since the thing referred to is not a product of The MathWorks?, but
rather our own module for reading the MATLAB file format.
Possibilities that seem more nominative to me include "matlabcompat",
"matlabfilecompat", or something else that indicates that the module
is not MATLAB nor any part of it and was not written by/at The
MathWorks.  I have not consulted a lawyer about this specific issue
but that might be advisable once the community has decided which
naming alternatives would be OK.  The advantage of doing this now
would be to have some time with a deprecation warning on the old name.
If we made the change in response to a cease-and-desist letter, we
might not have that luxury.  Picking a naming convention that works
with other software's file formats that we could implement in the
future (e.g., for IDL? save files; IDL is a registered trademark of
ITT, Inc.) would be good.

The two relevant messages from our lawyer (minus lengthy quoted
discussion that appeared after her signature) are below, forwarded
with her permission.  The first includes instructions for the use of
"MATLAB" and the text of the trademark statement, which differs from
that proposed earlier on scipy-user.  Note that the US URL for the
page she references is:

http://www.mathworks.com/company/pressroom/editorial_guidelines.html

I tried hard to get her to agree that we did not need to use the
R-in-a-circle symbol at all.  However, her second email below notes a
case in which someone was forced by a US court to include the
R-in-a-circle symbol in a fair use of someone else's registered
trademark:

G.D. Searle & Co. v. Hudson Pharm Corp 715 F.2d 837, 839( 3rd
Cir. 1983)

However, use of the symbol is only required on the first use in a
document, at least in this case.

Regarding placement of the first use of the term and the trademark
statement, NumPy is a single entity imported into Python in a single
command:

import numpy as np

The docs are separately available as a single PDF and a book-format
HTML document tree.  Each entity is thus a single item requiring a
single trademark acknowledgement statement and R-in-a-circle symbol.

In the PDF and HTML, the right place for the trademark statement and
the use with the ? symbol is in the front matter.  Then, all the uses
in the main text, including all docstrings, are simply "MATLAB",
unadorned by the ? symbol.  I would suggest putting the trademark
statement on page 1 of the PDF version, below the release number and
date, right before Chapter 1, but any location would be fine as long
as it is before chapter 1 and before any other use of the term.  I
would also suggest adding a copyright statement and an appropriate
Creative Commons or loosely similar license to the PDF and HTML.

The help() function in the software itself is merely an index browser
into the collection of docs, capable of jumping around in the docs at
random but not of actualy reordering the docs.  The notion of "first"
seems best addressed by the help(np) page (np.__doc__), since the
PDF/HTML front matter does not exist in the help() system and since
that page offers somewhat of an index into the rest of the docs.  I
suggest ending np.__doc__ with:

  The NumPy documentation occasionally refers to MATLAB?, which is a
  registered trademark of The MathWorks, Inc.

The UCF lawyer's recommendation seems well in line with the web sites
cited earlier in this thread.  If anyone has any reason to object, now
is the time to do so.  Otherwise, I propose that we make the lawyer's
recommendation our policy.

Note that this is all based on US trademark law.  If the law is
different in your country, please speak up now so we can see if there
is a policy that satisfies all countries' laws.

Thanks,

--jh--
Prof. Joseph Harrington
Planetary Sciences Group
Department of Physics
MAP 414
4000 Central Florida Blvd.
University of Central Florida
Orlando, FL 32816-2385
jh at physics.ucf.edu
planets.ucf.edu


Date: Mon, 21 Jun 2010 13:14:10 -0400
From: "Natasha Hellerich" <nhelleri at mail.ucf.edu>
To: "gcounsel" <gcounsel at mail.ucf.edu>,
	"Tanya Perry" <tperry at mail.ucf.edu>,<jh at physics.ucf.edu>
Subject: Re: Fwd: use of TM and R in computer documentation
In-Reply-To: <wl839wgtki4.fsf at glup.physics.ucf.edu>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Disposition: inline

Just as is stated on their website
http://www.mathworks.co.uk/company/pressroom/editorial_guidelines.html

MATLAB? - MATLAB should always be written with all letters in
uppercase. Use the ? symbol on the first reference. 

Also, the statement MATLAB? is a registered trademark of The MathWorks,
Inc. should be included.  

Please do not include any additional statements that were included on
your previous email, such as "We use this trademark without permission
from The MathWorks, etc." I specifically recommend AGAINST this
statement.  

With respect to the computer documentation the employee is writing, I
don't really know what this looks like, so it is difficult for me to
judge whether to consider that as a single entity for purposes of
following The MathWorks, Inc. guidelines with respect to using the ?
symbol on the first reference.  You are in a better position to judge
whether this documentation constitutes a single entity.  If you consider
it as such, you can then make that argument.  

Also, not knowing the particulars surrounding this, my advice is
limited to the trademark/symbol usage issues raised.

Natasha


Date: Wed, 16 Jun 2010 13:24:36 -0400
From: "Natasha Hellerich" <nhelleri at mail.ucf.edu>
To: <jh at physics.ucf.edu>
Cc: "gcounsel" <gcounsel at mail.ucf.edu>,
	"Tanya Perry" <tperry at mail.ucf.edu>
Subject: Re: Fwd: use of TM and R in computer documentation
In-Reply-To: <wl8zkz5s8qr.fsf at glup.physics.ucf.edu>
Mime-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Disposition: inline

Good afternoon,

With respect to your follow-up question, I can offer the following information:

As previously recommended, I took a look at the MathWorks website.  They in fact provide the specific guidelines as to how they wish for others to refer to their trademarks, including when and how to include the circle R symbol.  Please see the link below.
http://www.mathworks.co.uk/company/pressroom/editorial_guidelines.html 
I recommend looking to other companies' web sites for guidance as well or contacting the entity at issue for the use of trademarks other than those owned by MathWorks.

Generally, 17 United States Code (USC) Section 107 sets forth the concept of fair use. 
Fair use would be a defense to someone using another person's
trademark.  Fair use is the legal concept that would allow you to use another's trademark, within the parameters of fair use.  So the fair use defense is one way to argue that you are not infringing a trademark, but it would require that the use is descriptive and in good
faith and is used to describe the goods and services of another.  If someone
knows the other trademark is registered, then good faith use would show the
use of the symbol.  

There is at least one court case G.D. Searle & Co. v. Hudson Pharm
Corp 715 F.2d 837, 839( 3rd Cir. 1983) where a defendant in a trademark
infringement case was ordered by the court to refer to his competitors'
products using the registration symbol of R with a circle.

I will ask Tanya Perry, our paralegal, to research additional case law regarding rules with respect to using another person's or entity's trademark in research articles or other
commentary, as well as in other scenarios, i.e. whether there is additional case law out there that discusses the use of appropriate trademark symbols every time the trademark is mentioned vs. maybe just once at the beginning.

Again, if a company already sets forth its own specific rules regarding use of trademark symbols when referring to their trademarks (such as MathWorks does via their web site), then those rules obviously should be followed.

Tanya - please advise on what you can find regarding this issue.

Thanks,
Natasha


From bsouthey at gmail.com  Wed Jun 23 15:31:32 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Wed, 23 Jun 2010 14:31:32 -0500
Subject: [SciPy-Dev] Indexing sparse matrices
In-Reply-To: <4C2218A2.3030109@creativetrax.com>
References: <4C2218A2.3030109@creativetrax.com>
Message-ID: <4C226114.5050900@gmail.com>

On 06/23/2010 09:22 AM, Jason Grout wrote:
> Hi everyone,
>
> This bug in scipy sparse matrices was recently reported in Sage:
>
> ----------------------------------------------------------------------
> | Sage Version 4.4.2, Release Date: 2010-05-19                       |
> | Type notebook() for the GUI, and license() for information.        |
> ----------------------------------------------------------------------
> sage: from scipy import sparse
> sage: a = sparse.lil_matrix((10,10))
> sage: a[1,2] = 1
> ---------------------------------------------------------------------------
> ValueError                                Traceback (most recent call last)
>
> /Users/grout/sage-4.4.2-test3/local/lib/python2.6/site-packages/numpy/core/<ipython
> console>  in<module>()
>
> /Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc
> in __setitem__(self, index, x)
>       320                     self._insertat3(row, data, j, xx)
>       321         else:
> -->  322             raise ValueError('invalid index value: %s' % str((i,
> j)))
>       323
>       324     def _mul_scalar(self, other):
>
> ValueError: invalid index value: (1, 2)
>
>
> Since the preparser in Sage makes integers instances of the Sage Integer
> class, and the Sage Integer class is not in the specific list of types
> checked by numpy.isscalar, the case falls through the checks in the
> sparse matrix setitem method to the last error-raising case. The problem
> seems to be that scipy checks for a specific list of types (via
> numpy.isscalar), instead of just using the __index__ magic method which
> was designed for this purpose (by Travis Oliphant for numpy! see
> http://docs.python.org/whatsnew/2.5.html#pep-357-the-index-method).
>
> Note that things work fine in numpy:
>
> sage: import numpy
> sage: a=numpy.array([[1,2],[3,4]],dtype=int); a
> array([[1, 2],
>          [3, 4]])
> sage: a[1,1]=5
> sage: a
> array([[1, 2],
>          [3, 5]])
>
>
> Thanks,
>
> Jason
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>    

Please post this discussion on the numpy list from the scipy-dev list 
because this is not really scipy issue (yet).

I agree that this appears to be an issue between Sage and numpy's 
isscalar function. There are many uses of isscalar() in numpy (by 
grepping the source) and scipy that should be failing with Sage. For 
example  you should be getting errors using numpy's polynomial functions 
(polynomial/polynomial.py).  I would have thought that you would be 
seeing lots of errors when running numpy and scipy tests within Sage. 
(Perhaps those tests don't occur with Sage integers because Sage does 
not create them.)

Also, note that numpy still supports Python 2.4.

Bruce


From jason-sage at creativetrax.com  Thu Jun 24 00:43:16 2010
From: jason-sage at creativetrax.com (Jason Grout)
Date: Wed, 23 Jun 2010 21:43:16 -0700
Subject: [SciPy-Dev] Indexing sparse matrices
In-Reply-To: <4C226114.5050900@gmail.com>
References: <4C2218A2.3030109@creativetrax.com> <4C226114.5050900@gmail.com>
Message-ID: <4C22E264.4010806@creativetrax.com>

On 6/23/10 12:31 PM, Bruce Southey wrote:
> On 06/23/2010 09:22 AM, Jason Grout wrote:
>    
>> Hi everyone,
>>
>> This bug in scipy sparse matrices was recently reported in Sage:
>>
>> ----------------------------------------------------------------------
>> | Sage Version 4.4.2, Release Date: 2010-05-19                       |
>> | Type notebook() for the GUI, and license() for information.        |
>> ----------------------------------------------------------------------
>> sage: from scipy import sparse
>> sage: a = sparse.lil_matrix((10,10))
>> sage: a[1,2] = 1
>> ---------------------------------------------------------------------------
>> ValueError                                Traceback (most recent call last)
>>
>> /Users/grout/sage-4.4.2-test3/local/lib/python2.6/site-packages/numpy/core/<ipython
>> console>   in<module>()
>>
>> /Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc
>> in __setitem__(self, index, x)
>>        320                     self._insertat3(row, data, j, xx)
>>        321         else:
>> -->   322             raise ValueError('invalid index value: %s' % str((i,
>> j)))
>>        323
>>        324     def _mul_scalar(self, other):
>>
>> ValueError: invalid index value: (1, 2)
>>
>>
>> Since the preparser in Sage makes integers instances of the Sage Integer
>> class, and the Sage Integer class is not in the specific list of types
>> checked by numpy.isscalar, the case falls through the checks in the
>> sparse matrix setitem method to the last error-raising case. The problem
>> seems to be that scipy checks for a specific list of types (via
>> numpy.isscalar), instead of just using the __index__ magic method which
>> was designed for this purpose (by Travis Oliphant for numpy! see
>> http://docs.python.org/whatsnew/2.5.html#pep-357-the-index-method).
>>
>> Note that things work fine in numpy:
>>
>> sage: import numpy
>> sage: a=numpy.array([[1,2],[3,4]],dtype=int); a
>> array([[1, 2],
>>           [3, 4]])
>> sage: a[1,1]=5
>> sage: a
>> array([[1, 2],
>>           [3, 5]])
>>
>>
>> Thanks,
>>
>> Jason
>>
>>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>>      
> Please post this discussion on the numpy list from the scipy-dev list
> because this is not really scipy issue (yet).
>
>    

The thing is that the index object may not be a scalar, but it may have 
an __index__ method that is supposed to be used when the object is used 
as an index.  So the call to isscalar may rightfully return False, but 
the __index__ method should still be called to see if there is a value 
to act as an index.  For example, according to Python >=2.5, this should 
work:

Python 2.6.4 (r264:75706, May 25 2010, 15:42:09)
Type "copyright", "credits" or "license" for more information.

IPython 0.9.1 -- An enhanced Interactive Python.
?         -> Introduction and overview of IPython's features.
%quickref -> Quick reference.
help      -> Python's own help system.
object?   -> Details about 'object'. ?object also works, ?? prints more.

In [1]: class MyObject(object):
    ...:     def __index__(self):
    ...:         return 3
    ...:
    ...:

In [2]: import numpy

In [3]: a=numpy.array([[1,2,3,4],[3,2,1,2]])

In [4]: ind=MyObject()

In [5]: a[1,ind]
Out[5]: 2

***************************
Note: it works fine in numpy
***************************


In [6]: from scipy import sparse

In [7]: a = sparse.lil_matrix((10,10))

In [8]: a[1,ind]=3
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)

/Users/grout/projects/reptheory/practice/<ipython console> in <module>()

/Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc 
in __setitem__(self, index, x)
     302             row = self.rows[i]
     303             data = self.data[i]
--> 304             self._insertat3(row, data, j, x)
     305         elif issequence(i) and issequence(j):
     306             if np.isscalar(x):

/Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc 
in _insertat3(self, row, data, j, x)
     276             self._insertat2(row, data, j, x)
     277         else:
--> 278             raise ValueError('invalid column value: %s' % str(j))
     279
     280

ValueError: invalid column value: <__main__.MyObject object at 0x1017a9f90>


**************************
Note that the problem above is that only the first index is checked to 
be a scalar, instead of all indices.  Continuing on...
**************************


In [9]: a.__setitem__??

In [10]: numpy.isscalar(ind)
Out[10]: False

In [11]: a.__setitem__??

In [12]: numpy.isscalar(ind)
Out[12]: False

In [13]: a[ind,1]=3
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)

/Users/grout/projects/reptheory/practice/<ipython console> in <module>()

/Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc 
in __setitem__(self, index, x)
     320                     self._insertat3(row, data, j, xx)
     321         else:
--> 322             raise ValueError('invalid index value: %s' % str((i, 
j)))
     323
     324     def _mul_scalar(self, other):

ValueError: invalid index value: (<__main__.MyObject object at 
0x1017a9f90>, 1)

**********
Here, the __index__ method of ind was not called.  Note that ind itself 
may actually be something we'd normally consider to be not a scalar, so 
again, I don't think the isscalar check is really what needs to happen.  
Rather, if scipy wants an index, it should check for an index specifically.


> I agree that this appears to be an issue between Sage and numpy's
> isscalar function. There are many uses of isscalar() in numpy (by
> grepping the source) and scipy that should be failing with Sage. For
> example  you should be getting errors using numpy's polynomial functions
> (polynomial/polynomial.py).  I would have thought that you would be
> seeing lots of errors when running numpy and scipy tests within Sage.
> (Perhaps those tests don't occur with Sage integers because Sage does
> not create them.)
>    

We used to have lots of errors before using the standard numpy ways for 
converting Sage datatypes to numpy datatypes.  We should look at these 
other situations when isscalar might be called, though.  I do believe 
the above issue is a different issue, since scipy specifically wants 
indices, which don't always correspond with something being a "scalar".

> Also, note that numpy still supports Python 2.4.
>    

Good point.  I don't know if it's better to test the version number of 
python before trying to use __index__, or to just assume that python 2.4 
code may still use __index__ like we expect in 2.5.

Thanks,

Jason


From amcmorl at gmail.com  Thu Jun 24 07:41:51 2010
From: amcmorl at gmail.com (Angus McMorland)
Date: Thu, 24 Jun 2010 07:41:51 -0400
Subject: [SciPy-Dev] stats.sem default ddof
Message-ID: <AANLkTimtvPeA0s0J23_6NWw2WeGq5vtng-uWinTZv12o@mail.gmail.com>

Hi all,

I had a look at the stats.sem docstring this morning, and noticed that
the default value for the ddof argument is 1. While this might be the
most commonly used value for SEM, the default in numpy.std is 0, and
I'd thought I'd see whether you think consistency or ease-of-use is
the rule to follow here.

I can't remember (and my laptop battery is about to go flat) but I'm
guessing that the rewritten stats.nanstd will have ddof=0 as the
default(?)

... I've also just noticed that stats.tsem doesn't have the usual ddof
and axis keywords at all - I think this should be changed.

I can file tickets if we think that's the way to go.

Best,

Angus.
-- 
AJC McMorland
Post-doctoral research fellow
Neurobiology, University of Pittsburgh


From josef.pktd at gmail.com  Thu Jun 24 08:44:03 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Thu, 24 Jun 2010 08:44:03 -0400
Subject: [SciPy-Dev] stats.sem default ddof
In-Reply-To: <AANLkTimtvPeA0s0J23_6NWw2WeGq5vtng-uWinTZv12o@mail.gmail.com>
References: <AANLkTimtvPeA0s0J23_6NWw2WeGq5vtng-uWinTZv12o@mail.gmail.com>
Message-ID: <AANLkTimVb1Vpbp4TbGN1ZidQO5mN0lOx8ISbcTBvt39H@mail.gmail.com>

On Thu, Jun 24, 2010 at 7:41 AM, Angus McMorland <amcmorl at gmail.com> wrote:
> Hi all,
>
> I had a look at the stats.sem docstring this morning, and noticed that
> the default value for the ddof argument is 1. While this might be the
> most commonly used value for SEM, the default in numpy.std is 0, and
> I'd thought I'd see whether you think consistency or ease-of-use is
> the rule to follow here.

>From what I remember, SEM is supposed to have one denominator with n
and one with (n-1).
I think that's the standard normalization for the t-test

>>> import numpy as np
>>> from scipy import stats
>>> x = np.random.randn(20)

>>> stats.ttest_1samp(x, 0)
(1.5496926824178252, 0.13771199831758346)
>>> x.mean()/stats.sem(x)
1.5496926824178252

Josef

>
> I can't remember (and my laptop battery is about to go flat) but I'm
> guessing that the rewritten stats.nanstd will have ddof=0 as the
> default(?)
>
> ... I've also just noticed that stats.tsem doesn't have the usual ddof
> and axis keywords at all - I think this should be changed.
>
> I can file tickets if we think that's the way to go.
>
> Best,
>
> Angus.
> --
> AJC McMorland
> Post-doctoral research fellow
> Neurobiology, University of Pittsburgh
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From ralf.gommers at googlemail.com  Thu Jun 24 10:24:10 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 24 Jun 2010 22:24:10 +0800
Subject: [SciPy-Dev] Changes to fsolve interface to match recent changes
	to leastsq interface
In-Reply-To: <AANLkTimJwZb_MDl6xMtCNyIDWkFYawTHV4xlggwt4SVT@mail.gmail.com>
References: <0615D450-A8A1-4467-893C-D8944E80802C@enthought.com>
	<AANLkTimJwZb_MDl6xMtCNyIDWkFYawTHV4xlggwt4SVT@mail.gmail.com>
Message-ID: <AANLkTinTe9JBJ0aWhAChJpT-f7jpFbg1_wyWYIlgtbxj@mail.gmail.com>

On Wed, Jun 23, 2010 at 10:02 AM, Charles R Harris <
charlesr.harris at gmail.com> wrote:

>
>
> On Tue, Jun 22, 2010 at 7:47 PM, Travis Oliphant <oliphant at enthought.com>wrote:
>
>>
>> Hi all,
>>
>> This is a heads-up in case there are concerns that I recently checked in
>> (to both trunk and 0.8.x branch) a small one-line change to fsolve to mirror
>> the change that was made to leastsq which removes the special-case handling
>> of one-variable problems.
>>
>> The effect is to ensure that both fsolve and leastsq will continue to have
>> the same interface which now requires less special-case handling because it
>> always returns an array of length 1.
>>
>> Thanks for making that consistent Travis. All your changes to fsolve and
curve_fit look good, the one minor nitpick I have is that your tests should
use numpy.testing.assert_ instead of plain assert.


> This is a relatively small, but real semantic change.   While most code
>> will not be bothered by length-1 arrays instead of scalars being returned
>> from fsolve and leastsq for the 1-variable case, it does represent a bit of
>> a change and so should be stated in the release notes.
>>
>> The change to leastsq fixes a problem with curve_fit and also arguably
>> simplifies the leastsq API.
>>
>>
> I think the change should to be documented in both the function and the
> release notes, probably with version info in the case of the function.
>
> The function already documented the new behavior (so old behavior could be
considered a bug). I'll make sure it's in the release notes.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100624/f25a4a65/attachment.html>

From ralf.gommers at googlemail.com  Thu Jun 24 10:31:52 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Thu, 24 Jun 2010 22:31:52 +0800
Subject: [SciPy-Dev] fftpack test failures for 0.8.0b1
In-Reply-To: <AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
Message-ID: <AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>

On Sat, Jun 19, 2010 at 2:55 AM, Ralf Gommers
<ralf.gommers at googlemail.com>wrote:

>
>
> On Thu, Jun 17, 2010 at 11:48 PM, Charles R Harris <
> charlesr.harris at gmail.com> wrote:
>
>>
>>
>> On Thu, Jun 17, 2010 at 8:50 AM, Ralf Gommers <
>> ralf.gommers at googlemail.com> wrote:
>>
>>> Below are two test failures in fftpack. The test data is random, so they
>>> happen only occasionally. I decreased the precision to decimal=5 and
>>> maxulp=1e4, but still got one failure of each test in 200 test runs. The
>>> TestSingleIFFT failure always occurs with an array of size 111. This is on
>>> OS X 10.6, the same first error was also reported on 10.5.
>>>
>>> What should be done with these failures?
>>>
>>>
>> I guess the first thing is to figure out what is going on, and that needs
>> something repeatable. Maybe the test could be repeated with a set of seeds
>> until the problem appears. Also, since 111 = 3*37, I wonder what happens
>> with other odd primes besides 37. It might be worth testing with 97 and
>> other such.
>>
>> Attached is a small script to test with larger primes. You can supply a
> fixed seed but it's not even necessary. Typical result:
> arraysize, max error
>    1 0.0
>   17 4.76837e-07
>   37 2.98023e-06
>   97 0.000104427
>  313 0.000443935
>  701 0.00112867
> 1447 0.00620008
> 2011 0.0138307
> 3469 0.16958
>
> So even decimal=4 would fail for 97 already. For larger primes the FFT
> should be slower but not less accurate, right?
>

Any opinion on this? Is it easily fixable? This is the last thing holding up
0.8.0 I think, can we mark it knownfail for that or does anyone think it's
important enough to delay the release for?

Ralf


> If I use the numpy.fft.fft/ifft the result is:
>    1 0.0
>   17 7.54951656745e-15
>   37 2.99760216649e-15
>   97 4.99600361081e-15
>  313 1.65423230669e-14
>  701 5.66213742559e-15
> 1447 3.0253577421e-14
> 2011 1.87072579649e-14
> 3469 1.09912079438e-14
>
> Looks a lot better.
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100624/4d05a559/attachment.html>

From bsouthey at gmail.com  Thu Jun 24 10:39:30 2010
From: bsouthey at gmail.com (Bruce Southey)
Date: Thu, 24 Jun 2010 09:39:30 -0500
Subject: [SciPy-Dev] Indexing sparse matrices
In-Reply-To: <4C22E264.4010806@creativetrax.com>
References: <4C2218A2.3030109@creativetrax.com> <4C226114.5050900@gmail.com>
	<4C22E264.4010806@creativetrax.com>
Message-ID: <4C236E22.50800@gmail.com>

On 06/23/2010 11:43 PM, Jason Grout wrote:
> On 6/23/10 12:31 PM, Bruce Southey wrote:
>    
>> On 06/23/2010 09:22 AM, Jason Grout wrote:
>>
>>      
>>> Hi everyone,
>>>
>>> This bug in scipy sparse matrices was recently reported in Sage:
>>>
>>> ----------------------------------------------------------------------
>>> | Sage Version 4.4.2, Release Date: 2010-05-19                       |
>>> | Type notebook() for the GUI, and license() for information.        |
>>> ----------------------------------------------------------------------
>>> sage: from scipy import sparse
>>> sage: a = sparse.lil_matrix((10,10))
>>> sage: a[1,2] = 1
>>> ---------------------------------------------------------------------------
>>> ValueError                                Traceback (most recent call last)
>>>
>>> /Users/grout/sage-4.4.2-test3/local/lib/python2.6/site-packages/numpy/core/<ipython
>>> console>    in<module>()
>>>
>>> /Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc
>>> in __setitem__(self, index, x)
>>>         320                     self._insertat3(row, data, j, xx)
>>>         321         else:
>>> -->    322             raise ValueError('invalid index value: %s' % str((i,
>>> j)))
>>>         323
>>>         324     def _mul_scalar(self, other):
>>>
>>> ValueError: invalid index value: (1, 2)
>>>
>>>
>>> Since the preparser in Sage makes integers instances of the Sage Integer
>>> class, and the Sage Integer class is not in the specific list of types
>>> checked by numpy.isscalar, the case falls through the checks in the
>>> sparse matrix setitem method to the last error-raising case. The problem
>>> seems to be that scipy checks for a specific list of types (via
>>> numpy.isscalar), instead of just using the __index__ magic method which
>>> was designed for this purpose (by Travis Oliphant for numpy! see
>>> http://docs.python.org/whatsnew/2.5.html#pep-357-the-index-method).
>>>
>>> Note that things work fine in numpy:
>>>
>>> sage: import numpy
>>> sage: a=numpy.array([[1,2],[3,4]],dtype=int); a
>>> array([[1, 2],
>>>            [3, 4]])
>>> sage: a[1,1]=5
>>> sage: a
>>> array([[1, 2],
>>>            [3, 5]])
>>>
>>>
>>> Thanks,
>>>
>>> Jason
>>>
>>>
>>> _______________________________________________
>>> SciPy-Dev mailing list
>>> SciPy-Dev at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>>
>>>
>>>        
>> Please post this discussion on the numpy list from the scipy-dev list
>> because this is not really scipy issue (yet).
>>
>>   
This is not the place for this discussion!
>>
>>      
> The thing is that the index object may not be a scalar, but it may have
> an __index__ method that is supposed to be used when the object is used
> as an index.  So the call to isscalar may rightfully return False, but
> the __index__ method should still be called to see if there is a value
> to act as an index.  For example, according to Python>=2.5, this should
> work:
>    
Remember (as given below) that numpy officially supports Python 2.4 so 
any Python 2.5+ features are not allowed (at least until some one 
notices them).
If you just look at the numpy's isscalar function you can see why it 
fails - so file a bug report on it.
> Python 2.6.4 (r264:75706, May 25 2010, 15:42:09)
> Type "copyright", "credits" or "license" for more information.
>
> IPython 0.9.1 -- An enhanced Interactive Python.
> ?         ->  Introduction and overview of IPython's features.
> %quickref ->  Quick reference.
> help      ->  Python's own help system.
> object?   ->  Details about 'object'. ?object also works, ?? prints more.
>
> In [1]: class MyObject(object):
>      ...:     def __index__(self):
>      ...:         return 3
>      ...:
>      ...:
>
> In [2]: import numpy
>
> In [3]: a=numpy.array([[1,2,3,4],[3,2,1,2]])
>
> In [4]: ind=MyObject()
>
> In [5]: a[1,ind]
> Out[5]: 2
>
> ***************************
> Note: it works fine in numpy
> ***************************
>
>
> In [6]: from scipy import sparse
>
> In [7]: a = sparse.lil_matrix((10,10))
>
> In [8]: a[1,ind]=3
> ---------------------------------------------------------------------------
> ValueError                                Traceback (most recent call last)
>
> /Users/grout/projects/reptheory/practice/<ipython console>  in<module>()
>
> /Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc
> in __setitem__(self, index, x)
>       302             row = self.rows[i]
>       303             data = self.data[i]
> -->  304             self._insertat3(row, data, j, x)
>       305         elif issequence(i) and issequence(j):
>       306             if np.isscalar(x):
>
> /Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc
> in _insertat3(self, row, data, j, x)
>       276             self._insertat2(row, data, j, x)
>       277         else:
> -->  278             raise ValueError('invalid column value: %s' % str(j))
>       279
>       280
>
> ValueError: invalid column value:<__main__.MyObject object at 0x1017a9f90>
>
>
> **************************
> Note that the problem above is that only the first index is checked to
> be a scalar, instead of all indices.  Continuing on...
>    
It is meaningless to compare an ndarray with a sparse matrix because 
these are not the same thing (IIRC sparse matrices are not a subclass of 
ndarrays).


> **************************
>
>
> In [9]: a.__setitem__??
>
> In [10]: numpy.isscalar(ind)
> Out[10]: False
>    
Correct since ind should be a class instance.
 >>> ind
<__main__.MyObject object at 0x11e4950>


> In [11]: a.__setitem__??
>
> In [12]: numpy.isscalar(ind)
> Out[12]: False
>
> In [13]: a[ind,1]=3
> ---------------------------------------------------------------------------
> ValueError                                Traceback (most recent call last)
>
> /Users/grout/projects/reptheory/practice/<ipython console>  in<module>()
>
> /Users/grout/sage/local/lib/python2.6/site-packages/scipy/sparse/lil.pyc
> in __setitem__(self, index, x)
>       320                     self._insertat3(row, data, j, xx)
>       321         else:
> -->  322             raise ValueError('invalid index value: %s' % str((i,
> j)))
>       323
>       324     def _mul_scalar(self, other):
>
> ValueError: invalid index value: (<__main__.MyObject object at
> 0x1017a9f90>, 1)
>
> **********
> Here, the __index__ method of ind was not called.  Note that ind itself
> may actually be something we'd normally consider to be not a scalar, so
> again, I don't think the isscalar check is really what needs to happen.
> Rather, if scipy wants an index, it should check for an index specifically.
>> I agree that this appears to be an issue between Sage and numpy's
>> isscalar function. There are many uses of isscalar() in numpy (by
>> grepping the source) and scipy that should be failing with Sage. For
>> example  you should be getting errors using numpy's polynomial functions
>> (polynomial/polynomial.py).  I would have thought that you would be
>> seeing lots of errors when running numpy and scipy tests within Sage.
>> (Perhaps those tests don't occur with Sage integers because Sage does
>> not create them.)
>>
>>      
> We used to have lots of errors before using the standard numpy ways for
> converting Sage datatypes to numpy datatypes.  We should look at these
> other situations when isscalar might be called, though.  I do believe
> the above issue is a different issue, since scipy specifically wants
> indices, which don't always correspond with something being a "scalar".
>    
>> Also, note that numpy still supports Python 2.4.
>>
>>      
> Good point.  I don't know if it's better to test the version number of
> python before trying to use __index__, or to just assume that python 2.4
> code may still use __index__ like we expect in 2.5.
>
> Thanks,
>
> Jason
>
>    
I am not an expert on numpy but I do think you confusing how you think 
it should work and how it actually works.

The isscalar function is a very simple function that checks if the input 
type is in np.generic or in np.ScalarType (I think these are all in 
numpy/core/numerictypes.py). Thus, isscalar is really only checking if 
the input is a valid dtype as those must be scalar - hence the wikipedia 
definition of scalar:
en.wikipedia.org/wiki/Scalar_(computing)

Thus not even a list is a numpy 'scalar'
 >>> np.isscalar([3.1])
False

Changing that numerictypes.py file would be an highly undesirable ABI 
change. The solution, which should help Sage, is to adapt isscalar to 
allow similar things to Sage's scalars. But I am not sure how check that 
a class/function returns a valid dtype scalar - I am not even sure if 
the test 'isinstance(num, generic)' is even needed.

So you need to report a bug for isscalar that includes a non-Sage 
example. It would be great if you could also provide a patch as well.

Bruce


From d.l.goldsmith at gmail.com  Thu Jun 24 10:57:50 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 24 Jun 2010 07:57:50 -0700
Subject: [SciPy-Dev] fftpack test failures for 0.8.0b1
In-Reply-To: <AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
Message-ID: <AANLkTinaYztDvfvwUHje7mshVcMYRZE1YYUa0nVQYXJH@mail.gmail.com>

On Thu, Jun 24, 2010 at 7:31 AM, Ralf Gommers
<ralf.gommers at googlemail.com>wrote:

>
> On Sat, Jun 19, 2010 at 2:55 AM, Ralf Gommers <ralf.gommers at googlemail.com
> > wrote:
>
>>
>> On Thu, Jun 17, 2010 at 11:48 PM, Charles R Harris <
>> charlesr.harris at gmail.com> wrote:
>>
>>>
>>> On Thu, Jun 17, 2010 at 8:50 AM, Ralf Gommers <
>>> ralf.gommers at googlemail.com> wrote:
>>>
>>>> Below are two test failures in fftpack. The test data is random, so they
>>>> happen only occasionally. I decreased the precision to decimal=5 and
>>>> maxulp=1e4, but still got one failure of each test in 200 test runs. The
>>>> TestSingleIFFT failure always occurs with an array of size 111. This is on
>>>> OS X 10.6, the same first error was also reported on 10.5.
>>>>
>>>> What should be done with these failures?
>>>>
>>>
>>> I guess the first thing is to figure out what is going on, and that needs
>>> something repeatable. Maybe the test could be repeated with a set of seeds
>>> until the problem appears. Also, since 111 = 3*37, I wonder what happens
>>> with other odd primes besides 37. It might be worth testing with 97 and
>>> other such.
>>>
>>> Attached is a small script to test with larger primes. You can supply a
>> fixed seed but it's not even necessary. Typical result:
>> arraysize, max error
>>    1 0.0
>>   17 4.76837e-07
>>   37 2.98023e-06
>>   97 0.000104427
>>  313 0.000443935
>>  701 0.00112867
>> 1447 0.00620008
>> 2011 0.0138307
>> 3469 0.16958
>>
>> So even decimal=4 would fail for 97 already. For larger primes the FFT
>> should be slower but not less accurate, right?
>>
>
> Any opinion on this? Is it easily fixable? This is the last thing holding
> up 0.8.0 I think, can we mark it knownfail
>

Is that something that just get's done in the release notes?


> for that or does anyone think it's important enough to delay the release
> for?
>
> Ralf
>
> If I use the numpy.fft.fft/ifft the result is:
>>    1 0.0
>>   17 7.54951656745e-15
>>   37 2.99760216649e-15
>>   97 4.99600361081e-15
>>  313 1.65423230669e-14
>>  701 5.66213742559e-15
>> 1447 3.0253577421e-14
>> 2011 1.87072579649e-14
>> 3469 1.09912079438e-14
>>
>> Looks a lot better.
>>
>
So how hard would it be to bypass whatever's doing the work in fftpack and
just call numpy.fft.fft?  Is it just a matter of someone writing such a
patch and testing it?

DG


>
>>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100624/354cd955/attachment.html>

From njs at pobox.com  Thu Jun 24 11:20:53 2010
From: njs at pobox.com (Nathaniel Smith)
Date: Thu, 24 Jun 2010 08:20:53 -0700
Subject: [SciPy-Dev] Indexing sparse matrices
In-Reply-To: <4C236E22.50800@gmail.com>
References: <4C2218A2.3030109@creativetrax.com> <4C226114.5050900@gmail.com>
	<4C22E264.4010806@creativetrax.com> <4C236E22.50800@gmail.com>
Message-ID: <AANLkTim1U61UO2jIH1rHDT_K3z2_IX2ort69KRHCKuVt@mail.gmail.com>

On Thu, Jun 24, 2010 at 7:39 AM, Bruce Southey <bsouthey at gmail.com> wrote:
> On 06/23/2010 11:43 PM, Jason Grout wrote:
>> On 6/23/10 12:31 PM, Bruce Southey wrote:
>>> Please post this discussion on the numpy list from the scipy-dev list
>>> because this is not really scipy issue (yet).
>>>
>>>
> This is not the place for this discussion!
>>>
>>>
>> The thing is that the index object may not be a scalar, but it may have
>> an __index__ method that is supposed to be used when the object is used
>> as an index. ?So the call to isscalar may rightfully return False, but
>> the __index__ method should still be called to see if there is a value
>> to act as an index. ?For example, according to Python>=2.5, this should
>> work:
>>
> Remember (as given below) that numpy officially supports Python 2.4 so
> any Python 2.5+ features are not allowed (at least until some one
> notices them).
> If you just look at the numpy's isscalar function you can see why it
> fails - so file a bug report on it.

How is this not a scipy bug? From what you say, np.isscalar is working
correctly, i.e., it determines whether an object is of a type that can
be stored directly in an array of primitive type. Sage integers do not
have an associated dtype. (Or maybe isscalar is supposed to be doing
something else -- the docs are not at all clear -- but this is a
plausible interpretation. Maybe it's supposed to be checking for
whether an object is iterable?)

The bug is that scipy.sparse is using np.isscalar to determine whether
something is a valid index, and that's not what it does.

So we need an isindex() or toindex() function. Maybe this should be
promoted to numpy so libraries besides scipy.sparse can use it, but
that's a different question.

Even though built-in types (i.e. lists and tuples) don't start
supporting __index__ until 2.5, there's no issue with numpy or scipy
types supporting it on 2.4 -- it's just a method, if it exists then
you can call it. The only thing to be careful of is that in 2.4, you
need a special case to detect that int's are valid indices, whereas in
2.5+ int gains an __index__ method so it can be handled by generic
__index__ handling code.

Cheers,
-- Nathaniel


From pav at iki.fi  Thu Jun 24 11:25:29 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Thu, 24 Jun 2010 15:25:29 +0000 (UTC)
Subject: [SciPy-Dev] Indexing sparse matrices
References: <4C2218A2.3030109@creativetrax.com> <4C226114.5050900@gmail.com>
	<4C22E264.4010806@creativetrax.com> <4C236E22.50800@gmail.com>
Message-ID: <hvvtd9$fd8$1@dough.gmane.org>

Thu, 24 Jun 2010 09:39:30 -0500, Bruce Southey wrote:
> On 06/23/2010 11:43 PM, Jason Grout wrote:
[clip]
>> The thing is that the index object may not be a scalar, but it may have
>> an __index__ method that is supposed to be used when the object is used
>> as an index.  So the call to isscalar may rightfully return False, but
>> the __index__ method should still be called to see if there is a value
>> to act as an index.  For example, according to Python>=2.5, this should
>> work:
>>    
> Remember (as given below) that numpy officially supports Python 2.4 so
> any Python 2.5+ features are not allowed (at least until some one
> notices them).
> If you just look at the numpy's isscalar function you can see why it
> fails - so file a bug report on it.

I believe Jason is correct here: since sparse matrices are an indexable 
container type, they should support the __index__ protocol. So it's a 
Scipy issue, and I don't see any reasons why not to add support for that. 
Supporting Python 2.4 at the same time should not be a problem.

This is a separate issue from numpy.isscalar. In that case it is not 
clear whether an object supporting the __int__ / __float__ / __complex__ 
number protocols should be treated as a scalar. Most likely not, since 
that is not enough to make them work like scalars in arithmetic.

-- 
Pauli Virtanen


From pav at iki.fi  Thu Jun 24 12:22:13 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Thu, 24 Jun 2010 16:22:13 +0000 (UTC)
Subject: [SciPy-Dev] Accuracy of single-precision FFT
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
Message-ID: <i000nk$m79$1@dough.gmane.org>

Thu, 24 Jun 2010 22:31:52 +0800, Ralf Gommers wrote:
[clip: significant errors in float32 fft in Scipy]
>>    1 0.0
>>   17 4.76837e-07
>>   37 2.98023e-06
>>   97 0.000104427
>>  313 0.000443935
>>  701 0.00112867
>> 1447 0.00620008
>> 2011 0.0138307
>> 3469 0.16958
>>
>> So even decimal=4 would fail for 97 already. For larger primes the FFT
>> should be slower but not less accurate, right?
>
> Any opinion on this? Is it easily fixable? This is the last thing
> holding up 0.8.0 I think, can we mark it knownfail for that or does
> anyone think it's important enough to delay the release for?

IIRC, single precision (float32) FFT is a new feature in Scipy 0.8, and 
was not present in earlier releases. I think Numpy and previous versions 
of Scipy were doing the FFT all the time in double precision (check 
this!).

There are now two possibilities:

1) the single precision FFT in Scipy works incorrectly,

2) the single precision FFT in Scipy works correctly, but the precision 
   unavoidably sucks for large arrays.

I guess (2) is more likely here.

Moreover, I would guess the error is the largest in the high-frequency 
components, but remains low in at low frequencies.

    ***

The main question for the release, IMO, is:

	Should the single-precision FFT implementation be used
	by default if input data is single precision? 

I'm not completely convinced it should be, since if (2) is true, then 
this may surprise some people who expect more accuracy.

Perhaps it should be put as a keyword option instead, or just split the 
functionality to different functions. I think we should maybe delay the 
release until we are sure which way is best to take here.

    ***

David, any comments?

-- 
Pauli Virtanen


From charlesr.harris at gmail.com  Thu Jun 24 13:02:36 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Thu, 24 Jun 2010 11:02:36 -0600
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <i000nk$m79$1@dough.gmane.org>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
Message-ID: <AANLkTinO_nZMHop1nxw4lxnKzzI8yXFZyW1uSrKQ13vG@mail.gmail.com>

On Thu, Jun 24, 2010 at 10:22 AM, Pauli Virtanen <pav at iki.fi> wrote:

> Thu, 24 Jun 2010 22:31:52 +0800, Ralf Gommers wrote:
> [clip: significant errors in float32 fft in Scipy]
> >>    1 0.0
> >>   17 4.76837e-07
> >>   37 2.98023e-06
> >>   97 0.000104427
> >>  313 0.000443935
> >>  701 0.00112867
> >> 1447 0.00620008
> >> 2011 0.0138307
> >> 3469 0.16958
> >>
> >> So even decimal=4 would fail for 97 already. For larger primes the FFT
> >> should be slower but not less accurate, right?
> >
> > Any opinion on this? Is it easily fixable? This is the last thing
> > holding up 0.8.0 I think, can we mark it knownfail for that or does
> > anyone think it's important enough to delay the release for?
>
> IIRC, single precision (float32) FFT is a new feature in Scipy 0.8, and
> was not present in earlier releases. I think Numpy and previous versions
> of Scipy were doing the FFT all the time in double precision (check
> this!).
>
> There are now two possibilities:
>
> 1) the single precision FFT in Scipy works incorrectly,
>
> 2) the single precision FFT in Scipy works correctly, but the precision
>   unavoidably sucks for large arrays.
>
> I guess (2) is more likely here.
>
>
I think so too, but I don't think the source of the error is in the array
multiplication, but rather in the generation of the array entries, which is
likely done in single precision using recursion. I haven't checked that
however...

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100624/e7e87ff6/attachment.html>

From charlesr.harris at gmail.com  Thu Jun 24 13:14:09 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Thu, 24 Jun 2010 11:14:09 -0600
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTinO_nZMHop1nxw4lxnKzzI8yXFZyW1uSrKQ13vG@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTinO_nZMHop1nxw4lxnKzzI8yXFZyW1uSrKQ13vG@mail.gmail.com>
Message-ID: <AANLkTin-5CDjdDDKA52F-wkjv6GqaokbkC9sONyDR6XU@mail.gmail.com>

On Thu, Jun 24, 2010 at 11:02 AM, Charles R Harris <
charlesr.harris at gmail.com> wrote:

>
>
> On Thu, Jun 24, 2010 at 10:22 AM, Pauli Virtanen <pav at iki.fi> wrote:
>
>> Thu, 24 Jun 2010 22:31:52 +0800, Ralf Gommers wrote:
>> [clip: significant errors in float32 fft in Scipy]
>> >>    1 0.0
>> >>   17 4.76837e-07
>> >>   37 2.98023e-06
>> >>   97 0.000104427
>> >>  313 0.000443935
>> >>  701 0.00112867
>> >> 1447 0.00620008
>> >> 2011 0.0138307
>> >> 3469 0.16958
>> >>
>> >> So even decimal=4 would fail for 97 already. For larger primes the FFT
>> >> should be slower but not less accurate, right?
>> >
>> > Any opinion on this? Is it easily fixable? This is the last thing
>> > holding up 0.8.0 I think, can we mark it knownfail for that or does
>> > anyone think it's important enough to delay the release for?
>>
>> IIRC, single precision (float32) FFT is a new feature in Scipy 0.8, and
>> was not present in earlier releases. I think Numpy and previous versions
>> of Scipy were doing the FFT all the time in double precision (check
>> this!).
>>
>> There are now two possibilities:
>>
>> 1) the single precision FFT in Scipy works incorrectly,
>>
>> 2) the single precision FFT in Scipy works correctly, but the precision
>>   unavoidably sucks for large arrays.
>>
>> I guess (2) is more likely here.
>>
>>
> I think so too, but I don't think the source of the error is in the array
> multiplication, but rather in the generation of the array entries, which is
> likely done in single precision using recursion. I haven't checked that
> however...
>
>
Which is to say, I suspect it is fixable. There are various methods of
generating the needed cosines and sines that have improved error bounds. The
easiest would be simply using double precision and casting the results to
single, but there are other tricks that can improve the accuracy of the
double precision results also.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100624/08327ff5/attachment.html>

From aarchiba at physics.mcgill.ca  Thu Jun 24 13:41:01 2010
From: aarchiba at physics.mcgill.ca (Anne Archibald)
Date: Thu, 24 Jun 2010 13:41:01 -0400
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <i000nk$m79$1@dough.gmane.org>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com> 
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com> 
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com> 
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com> 
	<i000nk$m79$1@dough.gmane.org>
Message-ID: <AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>

I'm not sure exactly what you're listing here, but I wrote a test
program to compare single- and double-precision FFTs using FFTW; the
root-mean-squared fractional errors are:
1 1.10991e-09
17 9.26845e-08
37 1.16509e-07
97 1.20773e-07
313 1.33648e-07
701 1.55432e-07
1447 1.75132e-07
2011 1.80602e-07
3469 1.93587e-07

We can't use FFTW, unfortunately, but it's clear that prime-size FFTs
can do much better than we are. (Though what exactly is the definition
of the errors, below?) FFTW does this, IIRC, by not actually doing
prime-size FFTs; they have some scheme by which the prime-size FFT is
viewed as a convolution, then padded and computed with a
nicely-divisible-size FFT.

Anne

On 24 June 2010 12:22, Pauli Virtanen <pav at iki.fi> wrote:
> Thu, 24 Jun 2010 22:31:52 +0800, Ralf Gommers wrote:
> [clip: significant errors in float32 fft in Scipy]
>>> ? ?1 0.0
>>> ? 17 4.76837e-07
>>> ? 37 2.98023e-06
>>> ? 97 0.000104427
>>> ?313 0.000443935
>>> ?701 0.00112867
>>> 1447 0.00620008
>>> 2011 0.0138307
>>> 3469 0.16958
>>>
>>> So even decimal=4 would fail for 97 already. For larger primes the FFT
>>> should be slower but not less accurate, right?
>>
>> Any opinion on this? Is it easily fixable? This is the last thing
>> holding up 0.8.0 I think, can we mark it knownfail for that or does
>> anyone think it's important enough to delay the release for?
>
> IIRC, single precision (float32) FFT is a new feature in Scipy 0.8, and
> was not present in earlier releases. I think Numpy and previous versions
> of Scipy were doing the FFT all the time in double precision (check
> this!).
>
> There are now two possibilities:
>
> 1) the single precision FFT in Scipy works incorrectly,
>
> 2) the single precision FFT in Scipy works correctly, but the precision
> ? unavoidably sucks for large arrays.
>
> I guess (2) is more likely here.
>
> Moreover, I would guess the error is the largest in the high-frequency
> components, but remains low in at low frequencies.
>
> ? ?***
>
> The main question for the release, IMO, is:
>
> ? ? ? ?Should the single-precision FFT implementation be used
> ? ? ? ?by default if input data is single precision?
>
> I'm not completely convinced it should be, since if (2) is true, then
> this may surprise some people who expect more accuracy.
>
> Perhaps it should be put as a keyword option instead, or just split the
> functionality to different functions. I think we should maybe delay the
> release until we are sure which way is best to take here.
>
> ? ?***
>
> David, any comments?
>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: fftwaccuracy.c
Type: text/x-csrc
Size: 1632 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100624/2f9da6c8/attachment.c>

From charlesr.harris at gmail.com  Thu Jun 24 13:55:15 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Thu, 24 Jun 2010 11:55:15 -0600
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
Message-ID: <AANLkTimpJHLPT3hIVl-AiPAL-WK9MXK8Y6lgugFa8Xfu@mail.gmail.com>

On Thu, Jun 24, 2010 at 11:41 AM, Anne Archibald <aarchiba at physics.mcgill.ca
> wrote:

> I'm not sure exactly what you're listing here, but I wrote a test
> program to compare single- and double-precision FFTs using FFTW; the
> root-mean-squared fractional errors are:
> 1 1.10991e-09
> 17 9.26845e-08
> 37 1.16509e-07
> 97 1.20773e-07
> 313 1.33648e-07
> 701 1.55432e-07
> 1447 1.75132e-07
> 2011 1.80602e-07
> 3469 1.93587e-07
>
> We can't use FFTW, unfortunately, but it's clear that prime-size FFTs
> can do much better than we are. (Though what exactly is the definition
> of the errors, below?) FFTW does this, IIRC, by not actually doing
> prime-size FFTs; they have some scheme by which the prime-size FFT is
> viewed as a convolution, then padded and computed with a
> nicely-divisible-size FFT.
>
>
Yeah, there is a longstanding ticket open to implement that. There are two
possibilities, Rader's or Winograd's algorithm. I don't know which FFTW
uses, but I'd like to see the Winograd algorithm since it can be used for
other things, like chirp transforms.  In any case, I think the single
precision results for FFTPACK should be better than they are, if not as good
as those that could be obtained using a better algorithm. Something isn't
quite right in there.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100624/b314f23f/attachment.html>

From pav at iki.fi  Thu Jun 24 15:03:22 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Thu, 24 Jun 2010 19:03:22 +0000 (UTC)
Subject: [SciPy-Dev] Accuracy of single-precision FFT
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
Message-ID: <i00a5q$4fj$1@dough.gmane.org>

Thu, 24 Jun 2010 13:41:01 -0400, Anne Archibald wrote:
[clip]
> We can't use FFTW, unfortunately, but it's clear that prime-size FFTs
> can do much better than we are. (Though what exactly is the definition
> of the errors, below?) 

>From Ralf's test program:

x = np.random.randn(size).astype(np.float32)
err = abs(x - ifft(fft(x))).max()

It's anyway in the order of magnitude of the rms error in this case.

Since 2**n FFTs have a much better accuracy, the problem is probably 
indeed in FFTPACK.

-- 
Pauli Virtanen


From david at silveregg.co.jp  Thu Jun 24 21:47:32 2010
From: david at silveregg.co.jp (David)
Date: Fri, 25 Jun 2010 10:47:32 +0900
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com> 
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com> 
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com> 
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com> 
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
Message-ID: <4C240AB4.3050808@silveregg.co.jp>

On 06/25/2010 02:41 AM, Anne Archibald wrote:

>
> We can't use FFTW, unfortunately, but it's clear that prime-size FFTs
> can do much better than we are. (Though what exactly is the definition
> of the errors, below?) FFTW does this, IIRC, by not actually doing
> prime-size FFTs; they have some scheme by which the prime-size FFT is
> viewed as a convolution, then padded and computed with a
> nicely-divisible-size FFT.

Yep, the bluestein algorithm. I thought fftpack in scipy did this, but 
it actually does not (I tested size 4096 vs 4093, one is 100 times 
slower than the other).

So the solution would be to implement this (for both numpy and scipy). I 
won't be able to do it for 0.8, though.

cheers,

David


From d.l.goldsmith at gmail.com  Thu Jun 24 22:11:18 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Thu, 24 Jun 2010 19:11:18 -0700
Subject: [SciPy-Dev] Place for PDF of "ODRPACK User's Guide" on scipy.org?
Message-ID: <AANLkTinpen6HQX2JKUCad5vmqgrDoTiPStjdFkddJOoe@mail.gmail.com>

Hi!  Robert's excellent docstring for the odr module had a dead link to the
recommended "ODRPACK User's Guide"; Marcus Boumans provided a live link in
the Discussion area, and I've since incorporated it into the docstring
proper.  One small problem, however: the link is to a PostScript file - no
biggy for posix users (all flavors come standard w/ PS readers IIRC), but an
(admittedly minor) nuisance for Windows users, since if one doesn't already
have a PS reader, one needs to go find one, install it, etc. to be able to
read this.  (I don't recall what the situation is for Mac, but I suspect,
since the OS became unix-based, it's PS friendly out-of-the-box, yes?)
Anyway, I went ahead and created a PDF of the document *and* secured
redistribution permission from Daniel Lozier, Mathematical Software Group
Leader, NIST, 'cause I thought it might be nice of us to provide such a
version of the document, w/ a link to it in the docstring.  If TPTB agree, I
can email both the PDF and the redist. permission email to the appropriate
person to place somewhere on scipy.org.

DG

PS: the PDF is only 607 KB
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100624/a3e7b491/attachment.html>

From robert.kern at gmail.com  Thu Jun 24 22:27:55 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Thu, 24 Jun 2010 21:27:55 -0500
Subject: [SciPy-Dev] Place for PDF of "ODRPACK User's Guide" on
	scipy.org?
In-Reply-To: <AANLkTinpen6HQX2JKUCad5vmqgrDoTiPStjdFkddJOoe@mail.gmail.com>
References: <AANLkTinpen6HQX2JKUCad5vmqgrDoTiPStjdFkddJOoe@mail.gmail.com>
Message-ID: <AANLkTinaIePFlxMs4WVjMxd_p7wtp-6XeeXwCepLO40O@mail.gmail.com>

On Thu, Jun 24, 2010 at 21:11, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> Hi!? Robert's excellent docstring for the odr module had a dead link to the
> recommended "ODRPACK User's Guide"; Marcus Boumans provided a live link in
> the Discussion area, and I've since incorporated it into the docstring
> proper.? One small problem, however: the link is to a PostScript file - no
> biggy for posix users (all flavors come standard w/ PS readers IIRC), but an
> (admittedly minor) nuisance for Windows users, since if one doesn't already
> have a PS reader, one needs to go find one, install it, etc. to be able to
> read this.? (I don't recall what the situation is for Mac, but I suspect,
> since the OS became unix-based, it's PS friendly out-of-the-box, yes?)
> Anyway, I went ahead and created a PDF of the document *and* secured
> redistribution permission from Daniel Lozier, Mathematical Software Group
> Leader, NIST, 'cause I thought it might be nice of us to provide such a
> version of the document, w/ a link to it in the docstring.? If TPTB agree, I
> can email both the PDF and the redist. permission email to the appropriate
> person to place somewhere on scipy.org.

The PDFs generated from the original LaTeX DVI files are here:

http://www.mechanicalkern.com/static/odrpack_guide.pdf
http://www.mechanicalkern.com/static/odr_ams.pdf
http://www.mechanicalkern.com/static/odr_vcv.pdf

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From pav at iki.fi  Fri Jun 25 05:02:02 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 25 Jun 2010 09:02:02 +0000 (UTC)
Subject: [SciPy-Dev] Accuracy of single-precision FFT
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C240AB4.3050808@silveregg.co.jp>
Message-ID: <i01raa$ho2$1@dough.gmane.org>

Fri, 25 Jun 2010 10:47:32 +0900, David wrote:
[clip]
> Yep, the bluestein algorithm. I thought fftpack in scipy did this, but
> it actually does not (I tested size 4096 vs 4093, one is 100 times
> slower than the other).
> 
> So the solution would be to implement this (for both numpy and scipy). I
> won't be able to do it for 0.8, though.

Ok, so it seems clear we can't fix the accuracy issue fast, and should 
not wait for it for 0.8.

Now the question is whether the accuracy problem is severe enough so that 
we should disable the single-precision transforms in this release.

If we are unsure that it's not severe, I think we should disable the 
support for single precision FFT in the 0.8.x branch, and remove the 
corresponding mention from the release notes.

Like so:

	http://github.com/pv/scipy-work/tree/disable-single-precision-fft

-- 
Pauli Virtanen


From d.l.goldsmith at gmail.com  Fri Jun 25 05:41:45 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 25 Jun 2010 02:41:45 -0700
Subject: [SciPy-Dev] Place for PDF of "ODRPACK User's Guide" on
	scipy.org?
In-Reply-To: <AANLkTinaIePFlxMs4WVjMxd_p7wtp-6XeeXwCepLO40O@mail.gmail.com>
References: <AANLkTinpen6HQX2JKUCad5vmqgrDoTiPStjdFkddJOoe@mail.gmail.com>
	<AANLkTinaIePFlxMs4WVjMxd_p7wtp-6XeeXwCepLO40O@mail.gmail.com>
Message-ID: <AANLkTimTSu896B9JWLoHO_BfDrMRGG71VKldrbWuAubJ@mail.gmail.com>

On Thu, Jun 24, 2010 at 7:27 PM, Robert Kern <robert.kern at gmail.com> wrote:

> On Thu, Jun 24, 2010 at 21:11, David Goldsmith <d.l.goldsmith at gmail.com>
> wrote:
> > Hi!  Robert's excellent docstring for the odr module had a dead link to
> the
> > recommended "ODRPACK User's Guide"; Marcus Boumans provided a live link
> in
> > the Discussion area, and I've since incorporated it into the docstring
> > proper.  One small problem, however: the link is to a PostScript file -
> no
> > biggy for posix users (all flavors come standard w/ PS readers IIRC), but
> an
> > (admittedly minor) nuisance for Windows users, since if one doesn't
> already
> > have a PS reader, one needs to go find one, install it, etc. to be able
> to
> > read this.  (I don't recall what the situation is for Mac, but I suspect,
> > since the OS became unix-based, it's PS friendly out-of-the-box, yes?)
> > Anyway, I went ahead and created a PDF of the document *and* secured
> > redistribution permission from Daniel Lozier, Mathematical Software Group
> > Leader, NIST, 'cause I thought it might be nice of us to provide such a
> > version of the document, w/ a link to it in the docstring.  If TPTB
> agree, I
> > can email both the PDF and the redist. permission email to the
> appropriate
> > person to place somewhere on scipy.org.
>
> The PDFs generated from the original LaTeX DVI files are here:
>
> http://www.mechanicalkern.com/static/odrpack_guide.pdf
> http://www.mechanicalkern.com/static/odr_ams.pdf
> http://www.mechanicalkern.com/static/odr_vcv.pdf
>

Thanks, Robert.  I assume I may add those links to the docstring?

DG

>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
>  -- Umberto Eco
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100625/f8b5759f/attachment.html>

From pav at iki.fi  Fri Jun 25 06:25:12 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 25 Jun 2010 10:25:12 +0000 (UTC)
Subject: [SciPy-Dev] Place for PDF of "ODRPACK User's Guide"
	on	scipy.org?
References: <AANLkTinpen6HQX2JKUCad5vmqgrDoTiPStjdFkddJOoe@mail.gmail.com>
	<AANLkTinaIePFlxMs4WVjMxd_p7wtp-6XeeXwCepLO40O@mail.gmail.com>
	<AANLkTimTSu896B9JWLoHO_BfDrMRGG71VKldrbWuAubJ@mail.gmail.com>
Message-ID: <i02068$9fh$1@dough.gmane.org>

Fri, 25 Jun 2010 02:41:45 -0700, David Goldsmith wrote:

> On Thu, Jun 24, 2010 at 7:27 PM, Robert Kern <robert.kern at gmail.com>
> wrote:
> 
>> On Thu, Jun 24, 2010 at 21:11, David Goldsmith
>> <d.l.goldsmith at gmail.com> wrote:
>> > Hi!  Robert's excellent docstring for the odr module had a dead link
>> > to
>> the
>> > recommended "ODRPACK User's Guide"; Marcus Boumans provided a live
>> > link
>> in
>> > the Discussion area, and I've since incorporated it into the
>> > docstring proper.  One small problem, however: the link is to a
>> > PostScript file -
>> no
>> > biggy for posix users (all flavors come standard w/ PS readers IIRC),
>> > but
>> an
>> > (admittedly minor) nuisance for Windows users, since if one doesn't
>> already
>> > have a PS reader, one needs to go find one, install it, etc. to be
>> > able
>> to
>> > read this.  (I don't recall what the situation is for Mac, but I
>> > suspect, since the OS became unix-based, it's PS friendly
>> > out-of-the-box, yes?) Anyway, I went ahead and created a PDF of the
>> > document *and* secured redistribution permission from Daniel Lozier,
>> > Mathematical Software Group Leader, NIST, 'cause I thought it might
>> > be nice of us to provide such a version of the document, w/ a link to
>> > it in the docstring.  If TPTB
>> agree, I
>> > can email both the PDF and the redist. permission email to the
>> appropriate
>> > person to place somewhere on scipy.org.
>>
>> The PDFs generated from the original LaTeX DVI files are here:
>>
>> http://www.mechanicalkern.com/static/odrpack_guide.pdf
>> http://www.mechanicalkern.com/static/odr_ams.pdf
>> http://www.mechanicalkern.com/static/odr_vcv.pdf
>>
>>
> Thanks, Robert.  I assume I may add those links to the docstring?

I put them also here:

	http://docs.scipy.org/doc/external/odrpack_guide.pdf
	http://docs.scipy.org/doc/external/odr_ams.pdf
	http://docs.scipy.org/doc/external/odr_vcv.pdf

-- 
Pauli Virtanen


From sturla at molden.no  Fri Jun 25 08:07:56 2010
From: sturla at molden.no (Sturla Molden)
Date: Fri, 25 Jun 2010 14:07:56 +0200
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
Message-ID: <4C249C1C.1060802@molden.no>

Den 24.06.2010 19:41, skrev Anne Archibald:
> We can't use FFTW, unfortunately, but it's clear that prime-size FFTs
> can do much better than we are.


FFTW and FFTPACK are not the only options.

We could use Intel MKL, not just for FFTs but also BLAS. It's not worse 
than asking Intel to sponsor NumPy/SciPy with an MKL license. That would 
allow redistribution of a binary build linked against MKL, similar to 
what Cristoph Gohlke and Enthought are doing:

http://www.lfd.uci.edu/~gohlke/pythonlibs/ 
<http://www.lfd.uci.edu/%7Egohlke/pythonlibs/>
http://enthought.com/

We can also use OpenCL for those that have GPUs available:

https://developer.apple.com/mac/library/samplecode/OpenCL_FFT/Introduction/Intro.html
http://www.cs.berkeley.edu/~kubitron/courses/cs258-S08/projects/reports/project6_report.pdf 
<http://www.cs.berkeley.edu/%7Ekubitron/courses/cs258-S08/projects/reports/project6_report.pdf>
http://forums.nvidia.com/index.php?showtopic=102635


Sturla


From sturla at molden.no  Fri Jun 25 08:20:32 2010
From: sturla at molden.no (Sturla Molden)
Date: Fri, 25 Jun 2010 14:20:32 +0200
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <4C240AB4.3050808@silveregg.co.jp>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C240AB4.3050808@silveregg.co.jp>
Message-ID: <4C249F10.5070809@molden.no>

Den 25.06.2010 03:47, skrev David:
> Yep, the bluestein algorithm. I thought fftpack in scipy did this, but
> it actually does not (I tested size 4096 vs 4093, one is 100 times
> slower than the other).
>    

Bluestein's algorithm is just a few lines of Python code using the FFT 
we already have.

(It requires that we factor N into primes though, but we can memoize 
with a dict.)

Sturla


From sturla at molden.no  Fri Jun 25 09:07:28 2010
From: sturla at molden.no (Sturla Molden)
Date: Fri, 25 Jun 2010 15:07:28 +0200
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <i01raa$ho2$1@dough.gmane.org>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>	<i000nk$m79$1@dough.gmane.org>	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>	<4C240AB4.3050808@silveregg.co.jp>
	<i01raa$ho2$1@dough.gmane.org>
Message-ID: <4C24AA10.6020207@molden.no>

Den 25.06.2010 11:02, skrev Pauli Virtanen:
> If we are unsure that it's not severe, I think we should disable the
> support for single precision FFT in the 0.8.x branch, and remove the
> corresponding mention from the release notes.
>
>    

That is the safest option. If the single precision FFT is inaccurate, it 
makes sence to coerce to double precision. Double precision is not 
slower on modern architectures anyway. The FPU on x86 uses a 80 bit 
exended double natively, and exceeds both single and double resolution. 
We only save some memory by using single-precision.

Another question is whether we should enforce strict IEEE 754 arthmetics 
for single and double precision FFTs, or or allow 80 bit extended double 
(native on x86). It is both faster and more accurate, but less 
reproducible on other platforms. I am not sure what SciPy currently does.

Sturla


From pav at iki.fi  Fri Jun 25 10:05:08 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 25 Jun 2010 14:05:08 +0000 (UTC)
Subject: [SciPy-Dev] Accuracy of single-precision FFT
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no>
Message-ID: <i02d2k$mdj$1@dough.gmane.org>

Fri, 25 Jun 2010 14:07:56 +0200, Sturla Molden wrote:
> Den 24.06.2010 19:41, skrev Anne Archibald:
>> We can't use FFTW, unfortunately, but it's clear that prime-size FFTs
>> can do much better than we are.
>
> FFTW and FFTPACK are not the only options.
>
> We could use Intel MKL, not just for FFTs but also BLAS. It's not worse
> than asking Intel to sponsor NumPy/SciPy with an MKL license. That would
> allow redistribution of a binary build linked against MKL, similar to
> what Cristoph Gohlke and Enthought are doing:

That would require adding support for multi-backend FFT back to Scipy. If 
we're going to put it back, then it would be worth to think carefully 
about the design first, so that someone doesn't have to rip it out 
again :)

Supporting only MKL is not an option, since even if we managed to secure 
ourselves a redistribution license, it will disturb our redistribution 
chain because all third party packagers [eg. Python(x,y), Linux 
distributions] would also need a license of their own. Moreover, 
developers would also need to grab a copy of MKL etc...

-- 
Pauli Virtanen


From njs at pobox.com  Fri Jun 25 10:28:44 2010
From: njs at pobox.com (Nathaniel Smith)
Date: Fri, 25 Jun 2010 07:28:44 -0700
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <i02d2k$mdj$1@dough.gmane.org>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
Message-ID: <AANLkTimnAyF-GobnSB5xiGZ07dtttdfAXYQx0KZX759B@mail.gmail.com>

On Fri, Jun 25, 2010 at 7:05 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Supporting only MKL is not an option, since even if we managed to secure
> ourselves a redistribution license, it will disturb our redistribution
> chain because all third party packagers [eg. Python(x,y), Linux
> distributions] would also need a license of their own. Moreover,
> developers would also need to grab a copy of MKL etc...

Not to mention, if MKL were required then GPL'ed projects could not
legally use SciPy.  (Plus it would annoy the FOSS zealots who would
really rather not use a proprietary library at all. (Like me.))

-- Nathaniel


From sturla at molden.no  Fri Jun 25 10:59:40 2010
From: sturla at molden.no (Sturla Molden)
Date: Fri, 25 Jun 2010 16:59:40 +0200
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTimnAyF-GobnSB5xiGZ07dtttdfAXYQx0KZX759B@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>	<i000nk$m79$1@dough.gmane.org>	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>	<4C249C1C.1060802@molden.no>
	<i02d2k$mdj$1@dough.gmane.org>
	<AANLkTimnAyF-GobnSB5xiGZ07dtttdfAXYQx0KZX759B@mail.gmail.com>
Message-ID: <4C24C45C.7070301@molden.no>

Den 25.06.2010 16:28, skrev Nathaniel Smith:
> Not to mention, if MKL were required then GPL'ed projects could not
> legally use SciPy.  (Plus it would annoy the FOSS zealots who would
> really rather not use a proprietary library at all. (Like me.))
>    


Are you suggesting GPL'd programs cannot use proprietary DLLs? How can 
you legally run a GPL'd Python program on Windows, when msvcr90.dll and 
the OS are closed source?

If a GPL'd program can use msvcr90.dll, it can use MKL as well. There is 
no difference here.

Also, optionally using MKL is different from requiring MKL. We can build 
SciPy and NumPy against MKL even today, e.g. for LAPACK and BLAS. 
Doesn't that exclude their use from your GPL'd project?

I believe you are thinking backwards, IANAL but I am quite sure it works 
like this:

- A GPL'd project can link against whatever library it needs.

- A GPL incompatible project cannot link against a GPL'd library.

GPL is an descending infection from library to project, not an ascending 
infection, to speak in medical termes. Ascending GPL infection would not 
make sence, since the library is the property of someone else.

All I am saying is this: Why not allow MKL as an alternative to FFTPACK 
as well?


Sturla


From robert.kern at gmail.com  Fri Jun 25 11:10:31 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Fri, 25 Jun 2010 10:10:31 -0500
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <4C24C45C.7070301@molden.no>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com> 
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com> 
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com> 
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com> 
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com> 
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org> 
	<AANLkTimnAyF-GobnSB5xiGZ07dtttdfAXYQx0KZX759B@mail.gmail.com> 
	<4C24C45C.7070301@molden.no>
Message-ID: <AANLkTilY5z7-HEV0LVCiSqiY-nfl1o1bKiBHJlDbxkts@mail.gmail.com>

On Fri, Jun 25, 2010 at 09:59, Sturla Molden <sturla at molden.no> wrote:
> Den 25.06.2010 16:28, skrev Nathaniel Smith:
>> Not to mention, if MKL were required then GPL'ed projects could not
>> legally use SciPy. ?(Plus it would annoy the FOSS zealots who would
>> really rather not use a proprietary library at all. (Like me.))
>
> Are you suggesting GPL'd programs cannot use proprietary DLLs? How can
> you legally run a GPL'd Python program on Windows, when msvcr90.dll and
> the OS are closed source?

There is an explicit exception in the GPL for system runtime libraries.

> If a GPL'd program can use msvcr90.dll, it can use MKL as well. There is
> no difference here.
>
> Also, optionally using MKL is different from requiring MKL. We can build
> SciPy and NumPy against MKL even today, e.g. for LAPACK and BLAS.
> Doesn't that exclude their use from your GPL'd project?

He was under the impression that you were suggesting that MKL would be required.

> I believe you are thinking backwards, IANAL but I am quite sure it works
> like this:
>
> - A GPL'd project can link against whatever library it needs.
>
> - A GPL incompatible project cannot link against a GPL'd library.
>
> GPL is an descending infection from library to project, not an ascending
> infection, to speak in medical termes. Ascending GPL infection would not
> make sence, since the library is the property of someone else.

This is completely incorrect. The GPL requires that the work as a
whole be distributed under the terms of the GPL (or less restrictive
terms for an individual component). If a library is proprietary, then
the GPLed program simply cannot use it. Ignore the "infection"
analogy. It is misleading.

> All I am saying is this: Why not allow MKL as an alternative to FFTPACK
> as well?

Just the work that would be required to have a generic backend
mechanism. We used to have this, but eventually removed it because it
broke often. Most developers did not have the ability to build and
test against all (or even more than one) backend. It could be done,
but the cost is significant.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From pav at iki.fi  Fri Jun 25 11:17:33 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 25 Jun 2010 15:17:33 +0000 (UTC)
Subject: [SciPy-Dev] Using MKL for FFTs (was: Accuracy of single-precision
	FFT)
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<AANLkTimnAyF-GobnSB5xiGZ07dtttdfAXYQx0KZX759B@mail.gmail.com>
	<4C24C45C.7070301@molden.no>
Message-ID: <i02had$ada$1@dough.gmane.org>

Fri, 25 Jun 2010 16:59:40 +0200, Sturla Molden wrote:
[clip]
> All I am saying is this: Why not allow MKL as an alternative to FFTPACK
> as well?

I don't think there are valid objections to this. The issues are just 
practical: (i) can we find someone who wants to do it, and (ii) the multi-
backend stuff should be done well enough to be maintainable in the future.

-- 
Pauli Virtanen


From sturla at molden.no  Fri Jun 25 11:35:14 2010
From: sturla at molden.no (Sturla Molden)
Date: Fri, 25 Jun 2010 17:35:14 +0200
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <i02d2k$mdj$1@dough.gmane.org>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>	<i000nk$m79$1@dough.gmane.org>	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>	<4C249C1C.1060802@molden.no>
	<i02d2k$mdj$1@dough.gmane.org>
Message-ID: <4C24CCB2.1060104@molden.no>

Den 25.06.2010 16:05, skrev Pauli Virtanen:
>
> Supporting only MKL is not an option, since even if we managed to secure
> ourselves a redistribution license,
>    


I was not suggesting that :) Or at least I did not intend to.

We have to have other alternatives. For example to run NumPy on ARM. Or 
if someone wants to py2exe a NumPy based program, and does not own an 
MKL license themselves.

But for example: If FFTPACK is faulty on single precision FFTs, we could 
restrict its use to double precision. MKL could still be used for single 
precision.

I also wonder if this in-accuracy is due to FFTPACK's algorithms or the 
Fortran compiler? NumPy uses  fftpack_lite which is a C library. It is 
easy to compile fftpack_lite for single precision as well (basically 
just tydef'ing Treal to float). Is that inaccurate too? If not, we have 
a Fortran compiler problem.

With C99 or C++, we can even compile fftpack_lite for complex numbers, 
it's just a litte bit more work than:

typedef double _Complex Treal; // C99
typedef std::complex<double> Treal;  // C++

What's left then is the DCT. I'm sure we can get around that as well. We 
can e.g. compute DCT via an FFT.

This means Scipy does not need FFTPACK at all anymore, since all FFTPACK 
does is in fftpack_lite and the C99 or C++ compiler. All in all, 
FFTPACK  is close to obsolete.

Note that NumPy's fftpack_lite has a better wrapper as well. The FFTPACK 
wrapper leaks memory (for plans) and prevents the GIL from being 
released (the plan cache is not thread safe). With fftpack_lite however, 
we can safely release the GIL (I've submitted code for that before), as 
the plans are cached in Python and not C. With fftpack_lite we can also 
empty the plan cache by rebinding the dict to an empty one.

Note that with fftpack_lite it is also trivial to compile the FFT for 
long double. We have those dtypes in NumPy too.


Sturla


From sturla at molden.no  Fri Jun 25 11:44:02 2010
From: sturla at molden.no (Sturla Molden)
Date: Fri, 25 Jun 2010 17:44:02 +0200
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTilY5z7-HEV0LVCiSqiY-nfl1o1bKiBHJlDbxkts@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<AANLkTimnAyF-GobnSB5xiGZ07dtttdfAXYQx0KZX759B@mail.gmail.com>
	<4C24C45C.7070301@molden.no>
	<AANLkTilY5z7-HEV0LVCiSqiY-nfl1o1bKiBHJlDbxkts@mail.gmail.com>
Message-ID: <4C24CEC2.9090905@molden.no>

Den 25.06.2010 17:10, skrev Robert Kern:
> There is an explicit exception in the GPL for system runtime libraries.
>
>    

Then what constitues a 'system runtime'? A program that uses Python and 
NumPy could consider that 'system runtime', I would think. And if it 
does not, those that write GPL software can introduce this exception to 
their code.

Sturla


From robert.kern at gmail.com  Fri Jun 25 11:49:06 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Fri, 25 Jun 2010 10:49:06 -0500
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <4C24CEC2.9090905@molden.no>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com> 
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com> 
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com> 
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com> 
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com> 
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org> 
	<AANLkTimnAyF-GobnSB5xiGZ07dtttdfAXYQx0KZX759B@mail.gmail.com> 
	<4C24C45C.7070301@molden.no>
	<AANLkTilY5z7-HEV0LVCiSqiY-nfl1o1bKiBHJlDbxkts@mail.gmail.com> 
	<4C24CEC2.9090905@molden.no>
Message-ID: <AANLkTilwjgNe2hdeSjNfNkvJ4qE_OLX6H37yxOFGkRwD@mail.gmail.com>

On Fri, Jun 25, 2010 at 10:44, Sturla Molden <sturla at molden.no> wrote:
> Den 25.06.2010 17:10, skrev Robert Kern:
>> There is an explicit exception in the GPL for system runtime libraries.
>
> Then what constitues a 'system runtime'?

C runtimes, for the most part. Things normally distributed as part of
the operating system.

> A program that uses Python and
> NumPy could consider that 'system runtime', I would think.

No, sorry. No one agrees with that interpretation.

> And if it
> does not, those that write GPL software can introduce this exception to
> their code.

But almost all of them do not. We need to allow people to not only
write their own GPLed code that use numpy but also to write programs
use GPLed libraries outside of their control.

numpy will never require GPL-incompatible code.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From robert.kern at gmail.com  Fri Jun 25 11:54:24 2010
From: robert.kern at gmail.com (Robert Kern)
Date: Fri, 25 Jun 2010 10:54:24 -0500
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTilwjgNe2hdeSjNfNkvJ4qE_OLX6H37yxOFGkRwD@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com> 
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com> 
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com> 
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com> 
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com> 
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org> 
	<AANLkTimnAyF-GobnSB5xiGZ07dtttdfAXYQx0KZX759B@mail.gmail.com> 
	<4C24C45C.7070301@molden.no>
	<AANLkTilY5z7-HEV0LVCiSqiY-nfl1o1bKiBHJlDbxkts@mail.gmail.com> 
	<4C24CEC2.9090905@molden.no>
	<AANLkTilwjgNe2hdeSjNfNkvJ4qE_OLX6H37yxOFGkRwD@mail.gmail.com>
Message-ID: <AANLkTilkSp90MOM_vY_O8df_IXyRW9MjOql-TpGw9pTN@mail.gmail.com>

On Fri, Jun 25, 2010 at 10:49, Robert Kern <robert.kern at gmail.com> wrote:
> On Fri, Jun 25, 2010 at 10:44, Sturla Molden <sturla at molden.no> wrote:
>> Den 25.06.2010 17:10, skrev Robert Kern:
>>> There is an explicit exception in the GPL for system runtime libraries.
>>
>> Then what constitues a 'system runtime'?
>
> C runtimes, for the most part. Things normally distributed as part of
> the operating system.

Specifically, from the GPLv3:

"""
The ?System Libraries? of an executable work include anything, other
than the work as a whole, that (a) is included in the normal form of
packaging a Major Component, but which is not part of that Major
Component, and (b) serves only to enable use of the work with that
Major Component, or to implement a Standard Interface for which an
implementation is available to the public in source code form. A
?Major Component?, in this context, means a major essential component
(kernel, window system, and so on) of the specific operating system
(if any) on which the executable work runs, or a compiler used to
produce the work, or an object code interpreter used to run it.
"""

So if Python were GPL-incompatible, that would be fine being the
object code interpreter used to run the GPLed program. msvcr90.dll is
also fine being a major essential component of the compiler used to
produce the work.

numpy is not one of these things.

-- 
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
  -- Umberto Eco


From d.l.goldsmith at gmail.com  Fri Jun 25 11:58:30 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 25 Jun 2010 08:58:30 -0700
Subject: [SciPy-Dev] Skypecon in three minutes
Message-ID: <AANLkTinXgnS7s6HC8PNiO4Lt5b4sDwCjFHx4dzUMn37m@mail.gmail.com>

SKype IM me if you want in - d.l.goldsmith

-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100625/138b2cbc/attachment.html>

From d.l.goldsmith at gmail.com  Fri Jun 25 12:05:44 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Fri, 25 Jun 2010 09:05:44 -0700
Subject: [SciPy-Dev] Place for PDF of "ODRPACK User's Guide" on
	scipy.org?
In-Reply-To: <i02068$9fh$1@dough.gmane.org>
References: <AANLkTinpen6HQX2JKUCad5vmqgrDoTiPStjdFkddJOoe@mail.gmail.com>
	<AANLkTinaIePFlxMs4WVjMxd_p7wtp-6XeeXwCepLO40O@mail.gmail.com>
	<AANLkTimTSu896B9JWLoHO_BfDrMRGG71VKldrbWuAubJ@mail.gmail.com>
	<i02068$9fh$1@dough.gmane.org>
Message-ID: <AANLkTils9HFwaD1W_k2i9SRnVF7GX687RwAAWLoWwrMp@mail.gmail.com>

On Fri, Jun 25, 2010 at 3:25 AM, Pauli Virtanen <pav at iki.fi> wrote:

> Fri, 25 Jun 2010 02:41:45 -0700, David Goldsmith wrote:
>
> > On Thu, Jun 24, 2010 at 7:27 PM, Robert Kern <robert.kern at gmail.com>
> > wrote:
> >
> >> On Thu, Jun 24, 2010 at 21:11, David Goldsmith
> >> <d.l.goldsmith at gmail.com> wrote:
> >> > Hi!  Robert's excellent docstring for the odr module had a dead link
> >> > to
> >> the
> >> > recommended "ODRPACK User's Guide"; Marcus Boumans provided a live
> >> > link
> >> in
> >> > the Discussion area, and I've since incorporated it into the
> >> > docstring proper.  One small problem, however: the link is to a
> >> > PostScript file -
> >> no
> >> > biggy for posix users (all flavors come standard w/ PS readers IIRC),
> >> > but
> >> an
> >> > (admittedly minor) nuisance for Windows users, since if one doesn't
> >> already
> >> > have a PS reader, one needs to go find one, install it, etc. to be
> >> > able
> >> to
> >> > read this.  (I don't recall what the situation is for Mac, but I
> >> > suspect, since the OS became unix-based, it's PS friendly
> >> > out-of-the-box, yes?) Anyway, I went ahead and created a PDF of the
> >> > document *and* secured redistribution permission from Daniel Lozier,
> >> > Mathematical Software Group Leader, NIST, 'cause I thought it might
> >> > be nice of us to provide such a version of the document, w/ a link to
> >> > it in the docstring.  If TPTB
> >> agree, I
> >> > can email both the PDF and the redist. permission email to the
> >> appropriate
> >> > person to place somewhere on scipy.org.
> >>
> >> The PDFs generated from the original LaTeX DVI files are here:
> >>
> >> http://www.mechanicalkern.com/static/odrpack_guide.pdf
> >> http://www.mechanicalkern.com/static/odr_ams.pdf
> >> http://www.mechanicalkern.com/static/odr_vcv.pdf
> >>
> >>
> > Thanks, Robert.  I assume I may add those links to the docstring?
>
> I put them also here:
>
>        http://docs.scipy.org/doc/external/odrpack_guide.pdf
>        http://docs.scipy.org/doc/external/odr_ams.pdf
>        http://docs.scipy.org/doc/external/odr_vcv.pdf


OK, I'll add those to the docstring.

DG

>
>
> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100625/9ff69de3/attachment.html>

From aarchiba at physics.mcgill.ca  Fri Jun 25 12:38:44 2010
From: aarchiba at physics.mcgill.ca (Anne Archibald)
Date: Fri, 25 Jun 2010 12:38:44 -0400
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <4C24CCB2.1060104@molden.no>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com> 
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com> 
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com> 
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com> 
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com> 
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org> 
	<4C24CCB2.1060104@molden.no>
Message-ID: <AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>

Let's drop the MKL discussion completely. It is not a solution to the
current problem, since some versions of numpy must be compiled without
it. If you want to discuss the implications of the GPL, please start
another thread so I can killfile it.

On 25 June 2010 11:35, Sturla Molden <sturla at molden.no> wrote:

> I also wonder if this in-accuracy is due to FFTPACK's algorithms or the
> Fortran compiler? NumPy uses ?fftpack_lite which is a C library. It is
> easy to compile fftpack_lite for single precision as well (basically
> just tydef'ing Treal to float). Is that inaccurate too? If not, we have
> a Fortran compiler problem.

It seems clear from the performance that FFTPACK is falling back to an
O(n**2) algorithm for large prime sizes. This will also result in the
much worse numerical behaviour that we see.

It seems to me that the right short-term solution is to switch to
double precision for difficult sizes (any non-power-of-two is probably
fine) and add a warning to the docstring (in particular mentioning the
horrible performance as well as the roundoff error, which can be an
issue for doubles too). This we can do for 0.8.

In the medium term, I think a python implementation of Bluestein's
algorithm is the way to go; it's not so difficult, given that we
already have efficient FFTs, but it's a bit bug-prone for  0.8. (And
there's some tuning to be done - for example, do we pad up to the next
power of two, or the next number divisible entirely by 2,3,5 and 7?
The latter might actually be faster.)

In the long term, if numpy is to become an array library plus a python
wrapper, the FFT code must migrate to C. I don't anticipate this being
too bad either, but it amounts to extending FFTPACK.


Anne


From sturla at molden.no  Fri Jun 25 12:43:37 2010
From: sturla at molden.no (Sturla Molden)
Date: Fri, 25 Jun 2010 18:43:37 +0200
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<4C24CCB2.1060104@molden.no>
	<AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
Message-ID: <4C24DCB9.3020703@molden.no>

Den 25.06.2010 18:38, skrev Anne Archibald:
> It seems clear from the performance that FFTPACK is falling back to an
> O(n**2) algorithm for large prime sizes.

It can only handle 2, 3 and 5, the rest is O(N**2).

Sturla


From sturla at molden.no  Fri Jun 25 12:56:09 2010
From: sturla at molden.no (Sturla Molden)
Date: Fri, 25 Jun 2010 18:56:09 +0200
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<4C24CCB2.1060104@molden.no>
	<AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
Message-ID: <4C24DFA9.8010109@molden.no>

Den 25.06.2010 18:38, skrev Anne Archibald:
> In the long term, if numpy is to become an array library plus a python
> wrapper, the FFT code must migrate to C. I don't anticipate this being
> too bad either, but it amounts to extending FFTPACK.
>    

FFTs in NumPy are already written in C.

FFTs in SciPy are written in Fortran 77.

I was suggesting to use NumPy's FFTs in SciPy, instead of have two 
almost identical FFT libraires (one C and one Fortran), as we do today.

The Fortran FFTPACK is used in SciPy because of single precision, 
complex numbers and DCT. But we can get around that easily by compiling 
fftpack_lite.c with other definitions of Treal than double.

We don't even need special Fortran code for DCT. It just takes an FFT 
and two or three extra lines of Python.

Why do we use Fortran FFTPACK at all?


Sturla


From charlesr.harris at gmail.com  Fri Jun 25 13:11:49 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Fri, 25 Jun 2010 11:11:49 -0600
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<4C24CCB2.1060104@molden.no>
	<AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
Message-ID: <AANLkTiluMJKs7PfeRMoaXiOQJJqP6BB_BdP8SmcAY5e7@mail.gmail.com>

On Fri, Jun 25, 2010 at 10:38 AM, Anne Archibald <aarchiba at physics.mcgill.ca
> wrote:

> Let's drop the MKL discussion completely. It is not a solution to the
> current problem, since some versions of numpy must be compiled without
> it. If you want to discuss the implications of the GPL, please start
> another thread so I can killfile it.
>
> On 25 June 2010 11:35, Sturla Molden <sturla at molden.no> wrote:
>
> > I also wonder if this in-accuracy is due to FFTPACK's algorithms or the
> > Fortran compiler? NumPy uses  fftpack_lite which is a C library. It is
> > easy to compile fftpack_lite for single precision as well (basically
> > just tydef'ing Treal to float). Is that inaccurate too? If not, we have
> > a Fortran compiler problem.
>
> It seems clear from the performance that FFTPACK is falling back to an
> O(n**2) algorithm for large prime sizes. This will also result in the
> much worse numerical behaviour that we see.
>
> It seems to me that the right short-term solution is to switch to
> double precision for difficult sizes (any non-power-of-two is probably
> fine) and add a warning to the docstring (in particular mentioning the
> horrible performance as well as the roundoff error, which can be an
> issue for doubles too). This we can do for 0.8.
>
> In the medium term, I think a python implementation of Bluestein's
> algorithm is the way to go; it's not so difficult, given that we
> already have efficient FFTs, but it's a bit bug-prone for  0.8. (And
> there's some tuning to be done - for example, do we pad up to the next
> power of two, or the next number divisible entirely by 2,3,5 and 7?
> The latter might actually be faster.)
>
>
As the size of the primes go up, the savings over simple matrix
multiplication (3x3,4x4,7x7) go down, so simply falling back to the
Bluestein algorithm might be faster even for some of these. It needs
benchmarks. I'm a bit concerned with generating the exp(i*x^2) type factors
as roundoff could be a consideration there, and there will also be some
increase in memory needs. Which is to say some work needs to be done.

I suspect the current FFTPACK single precision failings are due more to
errors in generating the elements for the large arrays then in the use of
large arrays in themselves, the observed roundoff errors are larger than I
would expect from matrix multiplication by decently orthogonal matrices.

In the long term, if numpy is to become an array library plus a python
> wrapper, the FFT code must migrate to C. I don't anticipate this being
> too bad either, but it amounts to extending FFTPACK.
>
>
I haven't checked what is out there lately. Might be worth a look.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100625/77b28a88/attachment.html>

From pav at iki.fi  Fri Jun 25 14:50:40 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Fri, 25 Jun 2010 18:50:40 +0000 (UTC)
Subject: [SciPy-Dev] Accuracy of single-precision FFT
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<4C24CCB2.1060104@molden.no>
	<AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
Message-ID: <i02tpv$eb1$1@dough.gmane.org>

Fri, 25 Jun 2010 12:38:44 -0400, Anne Archibald wrote:
[clip]
> It seems to me that the right short-term solution is to switch to double
> precision for difficult sizes (any non-power-of-two is probably fine)
> and add a warning to the docstring (in particular mentioning the
> horrible performance as well as the roundoff error, which can be an
> issue for doubles too). This we can do for 0.8.

Agreed. Done here:

    http://github.com/pv/scipy-work/compare/0.8.x...disable-single-precision-fft

Seems to work, I think it should go in to the 0.8.x branch ASAP.

-- 
Pauli Virtanen


From charlesr.harris at gmail.com  Fri Jun 25 15:39:49 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Fri, 25 Jun 2010 13:39:49 -0600
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <i02tpv$eb1$1@dough.gmane.org>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<4C24CCB2.1060104@molden.no>
	<AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
	<i02tpv$eb1$1@dough.gmane.org>
Message-ID: <AANLkTimYia4-4n3q0Z43tS3_KGa1mZ7gFXFTzbbQc5l0@mail.gmail.com>

On Fri, Jun 25, 2010 at 12:50 PM, Pauli Virtanen <pav at iki.fi> wrote:

> Fri, 25 Jun 2010 12:38:44 -0400, Anne Archibald wrote:
> [clip]
> > It seems to me that the right short-term solution is to switch to double
> > precision for difficult sizes (any non-power-of-two is probably fine)
> > and add a warning to the docstring (in particular mentioning the
> > horrible performance as well as the roundoff error, which can be an
> > issue for doubles too). This we can do for 0.8.
>
> Agreed. Done here:
>
>
> http://github.com/pv/scipy-work/compare/0.8.x...disable-single-precision-fft
>
> Seems to work, I think it should go in to the 0.8.x branch ASAP.
>
>
I'd also keep at least power of 5 also so that round decimal numbers work
well.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100625/09207761/attachment.html>

From vincent at vincentdavis.net  Fri Jun 25 21:36:08 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 25 Jun 2010 19:36:08 -0600
Subject: [SciPy-Dev] Submitting patches, updates
Message-ID: <AANLkTilmeXJSZkLzVh1TbJDiyK1OAYP6g601LegWZYPe@mail.gmail.com>

About 1.5 weeks ago I posted an email at the end of an email tread
saying I have updated codata.py to the 2006 physical constants. It
might have been missed since it was at the end of a tread and I am
sure it is of low priority. But my question is what it the right way
to submit a patch our updates?
Should they be announced as an separate email?
Should I submit a diff?
I bring this up as I was waiting for the code to be updated before I
completed the update to the docs for this file.
http://github.com/vincentdavis/SciPy-vmd-dev/blob/master/scipy/constants/codata.py

Thanks
Vincent


From josef.pktd at gmail.com  Fri Jun 25 21:46:07 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Fri, 25 Jun 2010 21:46:07 -0400
Subject: [SciPy-Dev] Submitting patches, updates
In-Reply-To: <AANLkTilmeXJSZkLzVh1TbJDiyK1OAYP6g601LegWZYPe@mail.gmail.com>
References: <AANLkTilmeXJSZkLzVh1TbJDiyK1OAYP6g601LegWZYPe@mail.gmail.com>
Message-ID: <AANLkTinHNYZOMpZIQ_OLF4eDQBy-PdWeUupbxcpIYzwZ@mail.gmail.com>

On Fri, Jun 25, 2010 at 9:36 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> About 1.5 weeks ago I posted an email at the end of an email tread
> saying I have updated codata.py to the 2006 physical constants. It
> might have been missed since it was at the end of a tread and I am
> sure it is of low priority. But my question is what it the right way
> to submit a patch our updates?
> Should they be announced as an separate email?
> Should I submit a diff?
> I bring this up as I was waiting for the code to be updated before I
> completed the update to the docs for this file.
> http://github.com/vincentdavis/SciPy-vmd-dev/blob/master/scipy/constants/codata.py

Did you open a ticket?
The usual is to open a ticket, and attach a patch, or a file with the
new code, and a link to your branch if available, so it can be
reviewed and committed and doesn't get lost in the mailing list.

Josef

>
> Thanks
> Vincent
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Fri Jun 25 21:47:12 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Fri, 25 Jun 2010 19:47:12 -0600
Subject: [SciPy-Dev] Submitting patches, updates
In-Reply-To: <AANLkTinHNYZOMpZIQ_OLF4eDQBy-PdWeUupbxcpIYzwZ@mail.gmail.com>
References: <AANLkTilmeXJSZkLzVh1TbJDiyK1OAYP6g601LegWZYPe@mail.gmail.com>
	<AANLkTinHNYZOMpZIQ_OLF4eDQBy-PdWeUupbxcpIYzwZ@mail.gmail.com>
Message-ID: <AANLkTilfDEiZn6XihfsqZH7jYJN3WHVnH29v8AYZrIBV@mail.gmail.com>

On Fri, Jun 25, 2010 at 7:46 PM,  <josef.pktd at gmail.com> wrote:
> On Fri, Jun 25, 2010 at 9:36 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> About 1.5 weeks ago I posted an email at the end of an email tread
>> saying I have updated codata.py to the 2006 physical constants. It
>> might have been missed since it was at the end of a tread and I am
>> sure it is of low priority. But my question is what it the right way
>> to submit a patch our updates?
>> Should they be announced as an separate email?
>> Should I submit a diff?
>> I bring this up as I was waiting for the code to be updated before I
>> completed the update to the docs for this file.
>> http://github.com/vincentdavis/SciPy-vmd-dev/blob/master/scipy/constants/codata.py
>
> Did you open a ticket?
> The usual is to open a ticket, and attach a patch, or a file with the
> new code, and a link to your branch if available, so it can be
> reviewed and committed and doesn't get lost in the mailing list.

No I did not, I will do that.
Thanks for the info.

Vincent
>
> Josef
>
>>
>> Thanks
>> Vincent
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Sun Jun 27 00:21:39 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sat, 26 Jun 2010 22:21:39 -0600
Subject: [SciPy-Dev] [SciPy] #1211: KeyError: Wien = _cd('Wien
	displacement law constant') in constants.constants
In-Reply-To: <053.fab48c1875146a4e9d4f006760e1ff87@scipy.org>
References: <044.44715e10a816e1a3b395309427d435c1@scipy.org>
	<053.fab48c1875146a4e9d4f006760e1ff87@scipy.org>
Message-ID: <AANLkTimAJwMWydCbpnyLo73reGgMMna7SnpcTxMgdx4w@mail.gmail.com>

On Sat, Jun 26, 2010 at 10:00 PM,  <scipy-tickets at scipy.org> wrote:
> #1211: KeyError: Wien = _cd('Wien displacement law constant') in
> constants.constants
> -------------------------------------+--------------------------------------
> ?Reporter: ?vincentdavis ? ? ? ? ? ? | ? ? ? Owner: ?somebody
> ? ? Type: ?defect ? ? ? ? ? ? ? ? ? | ? ? ?Status: ?new
> ?Priority: ?normal ? ? ? ? ? ? ? ? ? | ? Milestone: ?0.8.0
> Component: ?Other ? ? ? ? ? ? ? ? ? ?| ? ? Version: ?devel
> ?Keywords: ?constants, wien, codata ?|
> -------------------------------------+--------------------------------------
>
> Comment(by stefan):
>
> ?Are you running the version I committed? ?I explicitly fixed that.
>
> ?We can always add backward compatible entries to the dictionary, if
> ?someone would compare the two tables.

The attached file compares the 2002 and 2006 values and defines sets
of names, 2002 only, 2006 only, 2002 and 2006
not sure where to go with this with regard to backward compatible entries.

Vincent

>
> --
> Ticket URL: <http://projects.scipy.org/scipy/ticket/1211#comment:3>
> SciPy <http://www.scipy.org>
> SciPy is open-source software for mathematics, science, and engineering.
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: review_constants.py
Type: application/octet-stream
Size: 67439 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100626/8379eaa9/attachment.obj>

From stefan at sun.ac.za  Sun Jun 27 00:35:42 2010
From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=)
Date: Sat, 26 Jun 2010 21:35:42 -0700
Subject: [SciPy-Dev] Steve Spicklemire needs edit rights, please
In-Reply-To: <AANLkTimFHX4n-k5FivUD9mSlrEx4aot1U7eeaHGqI5lR@mail.gmail.com>
References: <AANLkTimFHX4n-k5FivUD9mSlrEx4aot1U7eeaHGqI5lR@mail.gmail.com>
Message-ID: <AANLkTimCiHMjY9O3EKAsZ3-WXCiTqAL6QFBtLTi2c1gJ@mail.gmail.com>

On 23 June 2010 10:00, David Goldsmith <d.l.goldsmith at gmail.com> wrote:
> Hi, Steve, and thanks.? Unfortunately, no I don't have that power, but I'm
> cc-ing this to scipy-dev at scipy.org where some people do.? Thanks again!

Added, sorry for the delay.

Regards
St?fan


From ralf.gommers at googlemail.com  Sun Jun 27 06:43:22 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Sun, 27 Jun 2010 18:43:22 +0800
Subject: [SciPy-Dev] Accuracy of single-precision FFT
In-Reply-To: <AANLkTimYia4-4n3q0Z43tS3_KGa1mZ7gFXFTzbbQc5l0@mail.gmail.com>
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<4C24CCB2.1060104@molden.no>
	<AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
	<i02tpv$eb1$1@dough.gmane.org>
	<AANLkTimYia4-4n3q0Z43tS3_KGa1mZ7gFXFTzbbQc5l0@mail.gmail.com>
Message-ID: <AANLkTilyfnIKua1vviJyLXa5h_ci5TLQ9TfAOOOx2QwW@mail.gmail.com>

On Sat, Jun 26, 2010 at 3:39 AM, Charles R Harris <charlesr.harris at gmail.com
> wrote:

>
>
> On Fri, Jun 25, 2010 at 12:50 PM, Pauli Virtanen <pav at iki.fi> wrote:
>
>> Fri, 25 Jun 2010 12:38:44 -0400, Anne Archibald wrote:
>> [clip]
>> > It seems to me that the right short-term solution is to switch to double
>> > precision for difficult sizes (any non-power-of-two is probably fine)
>> > and add a warning to the docstring (in particular mentioning the
>> > horrible performance as well as the roundoff error, which can be an
>> > issue for doubles too). This we can do for 0.8.
>>
>> Agreed. Done here:
>>
>>
>> http://github.com/pv/scipy-work/compare/0.8.x...disable-single-precision-fft
>>
>> Seems to work, I think it should go in to the 0.8.x branch ASAP.
>>
>> That looks good, I agree that can go in. For trunk I opened a ticket:
http://projects.scipy.org/scipy/ticket/1212.


> I'd also keep at least power of 5 also so that round decimal numbers work
> well.
>

Pauli's patch already kept 2, 3 and 5.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100627/ed39ff0c/attachment.html>

From pav at iki.fi  Sun Jun 27 06:46:57 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Sun, 27 Jun 2010 10:46:57 +0000 (UTC)
Subject: [SciPy-Dev] Accuracy of single-precision FFT
References: <AANLkTinik2L_cuwNQAL4DjH-XYWpztmeoTpAZy-PYCCl@mail.gmail.com>
	<AANLkTil1rA0O2LweVJYuOeCtu6Da7w0dulhZyjQVZ6LM@mail.gmail.com>
	<AANLkTikgNR4BexSZi-jbxqhl-CnCqkMU5s-u3Ve_21v_@mail.gmail.com>
	<AANLkTilHFkx_EGLyNMyNpHa-o_xER4qYwrQ7o75jTEZD@mail.gmail.com>
	<i000nk$m79$1@dough.gmane.org>
	<AANLkTim-CMenamFdrFqr-hjdTVDfIuxDqtJ79gzEjG1J@mail.gmail.com>
	<4C249C1C.1060802@molden.no> <i02d2k$mdj$1@dough.gmane.org>
	<4C24CCB2.1060104@molden.no>
	<AANLkTinIJbMmulcXR3zUw9X-CGx3y8vaQimGesK2nVxb@mail.gmail.com>
	<i02tpv$eb1$1@dough.gmane.org>
	<AANLkTimYia4-4n3q0Z43tS3_KGa1mZ7gFXFTzbbQc5l0@mail.gmail.com>
	<AANLkTilyfnIKua1vviJyLXa5h_ci5TLQ9TfAOOOx2QwW@mail.gmail.com>
Message-ID: <i07a6v$elr$2@dough.gmane.org>

Sun, 27 Jun 2010 18:43:22 +0800, Ralf Gommers wrote:
[clip: fftpack]
> > I'd also keep at least power of 5 also so that round,
> > decimal numbers work well.
>
> Pauli's patch already kept 2, 3 and 5.

Yeah, because I changed it after Chuck's comment :)

But if thinks look OK as-is, I think it could go in.

-- 
Pauli Virtanen


From vincent at vincentdavis.net  Mon Jun 28 01:05:37 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Sun, 27 Jun 2010 23:05:37 -0600
Subject: [SciPy-Dev] FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1),
	py27 64bit osx
Message-ID: <AANLkTikMYt5N5geCoXlKfU2gRPCmpO1P0xHwoS2iWBIB@mail.gmail.com>

Not sure if these are know, I looked for the failure in the tickets
but found only old ones.
Vincent

ERROR: test_decomp.test_lapack_misaligned(<function solve at
0x10253e140>, (array([[  1.734e-255,   8.189e-217,   4.025e-178,
1.903e-139,   9.344e-101,
          4.422e-062,   2.169e-023,   1.028e+016,   5.035e+054,   2.388e+093],
       [  1.169e+132,   5.549e+170,   2.713e+209,   1.290e+248,   6.299e+286,
         -9.272e-292,  -4.524e-253,  -2.155e-214,  -1.050e-175,  -5.007e-137],
       [ -2.438e-098,  -1.163e-059,  -5.659e-021,  -2.703e+018,  -1.314e+057,
         -6.281e+095,  -3.049e+134,  -1.459e+173,  -7.078e+211,  -3.391e+250],
       [ -1.643e+289,   3.524e-294,   1.734e-255,   8.189e-217,   4.025e-178,
          1.903e-139,   9.344e-101,   4.422e-062,   2.169e-023,   1.028e+016],
       [  5.035e+054,   2.388e+093,   1.169e+132,   5.549e+170,   2.713e+209,
          1.290e+248,   6.299e+286,  -9.272e-292,  -4.524e-253,  -2.155e-214],
       [ -1.050e-175,  -5.007e-137,  -2.438e-098,  -1.163e-059,  -5.659e-021,
         -2.703e+018,  -1.314e+057,  -6.281e+095,  -3.049e+134,  -1.459e+173],
       [ -7.078e+211,  -3.391e+250,  -1.643e+289,   3.524e-294,   1.734e-255,
          8.189e-217,   4.025e-178,   1.903e-139,   9.344e-101,   4.422e-062],
       [  2.169e-023,   1.028e+016,   5.035e+054,   2.388e+093,   1.169e+132,
          5.549e+170,   2.713e+209,   1.290e+248,   6.299e+286,  -9.272e-292],
       [ -4.524e-253,  -2.155e-214,  -1.050e-175,  -5.007e-137,  -2.438e-098,
         -1.163e-059,  -5.659e-021,  -2.703e+018,  -1.314e+057,  -6.281e+095],
       [ -3.049e+134,  -1.459e+173,  -7.078e+211,  -3.391e+250,  -1.643e+289,
          3.524e-294,   1.734e-255,   8.189e-217,   4.025e-178,
1.903e-139]]), array([        NaN,         NaN,         NaN,
NaN,         NaN,  -3.264e+62,
         6.682e+23,   9.710e-15,  -1.988e-53,  -2.520e-94])),
{'overwrite_a': True, 'overwrite_b': True})
test_decomp.test_lapack_misaligned(<function solve at 0x10253e140>,
(array([[  1.734e-255,   8.189e-217,   4.025e-178,   1.903e-139,
9.344e-101,
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
line 186, in runTest
    self.test(*self.arg)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_decomp.py",
line 1074, in check_lapack_misaligned
    func(*a,**kwargs)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/basic.py",
line 49, in solve
    a1, b1 = map(asarray_chkfinite,(a,b))
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/function_base.py",
line 528, in asarray_chkfinite
    "array must not contain infs or NaNs")
ValueError: array must not contain infs or NaNs

======================================================================
ERROR: test_none (test_signaltools.TestMedFilt)
Ticket #1124.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/tests/test_signaltools.py",
line 289, in test_none
    signal.medfilt(None)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/signaltools.py",
line 317, in medfilt
    return sigtools._order_filterND(volume,domain,order)
ValueError: order_filterND not available for this type

======================================================================
ERROR: test_complex_nonsymmetric_modes
(test_arpack.TestEigenComplexNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 267, in test_complex_nonsymmetric_modes
    self.eval_evec(m,typ,k,which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 248, in eval_evec
    eval,evec=eigen(a,k,which=which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 397, in eigen
    params.iterate()
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 204, in test_nonsymmetric_modes
    self.eval_evec(m,typ,k,which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 186, in eval_evec
    eval,evec=eigen(a,k,which=which,**kwds)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 397, in eigen
    params.iterate()
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 214, in test_starting_vector
    self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 186, in eval_evec
    eval,evec=eigen(a,k,which=which,**kwds)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 397, in eigen
    params.iterate()
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
FAIL: test_complex_symmetric_modes (test_arpack.TestEigenComplexSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 156, in test_complex_symmetric_modes
    self.eval_evec(self.symmetric[0],typ,k,which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 145, in eval_evec
    assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
line 774, in assert_array_almost_equal
    header='Arrays are not almost equal')
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
line 618, in assert_array_compare
    raise AssertionError(msg)
AssertionError:
Arrays are not almost equal

(mismatch 100.0%)
 x: array([ 1.07188725 +6.23436023e-08j,  4.91291142
-3.25412906e-08j], dtype=complex64)
 y: array([ 5.+0.j,  6.+0.j], dtype=complex64)
>>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch 100.0%)\n x: array([ 1.07188725 +6.23436023e-08j,  4.91291142 -3.25412906e-08j], dtype=complex64)\n y: array([ 5.+0.j,  6.+0.j], dtype=complex64)')


----------------------------------------------------------------------
Ran 4608 tests in 177.171s

FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1)
<nose.result.TextTestResult run=4608 errors=6 failures=1>


From ralf.gommers at googlemail.com  Mon Jun 28 08:05:58 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Mon, 28 Jun 2010 20:05:58 +0800
Subject: [SciPy-Dev] FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1),
	py27 64bit osx
In-Reply-To: <AANLkTikMYt5N5geCoXlKfU2gRPCmpO1P0xHwoS2iWBIB@mail.gmail.com>
References: <AANLkTikMYt5N5geCoXlKfU2gRPCmpO1P0xHwoS2iWBIB@mail.gmail.com>
Message-ID: <AANLkTilVGHTFy1GNVPANhB1hRP4_BYjdCowKeboxsM16@mail.gmail.com>

On Mon, Jun 28, 2010 at 1:05 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> Not sure if these are know, I looked for the failure in the tickets
> but found only old ones.
>

They are all known. But the test_signaltools.TestMedFilt error should have
been fixed:
http://projects.scipy.org/scipy/ticket/1124
http://projects.scipy.org/scipy/ticket/1102
Are you running latest trunk?

Cheers,
Ralf


Vincent
>
> ERROR: test_decomp.test_lapack_misaligned(<function solve at
> 0x10253e140>, (array([[  1.734e-255,   8.189e-217,   4.025e-178,
> 1.903e-139,   9.344e-101,
>          4.422e-062,   2.169e-023,   1.028e+016,   5.035e+054,
> 2.388e+093],
>       [  1.169e+132,   5.549e+170,   2.713e+209,   1.290e+248,
> 6.299e+286,
>         -9.272e-292,  -4.524e-253,  -2.155e-214,  -1.050e-175,
>  -5.007e-137],
>       [ -2.438e-098,  -1.163e-059,  -5.659e-021,  -2.703e+018,
>  -1.314e+057,
>         -6.281e+095,  -3.049e+134,  -1.459e+173,  -7.078e+211,
>  -3.391e+250],
>       [ -1.643e+289,   3.524e-294,   1.734e-255,   8.189e-217,
> 4.025e-178,
>          1.903e-139,   9.344e-101,   4.422e-062,   2.169e-023,
> 1.028e+016],
>       [  5.035e+054,   2.388e+093,   1.169e+132,   5.549e+170,
> 2.713e+209,
>          1.290e+248,   6.299e+286,  -9.272e-292,  -4.524e-253,
>  -2.155e-214],
>       [ -1.050e-175,  -5.007e-137,  -2.438e-098,  -1.163e-059,
>  -5.659e-021,
>         -2.703e+018,  -1.314e+057,  -6.281e+095,  -3.049e+134,
>  -1.459e+173],
>       [ -7.078e+211,  -3.391e+250,  -1.643e+289,   3.524e-294,
> 1.734e-255,
>          8.189e-217,   4.025e-178,   1.903e-139,   9.344e-101,
> 4.422e-062],
>       [  2.169e-023,   1.028e+016,   5.035e+054,   2.388e+093,
> 1.169e+132,
>          5.549e+170,   2.713e+209,   1.290e+248,   6.299e+286,
>  -9.272e-292],
>       [ -4.524e-253,  -2.155e-214,  -1.050e-175,  -5.007e-137,
>  -2.438e-098,
>         -1.163e-059,  -5.659e-021,  -2.703e+018,  -1.314e+057,
>  -6.281e+095],
>       [ -3.049e+134,  -1.459e+173,  -7.078e+211,  -3.391e+250,
>  -1.643e+289,
>          3.524e-294,   1.734e-255,   8.189e-217,   4.025e-178,
> 1.903e-139]]), array([        NaN,         NaN,         NaN,
> NaN,         NaN,  -3.264e+62,
>         6.682e+23,   9.710e-15,  -1.988e-53,  -2.520e-94])),
> {'overwrite_a': True, 'overwrite_b': True})
> test_decomp.test_lapack_misaligned(<function solve at 0x10253e140>,
> (array([[  1.734e-255,   8.189e-217,   4.025e-178,   1.903e-139,
> 9.344e-101,
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
> line 186, in runTest
>    self.test(*self.arg)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_decomp.py",
> line 1074, in check_lapack_misaligned
>    func(*a,**kwargs)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/basic.py",
> line 49, in solve
>    a1, b1 = map(asarray_chkfinite,(a,b))
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/function_base.py",
> line 528, in asarray_chkfinite
>    "array must not contain infs or NaNs")
> ValueError: array must not contain infs or NaNs
>
> ======================================================================
> ERROR: test_none (test_signaltools.TestMedFilt)
> Ticket #1124.
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/tests/test_signaltools.py",
> line 289, in test_none
>    signal.medfilt(None)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/signaltools.py",
> line 317, in medfilt
>    return sigtools._order_filterND(volume,domain,order)
> ValueError: order_filterND not available for this type
>
> ======================================================================
> ERROR: test_complex_nonsymmetric_modes
> (test_arpack.TestEigenComplexNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 267, in test_complex_nonsymmetric_modes
>    self.eval_evec(m,typ,k,which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 248, in eval_evec
>    eval,evec=eigen(a,k,which=which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 397, in eigen
>    params.iterate()
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 222, in iterate
>    raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 204, in test_nonsymmetric_modes
>    self.eval_evec(m,typ,k,which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 186, in eval_evec
>    eval,evec=eigen(a,k,which=which,**kwds)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 397, in eigen
>    params.iterate()
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 222, in iterate
>    raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 214, in test_starting_vector
>    self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 186, in eval_evec
>    eval,evec=eigen(a,k,which=which,**kwds)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 397, in eigen
>    params.iterate()
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 222, in iterate
>    raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> FAIL: test_complex_symmetric_modes (test_arpack.TestEigenComplexSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 156, in test_complex_symmetric_modes
>    self.eval_evec(self.symmetric[0],typ,k,which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 145, in eval_evec
>    assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> line 774, in assert_array_almost_equal
>    header='Arrays are not almost equal')
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> line 618, in assert_array_compare
>    raise AssertionError(msg)
> AssertionError:
> Arrays are not almost equal
>
> (mismatch 100.0%)
>  x: array([ 1.07188725 +6.23436023e-08j,  4.91291142
> -3.25412906e-08j], dtype=complex64)
>  y: array([ 5.+0.j,  6.+0.j], dtype=complex64)
> >>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch
> 100.0%)\n x: array([ 1.07188725 +6.23436023e-08j,  4.91291142
> -3.25412906e-08j], dtype=complex64)\n y: array([ 5.+0.j,  6.+0.j],
> dtype=complex64)')
>
>
> ----------------------------------------------------------------------
> Ran 4608 tests in 177.171s
>
> FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1)
> <nose.result.TextTestResult run=4608 errors=6 failures=1>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100628/f2531881/attachment.html>

From vincent at vincentdavis.net  Mon Jun 28 10:20:45 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 08:20:45 -0600
Subject: [SciPy-Dev] FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1),
	py27 64bit osx
In-Reply-To: <AANLkTilVGHTFy1GNVPANhB1hRP4_BYjdCowKeboxsM16@mail.gmail.com>
References: <AANLkTikMYt5N5geCoXlKfU2gRPCmpO1P0xHwoS2iWBIB@mail.gmail.com>
	<AANLkTilVGHTFy1GNVPANhB1hRP4_BYjdCowKeboxsM16@mail.gmail.com>
Message-ID: <AANLkTinyGBNxmY_7X3lfnqFPp_IdSCwiyHQHN0DGkz5I@mail.gmail.com>

On Mon, Jun 28, 2010 at 6:05 AM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
>
>
> On Mon, Jun 28, 2010 at 1:05 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
>>
>> Not sure if these are know, I looked for the failure in the tickets
>> but found only old ones.
>
> They are all known. But the test_signaltools.TestMedFilt error should have
> been fixed:
> http://projects.scipy.org/scipy/ticket/1124
> http://projects.scipy.org/scipy/ticket/1102
> Are you running latest trunk?

Just reinstalled current snapshot and ran test again. ran the tests as
scipy.test(verbose=2) (see below).
Not sure why this reports more failures, I guess I need to know more
about how nose does tests?
How do I know the error are known errors?
Not sure why ERROR: test_none (test_signaltools.TestMedFilt) Ticket
#1124. has the ticket number, It looks like a different error that in
the ticket.
I don't have time now but I will try to look/learn more about this later.
Thanks
Vincent

ERROR: test_decomp.test_lapack_misaligned(<function solve at
0x10239d5f0>, (array([[  1.734e-255,   8.189e-217,   4.025e-178,
1.903e-139,   9.344e-101,
          4.422e-062,   2.169e-023,   1.028e+016,   5.035e+054,   2.388e+093],
       [  1.169e+132,   5.549e+170,   2.713e+209,   1.290e+248,   6.299e+286,
         -9.272e-292,  -4.524e-253,  -2.155e-214,  -1.050e-175,  -5.007e-137],
       [ -2.438e-098,  -1.163e-059,  -5.659e-021,  -2.703e+018,  -1.314e+057,
         -6.281e+095,  -3.049e+134,  -1.459e+173,  -7.078e+211,  -3.391e+250],
       [ -1.643e+289,   3.524e-294,   1.734e-255,   8.189e-217,   4.025e-178,
          1.903e-139,   9.344e-101,   4.422e-062,   2.169e-023,   1.028e+016],
       [  5.035e+054,   2.388e+093,   1.169e+132,   5.549e+170,   2.713e+209,
          1.290e+248,   6.299e+286,  -9.272e-292,  -4.524e-253,  -2.155e-214],
       [ -1.050e-175,  -5.007e-137,  -2.438e-098,  -1.163e-059,  -5.659e-021,
         -2.703e+018,  -1.314e+057,  -6.281e+095,  -3.049e+134,  -1.459e+173],
       [ -7.078e+211,  -3.391e+250,  -1.643e+289,   3.524e-294,   1.734e-255,
          8.189e-217,   4.025e-178,   1.903e-139,   9.344e-101,   4.422e-062],
       [  2.169e-023,   1.028e+016,   5.035e+054,   2.388e+093,   1.169e+132,
          5.549e+170,   2.713e+209,   1.290e+248,   6.299e+286,  -9.272e-292],
       [ -4.524e-253,  -2.155e-214,  -1.050e-175,  -5.007e-137,  -2.438e-098,
         -1.163e-059,  -5.659e-021,  -2.703e+018,  -1.314e+057,  -6.281e+095],
       [ -3.049e+134,  -1.459e+173,  -7.078e+211,  -3.391e+250,  -1.643e+289,
          3.524e-294,   1.734e-255,   8.189e-217,   4.025e-178,
1.903e-139]]), array([        NaN,         NaN,         NaN,
NaN,         NaN,  -3.264e+62,
         6.682e+23,   9.710e-15,  -1.988e-53,  -2.520e-94])),
{'overwrite_a': True, 'overwrite_b': True})
test_decomp.test_lapack_misaligned(<function solve at 0x10239d5f0>,
(array([[  1.734e-255,   8.189e-217,   4.025e-178,   1.903e-139,
9.344e-101,
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
line 186, in runTest
    self.test(*self.arg)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_decomp.py",
line 1074, in check_lapack_misaligned
    func(*a,**kwargs)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/basic.py",
line 49, in solve
    a1, b1 = map(asarray_chkfinite,(a,b))
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/function_base.py",
line 528, in asarray_chkfinite
    "array must not contain infs or NaNs")
ValueError: array must not contain infs or NaNs

======================================================================
ERROR: test_none (test_signaltools.TestMedFilt)
Ticket #1124.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/tests/test_signaltools.py",
line 289, in test_none
    signal.medfilt(None)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/signaltools.py",
line 317, in medfilt
    return sigtools._order_filterND(volume,domain,order)
ValueError: order_filterND not available for this type

======================================================================
ERROR: test_complex_nonsymmetric_modes
(test_arpack.TestEigenComplexNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 267, in test_complex_nonsymmetric_modes
    self.eval_evec(m,typ,k,which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 248, in eval_evec
    eval,evec=eigen(a,k,which=which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 397, in eigen
    params.iterate()
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 204, in test_nonsymmetric_modes
    self.eval_evec(m,typ,k,which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 186, in eval_evec
    eval,evec=eigen(a,k,which=which,**kwds)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 397, in eigen
    params.iterate()
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 214, in test_starting_vector
    self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 186, in eval_evec
    eval,evec=eigen(a,k,which=which,**kwds)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 397, in eigen
    params.iterate()
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
line 222, in iterate
    raise RuntimeError("Error info=%d in arpack" % self.info)
RuntimeError: Error info=-8 in arpack

======================================================================
FAIL: test_mio.test_mat4_3d(<type 'exceptions.DeprecationWarning'>,
<functools.partial object at 0x1035f4e10>, <StringIO.StringIO instance
at 0x1042ffa28>, {'a': array([[[ 0,  1,  2,  3],
        [ 4,  5,  6,  7],
        [ 8,  9, 10, 11]],

       [[12, 13, 14, 15],
        [16, 17, 18, 19],
        [20, 21, 22, 23]]])}, True, '4')
test_mio.test_mat4_3d(<type 'exceptions.DeprecationWarning'>,
<functools.partial object at 0x1035f4e10>, <StringIO.StringIO instance
at 0x1042ffa28>, {'a': array([[[ 0,  1,  2,  3],
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
line 186, in runTest
    self.test(*self.arg)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
line 982, in assert_raises
    return nose.tools.assert_raises(*args,**kwargs)
AssertionError: DeprecationWarning not raised


======================================================================
FAIL: test_00_deprecation_warning (test_basic.TestSolveHBanded)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_basic.py",
line 262, in test_00_deprecation_warning
    assert_raises(DeprecationWarning, solveh_banded, ab, b)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
line 982, in assert_raises
    return nose.tools.assert_raises(*args,**kwargs)
AssertionError: DeprecationWarning not raised


======================================================================
FAIL: test_complex_symmetric_modes (test_arpack.TestEigenComplexSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 156, in test_complex_symmetric_modes
    self.eval_evec(self.symmetric[0],typ,k,which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 145, in eval_evec
    assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
line 774, in assert_array_almost_equal
    header='Arrays are not almost equal')
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
line 618, in assert_array_compare
    raise AssertionError(msg)
AssertionError:
Arrays are not almost equal

(mismatch 100.0%)
 x: array([ 0.828 -4.967e-08j,  5.511 -1.366e-07j], dtype=complex64)
 y: array([ 5.+0.j,  6.+0.j], dtype=complex64)
>>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch 100.0%)\n x: array([ 0.828 -4.967e-08j,  5.511 -1.366e-07j], dtype=complex64)\n y: array([ 5.+0.j,  6.+0.j], dtype=complex64)')


======================================================================
FAIL: test_symmetric_modes (test_arpack.TestEigenSymmetric)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 109, in test_symmetric_modes
    self.eval_evec(self.symmetric[0],typ,k,which)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
line 98, in eval_evec
    assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
line 774, in assert_array_almost_equal
    header='Arrays are not almost equal')
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
line 618, in assert_array_compare
    raise AssertionError(msg)
AssertionError:
Arrays are not almost equal

(mismatch 50.0%)
 x: array([  1.638e-07,   0.000e+00], dtype=float32)
 y: array([ 0.,  1.], dtype=float32)
>>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch 50.0%)\n x: array([  1.638e-07,   0.000e+00], dtype=float32)\n y: array([ 0.,  1.], dtype=float32)')


----------------------------------------------------------------------
Ran 4607 tests in 197.996s

FAILED (KNOWNFAIL=11, SKIP=39, errors=5, failures=4)
<nose.result.TextTestResult run=4607 errors=5 failures=4>
>
> Cheers,
> Ralf
>
>
>> Vincent
>>
>> ERROR: test_decomp.test_lapack_misaligned(<function solve at
>> 0x10253e140>, (array([[ ?1.734e-255, ? 8.189e-217, ? 4.025e-178,
>> 1.903e-139, ? 9.344e-101,
>> ? ? ? ? ?4.422e-062, ? 2.169e-023, ? 1.028e+016, ? 5.035e+054,
>> 2.388e+093],
>> ? ? ? [ ?1.169e+132, ? 5.549e+170, ? 2.713e+209, ? 1.290e+248,
>> 6.299e+286,
>> ? ? ? ? -9.272e-292, ?-4.524e-253, ?-2.155e-214, ?-1.050e-175,
>> ?-5.007e-137],
>> ? ? ? [ -2.438e-098, ?-1.163e-059, ?-5.659e-021, ?-2.703e+018,
>> ?-1.314e+057,
>> ? ? ? ? -6.281e+095, ?-3.049e+134, ?-1.459e+173, ?-7.078e+211,
>> ?-3.391e+250],
>> ? ? ? [ -1.643e+289, ? 3.524e-294, ? 1.734e-255, ? 8.189e-217,
>> 4.025e-178,
>> ? ? ? ? ?1.903e-139, ? 9.344e-101, ? 4.422e-062, ? 2.169e-023,
>> 1.028e+016],
>> ? ? ? [ ?5.035e+054, ? 2.388e+093, ? 1.169e+132, ? 5.549e+170,
>> 2.713e+209,
>> ? ? ? ? ?1.290e+248, ? 6.299e+286, ?-9.272e-292, ?-4.524e-253,
>> ?-2.155e-214],
>> ? ? ? [ -1.050e-175, ?-5.007e-137, ?-2.438e-098, ?-1.163e-059,
>> ?-5.659e-021,
>> ? ? ? ? -2.703e+018, ?-1.314e+057, ?-6.281e+095, ?-3.049e+134,
>> ?-1.459e+173],
>> ? ? ? [ -7.078e+211, ?-3.391e+250, ?-1.643e+289, ? 3.524e-294,
>> 1.734e-255,
>> ? ? ? ? ?8.189e-217, ? 4.025e-178, ? 1.903e-139, ? 9.344e-101,
>> 4.422e-062],
>> ? ? ? [ ?2.169e-023, ? 1.028e+016, ? 5.035e+054, ? 2.388e+093,
>> 1.169e+132,
>> ? ? ? ? ?5.549e+170, ? 2.713e+209, ? 1.290e+248, ? 6.299e+286,
>> ?-9.272e-292],
>> ? ? ? [ -4.524e-253, ?-2.155e-214, ?-1.050e-175, ?-5.007e-137,
>> ?-2.438e-098,
>> ? ? ? ? -1.163e-059, ?-5.659e-021, ?-2.703e+018, ?-1.314e+057,
>> ?-6.281e+095],
>> ? ? ? [ -3.049e+134, ?-1.459e+173, ?-7.078e+211, ?-3.391e+250,
>> ?-1.643e+289,
>> ? ? ? ? ?3.524e-294, ? 1.734e-255, ? 8.189e-217, ? 4.025e-178,
>> 1.903e-139]]), array([ ? ? ? ?NaN, ? ? ? ? NaN, ? ? ? ? NaN,
>> NaN, ? ? ? ? NaN, ?-3.264e+62,
>> ? ? ? ? 6.682e+23, ? 9.710e-15, ?-1.988e-53, ?-2.520e-94])),
>> {'overwrite_a': True, 'overwrite_b': True})
>> test_decomp.test_lapack_misaligned(<function solve at 0x10253e140>,
>> (array([[ ?1.734e-255, ? 8.189e-217, ? 4.025e-178, ? 1.903e-139,
>> 9.344e-101,
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
>> line 186, in runTest
>> ? ?self.test(*self.arg)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_decomp.py",
>> line 1074, in check_lapack_misaligned
>> ? ?func(*a,**kwargs)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/basic.py",
>> line 49, in solve
>> ? ?a1, b1 = map(asarray_chkfinite,(a,b))
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/function_base.py",
>> line 528, in asarray_chkfinite
>> ? ?"array must not contain infs or NaNs")
>> ValueError: array must not contain infs or NaNs
>>
>> ======================================================================
>> ERROR: test_none (test_signaltools.TestMedFilt)
>> Ticket #1124.
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/tests/test_signaltools.py",
>> line 289, in test_none
>> ? ?signal.medfilt(None)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/signaltools.py",
>> line 317, in medfilt
>> ? ?return sigtools._order_filterND(volume,domain,order)
>> ValueError: order_filterND not available for this type
>>
>> ======================================================================
>> ERROR: test_complex_nonsymmetric_modes
>> (test_arpack.TestEigenComplexNonSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 267, in test_complex_nonsymmetric_modes
>> ? ?self.eval_evec(m,typ,k,which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 248, in eval_evec
>> ? ?eval,evec=eigen(a,k,which=which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 397, in eigen
>> ? ?params.iterate()
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 222, in iterate
>> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> RuntimeError: Error info=-8 in arpack
>>
>> ======================================================================
>> ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 204, in test_nonsymmetric_modes
>> ? ?self.eval_evec(m,typ,k,which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 186, in eval_evec
>> ? ?eval,evec=eigen(a,k,which=which,**kwds)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 397, in eigen
>> ? ?params.iterate()
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 222, in iterate
>> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> RuntimeError: Error info=-8 in arpack
>>
>> ======================================================================
>> ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 214, in test_starting_vector
>> ? ?self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 186, in eval_evec
>> ? ?eval,evec=eigen(a,k,which=which,**kwds)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 397, in eigen
>> ? ?params.iterate()
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 222, in iterate
>> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> RuntimeError: Error info=-8 in arpack
>>
>> ======================================================================
>> FAIL: test_complex_symmetric_modes (test_arpack.TestEigenComplexSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 156, in test_complex_symmetric_modes
>> ? ?self.eval_evec(self.symmetric[0],typ,k,which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 145, in eval_evec
>> ? ?assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> line 774, in assert_array_almost_equal
>> ? ?header='Arrays are not almost equal')
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> line 618, in assert_array_compare
>> ? ?raise AssertionError(msg)
>> AssertionError:
>> Arrays are not almost equal
>>
>> (mismatch 100.0%)
>> ?x: array([ 1.07188725 +6.23436023e-08j, ?4.91291142
>> -3.25412906e-08j], dtype=complex64)
>> ?y: array([ 5.+0.j, ?6.+0.j], dtype=complex64)
>> >> ?raise AssertionError('\nArrays are not almost equal\n\n(mismatch
>> >> 100.0%)\n x: array([ 1.07188725 +6.23436023e-08j, ?4.91291142
>> >> -3.25412906e-08j], dtype=complex64)\n y: array([ 5.+0.j, ?6.+0.j],
>> >> dtype=complex64)')
>>
>>
>> ----------------------------------------------------------------------
>> Ran 4608 tests in 177.171s
>>
>> FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1)
>> <nose.result.TextTestResult run=4608 errors=6 failures=1>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From ralf.gommers at googlemail.com  Mon Jun 28 12:02:49 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Tue, 29 Jun 2010 00:02:49 +0800
Subject: [SciPy-Dev] FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1),
	py27 64bit osx
In-Reply-To: <AANLkTinyGBNxmY_7X3lfnqFPp_IdSCwiyHQHN0DGkz5I@mail.gmail.com>
References: <AANLkTikMYt5N5geCoXlKfU2gRPCmpO1P0xHwoS2iWBIB@mail.gmail.com>
	<AANLkTilVGHTFy1GNVPANhB1hRP4_BYjdCowKeboxsM16@mail.gmail.com>
	<AANLkTinyGBNxmY_7X3lfnqFPp_IdSCwiyHQHN0DGkz5I@mail.gmail.com>
Message-ID: <AANLkTinJExNEECvJwwWrlRNo6LB5wV4iQ5OOPF0chdX9@mail.gmail.com>

On Mon, Jun 28, 2010 at 10:20 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Mon, Jun 28, 2010 at 6:05 AM, Ralf Gommers
> <ralf.gommers at googlemail.com> wrote:
> >
> >
> > On Mon, Jun 28, 2010 at 1:05 PM, Vincent Davis <vincent at vincentdavis.net
> >
> > wrote:
> >>
> >> Not sure if these are know, I looked for the failure in the tickets
> >> but found only old ones.
> >
> > They are all known. But the test_signaltools.TestMedFilt error should
> have
> > been fixed:
> > http://projects.scipy.org/scipy/ticket/1124
> > http://projects.scipy.org/scipy/ticket/1102
> > Are you running latest trunk?
>
> Just reinstalled current snapshot and ran test again. ran the tests as
> scipy.test(verbose=2) (see below).
> Not sure why this reports more failures, I guess I need to know more
> about how nose does tests?
>

The failures for deprecation warnings are because you ran the test twice
probably. The default behavior of the warnings module is to raise a warning
in the same place in code only once - therefore if you check for it being
raised the test fails the second time.


> How do I know the error are known errors?
>

They're not marked as known (yet) so the only way would be if you saw it
coming by on the mailing list before.


> Not sure why ERROR: test_none (test_signaltools.TestMedFilt) Ticket
> #1124. has the ticket number, It looks like a different error that in
> the ticket.
> I don't have time now but I will try to look/learn more about this later.
>

That would be useful.

Cheers,
Ralf


> Thanks
> Vincent
>
> ERROR: test_decomp.test_lapack_misaligned(<function solve at
> 0x10239d5f0>, (array([[  1.734e-255,   8.189e-217,   4.025e-178,
> 1.903e-139,   9.344e-101,
>          4.422e-062,   2.169e-023,   1.028e+016,   5.035e+054,
> 2.388e+093],
>       [  1.169e+132,   5.549e+170,   2.713e+209,   1.290e+248,
> 6.299e+286,
>         -9.272e-292,  -4.524e-253,  -2.155e-214,  -1.050e-175,
>  -5.007e-137],
>       [ -2.438e-098,  -1.163e-059,  -5.659e-021,  -2.703e+018,
>  -1.314e+057,
>         -6.281e+095,  -3.049e+134,  -1.459e+173,  -7.078e+211,
>  -3.391e+250],
>       [ -1.643e+289,   3.524e-294,   1.734e-255,   8.189e-217,
> 4.025e-178,
>          1.903e-139,   9.344e-101,   4.422e-062,   2.169e-023,
> 1.028e+016],
>       [  5.035e+054,   2.388e+093,   1.169e+132,   5.549e+170,
> 2.713e+209,
>          1.290e+248,   6.299e+286,  -9.272e-292,  -4.524e-253,
>  -2.155e-214],
>       [ -1.050e-175,  -5.007e-137,  -2.438e-098,  -1.163e-059,
>  -5.659e-021,
>         -2.703e+018,  -1.314e+057,  -6.281e+095,  -3.049e+134,
>  -1.459e+173],
>       [ -7.078e+211,  -3.391e+250,  -1.643e+289,   3.524e-294,
> 1.734e-255,
>          8.189e-217,   4.025e-178,   1.903e-139,   9.344e-101,
> 4.422e-062],
>       [  2.169e-023,   1.028e+016,   5.035e+054,   2.388e+093,
> 1.169e+132,
>          5.549e+170,   2.713e+209,   1.290e+248,   6.299e+286,
>  -9.272e-292],
>       [ -4.524e-253,  -2.155e-214,  -1.050e-175,  -5.007e-137,
>  -2.438e-098,
>         -1.163e-059,  -5.659e-021,  -2.703e+018,  -1.314e+057,
>  -6.281e+095],
>       [ -3.049e+134,  -1.459e+173,  -7.078e+211,  -3.391e+250,
>  -1.643e+289,
>          3.524e-294,   1.734e-255,   8.189e-217,   4.025e-178,
> 1.903e-139]]), array([        NaN,         NaN,         NaN,
> NaN,         NaN,  -3.264e+62,
>         6.682e+23,   9.710e-15,  -1.988e-53,  -2.520e-94])),
> {'overwrite_a': True, 'overwrite_b': True})
> test_decomp.test_lapack_misaligned(<function solve at 0x10239d5f0>,
> (array([[  1.734e-255,   8.189e-217,   4.025e-178,   1.903e-139,
> 9.344e-101,
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
> line 186, in runTest
>    self.test(*self.arg)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_decomp.py",
> line 1074, in check_lapack_misaligned
>    func(*a,**kwargs)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/basic.py",
> line 49, in solve
>    a1, b1 = map(asarray_chkfinite,(a,b))
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/function_base.py",
> line 528, in asarray_chkfinite
>    "array must not contain infs or NaNs")
> ValueError: array must not contain infs or NaNs
>
> ======================================================================
> ERROR: test_none (test_signaltools.TestMedFilt)
> Ticket #1124.
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/tests/test_signaltools.py",
> line 289, in test_none
>    signal.medfilt(None)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/signaltools.py",
> line 317, in medfilt
>    return sigtools._order_filterND(volume,domain,order)
> ValueError: order_filterND not available for this type
>
> ======================================================================
> ERROR: test_complex_nonsymmetric_modes
> (test_arpack.TestEigenComplexNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 267, in test_complex_nonsymmetric_modes
>    self.eval_evec(m,typ,k,which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 248, in eval_evec
>    eval,evec=eigen(a,k,which=which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 397, in eigen
>    params.iterate()
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 222, in iterate
>    raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 204, in test_nonsymmetric_modes
>    self.eval_evec(m,typ,k,which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 186, in eval_evec
>    eval,evec=eigen(a,k,which=which,**kwds)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 397, in eigen
>    params.iterate()
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 222, in iterate
>    raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 214, in test_starting_vector
>    self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 186, in eval_evec
>    eval,evec=eigen(a,k,which=which,**kwds)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 397, in eigen
>    params.iterate()
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> line 222, in iterate
>    raise RuntimeError("Error info=%d in arpack" % self.info)
> RuntimeError: Error info=-8 in arpack
>
> ======================================================================
> FAIL: test_mio.test_mat4_3d(<type 'exceptions.DeprecationWarning'>,
> <functools.partial object at 0x1035f4e10>, <StringIO.StringIO instance
> at 0x1042ffa28>, {'a': array([[[ 0,  1,  2,  3],
>        [ 4,  5,  6,  7],
>        [ 8,  9, 10, 11]],
>
>       [[12, 13, 14, 15],
>        [16, 17, 18, 19],
>        [20, 21, 22, 23]]])}, True, '4')
> test_mio.test_mat4_3d(<type 'exceptions.DeprecationWarning'>,
> <functools.partial object at 0x1035f4e10>, <StringIO.StringIO instance
> at 0x1042ffa28>, {'a': array([[[ 0,  1,  2,  3],
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
> line 186, in runTest
>    self.test(*self.arg)
>   File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> line 982, in assert_raises
>    return nose.tools.assert_raises(*args,**kwargs)
> AssertionError: DeprecationWarning not raised
>
>
> ======================================================================
> FAIL: test_00_deprecation_warning (test_basic.TestSolveHBanded)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_basic.py",
> line 262, in test_00_deprecation_warning
>    assert_raises(DeprecationWarning, solveh_banded, ab, b)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> line 982, in assert_raises
>    return nose.tools.assert_raises(*args,**kwargs)
> AssertionError: DeprecationWarning not raised
>
>
> ======================================================================
> FAIL: test_complex_symmetric_modes (test_arpack.TestEigenComplexSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 156, in test_complex_symmetric_modes
>    self.eval_evec(self.symmetric[0],typ,k,which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 145, in eval_evec
>    assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> line 774, in assert_array_almost_equal
>    header='Arrays are not almost equal')
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> line 618, in assert_array_compare
>    raise AssertionError(msg)
> AssertionError:
> Arrays are not almost equal
>
> (mismatch 100.0%)
>  x: array([ 0.828 -4.967e-08j,  5.511 -1.366e-07j], dtype=complex64)
>  y: array([ 5.+0.j,  6.+0.j], dtype=complex64)
> >>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch
> 100.0%)\n x: array([ 0.828 -4.967e-08j,  5.511 -1.366e-07j],
> dtype=complex64)\n y: array([ 5.+0.j,  6.+0.j], dtype=complex64)')
>
>
> ======================================================================
> FAIL: test_symmetric_modes (test_arpack.TestEigenSymmetric)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 109, in test_symmetric_modes
>     self.eval_evec(self.symmetric[0],typ,k,which)
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> line 98, in eval_evec
>     assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> line 774, in assert_array_almost_equal
>    header='Arrays are not almost equal')
>  File
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> line 618, in assert_array_compare
>    raise AssertionError(msg)
> AssertionError:
> Arrays are not almost equal
>
> (mismatch 50.0%)
>  x: array([  1.638e-07,   0.000e+00], dtype=float32)
>  y: array([ 0.,  1.], dtype=float32)
> >>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch
> 50.0%)\n x: array([  1.638e-07,   0.000e+00], dtype=float32)\n y: array([
> 0.,  1.], dtype=float32)')
>
>
> ----------------------------------------------------------------------
> Ran 4607 tests in 197.996s
>
> FAILED (KNOWNFAIL=11, SKIP=39, errors=5, failures=4)
> <nose.result.TextTestResult run=4607 errors=5 failures=4>
> >
> > Cheers,
> > Ralf
> >
> >
> >> Vincent
> >>
> >> ERROR: test_decomp.test_lapack_misaligned(<function solve at
> >> 0x10253e140>, (array([[  1.734e-255,   8.189e-217,   4.025e-178,
> >> 1.903e-139,   9.344e-101,
> >>          4.422e-062,   2.169e-023,   1.028e+016,   5.035e+054,
> >> 2.388e+093],
> >>       [  1.169e+132,   5.549e+170,   2.713e+209,   1.290e+248,
> >> 6.299e+286,
> >>         -9.272e-292,  -4.524e-253,  -2.155e-214,  -1.050e-175,
> >>  -5.007e-137],
> >>       [ -2.438e-098,  -1.163e-059,  -5.659e-021,  -2.703e+018,
> >>  -1.314e+057,
> >>         -6.281e+095,  -3.049e+134,  -1.459e+173,  -7.078e+211,
> >>  -3.391e+250],
> >>       [ -1.643e+289,   3.524e-294,   1.734e-255,   8.189e-217,
> >> 4.025e-178,
> >>          1.903e-139,   9.344e-101,   4.422e-062,   2.169e-023,
> >> 1.028e+016],
> >>       [  5.035e+054,   2.388e+093,   1.169e+132,   5.549e+170,
> >> 2.713e+209,
> >>          1.290e+248,   6.299e+286,  -9.272e-292,  -4.524e-253,
> >>  -2.155e-214],
> >>       [ -1.050e-175,  -5.007e-137,  -2.438e-098,  -1.163e-059,
> >>  -5.659e-021,
> >>         -2.703e+018,  -1.314e+057,  -6.281e+095,  -3.049e+134,
> >>  -1.459e+173],
> >>       [ -7.078e+211,  -3.391e+250,  -1.643e+289,   3.524e-294,
> >> 1.734e-255,
> >>          8.189e-217,   4.025e-178,   1.903e-139,   9.344e-101,
> >> 4.422e-062],
> >>       [  2.169e-023,   1.028e+016,   5.035e+054,   2.388e+093,
> >> 1.169e+132,
> >>          5.549e+170,   2.713e+209,   1.290e+248,   6.299e+286,
> >>  -9.272e-292],
> >>       [ -4.524e-253,  -2.155e-214,  -1.050e-175,  -5.007e-137,
> >>  -2.438e-098,
> >>         -1.163e-059,  -5.659e-021,  -2.703e+018,  -1.314e+057,
> >>  -6.281e+095],
> >>       [ -3.049e+134,  -1.459e+173,  -7.078e+211,  -3.391e+250,
> >>  -1.643e+289,
> >>          3.524e-294,   1.734e-255,   8.189e-217,   4.025e-178,
> >> 1.903e-139]]), array([        NaN,         NaN,         NaN,
> >> NaN,         NaN,  -3.264e+62,
> >>         6.682e+23,   9.710e-15,  -1.988e-53,  -2.520e-94])),
> >> {'overwrite_a': True, 'overwrite_b': True})
> >> test_decomp.test_lapack_misaligned(<function solve at 0x10253e140>,
> >> (array([[  1.734e-255,   8.189e-217,   4.025e-178,   1.903e-139,
> >> 9.344e-101,
> >> ----------------------------------------------------------------------
> >> Traceback (most recent call last):
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
> >> line 186, in runTest
> >>    self.test(*self.arg)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_decomp.py",
> >> line 1074, in check_lapack_misaligned
> >>    func(*a,**kwargs)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/basic.py",
> >> line 49, in solve
> >>    a1, b1 = map(asarray_chkfinite,(a,b))
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/function_base.py",
> >> line 528, in asarray_chkfinite
> >>    "array must not contain infs or NaNs")
> >> ValueError: array must not contain infs or NaNs
> >>
> >> ======================================================================
> >> ERROR: test_none (test_signaltools.TestMedFilt)
> >> Ticket #1124.
> >> ----------------------------------------------------------------------
> >> Traceback (most recent call last):
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/tests/test_signaltools.py",
> >> line 289, in test_none
> >>    signal.medfilt(None)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/signaltools.py",
> >> line 317, in medfilt
> >>    return sigtools._order_filterND(volume,domain,order)
> >> ValueError: order_filterND not available for this type
> >>
> >> ======================================================================
> >> ERROR: test_complex_nonsymmetric_modes
> >> (test_arpack.TestEigenComplexNonSymmetric)
> >> ----------------------------------------------------------------------
> >> Traceback (most recent call last):
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> >> line 267, in test_complex_nonsymmetric_modes
> >>    self.eval_evec(m,typ,k,which)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> >> line 248, in eval_evec
> >>    eval,evec=eigen(a,k,which=which)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> >> line 397, in eigen
> >>    params.iterate()
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> >> line 222, in iterate
> >>    raise RuntimeError("Error info=%d in arpack" % self.info)
> >> RuntimeError: Error info=-8 in arpack
> >>
> >> ======================================================================
> >> ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
> >> ----------------------------------------------------------------------
> >> Traceback (most recent call last):
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> >> line 204, in test_nonsymmetric_modes
> >>    self.eval_evec(m,typ,k,which)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> >> line 186, in eval_evec
> >>    eval,evec=eigen(a,k,which=which,**kwds)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> >> line 397, in eigen
> >>    params.iterate()
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> >> line 222, in iterate
> >>    raise RuntimeError("Error info=%d in arpack" % self.info)
> >> RuntimeError: Error info=-8 in arpack
> >>
> >> ======================================================================
> >> ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
> >> ----------------------------------------------------------------------
> >> Traceback (most recent call last):
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> >> line 214, in test_starting_vector
> >>    self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> >> line 186, in eval_evec
> >>    eval,evec=eigen(a,k,which=which,**kwds)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> >> line 397, in eigen
> >>    params.iterate()
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
> >> line 222, in iterate
> >>    raise RuntimeError("Error info=%d in arpack" % self.info)
> >> RuntimeError: Error info=-8 in arpack
> >>
> >> ======================================================================
> >> FAIL: test_complex_symmetric_modes
> (test_arpack.TestEigenComplexSymmetric)
> >> ----------------------------------------------------------------------
> >> Traceback (most recent call last):
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> >> line 156, in test_complex_symmetric_modes
> >>    self.eval_evec(self.symmetric[0],typ,k,which)
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
> >> line 145, in eval_evec
> >>    assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> >> line 774, in assert_array_almost_equal
> >>    header='Arrays are not almost equal')
> >>  File
> >>
> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
> >> line 618, in assert_array_compare
> >>    raise AssertionError(msg)
> >> AssertionError:
> >> Arrays are not almost equal
> >>
> >> (mismatch 100.0%)
> >>  x: array([ 1.07188725 +6.23436023e-08j,  4.91291142
> >> -3.25412906e-08j], dtype=complex64)
> >>  y: array([ 5.+0.j,  6.+0.j], dtype=complex64)
> >> >>  raise AssertionError('\nArrays are not almost equal\n\n(mismatch
> >> >> 100.0%)\n x: array([ 1.07188725 +6.23436023e-08j,  4.91291142
> >> >> -3.25412906e-08j], dtype=complex64)\n y: array([ 5.+0.j,  6.+0.j],
> >> >> dtype=complex64)')
> >>
> >>
> >> ----------------------------------------------------------------------
> >> Ran 4608 tests in 177.171s
> >>
> >> FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1)
> >> <nose.result.TextTestResult run=4608 errors=6 failures=1>
> >> _______________________________________________
> >> SciPy-Dev mailing list
> >> SciPy-Dev at scipy.org
> >> http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100629/235a5b12/attachment.html>

From vincent at vincentdavis.net  Mon Jun 28 15:30:09 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 13:30:09 -0600
Subject: [SciPy-Dev] FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1),
	py27 64bit osx
In-Reply-To: <AANLkTinJExNEECvJwwWrlRNo6LB5wV4iQ5OOPF0chdX9@mail.gmail.com>
References: <AANLkTikMYt5N5geCoXlKfU2gRPCmpO1P0xHwoS2iWBIB@mail.gmail.com>
	<AANLkTilVGHTFy1GNVPANhB1hRP4_BYjdCowKeboxsM16@mail.gmail.com>
	<AANLkTinyGBNxmY_7X3lfnqFPp_IdSCwiyHQHN0DGkz5I@mail.gmail.com>
	<AANLkTinJExNEECvJwwWrlRNo6LB5wV4iQ5OOPF0chdX9@mail.gmail.com>
Message-ID: <AANLkTimdr1Lw8XRV218HdH7n1O98kD_-HQ7KHQXACd3a@mail.gmail.com>

On Mon, Jun 28, 2010 at 10:02 AM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
>
>
> On Mon, Jun 28, 2010 at 10:20 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
>>
>> On Mon, Jun 28, 2010 at 6:05 AM, Ralf Gommers
>> <ralf.gommers at googlemail.com> wrote:
>> >
>> >
>> > On Mon, Jun 28, 2010 at 1:05 PM, Vincent Davis
>> > <vincent at vincentdavis.net>
>> > wrote:
>> >>
>> >> Not sure if these are know, I looked for the failure in the tickets
>> >> but found only old ones.
>> >
>> > They are all known. But the test_signaltools.TestMedFilt error should
>> > have
>> > been fixed:
>> > http://projects.scipy.org/scipy/ticket/1124
>> > http://projects.scipy.org/scipy/ticket/1102
>> > Are you running latest trunk?
>>
>> Just reinstalled current snapshot and ran test again. ran the tests as
>> scipy.test(verbose=2) (see below).
>> Not sure why this reports more failures, I guess I need to know more
>> about how nose does tests?
>
> The failures for deprecation warnings are because you ran the test twice
> probably. The default behavior of the warnings module is to raise a warning
> in the same place in code only once - therefore if you check for it being
> raised the test fails the second time.
>
>>
>> How do I know the error are known errors?
>
> They're not marked as known (yet) so the only way would be if you saw it
> coming by on the mailing list before.
>
>>
>> Not sure why ERROR: test_none (test_signaltools.TestMedFilt) Ticket
>> #1124. has the ticket number, It looks like a different error that in
>> the ticket.
>> I don't have time now but I will try to look/learn more about this later.
>
> That would be useful.

It would seem to me that the set should fail.
def test_none(self):
        """Ticket #1124."""
        signal.medfilt(None)

Not sure how to do this but I guess it is possible that the error is
not a sigfualt or medfilt could return a more meaningful error because
None is not a good argument. Volume=None > gets converted with
numpy.atleast_1d(*arys) which returns array([None], dtype=object).

So should the test use an expect exception? Or whats the fix?

Vincent
>
> Cheers,
> Ralf
>
>
>>
>> Thanks
>> Vincent
>>
>> ERROR: test_decomp.test_lapack_misaligned(<function solve at
>> 0x10239d5f0>, (array([[ ?1.734e-255, ? 8.189e-217, ? 4.025e-178,
>> 1.903e-139, ? 9.344e-101,
>> ? ? ? ? ?4.422e-062, ? 2.169e-023, ? 1.028e+016, ? 5.035e+054,
>> 2.388e+093],
>> ? ? ? [ ?1.169e+132, ? 5.549e+170, ? 2.713e+209, ? 1.290e+248,
>> 6.299e+286,
>> ? ? ? ? -9.272e-292, ?-4.524e-253, ?-2.155e-214, ?-1.050e-175,
>> ?-5.007e-137],
>> ? ? ? [ -2.438e-098, ?-1.163e-059, ?-5.659e-021, ?-2.703e+018,
>> ?-1.314e+057,
>> ? ? ? ? -6.281e+095, ?-3.049e+134, ?-1.459e+173, ?-7.078e+211,
>> ?-3.391e+250],
>> ? ? ? [ -1.643e+289, ? 3.524e-294, ? 1.734e-255, ? 8.189e-217,
>> 4.025e-178,
>> ? ? ? ? ?1.903e-139, ? 9.344e-101, ? 4.422e-062, ? 2.169e-023,
>> 1.028e+016],
>> ? ? ? [ ?5.035e+054, ? 2.388e+093, ? 1.169e+132, ? 5.549e+170,
>> 2.713e+209,
>> ? ? ? ? ?1.290e+248, ? 6.299e+286, ?-9.272e-292, ?-4.524e-253,
>> ?-2.155e-214],
>> ? ? ? [ -1.050e-175, ?-5.007e-137, ?-2.438e-098, ?-1.163e-059,
>> ?-5.659e-021,
>> ? ? ? ? -2.703e+018, ?-1.314e+057, ?-6.281e+095, ?-3.049e+134,
>> ?-1.459e+173],
>> ? ? ? [ -7.078e+211, ?-3.391e+250, ?-1.643e+289, ? 3.524e-294,
>> 1.734e-255,
>> ? ? ? ? ?8.189e-217, ? 4.025e-178, ? 1.903e-139, ? 9.344e-101,
>> 4.422e-062],
>> ? ? ? [ ?2.169e-023, ? 1.028e+016, ? 5.035e+054, ? 2.388e+093,
>> 1.169e+132,
>> ? ? ? ? ?5.549e+170, ? 2.713e+209, ? 1.290e+248, ? 6.299e+286,
>> ?-9.272e-292],
>> ? ? ? [ -4.524e-253, ?-2.155e-214, ?-1.050e-175, ?-5.007e-137,
>> ?-2.438e-098,
>> ? ? ? ? -1.163e-059, ?-5.659e-021, ?-2.703e+018, ?-1.314e+057,
>> ?-6.281e+095],
>> ? ? ? [ -3.049e+134, ?-1.459e+173, ?-7.078e+211, ?-3.391e+250,
>> ?-1.643e+289,
>> ? ? ? ? ?3.524e-294, ? 1.734e-255, ? 8.189e-217, ? 4.025e-178,
>> 1.903e-139]]), array([ ? ? ? ?NaN, ? ? ? ? NaN, ? ? ? ? NaN,
>> NaN, ? ? ? ? NaN, ?-3.264e+62,
>> ? ? ? ? 6.682e+23, ? 9.710e-15, ?-1.988e-53, ?-2.520e-94])),
>> {'overwrite_a': True, 'overwrite_b': True})
>> test_decomp.test_lapack_misaligned(<function solve at 0x10239d5f0>,
>> (array([[ ?1.734e-255, ? 8.189e-217, ? 4.025e-178, ? 1.903e-139,
>> 9.344e-101,
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
>> line 186, in runTest
>> ? ?self.test(*self.arg)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_decomp.py",
>> line 1074, in check_lapack_misaligned
>> ? ?func(*a,**kwargs)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/basic.py",
>> line 49, in solve
>> ? ?a1, b1 = map(asarray_chkfinite,(a,b))
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/function_base.py",
>> line 528, in asarray_chkfinite
>> ? ?"array must not contain infs or NaNs")
>> ValueError: array must not contain infs or NaNs
>>
>> ======================================================================
>> ERROR: test_none (test_signaltools.TestMedFilt)
>> Ticket #1124.
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/tests/test_signaltools.py",
>> line 289, in test_none
>> ? ?signal.medfilt(None)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/signaltools.py",
>> line 317, in medfilt
>> ? ?return sigtools._order_filterND(volume,domain,order)
>> ValueError: order_filterND not available for this type
>>
>> ======================================================================
>> ERROR: test_complex_nonsymmetric_modes
>> (test_arpack.TestEigenComplexNonSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 267, in test_complex_nonsymmetric_modes
>> ? ?self.eval_evec(m,typ,k,which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 248, in eval_evec
>> ? ?eval,evec=eigen(a,k,which=which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 397, in eigen
>> ? ?params.iterate()
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 222, in iterate
>> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> RuntimeError: Error info=-8 in arpack
>>
>> ======================================================================
>> ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 204, in test_nonsymmetric_modes
>> ? ?self.eval_evec(m,typ,k,which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 186, in eval_evec
>> ? ?eval,evec=eigen(a,k,which=which,**kwds)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 397, in eigen
>> ? ?params.iterate()
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 222, in iterate
>> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> RuntimeError: Error info=-8 in arpack
>>
>> ======================================================================
>> ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 214, in test_starting_vector
>> ? ?self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 186, in eval_evec
>> ? ?eval,evec=eigen(a,k,which=which,**kwds)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 397, in eigen
>> ? ?params.iterate()
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> line 222, in iterate
>> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> RuntimeError: Error info=-8 in arpack
>>
>> ======================================================================
>> FAIL: test_mio.test_mat4_3d(<type 'exceptions.DeprecationWarning'>,
>> <functools.partial object at 0x1035f4e10>, <StringIO.StringIO instance
>> at 0x1042ffa28>, {'a': array([[[ 0, ?1, ?2, ?3],
>> ? ? ? ?[ 4, ?5, ?6, ?7],
>> ? ? ? ?[ 8, ?9, 10, 11]],
>>
>> ? ? ? [[12, 13, 14, 15],
>> ? ? ? ?[16, 17, 18, 19],
>> ? ? ? ?[20, 21, 22, 23]]])}, True, '4')
>> test_mio.test_mat4_3d(<type 'exceptions.DeprecationWarning'>,
>> <functools.partial object at 0x1035f4e10>, <StringIO.StringIO instance
>> at 0x1042ffa28>, {'a': array([[[ 0, ?1, ?2, ?3],
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
>> line 186, in runTest
>> ? ?self.test(*self.arg)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> line 982, in assert_raises
>> ? ?return nose.tools.assert_raises(*args,**kwargs)
>> AssertionError: DeprecationWarning not raised
>>
>>
>> ======================================================================
>> FAIL: test_00_deprecation_warning (test_basic.TestSolveHBanded)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_basic.py",
>> line 262, in test_00_deprecation_warning
>> ? ?assert_raises(DeprecationWarning, solveh_banded, ab, b)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> line 982, in assert_raises
>> ? ?return nose.tools.assert_raises(*args,**kwargs)
>> AssertionError: DeprecationWarning not raised
>>
>>
>> ======================================================================
>> FAIL: test_complex_symmetric_modes (test_arpack.TestEigenComplexSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 156, in test_complex_symmetric_modes
>> ? ?self.eval_evec(self.symmetric[0],typ,k,which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 145, in eval_evec
>> ? ?assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> line 774, in assert_array_almost_equal
>> ? ?header='Arrays are not almost equal')
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> line 618, in assert_array_compare
>> ? ?raise AssertionError(msg)
>> AssertionError:
>> Arrays are not almost equal
>>
>> (mismatch 100.0%)
>> ?x: array([ 0.828 -4.967e-08j, ?5.511 -1.366e-07j], dtype=complex64)
>> ?y: array([ 5.+0.j, ?6.+0.j], dtype=complex64)
>> >> ?raise AssertionError('\nArrays are not almost equal\n\n(mismatch
>> >> 100.0%)\n x: array([ 0.828 -4.967e-08j, ?5.511 -1.366e-07j],
>> >> dtype=complex64)\n y: array([ 5.+0.j, ?6.+0.j], dtype=complex64)')
>>
>>
>> ======================================================================
>> FAIL: test_symmetric_modes (test_arpack.TestEigenSymmetric)
>> ----------------------------------------------------------------------
>> Traceback (most recent call last):
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 109, in test_symmetric_modes
>> ? ?self.eval_evec(self.symmetric[0],typ,k,which)
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> line 98, in eval_evec
>> ? ?assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> line 774, in assert_array_almost_equal
>> ? ?header='Arrays are not almost equal')
>> ?File
>> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> line 618, in assert_array_compare
>> ? ?raise AssertionError(msg)
>> AssertionError:
>> Arrays are not almost equal
>>
>> (mismatch 50.0%)
>> ?x: array([ ?1.638e-07, ? 0.000e+00], dtype=float32)
>> ?y: array([ 0., ?1.], dtype=float32)
>> >> ?raise AssertionError('\nArrays are not almost equal\n\n(mismatch
>> >> 50.0%)\n x: array([ ?1.638e-07, ? 0.000e+00], dtype=float32)\n y: array([
>> >> 0., ?1.], dtype=float32)')
>>
>>
>> ----------------------------------------------------------------------
>> Ran 4607 tests in 197.996s
>>
>> FAILED (KNOWNFAIL=11, SKIP=39, errors=5, failures=4)
>> <nose.result.TextTestResult run=4607 errors=5 failures=4>
>> >
>> > Cheers,
>> > Ralf
>> >
>> >
>> >> Vincent
>> >>
>> >> ERROR: test_decomp.test_lapack_misaligned(<function solve at
>> >> 0x10253e140>, (array([[ ?1.734e-255, ? 8.189e-217, ? 4.025e-178,
>> >> 1.903e-139, ? 9.344e-101,
>> >> ? ? ? ? ?4.422e-062, ? 2.169e-023, ? 1.028e+016, ? 5.035e+054,
>> >> 2.388e+093],
>> >> ? ? ? [ ?1.169e+132, ? 5.549e+170, ? 2.713e+209, ? 1.290e+248,
>> >> 6.299e+286,
>> >> ? ? ? ? -9.272e-292, ?-4.524e-253, ?-2.155e-214, ?-1.050e-175,
>> >> ?-5.007e-137],
>> >> ? ? ? [ -2.438e-098, ?-1.163e-059, ?-5.659e-021, ?-2.703e+018,
>> >> ?-1.314e+057,
>> >> ? ? ? ? -6.281e+095, ?-3.049e+134, ?-1.459e+173, ?-7.078e+211,
>> >> ?-3.391e+250],
>> >> ? ? ? [ -1.643e+289, ? 3.524e-294, ? 1.734e-255, ? 8.189e-217,
>> >> 4.025e-178,
>> >> ? ? ? ? ?1.903e-139, ? 9.344e-101, ? 4.422e-062, ? 2.169e-023,
>> >> 1.028e+016],
>> >> ? ? ? [ ?5.035e+054, ? 2.388e+093, ? 1.169e+132, ? 5.549e+170,
>> >> 2.713e+209,
>> >> ? ? ? ? ?1.290e+248, ? 6.299e+286, ?-9.272e-292, ?-4.524e-253,
>> >> ?-2.155e-214],
>> >> ? ? ? [ -1.050e-175, ?-5.007e-137, ?-2.438e-098, ?-1.163e-059,
>> >> ?-5.659e-021,
>> >> ? ? ? ? -2.703e+018, ?-1.314e+057, ?-6.281e+095, ?-3.049e+134,
>> >> ?-1.459e+173],
>> >> ? ? ? [ -7.078e+211, ?-3.391e+250, ?-1.643e+289, ? 3.524e-294,
>> >> 1.734e-255,
>> >> ? ? ? ? ?8.189e-217, ? 4.025e-178, ? 1.903e-139, ? 9.344e-101,
>> >> 4.422e-062],
>> >> ? ? ? [ ?2.169e-023, ? 1.028e+016, ? 5.035e+054, ? 2.388e+093,
>> >> 1.169e+132,
>> >> ? ? ? ? ?5.549e+170, ? 2.713e+209, ? 1.290e+248, ? 6.299e+286,
>> >> ?-9.272e-292],
>> >> ? ? ? [ -4.524e-253, ?-2.155e-214, ?-1.050e-175, ?-5.007e-137,
>> >> ?-2.438e-098,
>> >> ? ? ? ? -1.163e-059, ?-5.659e-021, ?-2.703e+018, ?-1.314e+057,
>> >> ?-6.281e+095],
>> >> ? ? ? [ -3.049e+134, ?-1.459e+173, ?-7.078e+211, ?-3.391e+250,
>> >> ?-1.643e+289,
>> >> ? ? ? ? ?3.524e-294, ? 1.734e-255, ? 8.189e-217, ? 4.025e-178,
>> >> 1.903e-139]]), array([ ? ? ? ?NaN, ? ? ? ? NaN, ? ? ? ? NaN,
>> >> NaN, ? ? ? ? NaN, ?-3.264e+62,
>> >> ? ? ? ? 6.682e+23, ? 9.710e-15, ?-1.988e-53, ?-2.520e-94])),
>> >> {'overwrite_a': True, 'overwrite_b': True})
>> >> test_decomp.test_lapack_misaligned(<function solve at 0x10253e140>,
>> >> (array([[ ?1.734e-255, ? 8.189e-217, ? 4.025e-178, ? 1.903e-139,
>> >> 9.344e-101,
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/nose/case.py",
>> >> line 186, in runTest
>> >> ? ?self.test(*self.arg)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/tests/test_decomp.py",
>> >> line 1074, in check_lapack_misaligned
>> >> ? ?func(*a,**kwargs)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/linalg/basic.py",
>> >> line 49, in solve
>> >> ? ?a1, b1 = map(asarray_chkfinite,(a,b))
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/function_base.py",
>> >> line 528, in asarray_chkfinite
>> >> ? ?"array must not contain infs or NaNs")
>> >> ValueError: array must not contain infs or NaNs
>> >>
>> >> ======================================================================
>> >> ERROR: test_none (test_signaltools.TestMedFilt)
>> >> Ticket #1124.
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/tests/test_signaltools.py",
>> >> line 289, in test_none
>> >> ? ?signal.medfilt(None)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/signal/signaltools.py",
>> >> line 317, in medfilt
>> >> ? ?return sigtools._order_filterND(volume,domain,order)
>> >> ValueError: order_filterND not available for this type
>> >>
>> >> ======================================================================
>> >> ERROR: test_complex_nonsymmetric_modes
>> >> (test_arpack.TestEigenComplexNonSymmetric)
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> >> line 267, in test_complex_nonsymmetric_modes
>> >> ? ?self.eval_evec(m,typ,k,which)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> >> line 248, in eval_evec
>> >> ? ?eval,evec=eigen(a,k,which=which)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> >> line 397, in eigen
>> >> ? ?params.iterate()
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> >> line 222, in iterate
>> >> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> >> RuntimeError: Error info=-8 in arpack
>> >>
>> >> ======================================================================
>> >> ERROR: test_nonsymmetric_modes (test_arpack.TestEigenNonSymmetric)
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> >> line 204, in test_nonsymmetric_modes
>> >> ? ?self.eval_evec(m,typ,k,which)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> >> line 186, in eval_evec
>> >> ? ?eval,evec=eigen(a,k,which=which,**kwds)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> >> line 397, in eigen
>> >> ? ?params.iterate()
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> >> line 222, in iterate
>> >> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> >> RuntimeError: Error info=-8 in arpack
>> >>
>> >> ======================================================================
>> >> ERROR: test_starting_vector (test_arpack.TestEigenNonSymmetric)
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> >> line 214, in test_starting_vector
>> >> ? ?self.eval_evec(self.symmetric[0],typ,k,which='LM',v0=v0)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> >> line 186, in eval_evec
>> >> ? ?eval,evec=eigen(a,k,which=which,**kwds)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> >> line 397, in eigen
>> >> ? ?params.iterate()
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/arpack.py",
>> >> line 222, in iterate
>> >> ? ?raise RuntimeError("Error info=%d in arpack" % self.info)
>> >> RuntimeError: Error info=-8 in arpack
>> >>
>> >> ======================================================================
>> >> FAIL: test_complex_symmetric_modes
>> >> (test_arpack.TestEigenComplexSymmetric)
>> >> ----------------------------------------------------------------------
>> >> Traceback (most recent call last):
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> >> line 156, in test_complex_symmetric_modes
>> >> ? ?self.eval_evec(self.symmetric[0],typ,k,which)
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/sparse/linalg/eigen/arpack/tests/test_arpack.py",
>> >> line 145, in eval_evec
>> >> ? ?assert_array_almost_equal(eval,exact_eval,decimal=_ndigits[typ])
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> >> line 774, in assert_array_almost_equal
>> >> ? ?header='Arrays are not almost equal')
>> >> ?File
>> >>
>> >> "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/testing/utils.py",
>> >> line 618, in assert_array_compare
>> >> ? ?raise AssertionError(msg)
>> >> AssertionError:
>> >> Arrays are not almost equal
>> >>
>> >> (mismatch 100.0%)
>> >> ?x: array([ 1.07188725 +6.23436023e-08j, ?4.91291142
>> >> -3.25412906e-08j], dtype=complex64)
>> >> ?y: array([ 5.+0.j, ?6.+0.j], dtype=complex64)
>> >> >> ?raise AssertionError('\nArrays are not almost equal\n\n(mismatch
>> >> >> 100.0%)\n x: array([ 1.07188725 +6.23436023e-08j, ?4.91291142
>> >> >> -3.25412906e-08j], dtype=complex64)\n y: array([ 5.+0.j, ?6.+0.j],
>> >> >> dtype=complex64)')
>> >>
>> >>
>> >> ----------------------------------------------------------------------
>> >> Ran 4608 tests in 177.171s
>> >>
>> >> FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1)
>> >> <nose.result.TextTestResult run=4608 errors=6 failures=1>
>> >> _______________________________________________
>> >> SciPy-Dev mailing list
>> >> SciPy-Dev at scipy.org
>> >> http://mail.scipy.org/mailman/listinfo/scipy-dev
>> >
>> >
>> > _______________________________________________
>> > SciPy-Dev mailing list
>> > SciPy-Dev at scipy.org
>> > http://mail.scipy.org/mailman/listinfo/scipy-dev
>> >
>> >
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Mon Jun 28 18:27:46 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 16:27:46 -0600
Subject: [SciPy-Dev] Understanding source of docs,
	source file ".py" or ".rst" file
Message-ID: <AANLkTilYSrYiUzyxYXzgH6IGF5I_wxUsp7yE8HxMHn5N@mail.gmail.com>

There are two sets of docs for scipy.constants.constants, that in the
source constants.py and that in that in
scipy/doc/source/constants.rst.
The rst file is what is used on the website but I am not sure how to
(can you) view it from an interactive py session.
Is it necessary to maintain both?
Or more generally whats the protocol for how and why this is done.

Thanks
Vincent


From oliphant at enthought.com  Mon Jun 28 19:12:49 2010
From: oliphant at enthought.com (Travis Oliphant)
Date: Mon, 28 Jun 2010 18:12:49 -0500
Subject: [SciPy-Dev] A few minor changes to distributions .fit and
	scipy.optimize
Message-ID: <650BFA1E-B085-4A8D-9835-A16EF87EC93F@enthought.com>


There was some recent discussion about the fact that the .fit functions only use the Nelder Mead fmin algorithm, and that the .fit functions do not allow dependency on other variables. 

I would like to check in a quick change to allow an optimizer keyword to the function which can take any optimizer which accepts a similar interface (and I've changed all the fmin_ style optimizers to adjust the few that have trickled in over the past few years which did not have a disp= keyword).    

Comments on this are welcome.   It is a simple enough change that it seems appropriate to just check-it in. 


Regarding the matter of parameterization of the shape and location or scale variables:  I think it would be straightforward to use a similar mechanism that allows one to now fix the shape, location, or scale parameters during the optimization, to allow for these variables to be parameterized by additional independent variables via a function.   In other words, instead of fixing the value of shape, location, or scale, you could specify that this parameter should be a specific function of some other variables.  

Then, the optimization would proceed using these underlying functions.    This interface should also be flexible enough to allow you to specify a function that returns several of the shape, location, and/or scale values given the same set of underlying functions.      

I'm thinking of merging this with the fixing of the parameters approach using a couple of object factory functions that are passed in via a single keyword argument to the .fit function.  This would be a simple yet sufficiently flexible approach.   I don't have a great name for the keyword function, perhaps params

# s0..sn fix any shape parameters
.fit(data, params=fix(s1=3,loc=4))

and

.fit(data, params=expand(s1=func, loc=(func3, start)))   # passing a tuple in fixes starting guess for underlying function.

TBD:  how to specify a function that returns several of the parameters:  perhaps a keyword with names strung together:  s0_loc_scale = func4


-Travis


From charlesr.harris at gmail.com  Mon Jun 28 19:52:52 2010
From: charlesr.harris at gmail.com (Charles R Harris)
Date: Mon, 28 Jun 2010 17:52:52 -0600
Subject: [SciPy-Dev] A few minor changes to distributions .fit and
	scipy.optimize
In-Reply-To: <650BFA1E-B085-4A8D-9835-A16EF87EC93F@enthought.com>
References: <650BFA1E-B085-4A8D-9835-A16EF87EC93F@enthought.com>
Message-ID: <AANLkTikm1Y8xQqcZ9rsWGPb-Xl8WgHU4bChyhjcDxPQf@mail.gmail.com>

On Mon, Jun 28, 2010 at 5:12 PM, Travis Oliphant <oliphant at enthought.com>wrote:

>
> There was some recent discussion about the fact that the .fit functions
> only use the Nelder Mead fmin algorithm, and that the .fit functions do not
> allow dependency on other variables.
>
> I would like to check in a quick change to allow an optimizer keyword to
> the function which can take any optimizer which accepts a similar interface
> (and I've changed all the fmin_ style optimizers to adjust the few that have
> trickled in over the past few years which did not have a disp= keyword).
>
> Comments on this are welcome.   It is a simple enough change that it seems
> appropriate to just check-it in.
>
>
> Regarding the matter of parameterization of the shape and location or scale
> variables:  I think it would be straightforward to use a similar mechanism
> that allows one to now fix the shape, location, or scale parameters during
> the optimization, to allow for these variables to be parameterized by
> additional independent variables via a function.   In other words, instead
> of fixing the value of shape, location, or scale, you could specify that
> this parameter should be a specific function of some other variables.
>
> Then, the optimization would proceed using these underlying functions.
>  This interface should also be flexible enough to allow you to specify a
> function that returns several of the shape, location, and/or scale values
> given the same set of underlying functions.
>
> I'm thinking of merging this with the fixing of the parameters approach
> using a couple of object factory functions that are passed in via a single
> keyword argument to the .fit function.  This would be a simple yet
> sufficiently flexible approach.   I don't have a great name for the keyword
> function, perhaps params
>
> # s0..sn fix any shape parameters
> .fit(data, params=fix(s1=3,loc=4))
>
> and
>
> .fit(data, params=expand(s1=func, loc=(func3, start)))   # passing a tuple
> in fixes starting guess for underlying function.
>
> TBD:  how to specify a function that returns several of the parameters:
>  perhaps a keyword with names strung together:  s0_loc_scale = func4
>
>
>
I think you should wait a bit on the checkin so that there is time for
comments to make their way to the list. I'm not involved in this area of
scipy, but this looks to be setting up a higher level interface and I think
it needs to be thrashed out a bit.

Chuck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100628/bda9343d/attachment.html>

From vincent at vincentdavis.net  Mon Jun 28 20:23:23 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 18:23:23 -0600
Subject: [SciPy-Dev] Is imresize() defined twice in scipy.misc.pilutil.py
Message-ID: <AANLkTinOhsh_4Du-ajlh_t2F9w1RUFqpw2iMlx1QVQw0@mail.gmail.com>

Is imresize defined twice in  scipy.misc.pilutil.py or am i missing something?

Thanks
Vincent

Below is a separate issue maybe related to the above.
According to the docs
size : int, float or tuple

        * int - Percentage of current size.
        * float - Fraction of current size.
        * tuple - Size of the output image.

an integer should work.
Just wonder if someone can confirm this

>>> from scipy.misc.pilutil import imresize
>>> x = np.random.randn(3,3)
>>> imresize(x, 2)
Traceback (most recent call last):
  File "/Applications/WingIDE.app/Contents/MacOS/src/debug/tserver/_sandbox.py",
line 1, in <module>
    # Used internally for debug sandbox under external interpreter
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/misc/pilutil.py",
line 316, in imresize
    imnew = im.resize(size)
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/PIL/Image.py",
line 1302, in resize
    im = self.im.resize(size, resample)
TypeError: must be 2-item sequence, not float
>>> imresize(x, 2.0)
array([[255, 255, 252, 252, 171, 171],
       [255, 255, 252, 252, 171, 171],
       [163, 163,   0,   0, 185, 185],
       [163, 163,   0,   0, 185, 185],
       [255, 255, 223, 223, 167, 167],
       [255, 255, 223, 223, 167, 167]], dtype=uint8)
>>> imresize(x, (5,4))
array([[255, 255, 252, 171],
       [255, 255, 252, 171],
       [163, 163,   0, 185],
       [163, 163,   0, 185],
       [255, 255, 223, 167]], dtype=uint8)


From warren.weckesser at enthought.com  Mon Jun 28 20:51:51 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Mon, 28 Jun 2010 19:51:51 -0500
Subject: [SciPy-Dev] Is imresize() defined twice in scipy.misc.pilutil.py
In-Reply-To: <AANLkTinOhsh_4Du-ajlh_t2F9w1RUFqpw2iMlx1QVQw0@mail.gmail.com>
References: <AANLkTinOhsh_4Du-ajlh_t2F9w1RUFqpw2iMlx1QVQw0@mail.gmail.com>
Message-ID: <4C2943A7.9050704@enthought.com>

Vincent Davis wrote:
> Is imresize defined twice in  scipy.misc.pilutil.py or am i missing something?
>   


Yup, and they were both added by travo about eight years ago (r521 and 
r545).  Hey Travis, what's up with that?

Warren


> Thanks
> Vincent
>
> Below is a separate issue maybe related to the above.
> According to the docs
> size : int, float or tuple
>
>         * int - Percentage of current size.
>         * float - Fraction of current size.
>         * tuple - Size of the output image.
>
> an integer should work.
> Just wonder if someone can confirm this
>
>   
>>>> from scipy.misc.pilutil import imresize
>>>> x = np.random.randn(3,3)
>>>> imresize(x, 2)
>>>>         
> Traceback (most recent call last):
>   File "/Applications/WingIDE.app/Contents/MacOS/src/debug/tserver/_sandbox.py",
> line 1, in <module>
>     # Used internally for debug sandbox under external interpreter
>   File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/scipy/misc/pilutil.py",
> line 316, in imresize
>     imnew = im.resize(size)
>   File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/PIL/Image.py",
> line 1302, in resize
>     im = self.im.resize(size, resample)
> TypeError: must be 2-item sequence, not float
>   
>>>> imresize(x, 2.0)
>>>>         
> array([[255, 255, 252, 252, 171, 171],
>        [255, 255, 252, 252, 171, 171],
>        [163, 163,   0,   0, 185, 185],
>        [163, 163,   0,   0, 185, 185],
>        [255, 255, 223, 223, 167, 167],
>        [255, 255, 223, 223, 167, 167]], dtype=uint8)
>   
>>>> imresize(x, (5,4))
>>>>         
> array([[255, 255, 252, 171],
>        [255, 255, 252, 171],
>        [163, 163,   0, 185],
>        [163, 163,   0, 185],
>        [255, 255, 223, 167]], dtype=uint8)
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From josef.pktd at gmail.com  Mon Jun 28 20:53:39 2010
From: josef.pktd at gmail.com (josef.pktd at gmail.com)
Date: Mon, 28 Jun 2010 20:53:39 -0400
Subject: [SciPy-Dev] A few minor changes to distributions .fit and
	scipy.optimize
In-Reply-To: <650BFA1E-B085-4A8D-9835-A16EF87EC93F@enthought.com>
References: <650BFA1E-B085-4A8D-9835-A16EF87EC93F@enthought.com>
Message-ID: <AANLkTikw1bAvqDD8L9wMfTdHOc6qReJXqehNtxV-i0O5@mail.gmail.com>

On Mon, Jun 28, 2010 at 7:12 PM, Travis Oliphant <oliphant at enthought.com> wrote:
>
> There was some recent discussion about the fact that the .fit functions only use the Nelder Mead fmin algorithm, and that the .fit functions do not allow dependency on other variables.
>
> I would like to check in a quick change to allow an optimizer keyword to the function which can take any optimizer which accepts a similar interface (and I've changed all the fmin_ style optimizers to adjust the few that have trickled in over the past few years which did not have a disp= keyword).
>
> Comments on this are welcome. ? It is a simple enough change that it seems appropriate to just check-it in.

I agree after a brief look at the change sets that this is a simple
enough change.

one thing I was trying to get away from in some cases is to require
that callables (or distributions) are in the scipy namespace
e.g
optimizer = getattr(optimize, optimizer)
restricts the choice to optimizers that are defined in scipy.optimize,
instead we could also allow user defined optimizers or optimizers from
other packages assuming they have a compatible interface.

In a similar case, I have distributions that are not in scipy.stats
and I like functions to be indifferent to the location of a callable.

A comment on tests: the test directory has a test_fit.py that tests
whether the estimated results are reasonably good. It's currently
disabled because there are too many problems with the generic fit
(name of the function is est_  instead of test_ so that nose doesn't
pick it up). I left it in there for the time that fit works across the
board.

I just ran in on an older trunk and get
Ran 83 tests in 495.703s
FAILED (failures=15)


>
>
> Regarding the matter of parameterization of the shape and location or scale variables: ?I think it would be straightforward to use a similar mechanism that allows one to now fix the shape, location, or scale parameters during the optimization, to allow for these variables to be parameterized by additional independent variables via a function. ? In other words, instead of fixing the value of shape, location, or scale, you could specify that this parameter should be a specific function of some other variables.
>
> Then, the optimization would proceed using these underlying functions. ? ?This interface should also be flexible enough to allow you to specify a function that returns several of the shape, location, and/or scale values given the same set of underlying functions.
>
> I'm thinking of merging this with the fixing of the parameters approach using a couple of object factory functions that are passed in via a single keyword argument to the .fit function. ?This would be a simple yet sufficiently flexible approach. ? I don't have a great name for the keyword function, perhaps params
>
> # s0..sn fix any shape parameters
> .fit(data, params=fix(s1=3,loc=4))
>
> and
>
> .fit(data, params=expand(s1=func, loc=(func3, start))) ? # passing a tuple in fixes starting guess for underlying function.
>
> TBD: ?how to specify a function that returns several of the parameters: ?perhaps a keyword with names strung together: ?s0_loc_scale = func4

I think to do this in a really useful way that works for a majority of
distributions, requires quite a bit of experimentation and thinking
about the overall design.

A basic enhancement to fit would be easy enough, but the question is
how flexible we want the new options to be and what results we want to
return. For example, Per Brodtkorb's original enhancement returned a
Profile class to do some further post estimation analysis. (We never
reviewed his proposed changes and they have not been incorporated
yet.)

( next part is taken care of in your proposal:
Take the simplest case, y distributed N(mu, sigma**2) with mu =
X*beta, which is just standard linear regression or MLE. We would like
to get the estimates for beta and sigma**2 plus most likely the
associated standard errors. In the case of the t-distribution, we
would have to estimate the dof (shape parameter) additional to beta
and the scale. So, I think, the function that specifies any of the
distribution parameters, will have its own parameters that need to be
jointly optimized, unless we do two-stage optimization, which would be
a way around this problem. In the former case, we need a way to
specify additional starting parameters and pool all parameters, that
are estimated for the optimization problem.
)

In terms of results, I was also thinking of returning at least the
Hessian and maybe the Gradient, or a result class that allows the
calculation of additional statistics. Similar to what we are working
on for the generic MLE framework in statsmodels.

If we are able to produce a good estimation procedure, then I wouldn't
hide it as a method of scipy.stats.distributions. I think the helper
functions are a possible approach, but I would write estimation model
classes independent of scipy.stats.distributions and delegate to it in
the method. I think, it's more useful to directly attach distribution
specific information to scipy.stats.distributions like log-likelihood
and I started to work with some characteristic functions then write
generic estimation models as methods.

That said, and since my plans are always larger than the available
time, maybe this is a "simple" extension for just the basic fit
results. It might work for most of the distributions with support the
entire real line.

One small comment: I was wondering for a while if we can get the names
of the shape parameters as keyword parameters for the interface. I
think R and some others do it this way instead of the generic s1,
s2,.... Parsing the shape parameter string it should be possible.

I don't think "expand" is an informative name, but I don't know any
good alternative (explain?, fromfunc?). In your proposal how can you
fix and expand at the same time, e.g. fix loc to fix support and make
a shape parameter dependent on some explanatory variables?

Josef

>
>
> -Travis
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Mon Jun 28 21:11:08 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 19:11:08 -0600
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
Message-ID: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>

According to the docs factorial2 should return a float or int, float
in the case of exact=False, It does not.

>>> import scipy.misc.common as sc
>>> sc.factorial2(7, exact=False)
array(105.00000000000001)
>>> sc.factorial2(7, exact=True)
105L
>>> sc.factorial2(8, exact=True)
384L
>>> sc.factorial2(8, exact=False)
array(384.0)
>>> sc.factorial2(8, exact=False)


From vincent at vincentdavis.net  Mon Jun 28 21:18:36 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 19:18:36 -0600
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
In-Reply-To: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>
References: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>
Message-ID: <AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com>

On Mon, Jun 28, 2010 at 7:11 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> According to the docs factorial2 should return a float or int, float
> in the case of exact=False, It does not.
>
>>>> import scipy.misc.common as sc
>>>> sc.factorial2(7, exact=False)
> array(105.00000000000001)
>>>> sc.factorial2(7, exact=True)
> 105L
>>>> sc.factorial2(8, exact=True)
> 384L
>>>> sc.factorial2(8, exact=False)
> array(384.0)
>>>> sc.factorial2(8, exact=False)
>

Same with factorial

>>> sc.factorial(5, exact=False)
array(120.0)
>>> arr = np.array([3,4,5])
>>> sc.factorial(arr, exact=False)
array([   6.,   24.,  120.])
>>> sc.factorial(5, exact=True)
120L
>>>


From warren.weckesser at enthought.com  Mon Jun 28 21:24:52 2010
From: warren.weckesser at enthought.com (Warren Weckesser)
Date: Mon, 28 Jun 2010 20:24:52 -0500
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
In-Reply-To: <AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com>
References: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>
	<AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com>
Message-ID: <4C294B64.6080406@enthought.com>

Vincent Davis wrote:
> On Mon, Jun 28, 2010 at 7:11 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>   
>> According to the docs factorial2 should return a float or int, float
>> in the case of exact=False, It does not.
>>
>>     
>>>>> import scipy.misc.common as sc
>>>>> sc.factorial2(7, exact=False)
>>>>>           
>> array(105.00000000000001)
>>     
>>>>> sc.factorial2(7, exact=True)
>>>>>           
>> 105L
>>     
>>>>> sc.factorial2(8, exact=True)
>>>>>           
>> 384L
>>     
>>>>> sc.factorial2(8, exact=False)
>>>>>           
>> array(384.0)
>>     
>>>>> sc.factorial2(8, exact=False)
>>>>>           
>
> Same with factorial
>
>   
>>>> sc.factorial(5, exact=False)
>>>>         
> array(120.0)
>   
>>>> arr = np.array([3,4,5])
>>>> sc.factorial(arr, exact=False)
>>>>         
> array([   6.,   24.,  120.])
>   
>>>> sc.factorial(5, exact=True)
>>>>         
> 120L
>   

I think the problem is simply that it is an awkward API.  After looking 
at the code for factorial, a more precise description is:

If `exact` is True, `n` must be an integer, and an integer is returned.

If `exact` is False, `n` is array_like, and an array the same shape as 
`n` is returned.


Warren

> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>   


From vincent at vincentdavis.net  Mon Jun 28 21:29:40 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 19:29:40 -0600
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
In-Reply-To: <4C294B64.6080406@enthought.com>
References: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>
	<AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com>
	<4C294B64.6080406@enthought.com>
Message-ID: <AANLkTikJQQd0akUPqf2SJL0yvToW8nY9i875mslA94s9@mail.gmail.com>

On Mon, Jun 28, 2010 at 7:24 PM, Warren Weckesser
<warren.weckesser at enthought.com> wrote:
> Vincent Davis wrote:
>> On Mon, Jun 28, 2010 at 7:11 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>
>>> According to the docs factorial2 should return a float or int, float
>>> in the case of exact=False, It does not.
>>>
>>>
>>>>>> import scipy.misc.common as sc
>>>>>> sc.factorial2(7, exact=False)
>>>>>>
>>> array(105.00000000000001)
>>>
>>>>>> sc.factorial2(7, exact=True)
>>>>>>
>>> 105L
>>>
>>>>>> sc.factorial2(8, exact=True)
>>>>>>
>>> 384L
>>>
>>>>>> sc.factorial2(8, exact=False)
>>>>>>
>>> array(384.0)
>>>
>>>>>> sc.factorial2(8, exact=False)
>>>>>>
>>
>> Same with factorial
>>
>>
>>>>> sc.factorial(5, exact=False)
>>>>>
>> array(120.0)
>>
>>>>> arr = np.array([3,4,5])
>>>>> sc.factorial(arr, exact=False)
>>>>>
>> array([ ? 6., ? 24., ?120.])
>>
>>>>> sc.factorial(5, exact=True)
>>>>>
>> 120L
>>
>
> I think the problem is simply that it is an awkward API. ?After looking
> at the code for factorial, a more precise description is:
>
> If `exact` is True, `n` must be an integer, and an integer is returned.
>
> If `exact` is False, `n` is array_like, and an array the same shape as
> `n` is returned.

I am finding that when editing the docs it is not always clear if the
problem is the docs or the function.
comb() has a similar issue.

I guess my question is?
My thought is the you factoral(), or comb() return an array only if 1,
exact=False and n is not an array. Basically you get back the same
dtype as input.
That said fixing the docs is just a little easier.

Vincent
>
>
> Warren
>
>> _______________________________________________
>> SciPy-Dev mailing list
>> SciPy-Dev at scipy.org
>> http://mail.scipy.org/mailman/listinfo/scipy-dev
>>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Mon Jun 28 23:52:25 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 21:52:25 -0600
Subject: [SciPy-Dev] factorial duplicate scipy.misc.common.factorial and
	scipy.signal.bsplines.factorial
Message-ID: <AANLkTimVGgQ-u_xNl5LDA6kRsLAYeCxFpIZBZ3vOKt8i@mail.gmail.com>

factorial duplicate scipy.misc.common.factorial and
scipy.signal.bsplines.factorial
Not an exact duplicate but I think scipy.signal.bsplines.factorial
should be removed.

What to do?

Vincent


From jsseabold at gmail.com  Tue Jun 29 00:04:39 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Mon, 28 Jun 2010 23:04:39 -0500
Subject: [SciPy-Dev] Understanding source of docs,
	source file ".py" or 	".rst" file
In-Reply-To: <AANLkTilYSrYiUzyxYXzgH6IGF5I_wxUsp7yE8HxMHn5N@mail.gmail.com>
References: <AANLkTilYSrYiUzyxYXzgH6IGF5I_wxUsp7yE8HxMHn5N@mail.gmail.com>
Message-ID: <AANLkTilHTJq7zJEq-uKvztxLqa3Kpfi3VDgastCv1Zou@mail.gmail.com>

On Mon, Jun 28, 2010 at 5:27 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
> There are two sets of docs for scipy.constants.constants, that in the
> source constants.py and that in that in
> scipy/doc/source/constants.rst.
> The rst file is what is used on the website but I am not sure how to
> (can you) view it from an interactive py session.
> Is it necessary to maintain both?
> Or more generally whats the protocol for how and why this is done.
>

I believe that you edit the .rst files via the docs wiki and that they
are applied to the actual source modules by hand intermittently (ie.,
before a release, when a lot of progress has been made, or after a
docs marathon or sprint).

I also struggled to understand this at first.  Is this documented
somewhere?  If not, where would be best?  I'm thinking somewhere on
the main docs page.  I don't know that the Question and Answers page
is obvious enough.  Would the "before you start" be a good place?

http://docs.scipy.org/numpy/Front%20Page/#before-you-start

Skipper


From d.l.goldsmith at gmail.com  Tue Jun 29 00:14:52 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 28 Jun 2010 21:14:52 -0700
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
In-Reply-To: <AANLkTikJQQd0akUPqf2SJL0yvToW8nY9i875mslA94s9@mail.gmail.com>
References: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>
	<AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com>
	<4C294B64.6080406@enthought.com>
	<AANLkTikJQQd0akUPqf2SJL0yvToW8nY9i875mslA94s9@mail.gmail.com>
Message-ID: <AANLkTikH342mRBXm2JJpI45elwZSiExmZznM8_iC7Onz@mail.gmail.com>

On Mon, Jun 28, 2010 at 6:29 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Mon, Jun 28, 2010 at 7:24 PM, Warren Weckesser
> <warren.weckesser at enthought.com> wrote:
> > Vincent Davis wrote:
> >> On Mon, Jun 28, 2010 at 7:11 PM, Vincent Davis <
> vincent at vincentdavis.net> wrote:
> >>
> >>> According to the docs factorial2 should return a float or int, float
> >>> in the case of exact=False, It does not.
> >>>
> >>>
> >>>>>> import scipy.misc.common as sc
> >>>>>> sc.factorial2(7, exact=False)
> >>>>>>
> >>> array(105.00000000000001)
> >>>
> >>>>>> sc.factorial2(7, exact=True)
> >>>>>>
> >>> 105L
> >>>
> >>>>>> sc.factorial2(8, exact=True)
> >>>>>>
> >>> 384L
> >>>
> >>>>>> sc.factorial2(8, exact=False)
> >>>>>>
> >>> array(384.0)
> >>>
> >>>>>> sc.factorial2(8, exact=False)
> >>>>>>
> >>
> >> Same with factorial
> >>
> >>
> >>>>> sc.factorial(5, exact=False)
> >>>>>
> >> array(120.0)
> >>
> >>>>> arr = np.array([3,4,5])
> >>>>> sc.factorial(arr, exact=False)
> >>>>>
> >> array([   6.,   24.,  120.])
> >>
> >>>>> sc.factorial(5, exact=True)
> >>>>>
> >> 120L
> >>
> >
> > I think the problem is simply that it is an awkward API.  After looking
> > at the code for factorial, a more precise description is:
> >
> > If `exact` is True, `n` must be an integer, and an integer is returned.
> >
> > If `exact` is False, `n` is array_like, and an array the same shape as
> > `n` is returned.
>
> I am finding that when editing the docs it is not always clear if the
> problem is the docs or the function.
> comb() has a similar issue.
>
> I guess my question is?
> My thought is the you factoral(), or comb() return an array only if 1,
> exact=False and n is not an array. Basically you get back the same
> dtype as input.
> That said fixing the docs is just a little easier.
>

This issue has come up many times; again, here's the "policy": document
desired/intended behavior, not actual behavior - if they differ, and the doc
specifies the desired/intended behavior correctly, then file a (code) bug
report; if the function is behaving correctly, but the doc doesn't correctly
document that behavior, correct the doc in the Wiki; if both are wrong (and
there is precedent for this) then both actions must be taken.  What is the
implication of this when you encounter this situation while editing the
docs?  You must do what you've done: post to the appropriate list, asking
which is correct, and wait a little while for a decision; if several days
pass and no one responds, re-post; if several days pass and there remains
disagreement on the matter, file a bug ticket, so the debate moves over
there and "move on."

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100628/5f7d19db/attachment.html>

From jsseabold at gmail.com  Tue Jun 29 00:34:28 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Mon, 28 Jun 2010 23:34:28 -0500
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
In-Reply-To: <AANLkTikH342mRBXm2JJpI45elwZSiExmZznM8_iC7Onz@mail.gmail.com>
References: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com> 
	<AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com> 
	<4C294B64.6080406@enthought.com>
	<AANLkTikJQQd0akUPqf2SJL0yvToW8nY9i875mslA94s9@mail.gmail.com> 
	<AANLkTikH342mRBXm2JJpI45elwZSiExmZznM8_iC7Onz@mail.gmail.com>
Message-ID: <AANLkTikQZfqfWbKQUzFpxI69yCOprID0IiPv8GB0tUN5@mail.gmail.com>

On Mon, Jun 28, 2010 at 11:14 PM, David Goldsmith
<d.l.goldsmith at gmail.com> wrote:
>
> On Mon, Jun 28, 2010 at 6:29 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>
>> On Mon, Jun 28, 2010 at 7:24 PM, Warren Weckesser
>> <warren.weckesser at enthought.com> wrote:
>> > Vincent Davis wrote:
>> >> On Mon, Jun 28, 2010 at 7:11 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>> >>
>> >>> According to the docs factorial2 should return a float or int, float
>> >>> in the case of exact=False, It does not.
>> >>>
>> >>>
>> >>>>>> import scipy.misc.common as sc
>> >>>>>> sc.factorial2(7, exact=False)
>> >>>>>>
>> >>> array(105.00000000000001)
>> >>>
>> >>>>>> sc.factorial2(7, exact=True)
>> >>>>>>
>> >>> 105L
>> >>>
>> >>>>>> sc.factorial2(8, exact=True)
>> >>>>>>
>> >>> 384L
>> >>>
>> >>>>>> sc.factorial2(8, exact=False)
>> >>>>>>
>> >>> array(384.0)
>> >>>
>> >>>>>> sc.factorial2(8, exact=False)
>> >>>>>>
>> >>
>> >> Same with factorial
>> >>
>> >>
>> >>>>> sc.factorial(5, exact=False)
>> >>>>>
>> >> array(120.0)
>> >>
>> >>>>> arr = np.array([3,4,5])
>> >>>>> sc.factorial(arr, exact=False)
>> >>>>>
>> >> array([ ? 6., ? 24., ?120.])
>> >>
>> >>>>> sc.factorial(5, exact=True)
>> >>>>>
>> >> 120L
>> >>
>> >
>> > I think the problem is simply that it is an awkward API. ?After looking
>> > at the code for factorial, a more precise description is:
>> >
>> > If `exact` is True, `n` must be an integer, and an integer is returned.
>> >
>> > If `exact` is False, `n` is array_like, and an array the same shape as
>> > `n` is returned.
>>
>> I am finding that when editing the docs it is not always clear if the
>> problem is the docs or the function.
>> comb() has a similar issue.
>>
>> I guess my question is?
>> My thought is the you factoral(), or comb() return an array only if 1,
>> exact=False and n is not an array. Basically you get back the same
>> dtype as input.
>> That said fixing the docs is just a little easier.
>
> This issue has come up many times; again, here's the "policy": document desired/intended behavior, not actual behavior - if they differ, and the doc specifies the desired/intended behavior correctly, then file a (code) bug report; if the function is behaving correctly, but the doc doesn't correctly document that behavior, correct the doc in the Wiki; if both are wrong (and there is precedent for this) then both actions must be taken.? What is the implication of this when you encounter this situation while editing the docs?? You must do what you've done: post to the appropriate list, asking which is correct, and wait a little while for a decision; if several days pass and no one responds, re-post; if several days pass and there remains disagreement on the matter, file a bug ticket, so the debate moves over there and "move on."
>

Sound good?

http://docs.scipy.org/numpy/Questions+Answers/#documented-and-actual-behavior-differ

Skipper


From vincent at vincentdavis.net  Tue Jun 29 00:37:39 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 22:37:39 -0600
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
In-Reply-To: <AANLkTikQZfqfWbKQUzFpxI69yCOprID0IiPv8GB0tUN5@mail.gmail.com>
References: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>
	<AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com>
	<4C294B64.6080406@enthought.com>
	<AANLkTikJQQd0akUPqf2SJL0yvToW8nY9i875mslA94s9@mail.gmail.com>
	<AANLkTikH342mRBXm2JJpI45elwZSiExmZznM8_iC7Onz@mail.gmail.com>
	<AANLkTikQZfqfWbKQUzFpxI69yCOprID0IiPv8GB0tUN5@mail.gmail.com>
Message-ID: <AANLkTikfqfSSukZpjYA-ujm3LHjdulWbQ3_eLrzHi_OJ@mail.gmail.com>

On Mon, Jun 28, 2010 at 10:34 PM, Skipper Seabold <jsseabold at gmail.com> wrote:
> On Mon, Jun 28, 2010 at 11:14 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
>>
>> On Mon, Jun 28, 2010 at 6:29 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>>
>>> On Mon, Jun 28, 2010 at 7:24 PM, Warren Weckesser
>>> <warren.weckesser at enthought.com> wrote:
>>> > Vincent Davis wrote:
>>> >> On Mon, Jun 28, 2010 at 7:11 PM, Vincent Davis <vincent at vincentdavis.net> wrote:
>>> >>
>>> >>> According to the docs factorial2 should return a float or int, float
>>> >>> in the case of exact=False, It does not.
>>> >>>
>>> >>>
>>> >>>>>> import scipy.misc.common as sc
>>> >>>>>> sc.factorial2(7, exact=False)
>>> >>>>>>
>>> >>> array(105.00000000000001)
>>> >>>
>>> >>>>>> sc.factorial2(7, exact=True)
>>> >>>>>>
>>> >>> 105L
>>> >>>
>>> >>>>>> sc.factorial2(8, exact=True)
>>> >>>>>>
>>> >>> 384L
>>> >>>
>>> >>>>>> sc.factorial2(8, exact=False)
>>> >>>>>>
>>> >>> array(384.0)
>>> >>>
>>> >>>>>> sc.factorial2(8, exact=False)
>>> >>>>>>
>>> >>
>>> >> Same with factorial
>>> >>
>>> >>
>>> >>>>> sc.factorial(5, exact=False)
>>> >>>>>
>>> >> array(120.0)
>>> >>
>>> >>>>> arr = np.array([3,4,5])
>>> >>>>> sc.factorial(arr, exact=False)
>>> >>>>>
>>> >> array([ ? 6., ? 24., ?120.])
>>> >>
>>> >>>>> sc.factorial(5, exact=True)
>>> >>>>>
>>> >> 120L
>>> >>
>>> >
>>> > I think the problem is simply that it is an awkward API. ?After looking
>>> > at the code for factorial, a more precise description is:
>>> >
>>> > If `exact` is True, `n` must be an integer, and an integer is returned.
>>> >
>>> > If `exact` is False, `n` is array_like, and an array the same shape as
>>> > `n` is returned.
>>>
>>> I am finding that when editing the docs it is not always clear if the
>>> problem is the docs or the function.
>>> comb() has a similar issue.
>>>
>>> I guess my question is?
>>> My thought is the you factoral(), or comb() return an array only if 1,
>>> exact=False and n is not an array. Basically you get back the same
>>> dtype as input.
>>> That said fixing the docs is just a little easier.
>>
>> This issue has come up many times; again, here's the "policy": document desired/intended behavior, not actual behavior - if they differ, and the doc specifies the desired/intended behavior correctly, then file a (code) bug report; if the function is behaving correctly, but the doc doesn't correctly document that behavior, correct the doc in the Wiki; if both are wrong (and there is precedent for this) then both actions must be taken.? What is the implication of this when you encounter this situation while editing the docs?? You must do what you've done: post to the appropriate list, asking which is correct, and wait a little while for a decision; if several days pass and no one responds, re-post; if several days pass and there remains disagreement on the matter, file a bug ticket, so the debate moves over there and "move on."
>>
>
> Sound good?
>
> http://docs.scipy.org/numpy/Questions+Answers/#documented-and-actual-behavior-differ

So whats the desired/intended behavior? i.e. go with what Warren
proposed for the docs or what I proposed for the behavior?

Vincent

>
> Skipper
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From vincent at vincentdavis.net  Tue Jun 29 00:44:43 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 22:44:43 -0600
Subject: [SciPy-Dev] function that are duplicates of numpy,
	such as scipy.special.basic.log
Message-ID: <AANLkTini-4G8vWm0prX74N-zBo3WoODjApfsSupopSBb@mail.gmail.com>

Is there a reason to keep function such as scipy.special.basic.log()
that appears to duplicate numpy.log()


Vincent


From vincent at vincentdavis.net  Tue Jun 29 00:50:16 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Mon, 28 Jun 2010 22:50:16 -0600
Subject: [SciPy-Dev] function that are duplicates of numpy,
	such as 	scipy.special.basic.log
In-Reply-To: <AANLkTini-4G8vWm0prX74N-zBo3WoODjApfsSupopSBb@mail.gmail.com>
References: <AANLkTini-4G8vWm0prX74N-zBo3WoODjApfsSupopSBb@mail.gmail.com>
Message-ID: <AANLkTikxtsayY9_qOGpMK7NMK5JTxEgYgHhqdlHKFfYp@mail.gmail.com>

On Mon, Jun 28, 2010 at 10:44 PM, Vincent Davis
<vincent at vincentdavis.net> wrote:
> Is there a reason to keep function such as scipy.special.basic.log()
> that appears to duplicate numpy.log()

I like talking to myself, I see that this is exactly numpy.log()

But why do this? (seriously is there a reason I don't see)?

Vincent

>
>
> Vincent
>


From d.l.goldsmith at gmail.com  Tue Jun 29 00:56:05 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 28 Jun 2010 21:56:05 -0700
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
In-Reply-To: <AANLkTikfqfSSukZpjYA-ujm3LHjdulWbQ3_eLrzHi_OJ@mail.gmail.com>
References: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>
	<AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com>
	<4C294B64.6080406@enthought.com>
	<AANLkTikJQQd0akUPqf2SJL0yvToW8nY9i875mslA94s9@mail.gmail.com>
	<AANLkTikH342mRBXm2JJpI45elwZSiExmZznM8_iC7Onz@mail.gmail.com>
	<AANLkTikQZfqfWbKQUzFpxI69yCOprID0IiPv8GB0tUN5@mail.gmail.com>
	<AANLkTikfqfSSukZpjYA-ujm3LHjdulWbQ3_eLrzHi_OJ@mail.gmail.com>
Message-ID: <AANLkTinjEk_CLouCInh002nWRpUi9A0OGkXR5HdxAdkA@mail.gmail.com>

On Mon, Jun 28, 2010 at 9:37 PM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Mon, Jun 28, 2010 at 10:34 PM, Skipper Seabold <jsseabold at gmail.com>
> wrote:
> > On Mon, Jun 28, 2010 at 11:14 PM, David Goldsmith
> > <d.l.goldsmith at gmail.com> wrote:
> >>
> >> On Mon, Jun 28, 2010 at 6:29 PM, Vincent Davis <
> vincent at vincentdavis.net> wrote:
> >>>
> >>> On Mon, Jun 28, 2010 at 7:24 PM, Warren Weckesser
> >>> <warren.weckesser at enthought.com> wrote:
> >>> > Vincent Davis wrote:
> >>> >> On Mon, Jun 28, 2010 at 7:11 PM, Vincent Davis <
> vincent at vincentdavis.net> wrote:
> >>> >>
> >>> >>> According to the docs factorial2 should return a float or int,
> float
> >>> >>> in the case of exact=False, It does not.
> >>> >>>
> >>> >>>
> >>> >>>>>> import scipy.misc.common as sc
> >>> >>>>>> sc.factorial2(7, exact=False)
> >>> >>>>>>
> >>> >>> array(105.00000000000001)
> >>> >>>
> >>> >>>>>> sc.factorial2(7, exact=True)
> >>> >>>>>>
> >>> >>> 105L
> >>> >>>
> >>> >>>>>> sc.factorial2(8, exact=True)
> >>> >>>>>>
> >>> >>> 384L
> >>> >>>
> >>> >>>>>> sc.factorial2(8, exact=False)
> >>> >>>>>>
> >>> >>> array(384.0)
> >>> >>>
> >>> >>>>>> sc.factorial2(8, exact=False)
> >>> >>>>>>
> >>> >>
> >>> >> Same with factorial
> >>> >>
> >>> >>
> >>> >>>>> sc.factorial(5, exact=False)
> >>> >>>>>
> >>> >> array(120.0)
> >>> >>
> >>> >>>>> arr = np.array([3,4,5])
> >>> >>>>> sc.factorial(arr, exact=False)
> >>> >>>>>
> >>> >> array([   6.,   24.,  120.])
> >>> >>
> >>> >>>>> sc.factorial(5, exact=True)
> >>> >>>>>
> >>> >> 120L
> >>> >>
> >>> >
> >>> > I think the problem is simply that it is an awkward API.  After
> looking
> >>> > at the code for factorial, a more precise description is:
> >>> >
> >>> > If `exact` is True, `n` must be an integer, and an integer is
> returned.
> >>> >
> >>> > If `exact` is False, `n` is array_like, and an array the same shape
> as
> >>> > `n` is returned.
> >>>
> >>> I am finding that when editing the docs it is not always clear if the
> >>> problem is the docs or the function.
> >>> comb() has a similar issue.
> >>>
> >>> I guess my question is?
> >>> My thought is the you factoral(), or comb() return an array only if 1,
> >>> exact=False and n is not an array. Basically you get back the same
> >>> dtype as input.
> >>> That said fixing the docs is just a little easier.
> >>
> >> This issue has come up many times; again, here's the "policy": document
> desired/intended behavior, not actual behavior - if they differ, and the doc
> specifies the desired/intended behavior correctly, then file a (code) bug
> report; if the function is behaving correctly, but the doc doesn't correctly
> document that behavior, correct the doc in the Wiki; if both are wrong (and
> there is precedent for this) then both actions must be taken.  What is the
> implication of this when you encounter this situation while editing the
> docs?  You must do what you've done: post to the appropriate list, asking
> which is correct, and wait a little while for a decision; if several days
> pass and no one responds, re-post; if several days pass and there remains
> disagreement on the matter, file a bug ticket, so the debate moves over
> there and "move on."
> >>
> >
> > Sound good?
> >
> >
> http://docs.scipy.org/numpy/Questions+Answers/#documented-and-actual-behavior-differ
>
> So whats the desired/intended behavior? i.e. go with what Warren
> proposed for the docs or what I proposed for the behavior?
>

This is where you play that waiting game for a bit: you can't count on
everyone who has an opinion on the matter (and let's face it: some people's
opinions matter more than others in this community, e.g., "when RK speaks,
people listen") to have seen this so soon.  Wait a few days, then ask again
(if necessary).

DG

>
> Vincent
>
> >
> > Skipper
> > _______________________________________________
> > SciPy-Dev mailing list
> > SciPy-Dev at scipy.org
> > http://mail.scipy.org/mailman/listinfo/scipy-dev
> >
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


-- 
Mathematician: noun, someone who disavows certainty when their uncertainty
set is non-empty, even if that set has measure zero.

Hope: noun, that delusive spirit which escaped Pandora's jar and, with her
lies, prevents mankind from committing a general suicide.  (As interpreted
by Robert Graves)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100628/248f3bc0/attachment.html>

From d.l.goldsmith at gmail.com  Tue Jun 29 00:59:04 2010
From: d.l.goldsmith at gmail.com (David Goldsmith)
Date: Mon, 28 Jun 2010 21:59:04 -0700
Subject: [SciPy-Dev] factorial2 does not return a float with exact=False
In-Reply-To: <AANLkTikQZfqfWbKQUzFpxI69yCOprID0IiPv8GB0tUN5@mail.gmail.com>
References: <AANLkTim-CerPQdNOwrLD3pdMBLaJXOnF2IgH9z69A2V7@mail.gmail.com>
	<AANLkTikUSlrnCSbHaoZTzen8jtcEw_c8zTcpNoHl4ALr@mail.gmail.com>
	<4C294B64.6080406@enthought.com>
	<AANLkTikJQQd0akUPqf2SJL0yvToW8nY9i875mslA94s9@mail.gmail.com>
	<AANLkTikH342mRBXm2JJpI45elwZSiExmZznM8_iC7Onz@mail.gmail.com>
	<AANLkTikQZfqfWbKQUzFpxI69yCOprID0IiPv8GB0tUN5@mail.gmail.com>
Message-ID: <AANLkTin9Dg4V0YHKgR4rR9_wYndYo3GzRzXsOyrD2-SR@mail.gmail.com>

On Mon, Jun 28, 2010 at 9:34 PM, Skipper Seabold <jsseabold at gmail.com>wrote:

> On Mon, Jun 28, 2010 at 11:14 PM, David Goldsmith
> <d.l.goldsmith at gmail.com> wrote:
> >
> > On Mon, Jun 28, 2010 at 6:29 PM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
> >>
> >> On Mon, Jun 28, 2010 at 7:24 PM, Warren Weckesser
> >> <warren.weckesser at enthought.com> wrote:
> >> > Vincent Davis wrote:
> >> >> On Mon, Jun 28, 2010 at 7:11 PM, Vincent Davis <
> vincent at vincentdavis.net> wrote:
> >> >>
> >> >>> According to the docs factorial2 should return a float or int, float
> >> >>> in the case of exact=False, It does not.
> >> >>>
> >> >>>
> >> >>>>>> import scipy.misc.common as sc
> >> >>>>>> sc.factorial2(7, exact=False)
> >> >>>>>>
> >> >>> array(105.00000000000001)
> >> >>>
> >> >>>>>> sc.factorial2(7, exact=True)
> >> >>>>>>
> >> >>> 105L
> >> >>>
> >> >>>>>> sc.factorial2(8, exact=True)
> >> >>>>>>
> >> >>> 384L
> >> >>>
> >> >>>>>> sc.factorial2(8, exact=False)
> >> >>>>>>
> >> >>> array(384.0)
> >> >>>
> >> >>>>>> sc.factorial2(8, exact=False)
> >> >>>>>>
> >> >>
> >> >> Same with factorial
> >> >>
> >> >>
> >> >>>>> sc.factorial(5, exact=False)
> >> >>>>>
> >> >> array(120.0)
> >> >>
> >> >>>>> arr = np.array([3,4,5])
> >> >>>>> sc.factorial(arr, exact=False)
> >> >>>>>
> >> >> array([   6.,   24.,  120.])
> >> >>
> >> >>>>> sc.factorial(5, exact=True)
> >> >>>>>
> >> >> 120L
> >> >>
> >> >
> >> > I think the problem is simply that it is an awkward API.  After
> looking
> >> > at the code for factorial, a more precise description is:
> >> >
> >> > If `exact` is True, `n` must be an integer, and an integer is
> returned.
> >> >
> >> > If `exact` is False, `n` is array_like, and an array the same shape as
> >> > `n` is returned.
> >>
> >> I am finding that when editing the docs it is not always clear if the
> >> problem is the docs or the function.
> >> comb() has a similar issue.
> >>
> >> I guess my question is?
> >> My thought is the you factoral(), or comb() return an array only if 1,
> >> exact=False and n is not an array. Basically you get back the same
> >> dtype as input.
> >> That said fixing the docs is just a little easier.
> >
> > This issue has come up many times; again, here's the "policy": document
> desired/intended behavior, not actual behavior - if they differ, and the doc
> specifies the desired/intended behavior correctly, then file a (code) bug
> report; if the function is behaving correctly, but the doc doesn't correctly
> document that behavior, correct the doc in the Wiki; if both are wrong (and
> there is precedent for this) then both actions must be taken.  What is the
> implication of this when you encounter this situation while editing the
> docs?  You must do what you've done: post to the appropriate list, asking
> which is correct, and wait a little while for a decision; if several days
> pass and no one responds, re-post; if several days pass and there remains
> disagreement on the matter, file a bug ticket, so the debate moves over
> there and "move on."
> >
>
> Sound good?
>
>
> http://docs.scipy.org/numpy/Questions+Answers/#documented-and-actual-behavior-differ
>
> Skipper
>

I appreciate that you felt my reply was canonical enough to quote verbatim
:-) (or were you just being lazy ;-) ) but over there, it should be
converted to the third person.  I'll take care of it - thanks!

DG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100628/e1037bab/attachment.html>

From david at silveregg.co.jp  Tue Jun 29 01:10:10 2010
From: david at silveregg.co.jp (David)
Date: Tue, 29 Jun 2010 14:10:10 +0900
Subject: [SciPy-Dev] function that are duplicates of numpy,
	such as 	scipy.special.basic.log
In-Reply-To: <AANLkTikxtsayY9_qOGpMK7NMK5JTxEgYgHhqdlHKFfYp@mail.gmail.com>
References: <AANLkTini-4G8vWm0prX74N-zBo3WoODjApfsSupopSBb@mail.gmail.com>
	<AANLkTikxtsayY9_qOGpMK7NMK5JTxEgYgHhqdlHKFfYp@mail.gmail.com>
Message-ID: <4C298032.5000608@silveregg.co.jp>

On 06/29/2010 01:50 PM, Vincent Davis wrote:
> On Mon, Jun 28, 2010 at 10:44 PM, Vincent Davis
> <vincent at vincentdavis.net>  wrote:
>> Is there a reason to keep function such as scipy.special.basic.log()
>> that appears to duplicate numpy.log()
>
> I like talking to myself, I see that this is exactly numpy.log()
>
> But why do this? (seriously is there a reason I don't see)?

In most cases, this is historical. The code you are looking at is almost 
a decade old, where numpy was something else that it is now.

cheers,

David


From ben.root at ou.edu  Tue Jun 29 01:10:23 2010
From: ben.root at ou.edu (Benjamin Root)
Date: Tue, 29 Jun 2010 00:10:23 -0500
Subject: [SciPy-Dev] function that are duplicates of numpy,
	such as 	scipy.special.basic.log
In-Reply-To: <4C298032.5000608@silveregg.co.jp>
References: <AANLkTini-4G8vWm0prX74N-zBo3WoODjApfsSupopSBb@mail.gmail.com> 
	<AANLkTikxtsayY9_qOGpMK7NMK5JTxEgYgHhqdlHKFfYp@mail.gmail.com> 
	<4C298032.5000608@silveregg.co.jp>
Message-ID: <AANLkTikILq9isD5ScSqI3o2UxT6k7IsP1ljNZdGSJ4tQ@mail.gmail.com>

On Tue, Jun 29, 2010 at 12:10 AM, David <david at silveregg.co.jp> wrote:

> On 06/29/2010 01:50 PM, Vincent Davis wrote:
> > On Mon, Jun 28, 2010 at 10:44 PM, Vincent Davis
> > <vincent at vincentdavis.net>  wrote:
> >> Is there a reason to keep function such as scipy.special.basic.log()
> >> that appears to duplicate numpy.log()
> >
> > I like talking to myself, I see that this is exactly numpy.log()
> >
> > But why do this? (seriously is there a reason I don't see)?
>
> In most cases, this is historical. The code you are looking at is almost
> a decade old, where numpy was something else that it is now.
>
> cheers,
>
>
So, wouldn't that be justification enough to begin a deprecation process for
it?  These are the sort of edges that needs to get polished in scipy/numpy.

Ben Root


> David
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100629/d657e73f/attachment.html>

From pav at iki.fi  Tue Jun 29 04:55:45 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Tue, 29 Jun 2010 08:55:45 +0000 (UTC)
Subject: [SciPy-Dev] Understanding source of docs,
	source file ".py" or 	".rst" file
References: <AANLkTilYSrYiUzyxYXzgH6IGF5I_wxUsp7yE8HxMHn5N@mail.gmail.com>
	<AANLkTilHTJq7zJEq-uKvztxLqa3Kpfi3VDgastCv1Zou@mail.gmail.com>
Message-ID: <i0cceh$qdh$1@dough.gmane.org>

Mon, 28 Jun 2010 23:04:39 -0500, Skipper Seabold wrote:
[clip]
> I also struggled to understand this at first.  Is this documented
> somewhere?  If not, where would be best?  I'm thinking somewhere on the
> main docs page.  I don't know that the Question and Answers page is
> obvious enough.  Would the "before you start" be a good place?

If you think about how Sphinx works, all this becomes clear. But perhaps 
it should be explicitly explained somewhere.

-- 
Pauli Virtanen


From pav at iki.fi  Tue Jun 29 04:59:25 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Tue, 29 Jun 2010 08:59:25 +0000 (UTC)
Subject: [SciPy-Dev] Is imresize() defined twice in scipy.misc.pilutil.py
References: <AANLkTinOhsh_4Du-ajlh_t2F9w1RUFqpw2iMlx1QVQw0@mail.gmail.com>
	<4C2943A7.9050704@enthought.com>
Message-ID: <i0ccld$qdh$2@dough.gmane.org>

Mon, 28 Jun 2010 19:51:51 -0500, Warren Weckesser wrote:
> Vincent Davis wrote:
>> Is imresize defined twice in  scipy.misc.pilutil.py or am i missing
>> something?
>
> Yup, and they were both added by travo about eight years ago (r521 and
> r545).  Hey Travis, what's up with that?

I guess something like that would be hard to remember :)

The two functions appear to have slightly different functionality. 
Nevertheless, perhaps it's just best to remove the first one (which is 
overridden by the second one). This looks pretty much like legacy stuff.

-- 
Pauli Virtanen


From pav at iki.fi  Tue Jun 29 05:06:58 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Tue, 29 Jun 2010 09:06:58 +0000 (UTC)
Subject: [SciPy-Dev] function that are duplicates of numpy,
 such as scipy.special.basic.log
References: <AANLkTini-4G8vWm0prX74N-zBo3WoODjApfsSupopSBb@mail.gmail.com>
	<AANLkTikxtsayY9_qOGpMK7NMK5JTxEgYgHhqdlHKFfYp@mail.gmail.com>
	<4C298032.5000608@silveregg.co.jp>
	<AANLkTikILq9isD5ScSqI3o2UxT6k7IsP1ljNZdGSJ4tQ@mail.gmail.com>
Message-ID: <i0cd3i$qdh$3@dough.gmane.org>

Tue, 29 Jun 2010 00:10:23 -0500, Benjamin Root wrote:
> So, wouldn't that be justification enough to begin a deprecation process
> for it?  These are the sort of edges that needs to get polished in
> scipy/numpy.

Yes. Please file bug tickets listing stuff that is duplicated or looks 
like it should be deprecated. (For example, one ticket per submodule 
should be ok.)

-- 
Pauli Virtanen


From gael.varoquaux at normalesup.org  Tue Jun 29 08:17:01 2010
From: gael.varoquaux at normalesup.org (Gael Varoquaux)
Date: Tue, 29 Jun 2010 14:17:01 +0200
Subject: [SciPy-Dev] Status of trunk
Message-ID: <20100629121701.GA12517@phare.normalesup.org>

Sorry for the silly question, but I have been unable to follow properly 
the mailing lists lately. What is the current status of trunk? I am
sitting with Robert Cimrman and we would like to push a few new feature
in (sparse matrix stuff), but it does not seem right to be pushing in
what is going to be released soon.

I am just asking to plan a bit how Robert and I are going to work on
this.

Cheers,

Ga?l

PS: Basically, we want to tackle ticket #1057


From ralf.gommers at googlemail.com  Tue Jun 29 08:20:13 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Tue, 29 Jun 2010 20:20:13 +0800
Subject: [SciPy-Dev] Status of trunk
In-Reply-To: <20100629121701.GA12517@phare.normalesup.org>
References: <20100629121701.GA12517@phare.normalesup.org>
Message-ID: <AANLkTilIKYZ56FYo3h2keMAIPbIihhmKVzQHbpVzsnDg@mail.gmail.com>

On Tue, Jun 29, 2010 at 8:17 PM, Gael Varoquaux <
gael.varoquaux at normalesup.org> wrote:

> Sorry for the silly question, but I have been unable to follow properly
> the mailing lists lately. What is the current status of trunk? I am
> sitting with Robert Cimrman and we would like to push a few new feature
> in (sparse matrix stuff), but it does not seem right to be pushing in
> what is going to be released soon.
>
> There's a 0.8.x branch already, so pushing to trunk is perfectly fine.

Cheers,
Ralf


> I am just asking to plan a bit how Robert and I are going to work on
> this.
>
> Cheers,
>
> Ga?l
>
> PS: Basically, we want to tackle ticket #1057
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100629/df812c58/attachment.html>

From ralf.gommers at googlemail.com  Tue Jun 29 09:08:35 2010
From: ralf.gommers at googlemail.com (Ralf Gommers)
Date: Tue, 29 Jun 2010 21:08:35 +0800
Subject: [SciPy-Dev] FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1),
	py27 64bit osx
In-Reply-To: <AANLkTimdr1Lw8XRV218HdH7n1O98kD_-HQ7KHQXACd3a@mail.gmail.com>
References: <AANLkTikMYt5N5geCoXlKfU2gRPCmpO1P0xHwoS2iWBIB@mail.gmail.com>
	<AANLkTilVGHTFy1GNVPANhB1hRP4_BYjdCowKeboxsM16@mail.gmail.com>
	<AANLkTinyGBNxmY_7X3lfnqFPp_IdSCwiyHQHN0DGkz5I@mail.gmail.com>
	<AANLkTinJExNEECvJwwWrlRNo6LB5wV4iQ5OOPF0chdX9@mail.gmail.com>
	<AANLkTimdr1Lw8XRV218HdH7n1O98kD_-HQ7KHQXACd3a@mail.gmail.com>
Message-ID: <AANLkTil3ihn8kltFgSWSkextKWDgJC7hcBMwR8L5Kdbn@mail.gmail.com>

On Tue, Jun 29, 2010 at 3:30 AM, Vincent Davis <vincent at vincentdavis.net>wrote:

> On Mon, Jun 28, 2010 at 10:02 AM, Ralf Gommers
>
> It would seem to me that the set should fail.
> def test_none(self):
>        """Ticket #1124."""
>        signal.medfilt(None)
>
> Not sure how to do this but I guess it is possible that the error is
> not a sigfualt or medfilt could return a more meaningful error because
> None is not a good argument. Volume=None > gets converted with
> numpy.atleast_1d(*arys) which returns array([None], dtype=object).
>
> So should the test use an expect exception? Or whats the fix?
>
> I don't get an exception on 32-bit OS X. Looking at the test, the only
purpose seems to be to ensure there's no segfault. There are probably loads
of functions that do strange things when you feed them object arrays with
None or something more random inside - no point checking for that in the
function. So to fix the test, maybe simply:

def test_none(self):
    """Ticket #1124. Ensure this does not segfault."""
    try:
        signal.medfilt(None)
    except:
        pass

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100629/ae435f8b/attachment.html>

From vincent at vincentdavis.net  Tue Jun 29 09:38:22 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 29 Jun 2010 07:38:22 -0600
Subject: [SciPy-Dev] FAILED (KNOWNFAIL=11, SKIP=39, errors=6, failures=1),
	py27 64bit osx
In-Reply-To: <AANLkTil3ihn8kltFgSWSkextKWDgJC7hcBMwR8L5Kdbn@mail.gmail.com>
References: <AANLkTikMYt5N5geCoXlKfU2gRPCmpO1P0xHwoS2iWBIB@mail.gmail.com>
	<AANLkTilVGHTFy1GNVPANhB1hRP4_BYjdCowKeboxsM16@mail.gmail.com>
	<AANLkTinyGBNxmY_7X3lfnqFPp_IdSCwiyHQHN0DGkz5I@mail.gmail.com>
	<AANLkTinJExNEECvJwwWrlRNo6LB5wV4iQ5OOPF0chdX9@mail.gmail.com>
	<AANLkTimdr1Lw8XRV218HdH7n1O98kD_-HQ7KHQXACd3a@mail.gmail.com>
	<AANLkTil3ihn8kltFgSWSkextKWDgJC7hcBMwR8L5Kdbn@mail.gmail.com>
Message-ID: <AANLkTimQGZE_BnnOyJ7ZwHO80v8DpNuOxbaAUJhVJFUs@mail.gmail.com>

On Tue, Jun 29, 2010 at 7:08 AM, Ralf Gommers
<ralf.gommers at googlemail.com> wrote:
>
>
> On Tue, Jun 29, 2010 at 3:30 AM, Vincent Davis <vincent at vincentdavis.net>
> wrote:
>>
>> On Mon, Jun 28, 2010 at 10:02 AM, Ralf Gommers
>>
>> It would seem to me that the set should fail.
>> def test_none(self):
>> ? ? ? ?"""Ticket #1124."""
>> ? ? ? ?signal.medfilt(None)
>>
>> Not sure how to do this but I guess it is possible that the error is
>> not a sigfualt or medfilt could return a more meaningful error because
>> None is not a good argument. Volume=None > gets converted with
>> numpy.atleast_1d(*arys) which returns array([None], dtype=object).
>>
>> So should the test use an expect exception? Or whats the fix?
>>
> I don't get an exception on 32-bit OS X. Looking at the test, the only
> purpose seems to be to ensure there's no segfault. There are probably loads
> of functions that do strange things when you feed them object arrays with
> None or something more random inside - no point checking for that in the
> function. So to fix the test, maybe simply:
>
> def test_none(self):
> ??? """Ticket #1124. Ensure this does not segfault."""
> ??? try:
> ??????? signal.medfilt(None)
> ??? except:
> ??????? pass
>
Filed a ticket
Applied and tested the above patch
Attached patch to ticket.

Vincent

> Cheers,
> Ralf
>
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>


From vincent at vincentdavis.net  Tue Jun 29 09:36:43 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 29 Jun 2010 07:36:43 -0600
Subject: [SciPy-Dev] Understanding source of docs,
	source file ".py" or 	".rst" file
In-Reply-To: <i0cceh$qdh$1@dough.gmane.org>
References: <AANLkTilYSrYiUzyxYXzgH6IGF5I_wxUsp7yE8HxMHn5N@mail.gmail.com>
	<AANLkTilHTJq7zJEq-uKvztxLqa3Kpfi3VDgastCv1Zou@mail.gmail.com>
	<i0cceh$qdh$1@dough.gmane.org>
Message-ID: <AANLkTinEqJwX1qhctlrfwF3xRyRzf7PYlvhyJah8rBJV@mail.gmail.com>

On Tue, Jun 29, 2010 at 2:55 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Mon, 28 Jun 2010 23:04:39 -0500, Skipper Seabold wrote:
> [clip]
>> I also struggled to understand this at first. ?Is this documented
>> somewhere? ?If not, where would be best? ?I'm thinking somewhere on the
>> main docs page. ?I don't know that the Question and Answers page is
>> obvious enough. ?Would the "before you start" be a good place?
>
> If you think about how Sphinx works, all this becomes clear. But perhaps
> it should be explicitly explained somewhere.
>
I guess my specific question is? When editing docs on the wiki should
I edit, for example
"scipy-docs/constants.rst"
http://docs.scipy.org/scipy/docs/scipy-docs/constants.rst/#scipy-docs-constants-rst
or
http://docs.scipy.org/scipy/docs/scipy.constants.constants/
or both
or is it like Skipper said
"I believe that you edit the .rst files via the docs wiki and that they
are applied to the actual source modules by hand intermittently (ie.,
before a release, when a lot of progress has been made, or after a
docs marathon or sprint)."

Thanks
Vincent


> --
> Pauli Virtanen
>
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>


From gael.varoquaux at normalesup.org  Tue Jun 29 10:02:55 2010
From: gael.varoquaux at normalesup.org (Gael Varoquaux)
Date: Tue, 29 Jun 2010 16:02:55 +0200
Subject: [SciPy-Dev] Status of trunk
In-Reply-To: <AANLkTilIKYZ56FYo3h2keMAIPbIihhmKVzQHbpVzsnDg@mail.gmail.com>
References: <20100629121701.GA12517@phare.normalesup.org>
	<AANLkTilIKYZ56FYo3h2keMAIPbIihhmKVzQHbpVzsnDg@mail.gmail.com>
Message-ID: <20100629140255.GA14464@phare.normalesup.org>

On Tue, Jun 29, 2010 at 08:20:13PM +0800, Ralf Gommers wrote:
>      Sorry for the silly question, but I have been unable to follow properly
>      the mailing lists lately. What is the current status of trunk? I am
>      sitting with Robert Cimrman and we would like to push a few new feature
>      in (sparse matrix stuff), but it does not seem right to be pushing in
>      what is going to be released soon.

>    There's a 0.8.x branch already, so pushing to trunk is perfectly fine.

Excellent. That's what I thought, but I wantd to be certain.

Cheers,

Ga?l


From jsseabold at gmail.com  Tue Jun 29 11:06:23 2010
From: jsseabold at gmail.com (Skipper Seabold)
Date: Tue, 29 Jun 2010 10:06:23 -0500
Subject: [SciPy-Dev] Understanding source of docs,
	source file ".py" or 	".rst" file
In-Reply-To: <i0cceh$qdh$1@dough.gmane.org>
References: <AANLkTilYSrYiUzyxYXzgH6IGF5I_wxUsp7yE8HxMHn5N@mail.gmail.com> 
	<AANLkTilHTJq7zJEq-uKvztxLqa3Kpfi3VDgastCv1Zou@mail.gmail.com> 
	<i0cceh$qdh$1@dough.gmane.org>
Message-ID: <AANLkTim-UwR5uuBI-HwOQ8AmoGw5aWph3aNwXp9RO7Wy@mail.gmail.com>

On Tue, Jun 29, 2010 at 3:55 AM, Pauli Virtanen <pav at iki.fi> wrote:
> Mon, 28 Jun 2010 23:04:39 -0500, Skipper Seabold wrote:
> [clip]
>> I also struggled to understand this at first. ?Is this documented
>> somewhere? ?If not, where would be best? ?I'm thinking somewhere on the
>> main docs page. ?I don't know that the Question and Answers page is
>> obvious enough. ?Would the "before you start" be a good place?
>
> If you think about how Sphinx works, all this becomes clear. But perhaps
> it should be explicitly explained somewhere.
>

Yeah, it's clear(er) now. If the idea is to keep the barrier of entry
low to work on the wiki docs, then a few sentences on how this works
might be worth it.

Skipper


From scott.sinclair.za at gmail.com  Tue Jun 29 11:07:56 2010
From: scott.sinclair.za at gmail.com (Scott Sinclair)
Date: Tue, 29 Jun 2010 17:07:56 +0200
Subject: [SciPy-Dev] Understanding source of docs,
	source file ".py" or 	".rst" file
In-Reply-To: <AANLkTinEqJwX1qhctlrfwF3xRyRzf7PYlvhyJah8rBJV@mail.gmail.com>
References: <AANLkTilYSrYiUzyxYXzgH6IGF5I_wxUsp7yE8HxMHn5N@mail.gmail.com>
	<AANLkTilHTJq7zJEq-uKvztxLqa3Kpfi3VDgastCv1Zou@mail.gmail.com>
	<i0cceh$qdh$1@dough.gmane.org>
	<AANLkTinEqJwX1qhctlrfwF3xRyRzf7PYlvhyJah8rBJV@mail.gmail.com>
Message-ID: <AANLkTinTHD3xmY3os8tss_3su21QLCFg8JrsA0E5msb0@mail.gmail.com>

On 29 June 2010 15:36, Vincent Davis <vincent at vincentdavis.net> wrote:
> On Tue, Jun 29, 2010 at 2:55 AM, Pauli Virtanen <pav at iki.fi> wrote:
>> Mon, 28 Jun 2010 23:04:39 -0500, Skipper Seabold wrote:
>> [clip]
>>> I also struggled to understand this at first. ?Is this documented
>>> somewhere? ?If not, where would be best? ?I'm thinking somewhere on the
>>> main docs page. ?I don't know that the Question and Answers page is
>>> obvious enough. ?Would the "before you start" be a good place?
>>
>> If you think about how Sphinx works, all this becomes clear. But perhaps
>> it should be explicitly explained somewhere.
>>
> I guess my specific question is? When editing docs on the wiki should
> I edit, for example
> "scipy-docs/constants.rst"
> http://docs.scipy.org/scipy/docs/scipy-docs/constants.rst/
> or
> http://docs.scipy.org/scipy/docs/scipy.constants.constants/

It's most helpful if you edit the above, for now.

> or both
> or is it like Skipper said
> "I believe that you edit the .rst files via the docs wiki and that they
> are applied to the actual source modules by hand intermittently (ie.,
> before a release, when a lot of progress has been made, or after a
> docs marathon or sprint)."

It's the other way around. The file doc/source/constants.rst [1] is
processed by Sphinx during the documentation build process to produce
some of the content in the SciPy reference guide at
http://docs.scipy.org/. The documentation in the source file
scipy/constants/constants.py [2] can eventually be pulled into the
reference guide using the appropriate Sphinx directives in
constants.rst.

Cheers,
Scott

[1] edited via the doc wiki at
http://docs.scipy.org/scipy/docs/scipy-docs/constants.rst/
[2] edited via the doc wiki at
http://docs.scipy.org/scipy/docs/scipy.constants.constants/


From pav at iki.fi  Tue Jun 29 11:31:36 2010
From: pav at iki.fi (Pauli Virtanen)
Date: Tue, 29 Jun 2010 15:31:36 +0000 (UTC)
Subject: [SciPy-Dev] Understanding source of docs,
	source file ".py" or 	".rst" file
References: <AANLkTilYSrYiUzyxYXzgH6IGF5I_wxUsp7yE8HxMHn5N@mail.gmail.com>
	<AANLkTilHTJq7zJEq-uKvztxLqa3Kpfi3VDgastCv1Zou@mail.gmail.com>
	<i0cceh$qdh$1@dough.gmane.org>
	<AANLkTinEqJwX1qhctlrfwF3xRyRzf7PYlvhyJah8rBJV@mail.gmail.com>
Message-ID: <i0d3ko$jpj$1@dough.gmane.org>

Tue, 29 Jun 2010 07:36:43 -0600, Vincent Davis wrote:
[clip]
> I guess my specific question is? When editing docs on the wiki should I
> edit, for example
> "scipy-docs/constants.rst"
> http://docs.scipy.org/scipy/docs/scipy-docs/constants.rst/#scipy-docs-
constants-rst
> or
> http://docs.scipy.org/scipy/docs/scipy.constants.constants/ or both
> or is it like Skipper said
> "I believe that you edit the .rst files via the docs wiki and that they
> are applied to the actual source modules by hand intermittently (ie.,
> before a release, when a lot of progress has been made, or after a docs
> marathon or sprint)."

The division of labor between module docstrings and the .rst files is not 
really resolved at the moment.

*Technically*, it would be possible to put everything to the module 
docstrings, and keep the .rst file as stubs. (The other way around is not 
technically feasible.) However, this is not what we're doing at the 
moment.

So at the moment, it's maybe best to concentrate on the .rst files since 
they are the most visible, and they're what ends up in the HTML docs.

-- 
Pauli Virtanen


From vincent at vincentdavis.net  Tue Jun 29 15:28:29 2010
From: vincent at vincentdavis.net (Vincent Davis)
Date: Tue, 29 Jun 2010 13:28:29 -0600
Subject: [SciPy-Dev] Add to SearchBar firefox plugin.
Message-ID: <AANLkTilKfGux7M7If8zowI1I4kqlKkDLHig7OiiyWktP@mail.gmail.com>

Just thought I would share this. Sorry if you consider it noise :-)
Allows you to add almost any search box as a firefox searchbar option,
like scipy bug tracker search ......

https://addons.mozilla.org/en-US/firefox/addon/3682/

Vincent


From m.boumans at gmx.net  Wed Jun 30 01:19:05 2010
From: m.boumans at gmx.net (bowie_22)
Date: Wed, 30 Jun 2010 05:19:05 +0000 (UTC)
Subject: [SciPy-Dev] Contribute an evolutionary optimization algorithm
Message-ID: <loom.20100630T070005-739@post.gmane.org>

Hello together,

at the moment I am evaluationg the optimization features of scipy.
During that evaluation I missed a evolutionary optimization algorithm 
in the"Global" section (there is anneal and brute).

Just for fun I have started to implement an evo algorithm that uses 
(as far as possible) the same signature 
as the other algorithms in scipy.optimize.

My plan is to use it as if it would be a part of scipy.

In the meantime I ask my self, if it would make sense to contribute 
the evo stuff to scipy.
But I have no idea how to do this. 


Who is the person who decides if this is possible or not?
Are there means to ensure a minimum of qualitiy standards 
(checklists, reviews ...)?


Can someone please give me some hints.

Regs Marcus