[SciPy-User] [Numpy-discussion] what statistical module to use for python?

Wes McKinney wesmckinn at gmail.com
Thu Dec 1 10:06:46 EST 2011


On Wed, Nov 30, 2011 at 8:41 PM,  <josef.pktd at gmail.com> wrote:
> <forwarding from numpy mailing list>
>
> On Wed, Nov 30, 2011 at 1:16 PM, Chao YUE <chaoyuejoy at gmail.com> wrote:
>> Hi all,
>
> This is more a question for the scipy-user mailing list since that is
> for more general question.
>
> I would also like to know since I have a biased or selective view.
>
>>
>> I just want to broadly ask what statistical package are you guys using? I
>> mean routine statistical function like linear regression, GLM, ANOVA... etc.
>>
>> I know there is SciKits packages like statsmodels, but are there more
>> general and complete ones?
>
> (Not counting rpy2 since it's not available on Windows anymore.)
>
> I think there are more complete packages on specific topics, but
> nothing in python that is complete and general, that's where
> statsmodels tries to be.
>
> sklearn is machine learning oriented but covers also a large area of
> statistical methods.
>
> Besides scipy.stats, statsmodels and sklearn, I don't know any that
> target to be general and not field specific. (scipy and numpy have
> also features that make do-it-yourself easy.)
>
> But there are many more field or topic specific packages, ......
> (Bayesian, spatial, discrete choice (transport), and then by scientific field.)
>
> http://www.scipy.org/Topical_Software   doesn't include a statistics section
>
> An overview or survey of packages and statistical methods (in a very
> broad definition) would be useful.
>
> Thanks,
>
> Josef
>
>>
>> thanks to all,
>>
>> Chao
>> --
>> ***********************************************************************************
>> Chao YUE
>> Laboratoire des Sciences du Climat et de l'Environnement (LSCE-IPSL)
>> UMR 1572 CEA-CNRS-UVSQ
>> Batiment 712 - Pe 119
>> 91191 GIF Sur YVETTE Cedex
>> Tel: (33) 01 69 08 29 02; Fax:01.69.08.77.16
>> ************************************************************************************
>>
>>
>> _______________________________________________
>> NumPy-Discussion mailing list
>> NumPy-Discussion at scipy.org
>> http://mail.scipy.org/mailman/listinfo/numpy-discussion
>>
> _______________________________________________
> SciPy-User mailing list
> SciPy-User at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-user

I think that statsmodels is the right place for the kinds of models
and analysis you're referring ot. We would love more contributors to
make it more complete, e.g. I don't it has much in the way of ANOVA
yet (probably needs the formula framework to be set up first).



More information about the SciPy-User mailing list