[SciPy-Dev] Contingency Table Model

josef.pktd at gmail.com josef.pktd at gmail.com
Mon Aug 9 16:11:10 EDT 2010


On Mon, Aug 9, 2010 at 3:31 PM, Anthony Scopatz <scopatz at gmail.com> wrote:
> Hello All,
> I have just opened a ticket
> (http://projects.scipy.org/scipy/ticket/1258) that adds a general
> contingency table class to the the stats package.  This class includes
> methods to slice and collapse the table as well a calculate metrics such as
> chi-squared and entropy.
> This implementation came out of Warren Weckesser and me working on this over
> the SciPy 2010 statistics sprint.
> Please take a look!  Comments and suggestions are always welcome.

just a quick question that I don't understand from a brief look at the source

Isn't the core of "from_columns" doing the same quantization as
np.histogramdd? ( I haven't looked closely enough yet)

If x in from_columns is a tuple, then an array_like could also contain
strings, e.g. names/levels of a categorical variable. I'm not sure how
far this should go.

other ideas
methods or functions "from_flat" and "to_flat" would be useful.
chi2 could be renamed to chi2_indep, or take an optional expected
keyword, where the user could specify other distribution hypotheses.

Josef


> Be Well,
> Anthony
> _______________________________________________
> SciPy-Dev mailing list
> SciPy-Dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/scipy-dev
>
>



More information about the SciPy-Dev mailing list