[SciPy-User] Identifying variables driving divergence in Scipy
Ben Temperton
btemperton at gmail.com
Thu Dec 20 17:44:43 EST 2012
Hi there,
I have a matrix of 37 samples and ~1300 variables that are clearly
clustering into 3 distinct groups using:
import scipy.cluster.hierarchy as sch
import scipy.spatial.distance as dist
d1= dist.pdist(mtx2, 'braycurtis')
Y1 = sch.linkage(d1, method='average', metric='braycurtis')
Z1 = sch.dendrogram(Y1, labels=mtx2.index)
What I would like to do is find out which variables are driving the
divergence in this clustering, i.e. which ones are greater in group A than
group B and vice versa. In tools like Primer, this would be done with a
SIMPER test, and I am hoping someone can point me in the direction of a
Scipy equivalent as I am trying to keep all my data analysis in ipython
notebook.
Many thanks,
Ben
<https://lh5.googleusercontent.com/-YdYGFVthrYk/UNOUWRM4OsI/AAAAAAAAAkI/VBJvYtIbPJo/s1600/clust.png>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.scipy.org/pipermail/scipy-user/attachments/20121220/4fc0fbcd/attachment.html>
More information about the SciPy-User
mailing list