[scikit-learn] Construct the microclusters using a CF-Tree

Roman Yurchak rth.yurchak at gmail.com
Fri Jun 30 10:42:11 EDT 2017


Hello Sema,

On 30/06/17 17:14, Sema Atasever wrote:
> I want to cluster them using Birch clustering algorithm.
> Does this method have 'precomputed' option.

No it doesn't, see 
http://scikit-learn.org/stable/modules/generated/sklearn.cluster.Birch.html 
so you would need to provide it with the original features matrix (not 
the precomputed distance matrix). Since your dataset is fairly small, 
there is no reason in precomputing it anyway.

> I needed train an SVM on the centroids of the microclusters so
> *How can i get the centroids of the microclusters?*

By "microclusters" do you mean sub-clusters? If you are interested in 
the leaves subclusters see the Birch.subcluster_centers_ parameter.

Otherwise if you want all the centroids in the hierarchy of subclusters, 
you can browse the hierarchical tree via the  Birch.root_ attribute then 
look at _CFSubcluster.centroid_ for each subcluster.

Hope this helps,
-- 
Roman


More information about the scikit-learn mailing list