[scikit-learn] Silhouette example - performance issue

Raghav R V ragvrv at gmail.com
Fri Oct 14 09:38:55 EDT 2016


On Fri, Oct 14, 2016 at 3:27 PM, Anaël Bonneton <anael.bonneton at gmail.com>
wrote:

> Hi,
>
> In the silhouette example (http://scikit-learn.org/
> stable/auto_examples/cluster/plot_kmeans_silhouette_
> analysis.html#sphx-glr-auto-examples-cluster-plot-kmeans-
> silhouette-analysis-py), the silhouette values of each sample is computed
> twice: once with *silhouette_score *and once with *silhouette_samples.*
> The call to *silhouette_score* can be easily avoided by computing the
> average of the result of* silhouette_samples*.
>
> Do you think we should remove the call to *silhouette_score* to improve
> the performance ? Or it is better to keep the two functions to show how to
> use them ?
>
Hi,

When I wrote it, I intended it to be demonstrative of the two methods.

Not sure if we should worry about performance issues there


-- 
Raghav RV
https://github.com/raghavrv
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20161014/1d1162e3/attachment.html>


More information about the scikit-learn mailing list