[scikit-learn] What's the principle of partial_fit?

lampahome pahome.chen at mirlab.org
Tue Jul 2 00:48:11 EDT 2019


I work with partial_fit of Birch because the dataset is too huge to load
into memory.

So I cluster data batch by batch. eg: I have 50000 samples and every batch
contain 1000 samples.

I found clustering result is better if I cluster data which contain part of
last batch better than cluster data which doesn't contain previous data.

So I want to know how partail_fit works.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20190702/2203a4b5/attachment.html>


More information about the scikit-learn mailing list