[SciPy-dev] Scikit for manifold learning techniques

Zachary Pincus zpincus at stanford.edu
Thu Dec 6 11:53:11 EST 2007


> I'd like to create a new scikit (I know I didn't put much effort in  
> the optimizers, but it will change when I will have more time) for  
> manifold learning. At first, I'd like to implement some usual  
> techniques like Isomap, LLE (some are in neuroimaging I heard) with  
> different levels of interaction. I do this in my PhD thesis, so it  
> is almost available like a scikit. It would be a twin-like of the  
> Dimensionality Reduction toolbox for MatLab but with a different  
> interaction : directly call the right global function (like isomap,  
> mds, nlm or gedodesicNLM ATM) or give directly to an optimizer the  
> cost function you want with a distance matrix (it will use my own  
> optimizers).
> Eigenmaps will be available shortly (I have a referee that want it,  
> so I will implement it), it will use scipy.sparse, and I hope I'll  
> be able to propose two interfaces as well.
>
> If everything goes smoothly, I'll propose my own dimensionality  
> reduction technique in the scikit as well.

Oh, this would be most fantastic. If desired, I can donate a PCA  
implementation, which would be a good "baseline" method to have in a  
dimensionality-reduction kit.

Zach

(PS. Yes, PCA is easy to implement, but it is also easy to get subtly  
wrong -- I've seen several such -- or to implement in a way that is a  
lot slower than it needs to be. I've spent a while making my  
implementation correct and as fast as possible for both n_data >>  
n_dims and vice-versa. If anyone wants, I'll send the code.)



More information about the SciPy-Dev mailing list