[scikit-learn] Replacing the Boston Housing Prices dataset

Andreas Mueller t3kcit at gmail.com
Thu Jul 6 14:09:10 EDT 2017



On 07/06/2017 12:41 PM, G Reina wrote:
>
> The fact that too many tutorials and examples rely on it is not a 
> reason to keep the dataset. New tutorials are written all the time. 
> And, as sklearn evolves some of the existing tutorials will need to be 
> updated anyway to keep up with the changes.
No, we try to avoid that as much as possible.
Old examples should work for as long as possible, and we actively avoid 
breaking API unnecessarily. It's one of the core principles of 
scikit-learn development.

And new tutorials can use any dataset they choose. We are working on 
including an openml fetcher, which allows using more datasets more easily.



More information about the scikit-learn mailing list