[scikit-learn] make_classification question

Wed Aug 12 11:12:37 EDT 2020

Hi Anna,

You can set shuffle=False (it's set to True by default in the make_classification function). Then, the resulting features will be sorted as follows:  X[:, :n_informative + n_redundant + n_repeated]. I.e., if you set “n_features=1000” and “n_informative=20”, the first 20 features will be the informative ones.

Best,
Sebastian

> On Aug 12, 2020, at 8:35 AM, Anna Jenul <anna.jenul at nmbu.no> wrote:
> 
> Hi!
> I am generating own datasets with sklearn.datasets.make_classification. Unfortunately, I cannot figure out which of the generated features are the informative ones. In my example I generate “n_features=1000” and “n_informative=20”. Is there any possibility to get the informative features after the dataset is generated?
> Thanks,
> Anna
> _______________________________________________
> scikit-learn mailing list
> scikit-learn at python.org
> https://mail.python.org/mailman/listinfo/scikit-learn