question

PriyeshDave-5166 avatar image
0 Votes"
PriyeshDave-5166 asked YutongTie-MSFT commented

Random Forest

I am currently working on Career Guidance prediction using Machine Learning.

The dataset has 38 features. For feature selection I tried using mutual_info_classif for getting the mutual information of my features and got the list of important features.

The Second approach I followed is using SelectKBest with mutual_info_classif as my score_func. On this approach I got some other list of features.

Is it normal to get different results ?
Can anyone please help me out?

azure-machine-learningazure-data-science-vm
· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@PriyeshDave-5166

Hello,

Do you have a chance to check if below answer helps? Please let us know if you have more questions or kindly accept the answer if you feel this is helpful. Thank you.

0 Votes 0 ·

1 Answer

YutongTie-MSFT avatar image
0 Votes"
YutongTie-MSFT answered

Hello @PriyeshDave-5166

Thanks for reaching out to us here. Based on my understanding, if you keep X, Y the same, and you are using sklearn.feature_selection.SelectKBest(score_func= mutual_info_classif , *, k= n) the result should be the same.

Are you manually rank the feature with mutual_info_classif score? The SelectKBest function is basically return the K highest feature based on the X and Y.

If this is not your case, I would highly recommend you to check with Scikit-learn team by email : https://mail.python.org/mailman/listinfo/scikit-learn

Or forum: https://stackoverflow.com/questions/tagged/scikit-learn


Hope this will help. Please let us know if any further queries.



  • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how

  • Want a reminder to come back and check responses? Here is how to subscribe to a notification

  • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators








5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.