Feature Importance #282

xnuohz · 2023-12-09T06:38:21Z

Feature

Support feature importance in tabular data scenarios.

Understand which features are beneficial for prediction and help to develop new features
Feature selection, removing features that are not helpful in prediction

Ideas

GBDTs naturally have APIs for calculating feature importance, it's easy to add.
NNs
- Permutation. After shuffling a certain feature, observe the changes in metric. The greater the change, the more important the feature is. Simple.
- SHAP. Complex.

yiweny · 2023-12-09T21:45:37Z

Mutual Information Sort is already added here.
For feature sorting in NNs, I recommend you take a look at the ExcelFormer example.
If you are interested in adding any feature related functionalities, you can add it in the transform module.

xnuohz · 2023-12-10T09:22:05Z

Thanks. As you mentioned, mutual information sorting and ExcelFormer improve performance through transformation capabilities. However, I want to discuss how much different features contribute to the final prediction result. For example, user behavioral features are important in recommender systems, so their feature importance should be high. pytorch-frame is good to use. It allows me to quickly obtain benchmark results on real-world datasets to determine whether NNs or GBDTs are better. I'm unsure if the functionality to evaluate feature importance is worth integrating as a module into pytorch-frame.

zechengz · 2023-12-11T06:33:17Z

I think you can use Captum https://captum.ai/ to have a try
cc @weihua916 we can also integrate this in PyT?

xnuohz · 2023-12-11T09:36:38Z

Yes, Captum implemented many interpretability methods, Feature Permutation and SHAP are part of them.

February24-Lee · 2024-04-30T16:00:22Z

Is there any update or roadmap related to it? 👀

akihironitta added the feature label Dec 10, 2023

zechengz mentioned this issue Dec 20, 2023

feature importance frameworks integration (SHAP, Captum and etc) #313

Closed

akihironitta assigned weihua916 and zechengz Dec 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Importance #282

Feature Importance #282

xnuohz commented Dec 9, 2023

yiweny commented Dec 9, 2023

xnuohz commented Dec 10, 2023

zechengz commented Dec 11, 2023 •

edited

Loading

xnuohz commented Dec 11, 2023

February24-Lee commented Apr 30, 2024

Feature Importance #282

Feature Importance #282

Comments

xnuohz commented Dec 9, 2023

Feature

Ideas

yiweny commented Dec 9, 2023

xnuohz commented Dec 10, 2023

zechengz commented Dec 11, 2023 • edited Loading

xnuohz commented Dec 11, 2023

February24-Lee commented Apr 30, 2024

zechengz commented Dec 11, 2023 •

edited

Loading