Are SHAP calculations in ExplainerDashboard aggregate? What are the ideal sizes for X_background and X when evaluating neural net models? #280

Menamonmon · 2023-08-23T22:58:26Z

Menamonmon
Aug 23, 2023

I have two questions here:

CanExplainerDashboard be used to examine the influence of features on a single prediction or does it only provide accurate explanations at an aggregate level for the model as a whole? If that's possible, how big should the X and X_background datasets be when passing them to ClassifierExplainer.

How big should the X_background be if I want to ensure accuracy of the SHAP values for a large training dataset (millions of rows for training)?
If SHAP calculations only give valuable insights at an aggregate level, similarly, how big should the X dataset when generating the ClassifierExplainer?

Would really appreciate some examples for clarifications on this? So far, I saw in the examples that use the training data as X_background and the testing data as X but in my case, I have a very large dataset and want to optimize performance vs accuracy and also want to see if getting explanation for individual predictions are possible. Thank you in advance!

oegedijk · 2023-12-19T19:53:36Z

oegedijk
Dec 19, 2023
Maintainer

Hi @Menamonmon, the idea of SHAP values are specific to the example, but can then be aggregated to get global insights into your model.

In terms of the size of X_background, I think this can be pretty small and still get decent results (tens of examples), but this is probably a question better asked on the shap library github.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are SHAP calculations in ExplainerDashboard aggregate? What are the ideal sizes for X_background and X when evaluating neural net models? #280

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Are SHAP calculations in ExplainerDashboard aggregate? What are the ideal sizes for X_background and X when evaluating neural net models? #280

Menamonmon Aug 23, 2023

Replies: 1 comment

oegedijk Dec 19, 2023 Maintainer

Menamonmon
Aug 23, 2023

oegedijk
Dec 19, 2023
Maintainer