arXiv:2401.09756 Abstract | arXiv Analytics

arXiv:2401.09756 [cs.LG]Abstract References Reviews Resources

Explaining Drift using Shapley Values

Narayanan U. Edakunni, Utkarsh Tekriwal, Anukriti Jain

Published 2024-01-18Version 1

Machine learning models often deteriorate in their performance when they are used to predict the outcomes over data on which they were not trained. These scenarios can often arise in real world when the distribution of data changes gradually or abruptly due to major events like a pandemic. There have been many attempts in machine learning research to come up with techniques that are resilient to such Concept drifts. However, there is no principled framework to identify the drivers behind the drift in model performance. In this paper, we propose a novel framework - DBShap that uses Shapley values to identify the main contributors of the drift and quantify their respective contributions. The proposed framework not only quantifies the importance of individual features in driving the drift but also includes the change in the underlying relation between the input and output as a possible driver. The explanation provided by DBShap can be used to understand the root cause behind the drift and use it to make the model resilient to the drift.

Categories: cs.LG, cs.AI

Keywords: shapley values, explaining drift, machine learning models, individual features, main contributors

Related articles: Most relevant | Search more

arXiv:1906.01827 [cs.LG] (Published 2019-06-05)

Data Sketching for Faster Training of Machine Learning Models

Baharan Mirzasoleiman, Jeff Bilmes, Jure Leskovec

arXiv:1911.07749 [cs.LG] (Published 2019-11-15)

On the computation of counterfactual explanations -- A survey

André Artelt, Barbara Hammer

arXiv:2104.04148 [cs.LG] (Published 2021-04-09)

Individual Explanations in Machine Learning Models: A Case Study on Poverty Estimation

Alfredo Carrillo, Luis F. Cantú, Luis Tejerina, Alejandro Noriega