Sentence-Based Model Agnostic NLP Interpretability

2020-12-24 10:32:41

Yves Rychener, Xavier Renard, Djamé Seddah, Pascal Frossard, Marcin Detyniecki

arXiv_CL

arXiv_CL Bert

Abstract
Abstract (translated)
URL
PDF

Abstract

Today, interpretability of Black-Box Natural Language Processing (NLP) models based on surrogates, like LIME or SHAP, uses word-based sampling to build the explanations. In this paper we explore the use of sentences to tackle NLP interpretability. While this choice may seem straight forward, we show that, when using complex classifiers like BERT, the word-based approach raises issues not only of computational complexity, but also of an out of distribution sampling, eventually leading to non founded explanations. By using sentences, the altered text remains in-distribution and the dimensionality of the problem is reduced for better fidelity to the black-box at comparable computational complexity.

Abstract (translated)

URL

https://arxiv.org/abs/2012.13189

PDF

https://arxiv.org/pdf/2012.13189.pdf

Sentence-Based Model Agnostic NLP Interpretability

Abstract

Abstract (translated)

URL

PDF Copy

PDF