Local Interpretations for Explainable Natural Language Processing: A Survey


연구 분야: Artificial Intelligence



학회: ACM Computing Surveys, Volume 56, Issue 9


초록

As the use of deep learning techniques has grown across various fields over the past decade, complaints about the opaqueness of the black-box models have increased, resulting in an increased focus on transparency in deep learning models. This work investigates various methods to improve the interpretability of deep neural networks for Natural Language Processing (NLP) tasks, including machine translation and sentiment analysis. We provide a comprehensive discussion on the definition of the term interpretability and its various aspects at the beginning of this work. The methods collected and summarised in this survey are only associated with local interpretation and are specifically divided into three categories: (1) interpreting the model’s predictions through related input features; (2) interpreting through natural language explanation; (3) probing the hidden states of models and word representations.


Author Profile
Siwen Luo

The University of Western Australia Perth Australia

Australia
Author Profile
Hamish Ivison

University of Washington Seattle United States

United States
Author Profile
Soyeon Caren Han

The University of Melbourne Melbourne Australia

Australia

📄 논문 정보

발행 연도 2024년
인용수 12
출판 국가 Australia, United States
사이트 ACM
좋아요 수 0

연관 논문 목록 (409건)