Skip to main navigation Skip to search Skip to main content

Uncovering Semantic Bias in Neural Network Models Using a Knowledge Graph

  • Andriy Nikolov
  • , Mathieu D'Aquin

Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

Abstract

While neural networks models have shown impressive performance in many NLP tasks, lack of interpretability is often seen as a disadvantage. Individual relevance scores assigned by post-hoc explanation methods are not sufficient to show deeper systematic preferences and potential biases of the model that apply consistently across examples. In this paper we apply rule mining using knowledge graphs in combination with neural network explanation methods to uncover such systematic preferences of trained neural models and capture them in the form of conjunctive rules. We test our approach in the context of text classification tasks and show that such rules are able to explain a substantial part of the model behaviour as well as indicate potential causes of misclassifications when the model is applied outside of the initial training context.
Original languageEnglish (Ireland)
Title of host publicationProceedings of the 29th ACM International Conference on Information \ Knowledge Management
PublisherACM
Number of pages9
DOIs
Publication statusPublished - 1 Jan 2020

Fingerprint

Dive into the research topics of 'Uncovering Semantic Bias in Neural Network Models Using a Knowledge Graph'. Together they form a unique fingerprint.

Cite this