Behind the mask: Masquerading the reason for prediction

Tomohiro Koide, Masato Uchida

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Ensuring the interpretability of the machine-learning-based prediction models is important for gaining users’ trust. The current representative algorithms for ensuring interpretability, LIME and SHAP, explain the prediction of a given black-box machine learning model based on a common perturbation mechanism to input data. In this study, we propose a masquerade layer to nullify this perturbation and hide the reason for prediction. Our proposed masquerade layer can be attached to any prediction models. It can also be attached without altering the prediction model itself, making it possible to manipulate the explanation provided by the interpretability algorithm in a manner that hardly changes the prediction model behavior. The experimental results show that existing representative perturbation-based interpretability algorithms have a critical weaknesses in terms of their reliability.

Original languageEnglish
Title of host publicationProceedings - 2021 IEEE 45th Annual Computers, Software, and Applications Conference, COMPSAC 2021
EditorsW. K. Chan, Bill Claycomb, Hiroki Takakura, Ji-Jiang Yang, Yuuichi Teranishi, Dave Towey, Sergio Segura, Hossain Shahriar, Sorel Reisman, Sheikh Iqbal Ahamed
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages475-481
Number of pages7
ISBN (Electronic)9781665424639
DOIs
Publication statusPublished - 2021 Jul
Event45th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2021 - Virtual, Online, Spain
Duration: 2021 Jul 122021 Jul 16

Publication series

NameProceedings - 2021 IEEE 45th Annual Computers, Software, and Applications Conference, COMPSAC 2021

Conference

Conference45th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2021
Country/TerritorySpain
CityVirtual, Online
Period21/7/1221/7/16

Keywords

  • Adversarial attacks
  • Explainability
  • Interpretability
  • Machine learning

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'Behind the mask: Masquerading the reason for prediction'. Together they form a unique fingerprint.

Cite this