Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings

This repository provides the official implementation of the paper Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings by J. Macdonald, M. Besançon and S. Pokutta (2021).

TL;DR: We use a constrained optimization formulation of the Rate-Distortion Explanations (RDE) (Macdonald et al., 2019) method for relevance attribution and Frank-Wolfe algorithms for obtaining interpretable neural network predictions.

Content

This repository contains subfolders with code for two independent experimental scenarios.

mnist : Sparse relevance maps (relevance attribution) and relevance orderings for a relatively small LeNet-inspired neural network classifier on the MNIST dataset of greyscale images of handwritten digits.
stl10 : Sparse relevance maps (relevance attribution) for a larger VGG-16 based neural network classifier on the STL-10 dataset of color images.

Requirements & Setup

The package versions we used are specified in Project.toml, Manifest.toml, and setup.jl.
To reproduce our computational environment run:

julia setup.jl

To test the installation run:

test_installation.jl

This should print all the installed Julia and Python packages.

Usage

The script rde.jl can be used to obtain sparse relevance mappings.

The script rde_birkhoff.jl can be used to obtain relevance orderings with deterministic Frank-Wolfe algorithms.

The script rde_birkhoff_stochastic.jl can be used to obtain relevance orderings with stochastic Frank-Wolfe algorithms.

License

This repository is MIT licensed, as found in the LICENSE file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings

Content

Requirements & Setup

Usage

License

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
mnist		mnist
stl10		stl10
.gitignore		.gitignore
CITATION.bib		CITATION.bib
LICENSE		LICENSE
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md
custom_oralces.jl		custom_oralces.jl
environment.yml		environment.yml
rde.jl		rde.jl
rde_birkhoff.jl		rde_birkhoff.jl
rde_birkhoff_stochastic.jl		rde_birkhoff_stochastic.jl
rde_mnist_stl10.png		rde_mnist_stl10.png
setup.jl		setup.jl
test_installation.jl		test_installation.jl

License

ZIB-IOL/fw-rde

Folders and files

Latest commit

History

Repository files navigation

Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings

Content

Requirements & Setup

Usage

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages