Towards-Vision-Language-Mechanistic-Interpretability

This is the repository for the Implementation of Causal Mediation Analysis on the Vision-Language Transformer BLIP, focussed on the Visual Question Answering task of colour identification in images and questions sourced from the COCO-QA Dataset from the paper "Exploring Models and Data for Image Question Answering"

BLIPforVQA or Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Transformer is a sequential encoder and decoder model, which takes as input an image and a question, each of which is sent into an Image Encoder and an Image Grounded Question Encoder.

The Image Grounded Question Encoder also obtains the image embedding output from the Image Encoder as an input, following which it generates Question Embeddings. These Question Embeddings are sent into the Answer Decoder alongside a BOS(Beginning-Of-String) ID --> [Decode] which allows it to decode the tensors into open-ended Answers

Blip for VQA Architecture

Causal Tracing by our methodology involves introducing noise to the Image Embeddings, and creating a batch of 2 image embeddings being input into the Question Encoder - one being uncorrupted and the other being corrupted. Following this, we hook the outputs of different layers inside the encoder and patch the uncorrupted states to the corrupted states.

Visualization of Patching of States for Interpretability

The following are the results of causal tracing on a couple of samples alongside the corresponding images:
Example 1:

COCO-QA ID:000000220218 Image and Causal Trace Heatmap

Example 2:

COCO-QA ID:000000458864 Image and Causal Trace Heatmap

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
Causal-Trace-Attn-CrossAttn Focussed-src		Causal-Trace-Attn-CrossAttn Focussed-src
Causal-Trace-src		Causal-Trace-src
Datasets		Datasets
Images		Images
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards-Vision-Language-Mechanistic-Interpretability

About

Releases

Packages

Languages

License

vedantpalit/Towards-Vision-Language-Mechanistic-Interpretability

Folders and files

Latest commit

History

Repository files navigation

Towards-Vision-Language-Mechanistic-Interpretability

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages