Skip to content

Code repository for the manuscript: Nucleotide dependency analysis of DNA language models reveals genomic functional elements

License

Notifications You must be signed in to change notification settings

gagneurlab/dependencies_DNALM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

dependencies_DNALM

Code repository for the manuscript: Nucleotide dependency analysis of DNA language models reveals genomic functional elements

Description

This repository contains code for the manuscript and general code to compute and visualize nucleotide dependencies using DNA language models. Please refer to the notebook compute_and_visualize_dep_maps.ipynb for a quick start , it includes examples and code to:

  • Visualize nucleotide dependency maps for a specific sequence and DNA Language Model
  • Compute variant influence scores for a specific sequence and DNA Language Model

Requirements

SpeciesLM and RiNALMo models require FlashAttention-2 to be installed (https://github.com/Dao-AILab/flash-attention).

Data

Data with intermediate files for the diffferent manuscript notebooks can be found at: https://doi.org/10.5281/zenodo.12982537

About

Code repository for the manuscript: Nucleotide dependency analysis of DNA language models reveals genomic functional elements

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published