Skip to content
Robert J. Gifford edited this page Oct 13, 2024 · 48 revisions

DIGS-for-EVEs

Welcome to DIGS-for-EVEs: Revealing the Viral Fossil Record


Endogenous viral elements (EVEs) are viral DNA fragments that have become permanently integrated into the genomes of host organisms as a result of ancient viral infections, and are inherited across generations. Many of these sequences are millions of years old, and these ancient viral remnants effectively serve as a viral “fossil record” preserving invaluable information about past viral lineages and their historical interactions with hosts. Studying EVEs sheds light on viral evolution, host-virus co-evolution, and the long-term influence of viral infections on the genetic diversity of eukaryotes.


The DIGS-for-EVEs repository offers a meticulously curated catalog of EVE sequences identified from publicly available genomic data, using the Database-Integrated Genome Screening (DIGS) Tool.


By providing organized, accessible data, DIGS-for-EVEs aims to support research into the evolutionary history of viruses and their lasting genetic impact on their hosts.


Project Goals

DIGS-for-EVEs aims to make the viral fossil record more accessible to researchers by:

  • Creating a Comprehensive EVE Catalog: Organizing EVE sequences into a systematic and searchable database, complete with detailed annotations and metadata.
  • Standardizing EVE Nomenclature: Applying a consistent naming convention to ensure that each EVE can be accurately identified and compared across different species and research contexts.
  • Facilitating Comparative Genomic Analysis: Providing tools and resources to support the study of orthologous and paralogous relationships among EVEs in different species.

Key Features

  • Systematic Identification: EVEs in this catalog were identified using Database-Integrated Genome Screening (DIGS), a high-throughput computational approach that can be used to systematically screen WGS data for viral sequences, capturing hits in a relational database.
  • Curated Annotations: Each EVE is annotated with information about its viral origin, host species, genomic location, and evolutionary relationships.
  • Phylogenetic Analysis: In-depth phylogenetic investigations were conducted to classify EVEs into appropriate virus taxonomic groups and to reveal their evolutionary history.
  • Ortholog Detection: Comparative genomic analysis was performed to identify orthologous EVE loci in different species, enhancing our understanding of their evolutionary significance.

How to Use DIGS-for-EVEs

Researchers and scientists can use the DIGS-for-EVEs catalog as a reference resource to:

  • Explore EVE Sequences: Search and browse the catalog to find specific EVEs and examine their annotations, genomic locations, and evolutionary relationships.
  • Analyze Viral Evolution: Utilize the catalog's phylogenetic data to study the origins and diversification of viral lineages in host genomes.
  • Identify Novel EVEs: Access tools and scripts that can aid in the discovery of new EVE sequences in previously unexamined genome data.

Utility Scripts and Tools

The DIGS-for-EVEs repository includes a collection of utility scripts designed to help researchers interact with the EVE catalog and perform data analysis tasks. These scripts offer functionalities such as:

  • Namespace Management: Easily view the used and available namespaces for EVEs to understand their naming conventions.
  • Data Querying: Efficiently extract specific information from the catalog based on user-defined criteria.
  • Phylogenetic Analysis Tools: Streamline the process of performing phylogenetic analysis on EVE sequences.

Why Study EVEs?

Studying EVEs is crucial for understanding the hidden history of viral infections and their impact on host genomes. By analyzing these viral fossils, researchers can uncover:

  • Evolutionary Patterns: Insights into how viruses have evolved and adapted to different host species over time.
  • Host-Virus Interactions: Evidence of the molecular arms race between viruses and their hosts, shaping immune responses and genetic defenses.
  • Genomic Innovations: The role of viral sequences in driving genomic innovation and the emergence of new biological functions in hosts.

Get Involved

DIGS-for-EVEs is an open-ended and collaborative project that continues to evolve as new genomic data and viral discoveries emerge. We welcome contributions from the research community to help expand and refine this resource. To get involved:

  • Contribute Data: Submit newly identified EVEs or suggest updates to the existing catalog.
  • Collaborate on Research: Reach out to discuss potential research collaborations using the DIGS-for-EVEs resource.
  • Report Issues: Help us improve by reporting any issues or inconsistencies you find in the catalog.

Stay Updated

As the field of genomics advances, DIGS-for-EVEs will be continuously updated to include the latest data and insights. Stay tuned for updates on newly identified EVEs, revised classifications, and expanded datasets.


Clone this wiki locally