-
Notifications
You must be signed in to change notification settings - Fork 0
Data Overview
Currently, separate EVE catalog files are maintained for vertebrates, invertebrates, plants, and fungi.
EVE catalog files include the following columns:
-
assigned_name
: The name of the sequence in the EVE set that the hit matched to. -
eve-type
: The first component of the EVE ID - the type of EVE (see glossary). -
eve-taxonomy
: The virus group from which the EVE derives. -
eve-numeric-id
: A positive integer that unambiguously identifies the locus across all species in which it occurs as an ortholog. -
eve-distribution
: Known distribution of the EVE across hosts. For example, if found in all species in the taxonomic order Carnivora, it would be 'Carnivora'. -
virus_genome_type
: Baltimore classification of the virus genome. -
virus_family
: The virus family from which the EVE derives. -
virus_genus
: The virus genus from which the EVE derives (if known). -
organism
: The host species in which this EVE copy was identified. -
tax_class
: The taxonomic class of the host species. -
superorder
: The taxonomic superorder of the host species. -
tax_order
: The taxonomic order of the host species. -
family
: The taxonomic family of the host species. -
genus
: The taxonomic genus of the host species. -
target_version
: Genome assembly version in which this copy was identified. -
scaffold
: Scaffold in which the EVE was identified. -
extract_start
: Extraction coordinates: start. -
extract_end
: Extraction coordinates: end. -
orientation
: The orientation of the EVE in the raw contig (forward/reverse). -
bitscore
: The bit score of the match of the hit. -
identity
: The identity of the hit. -
sequence_length
: The length of the hit.
In DIGS-for-EVEs, all EVE sequences identified via systematic BLAST search are 'classified' by comparison to a reference sequence library (also using BLAST). EVE reference or consensus sequences can be added to this library to help classify EVEs into sets of orthologous loci, and apply systematic nomeclature (see below). The last three fields (bitscore
, identity
, and sequence_length
) refer to BLAST matching between the EVE sequence being described, and the reference and/or consensus sequences to which this sequence most closely matched.
DIGS-for-EVEs by Robert J Gifford Lab.
For questions, issues, or feedback, please open an issue on the GitHub repository.