Skip to content

Latest commit

 

History

History
140 lines (89 loc) · 6.59 KB

README.md

File metadata and controls

140 lines (89 loc) · 6.59 KB

MirCure: A tool for quality control, filter, and curation of microRNAs of animals and plants

GitHub last commit pubstatus License: GPL v3

  • Guillem Ylla
  • Tianyuan Liu
  • Ana Conesa

About MirCure

MirCure is a computational application to assist on filtering and curating microRNA annotations obtained from databases or de novo miRNA annotation tools.

Given a list of miRNA candidates, MirCure evaluates a number of miRNA-specific features based on gene expression, biogenesis, and conservation data, and generates a score that can be used to discard poorly supported miRNA annotations. MirCure can also curate and adjust the annotation of the 5p and 3p arms based on user-provided small RNA-seq data. We evaluated MirCure on a set of manually curated animal and plant microRNAs and demonstrated great accuracy. Moreover, we show that MirCure can be used to revisit previous bona fide miRNAs annotations to improve microRNA databases.

We provide MirCure as a Shinny App, which can easily be executed as an R function or Docker container.

Video Tutorial

Introduction

  • The miRNAs are short (~22nts) non-coding RNAs that bind to target messenger RNA (mRNAs) to block their translation and trigger degradation. Due to their precise and powerful mechanism of action, miRNAs are involved in the regulation of a myriad of biological processes and are involved in numerous diseases.
  • Prediction of miRNA genes is a common step in genome annotation pipelines. Now, there are lots of false-positives in the prediction step.
  • Currently, the few sources of reliable miRNA annotations are those that went through an extensive manual curation process. This process involves running different algorithms and collecting different pieces of information that are then put together for human assessment. This is a tedious and time-consuming effort.
  • To assist researchers in the selection of bona fide miRNAs out of a list of putative miRNA annotations, we developed MirCure. MirCure is a Shiny tool for quality control and filtering of both existing miRNA annotations and de novo miRNA predictions. MirCure obtains a quality score based on the automatic evaluation of different sources of miRNA evidence that can be used to accept, reject, or revisit miR-NA calls. Moreover, MirCure generates a visual report for each miRNA candidate in which all the relevant miRNA biogenesis criteria are represented, thereby greatly facilitating the curation process. We validate MirCure on a set of extensively curated miRNA annotations both for animals and plants. To our knowledge, MirCure is the first tool to perform automatic quality control of miRNA annotations based on small RNA-seq and strict biogenesis criteria.

Basic Strategy

  • MirCure uses small RNA-seq coverage, structural information, and sequence conservation to calculate a quality score that can be use to select robustly supported miRNA annotations.

  • Example of some of the features that MirCure analyzes:

Figure 1

  • A real microRNA should have:
    1. Hairpin-like secondary structure of the precursor sequence.
    2. Two miRNA between 20-26 nts long with at least 16 complementary nts.
    3. The two-arm (mature/star) arm should have two nts overhang and both 3' extremes.
    4. RNA-seq evidence supporting the mature/star arms.
    5. ow proportion of reads within the loop region and precursor flanking regions.
    6. In plants, we expect higher seqeunce conservation.

Figure 2

How to Install and Run MirCure

There are two main options to run MirCure:

  1. Run MirCure Docker image (recommended)
  2. Install and run MirCure as R package.

Run MirCure Docker Image

Docker Cloud Build Status

  • You just need to install the Docker engine.
  • Run MirCure with the following command:
docker run --rm -p 3838:3838 \
-v LOCAL_Genome_DIR:/srv/shiny-server/data/genomes \
-v LOCAL_BAMFILE_DIR:/srv/shiny-server/data/bamfiles \
-v LOCAL_DIR_TO_SAVE_MIRCUREREPORTS:/srv/shiny-server/reports\
    conesalab/mircure
  • Where:

    • LOCAL_Genome_DIR is the path to directory where the genome fasta file is located.
    • LOCAL_BAMFILE_DIR is the path to directory where the bam file is located.
    • LOCAL_DIR_TO_SAVE_MIRCUREREPORTS path to directory where MirCure pdf reports will be saved.
  • Open the URL that the terminal will return (typically http://[::]:3838) in a web browser.

MirCure as R package

  • MirCure requires 2 external dependencies.

  • Install MirCure as R package.

library(devtools)
install_github("ConesaLab/MirCure")
  • Launch MirCure as R function
library("MirCure")
runMirCure()

Example Files

Download the example files to test MirCure. https://uflorida-my.sharepoint.com/:u:/g/personal/tianyuan_liu_ufl_edu/EecwMk1SxX9EluiTkOojjHQBIBSbPBe12hA9ZTMTnXf0Dg?e=Uk4Bet

Input Data for MirCure

  • MirCure Inputs

    1. Genome (fasta).
    2. microRNA annotations to evaluate, in 3 gff files. One with the precursor cordinates, and two with the 2 miRNA arms (mature and star, or 3' and 5').
    3. Small RNA-seq data mapped to the genome (bam)
  • Some Scripts to help you create a bam file in How to create bam files.RTF

Auxiliary scripts

Citation

Ylla, G., Liu, T. and Conesa, A. (2020). MirCure: a tool for quality control, filter and curation of microRNAs of animals and plants, Bioinformatics. Oxford University Press. doi: 10.1093/bioinformatics/btaa889.