This repository contains scripts used for the analysis performed in our manuscript
prime-seq, efficient and powerful bulk RNA-sequencing
Aleksandar Janjic, Lucas E. Wange, Johannes W. Bagnoli, Johanna Geuder, Phong Nguyen, Daniel Richter, Beate Vieth, Christoph Ziegenhain, Binje Vick, Ines Hellmann, Wolfgang Enard
For the full prime-seq protocol please visit protocols.io. For the full list of 384 barcoded oligo dT primers click here
prime-seq is a simple RNA-seq workflow that goes from lysate to sequencing library in no time. We benchmarked it’s performance against the MAQC-III study using power analysis and showed that it captures known biological differences in a differentiation experiment.
The data necessary to reproduce this analysis can be found at ArrayExpress:
Accession | Dataset |
---|---|
E-MTAB-10140 | Beads_Columns_tissue |
E-MTAB-10138 | Beads_Columns_PBMC |
E-MTAB-10142 | Beads_Columns_HEK |
E-MTAB-10141 | gDNA_priming |
E-MTAB-10139 | UHRR |
E-MTAB-10133 | iPSC |
E-MTAB-10175 | AML |
All RNA-seq data was adapter trimmed with cutadapt and preprocessed with zUMIs (Parekh et al., 2017).
Here we summarize the different experiment previous version of prime-seq have been used for in terms of number of samples, species and intron and exon mapped fractions. Next we show that introns can be used for gene expression quantification and are not derived from contaminating gDNA. R Notebooks for this analysis can be found here
We collected data from prime-seq experiments that were performed in the last years during it’s development and show that prime-seq works robustly on many different samples.
1.2 Intronic reads in prime-seq are not derived from gDNA and can be used for expression quantification
To benchmark prime-seq we compared it to a gold standard data set from
the MAQC consortium using
powsimR
. R Notebooks for this
analysis can be found here.
Method sensitivity
Method correlations
Method powsimR
To test the impact of different RNA isolation methods on gene expression we performed prime-seq on three types of input. RNA was isolated from HEK cells, human PBMCs and mouse striatal Tissue with either Columns or SPRI beads. R Notebooks for this analysis can be found here.
Lysis features
Lysis sensitivity
Lysis costs
Lysis DE
Lysis PCA
Low input sensitivity
Low input correlations
R Notebooks for this analysis can be found here
cross-contamination
correlation
cross-contamination
cycles
cross-contamination
simulation
We used prime-seq on many different types of samples already, here we show two examples. The first data set consists of 96 archival AML PDX samples that were sampled using biopsy punching. We show that the biological differences between the patients and AML types can be measured accurately using our method. In a second dataset we compared neuronal differentiation of five iPS cell lines that we generated previously (Geuder et al. 2021). R Notebooks for this analysis can be found here.
AML PDX PCA iPSC to NPC differentiation
Finally we showed the impact of per sample costs on power to detect differentially expressed genes. By enabling the study of many more biological replicates with a fixed budget compared to Illuminas TruSeq kit, prime-seq leverages the full power of bulk RNA-seq. R Notebooks for this analysis can be found here.
This schematic outlines the detailed molecular workflow from isolated RNA to sequencing library.
sessionInfo()
#> R version 4.1.0 (2021-05-18)
#> Platform: x86_64-pc-linux-gnu (64-bit)
#> Running under: Devuan GNU/Linux 3 (beowulf)
#>
#> Matrix products: default
#> BLAS: /usr/lib/x86_64-linux-gnu/openblas/libblas.so.3
#> LAPACK: /usr/lib/x86_64-linux-gnu/libopenblasp-r0.3.5.so
#>
#> locale:
#> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
#> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
#> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
#> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
#> [9] LC_ADDRESS=C LC_TELEPHONE=C
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
#>
#> attached base packages:
#> [1] stats graphics grDevices utils datasets methods base
#>
#> loaded via a namespace (and not attached):
#> [1] compiler_4.1.0 magrittr_2.0.1 fastmap_1.1.0 tools_4.1.0
#> [5] htmltools_0.5.2 yaml_2.2.1 stringi_1.7.4 rmarkdown_2.11
#> [9] knitr_1.36 stringr_1.4.0 xfun_0.28 digest_0.6.28
#> [13] rlang_0.4.12 evaluate_0.14