Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
HelloImBart authored Jun 28, 2018
1 parent 16f5b23 commit a406f38
Showing 1 changed file with 12 additions and 7 deletions.
19 changes: 12 additions & 7 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,18 +1,23 @@
# influenza_codon_usage
Data and code for Smith et al., Avian Influenza Virus PB1 Gene in H3N2 Viruses Evolved in Humans To Reduce Interferon Inhibition by Skewing Codon Usage toward Interferon-Altered tRNA Pools, mBio 2018


The folder "sequences" contains the sequences for the PB1 influenza constructs generated, as well as files containing the number of each codon used in each influenza isolate we analyzed.
# Sequences
This folder contains the sequences for the PB1 influenza constructs generated, as well as files containing the number of each codon used in each influenza isolate we analyzed.
The GISAID database does not allow sharing of sequences downloaded, so the acknowledgement tables for the GISAID sequences used are made available instead.
To run our analysis yourself, simply download the desired datasets from GISAID, and first run the script "prep-codoncounts.sh" on the fasta files. Following that, run the "codoncounts.py" on the output from "prep-codoncounts.sh" to generate a file containing the number of each codon used in each isolate of your chosen sequences.

The folder "tRNA-seq results" contains the codon and anti-codon counts we determined for each tRNA sequencing library we prepared in .csv files.
# tRNA Sequencing
This folder contains the codon and anti-codon counts we determined for each tRNA sequencing library we prepared in .csv files. It also contains the aggregated codon counts based on isoaccepting tRNA availability.

The folder "rtAI-CAI" contains the analysis of the CAI and rtAI of influenza genes. "allserofigures.Rmd" contains the scripts for all serotypes we analyzed. "H3N2figures.Rmd" contains scripts for our more extensive analysis of the PB1 segment of H3N2.
# rtAI-CAI
This folder contains the analysis of the CAI and rtAI of influenza genes in R Markdown scripts. "allserofigures.Rmd" contains the scripts for all sequences we analyzed. "H3N2figures.Rmd" contains scripts for our more extensive analysis of the PB1 segment of H3N2.

The folder "reshuffling tests" contains the scripts required to carry out the reshuffling test described in manuscript. The reshuffling test requires positional sequence information, and so needs to have its' codon information presented in a different format. "tidyitup-individual-sites.py" will do this to a prepped .fasta file.
# reshuffling test
This folder contains the R Markdown scripts required to carry out the reshuffling test described in manuscript. The reshuffling test requires positional sequence information, and so needs to have its' codon information presented in a different format. "tidyitup-individual-sites.py" will do this to a prepped .fasta file.
"reshuffle.Rmd" contains the scripts for the reshuffling test.

The folder "growth curves" contains data in .csv format on the growth curves performed. "growthcurves-figure.Rmd" contains the R scripts used to analyze and visualize the growth curves.
# growth curves
This folder contains data in .csv format on the growth curves performed. "growthcurves-figure.Rmd" contains the R Markdown scripts used to analyze and visualize the growth curves.

The folder "RT-PCR" contains the raw RT-PCR data in .csv format. "RT-PCR figures.Rmd" contains the scripts used to analyze and visualize the RT-PCR data.
# RT-PCR
This folder contains the raw RT-PCR data in .csv format. "RT-PCR figures.Rmd" contains the R Markdown scripts used to analyze and visualize the RT-PCR data.

0 comments on commit a406f38

Please sign in to comment.