SARS-CoV-2 population dynamics in immunocompetent individuals in a closed transmission chain shows genomic diversity over the course of infection
Hannah Goldswain1, Rebekah Penrice-Randal1, I’ah Donovan-Banfield1, Craig W. Duffy1, Xiaofeng Dong1, Nadine Randle1, Yan Ryan1, Alex Rzeszutek2, Jack Pilgrim2, Emma Keyser3, Simon A. Weller3, Emma J. Hutley4, Catherine Hartley1, Tessa Prince1, Alistair C. Darby1, Niall Aye Maung5, Henry Nwume3, Julian A. Hiscox1,6, and Stevan R. Emmett3.
1Institute for Infection, Veterinary and Ecological Sciences, University of Liverpool, UK.
2Centre for Genomic Research, University of Liverpool, UK.
3Defence Science Technology Laboratory, Porton Down, UK.
4Centre for Defence Pathology, OCT Centre, Royal Centre for Defence Medicine, Birmingham, B15 2WB.
5British Army, Hunter House, St Omer Barracks, Aldershot, Hampshire, GU11 2BG
6ASTAR Infectious Diseases Laboratories (ASTAR ID Labs), Agency for Science, Technology and Research (A*STAR), Singapore. \
Sequencing of longitudinal nasopharyngeal swab samples from immunocompetent participants in an isolated geographical location
- 50 participant samples sequenced via NimaGen-Illumina sequencing (Coolen et al., 2021).
- EasySeq pipeline used to analyse data (https://github.com/JordyCoolen/easyseq_covid19).
- Sequences filtered at 85% coverage across the genome.
- Further downstream analysis using DiversiTools (https://josephhughes.github.io/DiversiTools/tutorial.html), Pangolin (https://github.com/cov-lineages/pangolin), Snipit (https://github.com/aineniamh/snipit) and Nextclade (https://clades.nextstrain.org/).
- Plotting of outputs using custom R scripts in this repository.
heatmap_updated_paper.R # developed by HG/IDB
letrs_paper.R # developed by HG
nonsyn_lineplot_paper.R # developed by HG/IDB
prop_diff_colours_paper.R # developed by HG/IDB
top_sndaa_paper.R # developed by HG/IDB
Contributors: XD- Xiaofeng Dong, RPR- Rebekah Penrice-Randal, IDB- I'ah Donovan Banfield, HG- Hannah Goldswain
The example data includes files used in the R scripts and generated by the Syn_NonSyn_parse_aa_V3.pl script:
- Participant01_S1.final_AA_parse.txt
- Participant01_S1.final_AA.txt
- Participant01_S1.final_entropy.txt
Raw fastq files have been deposited in the National Center for Biotechnology Information (NCBI) Short Read Archive (SRA) under the bioproject: PRJNA1012698. Accessible here: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA1012698
Hannah Goldswain at [email protected], Julian Hiscox at [email protected].