The repository contains data and scripts to carry out all analysis in Shapiro et al., "Multilayer Networks of Plasmid Genetic Similarity Reveal Potential Pathways of Gene Transmission" (ISME J 2023, 10.1038/s41396-023-01373-5 https://rdcu.be/c8AK6)
All scripts tested in R version 4.1.1
This repository contains all files needed to run analysis on plasmid rumen networks. To begin, download and unzip the StartingFiles.zip
folder and run analyses sequentially starting with script 01_Initial_data_processing.
Scripts can be started independently using the Rdata saved in the Rdata_Intermediate_files.zip
folder. Rdata files are saved to a folder corresponding to the name of the script in which they were created, e.g. all Rdata outputs created in 03_Network_setup.R that are used downstream can be found in the folder Rdata_Intermediate_files/Outputs_Script03_Network_setup.zip.
Each script is sub-divided to sections. All the scripts are fully commented.
All scripts can be run on a local computer, besides the transmission model (scripts 15-18), which requires use of HPC and was run on the Ben-Gurion University server.
For convenience, we include a few result files that give direct information without needing to go through the whole analysis.
'plas.2k.name.node.id.csv
: A table with plasmid nam,es and idsplasmid.2k.metadat.csv
: A table describing which plasmid occurs in which cow.net.dat2k.ew.csv
: A table of edge lists (the network). In an extended edge list formatplas_mods.df.csv
: A table with the module assignment of each plasmid.ann.metadat.v1.csv
: A table with module assignments and ORF annotations.KOs_in_plasmids.txt
: A table with KEGG orthologies.