GitHub - stewart-lab/GSVApy: A Python wrapper around the GSVA algorithm

GSVApy: A Python wrapper for the GSVA algorithm

This is a Python script for running the Gene Set Variation Analysis (GSVA) algorithm. Specifically, it is a wrapper around the GSVA R package.

Specifically, given a genes-by-samples gene expression matrix, this algorithm outputs a gene_set-by-sample matrix with an enrichment score for each gene set within each sample.

Dependencies

This script requires the following Python packages as described in requirements.txt:

rpy2
optparse
pandas

This script requires the following R packages:

GSVA

I recommend installing these packages in a conda virtual environment. I have found the interaction between R and Python through rpy2 to be a bit fragile when using global installations.

Running on the command line

The run_gsva.py script can be run on the command as follow:

Usage: python run_gsva.py <input_expression_data> <input_GMT_gene_set_file>

Options:
  -h, --help            show this help message and exit
  -t, --transpose       Take transpose of input
  -d DISTRIBUTION, --distribution=DISTRIBUTION
                        Distribution to use in GSVA {'Poisson' or 'Gaussian'}
  -o OUT_FILE, --out_file=OUT_FILE
                        Output file

We provide an example gene expression dataset in example_data/example.tsv. We also provide gene sets corresponding to Gene Ontology (GO) terms in gene_sets/c5.bp.v7.1.symbols.gmt. To run GSVA on this example data using these gene sets, one would run the command:

python run_gsva.py -t -d Poisson -o example_GSVA_output.tsv example_data/example.tsv gene_sets/c5.bp.v7.1.symbols.gmt

Note, the -t parameter is required here because the input expression matrix is a samples-by-genes matrix rather than a genes-by-samples matrix and thus, we must take the transpose of the matrix before passing it to GSVA.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
GSVA_example_output		GSVA_example_output
Histories_Commands		Histories_Commands
example_data		example_data
gene_sets		gene_sets
outTrimmedGeneSets/2023_05_05_14_50_14		outTrimmedGeneSets/2023_05_05_14_50_14
out_mann_whitney/2023_05_05_15_32_18		out_mann_whitney/2023_05_05_15_32_18
README.md		README.md
exampleCommandLines.txt		exampleCommandLines.txt
requirements.txt		requirements.txt
runMannWhitneyCommandLine.txt		runMannWhitneyCommandLine.txt
runMannWhitney_Rms.py		runMannWhitney_Rms.py
run_gsva.py		run_gsva.py
trimGeneSets.py		trimGeneSets.py
trimGeneSetsCommandLine.txt		trimGeneSetsCommandLine.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GSVApy: A Python wrapper for the GSVA algorithm

Dependencies

Running on the command line

About

Releases

Packages

Languages

stewart-lab/GSVApy

Folders and files

Latest commit

History

Repository files navigation

GSVApy: A Python wrapper for the GSVA algorithm

Dependencies

Running on the command line

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages