SYNOPSIS

sn_detectContamination-kraken.pl

Attempts to guess a taxon for each sample and then lists at most one major contaminant for the sample.

Software requirements

Kraken1
Database formatted for Kraken1

Algorithm

species ID

Reads Kraken results from raw reads or from assemblies. Starting with species, if at least 25% of the reads correspond with one species taxon, then records the majority species for the sample. If 25% are not captured at species, then it moves onto genus, etc.

contamination ID

For the rank that is identified (species, genus, etc), if at least 1% of reads conflict with the species identified, lists the major conflicting taxon.

Outputs

Table with columns

sample
assumed taxon
best-fitting taxon
percent of reads that match the best-fitting taxon
rank (S is species, G is genus, F is family, ...)
major conflicting taxon
percent of reads supporting the major conflicting taxon

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sn_detectContamination-kraken.pl.md

sn_detectContamination-kraken.pl.md

SYNOPSIS

Software requirements

Algorithm

species ID

contamination ID

Outputs

Files

sn_detectContamination-kraken.pl.md

Latest commit

History

sn_detectContamination-kraken.pl.md

File metadata and controls

SYNOPSIS

Software requirements

Algorithm

species ID

contamination ID

Outputs