sn_detectContamination-kraken.pl
Attempts to guess a taxon for each sample and then lists at most one major contaminant for the sample.
- Kraken1
- Database formatted for Kraken1
Reads Kraken results from raw reads or from assemblies. Starting with species, if at least 25% of the reads correspond with one species taxon, then records the majority species for the sample. If 25% are not captured at species, then it moves onto genus, etc.
For the rank that is identified (species, genus, etc), if at least 1% of reads conflict with the species identified, lists the major conflicting taxon.
Table with columns
- sample
- assumed taxon
- best-fitting taxon
- percent of reads that match the best-fitting taxon
- rank (S is species, G is genus, F is family, ...)
- major conflicting taxon
- percent of reads supporting the major conflicting taxon