Skip to content

EvoArt/AssignTaxonomy.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AssignTaxonomy

Dev Build Status Build Status Coverage

AssignTaxonomy is an all Julia implementation of the RDP Naive Bayesian Classifier algorithm for assigning taxonomic classifications based on DNA sequences. Most users will only need to use the assign_taxonomy function on a pair of fasta files (one with target sequences, one with a reference database). However, additional functions are provided for reading in reference and target fasta files, for those who prefer to work with Julia data structures (e.g. vectors of DNA sequences, arrays of taxonomic classifications).

Results can be easily converted to DataFrame or saved to CSV

using AssignTaxonomy, CSV, DataFrames

my_results = assign_taxonomy(targets,refs)
df = DataFrame(my_results)
CSV.write("my_results.csv",my_results)

You can also store and reuse log_probabilities from the classifier. Basically, training the model of your reference data once and then re using it on new target data.

using AssignTaxonomy

my_results,my_lp = assign_taxonomy(targets,refs,keep_lp = true)
my_new_results = get_targets(some_other_target_fasta,refs,lp = my_lp)
all_my_results = AssignTaxonomy.classification_result(vcat(values(my_results),values(my_results)))

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages