Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallelize correlation clustering #23

Open
sean-la opened this issue Feb 20, 2019 · 1 comment
Open

Parallelize correlation clustering #23

sean-la opened this issue Feb 20, 2019 · 1 comment
Labels
enhancement New feature or request

Comments

@sean-la
Copy link
Collaborator

sean-la commented Feb 20, 2019

To decrease runtime and take advantage of multicore processors, I suggest parallezing correlation clustering so that the clusterings of each genotype (e.g. MLST, CNV, SNP) all be performed at the same time.

@sean-la sean-la added the enhancement New feature or request label Feb 20, 2019
@sean-la
Copy link
Collaborator Author

sean-la commented Feb 20, 2019

This could also be extended to include the entire pipeline, i.e. parallelize the process of genotyping, distance matrix calculation, and correlation clustering over multiple CPUs. Only the consensus clustering phase cannot be parallelized.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant