Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HDBSCAN and KMeans API improvements for improving CPU interoperability #6181

Merged
merged 9 commits into from
Dec 24, 2024

Conversation

dantegd
Copy link
Member

@dantegd dantegd commented Dec 13, 2024

No description provided.

Copy link

copy-pr-bot bot commented Dec 13, 2024

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@github-actions github-actions bot added the Cython / Python Cython or Python issue label Dec 13, 2024
@dantegd dantegd changed the title KMeans API improvements for matching Scikit-learn API Estimator API improvements for matching Scikit-learn API Dec 15, 2024
@dantegd dantegd changed the title Estimator API improvements for matching Scikit-learn API HDBSCAN and KMeans API improvements for improving CPU interoperability Dec 17, 2024
@dantegd dantegd added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Dec 17, 2024
DOC comment explaining behavior
@dantegd dantegd marked this pull request as ready for review December 17, 2024 03:45
@dantegd dantegd requested a review from a team as a code owner December 17, 2024 03:45
@dantegd dantegd requested review from betatim and divyegala December 17, 2024 03:45
Comment on lines 572 to 573
self._gen_min_span_tree = \
GlobalSettings().accelerator_active or value
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general we should not set attributes to values that are different from the constructor argument.

I tried to work out why doing this increases compatibility, but couldn't the default of gen_min_span_tree in HDBSCAN is also False. What was the thinking for making this default on when the accelerator is enabled?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Over-zealousness of wanting the mst to be available always when I was testing locally, forgot that the original hdbscan package has the same gen_min_span_tree parameter, so I removed this logic :)

@dantegd
Copy link
Member Author

dantegd commented Dec 24, 2024

/merge

@rapids-bot rapids-bot bot merged commit 7731ba2 into rapidsai:branch-25.02 Dec 24, 2024
61 of 62 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Cython / Python Cython or Python issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants