8000 HDBSCAN Ongoing Work · Issue #26801 · scikit-learn/scikit-learn · GitHub
[go: up one dir, main page]

Skip to content

HDBSCAN Ongoing Work #26801

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
3 of 13 tasks
Micky774 opened this issue Jul 7, 2023 · 3 comments
Open
3 of 13 tasks

HDBSCAN Ongoing Work #26801

Micky774 opened this issue Jul 7, 2023 · 3 comments
Labels
cython Moderate Anything that requires some knowledge of conventions and best practices module:cluster New Feature

Comments

@Micky774
Copy link
Contributor
Micky774 commented Jul 7, 2023

Introduction

This is a (hopefully) exhaustive list of ongoing/future work for HDBSCAN. These have all been discussed and are considered wanted, but some still require thorough investigation (especially heuristic evaluations).

Priority List

The higher priority items appear earlier in this list.

8000
@Micky774 Micky774 added New Feature Moderate Anything that requires some knowledge of conventions and best practices module:cluster cython labels Jul 7, 2023
@Micky774 Micky774 mentioned this issue Jul 7, 2023
13 tasks
@lorentzenchr
Copy link
Member

Has someone aligned with the devs from https://github.com/scikit-learn-contrib/hdbscan?
What is the long term path forward here? Will the contrib package be archived in the future? Will they help making the scikit-learn implementation roughly equivalent in features?

@lmcinnes friendly ping. Your opinion is highly appreciated.

@lmcinnes
Copy link
Contributor
lmcinnes commented Aug 3, 2023

The scikit-learn-contrib package is essentially in a pure maintenance mode right now; just trying to make sure it doesn't break going forward. I would hope that eventually it can just be archived. I think the main features that really needs to be in place before thinking about retiring the sklearn-contrib version is the Dual-tree Boruvka algorithm. I have offered to assist in that if needed (but @Micky774 seems to be more than capable).

@lorentzenchr
Copy link
Member

@lmcinnes Thanks. That’s good to hear. It would then be nice if the hdbscan package pointed to scikit-learn‘s hdbscan.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cython Moderate Anything that requires some knowledge of conventions and best practices module:cluster New Feature
Projects
None yet
Development

No branches or pull requests

3 participants
0