Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DensMAP #10

Open
lofcz opened this issue Nov 6, 2023 · 0 comments
Open

DensMAP #10

lofcz opened this issue Nov 6, 2023 · 0 comments

Comments

@lofcz
Copy link

lofcz commented Nov 6, 2023

The library is great (as usual with your work). I wonder whether there are plans to augment the implementation with DensMAP support to preserve local data density. This is critical if we are to use the reduced results for clusterization - with UMAP the same embedding (for example from ada-002) is placed a tad differently in the reduced dimensionality every time:

image
numberOfNeighbors = 15, K-Means++ clustering

This can be, to an extent mitigated with hyperparameter fine-tuning:
image
numberOfNeighbors = 80, DBSCAN clustering

DensMAP is an extension and if I'm not missing something, seems to be implemented only in two files:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant