I would like to know if there is any kind of research on the parameter selection (minpoints) for the HDBSCAN algorithm, in the context of having a training set with only normal samples and a test set that contains anomalies/outliers.

I want to apply the GLOSH algorithm to find the outliers but since i have a "clean" dataset available I suppose it is possible to exploit it.

I know that in the context of novelty detection exists lots of other algorithms (OC-SVM ecc...) but i would like to try solutions with HDBSCAN.

More Davide Fantini's questions See All
Similar questions and discussions