Is there any tool which calculates optimal value for minpts and eps for DBSCAN algorithm?
Currently i use sklearn library to apply DBSCAN algorithm
from sklearn.cluster import DBSCAN
I tried algorithm with several minpts and eps but without any calculation.
eps
and minpts
are both considered hyperparameters. There are no algorithms to determine the perfect values for these, given a dataset. Instead, they must be optimized largely based on the problem you are trying to solve.
Some ideas on how to optimize:
minpts
should be larger as the size of the dataset increases.
eps
is a value that deals with the radius of the clusters you are trying to find. To choose a value, we can perform a sort of elbowing technique (a similar technique that is often used to determine an optimal k
in K-Means clustering).
If there was a definite way to solve for optimal values, it would be largley documented. For now, all we can do is give our best calculated guess. Once again, the problem you are trying to solve may affect the way you choose your elbow point - it is important to understand that.