I am using carrot2's STC (Suffix Tree Clustering) algorithm for clustering a bunch of documents. By default, the max number of clusters the algorithm forms is 16. Is there a way to decide the number of clusters generated ?.
Below is the code for invoking STC clusters.
ProcessingResult byDomainClusters = controller.process(documents, null, STCClusteringAlgorithm.class);
List<Cluster> clustersByDomain = byDomainClusters.getClusters();
ConsoleFormatter.displayClusters(clustersByDomain);
To get more clusters, try increasing the values of the maxClusters and maxBaseClusters parameters. Here's how to pass parameter values in Java code.
However, the low number of clusters may also be caused by the characteristics of your input data (too few documents?). To verify this, try clustering your data with the Lingo algorithm.