carrot2

Can I use my own labels for carrot2


I am modifying the source code of carrot for a project. Acc. to the LINGO algorithm, it first generates the most probable labels and builds clusters which best suit the labels right? So, can I input my own set of labels to the carrot to see how it clusters the documents around them?


Solution

  • Unfortunately, you can't provide your own labels for clustering with Lingo.

    On the other hand, the label-to-document assignment algorithm is very simple in Lingo -- if the document contains the label's words, it will be assigned to the label. Therefore, you can achieve the same effect by, for example, indexing your documents in Lucene and then querying the index using the predefined labels you have.