nlpopennlp

OpenNLP lemmatization example


Does anyone know where I can find an example of how to use the SimpleLemmatizer class in the OpenNLP library, and where I can find a sample english dictionary? It appears to be missing from the documentation.


Solution

  • You can download Dictionary from here - en-lemmatizer.dict

    Example :

    import opennlp.tools.lemmatizer.SimpleLemmatizer;
    
    private static SimpleLemmatizer lemmatizer;
    
    private String lemmatize(String word, String postag) throws IOException {
        if (lemmatizer == null) {
            InputStream is = getClass().getResourceAsStream("/models/en-lemmatizer.dict");
            lemmatizer = new SimpleLemmatizer(is);
            is.close();
        }
        String lemma = lemmatizer.lemmatize(word, postag);
        return lemma;
    }
    

    Sample code taken from here - DocumentTaggerService