pythonnlpmatch-phrase

Extract keywords/phrases from a given short text using python and its libraries


From a user given input of job description, i need to extract the keywords or phrases, using python and its libraries. I am open for suggestions and guidance from the community of what libraries work best and if in case, its simple, please guide through.

Example of user input:

user_input = "i want a full stack developer. Specialization in python is a must".

Expected output:

keywords = ['full stack developer', 'python']


Solution

  • Well, a good keywords set is a good method. But, the key is how to build it. There are many way to do it.

    Firstly, the simplest one is searching open keywords set in the web. It's depend on your luck and your knowledge. Your keywords (likes "python, java, machine learing") are common tags in Stackoverflow, Recruitment websites. Don't break the law!

    The second one is IR(Information Extraction), it's more complex than the last one. There are many algorithms, likes "TextRank", "Entropy", "Apriori", "HMM", "Tf-IDF", "Conditional Random Fields", and so on.

    Good lucky.

    For matching keywords/phases, Trie Tree is more faster.