javamachine-learningocrhandwriting

Handwriting signature detection


I'm trying to find if a scanned pdf form contains a signature (like making sure a check is signed).

The problem domain:

I will be receiving document packages (multi page pdf's with multiple forms). I have already put together document package classifiers that will check the package for all documents and scale the images to a common size. After that I know where the signatures should be and can scan the area of the document specifically. What I'm looking for is the best approach to making sure there is a signature present. I've considered just checking for a base threshold of dark pixels but that seems so clumsy. The trouble with signatures is that they are not really writing, more of a personal mark.

The only thing I can come up with is a machine learning method to look for loopyness? But I'm not all the familiar with machine learning and don't even know where to start with something like that. Anyone with some suggestions for practical approaches would very appreciated.

I'm coding this in Java if that's helpful at all


Solution

  • What you asked was very broad so there isn't a lot of information that we can give you. However, I can point you to some helpful links: