I am currently evaluating your Cloud OCR SDK for the viability of automating AP Invoices in our application in Salesforce.
I am successfully using the
processImage method to upload the image and receive back the document in XML which gives me coordinates and all of the text etc. What I would like to know is if it is possible to build some templates to 'recognise' certain fields. As you know, Invoices come in all shapes and sizes and I would like to cut down on the amount of human intervention. For example, 2 invoices from the same supplier have similar, but not exact coordinates for text fields, and of course it also depends on how many product lines are invoiced as to where things end up on the page.
I looked at the
processTextFields methods, but I have to supply coordinates and as I mentioned, these aren't reliable enough to pin-point a particular field, in addition to that it would be a case of creating a different template for each supplier.
So, my questions are these:
- Is it realistic to implement my own templating type functionality? Would that generally be expected functionality?
- If so, is it possible to use some kind of tolerance for the coordinates - or do they have to be exact?
- Or, is this too ambitious a goal given the methods available in the Cloud OCR SDK currently?