I'm trying to process a receipt and am getting very poor results on a particular image. I'm calling processImage with exportFormat set to txt, correctSkew set to false, and imageSource set to scanner. Below is the image I'm processing
and the results being returned are
As you can see, a lot of the item descriptions are missing, some of the amounts don't have values for cents, there are extra spaces in the return, among other issues. What can I do to get better results?
Please try the textExtraction profile for your scenario. This profile is suitable for extracting all text from the input image.
Note that the red oval hinders ABBYY Cloud OCR SDK to recognize accurately the text above and below the line: Age Confirmed - 12/12/1912. This is expected behavior of the program.
answered 23 Jun '15, 13:18
Oksana Serdyuk ♦♦
I tried adding profile=textExtraction and this particular receipt is getting better results. Here is what was returned:
However, other receipts are getting bad results with profile=textExtraction. For example, when I submit this image
I was getting these results (without profile=textExtraction)
but now I'm getting these results (with profile=textExtraction)
As you can see, I'm loosing the Subtotal, line item amounts (and those that are read are still incorrectly read), Total amount, etc. Is there one call to read a receipt that will work on all receipts?
answered 23 Jun '15, 19:43
As suggested in your email response (that I've attached below), I have already tried setting the profile=textExtraction with mixed results. You also state "try to find more optimal recognition settings for your kind of images", but that's what I'm asking your advise on. What would those settings be?
You also suggest using a better image quality, but I'm trying to process receipts that clients will be taking photos of with their mobile phones and then emailing to a server for processing. I believe your ABBYY FineReader 12 is a desktop application, which isn't an option since all processing is online. Is there a perimeter that can be passed to ABBYY Cloud OCR SDK making the SDK increase the image quality?
Is there any other suggestions you might have to make ABBYY Cloud OCR SDK work for me?
Thank you for your interest in our product.
We are writing to you regarding your question at ABBYY Cloud OCR SDK forum. To achieve better recognition results we could advise you to take care of the source images quality and try to find more optimal recognition settings for your kind of images. Below you can find our recommendations which you can use as a starting point.
At first, it is necessary to notice that your images have quite low resolution for recognition. Mind that the image resolution has a real impact on the OCR quality that can be achieved. We have changed resolution of your image to more optimal values using ABBYY FineReader 12: Image Editor -> the Resolution tool. Please review the OCR - Optimal Image Resolution article to know more about the recommended resolution values for OCR purposes.
Also as we have already written at our forum, it is usually recommended to use the textExtraction profile for your usage scenario. This profile is better to use for receipts processing as it provides better results both in recognition quality and in speed of processing. Morever it is suitable for extracting all text from the input image, including small text areas of low.
We have tested your images and managed to achieve quite good recognition results using our above recommendations. Please find our results in the attachment:
Folder Images consists of your original image and our images after FineReader 12 image preprocessing; Folder Results consists of two subfolders: textExtraction and documentConversion. They have our OCR results which we have got using the processImage method with corresponding profiles.
Hope the information is useful.
If you have any technical issues, please visit our Developer Forum to get fast help from ABBYY Cloud OCR SDK developers’ community. Follow us on Twitter to get the latest news.
Kind regards, Oksana Serdyuk Technical Support Engineer
The method for receipt capture is now offically released for the USA. For other countries it is still in beta. Please see more information here: http://ocrsdk.com/documentation/apireference/processReceipt/
answered 06 Sep '16, 17:41