I have several receipts that recognize nearly perfectly. Specifically, ones that are from laser printers with normal looking fonts. However, this one type of receipt (impact printer... seems like dot-matrix with a "System" font) doesn't seem to recognize correctly. Is there any advice for getting this to output something useful (even if partially) or can I talk to support somehow?

Thanks!

You can see the image at: http://picpaste.com/pics/pfchangs-goIGHs9T.1427929919.jpg

asked 02 Apr '15, 02:38

mkommar's gravatar image

mkommar
112

Please, specify the product. Which settings do you set for receipts processing?

(02 Apr '15, 10:53) Natalia Kara...

As far as we understand you are using ABBYY Cloud OCR SDK. In this case for receipt capture we recommend you to use the processImage method with the textExtraction profile. This profile is suitable for extracting all text from the input image, including small text areas of low quality.

Also please pay your attention on the quality of your input image. We should notice that your image resolution is 96 dpi and it is quite low for your recognition purposes. The image resolution has a real impact on the OCR quality that can be archived. Moreover the image is fuzzy and has background that makes a lot of noise and lower the recognition quality. It is very important that the image contains only the text the customer wants to recognize. Please refer to the Best Practices section where you can find more recommendations on how to scan and photograph documents to achieve the best recognition results.

We have tested your image and have managed to achive more or less acceptable recognition result using our above recommendations:

  • we have changed the image resolution to more optimal value (we have used ABBYY FineReader 12 for image preprocessing);
  • we have cropped the image so that the image contain only the receipt without the background;
  • we have processed the image using the following recognition settings: ".../processImage?language=English&profile=textExtraction&imageSource=Auto&exportFormat=txt"

Our results will be sent you by e-mail.

link

answered 02 Apr '15, 14:59

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16

Thanks! I will try the suggestions. I think there was a bit of some miscommunication in the image. If you saw a small image on a webpage, click on it and you'll get the full sized image alone. It's a picture taken from a cellphone, about 5,312px × 2,988px.

That might make the request seem a little more sane!

That's the use case. I can do some detection of the boundaries of the receipt and crop it accordingly and perhaps even some perspective correction.

(10 Apr '15, 05:13) mkommar

Hi,

in BETA we have a method that extracts the data from receipts and returns it in an XML structure.

Cheers,

Rainer

link

answered 08 Jul '15, 16:09

rainerp's gravatar image

rainerp
213

edited 14 Aug '15, 15:44

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16

The method for receipt capture is now offically released for the USA. For other countries it is still in beta. Please see more information here: http://ocrsdk.com/documentation/apireference/processReceipt/

and here

https://www.abbyy.com/receipt-capture-ocr/

link

answered 06 Sep '16, 17:42

rainerp's gravatar image

rainerp
213

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×25
×9

Asked: 02 Apr '15, 02:38

Seen: 2,098 times

Last updated: 06 Sep '16, 17:42

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal