We're finding now that Cloud OCR SDK returning very different results for images we've always used as test images.

For example this image (http://i.imgur.com/6Pa3CTv.jpg) we have used as part of our integration tests used to return a good transcription, now it returns no content after being uploaded.

Other images are working fine, but this one in particular now appears to constantly return broken results.

asked 09 Jun '14, 17:29

Andrew%20M's gravatar image

Andrew M
714

converted to question 19 Jun '14, 16:09

Anastasia%20Galimova's gravatar image

Anastasia Ga... ♦♦
790112


Most probably the issue occurs due to OCR Engine update.

General accuracy on a large number of documents will be higher for the new version - we know this because we’ve tested the improvements on tens of thousands of real documents. However, the OCR result for particular images can differ from the results received using the previous version, because OCR algorithm is not a simple one. It is a very sophisticated artificial intelligence algorithm, and even a small change in it influences the outcome.

For the receipt capture we recommend either to use the special API, which is currently in beta-testing, or the textExtraction profile (if the text order will be wrong, you can sort the words on your side using its coordinates from XML). Unfortunately, the default (documentConversion) profile is not suitable for this task, even if it worked before: it is designed for document recognition.

link

answered 21 Jun '14, 17:22

Anastasia%20Galimova's gravatar image

Anastasia Ga... ♦♦
790112

It seems very strange, but I cannot reproduce what Andrew is referring to. I just plugged http://i.imgur.com/6Pa3CTv.jpg into http://cloud.ocrsdk.com/Demo with default settings. It took a while to recognize, but result is good. All text is there (except for the logo) and first glance I can't spot any recognition mistakes.

(22 Jun '14, 22:20) Andrey Isaev ♦♦
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×160

Asked: 09 Jun '14, 17:29

Seen: 963 times

Last updated: 22 Jun '14, 22:20

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal