Is there anything I can do to improve the OCR quality of the Cloud SDK? The output of FineReader 10 is more consistent.

For one particular document (I can share if required), the first line OCR is different

Original: 2 . Afte r you have divided your paper into fractions, color each fraction a different color.

FineReader 10: 2. After you have divided your paper into fractions, color each fraction a different color.

Cloud SDK: 2 . A fte r you have divided your paper into fractions, color each fraction a different color.

asked 07 Nov '14, 19:25

Neil's gravatar image

Neil
113

edited 07 Nov '14, 19:26

Does not look like problems with character recognition at all, since in both cases text is the same, but spaces positioning is different. How do you get the text and what is your output format? Problems like that might appear doe to character positioning inside PDF, and Adobe may "invent" spaces sometime where they are not present.

(07 Nov '14, 22:14) Andrey Isaev ♦♦

Output for Cloud SDK is PDFSearchable and use the processImage Method. PDF is obtained using CamScanner for Android.

(10 Nov '14, 15:26) Neil

Please provide us with the document you process for investigation - send it to cloudocrsdk@abbyy.com.

(13 Nov '14, 08:33) olga_parmenova ♦♦
Be the first one to answer this question!
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×28

Asked: 07 Nov '14, 19:25

Seen: 1,859 times

Last updated: 13 Nov '14, 08:33

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal