Hello,

We extracting text from scanned documents with parameters: language-english, profile-textExtraction, imageSource-scanner, correctSkew-true, exportFormat-pdf, pdf:writeTags-yes.

The quality of image to be processed is good.

The result is basically very good, but on some parts of a numbers we have some OCR errors. For example when on page we have 41,917.94 the result is -41/9V7.94, for me it is very strange result.

alt text

I have sent just right now to the ocrsdk e-mail the files with errors for att. of Anastasia Galimova.

Can I have some feedback from ABBYY to resolve this problem?

Many thanks, Vitalie

asked 09 Sep '14, 23:07

Vitalie's gravatar image

Vitalie
451214


Please try to use profile=documentConversion.

link

answered 16 Sep '14, 18:29

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16

Strange, but Demo page shows correct results on numbers and just one mistake on text. Probably you shuld play arround with settings and chose most optimal ones.

alt text

link

answered 10 Sep '14, 10:00

Andrey%20Isaev's gravatar image

Andrey Isaev ♦♦
2835

Hello Adrey,

I tried demo page with the same settings (English, Text extraction, Scanner) it woks fine on first two strings, but also fails on the last two strings of numbers. Please see attached image. It is very-very strange for me. I think there is something to examine. How can I improve text extraction if even on the Demo page I have errors on ocr?

alt text

alt text

(10 Sep '14, 10:53) Vitalie

Thank you, Vitalie, that's different story. Now, when we can reproduce this, support will investigate it.

(10 Sep '14, 11:15) Andrey Isaev ♦♦

Thank you, Andrey, I will waiting for the result of investigation.

(10 Sep '14, 11:24) Vitalie
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×102
×59
×6

Asked: 09 Sep '14, 23:07

Seen: 1,314 times

Last updated: 16 Sep '14, 18:29

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal