Is it possible to achieve the text quality and coverage (i.e., including smaller text or low quality areas of the image) of the textExtraction profile, while retaining the document structure information of the documentConversion profile? If not possible through the cloud API, then would it be possible via direct usage FineReader (or similar)?

asked 28 Jan '15, 19:59

jsack's gravatar image

jsack
112

edited 23 Mar '15, 12:42

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16


Sorry for the delay in response.

Recently ABBYY Cloud OCR SDK team has improved technology of TXT export used in our service. Now the text export format simulates original layout of a source document with the help of inserted spaces and empty lines. The new TXT export is available by default if your application uses the exportFormat=txt option.

The old text export format is available, too. It can be used by setting the exportFormat option to txtUnstructured. If this option value is selected, OCR results will be saved in the resulting text file in the same order as they are recognized, i.e. block by block.

ABBYY Cloud OCR SDK documentation will be updated accordingly.

link
This answer is marked "community wiki".

answered 23 Mar '15, 12:27

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×160
×11
×7
×2

Asked: 28 Jan '15, 19:59

Seen: 1,571 times

Last updated: 23 Mar '15, 12:42

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal