Hello

We need to optionally correct the orientation of the document (typically a simple business card supplied as a BITMAP which we add to the document) and then extract the text from one or more regions. Speed of execution is critical.

I can get it to run fast without document orientation correction.

First analyse each region then do a document recognise.

This works a treat and is fast once I turn off table detection etc. Okay, now for correcting the document orientation. The only way I have found is to do a PreprocessAnalyzeRecognize on the first page (we only have one page).

There are three problems with the above. Firstly it is a bit slow, ideally we need it to be at least twice as fast. Secondly it extracts all text from the document and not just text in our regions as confirmed when I export the document to an external XML file. Thirdly I cannot extract text from the desired regions. For example, when the card has two lines containing "2 Oak Lane" followed by "Fax: 0123456789", both aligned horizontally, and I specify a rectangle that selected the "2 Oak Lane", the ABBYY reader thinks the two lines are one line - "2 Oak Lane Fax: 0123456789" - so I cannot extract the "2 Oak Lane". I know that the line overlaps my region, but I do not know how many characters lie in my region.

I tried modifying the first block of code by first preprocessing the document, with the CorrectOrientation flag set to true, but that did not work.

Incidentally I tried adding code in this message but the firewall blocked me!

Help would be appreciated. Thanks in advance.

Leif

asked 21 Oct '16, 13:14

Leif's gravatar image

Leif
1315

Hi Leif,

Sorry for some silence from our side.

The situation should be tested, so please send the following information to your region Technical Support Team (TechSupport_eu@abbyy.com):

1) your serial number of the program,

2) the used code sample,

3) some image samples.

(24 Oct '16, 12:51) Oksana Serdyuk ♦♦
Be the first one to answer this question!
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×7
×5

Asked: 21 Oct '16, 13:14

Seen: 772 times

Last updated: 24 Oct '16, 12:51

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal