Dear all,

We have a complex problem without a good solution. We have now ABBYY Flexicapture version desktop.

We have many invoices in Georgian language. As ABBYYY don't support Georgian as recognition language. We don't have any possibility to extract text.

But, can we still extract numeric information ? We can define some regular expression on Character String to find out singleton elements, but how can we extract rows in tables ? Remark that some pages, Pre-Recognize can't detect separator lines between rows and columns (green lines in capture).

alt text

We think about using lower level ABBYY FineReader API to retrieve information about detected occurrences with information on positions (X,Y) pixels, then we can associate items on same row by considering approximation on X values.

Do you have any better idea ?

If we use ABBYYY FineReader API, which programming language and how can we process ? We prefer Java as mention in this page: https://abbyy.technology/en:kb:code-sample:java-wrapper

But, we don't find any jar files in current installation. Should we install more an other component ?

Thanks for your help.

This question is marked "community wiki".

asked 12 May '16, 10:29

thaichat04's gravatar image

thaichat04
135

edited 12 May '16, 10:30


From your usage description you can try either our offline solution (ABBYY FineReader Engine) or our online service (ABBYY Cloud OCR SDK). In both cases you can get the recognition result with the characters coordinates.

As for using with Java, the current version of FineReader Engine should include the com.abbyy.FREngine.jar file in the distribution package. And Cloud OCR SDK, as an online service, can be used in any environment supporting communication over the network.

Please contact your region sales to clarify more details about these products. They will help you to make a choice between them and provide with a trial license if you want.

link

answered 12 May '16, 15:03

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×17
×1

Asked: 12 May '16, 10:29

Seen: 608 times

Last updated: 12 May '16, 15:03

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal