I have images containing this kind of content:

long_text_description_1     number1a        number1b        number1c
long_text_description_2     number2a        number2b        number2c
long_text_description_3                                     number3c
long_text_description_4     number4a        number4b        number4c
...

and I would like to get the recognised output in the same order. Instead the API gives me a TXT file with column-based results, something like:

long_text_description_1
long_text_description_2
long_text_description_3
long_text_description_4
number1a
number2a
number4a
number1b
number2b
number4b
number1c
number2c
number3c
number4c

which make me loose all the correspondence/match between texts and numbers. I can't know any more which number belong to which text description

I just want to read line by line, what is the easiest way to achieve this?

My problem occurs when using TXT as output. When I asks for XLSX, if i open the resulting file, row and columns information are there but xlsx is not a format that a programmatic algorithm can use easily. CSV would be good but does not seem to be available.

So, what is the best way to get the plain text from my image, in a line by line fashion?

Thanks in advance.

asked 14 Oct '13, 16:25

MikaelW's gravatar image

MikaelW
11


You could perform export to XML and process words with its coordinates on your side. The region parameter of the processTextField method could be set to the coordinates corresponded to the each line.

link

answered 21 Oct '13, 10:47

SDK_support's gravatar image

SDK_support ♦♦
2763

also using processtextfield instead doesn't work since it seems to be limited to 200 characters per image. Unless that limit can be changed somewhere?

(14 Oct '13, 18:46) MikaelW

Hi any developments on this? XLSX provides excellent results but csv or json is essential at this point.

(12 Feb '14, 11:17) mikehal

We recommend either to use export to XML or to wait about a month for the beta-testing of receipts recognition feature.

More detailes are in a previous thread.

link

answered 12 Feb '14, 19:42

Anastasia%20Galimova's gravatar image

Anastasia Ga... ♦♦
790112

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×160
×11
×7

Asked: 14 Oct '13, 16:25

Seen: 1,640 times

Last updated: 12 Feb '14, 19:46

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal