Hello, i want extract data from xml and save it to java object. I was tried by coordinates(left, top, right, bottom), but it's not reliably, because documnet scanned and not guarantees that documents scanned exactly like others(difference in coordinates large).

I marked text on the pdf document below, what i want get and save it to object.

I run TestApp from sample in sdk for java:

java TestApp recognize test.pdf result.xml --lang=russian

Sample document: test.pdf

https://www.dropbox.com/s/girz3it2ntt10fm/test.pdf?dl=0

After recognize: result.xml

https://www.dropbox.com/s/iofy6i4xjesrsyj/result.xml?dl=0

How i can achieve to do? Thanks!

asked 16 Jul '16, 14:53

Alisher's gravatar image

Alisher
112

edited 16 Jul '16, 15:04


Unfortunately, ABBYY hasn't got a ready code sample for such kind of conversion. But it seems the Java community has a solution for XML to Java Objects conversion. In the Internet you can find the following articles:

link

answered 18 Jul '16, 17:26

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.4k16

а можно на русском?

(18 Jul '16, 17:41) Alisher

You can write us to CloudOCRSDK@abbyy.com in Russian.

(18 Jul '16, 17:47) Oksana Serdyuk ♦♦

I think that you dont fully understand me, my question is how i can get information(for example: phone number of employee or email and etc) from generated xml by ocr sdk. I dont need full parsed infromation, just piece of information.I wrote to CloudOCRSDK@abbyy.com, i will wait for a response.Thanks!

p.s: I can't use JAXB because xml not structured.

(18 Jul '16, 18:26) Alisher
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×49
×46
×18

Asked: 16 Jul '16, 14:53

Seen: 488 times

Last updated: 18 Jul '16, 18:30

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal