Hi there.

I'm using the wrapper created by this guy: http://forum.ocrsdk.com/questions/1458/python-wrapper-to-abbyy-cloudocr and the reason why is because your python on github is for Python2.7 and I'm using Python3.5.

Anyhow. I just want to grab the text and barcodes and get it back in XML. I keep getting back (even though I change my profiles) a full XML with stuff I don't need or want.

Do I have to parse through all the blocks, par, line, charParams?

Isn't there just a XML format like: < text> The OCR read text < /text> < barcode> value of a barcode < /barcode>?

I thought by changing my profile to documentArchiving or textExtraction it would give me something like that.

I don't care about the structure of the document. I just want ALL the text it can find and potentially any barcodes.

Thanks, Marcus

asked 22 Nov '16, 13:29

MarGul's gravatar image

MarGul
134

edited 22 Nov '16, 13:57


As I have already answered by email, ABBYY Cloud OCR SDK supports only this XML export format, and at the moment there are no any plans to add a new variant of XML export. You can create your own file in the needed format using our XML output.

link

answered 24 Nov '16, 15:00

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16

Thanks for the answer Oksana. I have parsed your XML output to my own now and my biggest concern was that you will change the current XML format you have to something else. Because if you would (say remove the <par> tags) my parser will break.

(25 Nov '16, 10:38) MarGul
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×9

Asked: 22 Nov '16, 13:29

Seen: 255 times

Last updated: 25 Nov '16, 10:38

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal