I'm trying to recognise semantic information about text regions e.g. headers, footers, page numbers, captions etc. I cannot find any information about how to achieve this in the documentation. Please can you provide a python code example?

I am using the supplied python example here: http://ocrsdk.com/documentation/code-samples/. In order to get this Python code to work, I had to change line 90 of AbbyyOnlineSdk.py from

return {"Authorization": "Basic %s" % base64.encodestring( ...

to

return {"Authorization": "Basic %s" % base64.b64encode( ...

I'm currently just executing this call:

python process.py infile.png outfile.xml -l English -xml

Edit


This issue is now fixed on GitHub: https://github.com/abbyysdk/ocrsdk.com/issues/27

asked 18 Jul '15, 17:53

kungfujam's gravatar image

kungfujam
134

edited 22 Jul '15, 19:50


This feature is not supported in ABBYY Cloud OCR SDK. But you can try to use another our SDK for your needs, namely ABBYY FineReader Engine. You can access the logical structure of a document formatting attributes, including headers, footers, page numbers, fonts, styles etc., via the IFRDocument::DocumentStructure and IFRPage::PageStructure properties. Please find more information in Help → Guided Tour → Advanced Techniques → Working with the Logical Structure of a Document.

For trial version of FineReader Engine, please fill the form at our site.

link

answered 28 Jul '15, 19:00

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×160
×11
×2

Asked: 18 Jul '15, 17:53

Seen: 2,460 times

Last updated: 28 Jul '15, 19:00

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal