How to export pdf into alto XML format (XCA_Basic , XCA_Extended). I am using submitImage/processDocument method to perform recognition.

asked 07 Feb '16, 10:58

pprashant9490's gravatar image


ABBYY Cloud OCR SDK supports the following export formats that should be useful for you:

To get the output in this format set the exportFormat parameter of the processDocument method to alto.

  • xml and xmlForCorrectedImage (the same as xml, but all coordinates written into the output XML file relate to the corrected (deskewed, rotated, etc.) image, not the original). This format is described with the following XML scheme.

Two XML results are available in this case: so called basic variant of XML result and extended one, which contains the variants of characters recognition. Use the xml:writeRecognitionVariants parameter of the processDocument method to specify if you need to get one or another XML. This parameter can have the false or true values correspondingly.


answered 09 Feb '16, 12:41

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here



Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported



Asked: 07 Feb '16, 10:58

Seen: 500 times

Last updated: 09 Feb '16, 12:41

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal