I have a PDF document where every page has the same template. I need to recognize a single field from each page.

Do I need to process the whole page or textfield recognition has the possibility to work with multipage?

asked 13 Jul '15, 12:34

danyolgiax's gravatar image


For your scenario you can use the processFields method. It allows to specify the coordinates of each field in an XML file for each page, for example:

<?xml version="1.0" encoding="utf-8"?>
<document xmlns="http://ocrsdk.com/schema/taskDescription-1.0.xsd" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://ocrsdk.com/schema/taskDescription-1.0.xsd http://ocrsdk.com/schema/taskDescription-1.0.xsd">
  <fieldTemplates />
  <page applyTo="0,1">
    <text id="Field1" left="395" top="105" right="1047" bottom="157">
  <page applyTo="2">
  <page applyTo="N">

answered 14 Jul '15, 14:55

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦

I considered it but I don't know how many pages the document has. Do I need to dinamically generate configuration XML file reading total page numbre from pdf document?

(14 Jul '15, 15:08) danyolgiax

Yes, you should do so because the "applyTo" attribute is mandatory for the "page" element name.

(14 Jul '15, 15:29) Oksana Serdyuk ♦♦
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here



Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported



Asked: 13 Jul '15, 12:34

Seen: 855 times

Last updated: 14 Jul '15, 15:29

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal