PDF coordinates and pages

  • 4.7K Views
  • Last Post 29 June 2012
greatrat00 posted this 27 June 2012

Hi,

I'm working with the processFields method, and everything works fine with JPG (coordinate-wise). However, JPGs don't handle multiple pages - so I'd like to use PDF. However, I'm feeding the same coordinates I fed the JPG version, but it doesn't work - values come back empty in the output XML. Is there something special that I have to do to get PDF coordinates since PDF is sort of vectorish and it will depend on the dpi your engine is using?

What do you recommend?

Thanks, Adam

  • Liked by
  • danyolgiax
Order By: Standard | Newest | Votes
Andrey Isaev posted this 27 June 2012

Please provide sample of JPG and PDF files

Dmitry Me posted this 27 June 2012

Could you please provide the fields settings XML and the request parameters (language, etc) too?

greatrat00 posted this 27 June 2012

Field settings XML, parameters are in there:

http://wevito.com/ocr/ocrparams2.xml

Thanks, Adam

Dmitry Me posted this 28 June 2012

In cases when actual resolution is not defined it is internally set to 300 dpi. This is exactly why you have no data extracted - the JPEG file has resolution set to 150 dpi and so coordinates in respect to the JPEG file are completely irrelevant to the PDF file.

  • Liked by
  • danyolgiax
Andrey Isaev posted this 29 June 2012

Then solution would be fixing the problem with software that was used to create PDF files so it will set correct DPI settings from JPEG files.

Close