Pdf, strange spacing

  • Last Post 21 June 2013
  • Topic Is Solved
Stephen posted this 14 June 2013

Hey guys.

When i download .txt file, ocr is almost perfect.

When i download pdfSearchable, in a lot of cases, i get weird spacing between letters. Can this be somehow fixed?

Please note that when i use pdfTextAndImages, text is perfect(no photos, background ..etc)

Thanks !

  • Liked by
  • Vitalie
  • Katia Sirotina
Order By: Standard | Newest | Votes
Anastasia Galimova posted this 14 June 2013

Hi Stephen.

It's a known issue, it was already described here.

In FineReader Engine it is solved with a special option "Tagged PDF". This option is not yet implemented in Cloud OCR SDK. Probably it will be implemented soon - I'm consulting with the developers now and will inform you as soon as I have additional information.

  • Liked by
  • Stephen
  • Katia Sirotina
Vitalie posted this 21 June 2013

I`m also interested in this implementation. Thank you, Anastasia.

SDK_support posted this 25 July 2013

We would like to inform that the special option "Tagged PDF" has been already implemented in Cloud OCR SDK. The processImage and processImage methods have new bool parameter "pdf:writeTags".

By default this parameter is set to autodetect value (auto). The possible values are:

• write

• dontWrite

We hope it will be useful.

Vitalie posted this 14 August 2013

Unfortunately I steel have same problem even using write/dontWrite parameter. Can you please see this post?