Having looked at PDF-Text Extraction from Text Layer, I can see it's possible to get the underlying text of a PDF document from FineReader Engine 10. Is this possible via ABBYY Cloud OCR SDK at all?
PDF Text Extraction from Text Layer via OCR SDK
- 2.4K Views
- Last Post 28 August 2013
Unfortunately, this feature is not implemented in ABBYY Cloud OCR SDK.
If you want to extract the text layer, you can use a PDF lib like Poppler or PDFMiner.
All the best,
Sadly, I'm really interested in automatic layout detection, which is scarce.
What do you mean by "automatic layout detection". Could you give an example/more detail?
1309 questions, 4296 answers.