OCR Stops at a Blank Page

  • 626 Views
  • Last Post 25 May 2016
Nir posted this 18 May 2016

When matching PDFs with a blank page in the middle - in FlexiLayout Studio it works fine, but when using the FC engine it is running until the blank page and ignores everything below it.

proccesor.SetDeleteBlankPages(true); didn't help.

Any suggestions?

Order By: Standard | Newest | Votes
Ilya Sukhorukov posted this 19 May 2016

We would like to suggest you the following workaround to remove blank pages: to use IImageProcessingTools::DetectEmpty and delete empty pages before adding them to the processor object or just do not add such a page. Please be aware, that this method can be not useful if the page is not completely empty and there are some garbage on it. You can find the example of using the IImageProcessingTools in the standard FCE sample "Color Stamps Removal (C#)". Just for example:

IImageProcessingTools tools = engine.CreateImageProcessingTools();

IImageFile imageFile = tools.OpenImageFile(@"Path to your pdf");

for (int i = 0; i < imageFile.PagesCount;i++ )

{

IImage srcImage = imageFile.OpenImagePage(i);

bool isEmpty = tools.DetectEmpty(srcImage);

if (!isEmpty)

{

processor.AddImage(srcImage);

}

}

Please try this and in case you need any additional help please send to SDK_Support@abbyy.com your *.fcproj project (with all used templates), input PDF files and serial number for further investigations?

Nir posted this 25 May 2016

Hey, thanks for the answer. I've managed to find 2 much simpler solutions: one is to set a minimum number of pages in the project properties that is higher than where there's usually a blank page. The second is to add header and footer logic.

  • Liked by
  • angusto
Close