I have a scenario where I need OCR the file which have more than 150 pages. In order to smoothen the process, is there any method which can break the pages of the single file and process it as multithreading.

asked 15 Jun '16, 08:57

nayeemkhan's gravatar image

nayeemkhan
111


In your case we would suggest that you consider using parallel processing. This will allow you to recognize pages in a document in parallel, and thus decrease overall processing time. For detailed description of possible ways to implement parallel processing in FineReader Engine 11, please refer to the Developer’s Help file, the article Guided Tour→Advanced Techniques→Parallel Processing.

You could also take a look at the MultiProcessingRecognition demo tool for an example of parallel processing in FREngine. This demo tool can be found at:

  • %ALLUSERSPROFILE%\Application Data\ABBYY\SDK\10\FineReader Engine\Samples\DemoTools — for Windows XP, Windows Server 2003;
  • %ProgramData%\ABBYY\SDK\10\FineReader Engine\Samples\DemoTools — for Windows Vista, Windows Server 2008, Windows 7, Windows 8, Windows Server 2012.
link

answered 15 Jun '16, 15:22

Oksana%20Serdyuk's gravatar image

Oksana Serdyuk ♦♦
1.5k16

In our scenario we are processing single file of multiple pages (Around 150 pages) ,is there any way that we can split the pages to the batch of 20 and process the OCR and then merge those pages.

(16 Jun '16, 10:27) nayeemkhan

This algorithm, that you are describing will, be performed automatically in case of using multiprocessing. As it is recommended in the Help file, use the FRDocument object for parallel processing of multi-page documents. It is the most easy-to-code multiprocessing way, because you do not have to implement any additional interfaces. Please find the usage details in the 'Processing with FRDocument object' section, the article Guided Tour→Advanced Techniques→Parallel Processing.

(17 Jun '16, 13:33) Oksana Serdyuk ♦♦

Also please note that to use multiprocessing your license must have the number of CPU cores available no less than 2(see the Productivity property → CPU cores).

(17 Jun '16, 13:33) Oksana Serdyuk ♦♦
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×100
×3

Asked: 15 Jun '16, 08:57

Seen: 518 times

Last updated: 17 Jun '16, 13:33

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal