we are using ABBYY finereader engine 11 in our java application. we want to OCR support for specific languages only. as example we want to support only English and German language documents to be processed. can we do that? if yes then can you please give us the example of that.
How to restrict documents for processing for specific languages? or support only specific language for processing?
- 957 Views
- Last Post 01 May 2015
You can do something like this to tell you are processing documents in English and German:
IDocumentProcessingParams docProcessingParams = engine.CreateDocumentProcessingParams(); IRecognizerParams recognizerParams = docProcessingParams.getPageProcessingParams().getRecognizerParams(); ILanguageDatabase languageDatabase = engine.CreateLanguageDatabase(); ITextLanguage textLanguage = languageDatabase.CreateCompoundTextLanguage("English,German"); recognizerParams.setTextLanguage(textLanguage); ... document.Process(docProcessingParams);
Also you can call the SetPredefinedTextLanguage method with a parameter which contains several language names separated with commas, for example "English,German". The program will automatically select the language of the document from the specified set. But please note that in general the recognition quality is better when less languages are specified.
1257 questions, 4140 answers.