Cannot distinguish between a dash and a minus sign

  • Last Post 12 September 2014
  • Topic Is Solved
manishyadav posted this 10 September 2014

I noticed an issue with the android demo. It cannot distinguish between a dash and a minus sign.

image i used

applicationId = "snapchat"

String language ="English";//Comma-separated list: Japanese,English or German,French,Spanish etc.
ProcessingSettings processingSettings = new ProcessingSettings();
processingSettings.setOutputFormat( ProcessingSettings.OutputFormat.txt );

Order By: Standard | Newest | Votes
Oksana Serdyuk posted this 12 September 2014

To investigate the issue please send to the following information:

  1. the image you recognize,
  2. the recognition settings you use,
  3. your Application ID.

manishyadav posted this 12 September 2014

please do check my question again ... i have updated all the details.

Oksana Serdyuk posted this 16 September 2014

The hyphen is recognized as character "Hyphen-minus" (U+002D), because this character represents hyphen in the ABBYY predefined languages. As we understand, you need the hyphen to be recognized as "En dash" (U+2013) or "Em dash" (U+2014). These symbols are not included into alphabets of the predefined languages, so it is not possible to do in Cloud OCR SDK but you can try another ABBYY product – ABBYY FineReader Engine. In FREngine you can create user defined language and add necessary characters to this language.