Chinese Mix characters recognistion

  • Last Post 06 June 2013
Upen posted this 06 June 2013


We are working on product which is supported in 17 languages as below:

English, French (France), German (Germany), Japanese, Swedish (Sweden), Danish, Finnish, Russian, Dutch (Nederlands), Italian (Italy), Chinese (Simplified), Spanish (Spain), Polish, Portuguese (Brazil), Czech, Hebrew, Turkish

I am planning to automate the L10N testing of Application by using the OCR of control at run time. so for this my approach is that i will take the snapshot of controls and areas which are relevant for testing , and by using OCR will get the actual displayed text and match with expected one. there are few more expectations which need to validate below as mentioned:

  1. Truncation of String
  2. Overlapping
  3. Formatting Issues
  4. Unwanted English characters

could you tell me, Is this approach could be 100% achievable by ABBYY's OCR SDK ?

I am currently exploring the OCR tools, while exploring your tool, i found there is some issues with recognizing the Chinese characters which has some English hot-keys along it.

I have attached the snapshot of one of that. could you let me know , how can access the mix characters?

Thanks!! Upendra Patel Pitney Bowes Software Mob +919999796447

Andrey Isaev posted this 06 June 2013

Can you share problematic images? There is no attachment to the post.