I just signed up for the SDK and am testing it out with my product. It works pretty well so far, but I have question regarding the regular expression functionality for processTextField.

One of the fields I need to recognize is a name field. These names are actually the names of my users and therefore I know the entire domain of possible entries. I'd like to pass this domain to the SDK to give it a better chance of a positive match. I'm trying to use the regExp parameter of processTextField to accomplish this, but it seems to have no effect on the outcome. I have tried passing

/(name1)|(name2)/i

and

(name1)|(name2)

and even

(name)

none of these seem to have an effect on the outcome. Guidance or suggestions, appreciated!

asked 29 Mar '12, 16:44

dante's gravatar image

dante
376

edited 09 Jun '12, 11:38

Vasily%20Panferov's gravatar image

Vasily Panferov ♦♦
5422516


Specifying regular expression doesn't actually force recognition engine to always use it in results. There is a possibility to get output completely different to what is specified in regexp.

When recognizer has several hypotheses how to recognize given word, it checks all of them against regular expression. If given recognition variant conforms to regexp, it has higher probability of being selected as final recognition output. But if there is no variant that matches regular expression, the result cannot be conforming to it.

However, there are other options besides regular expression to improve recognition quality. For example, internally there are many specialized dictionaries for names in different languages.

You can send your image snippet to cloudocrsdkbeta@abbyy.com. We'll take a look at it and probably suggest some options or provide something in cloud API to get better results.

link

answered 30 Mar '12, 11:17

Vasily%20Panferov's gravatar image

Vasily Panferov ♦♦
5422516

Okay thanks - this makes sense. Just to check, is the first regexp i listed valid - i.e. /(name1)|name2)/i -? If so, I'll use that and then do other checks on my side if it does not come back with a known name.

I'm still just testing, so i don't have very good samples to share, but I'll keep your offer in mind and send once I have better data. Thanks for your help!

(30 Mar '12, 15:44) dante
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×25
×4

Asked: 29 Mar '12, 16:44

Seen: 1,448 times

Last updated: 09 Jun '12, 11:38

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal