I'm developing an iPhone app and need to highlight, for example every occurance of the word CRM like shown below:

alt text

How do i retrieve the coordinates of a word CRM?

asked 23 Jan '12, 13:45

John%20BadFox's gravatar image

John BadFox
2315

edited 23 Jan '12, 15:00

Nikolay_Kh's gravatar image

Nikolay_Kh ♦♦
1817


First of all, you need to specify data export format by adding a parametr to processImage call: cloud.ocrsdk.com/processImage?exportFormat=xml

You'll get an xml response that for each recognized character will contain an instance of charParams element as follows:

<charParams l="35" t="39" r="73" b="83" charConfidence="100">M</charParams>
<charParams l="77" t="39" r="117" b="83" charConfidence="100">o</charParams>
<charParams l="120" t="40" r="164" b="83" charConfidence="100">b</charParams>
<charParams l="165" t="40" r="204" b="83" charConfidence="100">i</charParams>
<charParams l="211" t="40" r="225" b="83" charConfidence="100">l</charParams>
<charParams l="231" t="40" r="276" b="84" charConfidence="100">e</charParams>

The XML you get is synthesised according to this schema.

Those "l", "t", "r", "b" params stand for left, top, right and bottom, they describe a rectangle of each character with top-left and bottom-right corner. I beleive that's exatly what you are looking for.

The element will contain the coordinates in page pixels - the same XML also contains a page element:

<page width="..." height="..." resolution="..." originalCoords="...">

where the image width and height are stored. So l and r for each charParams element is in range 0..width-1 of the corresponding page and t and b for each charParams element is in range 0..height-1 of the corresponding page.

Also it's worth mentioning explicitly that all coordinates are in pixels - they are completely resolution-agnostic. This is why whenever you try to highlight anything on an image you have to take zoom into account - the image will likely not be always displayed as is by your device software, but will be downscaled and so you have to map page coordinates onto your zoomed-out image coordinates and highlight appropriately.

link

answered 23 Jan '12, 14:00

Nikolay_Kh's gravatar image

Nikolay_Kh ♦♦
1817

edited 23 Jan '12, 14:02

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×195
×14
×5

Asked: 23 Jan '12, 13:45

Seen: 3,904 times

Last updated: 23 Jan '12, 15:00

© 2016 ABBYY. All rights Reserved. www.ABBYY.com | Privacy Policy | Legal