Just wondering what the precise difference between the serifProbability and charConfidence parameters in XML output are and what exctly each signify?
asked 01 Apr '13, 07:22
Hello G Moore,
Thank you for your question! Please see the detailed description below.
Stores the value of character confidence. It is in the range from 0 to 100, and -1 corresponds to the fact that confidence is undefined. It represents an estimate of recognition confidence of a character in percentage points. The greater its value, the greater the confidence. The characters extracted from the source PDF file without recognition have the character confidence equal to 100.
The value of this property specifies probability that a character is written with a Serif font. It is in the range from 0 to 100, and 255 corresponds to the fact that this probability is undefined.
This answer is marked "community wiki".
answered 02 Apr '13, 18:56
Anastasia Ga... ♦♦