I am doing a research project, and need to deal with printed Classic Chinese rather than Modern simplified/traditional Chinese that Abbyy Finereader supports. I find Finereader particularly helpful but I kind of need to extensively customize the OCR parameters to improve the result.I visited the Cloud SDK website today and find most of its links inaccessible to me, saying "We're sorry...The page you have requested may have been removed or repositioned and cannot be found." Thus, I have very limited ideas about how much the SDK can achieve, and I submit a request to download the SDK trial, but I suppose I will have to wait a while to get it eventually. So can anybody tell me more about the SDK, like if it can set specific font size/type, or if I can add/delete special symbols. If not, what kind of support I can get else from Finereader SDK?
I will post an example in the following to give you a rough idea about what we need.
-Please notice, there are straight and curve lines left to some characters, and Finereader often recognize them as a variety kinds of wrong results. Since the this is the vertical layout. So these are basically just underlines of the text. Is there a way to tell the Finereader to recognize them as underline when text is in vertical layout?
-You can notice there are at least two fonts used in the text, and also different font sizes. Finereader often fail to catch those text with different font and font sizes, and give out incorrect results all the time. Is there a way for me to set those parameters? Or is it possible for Finereader to catch the text when it has a mixed font size and font type, especially when there are more language involved?
-About punctuation.Chinese language punctuation has different Unicode, and under each font they appeared differently in printed text.Since I cannot set font in the Finereader, it always creates recognition problems. I assume it could be solved by setting CJK font, but I am not so sure about it since I haven't tried the SDK yet.
-There are lots of lots of lots of books we need to process. So it is impossible for me to pre-adjust the pics before OCR. Considering the amount, it will be highly impossible to do high-quality recheck after OCR. So if you guys have any suggestion for me to improve the OCR result. Please help.
Thank you all very much!!!