I have to Python batch process a tranche of newspaper articles (as JPEGs).
I came to try the ocrsdk.com site because the text output of the finereaderonline.com was perfect. Newspaper Column-wise text was properly output as text, not as columnar data.
However, working with the SDK, the text is in column format. Why the difference and what is the solution?
Attached is test6 from the finereaderonline site and temp6 from the OCRSDK output.