I want to convert pdf and images to text file and extract data. All of my documents will contain both English and Thai language. I have tried various options to extract text.
Option 1 : --lang=English,Thai --profile=textExtraction
Option 2 : --lang=Thai --profile=textExtraction
Option 3 : --lang=English,Thai --profile=documentConversion
Option 4 : --lang=Thai --profile=documentConversion
There was a lot of mismatches between the input data and the output text. Option 4 gives the most accurate conversion. But the English text will be lost in this case. Is there any way were I can upload a single file and receive two output files. One for English and one for Thai. Otherwise I will have to upload the file twice.