PDF to XML

  • 70 Views
  • Last Post 12 February 2020
joshi.pankaj112@gmail.com posted this 03 January 2020

Hi Team,

I am trying to convert pdf file to xml.

getting output as xml but not in well format.

can you please help me.

Code Here :

ocr_engine = CloudOCR(application_id='XX', password='XX')

pdf = open('file.pdf', 'rb')

result = ocr_engine.process_and_download(file, exportFormat='xml', language='English')

for format, content in result.items():

    with open('final_xml_file13.xml', 'wb') as output_file:

        output_file.write(content.read())

 

Attached output XML in short

 

Order By: Standard | Newest | Votes
Bond James posted this 14 January 2020

Hi Team,

I am trying to convert pdf file to xml.

getting output as xml but not in well format.

can you please help me.

Code Here :

ocr_engine = CloudOCR(application_id='XX', password='XX')

pdf = open('file.pdf', 'rb')

result = ocr_engine.process_and_download(file, exportFormat='xml', language='English')

for format, content in result.items():

    with open('final_xml_file13.xml', 'wb') as output_file:

        output_file.write(content.read())

 

Attached output XML in short

 

It is very little available on the net about PDF to XML, and usually its too hard to find some good way to convert the PDF to Excel. Normally you can find any of the software that can convert any of the formats into the PDF but from PDF into XML, you will have to search a lot on the net. I 've searched and I tried most of the result I got in Google for pdf to XML, and among them I found one resource to be worth read and use.

 

JonsonSmith posted this 12 February 2020

 You can use online PDF to XML converter.

Close