Convert PDF Document to DOC Format

Converting PDF to MS Doc and DocX

Introduction

This API enables you to convert PDF document to DOC or DOCX format. Please check API Reference page for the description of API’s parameters. The mode parameter of the API allows you to control how a PDF document is converted into a word processing document and it can contain one of the following two values:

  • Textbox This is the Default mode. This mode is fast and good for maximally preserving the original look of the PDF file, but editability of the resulting document could be limited. Every visually grouped block of text in the original PDF file is converted into a textbook in the resulting document. This achieves a maximal resemblance of the output document to the original PDF file. The output document will look good but it will consist entirely of textboxes and it could make further editing of the document in Microsoft Word quite hard.
  • Flow Full recognition mode, the engine performs grouping and multi-level analysis to restore the original document author’s intent and produce a maximally editable document. The downside is that the output document might look different from the original PDF file.

We have provided a separate API for each of the following use cases:

  1. Converts PDF document (located on storage) to DOC format and returns resulting file in the response content
  2. Converts PDF document (located on storage) to DOC format and uploads resulting file to storage
  3. Converts PDF document (in request content) to DOC format and uploads resulting file to storage

cURL Example

Input Document: Sample.pdf

Output Document: result.doc    

SDK Example