PDF to Word

Aspose provides a high fidelity API to programmatically convert PDF documents to Word format and in the opposite directions with professional quality. The combined use of Adobe and Microsoft Office technologies has a lot to offer to the end-user.

These major document formats, including DOCX, DOC, RTF, ОDT and PDF, are capable of encapsulating almost any type of data including text, tables, raster and vector graphics, video, audio, and also support a wide range of formatting features.

Despite similarities, PDF and Word documents have considerable differences in the operational capabilities.

PDF documents support multilevel security options and are difficult to extract information. PDF format is the best choice when the document’s author allows others to read and print document copies only. Word document formats, in contrast, are great for collaborative development, but they aren’t always the best choice for distributing as they can be easily modified without author’s permission.

You may require to convert an immutable PDF to an editable DOCX or DOC document and it is also a fine solution if you need to extract text from a PDF file easily.

Aspose conversion engine transforms a PDF to Word-based document format, that can be naturally edited with any mainstream word processor. However, some complex formatting might appear slightly different from the original document. PDF to Word conversion supports detecting headers and footers. Sections of PDF pages containing page numbers, document names, etc. will be converted as Word’s headers and footers. These headers and footers do not affect the rest of the page layout and are not moved while the document is being edited.

Aspose PDF to Word conversion engine currently supports the following features:

  • Text and paragraphs
  • Text formatting (font, size, foreground/background, options like bold, italic, underline)
  • Bulleted and numbered lists (including nested lists)
  • Tables (bordered and non-bordered, without nested tables)
  • Images conversion
    • Semi-transparent images
    • Rotated images
    • Inline images (images that go “inside” the text, being a logical part of text paragraph)
  • Encrypted PDF files
  • Properly converts style and destination of hyperlinks, both to URLs and local files
  • Headers/footers detection for PDF conversion
  • Bookmarks
  • Basic vector graphics

REST API

The OpenAPI Specification defines a publicly accessible programming interface and lets you run PDF to Word conversions directly from a web browser.

You can use cURL command-line tool to access Aspose.Words web services and convert PDF documents to Word format easily. The following code demonstrates how to convert PDF to DOCX with cURL. Feel free to download and explore sample input demo.pdf and output demo.docx files designed to act as a demonstration and let you figure out the details quickly.

Cloud SDK Family

Using an SDK is the best way to speed up the development. An SDK takes care of low-level details and lets you focus on your project tasks.

Please check out the GitHub repository for a complete list of Aspose.Words SDKs.

The following code examples demonstrate how to convert PDF to Word programmatically using various Aspose.Words SDKs:

See also