Extract Text from PDF API Demonstration


This tool demonstrates DocumentAlchemy's ability to extract plain-text content from a PDF document.

Select the PDF from which you'd like to extract the unformatted text.

Then click the “Submit” button to create, view and download the PDF file.

Choose a PDF File.

This demonstration only exposes a few of the capabilities of the underlying API.

Using the full DocumentAlchemy API you can:

  • extract images or form data from PDF documents,
  • split, combine or watermark a PDF document,
  • convert a PDF document to Microsoft Word, Markdown, HTML, images and more,
  • convert Microsoft Office and other files to PDF,

and much more.

Visit the interactive API reference to learn more about DocumentAlchemy's support for converting PDF documents to text and other document processing methods.

Instantly add Microsoft Office support to your application, even on mobile, without the hassle and expense of maintaining a dedicated conversion box.

DocumentAlchemy is a RESTful web service API for converting, generating and processing “documents” in a variety of formats, including Microsoft Office, PDF, HTML, Markdown, OpenOffice, images (PNG/JPEG/GIF/WebP/TIFF) and data (XML/JSON/FDF).

To learn more about the DocumentAlchemy API, visit the interactive API reference or review some of the sample code (at GitHub).

For full access, sign up for DocumentAlchemy now. All you need is an email address — there is no commitment and no credit-card is required.

Copyright © 2018 DocumentAlchemy.