Text Extraction API

Extract text (as markdown) from pdf,docx,html,ppt,xsl and other files. Youtube transcript fetching is also supported.

What is the Text Extraction API?

Our Text Extraction API is a powerful tool designed to extract clean, structured markdown from multiple file formats, including:

  • ✔ PDFs – Preserve text structure and formatting while stripping unnecessary clutter.
  • ✔ Word Documents (DOC, DOCX) – Convert your documents into easy-to-use markdown without losing key formatting elements.
  • ✔ PowerPoint Presentations (PPT) – Extract text from slides and convert them into structured markdown.
  • ✔ HTML Files – Transform web pages into markdown-ready content effortlessly.
  • ✔ Excel Files – Transform Excel docs into markdown-ready content effortlessly.
  • ✔ Youtube videos – Fetch transcripts from youtube urls if they are available

With a simple API call, you can extract and format content programmatically, saving time and ensuring consistency across platforms.

Why Choose Our Text Extraction API?

  • ✅ Fast & Efficient – Process large files in seconds, reducing manual effort.
  • ✅ Supports Multiple File Formats – No need for multiple tools—handle all your document types with one API.
  • ✅ Markdown-Optimized Output – Clean and structured markdown makes content ready for CMS, documentation, or development use.
  • ✅ Developer-Friendly – Simple integration with clear documentation for easy implementation.
  • ✅ Ideal for Content Workflows – Perfect for bloggers, technical writers, developers, and businesses managing large-scale content.

Use Cases

  • 📌 Automated Content Conversion – Convert files into markdown for seamless integration into websites, blogs, and documentation platforms.
  • 📌 AI-Powered Content Processing – Use extracted markdown with AI tools for further analysis, summarization, or transformation.
  • 📌 eCommerce & SEO Optimization – Extract product descriptions from various formats and optimize them for online stores.
  • 📌 Archiving & Data Migration – Easily migrate legacy documents to modern markdown-based systems.

API End Points

The end point for connecting : If you subscribe directly from us use this endpoint.

	https://api.interfaces.one

API Documentation

POST  /text/extract
      parameters 
               input=<file>

The above should result in a response like the one below.

{
    "success": {
        "total": 2
    },
    "contents": {
        "text": "Family Life\n\nMerit Badge Workbook\n...",
        "format": "markdown"
    },
    "copyright": {
        "url": "api.interfaces.one",
        "year": "2025"
    }
}

Youtube transcription extraction

POST  /text/extract
      parameters 
               url=<youtubeurl>
               language="en"

Ratelimiting

Some of our API calls may be public(requires free API key) , while others requires paid subscription. To maintain our serice levels both public and private API endpoints are ratelimited. Please consult your specific plan that you subscribed to for the rate limit details.

Authentication

Currently we support API Key based authentication. Please set your Autorization Bearer request header with value of your API key. Alternatively you can also pass api_key= as a request parameter, though we strongly discourage this mode of passing the key, since it will allow others to see your key.

In curl this would mean sending the Authorization header like below.

            
 curl -i <url> -H "Authorization: Bearer <api_key>"
            
            

If you are using PHP.

            
$authorization = "Authorization: Bearer <api_key>";
$ch = curl_init('<url>'); // Initialise cURL
curl_setopt($ch, CURLOPT_HTTPHEADER, array('Content-Type: application/json' , $authorization ));
curl_setopt($ch, CURLOPT_CUSTOMREQUEST, "POST"); // OR GET
$result = curl_exec($ch);
curl_close($ch);
            
            

In javascript you can use headers key to add the authorization header

            
$.ajax({
   url: '<url>'
   type: 'GET',
   contentType: 'application/json'
   headers: {
      'Authorization': 'Bearer <api_key>'
   },
   success: function (result) {
       // CallBack(result);
   },
   error: function (error) {

   }
});
            
            

API Console

The following are the API calls you can make. You can try out / test the calls right from this page. Please note, javascript needs to be enabled to see the documentation below.

Pricing

Choose the Perfect Plan for Your Needs

We understand that every customer is unique, which is why we offer a variety of pricing plans tailored to fit different needs and budgets. Whether you're a small business owner, a growing startup, or a large enterprise, we have a plan that will provide you with the features and support you need to succeed. Explore our plans below and find the one that's right for you!

Interfaces One Text Extraction API Starter

$ 24 99 /month
  • 1 API Key
  • No setup, or hidden fees
  • 1500 API Calls/day
  • Fast support
Get started

Interfaces One Text Extraction API Premium

$ 49 99 /month
  • 1 API Key
  • No setup, or hidden fees
  • 5000 API Calls/day
  • Fast support
Get started

Interfaces One Text Extraction API Enterprise

$ 99 99 /month
  • 1 API Key
  • No setup, or hidden fees
  • 12500 API Calls/day
  • Fast support
Get started