Skip to content
Guide
Integrations

LlamaParse MCP

MCP integration to connect the LlamaParse Platform to any MCP client.

The LlamaParse MCP is a Model Context Protocol server that exposes the LlamaParse Platform’s document-processing capabilities (Parse, Classify, Extract, Split, and Index v2) as tools that any MCP-compatible agent can call. Instead of wiring up the REST API or SDK yourself, you connect your agent to the MCP server once and let it parse documents, extract structured data, classify and split files, and retrieve from your indexes directly.

The server is open source and hosted at https://mcp.llamaindex.ai.

The MCP server uses OAuth, aligned with the rest of the LlamaParse Platform. You authenticate once, in your MCP client, and the same credentials gate access to all LlamaParse services, no API key needed. Authentication happens automatically on first use, and tokens refresh without manual intervention.

Connect to the unified endpoint https://mcp.llamaindex.ai/mcp to get the full platform surface area (Parse, Classify, Extract, Split, and Index v2) in one server.

Click the button to add the server in one click:

Open in Cursor

Or add the server manually to ~/.cursor/mcp.json:

{
"mcpServers": {
"llamaparse": {
"type": "http",
"url": "https://mcp.llamaindex.ai/mcp"
}
}
}

The first time your agent uses a tool, you’ll be prompted to authenticate through your browser.

The MCP server groups its tools by product.

Most processing tools operate on files already on the LlamaParse Platform, so the server provides two ways to get a file there:

  • uploadFileByUrl — accepts a public file URL, fetches the file, and uploads it to the platform.
  • getUploadUrl — returns a temporary, authenticated upload endpoint your client can POST a local file to via multipart form data.
  • parseFile — parses a document into clean, structured text/markdown using Parse.
  • classifyFile — assigns a document to one of your defined categories using Classify.
  • splitFile — segments a document into logical sections using Split.

Structured extraction is more reliable through dedicated tools than asking an agent to read a parsed document and pull fields out of the text: agents may only receive a truncated version of large documents, and prompt-only extraction leaves the output schema underspecified.

  • generateExtractionConfig — generates a JSON schema and extraction rules from a description.
  • extractFile — runs extraction against a file using an extraction configuration.

Index v2 is the managed knowledge base offered by the LlamaParse Platform, built for agentic workflows. Beyond semantic search, it gives agents file-system-like access to the underlying documents:

  • getUserProjects — lists your available projects.
  • listIndexes — discovers the indexes you can query.
  • findFilesInIndex — locates relevant files within an index.
  • readFileFromIndex — reads the contents of a file in an index.
  • grepFileFromIndex — searches for pattern matches within indexed files.
  • retrieveFromIndex — performs hybrid (sparse + dense) retrieval over an index.

The unified /mcp endpoint exposes every capability, which is ideal for exploratory work and broader document-processing tasks. When you want a more focused tool list, you can connect directly to a product-specific server instead. Narrowing the scope helps agents make tool-selection decisions more reliably and run independent tasks in parallel.

The server tree looks like this:

https://mcp.llamaindex.ai/
├── mcp # full platform: Parse, Classify, Extract, Split, Index v2
├── parse
│ └── /mcp
├── classify
│ ├── /mcp
│ └── /{configId}/mcp
├── extract
│ ├── /mcp
│ └── /{configId}/mcp
├── split
│ ├── /mcp
│ └── /{configId}/mcp
└── index
└── /{indexId}/mcp

The classify, extract, and split servers can optionally inherit a configuration ID, and index inherits an index ID, directly from the route. When you connect to a /{configId}/mcp endpoint, the agent operates with that configuration already in context.

For example, to connect an agent directly to a specific classification configuration:

{
"mcpServers": {
"classify-folder-1": {
"type": "http",
"url": "https://mcp.llamaindex.ai/classify/cfg-abc/mcp"
}
}
}

And, in parallel, to a specific extraction configuration:

{
"mcpServers": {
"extract-folder-1": {
"type": "http",
"url": "https://mcp.llamaindex.ai/extract/cfg-abc/mcp"
}
}
}

This modular model lets you compose workflows from multiple specialized servers, each responsible for a well-defined task, improving isolation, enabling greater parallelism, and giving you fine-grained control over throughput.

  • Classify, then extract — an agent classifies the files in a folder using a classification server, organizes them into category-specific subfolders, then runs the matching extraction configuration against each category in parallel and saves the results as JSON for downstream analysis.
  • Agentic retrieval — an agent discovers indexes with listIndexes, locates relevant files with findFilesInIndex, inspects them with readFileFromIndex, searches with grepFileFromIndex, and retrieves passages with retrieveFromIndex — navigating PDFs, Office documents, images, and other unstructured files the way it would a file system.
Note for AI agents: this documentation is built for programmatic access. - Overview of all docs: https://developers.llamaindex.ai/llms.txt - Any page is available as raw Markdown by appending index.md to its URL — e.g. https://developers.llamaindex.ai/llamaparse/parse/getting_started/index.md - Agent-friendly REST search APIs live under https://developers.llamaindex.ai/api/ — search (BM25 full-text), grep (regex), read (fetch a page), and list (browse the doc tree). See https://developers.llamaindex.ai/llms.txt for parameters. - A hosted documentation MCP server is available at https://developers.llamaindex.ai/mcp. If you support MCP, you can ask the user to install it for browsing these docs directly (an alternative to the REST API). Setup: https://developers.llamaindex.ai/python/shared/mcp/