Getting Started

Guide

Parse

Quick start guide for Parse, covering API key generation and document parsing using Python, TypeScript, the REST API, or the Web UI.

Get your first parse job running in under a minute—whether you prefer Python, TypeScript, the REST API, or the Web UI.

Before you begin

You’ll need a LlamaCloud API key. Get your API key →

Set it as an environment variable so the SDKs pick it up automatically:

export LLAMA_CLOUD_API_KEY="llx-..."

Your first parse job in 60 seconds

Install the SDK:

pip install llama-cloud>=2.1

Parse a document:

from llama_cloud import LlamaCloud

client = LlamaCloud()  # reads LLAMA_CLOUD_API_KEY from the environment

# Upload and parse a document
file = client.files.create(file="./attention_is_all_you_need.pdf", purpose="parse")
result = client.parsing.parse(
    file_id=file.id,
    tier="agentic",
    version="latest",
    expand=["markdown"],
)

# Print the markdown for the first page
print(result.markdown.pages[0].markdown)

That’s it. The SDK handles job polling for you—client.parsing.parse() blocks until the job finishes and returns the full result.

Prefer async? Swap LlamaCloud for AsyncLlamaCloud and await the calls:

from llama_cloud import AsyncLlamaCloud
import asyncio

async def main():
    client = AsyncLlamaCloud()
    file = await client.files.create(file="./attention_is_all_you_need.pdf", purpose="parse")
    result = await client.parsing.parse(file_id=file.id, tier="agentic", version="latest", expand=["markdown"])
    print(result.markdown.pages[0].markdown)

asyncio.run(main())

Configure your parse job

The snippet above uses defaults. When you’re ready to tune the output, add input_options, output_options, or processing_options:

result = client.parsing.parse(
    file_id=file.id,
    tier="agentic",
    version="latest",
    output_options={
        "markdown": {"tables": {"output_tables_as_markdown": True}},
        "images_to_save": ["screenshot"],
    },
    processing_options={
        "ocr_parameters": {"languages": ["en"]},
    },
    expand=["text", "markdown", "items", "images_content_metadata"],
)

Each group has a dedicated reference page:

Input Options — page ranges, crop boxes, file-type-specific controls, and cache behavior
Output Options — markdown styling, spatial text, screenshots, tables-as-spreadsheet, and more
Processing Options — OCR languages, ignore rules, chart parsing, cost optimizer
Retrieving Results — every expand value and how to control what comes back

Install the SDK:

npm install @llamaindex/llama-cloud

Create a parse.ts file:

import LlamaCloud from '@llamaindex/llama-cloud';
import fs from 'fs';

const client = new LlamaCloud(); // reads LLAMA_CLOUD_API_KEY from the environment

// Upload and parse a document
const file = await client.files.create({
  file: fs.createReadStream('./attention_is_all_you_need.pdf'),
  purpose: 'parse',
});

const result = await client.parsing.parse({
  file_id: file.id,
  tier: 'agentic',
  version: 'latest',
  expand: ['markdown'],
});

// Print the markdown for the first page
console.log(result.markdown.pages[0].markdown);

Run it:

npx tsx parse.ts

The SDK handles job polling for you—client.parsing.parse() awaits until the job finishes and returns the full result.

Configure your parse job

Pass input_options, output_options, or processing_options to tune the output:

const result = await client.parsing.parse({
  file_id: file.id,
  tier: 'agentic',
  version: 'latest',
  output_options: {
    markdown: { tables: { output_tables_as_markdown: true } },
    images_to_save: ['screenshot'],
  },
  processing_options: {
    ocr_parameters: { languages: ['en'] },
  },
  expand: ['text', 'markdown', 'items', 'images_content_metadata'],
});

See the dedicated reference pages for the full set of options:

Input Options
Output Options
Processing Options
Retrieving Results — every expand value and how to control what comes back

If you’d prefer to skip the SDKs, the REST API lets you parse from any environment. Unlike the SDKs, the raw API is asynchronous: you upload a file, kick off a parse job, then poll for the result.

1. Upload a file

curl -X POST \
  https://api.cloud.llamaindex.ai/api/v1/beta/files \
  -H 'Accept: application/json' \
  -H "Authorization: Bearer $LLAMA_CLOUD_API_KEY" \
  -F 'purpose=parse' \
  -F 'file=@/path/to/your/file.pdf;type=application/pdf'

Grab the id from the response:

{
  "id": "cafe1337-e0dd-4762-b5f5-769fef112558"
}

2. Start a parse job

curl -X POST \
  'https://api.cloud.llamaindex.ai/api/v2/parse' \
  -H 'Accept: application/json' \
  -H 'Content-Type: application/json' \
  -H "Authorization: Bearer $LLAMA_CLOUD_API_KEY" \
  --data '{
    "file_id": "<file_id>",
    "tier": "agentic",
    "version": "latest"
  }'

You’ll get a job_id back:

{
  "id": "c0defee1-76a0-42c3-bbed-094e4566b762",
  "status": "PENDING"
}

3. Poll for the result

Call the result endpoint with your job_id. The job.status field will be PENDING, RUNNING, COMPLETED, or FAILED. Keep polling until you see COMPLETED, then use expand to request the fields you want:

curl -X GET \
  'https://api.cloud.llamaindex.ai/api/v2/parse/<job_id>?expand=markdown' \
  -H 'Accept: application/json' \
  -H "Authorization: Bearer $LLAMA_CLOUD_API_KEY"

See Retrieving Results for every available expand value.

Alternative: use a coding agent

If you’re already using Claude Code, Cursor, or another coding agent, you can skip the SDK entirely. Install the Parse agent skill and your agent can parse documents on the fly:

npx skills add run-llama/llamaparse-agent-skills --skill llamaparse

Then just ask your agent things like:

“Parse this PDF and extract the text as markdown.”
“Extract every table from each invoice in ./invoices and save them as CSVs.”
“Parse this financial report with cost optimizer enabled.”

The agent writes the SDK calls for you. It picks the right tier, configures options, and saves output — without you needing to remember API details.

Requirements: Node 18+, LLAMA_CLOUD_API_KEY in the environment, and a coding agent that supports Vercel-style skills.

Two skills are available:

Skill	What it does
`llamaparse`	Full parsing via the LlamaCloud API — charts, tables, images, complex layouts
`liteparse`	Local-first, no API key needed — best for plain text and simple layouts

Install both with npx skills add run-llama/llamaparse-agent-skills (no --skill flag). Source on GitHub →

Next steps

Choose the right tier → Tiers explains when to use Agentic Plus vs. Agentic vs. Cost Effective vs. Fast.
Learn to configure options → Configuring Parse covers every knob — input, output, and processing options.
Save money on long documents → Cost Optimizer routes each page to the right tier automatically.
See real examples → Parse Examples — runnable tutorials for common use cases.

Resources

Pricing & credits
Need structured data instead of markdown? Check out LlamaExtract.

Note for AI agents: this documentation is built for programmatic access. - Overview of all docs: https://developers.llamaindex.ai/llms.txt - Any page is available as raw Markdown by appending index.md to its URL — e.g. https://developers.llamaindex.ai/llamaparse/parse/getting_started/index.md - Agent-friendly REST search APIs live under https://developers.llamaindex.ai/api/ — search (BM25 full-text), grep (regex), read (fetch a page), and list (browse the doc tree). See https://developers.llamaindex.ai/llms.txt for parameters. - A hosted documentation MCP server is available at https://developers.llamaindex.ai/mcp. If you support MCP, you can ask the user to install it for browsing these docs directly (an alternative to the REST API). Setup: https://developers.llamaindex.ai/python/shared/mcp/

Getting Started

Before you begin

Your first parse job in 60 seconds

Configure your parse job

Configure your parse job

1. Upload a file

2. Start a parse job

3. Poll for the result

Choosing a tier

Alternative: use a coding agent

Next steps

Resources