Skip to content

LlamaIndex Python Documentation

Twitter LinkedIn Bluesky GitHub

Select theme

LlamaCloud
- Welcome to LlamaCloud
- Parse
- Extract
- Classify
  - Getting Started
    
    Getting Started
    Classify Python SDK
  - Examples
    
    LlamaClassify Examples
    Classify Contract Types
- Index
  - Getting Started
  - Usage Guides
    
    Index Usage Guides
    Index Status Monitoring
    Index API & Clients Guide
    Index Framework Integration
    Index No-code UI Guide
  - How-to Guides
    
    Files
    
    Extracting Figures from Documents
  - Examples
    
    Index Examples
    Building RAG Applications with Index & Agents
  - Integrations
    
    Data Sinks
    
    Data Sinks
    AstraDB
    Azure AI Search
    Managed Data Sink
    Milvus
    MongoDB Atlas Vector Search
    Pinecone
    Qdrant
    
    Data Sources
    
    Data Sources
    Azure Blob Storage Data Source
    Box Storage Data Source
    Confluence Data Source
    File Upload Data Source
    Google Drive Data Source
    Jira Data Source
    Microsoft OneDrive Data Source
    S3 Data Source
    Microsoft SharePoint Data Source
    
    Embedding Models
    
    Embedding Models
    Azure Embedding
    Bedrock Embedding
    Cohere Embedding
    Gemini Embedding
    HuggingFace Embedding
    OpenAI Embedding
  - Multi-Environments
  - Parsing & Transformation
  - Retrieval
    
    Basic
    Retrieval Modes
    Advanced
    Composite Retrieval
    Image Retrieval
- General
- Self-Hosting
  - Quick Start
  - Frequently Asked Questions
  - Architecture
  - Cloud-Specific Guides
    
    Overview
    
    Azure Deployment
    
    Azure Setup Guide
    Validation Guide
    Troubleshooting Guide
  - Configuration
    
    Auth
    File Storage
    Ingress
    
    Databases and Queues
    
    Overview
    Azure Service Bus as Job Queue
    
    LLM Integrations
    
    Overview
    OpenAI Setup
    Azure OpenAI Setup
    Anthropic API Setup
    AWS Bedrock Setup
    Google Gemini API Setup
    Google Vertex AI Setup
    
    Autoscaling
    Global Admin Setup
  - Tuning
    
    Service Configurations
    LlamaParse Configuration
- Cookbooks
  - Cookbooks
  - Enterprise Rollout
Cloud API Reference 🔗
LlamaAgents New
- Overview
- Agent Workflows
- llamactl
- llamactl Reference
Workflows API Reference 🔗
LlamaIndex Framework
- Welcome to LlamaIndex 🦙 !
- Getting Started
- Learn
  - Building an LLM application
  - Using LLMs
  - Building agents
    
    Building an agent
    Using existing tools
    Maintaining state
    Streaming output and events
    Human in the loop
    Multi-agent patterns in LlamaIndex
    Using Structured Output
  - Building Workflows
    
    Workflows introduction
    Basic workflow
    Branches and loops
    Maintaining state
    Streaming events
    Concurrent execution of workflows
    Subclassing workflows
    Resources
    Observability
    Workflows from unbound functions
  - Building a RAG pipeline
    
    Introduction to RAG
    
    Indexing
    
    Indexing
    
    Loading
    
    Loading Data (Ingestion)
    Loading from LlamaCloud
    LlamaHub
    
    Querying
    
    Querying
    
    Storing
    
    Storing
  - Structured Data Extraction
    
    Introduction to Structured Data Extraction
    Using Structured LLMs
    Structured Prediction
    Low-level structured data extraction
    Structured Input
  - Tracing And Debugging
    
    Tracing and Debugging
  - Evaluating
    
    Cost Analysis
    
    Cost Analysis
    Usage Pattern
    
    Evaluating
  - Putting It All Together
    
    Putting It All Together
    Agents
    
    Apps
    
    Full-Stack Web Application
    A Guide to Building a Full-Stack Web App with LLamaIndex
    A Guide to Building a Full-Stack LlamaIndex Web App with Delphic
    
    Chatbots
    
    How to Build a Chatbot
    
    Q And A
    
    Q&A patterns
    A Guide to Extracting Terms and Definitions
    
    Structured Data
    
    Structured Data
  - Privacy and Security
- Use Cases
- Component Guides
  - Component Guides
  - Deploying
    
    Agents
    
    Agents
    Memory
    Module Guides
    Tools
    
    Chat Engines
    
    Chat Engine
    Module Guides
    Usage Pattern
    
    Query Engine
    
    Query Engine
    Module Guides
    Response Modes
    Streaming
    Supporting Modules
    Usage Pattern
  - Evaluating
    
    Evaluating
    Contributing A `LabelledRagDataset`
    Evaluating Evaluators with `LabelledEvaluatorDataset`'s
    Evaluating With `LabelledRagDataset`'s
    Modules
    Usage Pattern (Response Evaluation)
    Usage Pattern (Retrieval)
  - Indexing
    
    Indexing
    Document Management
    How Each Index Works
    LlamaCloudIndex + LlamaCloudRetriever
    Using a Property Graph Index
    Metadata Extraction
    Module Guides
    Using VectorStoreIndex
  - Loading
    
    Loading Data
    
    Connector
    
    Data Connectors (LlamaHub)
    LlamaParse
    Module Guides
    Usage Pattern
    
    Documents And Nodes
    
    Documents / Nodes
    Defining and Customizing Documents
    Metadata Extraction Usage Pattern
    Defining and Customizing Nodes
    
    Ingestion Pipeline
    
    Ingestion Pipeline
    Transformations
    
    Node Parsers
    
    Node Parser Usage Pattern
    Node Parser Modules
    
    SimpleDirectoryReader
  - MCP
    
    Model Context Protocol (MCP)
    Converting Existing LlamaIndex Workflows & Tools to MCP
    LlamaCloud MCP Servers & Tools
    Using MCP Tools with LlamaIndex
  - Models
    
    Models
    Embeddings
    
    Llms
    
    Using LLMs
    Using local models
    Available LLM integrations
    Customizing LLMs within LlamaIndex Abstractions
    Using LLMs as standalone modules
    
    Multi-modal models
    
    Prompts
    
    Prompts
    Prompt Usage Pattern
  - Observability
    
    Observability
    
    Callbacks
    
    Callbacks
    Token Counting - Migration Guide
    
    Instrumentation
  - Querying
    
    Querying
    
    Node Postprocessors
    
    Node Postprocessor
    Node Postprocessor Modules
    
    Response Synthesizers
    
    Response Synthesizer
    Response Synthesis Modules
    
    Retriever
    
    Retriever
    Retriever Modes
    Retriever Modules
    
    Router
    
    Routers
    
    Structured Outputs
    
    Structured Outputs
    Output Parsing Modules
    Pydantic Programs
    (Deprecated) Query Engines + Pydantic Outputs
  - Storing
    
    Storing
    Chat Stores
    Customizing Storage
    Document Stores
    Index Stores
    Key-Value Stores
    Persisting & Loading Data
    Vector Stores
  - Supporting Modules
    
    Migrating from ServiceContext to Settings
    Configuring Settings
    Supporting Modules
  - Workflow
    
    Workflows
- Open Source Community
- ChangeLog
Examples
Framework API Reference 🔗

Twitter LinkedIn Bluesky GitHub

Select theme

On this page

Overview
Overview
OCR Service Configuration
- Hardware Recommendations
LlamaParse Worker Configuration
GenAI Providers
Advanced Configuration
Autoscaling
Monitoring and Optimization
- Key Metrics
- Optimization

On this page

Overview
Overview
OCR Service Configuration
- Hardware Recommendations
LlamaParse Worker Configuration
GenAI Providers
Advanced Configuration
Autoscaling
Monitoring and Optimization
- Key Metrics
- Optimization

LlamaParse Configuration

Loading...

Previous
Service Configurations Next
Cookbooks