Skip to content

Self-Hosting & BYOC (Bring Your Own Cloud)

Deploy LlamaCloud in your own cloud infrastructure with full self-hosting and BYOC support. Run LlamaParse, LlamaExtract, and the complete LlamaCloud platform on Kubernetes using Helm charts — on AWS, Azure, or GCP.

LlamaCloud can be deployed entirely within your own infrastructure. Self-hosted deployments — also known as BYOC (Bring Your Own Cloud) — give you the full capabilities of the LlamaCloud platform while keeping your data, models, and infrastructure under your control.

A self-hosted LlamaCloud deployment includes the complete platform — the same features available on our managed cloud service:

  • LlamaParse — Document parsing with 130+ file format support, OCR, and agentic parsing modes
  • LlamaExtract — Structured data extraction from unstructured documents
  • LlamaCloud Indexes — Managed RAG pipelines with document ingestion, chunking, embedding, and retrieval
  • Full Web UI and API — The same interface and REST API as the managed service, running in your environment

LlamaCloud is packaged as a Helm chart and deployed on Kubernetes (EKS, AKS, GKE, or any conformant cluster). You configure the deployment through a values.yaml file that specifies your database connections, LLM credentials, authentication provider, and storage backend.

We provide deployment guides and tested configurations for all major cloud platforms:

Cloud ProviderKubernetesStorage
Amazon Web ServicesEKSS3
Microsoft AzureAKSAzure Blob Storage
Google Cloud PlatformGKEGCS

LlamaParse uses models from OpenAI, Anthropic, and Google, selected per use case for best quality. Self-hosted deployments support both direct API access and enterprise cloud-hosted models:

Model FamilyDirect APIEnterprise Cloud
OpenAIOpenAI APIAzure OpenAI
Anthropic ClaudeAnthropic APIAWS Bedrock
Google GeminiGoogle Gemini APIGoogle Vertex AI

All LLM calls go directly from your cluster to providers using your own API keys and contracts. For organizations that need to restrict to a single cloud provider’s models, this is configurable — contact us to discuss your requirements.

Self-hosting is designed for organizations with strict data governance requirements:

  • Data stays in your cloud — Documents, embeddings, and processed outputs never leave your infrastructure
  • Enterprise authentication — OIDC integration with your identity provider (Microsoft Entra ID, Okta, and others)
  • Network isolation — Deploy within your VPC with full control over ingress, egress, and network policies

Self-hosted LlamaCloud is available on Enterprise plans.

  1. Contact us to discuss your deployment and obtain a license key
  2. Follow the Quick Start guide to deploy
  3. Explore cloud-specific deployment guides for production-ready configurations