Centralized Provider Configuration
Self-Hosting Documentation Access
This section requires a password to access. Interested in self-hosting? Contact sales to learn more.
Centralized LLM provider configuration allows you to configure LLM and managed embedding providers with custom credentials and API gateways. This configuration method enables use cases like routing through API gateways (Portkey, LiteLLM) or using custom credentials.
Use Cases
Section titled “Use Cases”- Custom API Gateways: Route LLM requests through gateways like Portkey or LiteLLM
- Custom Endpoints: Use custom base URLs for proxies or regional endpoints
- Custom Headers: Add custom HTTP headers per provider instance (e.g., for gateway authentication)
- Multiple Credentials: Configure multiple provider instances with different API keys
- Managed Embeddings: Route managed Index embeddings through centrally configured OpenAI-compatible embedding providers
Configuration Structure
Section titled “Configuration Structure”Add provider configurations to your Helm values under config.llms.providerConfigs:
config: llms: providerConfigs: - id: "my-config-name" # User-defined identifier (can be anything) provider: "openai" # Provider type: "openai", "anthropic", "azure", "gemini", or "vertexai" model_id: "openai-gpt-4o" # LlamaCloud model identifier (fixed values, see below) provider_model_name: "gpt-4o" # Optional: Provider-specific model name override, usually only needed if using an LLM gateway that requires a specific model name enabled: true # Enable/disable this configuration credentials: # Provider-specific credentials api_key: "sk-..." base_url: "https://custom.api.endpoint" # Optional custom endpoint headers: # Custom HTTP headers (optional) X-Custom-Header: "value"Supported Providers
Section titled “Supported Providers”OpenAI (Fully Supported)
Section titled “OpenAI (Fully Supported)”- id: "openai-primary" provider: "openai" model_id: "openai-gpt-4o" credentials: api_key: "sk-..." # Required org_id: "org-..." # Optional base_url: "https://api.openai.com/v1" # Optional headers: # Optional X-Custom-Header: "value"Supported model_id values:
openai-gpt-4oopenai-gpt-4o-miniopenai-gpt-4-1openai-gpt-4-1-miniopenai-gpt-4-1-nanoopenai-gpt-5openai-gpt-5-miniopenai-gpt-5-nanoopenai-text-embedding-3-smallopenai-text-embedding-3-large
Anthropic (Fully Supported)
Section titled “Anthropic (Fully Supported)”- id: "anthropic-primary" provider: "anthropic" model_id: "anthropic-sonnet-4.5" credentials: api_key: "sk-ant-..." # Required base_url: "https://api.anthropic.com" # Optional headers: # Optional X-Custom-Header: "value"Supported model_id values:
anthropic-sonnet-4.6anthropic-sonnet-4.5anthropic-sonnet-4.0anthropic-sonnet-3.7anthropic-sonnet-3.5anthropic-sonnet-3.5-v2anthropic-haiku-4.5anthropic-haiku-3.5anthropic-opus-4.6anthropic-opus-4.5
Azure OpenAI (Fully Supported)
Section titled “Azure OpenAI (Fully Supported)”- id: "azure-sweden" provider: "azure" model_id: "openai-gpt-4o" credentials: api_key: "..." # Required endpoint: "https://your-resource.openai.azure.com" # Required deployment_id: "gpt-4o" # Optional api_version: "2024-08-06" # Optional headers: # Optional X-Custom-Header: "value"Supported model_id values:
openai-gpt-4oopenai-gpt-4o-miniopenai-gpt-4-1openai-gpt-4-1-miniopenai-gpt-4-1-nanoopenai-gpt-5openai-gpt-5-miniopenai-gpt-5-nanoopenai-text-embedding-3-smallopenai-text-embedding-3-large
For Azure-hosted embedding models, set model_id to the LlamaCloud model identifier and credentials.deployment_id to your Azure deployment name.
Google Gemini (Fully Supported)
Section titled “Google Gemini (Fully Supported)”- id: "gemini-primary" provider: "gemini" model_id: "gemini-2.5-flash" credentials: api_key: "AIza..." # Required base_url: "https://generativelanguage.googleapis.com" # Optional headers: # Optional X-Custom-Header: "value"Supported model_id values:
gemini-3.1-progemini-3.0-progemini-3.0-flashgemini-2.5-progemini-2.5-flashgemini-2.5-flash-lite
Google Vertex AI (Fully Supported)
Section titled “Google Vertex AI (Fully Supported)”Vertex AI serves both Gemini and Anthropic Claude models. Use service-account
authentication with project_id + location + credentials (JSON-serialised
service account key).
- id: "vertex-primary" provider: "vertexai" model_id: "gemini-2.5-flash" credentials: project_id: "your-gcp-project-id" # Required location: "us-central1" # Required credentials: |- # Required (service account key JSON) { "type": "service_account", "project_id": "your-gcp-project-id", "private_key_id": "...", "private_key": "-----BEGIN PRIVATE KEY-----\n...\n-----END PRIVATE KEY-----\n", "client_email": "...@your-gcp-project-id.iam.gserviceaccount.com", "client_id": "..." } base_url: "https://us-central1-aiplatform.googleapis.com" # Optional headers: # Optional X-Custom-Header: "value"Supported model_id values (Gemini on Vertex):
gemini-3.1-progemini-3.0-progemini-3.0-flashgemini-2.5-progemini-2.5-flashgemini-2.5-flash-litegemini-2.0-flashgemini-2.0-flash-lite
Supported model_id values (Anthropic on Vertex):
anthropic-sonnet-4.6anthropic-sonnet-4.5anthropic-sonnet-4.0anthropic-sonnet-3.7anthropic-haiku-4.5anthropic-opus-4.6anthropic-opus-4.5
Common Use Cases
Section titled “Common Use Cases”Custom API Gateway
Section titled “Custom API Gateway”Use a custom API gateway or proxy (e.g., Portkey, LiteLLM):
config: llms: providerConfigs: - id: "portkey-openai" provider: "openai" model_id: "openai-gpt-4o-mini" provider_model_name: "@openai/gpt-4o-mini" enabled: true credentials: api_key: "your-portkey-api-key" base_url: "https://api.portkey.ai/v1" headers: x-portkey-api-key: "your-portkey-api-key"
- id: "portkey-anthropic" provider: "anthropic" model_id: "anthropic-sonnet-4.5" provider_model_name: "@anthropic/claude-sonnet-4-5" enabled: true credentials: api_key: "your-portkey-api-key" base_url: "https://api.portkey.ai" headers: x-portkey-api-key: "your-portkey-api-key" x-portkey-strict-open-ai-compliance: "False"Verification
Section titled “Verification”After configuration, verify your setup:
-
Verify in Admin UI: Check the LlamaCloud admin interface for available models
-
Test parsing: Upload a document to LlamaParse to confirm the configured providers are working