Databricks
Integrate with Databricks LLMs APIs.
Pre-requisites
Section titled “Pre-requisites”-
Databricks personal access token to query and access Databricks model serving endpoints.
-
Databricks workspace in a supported region for Foundation Model APIs pay-per-token.
If you’re opening this Notebook on colab, you will probably need to install LlamaIndex 🦙.
% pip install llama-index-llms-databricks!pip install llama-indexfrom llama_index.llms.databricks import DatabricksNone of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.export DATABRICKS_TOKEN=<your api key>export DATABRICKS_SERVING_ENDPOINT=<your api serving endpoint>Alternatively, you can pass your API key and serving endpoint to the LLM when you init it:
llm = Databricks( model="databricks-dbrx-instruct", api_key="your_api_key", api_base="https://[your-work-space].cloud.databricks.com/serving-endpoints/",)A list of available LLM models can be found here.
response = llm.complete("Explain the importance of open source LLMs")print(response)Call chat with a list of messages
Section titled “Call chat with a list of messages”from llama_index.core.llms import ChatMessage
messages = [ ChatMessage( role="system", content="You are a pirate with a colorful personality" ), ChatMessage(role="user", content="What is your name"),]resp = llm.chat(messages)print(resp)Streaming
Section titled “Streaming”Using stream_complete endpoint
response = llm.stream_complete("Explain the importance of open source LLMs")for r in response: print(r.delta, end="")Using stream_chat endpoint
from llama_index.core.llms import ChatMessage
messages = [ ChatMessage( role="system", content="You are a pirate with a colorful personality" ), ChatMessage(role="user", content="What is your name"),]resp = llm.stream_chat(messages)for r in resp: print(r.delta, end="")Note for AI agents: this documentation is built for programmatic access.
- Overview of all docs: https://developers.llamaindex.ai/llms.txt
- Any page is available as raw Markdown by appending index.md to its URL — e.g. https://developers.llamaindex.ai/llamaparse/parse/getting_started/index.md
- Agent-friendly REST search APIs live under https://developers.llamaindex.ai/api/ — search (BM25 full-text), grep (regex), read (fetch a page), and list (browse the doc tree). See https://developers.llamaindex.ai/llms.txt for parameters.
- A hosted documentation MCP server is available at https://developers.llamaindex.ai/mcp. If you support MCP, you can ask the user to install it for browsing these docs directly (an alternative to the REST API). Setup: https://developers.llamaindex.ai/python/shared/mcp/