DeepInfra
Check out available LLMs here.
Installation
Section titled “Installation”npm i llamaindex @llamaindex/deepinfraimport { DeepInfra } from "@llamaindex/deepinfra";import { Settings } from "llamaindex";
// Get the API key from `DEEPINFRA_API_TOKEN` environment variableimport { config } from "dotenv";config();Settings.llm = new DeepInfra();
// Set the API keyapiKey = "YOUR_API_KEY";Settings.llm = new DeepInfra({ apiKey });You can setup the apiKey on the environment variables, like:
export DEEPINFRA_API_TOKEN="<YOUR_API_KEY>"Load and index documents
Section titled “Load and index documents”For this example, we will use a single document. In a real-world scenario, you would have multiple documents to index.
import { Document, VectorStoreIndex } from "llamaindex";
const document = new Document({ text: essay, id_: "essay" });
const index = await VectorStoreIndex.fromDocuments([document]);const queryEngine = index.asQueryEngine();
const query = "What is the meaning of life?";
const results = await queryEngine.query({ query,});Full Example
Section titled “Full Example”import { DeepInfra } from "@llamaindex/deepinfra";import { Document, VectorStoreIndex, Settings } from "llamaindex";
// Use custom LLMconst model = "meta-llama/Meta-Llama-3-8B-Instruct";Settings.llm = new DeepInfra({ model, temperature: 0 });
async function main() { const document = new Document({ text: essay, id_: "essay" });
// Load and index documents const index = await VectorStoreIndex.fromDocuments([document]);
// get retriever const retriever = index.asRetriever();
// Create a query engine const queryEngine = index.asQueryEngine({ retriever, });
const query = "What is the meaning of life?";
// Query const response = await queryEngine.query({ query, });
// Log the response console.log(response.response);}Feedback
Section titled “Feedback”If you have any feedback, please reach out to us at feedback@deepinfra.com
API Reference
Section titled “API Reference”Note for AI agents: this documentation is built for programmatic access.
- Overview of all docs: https://developers.llamaindex.ai/llms.txt
- Any page is available as raw Markdown by appending index.md to its URL — e.g. https://developers.llamaindex.ai/llamaparse/parse/getting_started/index.md
- Agent-friendly REST search APIs live under https://developers.llamaindex.ai/api/ — search (BM25 full-text), grep (regex), read (fetch a page), and list (browse the doc tree). See https://developers.llamaindex.ai/llms.txt for parameters.
- A hosted documentation MCP server is available at https://developers.llamaindex.ai/mcp. If you support MCP, you can ask the user to install it for browsing these docs directly (an alternative to the REST API). Setup: https://developers.llamaindex.ai/python/shared/mcp/