Large Language Models

LlamaIndex Framework

Open Source Community

FAQ

FAQ

How to use a custom/local embedding model?
How to use a local hugging face embedding model?
How can I customize my prompt
Is it required to fine-tune my model?
I want to the LLM answer in Chinese/Italian/French but only answers in English, how to proceed?
Is LlamaIndex GPU accelerated?

1. How to define a custom LLM?

You can access Usage Custom to define a custom LLM.

2. How to use a different OpenAI model?

To use a different OpenAI model you can access Configure Model to set your own custom model.

3. How can I customize my prompt?

You can access Prompts to learn how to customize your prompts.

4. Is it required to fine-tune my model?

No. there’s isolated modules which might provide better results, but isn’t required, you can use llamaindex without needing to fine-tune the model.

5. I want to the LLM answer in Chinese/Italian/French but only answers in English, how to proceed?

To the LLM answer in another language more accurate you can update the prompts to enforce more the output language.

response = query_engine.query("Rest of your query... \nRespond in Italian")

Alternatively:

from llama_index.core import Settings
from llama_index.llms.openai import OpenAI

llm = OpenAI(system_prompt="Always respond in Italian.")

# set a global llm
Settings.llm = llm

query_engine = load_index_from_storage(
    storage_context,
).as_query_engine()

6. Is LlamaIndex GPU accelerated?

Yes, you can run a language model (LLM) on a GPU when running it locally. You can find an example of setting up LLMs with GPU support in the llama2 setup documentation.

Note for AI agents: this documentation is built for programmatic access. - Overview of all docs: https://developers.llamaindex.ai/llms.txt - Any page is available as raw Markdown by appending index.md to its URL — e.g. https://developers.llamaindex.ai/llamaparse/parse/getting_started/index.md - Agent-friendly REST search APIs live under https://developers.llamaindex.ai/api/ — search (BM25 full-text), grep (regex), read (fetch a page), and list (browse the doc tree). See https://developers.llamaindex.ai/llms.txt for parameters. - A hosted documentation MCP server is available at https://developers.llamaindex.ai/mcp. If you support MCP, you can ask the user to install it for browsing these docs directly (an alternative to the REST API). Setup: https://developers.llamaindex.ai/python/shared/mcp/