Let's create some demo corpus

sentences = [ “BGE M3 is an embedding model supporting dense retrieval, lexical matching and multi-vector interaction.”, “BM25 is a bag-of-words retrieval function that ranks a set of documents based on the query terms appearing in each document”, ] documents = [Document(doc_id=i, text=s) for i, s in enumerate(sentences)]

```python
# Indexing with BGE-M3 model
index = BGEM3Index.from_documents(
    documents,
    weights_for_different_modes=[
        0.4,
        0.2,
        0.4,
    ],  # [dense_weight, sparse_weight, multi_vector_weight]
)

Retrieve relevant documents

retriever = index.as_retriever()
response = retriever.retrieve("What is BGE-M3?")

RAG with BGE-M3

query_engine = index.as_query_engine()
response = query_engine.query("What is BGE-M3?")

Note for AI agents: this documentation is built for programmatic access. - Overview of all docs: https://developers.llamaindex.ai/llms.txt - Any page is available as raw Markdown by appending index.md to its URL — e.g. https://developers.llamaindex.ai/llamaparse/parse/getting_started/index.md - Agent-friendly REST search APIs live under https://developers.llamaindex.ai/api/ — search (BM25 full-text), grep (regex), read (fetch a page), and list (browse the doc tree). See https://developers.llamaindex.ai/llms.txt for parameters. - A hosted documentation MCP server is available at https://developers.llamaindex.ai/mcp. If you support MCP, you can ask the user to install it for browsing these docs directly (an alternative to the REST API). Setup: https://developers.llamaindex.ai/python/shared/mcp/