Data Connectors (LlamaHub)
Concept
Section titled “Concept”A data connector (aka Reader
) ingest data from different data sources and data formats into a simple Document
representation (text and simple metadata).
!!! tip Once you’ve ingested your data, you can build an Index on top, ask questions using a Query Engine, and have a conversation using a Chat Engine.
LlamaHub
Section titled “LlamaHub”Our data connectors are offered through LlamaHub 🦙. LlamaHub is an open-source repository containing data loaders that you can easily plug and play into any LlamaIndex application.
Usage Pattern
Section titled “Usage Pattern”Get started with:
from llama_index.core import download_loader
from llama_index.readers.google import GoogleDocsReader
loader = GoogleDocsReader()documents = loader.load_data(document_ids=[...])
See the full usage pattern guide for more details.
Modules
Section titled “Modules”Some sample data connectors:
- local file directory (
SimpleDirectoryReader
). Can support parsing a wide range of file types:.pdf
,.jpg
,.png
,.docx
, etc. - Notion (
NotionPageReader
) - Google Docs (
GoogleDocsReader
) - Slack (
SlackReader
) - Discord (
DiscordReader
) - Apify Actors (
ApifyActor
). Can crawl the web, scrape webpages, extract text content, download files including.pdf
,.jpg
,.png
,.docx
, etc.
See the modules guide for more details.