Skip to content

Getting Started

Install LiteParse and parse your first document in under a minute.

LiteParse is available for Node.js, Python, and as a standalone Rust binary.

All versions (except WASM) ship the same CLI and core library capabilities.

Terminal window
npm i -g @llamaindex/liteparse

Once installed, parse from the command line:

Terminal window
# Parse a PDF and print text to stdout
lit parse document.pdf
# Save output to a file
lit parse document.pdf -o output.txt
# Get structured JSON with bounding boxes
lit parse document.pdf --format json -o output.json
# Parse only specific pages
lit parse document.pdf --target-pages "1-5,10,15-20"

Parse an entire directory of documents at once:

Terminal window
lit batch-parse ./pdfs ./outputs

Generate page images from a PDF for LLM agents or visual workflows:

Terminal window
lit screenshot document.pdf -o ./screenshots