Getting Started
Install LiteParse and parse your first document in under a minute.
Installation
Section titled “Installation”Install LiteParse globally via npm to use the lit command anywhere:
npm i -g @llamaindex/liteparseFor macOS and Linux users, LiteParse can be also installed via brew:
brew tap run-llama/liteparsebrew install llamaindex-liteparseQuick start
Section titled “Quick start”Once installed, you can start parsing from the command line:
# Parse a PDF and print text to stdoutlit parse document.pdf
# Save output to a filelit parse document.pdf -o output.txt
# Get structured JSON with bounding boxeslit parse document.pdf --format json -o output.json
# Parse only specific pageslit parse document.pdf --target-pages "1-5,10,15-20"Batch parsing
Section titled “Batch parsing”Parse an entire directory of documents at once:
lit batch-parse ./pdfs ./outputsScreenshots
Section titled “Screenshots”Generate page images from a PDF for LLM agents or visual workflows:
lit screenshot document.pdf -o ./screenshotsNext steps
Section titled “Next steps”- Library usage: Use LiteParse programmatically from TypeScript or Python.
- OCR configuration: Configure Tesseract, use an external OCR server, or bring your own.
- Multi-format support: Parse DOCX, XLSX, PPTX, images, and more.
- Agent skill: Add LiteParse as a skill for coding agents.
- CLI reference: Complete command and option reference.