Skip to content

Rate Limits

LlamaCloud implements rate limiting on specific high-traffic endpoints to ensure fair usage and service stability across all customers:

EndpointQPS (Queries Per Second)WindowAPI Route
File Upload505 secondsPOST /api/v1/files
Parse Upload1510 secondsPOST /api/v1/parsing/upload

These rate limits are applied at different scopes as indicated above and reset at the end of each time window. File upload limits are applied per project, while Parse upload limits are applied per organization.

When you exceed the rate limit, the API will return a 429 Too Many Requests status code.