Rate Limits
LlamaCloud implements rate limiting on specific high-traffic endpoints to ensure fair usage and service stability across all customers:
| Endpoint | QPS (Queries Per Second) | Window | API Route |
|---|---|---|---|
| File Upload | 50 | 5 seconds | POST /api/v1/files |
| Parse Upload | 15 | 10 seconds | POST /api/v1/parsing/upload |
These rate limits are applied at different scopes as indicated above and reset at the end of each time window. File upload limits are applied per project, while Parse upload limits are applied per organization.
Rate Limit Response
Section titled “Rate Limit Response”When you exceed the rate limit, the API will return a 429 Too Many Requests status code.