Skip to content

Extract Printed Page Numbers

Extract printed page numbers allows you to configure whether page numbers that are printed on the document pages are extracted and included in the parsed content.

When enabled, this feature identifies and extracts page numbers that are visually printed on the document pages (such as “Page 1 of 10” or ”- 5 -”), making them available in the parsed output.

Enable extraction of printed page numbers from document pages.

{
"output_options": {
"extract_printed_page_number": true
}
}

Set to false to skip extraction of printed page numbers.

{
"output_options": {
"extract_printed_page_number": false
}
}
  • Document Indexing: Preserve original page numbering for reference
  • Academic Papers: Maintain citation-friendly page numbers
  • Legal Documents: Preserve official page numbering
  • Report Processing: Keep original pagination for cross-references
Terminal window
curl -X 'POST' \
'https://api.cloud.llamaindex.ai/api/v2/parse' \
-H 'Accept: application/json' \
-H 'Content-Type: application/json' \
-H "Authorization: Bearer $LLAMA_CLOUD_API_KEY" \
--data '{
"file_id": "<file_id>",
"tier": "agentic",
"version": "latest",
"output_options": {
"extract_printed_page_number": true
}
}'