List Extract Jobs

client.Extract.List(ctx, query) (*PaginatedCursor[ExtractV2Job], error)

GET/api/v2/extract

List extraction jobs with optional filtering and pagination.

Filter by configuration_id, status, file_input, or creation date range. Results are returned newest-first. Use expand=configuration to include the full configuration used, and expand=extract_metadata for per-field metadata.

ParametersExpand Collapse

query ExtractListParams

ConfigurationID param.Field[string]Optional

Filter by configuration ID

CreatedAtOnOrAfter param.Field[Time]Optional

Include items created at or after this timestamp (inclusive)

formatdate-time

CreatedAtOnOrBefore param.Field[Time]Optional

Include items created at or before this timestamp (inclusive)

formatdate-time

DocumentInputType param.Field[string]Optional

Filter by document input type (file_id or parse_job_id)

DeprecatedDocumentInputValue param.Field[string]Optional

Deprecated: use file_input instead

Expand param.Field[[]string]Optional

Additional fields to include: configuration, extract_metadata

FileInput param.Field[string]Optional

Filter by file input value

JobIDs param.Field[[]string]Optional

Filter by specific job IDs

OrganizationID param.Field[string]Optional

PageSize param.Field[int64]Optional

Number of items per page

PageToken param.Field[string]Optional

Token for pagination

ProjectID param.Field[string]Optional

Status param.Field[ExtractListParamsStatus]Optional

Filter by status

const ExtractListParamsStatusPending ExtractListParamsStatus = "PENDING"

const ExtractListParamsStatusThrottled ExtractListParamsStatus = "THROTTLED"

const ExtractListParamsStatusRunning ExtractListParamsStatus = "RUNNING"

const ExtractListParamsStatusCompleted ExtractListParamsStatus = "COMPLETED"

const ExtractListParamsStatusFailed ExtractListParamsStatus = "FAILED"

const ExtractListParamsStatusCancelled ExtractListParamsStatus = "CANCELLED"

ReturnsExpand Collapse

type ExtractV2Job struct{…}

An extraction job.

ID string

Unique job identifier (job_id)

CreatedAt Time

Creation timestamp

formatdate-time

FileInput string

File ID or parse job ID that was extracted

ProjectID string

Project this job belongs to

Status string

Current job status.

PENDING — queued, not yet started
RUNNING — actively processing
COMPLETED — finished successfully
FAILED — terminated with an error
CANCELLED — cancelled by user

UpdatedAt Time

Last update timestamp

formatdate-time

Configuration ExtractConfigurationOptional

Extract configuration combining parse and extract settings.

DataSchema map[string, *ExtractConfigurationDataSchemaUnion]

JSON Schema defining the fields to extract. Validate with the /schema/validate endpoint first.

One of the following:

type ExtractConfigurationDataSchemaMap map[string, any]

type ExtractConfigurationDataSchemaArray []any

string

float64

bool

CiteSources boolOptional

Include citations in results

ConfidenceScores boolOptional

Include confidence scores in results

ExtractionTarget ExtractConfigurationExtractionTargetOptional

Granularity of extraction: per_doc returns one object per document, per_page returns one object per page, per_table_row returns one object per table row

One of the following:

const ExtractConfigurationExtractionTargetPerDoc ExtractConfigurationExtractionTarget = "per_doc"

const ExtractConfigurationExtractionTargetPerPage ExtractConfigurationExtractionTarget = "per_page"

const ExtractConfigurationExtractionTargetPerTableRow ExtractConfigurationExtractionTarget = "per_table_row"

MaxPages int64Optional

Maximum number of pages to process. Omit for no limit.

minimum1

ParseConfigID stringOptional

Saved parse configuration ID to control how the document is parsed before extraction

ParseTier stringOptional

Parse tier to use before extraction. Defaults to the extract tier if not specified.

SystemPrompt stringOptional

Custom system prompt to guide extraction behavior

TargetPages stringOptional

Comma-separated page numbers or ranges to process (1-based). Omit to process all pages.

Tier ExtractConfigurationTierOptional

Extract tier: cost_effective (5 credits/page), agentic (15 credits/page), or agentic_plus (50 credits/page)

One of the following:

const ExtractConfigurationTierCostEffective ExtractConfigurationTier = "cost_effective"

const ExtractConfigurationTierAgentic ExtractConfigurationTier = "agentic"

const ExtractConfigurationTierAgenticPlus ExtractConfigurationTier = "agentic_plus"

Version stringOptional

Use ‘latest’ for the latest release for the selected tier or a date string (YYYY-MM-DD format) to pin to the nearest release at or before that date. Job responses always report the concrete resolved version the job runs, fixed at job creation; saved configurations keep the value as provided.

ConfigurationID stringOptional

Saved extract configuration ID used for this job, if any

ErrorMessage stringOptional

Error details when status is FAILED

ExtractMetadata ExtractJobMetadataOptional

Extraction metadata.

FieldMetadata ExtractedFieldMetadataOptional

Metadata for extracted fields including document, page, and row level info.

DocumentMetadata map[string, *ExtractedFieldMetadataDocumentMetadataUnion]Optional

Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.

One of the following:

type ExtractedFieldMetadataDocumentMetadataMap map[string, any]

type ExtractedFieldMetadataDocumentMetadataArray []any

string

float64

bool

PageMetadata []map[string, *ExtractedFieldMetadataPageMetadataUnion]Optional

Per-page metadata when extraction_target is per_page

One of the following:

type ExtractedFieldMetadataPageMetadataMap map[string, any]

type ExtractedFieldMetadataPageMetadataArray []any

string

float64

bool

RowMetadata []map[string, *ExtractedFieldMetadataRowMetadataUnion]Optional

Per-row metadata when extraction_target is per_table_row

One of the following:

type ExtractedFieldMetadataRowMetadataMap map[string, any]

type ExtractedFieldMetadataRowMetadataArray []any

string

float64

bool

ParseJobID stringOptional

Reference to the ParseJob ID used for parsing

ParseTier stringOptional

Parse tier used for parsing the document

ExtractResult ExtractV2JobExtractResultUnionOptional

Extracted data conforming to the data_schema. Returns a single object for per_doc, or an array for per_page / per_table_row.

One of the following:

type ExtractV2JobExtractResultMap map[string, ExtractV2JobExtractResultMapItemUnion]

One of the following:

type ExtractV2JobExtractResultMapItemMap map[string, any]

type ExtractV2JobExtractResultMapItemArray []any

string

float64

bool

type ExtractV2JobExtractResultArray []map[string, *ExtractV2JobExtractResultArrayItemUnion]

One of the following:

type ExtractV2JobExtractResultArrayItemMap map[string, any]

type ExtractV2JobExtractResultArrayItemArray []any

string

float64

bool

Metadata ExtractV2JobMetadataOptional

Job-level metadata.

Usage ExtractJobUsageOptional

Extraction usage metrics.

NumPagesBilled int64Optional

Number of effective pages billed

NumPagesExtracted int64Optional

Number of pages extracted

List Extract Jobs

package main

import (
  "context"
  "fmt"

  "github.com/run-llama/llama-parse-go"
  "github.com/run-llama/llama-parse-go/option"
)

func main() {
  client := llamacloudprod.NewClient(
    option.WithAPIKey("My API Key"),
  )
  page, err := client.Extract.List(context.TODO(), llamacloudprod.ExtractListParams{

  })
  if err != nil {
    panic(err.Error())
  }
  fmt.Printf("%+v\n", page)
}

{
  "items": [
    {
      "id": "ext-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee",
      "created_at": "2019-12-27T18:11:19.117Z",
      "file_input": "dfl-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee",
      "project_id": "prj-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee",
      "status": "COMPLETED",
      "updated_at": "2019-12-27T18:11:19.117Z",
      "configuration": {
        "data_schema": {
          "foo": {
            "foo": "bar"
          }
        },
        "cite_sources": true,
        "confidence_scores": true,
        "extraction_target": "per_doc",
        "max_pages": 10,
        "parse_config_id": "cfg-11111111-2222-3333-4444-555555555555",
        "parse_tier": "fast",
        "system_prompt": "Extract all monetary values in USD. If a currency is not specified, assume USD.",
        "target_pages": "1,3,5-7",
        "tier": "cost_effective",
        "version": "latest"
      },
      "configuration_id": "cfg-11111111-2222-3333-4444-555555555555",
      "error_message": "error_message",
      "extract_metadata": {
        "field_metadata": {
          "document_metadata": {
            "items": [
              {
                "amount": {
                  "citation": [
                    {
                      "matching_text": "$10.00",
                      "page": 1
                    }
                  ],
                  "confidence": 1
                },
                "description": {
                  "citation": [
                    {
                      "matching_text": "$10/month",
                      "page": 1
                    }
                  ],
                  "confidence": 0.998
                }
              }
            ],
            "total": {
              "citation": "bar",
              "confidence": "bar"
            },
            "vendor": {
              "citation": "bar",
              "confidence": "bar",
              "extraction_confidence": "bar",
              "parsing_confidence": "bar"
            }
          },
          "page_metadata": [
            {
              "foo": {
                "foo": "bar"
              }
            }
          ],
          "row_metadata": [
            {
              "foo": {
                "foo": "bar"
              }
            }
          ]
        },
        "parse_job_id": "parse_job_id",
        "parse_tier": "parse_tier"
      },
      "extract_result": {
        "foo": {
          "foo": "bar"
        }
      },
      "metadata": {
        "usage": {
          "num_pages_billed": 0,
          "num_pages_extracted": 0
        }
      }
    }
  ],
  "next_page_token": "next_page_token",
  "total_size": 0
}

Returns Examples

{
  "items": [
    {
      "id": "ext-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee",
      "created_at": "2019-12-27T18:11:19.117Z",
      "file_input": "dfl-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee",
      "project_id": "prj-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee",
      "status": "COMPLETED",
      "updated_at": "2019-12-27T18:11:19.117Z",
      "configuration": {
        "data_schema": {
          "foo": {
            "foo": "bar"
          }
        },
        "cite_sources": true,
        "confidence_scores": true,
        "extraction_target": "per_doc",
        "max_pages": 10,
        "parse_config_id": "cfg-11111111-2222-3333-4444-555555555555",
        "parse_tier": "fast",
        "system_prompt": "Extract all monetary values in USD. If a currency is not specified, assume USD.",
        "target_pages": "1,3,5-7",
        "tier": "cost_effective",
        "version": "latest"
      },
      "configuration_id": "cfg-11111111-2222-3333-4444-555555555555",
      "error_message": "error_message",
      "extract_metadata": {
        "field_metadata": {
          "document_metadata": {
            "items": [
              {
                "amount": {
                  "citation": [
                    {
                      "matching_text": "$10.00",
                      "page": 1
                    }
                  ],
                  "confidence": 1
                },
                "description": {
                  "citation": [
                    {
                      "matching_text": "$10/month",
                      "page": 1
                    }
                  ],
                  "confidence": 0.998
                }
              }
            ],
            "total": {
              "citation": "bar",
              "confidence": "bar"
            },
            "vendor": {
              "citation": "bar",
              "confidence": "bar",
              "extraction_confidence": "bar",
              "parsing_confidence": "bar"
            }
          },
          "page_metadata": [
            {
              "foo": {
                "foo": "bar"
              }
            }
          ],
          "row_metadata": [
            {
              "foo": {
                "foo": "bar"
              }
            }
          ]
        },
        "parse_job_id": "parse_job_id",
        "parse_tier": "parse_tier"
      },
      "extract_result": {
        "foo": {
          "foo": "bar"
        }
      },
      "metadata": {
        "usage": {
          "num_pages_billed": 0,
          "num_pages_extracted": 0
        }
      }
    }
  ],
  "next_page_token": "next_page_token",
  "total_size": 0
}

Note for AI agents: this documentation is built for programmatic access. - Overview of all docs: https://developers.llamaindex.ai/llms.txt - Any page is available as raw Markdown by appending index.md to its URL — e.g. https://developers.llamaindex.ai/llamaparse/parse/getting_started/index.md - Agent-friendly REST search APIs live under https://developers.llamaindex.ai/api/ — search (BM25 full-text), grep (regex), read (fetch a page), and list (browse the doc tree). See https://developers.llamaindex.ai/llms.txt for parameters. - A hosted documentation MCP server is available at https://developers.llamaindex.ai/mcp. If you support MCP, you can ask the user to install it for browsing these docs directly (an alternative to the REST API). Setup: https://developers.llamaindex.ai/python/shared/mcp/