Extract
List Extract Jobs
Get Extract Job
Delete Extract Job
Validate Extraction Schema
Generate Extraction Schema
ModelsExpand Collapse
type ExtractConfiguration struct{…}
Extract configuration combining parse and extract settings.
ExtractVersion stringoptional
Extract algorithm version. Use ‘latest’ for the default pipeline or a date string (e.g. ‘2026-01-08’) to pin to a specific release.
ExtractionTarget ExtractConfigurationExtractionTargetoptional
Granularity of extraction: per_doc returns one object per document, per_page returns one object per page, per_table_row returns one object per table row
ParseConfigID stringoptional
Saved parse configuration ID to control how the document is parsed before extraction
ParseTier stringoptional
Parse tier to use before extraction. Defaults to the extract tier if not specified.
type ExtractJobMetadata struct{…}
Extraction metadata.
Metadata for extracted fields including document, page, and row level info.
DocumentMetadata map[string, ExtractedFieldMetadataDocumentMetadataUnion]optional
Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.
type ExtractV2Job struct{…}
An extraction job.
Status string
Current job status.
PENDING— queued, not yet startedRUNNING— actively processingCOMPLETED— finished successfullyFAILED— terminated with an errorCANCELLED— cancelled by user
Extract configuration combining parse and extract settings.
ExtractVersion stringoptional
Extract algorithm version. Use ‘latest’ for the default pipeline or a date string (e.g. ‘2026-01-08’) to pin to a specific release.
ExtractionTarget ExtractConfigurationExtractionTargetoptional
Granularity of extraction: per_doc returns one object per document, per_page returns one object per page, per_table_row returns one object per table row
ParseConfigID stringoptional
Saved parse configuration ID to control how the document is parsed before extraction
ParseTier stringoptional
Parse tier to use before extraction. Defaults to the extract tier if not specified.
Extraction metadata.
Metadata for extracted fields including document, page, and row level info.
DocumentMetadata map[string, ExtractedFieldMetadataDocumentMetadataUnion]optional
Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.
ExtractResult ExtractV2JobExtractResultUnionoptional
type ExtractV2JobCreate struct{…}
Request to create an extraction job. Provide configuration_id or inline configuration.
Extract configuration combining parse and extract settings.
ExtractVersion stringoptional
Extract algorithm version. Use ‘latest’ for the default pipeline or a date string (e.g. ‘2026-01-08’) to pin to a specific release.
ExtractionTarget ExtractConfigurationExtractionTargetoptional
Granularity of extraction: per_doc returns one object per document, per_page returns one object per page, per_table_row returns one object per table row
ParseConfigID stringoptional
Saved parse configuration ID to control how the document is parsed before extraction
ParseTier stringoptional
Parse tier to use before extraction. Defaults to the extract tier if not specified.
WebhookConfigurations []ExtractV2JobCreateWebhookConfigurationoptional
Outbound webhook endpoints to notify on job status changes
WebhookEvents []stringoptional
Events to subscribe to (e.g. ‘parse.success’, ‘extract.error’). If null, all events are delivered.
type ExtractV2JobQueryResponse struct{…}
Paginated list of extraction jobs.
Items []ExtractV2Job
The list of items.
Status string
Current job status.
PENDING— queued, not yet startedRUNNING— actively processingCOMPLETED— finished successfullyFAILED— terminated with an errorCANCELLED— cancelled by user
Extract configuration combining parse and extract settings.
ExtractVersion stringoptional
Extract algorithm version. Use ‘latest’ for the default pipeline or a date string (e.g. ‘2026-01-08’) to pin to a specific release.
ExtractionTarget ExtractConfigurationExtractionTargetoptional
Granularity of extraction: per_doc returns one object per document, per_page returns one object per page, per_table_row returns one object per table row
ParseConfigID stringoptional
Saved parse configuration ID to control how the document is parsed before extraction
ParseTier stringoptional
Parse tier to use before extraction. Defaults to the extract tier if not specified.
Extraction metadata.
Metadata for extracted fields including document, page, and row level info.
DocumentMetadata map[string, ExtractedFieldMetadataDocumentMetadataUnion]optional
Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.
ExtractResult ExtractV2JobExtractResultUnionoptional
type ExtractedFieldMetadata struct{…}
Metadata for extracted fields including document, page, and row level info.
DocumentMetadata map[string, ExtractedFieldMetadataDocumentMetadataUnion]optional
Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.