Options
Configure extraction options including output format, strict mode, async processing, webhook callbacks, raw text inclusion, page ranges, and language hints.
formatstringOutput format for the extracted data.
strictbooleanWhen true, fields not in the schema are omitted from the response. When false, additional discovered fields may be included.
asyncbooleanWhen true, returns a 202 with a job ID instead of blocking. Poll the job endpoint for results.
webhook_urlstringURL to POST results to when extraction completes. Implies async behavior.
include_raw_textbooleanInclude the raw extracted text alongside structured data.
page_rangestringPages to extract from. E.g. "1-5", "1,3,7-10". PDF only.
language_hintstringISO 639-1 language code hint. Improves extraction for non-English documents.