Quick Start
Three modes, one API. Auto-detect extraction, schema-driven extraction, and document filtering — all with per-cell confidence and cost transparency.
Three modes. One API. Auto-detect what's in the document. Send your own schema and get exactly that shape. Or skip the document entirely and query data you already ingested. Plus per-cell provenance, confidence scores, and cost transparency on every call.
Prerequisites
Set your API key
Mode 1 — Auto-detect extract
Send a document with no schema. Talonic discovers every field automatically.
curl — auto-detect all fields
Returns every field the AI discovers — vendor, dates, amounts, line items, addresses — with per-field confidence scores. Use this when you don't know the document structure upfront.
Mode 2 — Schema-driven extract
Send a document AND the shape you want. Get exactly those fields back.
curl — extract with inline schema
The response contains exactly the four fields you asked for — nothing more. Save the schema with POST /v1/schemas for reuse across future extractions.
Mode 3 — Query ingested data
Don't send a document. Query data you already extracted — across all documents in your workspace.
curl — filter previously extracted documents
Returns all documents matching your filter — no re-extraction, no AI cost. Ingest once, query forever.
Cost on every call
Every synchronous extraction response includes cost headers so you can track spend per call:
Cost headers
Fields resolved from the registry (X-Talonic-Cells-Resolved-Registry) cost nothing. Only AI-resolved fields consume credits.
Example response
A synchronous extraction returns structured data with confidence scores:
Response (200 OK)