Quick Start

Three modes, one API. Auto-detect extraction, schema-driven extraction, and document filtering — all with per-cell confidence and cost transparency.

Three modes. One API. Auto-detect what's in the document. Send your own schema and get exactly that shape. Or skip the document entirely and query data you already ingested. Plus per-cell provenance, confidence scores, and cost transparency on every call.

Prerequisites

Set your API key

Mode 1 — Auto-detect extract

Send a document with no schema. Talonic discovers every field automatically.

curl — auto-detect all fields

Returns every field the AI discovers — vendor, dates, amounts, line items, addresses — with per-field confidence scores. Use this when you don't know the document structure upfront.

Mode 2 — Schema-driven extract

Send a document AND the shape you want. Get exactly those fields back.

curl — extract with inline schema

The response contains exactly the four fields you asked for — nothing more. Save the schema with POST /v1/schemas for reuse across future extractions.

Mode 3 — Query ingested data

Don't send a document. Query data you already extracted — across all documents in your workspace.

curl — filter previously extracted documents

Returns all documents matching your filter — no re-extraction, no AI cost. Ingest once, query forever.

Cost on every call

Every synchronous extraction response includes cost headers so you can track spend per call:

Cost headers

Fields resolved from the registry (X-Talonic-Cells-Resolved-Registry) cost nothing. Only AI-resolved fields consume credits.

Example response

A synchronous extraction returns structured data with confidence scores:

Response (200 OK)

Next steps