Store reproducible extraction recipes, materialize outputs with audit trails, serve publicly from R2 without waking the Durable Object.
Define the extraction recipe: JSON Schema for the output shape, a prompt for the LLM, and which model to use. Stored in DO SQLite.
After LLM extraction, write the validated data and full audit trail. DO stores in SQLite (source of truth) and writes data-only blob to R2 (serving layer).
Authenticated read from DO SQLite. Returns the validated extraction data. Used by SDK internals and admin tooling.
Every materialized output is traceable: see which model ran, what prompt was sent, and the raw LLM response before JSON parse.
Public URL served directly from R2. No authentication required. The Durable Object never wakes — zero compute cost on read.
Output profile and materialized data stored in DO SQLite (source of truth with audit trail).
Data-only blob written to R2 (serving layer).
Public reads at /v1/documents/{id}/o_invoice/data.json hit R2 directly — the Durable Object never wakes.