029-Review execution unified on (note, gate) pairs

Type: ../types/adr.md · Status: accepted

Status: accepted Date: 2026-06-12

Context

The unit of review work is one (note, gate) pair — the selector and acceptance state already reasoned on that key — but execution previously ran through two divergent paths that differed only in which axis they held constant when packing pairs into one LLM call. Each path carried its own prompt renderer, parser, footer-rewriter, and failure policy. The duplication produced concrete defects: one path re-rendered extracted reviews as synthetic single-note bundles and re-parsed them to reach the shared finalize tail; failure detection differed across paths; and parsing was all-or-nothing per call, so one missing block discarded every already-parsed review in the batch. The experimental grammar was free to change.

Decision

One pair grammar. Every output block is keyed by the full pair: === PAIR REVIEW START: {note_path} :: {gate_id} === … === PAIR REVIEW END: … === (protocol/format.py). The :: separator is rejected in note paths and gate ids at render time, as are reserved === … === sentinel lines inside embedded note or gate text.
One renderer. render_pairs_prompt (protocol/prompt.py) takes N note targets and M gate definitions, embeds each note text and each gate text exactly once, and requests one block per requested pair. Note contents are always embedded (Do not read them from disk), in both single-note and multi-note shapes; the multi-note shape adds the evaluate-independently rule. The renderer names the job-owned output file directly.
One parser. parse_pair_bundle (protocol/parser.py) raises on structural anomalies (nested, mismatched, unterminated, unexpected, duplicate, or empty blocks) because the rest of the stream is untrustworthy, and reports missing expected pairs in missing.
One finalization path. Shared finalization code owns parse, result-file writes, pair completion, acceptance, and job status. ADR 035 later removed the original partial-salvage policy: live finalization now treats missing pairs as whole-job failure and accepts no completed subset from a failed job. ADR 036 later made acceptance a current-state row rather than an event ledger.
Batching is a packing choice, not a protocol. Note-packed jobs and gate-packed jobs both use the same pair grammar, parser, and finalization path. The parent agent or harness owns fan-out; Commonplace owns deterministic job creation and finalization.
Per-job artifacts are pair-grammar slices. Each job's bundle-output.md holds the job's own pair-block stream; parsed result files are named from the job's packing strategy, and MANIFEST.json maps pairs to those files. Later storage work moved review bodies out of DB columns, so the DB now records artifact paths instead of storing raw review markdown.

ADR 031 later makes the same pair unit persistent in the SQLite schema by replacing the earlier gate-specific rows with review_pairs.

Consequences

Easier: - Protocol changes (new fields, sentinel rules, coverage checks) are made once; the parse→render→parse round trip and the twin parsers/rewriters are gone. - Pair coverage failures are represented uniformly; ADR 035 later made live finalization all-or-nothing instead of salvaging completed reviews from failed jobs. - Failure policy is uniform across packing shapes. - Mixed-axis packing (arbitrary pairs per call) is structurally supported by the renderer/parser boundary, available if a future caller needs it.

Harder / accepted costs: - Notes whose body contains a reserved === … === line can no longer be reviewed by either path (previously the single-note bundle path did not embed the note and tolerated this). Render fails loudly; an escaping scheme can be added if this bites. - The pair key repeats the shared axis in every sentinel — a small token cost paid for a parser that never infers a missing axis from prompt context. - Per-pair cost attribution from per-call telemetry remains approximate. Re-asking only missing pairs in a follow-up call is a candidate follow-up.

Relevant Notes:

review architecture — part-of: the subsystem this decision restructures
gate learning from accepted edits — see-also: adjacent undecided extension; per-gate accounting interacts with the packing axis
031-review state uses run-owned review pairs — see-also: the storage refinement that makes this protocol's pair unit the persistent row
035-review jobs finalize all-or-nothing with derived artifacts — supersedes-in-part: removes partial salvage from live finalization

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search