Methodology enforcement is constraining

Type: note · Status: seedling · Tags: learning-theory

The ways we enforce methodology in the KB — instructions, skills, hooks, scripts — map directly onto the constraining spectrum. The enforcement layers parallel the codification verifiability gradient — where codification moves code from prompt tweaks through schemas to deterministic modules, methodology enforcement moves practices from written guidance through structured skills to automated scripts. Each layer trades flexibility for reliability by reducing two things: semantic underspecification (committing to one interpretation of what the practice means) and execution indeterminism (ensuring the practice fires consistently across runs). Moving from instructions to scripts progressively eliminates both.

Layer	Trigger	Response	Reliability	Example
Ad hoc prompt	indeterministic (caller writes one)	underspecified + indeterministic (LLM interprets)	lowest	"read these three docs through this lens" in a one-off instructions note
Instruction	indeterministic (LLM remembers)	underspecified + indeterministic (LLM interprets)	low	"check descriptions" in CLAUDE.md
Skill	deterministic (user invokes)	underspecified + indeterministic (LLM executes)	medium	`/validate` checks note quality
Hook (warn)	deterministic (event fires)	underspecified + indeterministic (LLM acts on output)	medium-high	validate-note.sh outputs WARN on missing description
Hook (block)	deterministic (event fires)	deterministic (rejected)	high	exit 1 prevents the operation
Script	deterministic (user/hook runs)	deterministic (code runs)	highest	sync_topic_links.py rewrites Topics footer

Ad hoc prompts are looser than persistent instructions — they're one-shot, not loaded into every session, and exist only for a single use. They sit below instructions on the gradient because they add a third source of unreliability: the prompt itself is ephemeral, so it can't even accumulate the weak consistency that comes from an instruction being present every time.

Instructions have the lowest persistent reliability because both phenomena compound: the LLM may not remember to apply the practice (indeterminism in triggering), and when it does, it interprets the instruction through underspecified semantics ("check descriptions" admits multiple valid readings of what counts as a good description). Skills eliminate the trigger problem — the user invokes them deterministically — but the response is still an LLM interpreting an underspecified spec. Blocking hooks and scripts eliminate both phenomena entirely.

The key insight: hooks are not cleanly "deterministic." A hook that outputs a warning is a deterministic trigger with an underspecified, indeterministic response — the LLM decides what to do with the warning. Only blocking hooks (exit non-zero) are fully deterministic. This means the three-tier model (instruction → skill → hook) that arscontexta uses oversimplifies — the real picture is a gradient, which is just constraining.

Maturation trajectory

This is progressive compilation applied to methodology — new best practices should start as underspecified natural-language guidance and constrain toward precise, deterministic enforcement as they prove out:

Instruction — write it in CLAUDE.md or WRITING.md. Cheap to revise, tests whether the practice is worth encoding. If the LLM follows it inconsistently, that's signal.
Skill — encode it as a structured prompt. Reliable when invoked, but requires explicit invocation. Good for judgment-requiring operations that shouldn't be automated.
Hook/script — automate the deterministic parts. Only after the practice has constrained enough that you know exactly what the check should do.

When to move down. The strongest signal for automation is when the agent consistently proposes the same correct next step — meaning both that the LLM has converged on a single interpretation of the underspecified spec, and that it executes it reliably across runs. If the LLM's response is predictable and always right, the prompt-to-action path is just overhead; a hook or script would do the same thing without the latency or token cost. This is the codification trigger: a pattern has emerged from repeated execution, and constraining it commits to that interpretation in precise code — resolving the semantic underspecification by design rather than by luck, and eliminating the indeterminism entirely.

Not everything should complete the trajectory. Operations requiring semantic judgment (like "is this connection genuine?") belong permanently at the skill level — their oracle strength is too low to support deterministic verification. Attempting to automate judgment produces confident systematic errors — the over-automation risk. ADR-001 is a clean example of the trajectory completing: an LLM-generated Topics footer was recognised as fully mechanical, and the operation moved to a deterministic script.

The trajectory requires active observation. The context engineering study found that 50% of AGENTS.md files were never changed after creation — write-once artifacts that never enter the maturation trajectory at all. The codification trigger above (observing that the agent consistently proposes the same correct step) only fires if someone is watching. Among the files that do evolve, additions (78 commits) and modifications (59) vastly outnumber removals (23) and section deletions (2) — pruning is a discipline, not an emergent behavior. Instructions accumulate unless someone actively removes them.

The maturation trajectory parallels document type maturation — just as documents start as untyped note and gain type information as they codify, practices start as written guidance and gain enforcement structure as they prove out. Both are gradual typing applied to different substrates: types accumulate verifiable structural properties; enforcement accumulates deterministic triggers and responses. The loading frequency hierarchy mirrors the same gradient from the information-delivery side — CLAUDE.md instructions, skill descriptions, skill bodies — but for loading specificity rather than enforcement reliability.

Current state

We have hooks in .claude/hooks/ but they aren't wired up ("hooks": {} in settings.json) and reference old paths. We have scripts that work (sync_topic_links.py, generate_notes_index.py). We have skills that work (validate, connect, ingest). We have instructions that work (CLAUDE.md, WRITING.md). The gradient exists — we just haven't needed to push anything further toward the deterministic end yet.

Open questions

When should a WRITING.md instruction become a validate check? Oracle strength may provide the answer: a practice is ready to move down the gradient when you can cheaply verify whether it was followed correctly. If verification requires semantic judgment, the practice stays at skill level; if it can be reduced to structural checks, it is a candidate for scripting.
Should hook warnings be treated differently from skill output? The LLM sees both as text, but the trigger mechanism differs.
Are there practices currently at skill level that should be scripts? (sync_topic_links.py was probably this — a skill-level operation that turned out to be fully deterministic.)

Relevant Notes:

codification: the missing middle — grounds: the verifiability gradient for code (prompt tweaks -> schemas -> evals -> deterministic modules) is the general pattern this note instantiates for methodology
constraining is learning — foundation: the constraining gradient for code; this note applies the same gradient to methodology
programming practices apply to prompting — synthesizes: the maturation trajectory is progressive compilation applied to methodology — flexible instructions frozen into rigid, efficient automation
001-generate-topic-links-from-frontmatter — exemplifies: a skill-level operation that completed the maturation trajectory into a deterministic script
document types should be verifiable — parallels: document type maturation (note -> traits -> promoted base type) follows the same gradual-typing pattern as methodology maturation (instruction -> skill -> hook -> script); both trade flexibility for reliability as verifiability increases
oracle strength spectrum — determines when a practice is ready to move down the enforcement gradient: cheap verification enables scripting; expensive verification keeps the practice at skill level
instruction specificity should match loading frequency — mirrors: the loading hierarchy (CLAUDE.md -> skill descriptions -> skill bodies) parallels the enforcement hierarchy, but for information specificity rather than practice reliability
error messages that teach are a constraining technique — extends: adds the inform axis orthogonal to the trigger/response gradient; the most effective enforcement artifacts simultaneously constrain and teach
spec mining as codification — generalizes: the maturation trajectory (instruction → script) is spec mining applied to methodology; both share the same codification trigger ("a pattern has emerged from repeated execution")
Agentic Note-Taking 23: Notes Without Reasons — exemplifies: the judgment/verification gradient explains why automated link generation (judgment operation) degrades quality while automated link validation (verification operation) preserves it
Context Engineering for AI Agents in OSS — validates: 169 annotated commits across 10 actively maintained AGENTS.md files show add-then-modify dominance (Add 78, Modify 59, Remove 23, Remove-section 2), confirming the maturation trajectory empirically
ABC: Agent Behavioral Contracts — formalizes: hard/soft constraint vocabulary and Drift Bounds Theorem (D*=α/γ) provide mathematical grounding for the enforcement gradient; maps warning hooks to soft constraints with recovery windows
Harness Engineering (Lopopolo, 2026) — exemplifies: three runtime pillars (instructions → structural tests → automated cleanup agents) map to the constraining gradient; "every mistake is a harness bug" is the maturation trajectory in practitioner language
enforcement without structured recovery is incomplete — extends: adds the recovery column (corrective → fallback → escalation) missing from the enforcement gradient; oracle strength determines which recovery strategies are viable at each layer

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search