Linking theory
Type: note · Status: seedling · Tags: links
We have link practices — relationship types (ADR 009), title-as-claim, context phrases, position conventions — but no theory explaining why these practices work and when they break down. This note collects the grounding claims and builds toward one.
Established claims about linking
Seven notes across the KB argue what makes links work. They cluster into three groups.
Links as navigation decisions
Agents navigate by deciding what to read next models every link encounter as a decision: follow or skip? The decision is always probabilistic — the agent can't know what the target contains until loading it. What makes it tractable is context: information surrounding the pointer that hints at what the target contains. The more context a pointer carries, the cheaper the navigation decision.
When the title carries the argument, the pointer itself becomes the hint — every link text, every search result, every index entry does navigation work without additional context. Title-as-claim is the shortcut that works across all pointer types.
Instruction specificity should match loading frequency adds the progressive-disclosure constraint: the title layer is the cheapest disclosure layer. If titles are claims, agents decide what to load from the title alone. If titles are topics, agents must load notes to discover what they argue — the disclosure layer fails.
Links as argument structure
Title as claim enables traversal as reasoning — when titles are claims, following links reads as a reasoning chain. "since [structure enables navigation]" composes grammatically; "since [navigation notes]" doesn't. The note graph becomes scannable arguments. Link semantics ("since", "because", "but") encode relationship types — Toulmin warrants connecting grounds to claims.
Claim notes should use Toulmin-derived sections for structured argument — title-as-claim is the Toulmin claim; "since"/"because" link semantics encode warrants. Not all claim-titled notes need the full Toulmin scaffold, but the scaffold reveals what claim titles are doing implicitly.
Link strength is encoded in position and prose — an inline "since [X]" that uses a note as a premise carries more weight than a footer entry. Position and prose encode commitment level, creating a weighted graph orthogonal to relationship type.
Links as maintenance surface
Title as claim exposes commitments, enabling Popperian maintenance — claim titles expose what each note commits to, so reviewing the KB becomes scanning hypotheses: "do I still believe this?" Maintenance cost scales with doubtful claims, not total notes. Topic titles hide commitments behind labels that require opening every file.
Title as claim makes overlap between notes visible — two claim titles arguing the same thing are obviously redundant at the index level. Two topical titles covering the same territory are invisible until you read both. Overlap detection becomes a scanning task rather than a reading task, and the benefit compounds with KB size.
Candidate theory: links as decision-cost reducers
The claims above share a common mechanism: links are decision points, and link quality is the reduction of navigation uncertainty per unit of context consumed.
Every interaction with the knowledge graph involves a decision under bounded context: follow this link? Load this note? Open this search result? Each decision has a cost — context consumed to make it — and a value — probability of reaching task-relevant information. Link quality is the efficiency of that exchange: how much uncertainty about the target does the link context eliminate, relative to the context budget it consumes?
Each practice instantiates this principle:
-
Typed relationships (extends, contradicts, grounds) beat "related" because they carry more decision-relevant information per word. "Related" eliminates zero uncertainty about what following the link will do for the reader's task.
-
Claim titles beat topical titles because they compress the note's commitment into the pointer. A claim title is a one-line summary of what you'll get; a topical title says only what territory you'll enter.
-
Position encodes strength because inline links pay for their context with the surrounding argument — the prose advances the argument while hinting at the target, making the marginal context cost near zero. Footer links must pay for their context out-of-band.
-
"Related" is not a relationship — it carries zero bits of decision-relevant information, the linking equivalent of a null description.
What the theory predicts
If link quality is decision-cost reduction, the theory predicts:
-
Vocabulary size is a trade-off. Too few relationship types and you can't discriminate; too many and authoring cost exceeds the discrimination benefit. Our five types — extends, grounds, contradicts, enables, exemplifies — should be evaluated against common agent tasks.
-
Different tasks need different link types. An agent verifying a claim should prioritize "grounds" and "contradicts" links. An agent looking for examples should prioritize "exemplifies". The vocabulary is useful to the extent that agent tasks decompose along the same axes.
-
Untyped links work in small KBs but fail at scale. With few notes, every link target is roughly equally likely to be useful — decision cost is low regardless. As the note count grows, untyped links provide no discrimination.
-
Link density has diminishing returns. Each additional link adds decision points. If the marginal link carries less decision-relevant information than the context cost of processing it, it hurts navigation rather than helping it.
-
Maintenance cost scales with exposed commitments. Popperian maintenance works because claim titles externalize decision-relevant information. Any practice exposing commitments at low context cost — claim titles, typed relationships, context phrases — will reduce maintenance effort.
Open questions
-
How do relationship type and position interact? A "contradicts" link in a footer is arguably more important than an "exemplifies" link inline. Do type and position compose additively, or does one dominate?
-
Is the vocabulary the right one? Our five types were borrowed from arscontexta and adapted. Are there common agent navigation patterns that need a type we don't have? Are there types we never use in practice?
-
Can we measure decision cost? The theory is only testable if we can observe agents making navigation decisions and measure whether typed links lead to better choices. Agent trace analysis might provide this.
Relevant Notes:
- ADR 009: Link relationship semantics — decision: the specific vocabulary this theory aims to ground
- agents-navigate-by-deciding-what-to-read-next — grounds: the navigation-decision model
- title-as-claim-enables-traversal-as-reasoning — grounds: claim titles as argument structure
- title-as-claim-exposes-commitments-enabling-popperian-maintenance — grounds: claim titles as maintenance surface
- title-as-claim-makes-overlap-between-notes-visible — grounds: claim titles as redundancy detector
- link-strength-is-encoded-in-position-and-prose — extends: position as a second axis the theory needs to account for
- instruction-specificity-should-match-loading-frequency — enables: progressive disclosure depends on title-layer quality