Verification Plan: Skills vs Instructions

The file format is irrelevant — you can freely convert between the two. The only question is the loading mechanism: what does the skill pathway do that the instruction pathway doesn't, and does it matter?

Phase 1: Establish ground truth (local)

1.1 What does the Skill tool do?

Observe: when a skill is invoked, what actually happens? Is it Read(file) + "follow this", or does it inject variables, set up state, modify conversation mode?
Key question: Is there runtime behavior that Read(instruction_file) doesn't get?

1.2 Is `Skill("X")` different from `Read("path/to/X.md")`?

When the agent follows a CLAUDE.md routing rule ("when doing X, read Y.md"), it calls Read. Compare the result to what happens when the Skill tool fires.
Key question: Same content loaded — does the loading pathway change agent behavior?

1.3 Do the two discovery surfaces differ?

Skills appear in system-reminder messages as a list with descriptions and trigger conditions.
Instructions appear in CLAUDE.md routing table entries with conditions and paths.
Both are always-loaded. Both carry name + description + condition.
Key question: Does one surface produce more reliable activation than the other, or are they equivalent?

1.4 Token cost

Measure a skill description line vs a routing table entry. Both are ~1-2 lines.
Check for hidden costs: Skill tool schema tokens, descriptions repeated in multiple places.
Key question: Any real cost difference, or roughly equivalent as expected?

Phase 2: Platform documentation (web searches)

2.1 Claude Code skills — official guidance

Query: "claude code" skills ".claude/skills" documentation guide
Query: site:docs.anthropic.com claude code skills
Verify: Does Anthropic document any mechanical difference beyond discovery? Any guidance on when to use skills vs CLAUDE.md instructions?

2.2 The Skill tool internals

Query: "claude code" "Skill tool" invocation what it does
Query: "claude code" skill invoke vs read instruction
Verify: Is there documented behavior of the Skill tool beyond loading file content?

2.3 Codex equivalent

Query: codex AGENTS.md skills agents documentation
Verify: Does Codex make the same distinction? Different loading mechanism?

Phase 3: Practitioner evidence (web searches)

3.1 Skills vs instructions in practice

Query: "claude code" skill vs CLAUDE.md instruction when to use
Query: "CLAUDE.md" routing instructions best practice 2025 2026
Verify: Do practitioners report any behavioral difference (reliability, compliance) between the two pathways?

3.2 Activation reliability

Query: "claude code" skill "not triggered" OR "doesn't activate" OR "doesn't invoke"
Query: "CLAUDE.md" routing rule "agent ignores" OR "agent doesn't follow"
Verify: Is one activation mechanism more reliable than the other in practice?

3.3 Cross-platform

Query: cursor custom command vs rules file instruction routing
Query: agent system prompt instruction vs invokable skill comparison
Verify: Does the same loading-mechanism distinction appear across platforms? What do other platforms reveal?

Phase 4: Synthesis

Write findings.md:

List every verified mechanical difference in the loading mechanism
For each difference, evaluate whether it matters (does it change discoverability, composability, or trustworthiness of the procedure?)
Assess whether the current KB claim ("instructions are skills without automatic routing") holds
Answer the workshop question: when should you create a skill, and when is a CLAUDE.md routing entry enough?

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search