Tool loop

Type: index · Status: current · Tags: computational-model, context-engineering, tool-loop

Many LLM applications share a common operational core: construct a task frame, give the model tools, and loop until it stops.

state = initial_task_frame()

while not done(state):
    turn = llm_call(state, tools=tools)
    if turn.type == "tool_request":
        result = execute_tool(turn.request)
        state = absorb(state, turn.request, result)
    else:
        state = absorb(state, turn.output)

Frameworks own this loop because the mechanics are repetitive protocol work — parsing tool requests, dispatching to handlers, serializing results, feeding them back, handling streaming and retries. Abstracting that away is good engineering, just as abstracting HTTP parsing is.

Many useful interventions can stay hidden inside this loop without changing its structure: logging, approvals, budget checks, checkpoints, deterministic transforms on tool results. A stateful singleton runtime behind the tool boundary can go further, holding recursion state and branch records. The recovery is genuine — but the question is not whether the loop can absorb bookkeeping. It is who gets to decide what the next step can do.

Forcing cases

Three cases where a single framework-owned loop becomes insufficient:

subtasks that need different tools force loop exposure in agent frameworks — decomposition creates children that need different capability surfaces; the parent must construct fresh calls with different tool sets, which a fixed loop cannot express
semantic sub-goals that exceed one context window become scheduling problems — some sub-goals require deterministic orchestration over smaller semantic judgments because the material doesn't fit in one bounded call
codified scheduling patterns can turn tools into hidden schedulers — as next-step policies stabilize into code, hiding them in tools collapses orchestration into covert runtime logic

Resolution

The first and third cases call for sub-agents — fresh tool loops with their own prompt, capability surface, and stop condition. The second calls for something more: symbolic composition of agents — code-controlled iteration, filtering, and aggregation over multiple agent invocations. Sub-agents are the atomic unit; symbolic orchestration is what the application does with them. "Exposing the loop" means the framework supports both: spawning child loops and composing them in application code.

Downstream consequences

session history should not be the default next context — sub-tasks should start with constructed prompts, not inherit the parent's full conversation
apparent success is an unreliable health signal in framework-owned tool loops — when the framework owns recovery, the parent cannot distinguish intended-path success from workaround success
traditional debugging intuitions break when tool loops can recover semantically — programmers expect broken infrastructure to fail loudly; semantic recovery violates that, creating false confidence
silent disambiguation is the semantic analogue of tool fallback — the same observability problem for ambiguous specs rather than broken tools
conversation vs prompt refinement in agent-to-agent coordination — once sub-agents exist, the parent must choose how results come back: trace preservation, compression, or context forking

RLM has the model write ephemeral orchestrators over sub-agents — boundary case: the model writes an ephemeral symbolic orchestrator that composes agents via recursive_llm() — the loop is exposed to the model rather than the programmer, but discarded after each run

Relevant Notes:

bounded-context orchestration model — foundation: each sub-agent is a bounded call with its own select(K) in the scheduling model
llm-mediated schedulers are a degraded variant of the clean model — consequence: workarounds for hidden loops push scheduling into the conversational medium
"agent" is a tool loop — convention: grounds the sub-agent mechanism by equating "agent" with "tool loop"
agent orchestration occupies a multi-dimensional design space — broader context: the tool loop is one dimension; scheduler placement, persistence, coordination form, and return artifacts vary independently

Other tagged notes

The chat-history model trades context efficiency for implementation simplicity — Chat history persists because appending messages preserves information and avoids interface design, but that convenience trades away selective loading under bounded context

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search

Tool loop

Forcing cases

Resolution

Downstream consequences

Related approaches

Other tagged notes