Execution model

okso separates high-level planning from step-by-step execution so that tool calls stay predictable and reviewable.

Planner pass

The planner drafts a numbered outline that mentions the tools to use for each step.
An intent classification step determines if the request needs web context and filters the tool catalog for the planner.
A dedicated rephrasing step asks the configured search rephraser model (via SEARCH_REPHRASER_MODEL_SPEC, with autotuned defaults) for 1–3 focused web search strings constrained by a JSON schema. Each query is searched and the aggregated snippets are folded into the planner prompt. When llama.cpp is unavailable, okso skips the model call and uses the raw user query for search.
The outline is emitted as structured JSON for logging and optional downstream automation.
Approval prompts give you a chance to refine or abort the plan before any commands run.
A side-effect-free scorer evaluates each sampled outline before selection. Plans that stay within the PLANNER_MAX_PLAN_STEPS budget, end with final_answer, use registered tools, and delay side-effecting actions receive higher scores and win ties when multiple candidates share the same numeric total.
Each planner invocation currently generates a single candidate (the PLANNER_SAMPLE_COUNT knob is pinned to 1 in the planner); the scored JSONL history is written to PLANNER_DEBUG_LOG so you can audit how the winner was chosen.
The planner itself depends on llama.cpp for plan generation and scoring; if llama.cpp is unavailable the run stops before execution begins.

See Planner sampling for detailed scoring heuristics and debug log fields that help compare candidates.

Executor

After the plan is approved, the runtime executes the planner-selected tools in order:

Default behaviour: the executor loop runs each planned tool call deterministically. Planner output provides seed values for all arguments; empty string seeds indicate fields the executor must fill from observations, while non-empty seeds are used as-is. Fill mode is inferred from seed presence; missing required values result in validation errors that stop execution. During arg-fill, the executor summarizes web fetch observations and applies prompt context budgets before calling the model.
Fallback behaviour: if llama.cpp is unavailable, okso preserves planner-provided arguments and skips LLM-based arg enrichment.

Each executor decision and observation is streamed to the terminal.

Execution summaries keep tool observations machine-readable by emitting compact JSON for tool outputs (for example, a web search observation remains {"items":[...],"total_results":5} instead of a bulleted list). Use the format_tool_history_pretty() helper in src/lib/cli/output.sh if you need a human-formatted view.

Configuration hooks

APPROVE_ALL: toggles plan approval prompts.
INTENT_DISABLE_SEARCH: skips the pre-planner search stage regardless of intent.
PLANNER_SAMPLE_COUNT, PLANNER_TEMPERATURE, PLANNER_MAX_OUTPUT_TOKENS: control planner sampling settings.

See usage for CLI flags that control approvals and logging.