Skip to content

Reference

Installed skill wrapper

The installed skill includes scripts/agent-workbench.sh.

People normally ask their agent to use $agent-workbench. The agent follows the skill instructions and uses the wrapper internally.

Wrapper environment variables

Variable Description
AGENT_WORKBENCH_REPO Override the GitHub repository used for release downloads.
AGENT_WORKBENCH_VERSION Pin the release tag used by the wrapper.
AGENT_WORKBENCH_BIN Execute an already-built local CLI instead of downloading a release asset. Used for CI and release-candidate validation.
GITHUB_TOKEN Optional token for GitHub API and release download requests.
XDG_CACHE_HOME Override the cache root.

When the skill is inside an Agent Workbench source checkout, the wrapper uses the checkout's already-built target/debug/agent-workbench or target/release/agent-workbench before falling back to CLI_VERSION.

Command groups

Group Purpose
init, status, next Project setup and current work query.
work Work-unit lifecycle and activation stack operations.
resume-check, gate resume-ready Recorded and read-only resume evaluation.
rules, correction, authority Project and work-scope operating rules.
command Fixed/preferred command profiles, usage, promotion, and deviations.
record Structured work records and evidence links.
repository, git Repository snapshots, Git commits, file changes, and comparisons.
design, requirement, design-decision, gate-template Design package import and inspection.
trace, decompose, checklist, stale Design-to-task traceability, checklist completion, and explicit stale record disposition.
review, finding, closure, review-context Review planning, runs, findings, closures, and focused context.
evidence, coverage, gate Implementation evidence, coverage, validation gates, and readiness checks.
kpt Process review over corrections, command drift, findings, and outcomes.

Rule precedence

latest user instruction
  > active work-unit rule
  > project or repository rule
  > skill default
  > historical export or work record

When close-ready reports shadowed rule conflicts, agents should inspect rules applicable --scope current. Lines with shadowed_by=<id> identify the shadowed rule. If the override is intentional, record approval with acceptance add --target rule:<shadowed-rule-id> --type explicit_exception.

Review modes

Mode Use
fresh / new_unbiased_review New unbiased review and completion checks.
resume / finding_fix_verification Verify known finding closures.

review run add records a review result; it does not execute a review. Design-derived readiness gates only accept clean context-targeted review runs with trusted provenance. Use --provenance external_agent --external-agent-id <agent-id> --provenance-ref <review-output-ref> for a separate agent review, or --provenance human_review --provenance-ref <review-output-ref> for a human review. Self-recorded clean runs do not satisfy gate review evidence.

Readiness gates

Gate Purpose
design-ready Design can move to decomposition.
implementation-ready Design-derived tasks and gate selections are ready for implementation.
close-ready Active work has required evidence, coverage, closed checklists, reviews, command usage, repository state, and records.
resume-ready Suspended work can resume without stale or unresolved assumptions.

Design-derived implementation should use explicit activation for the work unit produced by decompose design: agent-workbench work activate --implementation --design-version <design-version-id> <work-unit-id>. agent-workbench work start --implementation --design-version ... is rejected so agents do not create unbound implementation work.

If a required review plan was created for the wrong scope or is intentionally not required, record a user, policy, or design authority event and run agent-workbench review plan waive <review-plan-id> --reason "<reason>" --authority <authority-event-id>. This records an approved exception that readiness gates understand without direct ledger edits.

For carried requirements whose key and content are unchanged across design versions, close-ready treats the current design's selected validation gate as covering the older equivalent requirement record for the same task. Agents should not repair that case by selecting gates against obsolete design records.

When close-ready is blocked, the failed item details are intended to be the agent-facing repair surface. validation_runs_recorded lists missing selected gate derivations as task_derivation:<id> and validation run blockers as validation_gate:<id>. review_plans_clean lists review_plan:<id>, stale target counts, and the exact context_ref required for context-targeted review runs. Agents should use those printed IDs with normal CLI commands, not inspect the ledger directly.

Agent-facing state

The ledger is storage, not the agent-facing API. Agents should use status, next, review-context, list commands, and readiness gates to decide what to do. If the CLI cannot answer a workflow question, that is a product gap to fix, not a reason for agents to inspect the ledger directly.

status and next can report a phase blocker. In that state the agent-facing next action is the printed blocker-resolution command, such as finding classification, closure, verification, or work unblock. Implementation should wait until the blocker is gone.