Flow Agents Skills Map
This map groups the current skills by the user journey they support. The Builder Kit workflow system is centered on concrete workflow skills, while Flow Agents coordinates Flow Kit installation, runtime adapters, and local control.
For practical operator instructions and copy/paste prompts, see https://github.com/kontourai/flow-agents/blob/main/docs/workflow-usage-guide.md. For the shared cross-distribution contracts behind the workflow artifacts and gates, see https://github.com/kontourai/flow-agents/blob/main/docs/workflow-shared-contracts.md.
builder-shape: product-level Builder Kit shape invocation that guidesidea-to-backlogwithout requiring the user to name the primitive, linkskits/builder/flows/shape.flow.json, and stops at the backlog gate unless issue sync is explicit.idea-to-backlog: discovery, idea separation, thinnest meaningful slice, shaping, prioritization, and executable GitHub issue creation.pull-work: dynamic backlog selection, grouping/dependency checks, WIP awareness, worktree decision, and execution handoff; in Builder Kit build, every selected item or justified group needs fresh pickup Probe evidence before planning.design-probe: generic one-question-at-a-time probing interview; Builder Kit uses this step before planning when the build flow needs shared understanding or a pickup decision.pickup-probe: Builder Kit specialization ofdesign-probefor selected work items; records scope, provider state, WIP/conflict scans, risks, decisions, unresolved questions, accepted gaps, and planning readiness.plan-work/execute-plan/deliver: Definition Of Done, execution orchestration, and local delivery closure.review-work: report-only critique for quality, security triggers, architecture fit, and standards findings.verify-work: behavior evidence mapped to acceptance criteria and Goal Fit.evidence-gate: trust assessment for completed work: acceptance evidence, integrity checks, CI confidence, and next step.release-readiness: operational decisioning for a published change: merge/release/deploy/hold, rollback, observability, final acceptance docs, and post-deploy planning.learning-review: post-merge/post-deploy learning, follow-up routing, docs promotion checks, and durable knowledge capture.
flowchart LR
Idea[Raw idea or goal]
Shape[Builder Kit shape / idea-to-backlog<br/>shape executable work]
Pickup[pull-work<br/>select ready work]
Probe[design-probe / pickup-probe<br/>pickup Probe before planning]
Build[plan-work + execute-plan<br/>build the slice]
Review[review-work<br/>critique code and risk]
Verify[verify-work<br/>prove behavior]
GoalFit[goal-fit<br/>check user outcome]
Trust[evidence-gate<br/>map criteria to evidence]
Publish[publish-change<br/>commit / push / PR / CI]
Release[release-readiness<br/>merge / release / deploy / docs decision]
Learn[learning-review<br/>route follow-ups]
Backlog[(GitHub issues)]
Idea --> Shape --> Backlog --> Pickup --> Probe --> Build --> Review --> Verify --> GoalFit --> Trust --> Publish --> Release --> Learn
Probe -->|not needed| Build
Pickup -->|too vague or stale| Shape
Review -->|findings| Build
GoalFit -->|incomplete| Build
Trust -->|FAIL / NOT_VERIFIED| Build
Learn -->|new work| Shape
Current Shape
The operating model now has first-class coverage from idea intake through trusted delivery:
- Upstream product work is exposed through
builder-shapeand owned byidea-to-backlog. - Backlog selection and execution handoff are owned by
pull-work. - Design probing is a generic skill named
design-probe; in the Builder Kit build flow the step is still nameddesign-probe, and thepickup-probespecialization records selected-work readiness before planning.decision_gaproute-backs return there for missing pickup/planning decisions. - Product-level Builder Kit build may guide
pull-work -> design-probe / pickup-probe -> plan-work; direct primitives still stop at their own gates and report the expected next step. - Broad continuation language does not carry across newly selected work after merge. Queue inspection is allowed, but planning the next item requires a fresh pickup Probe record.
- Critique is owned by
review-workand persisted incritique.json. - Verification is owned by
verify-workand persisted inevidence.json. - Trust evidence is assessed by
evidence-gate; it decides whether completed work has enough proof and integrity to publish or continue fixing. - Publishing verified changes is the bridge between evidence and release readiness: commit the verified diff, push the branch, open or update the PR, and collect PR/CI evidence.
- Merge/release/deploy decisioning is owned by
release-readinessafter the publish-change gate. - Retrospective learning and follow-up routing are owned by
learning-review. - Implementation still flows through
plan-work,execute-plan,review-work, andverify-work, withDefinition Of DoneandGoal Fit Gatepreventing task-complete-but-user-incomplete delivery. - Real browser/runtime checks remain delegated to
feedback-loopandbrowser-test.
The upstream guardrail is intentionally strict: multiple ideas are inventoried separately first, the thinnest meaningful slice is identified for each buildable idea, and bundled work must have an explicit dependency or shared-outcome justification. The pickup workflow repeats this check before planning so unrelated backlog items do not silently become one implementation stream.
The intentionally deferred primitives such as intake-idea, shape-work, test-map, and scope-and-integrity-check are nested workflow sections for now. They should become separate skills only if their behavior grows enough to need independent contracts, artifacts, or eval suites.
Phase Composition
This view shows how each phase is composed. The left rail is the durable phase sequence; each phase row names its primary owner, supporting skills, nested sections that may later become primitives, and the gate/artifact that lets the next phase begin.
Primary
builder-shape idea-to-backlog
Support
knowledge-search search-first explore crowdsource frontend-design github-cli knowledge-capture
Nested sections / future primitives
intake/dedupe, separate ideas, thinnest meaningful slice, opportunity review, explore options, shape-work, prioritize work, sync executable backlog
Gate & artifact
Idea, slice, shape, and backlog gates. Writes shaped briefs and GitHub issue links in .flow-agents/<slug>/.
Primary
pull-work
Support
github-cli
Nested sections / future primitives
board snapshot, WIP check, grouping/dependency check, pickup Probe decision, worktree decision, plan-work handoff
Gate & artifact
Pickup gate and pickup Probe handoff. Writes selected issues, blockers, bundle justification, provider state, accepted gaps, worktree policy, expected modified files, conflict risks, and handoff notes.
Primary
plan-work execute-plan review-work verify-work
Support
feedback-loop browser-test deliver fix-bug tdd-workflow
Nested sections / future primitives
Definition Of Done, execution plan, parallel waves, implementation session state, critique report, verification report, runtime/browser validation, Goal Fit Gate
Gate & artifact
Review, verification, and Goal Fit gates. Produces critique findings plus test, build, lint, browser, or runtime evidence tied to acceptance criteria and the user-facing outcome.
Primary
evidence-gate release-readiness
Support
github-cli eval-rebuild
Nested sections / future primitives
criteria-to-evidence map, CI confidence, scope-and-integrity-check, publish-change, rollback review, observability review, post-deploy plan, final acceptance docs, remediate-ci
Gate & artifact
Evidence, publish-change, release, and docs gates. Writes confidence, integrity, commit/branch/PR/CI links, release scope, risk, rollback, deploy-readiness decisions, and durable documentation links.
Primary
learning-review
Support
knowledge-capture observe idea-to-backlog eval-rebuild
Nested sections / future primitives
facts vs interpretation, follow-up routing, docs promotion review, knowledge updates, eval updates, skill/backlog improvements
Gate & artifact
Learning gate. Writes outcomes, gaps, docs promotion state, follow-ups, knowledge updates, and verdict.
| Phase | Primary workflow skill | Supporting skills | Nested sections / future primitive candidates |
|---|---|---|---|
| Idea discovery and shaping | builder-shape, idea-to-backlog |
knowledge-search, search-first, explore, crowdsource, frontend-design, github-cli, knowledge-capture |
intake/dedupe, separate ideas, thinnest meaningful slice, opportunity review, explore options, shape work, prioritize work, sync executable backlog |
| Backlog pickup | pull-work |
github-cli |
board snapshot, WIP check, grouping/dependency check, Probe decision, worktree decision, handoff |
| Execution planning and build | design-probe, pickup-probe, plan-work, execute-plan, review-work, verify-work |
feedback-loop, browser-test, deliver, fix-bug, tdd-workflow |
Probe notes, Builder Kit Probe record, Definition Of Done, execution plan, parallel waves, implementation session state, critique report, verification report, Goal Fit Gate |
| Evidence and release confidence | evidence-gate, release-readiness |
github-cli, eval-rebuild |
criteria-to-evidence map, CI confidence, scope/integrity check, publish-change, rollback review, observability review, final acceptance docs, post-deploy plan |
| Learning and improvement | learning-review |
knowledge-capture, observe, idea-to-backlog, eval-rebuild |
facts vs interpretation, docs promotion review, follow-up routing, knowledge updates, eval/skill/backlog improvements |
The highest-leverage future extractions are likely shape-work, test-map, scope-and-integrity-check, and remediate-ci. They are still nested because their behavior is present, but not yet large enough to need separate activation contracts.
Gates And Artifacts
Each workflow phase ends with an explicit gate and durable artifact:
builder-shapeinvokes Builder Kit shape, delegates toidea-to-backlog, linkskits/builder/flows/shape.flow.json, and writes the standard.flow-agents/<slug>/<slug>--idea-to-backlog.mdartifact.idea-to-backlogwrites.flow-agents/<slug>/<slug>--idea-to-backlog.mdand produces shaped briefs plus GitHub issue links.pull-workwrites.flow-agents/<slug>/<slug>--pull-work.mdwith selected issues, WIP notes, blockers, pickup Probe decisions or accepted gaps, worktree decision, expected modified files, conflict risks, and aplan-workhandoff.plan-workanddeliverwrite.flow-agents/<slug>/<slug>--*.mdartifacts withDefinition Of Done,Goal Fit Gate, andFinal Acceptancesections.review-workwrites reviewer artifacts when available and updatescritique.jsonwith quality, security, architecture, standards, and resolution state.verify-workwrites verification artifacts and updatesevidence.jsonwith behavior evidence mapped to acceptance criteria.evidence-gatewrites.flow-agents/<slug>/<slug>--evidence-gate.mdwith acceptance evidence, CI summary, integrity report, verdict, and next step.release-readinesswrites.flow-agents/<slug>/<slug>--release-readiness.mdwith release scope, evidence reference, risk review, operational plan, rollback plan, observability plan, post-deploy checks, and decision.learning-reviewwrites.flow-agents/<slug>/<slug>--learning-review.mdwith outcomes, evidence, decisions, gaps, follow-ups, knowledge updates, and verdict.
Core gates:
- Idea Gate: raw input is deduped, classified, and routed.
- Slice Gate: each candidate has one outcome, one thinnest meaningful slice, and explicit split/bundle/dependency reasoning.
- Shape Gate: scope, non-goals, risk, rollout notes, and acceptance criteria are stable enough.
- Backlog Gate: GitHub issues represent executable or near-executable work.
- Pickup Gate: selected work is ready, WIP is acceptable, and worktree policy is recorded.
- Review Gate: report-only reviewers have no open blocking findings, or findings are explicitly accepted/deferred/false positive.
- Verification Gate: implementation evidence exists from local, automated, browser, or runtime checks.
- Goal Fit Gate: the original user outcome is satisfied, gaps are explicit, and local/project/global scope is clear.
- Evidence Gate: acceptance criteria are mapped to falsifiable evidence and scope integrity is checked.
- Publish Change Gate: verified changes are committed, pushed, represented by a PR or explicit no-PR decision, and PR checks/CI are linked.
- Release Gate: CI, docs, rollout, rollback, observability, and owner concerns are addressed for the risk class.
- Docs Gate: accepted planning artifacts are archived and promoted into durable docs when useful.
- Learning Gate: failures and recurring patterns are routed to tests, evals, skills, backlog, or knowledge capture.
End-To-End Flow
flowchart LR
Idea[Idea / vague goal]
BacklogSkill[idea-to-backlog]
Issue[Executable GitHub issue]
Pull[pull-work]
Probe[design-probe]
Worktree[worktree decision]
Plan[plan-work]
Execute[execute-plan]
Review[review-work]
Verify[verify-work]
GoalFit[goal-fit]
Evidence[evidence-gate]
Publish[publish-change]
Release[release-readiness]
Docs[final acceptance docs]
Learning[learning-review]
Done[merged / shipped]
Idea --> BacklogSkill --> Issue --> Pull --> Probe --> Worktree --> Plan --> Execute --> Review --> Verify --> GoalFit --> Evidence
Evidence -->|PASS| Publish --> Release --> Done --> Docs --> Learning
Review -->|findings| Execute
GoalFit -->|incomplete| Plan
Evidence -->|FAIL / NOT_VERIFIED| Plan
Pull -->|too vague / stale| BacklogSkill
Learning -->|systemic change| Eval[eval-rebuild / backlog / skill update]
Eval Coverage
Workflow evals are layered to match this map:
- Static contract evals guard non-negotiable skill boundaries.
- Behavioral activation evals check that agents choose the right workflow and stop at gates.
- Artifact quality evals inspect durable session artifacts and GitHub issue drafts.
- Adversarial evals exercise premature coding, vague issues, missing CI, weakened tests, and prototype promotion risks.
- End-to-end evals cover
idea-to-backlog -> pull-work -> design-probe -> plan-work -> execute-plan -> review-work -> verify-work -> goal-fit -> evidence-gateselectively.
This keeps one conversation capable of carrying the full operating loop while making each phase produce an artifact that the next phase can verify.