Skip to content

Latest commit

 

History

History
187 lines (134 loc) · 26.2 KB

File metadata and controls

187 lines (134 loc) · 26.2 KB

Command Reference

This file is the index. Per-command pages — inputs, outputs, preconditions, behavior, safety, definition of done — live next to their skill at plugins/test-commander/skills/<skill>/commands/<command>.md. Per the Phase 1 design decision, the per-command page is the single source of truth that Claude reads at runtime and that users read for reference.

Phase 1 commands (shipped)

Command Skill Per-command page
/tc:init tc-core init.md
/tc:status tc-core status.md
/tc:journal tc-core journal.md
/tc:next tc-core next.md

For the methodology behind /tc:next's recommendations, see next-step-inference.md.

Phase 2 commands (shipped)

Command Skill Per-command page
/tc:review-requirements tc-requirements review-requirements.md
/tc:review-user-stories tc-requirements review-user-stories.md
/tc:review-acceptance-criteria tc-requirements review-acceptance-criteria.md
/tc:requirements-coverage tc-requirements requirements-coverage.md
/tc:requirements-to-tests tc-requirements requirements-to-tests.md

For the rubric methodology, see requirements-quality-review.md, user-story-readiness.md, and acceptance-criteria-quality.md. End-to-end walkthrough: user-guide/reviewing-requirements.md.

Phase 3 commands (shipped)

Command Skill Per-command page
/tc:learn-from-docs tc-knowledge learn-from-docs.md
/tc:learn-from-specs tc-knowledge learn-from-specs.md
/tc:learn-from-code tc-knowledge learn-from-code.md
/tc:learn-from-api tc-knowledge learn-from-api.md
/tc:learn-from-tests tc-knowledge learn-from-tests.md

All five commands write per-source models under <workspace>/product-knowledge/, contribute scoped ## From <source> sections to the cross-cutting artifacts (entities.md, user-journeys.md, business-rules.md, assumptions.md), and call the shared synthesize_system_model.py to regenerate system-model.md. Gap signals route to <workspace>/requirements/open-questions.md with a [<kind>] prefix.

For the methodology behind each helper, see project-knowledge.md (umbrella) plus learning-from-documents.md, learning-from-specs.md, learning-from-code.md, learning-from-api.md, and learning-from-tests.md. End-to-end walkthrough: user-guide/building-project-knowledge.md.

Phase 4 commands (shipped)

Command Skill Per-command page
/tc:create-charter tc-explore create-charter.md
/tc:explore tc-explore explore.md
/tc:session-summary tc-explore session-summary.md
/tc:test-ideas tc-explore test-ideas.md

/tc:create-charter scopes an exploration session against Phase-3 product-knowledge + Phase-2 open questions. /tc:explore replays a recorded Playwright MCP session against the charter, classifies events into Observations / Evidence / Anomalies / Charter Coverage, and runs the internal exploration-review sub-mode (suppressible with --no-review) that appends [exploration-review] gap signals to <workspace>/requirements/open-questions.md. /tc:session-summary synthesizes a per-session summary with aggregate counts and structured candidate scenarios, plus rebuilds <workspace>/sessions/index.md. /tc:test-ideas enriches the Phase-2 tc-test-idea/v1 seeds under <workspace>/test-ideas/REQ-*.md, preserving every Phase-2 frontmatter key byte-for-byte, flipping status: seedstatus: enriched, merging phase_4_sessions:, and appending a ## Phase 4 enrichment body section.

For the methodology behind each helper, see exploratory-testing.md (umbrella) plus charter-based-exploration.md, session-based-test-management.md, and test-idea-model.md. End-to-end walkthrough: user-guide/exploring-an-app.md.

Phase 5 commands (shipped)

Command Skill Per-command page
/tc:generate-bdd tc-bdd generate-bdd.md
/tc:review-bdd tc-bdd review-bdd.md
/tc:traceability-map tc-traceability traceability-map.md

/tc:generate-bdd turns Phase-4-enriched test-idea seeds into Gherkin .feature files — one scenario per CS-NNN-NNN candidate, each carrying @req:/@cs: linkage tags, an @area: namespace tag, and a type-mapped class tag — then writes a per-feature summary, rebuilds bdd/index.md, and auto-runs the review sub-mode (suppressible with --no-review). /tc:review-bdd runs the six-category universal rubric (ambiguous-step, missing-tag, untraceable, ui-coupled-step, missing-examples, conjunction-overload), writes a verdict into each summary, and routes failures to <workspace>/requirements/open-questions.md as deduplicated [bdd-review] gap signals; it shares one review_features() implementation with the generate-time auto-run. /tc:traceability-map is the authoritative regenerator of <workspace>/traceability/requirements-map.md (the shared 4-column format /tc:requirements-coverage also writes) and <workspace>/traceability/test-map.md (the scenario-level chain with pending downstream links).

For the methodology behind each helper, see bdd-generation.md (umbrella) plus bdd-quality-review.md and traceability.md. End-to-end walkthrough: user-guide/generating-bdd.md.

Phase 6 commands (shipped)

Command Skill Per-command page
/tc:build-framework tc-build-framework build-framework.md
/tc:automation-plan tc-automation-plan automation-plan.md
/tc:automate tc-automate automate.md
/tc:review-automation tc-automate review-automation.md
/tc:generate-test-data tc-test-data generate-test-data.md

/tc:build-framework scaffolds the project-root Playwright/TypeScript framework lazily (Decision D8) — the tests/{e2e,pages,components,fixtures,db,utils}/ tree plus playwright.config.ts and package.json, created only when absent. /tc:automation-plan scores every BDD scenario against the universal seven-factor suitability rubric (traceable, regression-value, risk-flagged, deterministic, right-sized, data-ready, persona-scoped) and writes automation-plan/<area>.md ranking each automate / consider / manual. /tc:automate renders page objects, per-area fixtures, and specs for automate-ranked scenarios — each test() carrying a // @req:/@cs: provenance comment and reaching data only through its fixture — writes traceability/automation-map.md, and auto-runs the review (suppressible with --no-review). /tc:review-automation runs the seven-category universal rubric (inline-test-data, hardcoded-wait, missing-provenance, weak-locator, untraceable-spec, assertion-free, unverified-write), writes automation-plan/review-summary.md, and routes failures to requirements/open-questions.md as deduplicated [automation-review] signals; it shares one review_automation() implementation with the auto-run. /tc:generate-test-data populates test-data/seed/<area>.json and test-data/scenarios/<area>.md so the generated fixtures reach data through a file, never inline (Decision D6).

The Phase 6 commands generate and structurally validate TypeScript — they never invoke tsc or npx playwright test (execution is Phase 7's /tc:run). For the methodology, see playwright-standards.md, automation-suitability.md, automation-generation.md, and test-data-strategy.md. End-to-end walkthrough: user-guide/automation.md.

Phase 7 commands (shipped)

Command Skill Per-command page
/tc:run tc-run run.md
/tc:analyze-results tc-run analyze-results.md
/tc:report tc-quality-report report.md
/tc:quality-gate tc-quality-report quality-gate.md

/tc:run executes the generated suite (or ingests a recorded Playwright JSON report with --report) for a run mode — all, smoke, regression, feature, failed-only, tagged — and writes a per-run record under runs/<RUN-ID>/ mapping each passed / failed / flaky result to its requirement, candidate, scenario, and spec; the real npx playwright test invocation is refused under pytest (the hermetic boundary). It auto-runs the tc-evidence indexer (--no-index to suppress), which routes screenshots (committed), videos, and traces (git-ignored via evidence/.gitignore, with a git-lfs opt-in) into evidence/ and writes evidence/evidence-index.md. /tc:analyze-results triages every non-passed result (product-defect / test-defect / environment / flaky), writes runs/<RUN-ID>/analysis.md, and routes deduplicated [test-analysis] signals. /tc:report aggregates the workspace into quality-report/current-quality-report.md with all fifteen sections (keeping [fact] / [interpretation] / [review] separated), snapshots a full copy to quality-report/history/<YYYY-MM-DD-HHmm>.md, and rebuilds the traceability maps so test-map.md's Test result and Quality report columns resolve from pending. /tc:quality-gate evaluates the latest run against tc-quality-report.gate.thresholds and returns PASS / WARN / FAIL (the CLI exits 1 on FAIL).

/tc:run and /tc:report take an injected clock (--now) for byte-stable artifacts. For the methodology, see test-execution.md, failure-triage.md, evidence-management.md, quality-reporting.md, and quality-gates.md. End-to-end walkthroughs: user-guide/running-tests.md and user-guide/quality-report.md.

Phase 8 commands (shipped)

Command Skill Per-command page
/tc:learn tc-learning learn.md
/tc:learn-from-failures tc-learning learn-from-failures.md
/tc:learn-from-exploration tc-learning learn-from-exploration.md
/tc:learn-from-feedback tc-learning learn-from-feedback.md
/tc:review-lessons tc-learning review-lessons.md
/tc:promote-lessons tc-learning promote-lessons.md

The four capture commands append tc-lesson/v1 candidates to learning/lessons-inbox.md through one shared engine (monotonic LESSON-NNN ids, (source, origin, summary) dedup, path:line provenance): /tc:learn from a freeform --note, /tc:learn-from-failures from the Phase-7 runs/<RUN-ID>/analysis.md triage, /tc:learn-from-exploration from exploration-notes/ anomalies and coverage gaps, and /tc:learn-from-feedback from resolved open questions and uploaded feedback. /tc:review-lessons classifies each candidate into accepted / rejected / needs-human-review (rubric: severity: high → needs-human-review; a summary already accepted → rejected; otherwise accepted) and clears the inbox. /tc:promote-lessons proposes by default and, only with --apply (the human-approval gate), moves accepted lessons into learning/promoted-guidance.md (status: promoted) and renders a core-promotion-proposal.md for any core: true lesson. The loop writes only under learning/ — it never rewrites Test Commander's shipped methodology or any third-party skill (Open Question Q6). For the methodology, see learning-loop.md, lesson-taxonomy.md, and improvement-governance.md. End-to-end walkthrough: user-guide/learning-loop.md.

Phase 9 commands (shipped)

Command Skill Per-command page
/tc:visualize tc-visualize visualize.md
/tc:diagram-flow tc-visualize diagram-flow.md
/tc:diagram-sequence tc-visualize diagram-sequence.md
/tc:diagram-state tc-visualize diagram-state.md
/tc:diagram-architecture tc-visualize diagram-architecture.md
/tc:diagram-risk tc-visualize diagram-risk.md
/tc:diagram-coverage tc-visualize diagram-coverage.md
/tc:diagram-traceability tc-visualize diagram-traceability.md
/tc:diagram-test-strategy tc-visualize diagram-test-strategy.md
/tc:generate-infographic tc-visualize generate-infographic.md
/tc:render-visuals tc-visualize render-visuals.md

Every visual is generated only from committed workspace artifacts and carries a > Sources: footer; a generator refuses (exit 2) a missing source and points at the producing command. /tc:visualize runs every diagram generator (skipping any whose source is absent); the eight /tc:diagram-* commands generate individual Mermaid diagrams under visuals/mermaid/; /tc:generate-infographic writes a brief + spec of measured facts under visuals/infographic/; and /tc:render-visuals renders the Mermaid sources to SVG/PNG via the Mermaid CLI (the only command that shells out — refused under pytest, graceful when the CLI is absent). End-to-end walkthrough: user-guide/visuals.md.

Phase 10 commands (shipped)

Command Skill Per-command page
/tc:web-init tc-web web-init.md
/tc:web-start tc-web web-start.md
/tc:web-sync tc-web web-sync.md
/tc:web-index-artifacts tc-web web-index-artifacts.md
/tc:web-export tc-web web-export.md

The web console is a team-facing, read-only and proposal-only viewer over the workspace (a Next.js frontend + a FastAPI backend, brought up by make run on docker compose). /tc:web-init provisions the console config; /tc:web-index-artifacts and /tc:web-sync (re)build the SQLite index (a rebuildable derivative — the workspace stays authoritative); /tc:web-start brings the stack up; /tc:web-export writes a shareable static bundle. The console also ships a read-only chat that answers from the index and surfaces command proposal cards — it never changes the workspace or runs a command (execution is gated behind the Phase 10.5 governance pipeline). End-to-end walkthrough: user-guide/web-console.md; architecture: web-console.md; API: runtime-api.md.

Phase 10.5 governance (shipped — no /tc:* commands)

tc-governance ships no /tc:* commands; it is the controlled-execution pipeline the console invokes through POST /api/execute (intent → command planner → permission policy → approval gate → bounded execution → output validation → audit log). Default deny: nothing above read-only runs without passing the policy engine and, where required, an approval gate, and the agent never sees the raw user prompt. Roles and approval policy are configured in <workspace>/policy/permissions.yaml and approvals.yaml. See controlled-agent-execution.md, security-and-permissions.md, runtime-approval-flow.md, agent-adapters.md, and the tester guide user-guide/governance.md.

Phase 11 runtime integrations (shipped — no /tc:* commands)

tc-mcp ships no /tc:* commands; it exposes Test Commander to other tools and agents two ways, both governed by the Phase-10.5 pipeline (no direct-execution backdoor):

  • the Runtime API (apps/api) — the Phase-10 read-only console API expanded with the /api/runtime/ namespace: GET /info (the seven permission levels), POST /plan (a read-only dry run), POST /execute (governed execution). See runtime-api.md.
  • the MCP server (apps/mcp) — a schema-first Model Context Protocol server (python -m tcmcp, stdio) with three tools: tc_status, tc_plan, and tc_run_command. See mcp-server.md.

The seven permission levels are enforced server-side on both front-ends; a destructive action is refused for an unauthorized role, and an approval with no approver is held. Integration walkthrough: user-guide/integrating.md.

Phase 12 commands (shipped)

Command Skill Per-command page
/tc:sandbox-init tc-sandbox sandbox-init.md
/tc:sandbox-launch tc-sandbox sandbox-launch.md
/tc:sandbox-status tc-sandbox sandbox-status.md
/tc:sandbox-sync tc-sandbox sandbox-sync.md
/tc:sandbox-stop tc-sandbox sandbox-stop.md
/tc:sandbox-export tc-sandbox sandbox-export.md

A sandbox is an on-demand, team-accessible Test Commander environment launched from GitHub Actions and governed exactly as the local runtime is — the Phase-10.5 pipeline runs inside it, so a sandbox cannot execute above its approved level. Targeting is safe-by-default (allow-listed hosts, blocked private ranges). The provider abstraction (sandbox/providers/) backs the commands; the MVP default is docker-compose-local. Architecture: sandboxed-environments.md; workflow: github-actions-sandbox.md; walkthrough: user-guide/sandbox.md.

Phase 13 commands (shipped)

Command Skill Per-command page
/tc:watch-changes tc-continuous-quality watch-changes.md
/tc:impact-analysis tc-continuous-quality impact-analysis.md
/tc:coverage-gap-analysis tc-continuous-quality coverage-gap-analysis.md
/tc:propose-tests tc-continuous-quality propose-tests.md
/tc:create-test-pr tc-continuous-quality create-test-pr.md
/tc:continuous-quality-check tc-continuous-quality continuous-quality-check.md

Continuous quality mode watches application changes, maps them to impacted features, finds coverage gaps, proposes tests, and opens clearly-labeled PRs when the configured autonomy mode allows it. It runs the watch → analyze → propose → PR loop through the same Phase-10.5 pipeline; the autonomy mode (0–4) is a ceiling on what auto-approves, and destructive/admin never auto-approve. Architecture: continuous-quality-agent.md; autonomy: autonomy-levels.md; walkthrough: user-guide/continuous-quality.md.

Test-plan commands (shipped)

Command Skill Per-command page
/tc:generate-test-plan tc-test-plan generate-test-plan.md
/tc:update-test-plan tc-test-plan update-test-plan.md

/tc:generate-test-plan seeds a human-owned test-plan/test-plan.md from the requirements inventory and writes a test-plan/coverage-map.md; /tc:update-test-plan regenerates the coverage map without touching the plan. For the division of labor between the deterministic helper and the Claude judgment layer, see test-planning.md.

All phases shipped

Phases 0–13 are complete. Test Commander ships the full workflow from workspace initialization through requirements, knowledge, exploration, BDD, automation, execution, learning, visualization, the web console, governance, the Runtime API + MCP server, sandboxes, and continuous quality.