A comprehensive research and comparison of spec-driven development (SDD) tools for AI-assisted coding, including analysis of git worktree support, architectural approaches, and practical recommendations.
This repository contains in-depth research comparing spec-driven development tools, agent orchestration, and the emerging execution layer for AI-assisted coding.
An interactive, Artificer-themed visualizer of this research: cameronsjo.github.io/spec-compare
- Compare workflows — step the same scenario (trivial change, greenfield, refactor, bug fix, parallel dev, cross-cutting) through all eight core tools in lockstep and watch OpenSpec's short path contrast with BMad's long one.
- Feature matrix — sortable, filterable capability table aggregated live from the data.
- Scoring heatmap — all thirteen tools across seven use-case dimensions, color-graded 1–5.
- Decision guide — an interactive flowchart that walks you to a recommended tool.
- Tool profiles — per-tool metadata, key features, and limitations.
Every value is extracted from the research docs below — no fabricated attributes.
The site lives in site/ — a React + Vite + TypeScript app on a vendored copy of the Artificer design system. Each tool is one JSON file under site/src/data/tools/, auto-discovered and AJV-validated against schema.json; the matrix and heatmap are derived by aggregating across those files (single source of truth).
cd site
npm install
npm run dev # local dev server
npm run validate # AJV-validate every tool JSON + cross-tool invariants
npm run build # validate → tsc → vite build (outputs site/dist/)
npm run preview # serve the production build locallyPushing to main builds and publishes site/dist/ to GitHub Pages via .github/workflows/deploy.yml.
- GitHub Spec-Kit - Open-source CLI toolkit for greenfield projects
- Spec Kitty - Community fork with built-in git worktree orchestration
- BMad Method - Enterprise framework with 21 specialized AI agents
- OpenSpec - Lightweight change-management for brownfield projects
- Kiro - AWS-backed agentic IDE, GA since Nov 2025 (paid tiers + CLI)
- Tessl - Spec-as-source platform; public Framework + Registry ($125M raised)
- Superpowers - MIT skills framework + methodology; brainstorm → plan → subagent TDD (~214K stars)
- Traycer - Commercial VS Code Plan → Execute → Verify layer over your agent (100K+ users)
- GSD - Meta-prompting SDD system with wave-based context management (63.8K stars)
- Ralph Loop - Stateless iterative execution pattern by Geoffrey Huntley
- Zencoder/Zenflow - Commercial SDD control plane; free Zenflow desktop app
- Kilo Code - Open-source agentic platform with Memory Bank ($8M seed, 1.5M users)
- Conductor - macOS parallel agent runner using git worktrees
- PromptX - AI agent context platform via MCP (gap entry)
- MUSUBI - Maximally-rigorous SDD framework, marginal (~57 stars, stalled)
Critical Gap: Most SDD tools excel when requirements are clear upfront but struggle with iterative changes like "change button from blue to green."
- OpenSpec - Purpose-built for modifications with delta format (ADDED, MODIFIED, REMOVED)
- Tessl - Spec-as-source enables edit-and-regenerate (Framework now public)
- Spec-Kit - Requires
/speckit.clarifyworkaround, not optimized for small changes - Kiro/BMad - "Sledgehammer to crack a nut" problem for trivial changes
See Iterative Development Analysis and Use Case Scoring for details.
Spec Kitty pioneered built-in git worktree support among SDD tools (Superpowers, Conductor, and Zencoder/Zenflow now automate worktrees too), enabling:
- Automatic worktree creation per feature
- Parallel feature isolation without branch switching
- Automated cleanup on merge
- Spec-First: Specs precede coding but are discarded (Spec-Kit, Kiro, BMad)
- Spec-Anchored: Specs persist and evolve (OpenSpec, Spec Kitty)
- Spec-as-Source: Only specs are edited, code auto-generates (Tessl)
The research is organized into focused, digestible documents:
- GitHub Spec-Kit - Open-source CLI toolkit
- Spec Kitty - Community fork with worktree support
- BMad Method - Enterprise framework with 21 agents
- OpenSpec - Lightweight change management
- Kiro - AWS-backed agentic IDE (GA)
- Tessl - Spec-as-source platform (public Framework + Registry)
- Superpowers - Auto-triggered skills + methodology, any harness
- Traycer - Spec-driven Plan → Execute → Verify layer (VS Code)
- GSD (Get Shit Done) - Context-engineered SDD for solo developers
- Ralph Loop - Stateless iterative execution pattern
- Comparison Matrices - Side-by-side feature comparisons (incl. AGENTS.md support)
- Use Case Scoring - 12 real-world scenarios graded + expanded 13-tool heatmap
- Iterative Development - Spec modification workflows
- Git Worktree Support - Detailed worktree analysis (updated with Beads, Conductor)
- Recommendations - Decision frameworks by use case
- Critical Analysis - Concerns, critiques, and future outlook
- Sources - All citations and references
- Orchestration Landscape - 30+ multi-agent tools surveyed, including Claude Code Agent Teams
- Beads, Agent Mail & Gas Town - Agent memory, messaging, and multi-agent villages
- Gaps: New Frameworks - Zencoder, Kilo Code, Conductor, PromptX, MUSUBI (GSD, Ralph Loop, Superpowers & Traycer promoted to full profiles)
- May 2026 Reassessment - Version/status re-verification + new-tool findings
- Beads + OpenSpec Cheatsheet - Practical setup and daily workflow
| Tool | License | Git Worktrees | Best For | Maturity |
|---|---|---|---|---|
| Spec-Kit | Open Source | No | Greenfield projects | Production (v0.8.18) |
| Spec Kitty | Open Source | Yes | Parallel development | Active Dev (v3.1.9) |
| BMad Method | Open Source | No | Enterprise workflows | Stable (v6.8.0) |
| OpenSpec | MIT | No | Brownfield changes | Production (v1.3.1) |
| Kiro | Proprietary | No | IDE experience | GA (v0.12.x) |
| Tessl | Proprietary | No | Spec-as-source | Active Dev (public) |
| Superpowers | MIT | Yes | Disciplined autonomous dev | Active Dev (v5.1.0) |
| Traycer | Proprietary | No | Plan-first orchestration | Active Dev |
Use Spec Kitty - The most complete built-in worktree management + parallel feature isolation among SDD tools (Superpowers also automates worktrees via its skill).
Use OpenSpec - Lightweight change management without excessive overhead.
Use BMad Method - Comprehensive workflows with 21 specialized agents.
Use Spec-Kit - Battle-tested, constitution-driven development.
Try Kiro or Tessl - Kiro is GA (free tier + paid); Tessl's Framework + Registry is now public.
The research includes analysis of:
- The Waterfall Question: Does SDD reintroduce waterfall bureaucracy?
- AI Adherence Issues: Agents frequently ignore specifications
- Scalability Concerns: Unclear when SDD adds value vs. overhead
- Historical Parallels: Similarities to failed Model-Driven Development (MDD)
- 25% of Y Combinator Winter 2025 cohort has 95% AI-generated codebases
- Industry leaders predict developers won't look at code by 2027
- Specifications becoming "the fundamental unit of programming"
See CONTRIBUTING.md for contribution guidelines.
See CHANGELOG.md for version history and changes.
This project is licensed under the MIT License - see the LICENSE file for details.
All research is compiled from publicly available sources including:
- Official tool documentation and repositories
- Industry blog posts and articles
- Comparative analyses from Martin Fowler, Medium, and others
- Critical perspectives from Marmelab, RedMonk, and Thoughtworks
Full source citations are available in docs/sources.md.
For questions, issues, or suggestions, please open an issue on GitHub.
Last Updated: 2026-05-31