v1.0.0 — The Disciplined Velocity Release
The initial MeowKit release. 13 capabilities inspired by deep analysis of BMAD-METHOD and ClaudeKit-Engineer. Theme: scale throughput while maintaining absolute discipline.
Navigation Help
Problem: After orientating in a project mid-session, it's unclear which phase to enter next. Developers waste time re-reading plans, checking git, and manually triangulating state.
Solution: /meow:help scans the project and tells you exactly what to do next.
/meow:help
# → "You're in Phase 1. Gate 1 is not yet approved.
# Next step: run /meow:cook tasks/plans/260329-auth.md"What it scans:
tasks/plans/— pending plans awaiting Gate 1tasks/reviews/— pending verdicts awaiting Gate 2tests/— failing tests blocking Phase 3git log— unshipped commits blocking Phase 5memory/lessons.md— pending retrospective items
The skill maps observed state to one of the 7 pipeline phases and prints a single, unambiguous next action. No hallucination about "what might be next" — only concrete evidence-based routing.
Hook-Based Enforcement
Problem: Behavioral rules only work when agents follow them. Rationalization slips through. An agent convinced the .env read is "necessary" can bypass Rule 4 silently.
Solution: Three shell hooks upgrade behavioral rules to enforcement — the action is blocked before it executes.
| Hook | Event | What it prevents |
|---|---|---|
privacy-block.sh | PreToolUse | Reads of .env, *.key, credential files |
gate-enforcement.sh | PreToolUse | Source code writes before Gate 1 approval |
project-context-loader.sh | SessionStart | Missing project-context.md at session start |
Without hooks: agent reasons "this .env read is safe" → reads it anyway
With hooks: hook fires → read blocked → agent must ask human firstprivacy-block.sh and gate-enforcement.sh are preventive — they intercept the tool call before execution. project-context-loader.sh is proactive — it injects docs/project-context.md into context before any task runs.
Rules define why. Hooks enforce what.
Planning Depth Per Mode
Problem: All workflow modes used the same research approach regardless of task scope. A documentation update ran the same planning overhead as a system design task.
Solution: Each mode now declares a Planning Depth that determines how many researchers run before the plan is written.
| Mode | Researchers | Approach |
|---|---|---|
strict | 2 parallel | Competing approaches — each researcher argues a different design |
architect | 2 parallel | Competing approaches — same as strict |
default | 1 | Single researcher, standard depth |
audit | 1 | Single researcher, security-focused |
fast | 0 | Skip research — go straight to plan |
cost-saver | 0 | Skip research — minimize token spend |
document | 0 | Skip research — docs tasks don't need it |
strict and architect modes force competing approaches specifically to surface trade-offs that single-researcher planning misses. The synthesis step resolves the competition into a single recommended path.
Structured Debugging
Problem: Agents guess-and-patch bugs instead of investigating root causes. Fix one symptom, introduce another.
Solution: meow:debug enforces a 5-step workflow: reproduce → isolate → root cause → fix → verify. The iron rule: no fix without root cause. Includes a references/common-root-causes.md covering the patterns behind ~80% of bugs (null paths, race conditions, API mismatches, wrong conditions).
Post-Implementation Simplification
Problem: Code that passes tests can still be unnecessarily complex. Dead code, deep nesting, premature abstractions accumulate.
Solution: meow:simplify runs after Phase 3 (Build GREEN) and before Phase 4 (Review). The iron rule: behavior must not change — every simplification must pass the same tests. Removes dead code, flattens nesting, extracts only when 3+ duplicates exist.
Team Configuration
Problem: Setting up parallel agents requires creating worktrees, ownership maps, and task queues manually.
Solution: meow:team-config automates the entire parallel setup: analyzes the plan for independent subtasks, generates an ownership map with glob patterns, creates git worktrees, and initializes the task queue. Validates zero overlap before any agent starts.
Scale-Adaptive Intelligence
Problem: Orchestrators guess task complexity. Guesses are wrong. A fintech payment task classified as STANDARD gets Sonnet instead of Opus — and misses a security edge case.
Solution: Domain-based routing at Phase 0 via meow:scale-routing. A CSV maps keyword signals to complexity levels. The CSV verdict overrides manual classification — no exceptions.
Task: "Add Stripe refund endpoint"
→ scale-routing detects: domain=fintech, level=high
→ Override: COMPLEX tier (Opus)
→ Manual classification: ignoredKey behaviors:
- High domains (fintech, healthcare, auth) force COMPLEX tier
- Low domains (docs, config, changelog) unlock one-shot bypass of Gate 1
- User-extensible — add rows to
domain-complexity.csvfor project-specific domains - No rationalization — agents cannot argue down a high-level verdict
Project Context System
Problem: Each agent infers project conventions independently. One agent uses snake_case, another uses camelCase. Context drift compounds over a session.
Solution: docs/project-context.md is the agent constitution. Every agent loads it at session start, before any task-specific context.
Session Start
1. Load docs/project-context.md ← Always first
2. Load memory/lessons.md
3. Load task context
4. Route to agentWhat goes in project-context.md:
- Tech stack and versions
- Naming conventions
- Anti-patterns to avoid
- Testing approach and coverage targets
- Deployment process
Generate or update it:
/meow:docs-init # new project — scan codebase, generate skeleton
/meow:docs-sync # after shipping a feature — diff-aware updateMulti-Layer Adversarial Review
Problem: Single-pass code review misses bugs. One reviewer looks for what they expect to see.
Solution: meow:review now uses a 4-step workflow with 3 parallel reviewers and a triage step.
Step 1: Blind Hunter — reviews with zero context (catches hidden bugs)
Step 2: Edge Case Hunter — stress-tests boundary conditions and error paths
Step 3: Criteria Auditor — verifies every acceptance criterion is actually met
Step 4: Triage — synthesizes findings, eliminates duplicates, assigns severityResults: catches 2-3x more bugs than single-pass review, particularly:
- Silent failures where code compiles but behaves wrong under load
- Missing error handling in exception paths
- Acceptance criteria that are technically "met" but miss the intent
This uses the step-file architecture — each step is JIT-loaded, keeping context lean.
Anti-Rationalization Hardening
Problem: Agents rationalize their way around rules. "This is just a simple task, we don't need security review." "The tests look fine, I won't write more."
Solution: Four explicit blocks with no self-override:
| Blocked behavior | Why |
|---|---|
| Downgrading complexity after scale-routing assigns it | Domain signals are more reliable than in-context reasoning |
| Minimizing test coverage ("looks fine") | Tests that don't cover real paths are worse than no tests |
| Skipping security rules for "simple" tasks | Security issues don't respect task complexity |
| Dismissing WARN verdicts without explicit human acknowledgment | WARNs exist for a reason; silence is not acknowledgment |
Party Mode
Problem: AI agents presented with an architectural question give one answer. It may be wrong. There's no structured way to stress-test the decision.
Solution: /meow:party spawns 2-4 deliberation agents with different analytical lenses. Each reaches an independent conclusion. A synthesis agent reconciles positions into a single recommendation.
/meow:party "PostgreSQL or MongoDB for the session store?"Agent A (performance lens): PostgreSQL — ACID transactions, predictable latency
Agent B (ops lens): MongoDB — horizontal scaling easier, ops simpler
Agent C (migration lens): PostgreSQL — existing data in PG, no migration needed
Synthesis: PostgreSQL. Migration cost and existing infra outweigh scaling argument.
MongoDB revisit if read load exceeds 50k req/s (not projected for 12 months).Party output feeds into an ADR. The decision is documented before any code is written.
When to use: Two reasonable engineers would disagree. Long-term consequences. Wrong choice costs > 2 days.
Party Mode skill → | Architecture workflow →
Step-File Architecture
Problem: Complex skills with 3+ distinct phases load everything into context at once. Token-expensive. Hard to audit. Cannot resume after context reset.
Solution: Skills decompose into JIT-loaded step files.
skills/meow:review/
├── SKILL.md # Entry point — metadata only
├── workflow.md # Step sequence
├── step-01-blind-review.md
├── step-02-edge-cases.md
├── step-03-criteria-audit.md
└── step-04-triage.mdOnly the active step is in context. State persists to session-state/{skill}-progress.json — interrupted workflows resume from the last completed step, not from scratch.
Rules:
- One step at a time — never pre-load the next step
- Never skip steps — even short steps run fast and preserve sequence integrity
- Monolithic skills (< 150 lines) don't need decomposition
Currently step-file enabled: meow:review (4 steps).
Parallel Execution
Problem: COMPLEX tasks with independent subtasks run sequentially. A backend API, its tests, and a frontend component have no dependency between them — but they run one at a time.
Solution: Up to 3 parallel agents, each isolated in a git worktree.
# Orchestrator decomposes COMPLEX task:
git worktree add .worktrees/api-agent -b feat/api-agent
git worktree add .worktrees/ui-agent -b feat/ui-agent
git worktree add .worktrees/test-agent -b feat/test-agent
# Agents work simultaneously, zero file overlap
# After all complete: merge branches, run full integration testsConstraints:
| Rule | Value |
|---|---|
| Max parallel agents | 3 |
| File overlap allowed | Never |
| Gates (1 and 2) | Always sequential, always human-approved |
| Integration test after merge | Required |
| Eligible task tier | COMPLEX only |
Party Mode and Gates are never parallelized — coordination overhead exceeds the benefit.
Summary
| Feature | Solves | When it activates |
|---|---|---|
| Navigation Help | "What do I do next?" | Explicit /meow:help |
| Hook Enforcement | Rule bypass via agent rationalization | Every session, automatic |
| Planning Depth | Over/under-researching by mode | Phase 1, per active mode |
| Scale-Adaptive Routing | Wrong model tier for domain | Phase 0, automatic |
| Project Context System | Agent convention drift | Session start, automatic |
| Adversarial Review | Single-pass review misses bugs | Phase 4 via meow:review |
| Anti-Rationalization | Agents bypass rules via reasoning | Always, built into rules |
| Party Mode | No stress-test for architecture decisions | Explicit /meow:party |
| Step-File Architecture | Token waste + no resumability | Complex skills (meow:review) |
| Parallel Execution | Sequential bottleneck on independent work | COMPLEX tasks, explicit |
