# `/kaizen` Command Implementation Plan

> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.

**Goal:** Build the `/kaizen` kaizen-loop command — a stdlib scanner that parses `docs/FRICTION.md` *Open signals* plus an interactive command that curates them (add/change/park/remove) into the decisions ledger.

**Architecture:** Mirrors `/review-repo` exactly: a deterministic stdlib Phase-0 scanner (`scripts/friction-scan.py`, unit-tested) feeds a markdown command (`.claude/commands/kaizen.md`) that drives the interactive curation. The same scanner powers a stage-2 nudge surfaced in `/review-repo`.

**Tech Stack:** Python 3 standard library only (matches `scripts/repo-scan.py`); pytest; markdown command docs.

**Spec:** `docs/superpowers/specs/2026-06-14-kaizen-command-design.md`

---

## File structure

- Create: `scripts/friction-scan.py` — stdlib parser of `FRICTION.md` *Open signals*; `--json` (default) and `--nudge` modes. One responsibility: turn the prose signal log into structured data + the nudge line.
- Create: `tests/test_friction_scan.py` — unit tests for the parser (string-based, deterministic via `--today`), matching `tests/test_repo_scan.py`.
- Create: `.claude/commands/kaizen.md` — the interactive curation process.
- Modify: `.claude/commands/review-repo.md` — add the stage-2 nudge line to its report.
- Modify: `STATUS.md` — add a `/kaizen` row.
- Modify: `docs/TODO.md` — mark item 11.1 in progress / built.

All scanner logic lives in functions that take strings/data (not files) so tests need no fixtures on disk; only `load_signals(path, today)` and `main()` touch the filesystem.

---

## Task 1: Scanner scaffold — section extraction + signal splitting

**Files:**
- Create: `scripts/friction-scan.py`
- Test: `tests/test_friction_scan.py`

- [ ] **Step 1: Write the failing test**

```python
# tests/test_friction_scan.py
import importlib.util
import os

_SPEC = importlib.util.spec_from_file_location(
    "friction_scan",
    os.path.join(os.path.dirname(__file__), "..", "scripts", "friction-scan.py"),
)
fs = importlib.util.module_from_spec(_SPEC)
_SPEC.loader.exec_module(fs)

SAMPLE = """# FRICTION.md

## Open signals

_(append new raw signals here)_

- `[gotcha]` **First thing** (2026-06-01): body line one.
  continuation line two.
- `[friction]` **Second thing** (2026-06-10): only one line.

---

## Kaizen reviews — decisions ledger

- `[gotcha]` **Should not be parsed** (2026-01-01): in the ledger.
"""


def test_extract_open_section_stops_at_next_heading():
    section = fs.extract_open_section(SAMPLE)
    assert "First thing" in section
    assert "Second thing" in section
    assert "Should not be parsed" not in section


def test_split_signals_finds_two_items_and_joins_continuations():
    signals = fs.split_signals(fs.extract_open_section(SAMPLE))
    assert len(signals) == 2
    assert "continuation line two" in signals[0]
    assert signals[1].startswith("`[friction]`")
```

- [ ] **Step 2: Run test to verify it fails**

Run: `.venv/bin/python -m pytest tests/test_friction_scan.py -v`
Expected: FAIL — `friction-scan.py` does not exist / `extract_open_section` undefined.

- [ ] **Step 3: Write minimal implementation**

```python
#!/usr/bin/env python3
"""Parse docs/FRICTION.md 'Open signals' into structured data for /kaizen.

Stdlib only. Modes:
  --json  (default): emit the open signals as JSON (Phase-0 input for /kaizen)
  --nudge          : print a one-line 'loop overdue?' summary

Authoritative design: docs/superpowers/specs/2026-06-14-kaizen-command-design.md
"""
import argparse
import datetime
import json
import os
import re

REPO_ROOT = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
FRICTION = os.path.join(REPO_ROOT, "docs", "FRICTION.md")


def extract_open_section(text):
    """Return the body between '## Open signals' and the next '## ' heading."""
    lines = text.splitlines()
    start = None
    for i, line in enumerate(lines):
        if line.strip().lower() == "## open signals":
            start = i + 1
            break
    if start is None:
        return ""
    end = len(lines)
    for j in range(start, len(lines)):
        if lines[j].startswith("## "):
            end = j
            break
    return "\n".join(lines[start:end])


def split_signals(section):
    """Split the Open-signals body into raw per-signal blocks.

    A signal starts with a top-level '- ' bullet; indented or blank lines are
    continuations. Returns a list of multi-line strings with the leading '- '
    stripped from the first line."""
    signals = []
    current = None
    for line in section.splitlines():
        if line.startswith("- "):
            if current is not None:
                signals.append("\n".join(current).strip())
            current = [line[2:]]
        elif current is not None:
            if line.strip() == "" or line.startswith("  "):
                current.append(line.strip())
            else:
                signals.append("\n".join(current).strip())
                current = None
    if current is not None:
        signals.append("\n".join(current).strip())
    return [s for s in signals if s]


if __name__ == "__main__":  # pragma: no cover (filled in Task 4)
    pass
```

- [ ] **Step 4: Run test to verify it passes**

Run: `.venv/bin/python -m pytest tests/test_friction_scan.py -v`
Expected: PASS (2 tests).

- [ ] **Step 5: Commit**

```bash
git add scripts/friction-scan.py tests/test_friction_scan.py
git commit -m "feat(kaizen): friction-scan section extraction + signal split"
```

---

## Task 2: Per-signal fields — tag, first_seen, age_days

**Files:**
- Modify: `scripts/friction-scan.py`
- Test: `tests/test_friction_scan.py`

- [ ] **Step 1: Write the failing test**

```python
import datetime

TODAY = datetime.date(2026, 6, 15)


def test_parse_signal_extracts_tag_and_date_and_age():
    raw = fs.split_signals(fs.extract_open_section(SAMPLE))[0]
    sig = fs.parse_signal(raw, TODAY)
    assert sig["tag"] == "gotcha"
    assert sig["first_seen"] == "2026-06-01"
    assert sig["age_days"] == 14
    assert "First thing" in sig["text"]


def test_parse_signal_handles_missing_date():
    sig = fs.parse_signal("`[unused]` **No date here** something", TODAY)
    assert sig["tag"] == "unused"
    assert sig["first_seen"] is None
    assert sig["age_days"] is None
```

- [ ] **Step 2: Run test to verify it fails**

Run: `.venv/bin/python -m pytest tests/test_friction_scan.py::test_parse_signal_extracts_tag_and_date_and_age -v`
Expected: FAIL — `parse_signal` undefined.

- [ ] **Step 3: Write minimal implementation**

Add near the top (after imports):

```python
TAG_RE = re.compile(r"`\[(friction|gotcha|recurring|unused)\]`")
DATE_RE = re.compile(r"(\d{4})-(\d{2})-(\d{2})")
```

Add the function (above the `__main__` block):

```python
def parse_signal(raw, today):
    """Turn one raw signal block into a structured dict."""
    tag_m = TAG_RE.search(raw)
    date_m = DATE_RE.search(raw)
    if date_m:
        first_seen = date_m.group(0)
        seen = datetime.date(int(date_m.group(1)), int(date_m.group(2)), int(date_m.group(3)))
        age_days = (today - seen).days
    else:
        first_seen = None
        age_days = None
    return {
        "tag": tag_m.group(1) if tag_m else None,
        "first_seen": first_seen,
        "age_days": age_days,
        "recurrence_count": 1,          # refined in Task 3
        "referenced_paths": [],         # filled in Task 3
        "still_exists": True,           # filled in Task 3
        "text": " ".join(raw.split()),
    }
```

- [ ] **Step 4: Run test to verify it passes**

Run: `.venv/bin/python -m pytest tests/test_friction_scan.py -v`
Expected: PASS (4 tests).

- [ ] **Step 5: Commit**

```bash
git add scripts/friction-scan.py tests/test_friction_scan.py
git commit -m "feat(kaizen): parse tag/first_seen/age per signal"
```

---

## Task 3: Recurrence count + referenced paths + still_exists

**Files:**
- Modify: `scripts/friction-scan.py`
- Test: `tests/test_friction_scan.py`

- [ ] **Step 1: Write the failing test**

```python
def test_recurrence_from_ordinal():
    assert fs.parse_recurrence("blah 5th occurrence (06-05/06/06) blah") == 5


def test_recurrence_from_datelist_when_no_ordinal():
    # three slash-separated date fragments → recurrence 3
    assert fs.parse_recurrence("recurred (06-05/06-09/06-10) again") == 3


def test_recurrence_defaults_to_one():
    assert fs.parse_recurrence("a one-off gotcha") == 1


def test_parse_paths_picks_repo_paths_only():
    paths = fs.parse_paths("see `scripts/repo-scan.py` and `latest` and `foo.yml`")
    assert "scripts/repo-scan.py" in paths
    assert "foo.yml" in paths
    assert "latest" not in paths


def test_still_exists_false_for_missing_path():
    sig = fs.parse_signal("`[unused]` **x** (2026-06-01): `scripts/nope-not-real.py`", TODAY)
    assert sig["still_exists"] is False


def test_still_exists_true_for_real_path():
    sig = fs.parse_signal("`[gotcha]` **x** (2026-06-01): `scripts/repo-scan.py`", TODAY)
    assert sig["still_exists"] is True
```

- [ ] **Step 2: Run test to verify it fails**

Run: `.venv/bin/python -m pytest tests/test_friction_scan.py -k "recurrence or paths or still_exists" -v`
Expected: FAIL — `parse_recurrence` / `parse_paths` undefined.

- [ ] **Step 3: Write minimal implementation**

Add the regexes near the others:

```python
ORDINAL_RE = re.compile(r"(\d+)(?:st|nd|rd|th)\s+(?:occurrence|reinforcement|time)", re.I)
DATELIST_RE = re.compile(r"\((\d{2}-\d{2}(?:/[\d/-]+)+)\)")
BACKTICK_RE = re.compile(r"`([^`]+)`")
PATH_EXTS = (".py", ".yml", ".yaml", ".md", ".sh", ".tf", ".j2", ".toml", ".cfg", ".hcl")
```

Add the helpers (above `parse_signal`):

```python
def parse_recurrence(text):
    """Best-effort recurrence count from explicit markers; default 1."""
    counts = [1]
    m = ORDINAL_RE.search(text)
    if m:
        counts.append(int(m.group(1)))
    dl = DATELIST_RE.search(text)
    if dl:
        counts.append(dl.group(1).count("/") + 1)
    return max(counts)


def parse_paths(text):
    """Backtick tokens that look like repo paths (contain '/' or a known ext)."""
    out, seen = [], set()
    for m in BACKTICK_RE.finditer(text):
        tok = m.group(1).strip()
        if ("/" in tok or tok.endswith(PATH_EXTS)) and tok not in seen:
            seen.add(tok)
            out.append(tok)
    return out
```

Then update `parse_signal` — replace the three placeholder fields:

```python
    paths = parse_paths(raw)
    still_exists = all(os.path.exists(os.path.join(REPO_ROOT, p)) for p in paths) if paths else True
    return {
        "tag": tag_m.group(1) if tag_m else None,
        "first_seen": first_seen,
        "age_days": age_days,
        "recurrence_count": parse_recurrence(raw),
        "referenced_paths": paths,
        "still_exists": still_exists,
        "text": " ".join(raw.split()),
    }
```

- [ ] **Step 4: Run test to verify it passes**

Run: `.venv/bin/python -m pytest tests/test_friction_scan.py -v`
Expected: PASS (10 tests).

- [ ] **Step 5: Commit**

```bash
git add scripts/friction-scan.py tests/test_friction_scan.py
git commit -m "feat(kaizen): recurrence count + referenced-path existence"
```

---

## Task 4: CLI — `load_signals`, `--json`, `--nudge`

**Files:**
- Modify: `scripts/friction-scan.py`
- Test: `tests/test_friction_scan.py`

- [ ] **Step 1: Write the failing test**

```python
def test_nudge_line_overdue_on_recurrence():
    sigs = [{"age_days": 2, "recurrence_count": 5}]
    line = fs.nudge_line(sigs)
    assert "OVERDUE" in line
    assert "max recurrence 5x" in line


def test_nudge_line_ok_when_quiet():
    sigs = [{"age_days": 3, "recurrence_count": 1}, {"age_days": 1, "recurrence_count": 1}]
    line = fs.nudge_line(sigs)
    assert "ok" in line
    assert "OVERDUE" not in line


def test_nudge_line_overdue_on_count():
    sigs = [{"age_days": 1, "recurrence_count": 1} for _ in range(8)]
    assert "OVERDUE" in fs.nudge_line(sigs)


def test_load_signals_reads_real_friction_file():
    path = os.path.join(os.path.dirname(__file__), "..", "docs", "FRICTION.md")
    sigs = fs.load_signals(path, TODAY)
    assert len(sigs) >= 1
    assert all(s["tag"] in {"friction", "gotcha", "recurring", "unused"} for s in sigs)
```

- [ ] **Step 2: Run test to verify it fails**

Run: `.venv/bin/python -m pytest tests/test_friction_scan.py -k "nudge or load_signals" -v`
Expected: FAIL — `nudge_line` / `load_signals` undefined.

- [ ] **Step 3: Write minimal implementation**

Add thresholds near the top (after `FRICTION = ...`):

```python
# Nudge thresholds (tunable; the /kaizen self-eval phase revisits these).
NUDGE_MIN_OPEN = 8
NUDGE_MAX_AGE_DAYS = 21
NUDGE_MIN_RECURRENCE = 3
```

Add the functions and replace the `__main__` block:

```python
def load_signals(path, today):
    with open(path, encoding="utf-8") as fh:
        text = fh.read()
    return [parse_signal(s, today) for s in split_signals(extract_open_section(text))]


def nudge_line(signals):
    n = len(signals)
    ages = [s["age_days"] for s in signals if s.get("age_days") is not None]
    oldest = max(ages) if ages else 0
    max_rec = max((s["recurrence_count"] for s in signals), default=0)
    overdue = n >= NUDGE_MIN_OPEN or oldest >= NUDGE_MAX_AGE_DAYS or max_rec >= NUDGE_MIN_RECURRENCE
    status = "OVERDUE — run /kaizen" if overdue else "ok"
    return f"kaizen: {n} open signals, oldest {oldest}d, max recurrence {max_rec}x — {status}"


def main():
    parser = argparse.ArgumentParser(description="Parse FRICTION.md Open signals for /kaizen.")
    parser.add_argument("--nudge", action="store_true", help="print a one-line overdue summary")
    parser.add_argument("--today", help="override today's date (YYYY-MM-DD) for testing")
    parser.add_argument("--file", default=FRICTION, help="path to FRICTION.md")
    args = parser.parse_args()

    if args.today:
        y, m, d = args.today.split("-")
        today = datetime.date(int(y), int(m), int(d))
    else:
        today = datetime.date.today()

    signals = load_signals(args.file, today)
    if args.nudge:
        print(nudge_line(signals))
    else:
        print(json.dumps(signals, indent=2))


if __name__ == "__main__":
    main()
```

- [ ] **Step 4: Run tests + smoke-test the CLI**

Run: `.venv/bin/python -m pytest tests/test_friction_scan.py -v`
Expected: PASS (14 tests).

Run: `python3 scripts/friction-scan.py --nudge`
Expected: one line like `kaizen: 13 open signals, oldest 14d, max recurrence 5x — OVERDUE — run /kaizen`.

Run: `python3 scripts/friction-scan.py | head -20`
Expected: a JSON array of signal objects.

- [ ] **Step 5: Commit**

```bash
git add scripts/friction-scan.py tests/test_friction_scan.py
git commit -m "feat(kaizen): friction-scan CLI (--json default, --nudge)"
```

---

## Task 5: The `/kaizen` command document

**Files:**
- Create: `.claude/commands/kaizen.md`

- [ ] **Step 1: Create the command file**

Write `.claude/commands/kaizen.md` with exactly this content:

````markdown
# Kaizen — curate the friction log into improvements

Consume the **Open signals** in `docs/FRICTION.md`: decide a verdict for each, migrate
durable knowledge into the right docs, and archive consumed signals into the decisions
ledger. **Curate-only** — do not hunt for new signals; capture stays manual. This is an
interactive, judgment-dense pass: propose, the operator decides, you apply on approval.

Design: `docs/superpowers/specs/2026-06-14-kaizen-command-design.md`.

## Phase 0 — scan
Run `python3 scripts/friction-scan.py > /tmp/kaizen.json`. It returns each Open signal as
`{tag, first_seen, age_days, recurrence_count, referenced_paths, still_exists, text}`.
Treat `still_exists: false` as a hint the signal may already be resolved.

## Phase 1 — triage
Order signals by `recurrence_count` desc, then `age_days` desc, then tag. **Group signals
that share a root cause** and curate them together. Present the agenda before editing
anything: total open, how many recurring (≥3), how many look already-resolved.

## Phase 2 — per-signal curation (interactive)
For each signal/group, present: a one-line restatement, the evidence (age, recurrence,
still-real), and a proposed **verdict**. Verdicts:

- **SYSTEMATIZE** — migrate the durable lesson into its right home (a runbook, an ADR,
  `CLAUDE.md`, a new `scripts/repo-scan.py` check, or a hook).
- **CHANGE** — adjust an existing tool/convention/config rather than document it.
- **PARK** — *out-of-phase but not obsolete*. Remove from the active tree, but write a
  ledger row recording **where it now lives (git SHA/branch/doc) and a resurrection
  trigger**. The default for "not touched lately but not wrong."
- **REMOVE** — *obsolete*: superseded, wrong, never worked, duplicated. Ledger row states
  why.
- **ALREADY-BUILT** — the systematization already exists / the fix landed; archive.
- **ACCEPTED** — conscious no-op (revisit-if-recurs); archive.
- **KEEP-OPEN** — still accruing, not ripe; leave it in *Open signals* (no ledger row).

Rules:
- **Knowledge is never removed** — SYSTEMATIZE/migrate it; only *active surface* (scripts,
  checks, conventions, plugins) is parked/removed.
- Every reductive verdict must classify *why unused*: **obsolete → REMOVE**,
  **out-of-phase → PARK**.
- The operator approves / modifies / rejects each verdict. On approval: do the mechanical
  edit (migrate text into the target doc; **move the signal from *Open signals* into the
  ledger table**; delete the parked/removed file) and show the diff.
- PARK and REMOVE both delete from the active tree — the difference is the ledger row.
  Git history + the ledger row are the park mechanism; never create a `parked/` directory.

## Phase 3 — close-out
- Add a new dated block under `## Kaizen reviews — decisions ledger` (newest first), same
  shape as the existing block: a table with columns **Signal (first seen) | Verdict |
  Resolution / where it lives now**.
- **Bias-to-remove discipline check:** if every verdict this pass was SYSTEMATIZE/CHANGE
  (only accreting), say so explicitly.
- **Self-eval (light):** is `/kaizen` being run often enough (oldest consumed age)? Should
  the nudge thresholds in `scripts/friction-scan.py` change? Note it.
- Run `make lint` if any code/docs changed; revert anything that breaks it.
- Commit per `CLAUDE.md` git conventions (one logical unit — straight to `main` if
  small/safe, a branch if sweeping; show the diff first for a branch).
- Print a one-line summary: `consumed X · parked Y · removed Z · kept-open W · migrated → <docs>`.

## Headless / cron (future)
Deferred until the notify + cron stack exists (`docs/TODO.md` 11.3). When run
non-interactively, **report only**: print the proposed verdicts and the nudge, do not edit
or commit.
````

- [ ] **Step 2: Verify it parses against the real log**

Run: `python3 scripts/friction-scan.py --today 2026-06-15 | python3 -c "import sys,json; print(len(json.load(sys.stdin)), 'signals')"`
Expected: prints a non-zero signal count with no traceback.

- [ ] **Step 3: Lint**

Run: `make lint`
Expected: passes (markdown isn't linted by yamllint/ansible-lint, but this confirms nothing else broke).

- [ ] **Step 4: Commit**

```bash
git add .claude/commands/kaizen.md
git commit -m "feat(kaizen): /kaizen command — interactive friction curation"
```

---

## Task 6: Stage-2 nudge in `/review-repo` + STATUS/TODO

**Files:**
- Modify: `.claude/commands/review-repo.md`
- Modify: `STATUS.md`
- Modify: `docs/TODO.md`

- [ ] **Step 1: Add the nudge to the review-repo command**

In `.claude/commands/review-repo.md`, find the "Phase 0 — deterministic pre-scan" section
(it runs `scripts/repo-scan.py`). Immediately after that paragraph, add:

```markdown
Also run `python3 scripts/friction-scan.py --nudge` and include its one-line output in the
report's summary — it flags when the kaizen loop (`/kaizen`) is overdue (recurring signals,
backlog size, or age). This is a reminder only; do not act on `FRICTION.md` from here.
```

- [ ] **Step 2: Add a STATUS row**

In `STATUS.md`, under "Real and working today", add a row:

```markdown
| `/kaizen` | Curate `docs/FRICTION.md` Open signals → decisions ledger (`scripts/friction-scan.py` Phase 0 + `.claude/commands/kaizen.md`). On-demand; `--nudge` surfaces in `/review-repo`. Headless/cron deferred (TODO 11.3). |
```

- [ ] **Step 3: Update TODO 11**

In `docs/TODO.md` item 11, mark sub-item 1 built:

Change `1. Build `/retro`: ...` to begin with `1. ~~Build `/retro``... ` — i.e. strike it
through and append: `DONE — built as `/kaizen` (scope narrowed to curate-only per the
2026-06-14 spec; `/retro` name dropped). `scripts/friction-scan.py` + `.claude/commands/kaizen.md`.`

- [ ] **Step 4: Lint**

Run: `make lint`
Expected: passes.

- [ ] **Step 5: Commit**

```bash
git add .claude/commands/review-repo.md STATUS.md docs/TODO.md
git commit -m "feat(kaizen): nudge in /review-repo; STATUS + TODO"
```

---

## Task 7: Dogfood — first real `/kaizen` run

This task is **not** automated; it is the first real use, done interactively with the operator.

- [ ] **Step 1:** Run `/kaizen` against the current Open signals (there are several,
  including the 3 added 2026-06-14 and the 5× execution-mode-menu signal).
- [ ] **Step 2:** Work the interactive curation (Phase 2) with the operator, applying
  verdicts on approval.
- [ ] **Step 3:** Confirm the close-out: ledger updated, `make lint` green, summary printed.
  This both processes the backlog and validates the command end-to-end.

---

## Self-review notes (author)

- **Spec coverage:** scope-A curate-only → Task 5 Phase 0–2; verdict model incl. PARK →
  Task 5 Phase 2 + ledger; single source FRICTION.md → Task 4 `load_signals`; interactive
  apply (B) → Task 5; ledger format → Task 5 Phase 3; scanner schema → Tasks 2–4; nudge +
  thresholds → Task 4 + Task 6; out-of-scope items → not built (correct); `/review-repo`
  relationship → Task 6 nudge. All covered.
- **No placeholders:** every code step shows complete code; the command doc is written in
  full.
- **Type consistency:** the signal dict keys (`tag, first_seen, age_days,
  recurrence_count, referenced_paths, still_exists, text`) are identical across Tasks 2–4
  and the command doc; `nudge_line` reads `age_days`/`recurrence_count` only.