Add /review-repo command with deterministic pre-scan and reviews store

New on-demand repo audit: scripts/repo-scan.py does the cheap deterministic checks (markers, broken refs, unencrypted vaults) and inventory; the command fans out judgement reviewers across four dimensions, applies only safe/obvious fixes, and writes a tracked report to docs/reviews/. Cron + email deferred. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-05-30 18:56:01 +02:00 · 2026-05-30 18:56:01 +02:00 · b33130eea9
commit b33130eea9
parent 5c087b413b
5 changed files with 262 additions and 0 deletions
--- a/.claude/commands/review-repo.md
+++ b/.claude/commands/review-repo.md
@ -0,0 +1,89 @@
+# Review the repo for staleness, drift, and contradictions
+
+Audit the whole repo across four dimensions, apply only the safe/obvious fixes,
+report the rest, and write a tracked report to `docs/reviews/`.
+
+## Dimensions (equal weight)
+
+1. **Cruft / staleness** — code, comments, or notes left behind after the thing they
+   addressed was resolved; commented-out blocks; done-but-undeleted TODOs; orphans.
+2. **Design conformance** — does the repo obey the ADRs and CLAUDE.md conventions
+   (FQCN, tags, `rolename__var`, nested vault, Makefile-as-interface, etc.)?
+3. **Consistency & intent** — contradictions (doc↔doc and doc↔code), overlaps,
+   duplication, and whether intent is actually documented.
+4. **Docs-vs-reality drift** — do `STATUS.md` and the ADRs match what is actually
+   built?
+
+## Rubric (the source of truth to measure against)
+
+- `docs/decisions/` (ADRs) — the design decisions.
+- `CLAUDE.md` — the conventions.
+- `STATUS.md` — what is actually built vs only planned.
+
+## Process
+
+### Phase 0 — deterministic pre-scan
+Run `python3 scripts/repo-scan.py > /tmp/repo-scan.json`. It returns the **inventory**
+(roles, ADRs, runbooks, playbooks, scripts — your shard list) and **exact findings**
+(markers, broken refs, unencrypted vaults). Fold these into the report verbatim.
+
+### Phase 1 — fan-out judgement review
+Scale to repo size:
+- **Small** (≤ ~10 roles, like boma today): a few sub-agents, or one pass per area.
+- **Large** (many roles/docs — heading toward AnsibleBaobabV4's 100+ roles): fan out
+  **one reviewer per shard** (role / doc-cluster / ADR group) in parallel via the
+  Agent tool, or the Workflow orchestrator for the biggest sweeps.
+
+Give each reviewer: its slice **plus the rubric** and the four dimensions. Each
+returns structured findings — `{dimension, severity (high|medium|low),
+location (file:line), description, suggested_fix, auto_fixable (bool)}`.
+
+### Phase 2 — synthesis
+- Merge and dedupe all findings (deterministic + reviewer).
+- Run **one cross-cutting reviewer** over the full ADR set + `STATUS.md` + `CLAUDE.md`
+  to catch contradictions that span files (per-shard agents can't see these).
+- Diff against the previous run's `docs/reviews/<prev>-findings.json` and tag each
+  finding **new / recurring / resolved**.
+- Prioritise by severity; split into auto-fixable vs report-only.
+
+### Phase 3 — act
+- Apply **only** the safe/obvious fixes (allow-list below). Run `make lint` after;
+  revert any fix that breaks it.
+- Write the report and machine findings (see Output) and generate a follow-up prompt.
+- Commit per CLAUDE.md git conventions (small/safe → `main`; a larger batch → a
+  branch, diff shown first). Summarise to the user: counts, what was auto-fixed, the
+  top open findings, and the follow-up prompt.
+
+## Auto-fix allow-list — safe and obvious ONLY
+
+**Allowed:** typos in prose/comments; broken cross-reference paths (repoint to the
+correct existing path); dead commented-out blocks with no explanatory value;
+obviously-stale comments that contradict the code they annotate; trivial formatting;
+updating a doc reference to a renamed file.
+
+**Never:** anything that changes runtime behaviour, logic, variables, or task order;
+anything touching secrets, vault files, or credentials; anything where intent is
+ambiguous or the answer is a judgement call; deleting files; resolving a
+contradiction between two ADRs (that is a design decision — report it).
+
+**When unsure, REPORT — do not fix.**
+
+## Output (written to `docs/reviews/`)
+
+- `docs/reviews/<YYYY-MM-DD>-review.md` — human report.
+- `docs/reviews/<YYYY-MM-DD>-findings.json` — machine-readable (next run's diff + the
+  cron email payload).
+- `docs/reviews/latest.md` — overwrite with a copy of the newest report.
+
+Report contents: run metadata (date, reviewed commit SHA); summary counts by
+dimension/severity; auto-fixes applied (with file list); open findings (prioritised,
+grouped by dimension, each with location + suggested fix + new/recurring tag); and a
+generated, copy-pasteable **follow-up prompt**. See `docs/reviews/README.md`.
+
+## Headless / cron (future — `docs/todo.md` "Scheduled work")
+
+When run non-interactively (`claude -p`, e.g. a fortnightly cron): **report only** —
+do not apply fixes or push to `main`. Write the report artifact and email its summary
+ the follow-up prompt to the maintainer via the machine's email skill (other
+projects on this host already use it). The report file is the email payload. Validate
+the headless email path when wiring the cron job.
--- a/CLAUDE.md
+++ b/CLAUDE.md
@ -30,6 +30,7 @@ Full design rationale: `docs/decisions/`
 | Check mode (dry run)          | `make check PLAYBOOK=<name>`                     |
 | Deploy a playbook             | `make deploy PLAYBOOK=<name>`                    |
 | Scaffold a new role           | `make new-role NAME=<name>`                      |
+| Review repo for drift/cruft   | `/review-repo` (Claude command)                  |
 | Encrypt a vault file          | `make encrypt FILE=<path>`                       |
 | Decrypt a vault file          | `make decrypt FILE=<path>`                       |
 | Install Python deps           | `make setup`                                     |
--- a/STATUS.md
+++ b/STATUS.md
@ -19,6 +19,7 @@ _Last reviewed: 2026-05-30._
 | `git` | Initialized, trunk-based on `main`, pushed to `origin` (`forgejo.nyumbani.baobab.band:7577`). |
 | Pre-commit hooks | Configured: lint, gitleaks, vault-encryption guard. Activate with `pre-commit install` after `make setup`. |
 | Vault password client | `scripts/vault-pass-client.sh` fetches the master password from Vaultwarden via `rbw` (wired as `vault_password_file`). Requires `rbw` installed + `rbw unlock`. |
+| `/review-repo` | Repo audit: `scripts/repo-scan.py` (Phase 0) + `.claude/commands/review-repo.md`, reports to `docs/reviews/`. On-demand only; cron + email deferred (`docs/todo.md`). |
 | Terraform HCL (`terraform/`) | Written (proxmox VM module + envs) — but never run; see below |

 ## Scaffolded but empty — NOT implemented
--- a/docs/reviews/README.md
+++ b/docs/reviews/README.md
@ -0,0 +1,25 @@
+# docs/reviews/
+
+Tracked output of the `/review-repo` command (one set of files per run). This is an
+**audit trail** — committed, not hand-edited. The command writes these files; don't
+edit them yourself.
+
+## Files per run
+
+| File | Purpose |
+|---|---|
+| `<YYYY-MM-DD>-review.md` | Human-readable report |
+| `<YYYY-MM-DD>-findings.json` | Machine-readable findings — used to diff new/recurring/resolved on the next run, and as the cron email payload |
+| `latest.md` | A copy of the most recent report (stable path for quick reference / email) |
+
+## What a report contains
+
+- **Run metadata** — date and the commit SHA reviewed.
+- **Summary** — finding counts by dimension and severity.
+- **Auto-fixes applied** — what the run fixed (safe/obvious only), with a file list.
+- **Open findings** — prioritised, grouped by dimension; each with a location, a
+  suggested fix, and a `new` / `recurring` / `resolved` tag (vs the previous run).
+- **Follow-up prompt** — a copy-pasteable prompt to act on the open findings.
+
+The four review dimensions and the auto-fix safety rules live in
+`.claude/commands/review-repo.md`.
--- a/scripts/repo-scan.py
+++ b/scripts/repo-scan.py
@ -0,0 +1,146 @@
+#!/usr/bin/env python3
+"""
+Phase-0 deterministic pre-scan for the /review-repo command.
+
+Python standard library only. Emits a JSON object to stdout:
+  - inventory: roles, adrs, runbooks, playbooks, scripts  (the shard list)
+  - findings:  exact, no-judgement issues (markers, broken refs, unencrypted vaults)
+
+The *judgement* review — contradictions, design-conformance, stale intent — is done
+by the /review-repo fan-out reviewers, NOT here. This script only catches the cheap,
+exact things so the reviewers can focus on what needs reasoning.
+
+Usage:  python3 scripts/repo-scan.py [repo_root]  > scan.json
+"""
+import json
+import os
+import re
+import sys
+
+ROOT = os.path.abspath(sys.argv[1] if len(sys.argv) > 1 else ".")
+
+PRUNE = {".git", ".venv", ".collections", ".ansible", ".worktrees",
+         ".pytest_cache", "node_modules", "__pycache__"}
+SKIP_PREFIX = os.path.join("docs", "reviews")  # don't scan our own reports
+SOURCE_EXTS = {".yml", ".yaml", ".j2", ".py", ".sh", ".md", ".tf", ".cfg", ".ini"}
+
+MARKER_RE = re.compile(r"(?<![(|])\b(TODO|FIXME|XXX|HACK)\b(?![|)])")
+ADR_REF_RE = re.compile(r"\bADR-(\d{3})\b")
+PATH_REF_RE = re.compile(r"(?:docs|scripts|roles|inventories|terraform|playbooks)/[\w./-]+")
+PLACEHOLDER = set("<>*${}")
+
+
+def walk_files():
+    for dirpath, dirnames, filenames in os.walk(ROOT):
+        dirnames[:] = [d for d in dirnames if d not in PRUNE]
+        for f in filenames:
+            yield os.path.join(dirpath, f)
+
+
+def rel(path):
+    return os.path.relpath(path, ROOT)
+
+
+def inventory():
+    def listdir(*parts, want_dirs=False, suffixes=None):
+        d = os.path.join(ROOT, *parts)
+        if not os.path.isdir(d):
+            return []
+        out = []
+        for e in sorted(os.listdir(d)):
+            full = os.path.join(d, e)
+            if want_dirs and not os.path.isdir(full):
+                continue
+            if suffixes and not e.endswith(suffixes):
+                continue
+            out.append(e)
+        return out
+
+    return {
+        "roles": listdir("roles", want_dirs=True),
+        "adrs": listdir("docs", "decisions", suffixes=(".md",)),
+        "runbooks": listdir("docs", "runbooks", suffixes=(".md",)),
+        "playbooks": listdir("playbooks", suffixes=(".yml", ".yaml")),
+        "scripts": listdir("scripts"),
+    }
+
+
+def adr_numbers():
+    dec = os.path.join(ROOT, "docs", "decisions")
+    nums = set()
+    if os.path.isdir(dec):
+        for f in os.listdir(dec):
+            m = re.match(r"(\d{3})-", f)
+            if m:
+                nums.add(m.group(1))
+    return nums
+
+
+def scan():
+    findings = []
+    adrs = adr_numbers()
+    for path in walk_files():
+        rpath = rel(path)
+        if rpath.startswith(SKIP_PREFIX):
+            continue
+        name = os.path.basename(path)
+
+        if name == "vault.yml":
+            try:
+                text = open(path, encoding="utf-8", errors="replace").read()
+            except OSError:
+                continue
+            if not text.startswith("$ANSIBLE_VAULT"):
+                real = [ln for ln in text.splitlines()
+                        if ln.strip() and not ln.lstrip().startswith("#") and ln.strip() != "---"]
+                if real:
+                    findings.append({"check": "vault-unencrypted", "severity": "high",
+                                     "path": rpath, "line": 1,
+                                     "detail": "vault.yml is not ansible-vault encrypted but has content"})
+            continue
+
+        if os.path.splitext(path)[1] not in SOURCE_EXTS:
+            continue
+        try:
+            lines = open(path, encoding="utf-8", errors="replace").readlines()
+        except OSError:
+            continue
+
+        for i, line in enumerate(lines, 1):
+            markers = sorted(set(m.group(1) for m in MARKER_RE.finditer(line)))
+            if markers:
+                findings.append({"check": "marker", "severity": "low", "path": rpath,
+                                 "line": i, "detail": f"{'/'.join(markers)}: {line.strip()[:120]}"})
+            for m in ADR_REF_RE.finditer(line):
+                if m.group(1) not in adrs:
+                    findings.append({"check": "broken-adr-ref", "severity": "medium", "path": rpath,
+                                     "line": i, "detail": f"references ADR-{m.group(1)} (no such file)"})
+            for m in PATH_REF_RE.finditer(line):
+                # Skip paths that are part of a URL (e.g. a backend API endpoint).
+                token_prefix = line[max(line.rfind(" ", 0, m.start()),
+                                        line.rfind("\t", 0, m.start())) + 1:m.start()]
+                if "://" in token_prefix:
+                    continue
+                ref = m.group(0).rstrip(".,);:`'\"")
+                if any(c in ref for c in PLACEHOLDER):
+                    continue
+                if not os.path.exists(os.path.join(ROOT, ref)):
+                    findings.append({"check": "broken-path-ref", "severity": "medium", "path": rpath,
+                                     "line": i, "detail": f"references '{ref}' which does not exist"})
+    return findings
+
+
+def main():
+    result = {"root": ROOT, "inventory": inventory(), "findings": scan()}
+    json.dump(result, sys.stdout, indent=2)
+    sys.stdout.write("\n")
+    counts = {}
+    for f in result["findings"]:
+        counts[f["check"]] = counts.get(f["check"], 0) + 1
+    summary = ", ".join(f"{k}={v}" for k, v in sorted(counts.items())) or "no deterministic findings"
+    print(f"repo-scan: {len(result['inventory']['roles'])} roles, "
+          f"{len(result['inventory']['adrs'])} ADRs; {summary}", file=sys.stderr)
+
+
+if __name__ == "__main__":
+    main()