Add ADR-014 (sourcing technical knowledge)

Policy for how agents source authoritative knowledge and translate it to boma: - Facts (version-specific syntax/options) → consult version-matched official docs, cite, and stamp `verified: subject · tool ver · source · date`. Training memory is the draft, never the authority. - Judgments (best practices) → evidence to translate through boma's principles (extends ADR-013); never adopted because the docs/a blog say so. - Consultation is risk-based (security / unfamiliar-or-fast-moving / can't quote a flag with confidence), with a "from memory, unverified" transparency backstop. - Sources: context7 → WebFetch upstream → claude-code-guide → deep-research; match the PINNED version, not "latest". - Graceful degradation: the plugin accelerators may be absent on a fresh checkout, so the policy commits to the principle and falls back to WebFetch/WebSearch. CLAUDE.md gains a concise operating rule + pointer. TODO 10.4 decided. New TODO 10.7 tracks making the plugin/MCP toolchain reproducible from the repo (it is not synced by account nor carried by git) — surfaced by a portability check this run. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-06-04 20:07:18 +02:00 · 2026-06-04 20:07:18 +02:00 · 68a37d51f1
commit 68a37d51f1
parent 2f4218814a
3 changed files with 129 additions and 1 deletions
--- a/CLAUDE.md
+++ b/CLAUDE.md
@ -164,6 +164,24 @@ Single-contributor, trunk-based (no merge requests / approval gates):

 ---

+## Sourcing technical knowledge (ADR-014)
+
+- **Facts vs judgments.** Version-specific facts (syntax, options, defaults) have one
+  authoritative answer — consult **version-matched** official docs and cite. Best
+  practices are *evidence to translate through boma's principles* (ADR-013), never
+  authority ("because the docs / a blog say so" is banned).
+- **When to verify (risk-based):** required when security-relevant, the tool is
+  unfamiliar / fast-moving or newer than your training, or you'd assert a specific
+  flag/option/default you can't quote with confidence. Otherwise memory is fine — but
+  mark any from-memory version-specific claim **"from memory, unverified."**
+- **Sources:** `context7` for library docs · upstream docs via `WebFetch` · Claude
+  Code/SDK/API → `claude-code-guide` agent · broad questions → `deep-research`. These
+  are plugins and may be absent on a fresh checkout — fall back to `WebFetch`/`WebSearch`
+  (core tools). Match the **pinned** version, not "latest."
+- **Stamp verified facts** next to them: `verified: <subject> · <tool> <version> · <source> · <YYYY-MM-DD>`.
+
+---
+
 ## Further reading

 | Topic                  | File                                  |
@ -174,6 +192,7 @@ Single-contributor, trunk-based (no merge requests / approval gates):
 | Per-service security checklist | `docs/security/service-checklist.md` |
 | Per-service security record (template) | `docs/security/service-security-template.md` |
 | Heritage / V4 policy   | `docs/decisions/013-heritage-v4.md`   |
+| Sourcing tech knowledge | `docs/decisions/014-knowledge-sourcing.md` |
 | Toolchain choices      | `docs/decisions/003-toolchain.md`     |
 | Docker & Compose model | `docs/decisions/004-docker-model.md`  |
 | Bootstrapping hosts    | `docs/decisions/005-bootstrapping.md` |
--- a/docs/TODO.md
+++ b/docs/TODO.md
@ -76,9 +76,15 @@
    1. ~~Policy for how we collaborate with references to baobabAnsibleV4 without misusing it.~~ DECIDED — ADR-013.
    2. Policy for how we write key documents like ADRs.
    3. Further development on how we we collaborate on designing the foundation for the project - seperate from how we implement new containers etc.
-    4. How do we make sure agents always use the latest official documentation for the technologies etc. we use?
+    4. ~~How do we make sure agents always use the latest official documentation for the technologies etc. we use?~~ DECIDED — ADR-014 (facts → version-matched docs, cited + stamped; best practices → translated per ADR-013; risk-based triggers; graceful fallback to WebFetch).
    5. Always subagent driven?
    6. When AI deploys, i.e. runs playbooks etc., should we make a methodology so that it does not have to poll all the time or review all the output. Perhaps something about the MAKE method could provide only the relevant feedback?
+    7. **Reproducible agent toolchain** (surfaced by ADR-014) — plugins/MCP
+       (`context7`, `superpowers`, `deep-research`, `claude-code-guide`) live under
+       `~/.claude/` per machine; they are NOT synced by account nor carried by git, so
+       a fresh clone lacks them. Make them reproducible from the repo:
+       `extraKnownMarketplaces` + `enabledPlugins` in `.claude/settings.json` + a
+       bootstrap note/script. Tie into control-node/AI setup (items 5 and 7).

 11. **Kaizen loop** — set up ~2026-06-06 (one week from now).
    1. Build `/retro`: reads `docs/FRICTION.md` + recurring `/review-repo`
--- a/docs/decisions/014-knowledge-sourcing.md
+++ b/docs/decisions/014-knowledge-sourcing.md
@ -0,0 +1,103 @@
+# ADR-014 — Sourcing technical knowledge (docs and best practices)
+
+## Context
+
+Most work in boma is done by AI agents drawing on training memory, which is stale
+and hallucination-prone for version-specific facts. The repo **pins tool versions**
+(`requirements.txt`, `requirements.yml`, image tags), so the right reference is the
+docs for *our pinned version*, not "the latest." The capability to fetch
+authoritative knowledge already exists (`context7`, `WebFetch`/`WebSearch`, the
+`deep-research` skill, the `claude-code-guide` agent) but is ungoverned. This ADR
+sets the policy for sourcing knowledge and translating it to boma's aims. (Resolves
+TODO 10.4.)
+
+## Two kinds of knowledge, two rules
+
+- **Facts** — version-specific syntax, options, defaults, behaviour. There is one
+  authoritative answer: the version-matched official docs. Training memory is the
+  *draft*, never the authority.
+- **Judgments** — "best practice," how to structure or approach something. External
+  best practices are **evidence to translate through boma's principles**, never
+  adopted because the docs or a blog say so. This extends ADR-013's
+  translate-don't-transplant leash to all external sources: the acceptance test is
+  *"can this be justified from boma's principles?"* — the source is the prompt, not
+  the authority.
+
+## When consulting is required (risk-based)
+
+Routine, well-known syntax may rely on memory. Consulting a version-matched
+authoritative source is **required** when any of these hold:
+
+- it is **security-relevant**; or
+- the tool is **unfamiliar or fast-moving**, or the version is **newer than the
+  agent's training**; or
+- the agent is about to assert a **specific flag/option/default it cannot quote with
+  confidence**.
+
+**Transparency backstop:** because "I'm uncertain" is an unreliable self-assessment
+(agents are often confidently wrong), any version-specific claim given *from memory*
+is marked **"from memory, unverified"** so it is visible and checkable. By the time
+something is durable and version-specific, it has usually crossed a trigger and
+should be verified, not left unverified.
+
+## Source hierarchy
+
+1. **Version-matched official docs** — `context7` for libraries/frameworks; upstream
+   docs via `WebFetch` for tools. Match the pinned version.
+2. **Claude Code / SDK / API questions** → the `claude-code-guide` agent.
+3. **Broad or contested questions** → the `deep-research` skill (multi-source,
+   fact-checked).
+4. Reputable references below those. **Training memory is never the authority** for a
+   version-specific fact.
+
+Cite the source whenever one is consulted.
+
+**Graceful degradation:** `context7` and the agents/skills above are *plugins* and may
+be absent on a fresh checkout (see "Reproducibility" below). The always-available
+fallback is `WebFetch`/`WebSearch` (core tools). This policy commits to the
+**principle** — consult version-matched authoritative docs and cite — never to a
+specific plugin, so it holds on a bare install.
+
+## Capture (hybrid)
+
+- **Facts** that can go stale get a durable, greppable stamp next to the fact (a
+  comment in the role/template, or a line in an ADR):
+
+  ```
+  verified: <subject> · <tool> <version> · <source> · <YYYY-MM-DD>
+  ```
+
+  This makes currency trackable and gives re-verification a trigger.
+- **Judgments** follow ADR-013: translated to boma's terms, durable docs stay clean,
+  lineage stays transient (commit/conversation).
+
+## Re-verification
+
+A stamp binds a fact to a **pinned version**. When that version bumps (a
+`requirements` change or an image upgrade), the stamps for that tool mark exactly
+what to re-check. A future `/review-repo` or `/security-review` check could flag
+stamps whose recorded version no longer matches what is pinned (stale-stamp
+detection) — a noted enhancement, not built yet.
+
+## Reproducibility of the toolchain
+
+The accelerators this policy prefers (`context7`, `deep-research`, `superpowers`,
+`claude-code-guide`) are **plugins under `~/.claude/`** — local per machine, **not**
+synced by Claude account and **not** carried by the git repo (only `.claude/commands`,
+`.claude/hooks`, `.claude/settings.json` travel). A fresh clone therefore lacks the
+plugin toolchain until it is reinstalled. Making it reproducible from the repo
+(`extraKnownMarketplaces` + `enabledPlugins` in `.claude/settings.json`, plus a
+bootstrap step) is tracked in `docs/TODO.md` and tied to control-node/AI setup. Until
+then, the graceful-degradation fallback above keeps the policy working.
+
+## Decision
+
+- Distinguish **facts** (consult version-matched authoritative docs; cite; stamp) from
+  **judgments** (translate through boma's principles per ADR-013; never authority).
+- Require consultation **risk-based** (security / unfamiliar-or-fast-moving / can't
+  quote with confidence), with a from-memory transparency backstop.
+- Commit to the principle, not a tool — degrade to `WebFetch`/`WebSearch` when plugins
+  are absent.
+
+See also: ADR-013 (heritage / translate-don't-transplant), ADR-011 (version pinning),
+ADR-008 (testing/verification).