sjat/boma - Forgejo: Beyond coding. We forge.

sjat/boma

Author	SHA1	Message	Date
sjat	3b029352b6	Add per-service SECURITY.md convention; one role per service Revise ADR-004 to a service-role standard: every service is its own self-contained role with a required file set including SECURITY.md, uniform deploy mechanics, and a deferred shared-engine option (with revisit trigger) recorded in the ADR. Add the per-service security record: - docs/security/service-security-template.md — canonical SECURITY.md template (exposure, checklist status, service-specific hardening, residual risks) - roles/<service>/SECURITY.md is where each service records how it meets the bar; /security-review aggregates roles/*/SECURITY.md and cross-checks against config - service-checklist.md noted as the generic bar the record answers Wire-up: new-role runbook step writes SECURITY.md from the template; ADR-002 governance bullet points at it; CLAUDE.md role conventions require it and mandate one-role-per-service; STATUS records the convention as defined-not-yet-applied. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-04 16:09:33 +02:00
sjat	19dd89b875	Re-challenge accepted risks; adopt CIS hardening + IDS Walked the seeded accepted-risk register (R1-R4) and turned inherited gaps into deliberate decisions: - Supply chain (R1): tightened to required baseline hygiene (digest pinning, official/verified images); active scanning deferred — stays an accepted risk - CIS (R2): adopted as a positive decision — CIS Debian L1+L2 (base role) + CIS Docker (docker_host + service checklist); app layer via the checklist - SELinux/AppArmor (R3): AppArmor becomes a baseline control (CIS-enforced); register keeps a clean "no SELinux" accept - IDS (R4): adopt AIDE (baseline via CIS) + Suricata on OPNsense + active alerting Register shrinks from 4 inherited gaps to 2 deliberate accepts. ADR-002 gains a Hardening standard section; STATUS + TODO 15 track the (unbuilt) implementation, including the CIS L2 partition impact on VM provisioning (ADR-006). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-04 15:15:39 +02:00
sjat	f338bccd46	Expand ADR-002 into a security baseline + strategy Add a managerial security frame on top of the host baseline: explicit threat model (opportunistic external, lateral movement/blast radius, operator/agent error; supply chain accepted-lower-priority), security principles, and four governance mechanisms that ADR-002 establishes and links out to: - docs/security/service-checklist.md — per-service security bar (referenced from the new-role runbook) - docs/security/accepted-risks.md — living accepted-risk register (R1-R4) - planned /security-review skill (TODO 8.5) - agent guardrails in CLAUDE.md "what Claude must not do" STATUS.md records the frame as present (manual enforcement) and /security-review as planned-not-built. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-04 14:39:51 +02:00
sjat	c57910eda8	Log Forgejo no-PR-workflow friction in FRICTION.md Forgejo origin is trunk-based with no merge-request gate, so the finishing-a-development-branch "open a PR" option doesn't apply — merge locally then push. Also carries earlier uncommitted FRICTION.md edits (emphasis normalization + 2026-05-31 ADR-status entry). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 11:22:26 +02:00
sjat	e12326148c	Note latest.md report mirror in ADR-012 Final-review minor: the /capacity-review skill overwrites a latest.md pointer alongside the dated report; record that in the ADR. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 10:40:16 +02:00
sjat	4c535c908e	Record ADR-012 + STATUS/CLAUDE/scripts docs for capacity tooling Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-01 10:34:38 +02:00
sjat	1060a9c08a	Add /capacity-review skill Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-01 10:32:07 +02:00
sjat	05694f6ea4	Complete capacity-scan.py: usage stub, subprocess glue, main() Adds gather_usage() (stubbed, returns available:false), known_hostnames() with graceful degradation when terraform/ansible-inventory are absent, _run_json() helper, and main() that parses reference.md and emits JSON. Three new TDD tests (12 total, all passing). Script exits 0 with valid JSON even when no cluster is provisioned. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-01 10:30:45 +02:00
sjat	8ed00c9206	Add hostname parsers + find_drift() to capacity-scan.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-01 10:24:11 +02:00
sjat	b240fa8bfe	Add compute_rollup() to capacity-scan.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-01 10:21:22 +02:00
sjat	07ecbb2789	Add capacity-scan.py with parse_table() Implements the parse_table() function and pytest test harness for the capacity-scan script. Tests cover header matching and graceful empty return when the required header is absent. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-01 10:20:10 +02:00
sjat	3ea9109ba2	Add hardware reference doc skeleton + reviews dir Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-01 10:14:53 +02:00
sjat	6ff5d55810	Add implementation plan for hardware capacity tooling Task-by-task TDD plan: reference.md skeleton, stdlib-only capacity-scan.py (parse_table, compute_rollup, drift, usage stub, main), /capacity-review skill, and ADR-012 + STATUS/CLAUDE/scripts-README updates. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 10:04:59 +02:00
sjat	88210db09c	Add hardware reference & capacity-evaluation design spec Brainstormed design for docs/hardware/reference.md (physical compute + network gear + workload placement intent), a stdlib-only capacity-scan.py, and an on-demand /capacity-review skill that reports to docs/hardware/reviews/. Mirrors the repo-scan -> /review-repo -> docs/reviews triad. TODO additions: schedule /capacity-review later and decide its usage-stats source (Proxmox RRD vs the Prometheus/Loki/Grafana/Alloy stack) before building any hook (8.4); reevaluate the stdlib-only script policy (#14). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 09:59:16 +02:00
sjat	ed3eeb0199	Log the mid-session hook-activation gotcha in FRICTION.md Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 22:19:22 +02:00
sjat	80bf9afea9	Add PreToolUse guard hooks: generated-file + rbw vault pre-flight Two project hooks (deny-only, fail open): block Write/Edit of generated inventories/<env>/hosts.yml, and block git commit when the rbw vault agent is locked. Both pipe-tested across all paths. Activate with a Claude Code restart (the watcher only tracks settings.json present at session start). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 22:14:40 +02:00
sjat	11af84938d	Add kaizen friction log and schedule the kaizen-loop setup docs/FRICTION.md: a running log of friction/gotchas/recurring-fixes/unused tooling, seeded with this session's real signals — raw material for the periodic kaizen review. docs/TODO.md: schedule building /retro in ~1 week, and record the Claude-setup decision. (Also carries your earlier backlog edits.) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 22:05:40 +02:00
sjat	778b581729	Record the Vaultwarden item name for the Forgejo token in ADR-010 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 21:35:24 +02:00
sjat	37cece9dbd	Add ADR-010 (Forgejo integration) and rbw-unlocked pre-flight convention ADR-010: API tokens as least-privilege managed secrets, declarative-first (no click-ops), automation boundary, planned trunk-based CI. CLAUDE.md/AGENTS.md: check 'rbw unlocked' before vault-dependent tasks (incl. commits) rather than failing partway. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 21:34:07 +02:00
sjat	905bc92b15	Use local Terraform state; drop unworkable Forgejo HTTP backend (R10b) Forgejo's /raw/ API is read-only so it cannot serve as a Terraform HTTP state backend. Switch both envs to local state on the control node (ADR-006); remove the dead TF_HTTP_* credential hints. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 21:34:05 +02:00
sjat	0513971f40	Fix Forgejo registry path to owner/image format (review R10a) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 21:34:02 +02:00
sjat	bf9ce95e1e	Fix make new-role: brace expansion fails under dash The mkdir used shell brace expansion {tasks,handlers,...}, which /bin/sh (dash) does not support, so new-role created one literally-named dir and then errored. make new-role had never worked on this host. Use explicit mkdir paths. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 19:35:11 +02:00
sjat	1642d1786a	Wire Terraform vlan_tag and fix scaffold placeholder (R9,R11) R9: pass vlan_tag (default 20 = srv VLAN, ADR-007) from both envs to the proxmox_vm module so VMs are tagged, not on untagged vmbr0. R11: make new-role now sed-substitutes ROLE_NAME_PLACEHOLDER so scaffolded molecule converge works out of the box. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 19:34:02 +02:00
sjat	93f2a847c7	Reconcile CI to trunk-based; mark base/docker_host not-built (R6-R8,R15-R16) R6/R7: ADR-003 & ADR-008 CI pipelines rewritten trunk-based (push to main -> test -> staging -> [manual gate] production); CLAUDE.md no longer forbids pushing to main. R8: STATUS/roles-README/site.yml now say base & docker_host are not built (not in git), so a clean clone errors. R15/R16: ADR-001 table flagged as intended design; dropped the unbuilt 'monitoring agent' from the baseline. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 19:32:37 +02:00
sjat	bb2179a288	Apply review fixes R12-R14: printf scaffold, phantom control/ dir, Galaxy wording Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 19:19:47 +02:00
sjat	45ab6ced01	Purge residual .vault_pass references (review R1-R5) Point ADR-005, the new-host runbook, CONTRIBUTING, and AGENTS at the rbw/Vaultwarden flow instead of a .vault_pass file. Also record the cron-section idea in docs/TODO.md. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 19:17:25 +02:00
sjat	703f1716e5	review-repo: harden scanner, apply safe fixes, record first review First /review-repo run on boma. Hardened repo-scan.py (no TODO.md/prose false positives). Applied 7 safe fixes (DNS staleness x2, STATUS factual correction, hosts.yml path generalisation, trunk-based wording x2, scripts/README). Recorded the run and 17 open findings in docs/reviews/2026-05-30-*. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 19:10:58 +02:00
sjat	de38d1c68b	Rename backlog to docs/TODO.md and fix references Match the uppercase convention of the other top-level docs; includes the new Scheduled-work and sanity-check items, and repoints references in STATUS.md and the /review-repo command. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 19:01:22 +02:00
sjat	b33130eea9	Add /review-repo command with deterministic pre-scan and reviews store New on-demand repo audit: scripts/repo-scan.py does the cheap deterministic checks (markers, broken refs, unencrypted vaults) and inventory; the command fans out judgement reviewers across four dimensions, applies only safe/obvious fixes, and writes a tracked report to docs/reviews/. Cron + email deferred. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 18:56:01 +02:00
sjat	5c087b413b	Tick off completed README items in backlog Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 18:30:02 +02:00
sjat	9dc976facc	Clarify README scope and Terraform role; explain the boma name Broaden the intro beyond Ansible (Terraform + Ansible), state the infrastructure-not-personal-devices scope, and explain the Swahili name. Also replace the stale .vault_pass quick-start step with 'rbw unlock'. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 18:25:50 +02:00
sjat	3988fec211	Track discussion backlog (docs/todo.md) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 18:23:19 +02:00
sjat	810e6d557b	Correct Forgejo host to forgejo.nyumbani.baobab.band Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 18:16:38 +02:00
sjat	4ee1b66e23	Source vault password from Vaultwarden via rbw; nest vault structure Master vault password is fetched from Vaultwarden via the rbw agent (scripts/vault-pass-client.sh, wired as vault_password_file) instead of a plaintext .vault_pass. Vault secrets use a nested vault.<service>.<key> map. Encrypted vault.yml files are excluded from lint. Includes the host rename in Makefile and STATUS.md. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 18:16:35 +02:00
sjat	2dfa8ca9d6	Harden lint setup and clean inventory placeholders - Pin pre-commit ansible-lint hook to ansible-core==2.17.* (was floating, crashed) - Add pre-commit to requirements.txt - Align .yamllint with ansible-lint (comments-indentation off, octal rules on) - Rewrite inventory placeholders to lint-clean empty-group form Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 14:56:16 +02:00
sjat	19d93d32dc	Add project orientation and contributor docs Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 14:10:01 +02:00
sjat	9a8181ef18	Add Terraform VM-provisioning skeleton Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 14:10:01 +02:00
sjat	fe4228fb38	Add architecture decision records and runbooks Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 14:10:01 +02:00
sjat	3f1d7eb128	Add core Ansible scaffold, tooling, and pre-commit guards Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 14:10:01 +02:00

... 2 3 4 5 6

289 commits