boma/terraform/environments/offsite
sjat 847d9885e2 revert: back out mesh-hardening 1/3 on askari after it broke the Docker host
Incident 2026-06-17: applying base's nftables default-deny (forward policy drop)
to askari — a Docker host — broke container forwarding/NAT on reboot, and the
wt0-only sshd ListenAddress left no break-glass (ip_nonlocal_bind did NOT beat
the boot race). Recovery: disable nftables + restart docker (restore the wiped
NAT masquerade) + force-recreate the coordinator (it FATAL-looped unable to
download its GeoLite2 DB with no egress) -> mesh re-formed.

Back out the enablement so a future deploy can't re-break askari:
- offsite_hosts: base__ssh_listen_mesh_only=false, base__firewall_apply=false
- remove host_vars/askari.yml (manage over the WAN again, not wt0)
- tf/offsite: re-open WAN :22 to ubongo only (break-glass; already applied)

askari now: sshd on all interfaces (Ansible-managed), nftables disabled, WAN :22
open -> stable + reboot-survivable. The base feature code (sshd ListenAddress
option, firewall public zone) stays; it's just not enabled on Docker hosts.
Mesh-hardening 1/3 to be re-spec'd before any retry.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-06-17 22:16:17 +02:00
..
.terraform.lock.hcl fix(tf): declare required_providers in modules; pin offsite lock 2026-06-14 16:14:05 +02:00
backend.tf feat(tf): offsite environment — askari (CAX11/hel1/debian-13) 2026-06-14 12:03:31 +02:00
main.tf revert: back out mesh-hardening 1/3 on askari after it broke the Docker host 2026-06-17 22:16:17 +02:00
outputs.tf feat(tf): offsite environment — askari (CAX11/hel1/debian-13) 2026-06-14 12:03:31 +02:00
providers.tf docs(review): 2026-06-14 repo audit — M4a doc drift + Traefik→Caddy lag 2026-06-14 18:37:54 +02:00
terraform.tfvars.example fix(tf): cloud-init heredoc column-0 + firewall uses ubongo's WAN IP 2026-06-14 12:19:45 +02:00
variables.tf feat(tf): offsite environment — askari (CAX11/hel1/debian-13) 2026-06-14 12:03:31 +02:00