01 / AI-native BMC autonomous vulnerability discovery & validation (latest)
BMC Red-Team Lab
Challenge
In the era of fully-automated AI lock-picks (Mythos and its successors), enterprises can no longer rely on once-a-year scans or one-shot external pentests. BMCs — the most sensitive, lowest-level server management surface — demand a 24/7, self-hosted, self-scheduled red-team loop. Generic agent shells ("OpenClaw + raw LLM") aimed at production BMCs are unguarded bombs on physical hardware.
Execution focus
- 232 specialized red-team skills routed across 7 deep assessment lanes — Unauth-DAST, Web UI deep parsing, Redfish privilege mapping, IPMI / OEM command audit, iKVM remote desktop, supply-chain SBOM, and the OpenBMC / libpldm collaboration track.
- ADR 0005 multi-model Council architecture: a Hunter (Claude Opus 4.7, Thinking) and a Skeptic (GPT-5.5, Extra Reasoning) debate every candidate, and a deterministic-code Governor makes the final Fail-Closed call — neutralizing single-model hallucination and prompt-injection contamination.
- Proof Ladder: every finding is promoted step by step — static candidate → deployed daemon reachability → controlled lab reproduction → exploitability confirmation. No "AI memo for executives" passes as a security conclusion; every high / critical finding ships with a replayable evidence pack.
- ADR 0002 purple closed-loop discipline: every offensive primitive auto-generates a paired Sigma detection rule that lands in the SIEM in parallel with remediation. Dangerous phases require explicit --allow-* approval + watchdog preflight / after-action + a global PANIC halt; confirmed vulnerabilities trigger an automatic coordinated-disclosure timer.
- Field result: the OpenBMC `libpldm decode_get_types_resp()` report progressed from a source-level OOB read to deployed pldmd reachability, controlled fake MCTP peer-path evidence, and a candidate fix shape — the OpenBMC security team responded and indicated they will address further issues in the area. The same completion-code-first pattern extends to FRU / BIOS / Platform / Firmware Update / IBM OEM decoder hardening candidates.