SRD-10 · Incident Investigation & Corrective Action System · Mar 27–29, 2026
Highlander
Built solo during a 59-hour hackathon by coordinating a multi-agent AI team — TPM, two SWEs, and QA — across a Flutter + Go stack. One developer. Four agents. 281 commits. Zero open issues.
Claude Opus + SonnetFlutter · GoADA / WCAGPlaywright · 542 testsRAG · AutoResearchMCP · JSON-RPC 2.0
AutoResearch was used to systematically improve the in-app AI assistant by auto-generating a structured wiki from the codebase on every commit. Rather than manually tuning prompts, the eval pipeline measures accuracy before and after each update — creating a continuous optimization loop. The same pattern Shopify used for a 53% improvement across 120 experiments.
10%
baseline accuracy
88%
with RAG wiki
+79pt
net lift
Eval results by domain (42 test cases)
Baseline
10%
With wiki RAG
88%
100%
Routes & RBAC
100%
Accounts
100%
Incidents
100%
OSHA compliance
100%
Tech architecture
+79pt
net accuracy lift
Issue breakdown
Enhancement
74 (72%)
Bug
26 (25%)
Documentation
3 (3%)
Difficulty distribution
9
Trivial
30
Routine
30
Complex
1
Critical
Agent team
TPMTechnical Program Manager
Orchestration, GitHub Issues, architecture decisions, model routing — Sonnet for routine, Opus for complex.