Mission Brief
Autonomy Dev Loop in Weeks
Make autonomy behaviors testable, promotable, and repeatable with a sim CI loop.
Problem
Autonomy teams often rely on fragile manual evaluation. This mission builds a repeatable loop that produces promotable results with evidence.
Constraints
- ITAR constraints and approved toolchains
- Scenario packs versioned and reviewable
- Regression gates and signed reports
- Interfaces explicit for sensor/actuation pipelines
What ships
- Simulation CI pipeline that runs scenarios nightly
- Scenario pack format + generator utilities
- Signed performance reports and regressions dashboard
- Promotion gates in CI/CD
- AI-First interface contracts for wrappers and data
AI-First interface map
Interfaces are explicit. Dependencies are documented. Swaps are practiced.
Success metrics
- Time to run a full eval suite
- Regression detection rate
- Promotion throughput
- Scenario coverage growth
- Signed report completeness
Reuse kit
Starter structures you can adapt inside your environment.
DoD mapping
- Model velocity (~30 days) enabled by eval harnesses
- AI-First mandatory
- Pace-setting demo cadence