Mission Brief

Autonomy Dev Loop in Weeks

Make autonomy behaviors testable, promotable, and repeatable with a sim CI loop.

Problem

Autonomy teams often rely on fragile manual evaluation. This mission builds a repeatable loop that produces promotable results with evidence.

Constraints

  • ITAR constraints and approved toolchains
  • Scenario packs versioned and reviewable
  • Regression gates and signed reports
  • Interfaces explicit for sensor/actuation pipelines

What ships

  • Simulation CI pipeline that runs scenarios nightly
  • Scenario pack format + generator utilities
  • Signed performance reports and regressions dashboard
  • Promotion gates in CI/CD
  • AI-First interface contracts for wrappers and data
AI-First interface map
Workflow / UI Tool Interface Model Wrapper Services / Data contract tests swap-ready Interfaces are explicit. Dependencies are documented. Swaps are practiced.

Success metrics

  • Time to run a full eval suite
  • Regression detection rate
  • Promotion throughput
  • Scenario coverage growth
  • Signed report completeness

Reuse kit

Starter structures you can adapt inside your environment.

DoD mapping

  • Model velocity (~30 days) enabled by eval harnesses
  • AI-First mandatory
  • Pace-setting demo cadence