Search intentQuery: AI agent evaluation framework for production
Turn agent-eval research into a shippable test harness.
Compare eval, tracing, and CI paths for agent workflows, then preserve the same production-eval intent into a Build Packet or proof draft.

