⇩ Markdown

considering dark software factories→dark software factory -- key media→blog post - OpenAI harness engineering 2026-02→AI unit -- Codex

AI unit -- Codex

^ AI unit -- Codex

Is like Devin, however it's also end-to-end reinforcement learning.

company - OpenAI

Backlinks

blog post - OpenAI harness engineering 2026-02

We did the same for observability tooling. Logs, metrics, and traces are exposed to Codex via a local observability stack that’s ephemeral for any given worktree. Codex works on a fully isolated version of that app—including its logs and metrics, which get torn down once that task is complete. Agents can query logs with LogQL and metrics with PromQL. With this context available, prompts like “ensure service startup completes in under 800ms” or “no span in these four critical user journeys exceeds two seconds” become tractable.
...
observability stack was directly built-in to AI unit -- Codex ^observability-stack-built-in-to-codex

see in context
codex sandbox

^ AI unit -- Codex sandbox

see in context
example - Codex asking to run outside the Codex sandbox

^ example - AI unit -- Codex asking to run outside the codex sandbox

see in context
exec plan

A type of plan by company - OpenAI used with AI unit -- Codex

see in context
tweet - Claude Code stop hook that triggers a Codex code review

^ Claude Code stop hook that triggers a AI unit -- Codex code review

see in context
tweet - video of Symphony and Codex and Linear 2026-03

Symphony and AI unit -- Codex and linear showing agent orchestration in kanban

see in context