codex-temporal

Temporal durable execution harness for the OpenAI Codex CLI agent.

Wraps the Codex agentic loop in a Temporal workflow so that multi-turn conversations, tool calls, and approval flows survive process failures and can be replayed deterministically.

Prerequisites

Rust (1.85+, edition 2024)
Temporal CLI (temporal server start-dev) or a running Temporal server
OpenAI API key (OPENAI_API_KEY env var)
Sibling repos checked out next to this one:

parent/
  codex/           # github.com/openai/codex   (branch: task/codex-temporal)
  sdk-core/        # github.com/mfateev/sdk-core (branch: fix/wait-condition-wakers)
  codex-temporal/  # this repo                  (branch: task/temporal-harness)

Quick start

1. Start a local Temporal server

temporal server start-dev

This starts a dev server on localhost:7233 with a web UI at http://localhost:8233.

2. Start the worker

The worker runs both the CodexWorkflow and the CodexActivities (model calls, shell execution) on the codex-temporal task queue.

export OPENAI_API_KEY="sk-..."

# Optional: override model (default: gpt-4o)
# export CODEX_MODEL="gpt-4o-mini"

# Optional: override Temporal address (default: http://localhost:7233)
# export TEMPORAL_ADDRESS="http://localhost:7233"

cargo run --bin codex-temporal-worker

3. Run a one-shot workflow

The client binary starts a single workflow and exits. Monitor progress in the Temporal UI.

cargo run --bin codex-temporal-client -- "What is 2+2? Reply with just the number."

For tool-use prompts:

cargo run --bin codex-temporal-client -- "Use shell to run 'echo hello world' and tell me the output."

4. Run the TUI

The TUI binary reuses the Codex ChatWidget but connects to a Temporal workflow instead of an in-process agent. The worker holds the API key and makes LLM calls — the TUI process does not need auth credentials or config files.

# Make sure the worker is running (step 2 above), then:
cargo run --bin codex-temporal-tui -- "Explain what Temporal is in two sentences."

You can also launch without an initial prompt and type interactively:

cargo run --bin codex-temporal-tui

The TUI renders the same chat interface as the standard Codex CLI. Events flow from the Temporal workflow back through the TUI in real time (session configured, agent messages, tool approval requests, turn completion).

5. Environment variables

Variable	Default	Description
`OPENAI_API_KEY`	(required)	OpenAI API key for model calls
`OPENAI_BASE_URL`	`https://api.openai.com/v1`	Base URL for the OpenAI API
`CODEX_MODEL`	`gpt-4o`	Model slug passed to the Responses API
`TEMPORAL_ADDRESS`	`http://localhost:7233`	Temporal server gRPC endpoint
`RUST_LOG`	`info`	Tracing filter (e.g. `codex_temporal=debug`)

Tests

Unit tests

cargo test --test unit_tests

18 tests covering entropy, clock, event sink, storage, types, signal payloads, and session/context constructors.

E2E tests

E2E tests spin up an ephemeral Temporal server (auto-downloads the CLI binary), create an in-process worker, and exercise the full TemporalAgentSession flow against the real OpenAI API.

OPENAI_API_KEY="sk-..." cargo test --test e2e_test -- --nocapture

Tests are skipped gracefully when OPENAI_API_KEY is not set.

Two test cases:

Test	What it does
`model_only_turn`	Submits a simple prompt, asserts `TurnStarted` and `TurnComplete` with a non-empty agent message
`tool_approval_flow`	Submits a prompt that triggers a shell tool call, waits for `ExecApprovalRequest`, sends approval, asserts `TurnComplete` after tool execution

TUI E2E tests

TUI E2E tests verify the full ChatWidget → wire_session → TemporalAgentSession → Temporal workflow round-trip. They require a running Temporal server.

# Start a dev server if one isn't running:
temporal server start-dev --port 7234

# Run the tests:
TEMPORAL_ADDRESS=http://localhost:7234 OPENAI_API_KEY="sk-..." \
  cargo test --test tui_e2e_test -- --nocapture

Two test cases:

Test	What it does
`tui_model_turn`	Wires a ChatWidget to a TemporalAgentSession, submits "Say hello in one word" via `initial_user_message`, asserts `TurnStarted`, agent messages, and `TurnComplete` all flow through the TUI event channel
`tui_tool_approval_request`	Submits a prompt requesting shell execution, asserts `ExecApprovalRequest` flows through the TUI pipeline from the Temporal workflow

Architecture

codex-temporal/src/
  lib.rs              Module declarations
  types.rs            Serializable I/O types and signal payloads
  entropy.rs          Deterministic RandomSource/Clock backed by workflow context
  sink.rs             BufferEventSink (in-memory EventSink with indexed polling)
  storage.rs          InMemoryStorage (in-memory StorageBackend)
  streamer.rs         ModelStreamer impl dispatching to model_call activity
  tools.rs            ToolCallHandler impl with approval gating + tool_exec activity
  activities.rs       Activity definitions (model_call, tool_exec)
  workflow.rs         CodexWorkflow (multi-turn interactive workflow)
  session.rs          TemporalAgentSession (AgentSession impl for client-side use)
  auth_stub.rs        NoopAuthProvider (stub AuthProvider for TUI — no credentials needed)
  models_stub.rs      FixedModelsProvider (stub ModelsProvider — single fixed model preset)
  bin/
    worker.rs         Temporal worker binary
    client.rs         CLI to start a one-shot workflow
    tui.rs            TUI binary (ChatWidget wired to TemporalAgentSession)

Protocol

The workflow exposes a signal/query interface for multi-turn interaction:

Signal	Purpose
`receive_user_turn`	Queue a new user message for processing
`receive_approval`	Approve or deny a pending tool call
`request_shutdown`	Gracefully terminate after the current turn

Query	Purpose
`get_events_since(index)`	Return new events since the given watermark (for client polling)

TemporalAgentSession implements the AgentSession trait by mapping submit(Op) to signals and next_event() to query polling with adaptive backoff.

Workflow execution flow

Client                    Temporal                     Worker
  |                         |                            |
  |-- start_workflow ------>|                            |
  |                         |-- WorkflowTask ----------->|
  |                         |                            |-- model_call activity
  |                         |<-- ScheduleActivity -------|
  |                         |-- ActivityTask ----------->|
  |                         |<-- ActivityResult ---------|  (model response)
  |                         |-- WorkflowTask ----------->|
  |                         |                            |-- emit ExecApprovalRequest
  |<-- query events --------|                            |   (if tool call)
  |-- signal approval ----->|                            |
  |                         |-- WorkflowTask ----------->|
  |                         |                            |-- tool_exec activity
  |                         |<-- ScheduleActivity -------|
  |                         |-- ActivityTask ----------->|
  |                         |<-- ActivityResult ---------|  (tool output)
  |                         |-- WorkflowTask ----------->|
  |                         |                            |-- model_call activity
  |                         |                            |   (model sees tool result)
  |                         |                            |-- emit TurnComplete
  |<-- query events --------|                            |

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
src		src
tests		tests
.gitignore		.gitignore
CONTINUE_AS_NEW_DESIGN.md		CONTINUE_AS_NEW_DESIGN.md
Cargo.toml		Cargo.toml
DESIGN.md		DESIGN.md
README.md		README.md
ROADMAP.md		ROADMAP.md
build.rs		build.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

codex-temporal

Prerequisites

Quick start

1. Start a local Temporal server

2. Start the worker

3. Run a one-shot workflow

4. Run the TUI

5. Environment variables

Tests

Unit tests

E2E tests

TUI E2E tests

Architecture

Protocol

Workflow execution flow

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

codex-temporal

Prerequisites

Quick start

1. Start a local Temporal server

2. Start the worker

3. Run a one-shot workflow

4. Run the TUI

5. Environment variables

Tests

Unit tests

E2E tests

TUI E2E tests

Architecture

Protocol

Workflow execution flow

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages