Add WebSocket transport for OpenAI Responses API streaming by dgageot · Pull Request #2186 · docker/docker-agent

dgageot · 2026-03-19T20:09:21Z

Introduce an optional WebSocket transport as an alternative to SSE for the OpenAI Responses API. Users can enable it via provider_opts:

models:
  gpt-ws:
    provider: openai
    model: gpt-4.1
    provider_opts:
      transport: websocket  # Use WebSocket instead of SSE

Key changes:

Add responseEventStream interface to abstract SSE and WebSocket transports
Refactor ResponseStreamAdapter to accept any responseEventStream
Implement wsStream (WebSocket transport) and wsPool (connection pool with 55-min TTL, auto-reconnect, and lastResponseID tracking)
Integrate WebSocket path in CreateResponseStream with automatic SSE fallback on connection failure
No new dependencies (reuses existing gorilla/websocket)

The existing ResponseStreamAdapter.Recv() logic is fully reused since WebSocket events use the same JSON schema as SSE events.

Assisted-By: docker-agent

- Add Client.Close() to release pooled WebSocket connections - Invalidate broken connections in pooledStream.Close() instead of returning dead sockets to the pool - Preserve lastResponseID across reconnections (expired + broken) so server-side context caching survives connection resets - Add wsMaxReconnectAttempts constant with bounded retry loop to prevent unbounded reconnection attempts - Replace os.Getenv("OPENAI_API_KEY") with c.Env.Get() for consistent secret resolution via the environment provider - Treat websocket.CloseNoStatusReceived as a normal close condition Assisted-By: docker-agent

dgageot · 2026-03-20T08:44:38Z

/review

docker-agent

Assessment: 🟢 APPROVE

This PR introduces WebSocket transport for the OpenAI Responses API with solid implementation and comprehensive tests. The code is well-structured with proper connection pooling, error handling, and automatic SSE fallback.

One minor issue was found: the lastResponseID field is tracked but never used for server-side context caching as intended by its documentation.

docker-agent · 2026-03-20T08:49:07Z

pkg/model/provider/openai/ws_pool.go

+	// this pool. It can be passed as previous_response_id in subsequent
+	// requests to enable server-side context caching.
+	// It lives on the pool (not wsConnection) so it survives reconnections.
+	lastResponseID string


ISSUE: Incomplete Feature Implementation

The lastResponseID field is tracked but never used for server-side context caching.

Details:
The field is declared with documentation stating "It can be passed as previous_response_id in subsequent requests to enable server-side context caching" (line 42-43), and it's correctly updated when terminal events are received (line 184 in pooledStream.Next()). However, there's no code that reads lastResponseID and passes it as previous_response_id in the request params.

Impact:
The server-side context caching feature mentioned in the documentation is non-functional. Subsequent requests on the same pooled connection won't benefit from cached context.

Recommendation:
Either:

Implement the feature by reading p.lastResponseID in wsPool.Stream() and injecting it into the params before sending the request

Remove the field and its documentation if context caching is not needed yet

Introduce an optional WebSocket transport as an alternative to SSE for the OpenAI Responses API. Users can enable it via provider_opts: provider_opts: transport: websocket Key changes: - Add responseEventStream interface to abstract SSE and WebSocket transports - Refactor ResponseStreamAdapter to accept any responseEventStream - Implement wsStream (WebSocket transport) and wsPool (connection pool with 55-min TTL, auto-reconnect, and lastResponseID tracking) - Integrate WebSocket path in CreateResponseStream with automatic SSE fallback on connection failure - No new dependencies (reuses existing gorilla/websocket) The existing ResponseStreamAdapter.Recv() logic is fully reused since WebSocket events use the same JSON schema as SSE events. Assisted-By: docker-agent

- Add Client.Close() to release pooled WebSocket connections - Invalidate broken connections in pooledStream.Close() instead of returning dead sockets to the pool - Preserve lastResponseID across reconnections (expired + broken) so server-side context caching survives connection resets - Add wsMaxReconnectAttempts constant with bounded retry loop to prevent unbounded reconnection attempts - Replace os.Getenv("OPENAI_API_KEY") with c.Env.Get() for consistent secret resolution via the environment provider - Treat websocket.CloseNoStatusReceived as a normal close condition Assisted-By: docker-agent

- Promote lastResponseID from wsConnection to wsPool so it naturally survives all connection transitions without manual threading - Extract closeLocked(), dialLocked(), invalidateConn() helpers to eliminate duplicated connection lifecycle logic in Stream() - Replace loop-of-one reconnect with a single dialLocked() call - Extract sendResponseCreate() to deduplicate marshal+send between dialWebSocket() and sendOnExisting() - Remove wsMaxReconnectAttempts constant (was always 1) - Simplify wsConnection struct to just conn + createdAt Net result: -18 lines, fewer code paths, same behavior. Assisted-By: docker-agent

- Initialize wsPool eagerly in NewClient instead of lazily in createWebSocketStream to eliminate a potential data race when concurrent goroutines both see wsPool==nil - Downgrade WebSocket→SSE fallback log from Error to Warn since this is an intentional graceful degradation, not an unexpected error - Close HTTP response body defensively in dialWebSocket on handshake failure to prevent a potential resource leak Assisted-By: docker-agent

The wsPool already tracked lastResponseID from completed responses but never forwarded it to subsequent requests. Now, wsPool.Stream() injects it as previous_response_id when the caller hasn't already set one, enabling server-side context caching across multi-turn exchanges. Add tests covering automatic injection, caller override preservation, and survival across reconnections. Assisted-By: docker-agent

dgageot requested a review from a team as a code owner March 19, 2026 20:09

This comment was marked as outdated.

Sign in to view

docker-agent bot reviewed Mar 20, 2026

View reviewed changes

dgageot added 5 commits March 20, 2026 12:34

dgageot force-pushed the ws branch from 317b902 to 556f27e Compare March 20, 2026 11:35

aheritier approved these changes Mar 20, 2026

View reviewed changes

dgageot merged commit 53a35dc into docker:main Mar 20, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add WebSocket transport for OpenAI Responses API streaming#2186

Add WebSocket transport for OpenAI Responses API streaming#2186
dgageot merged 5 commits intodocker:mainfrom
dgageot:ws

dgageot commented Mar 19, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

dgageot commented Mar 20, 2026

Uh oh!

docker-agent bot left a comment

Uh oh!

docker-agent bot Mar 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dgageot commented Mar 19, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

dgageot commented Mar 20, 2026

Uh oh!

docker-agent bot left a comment

Choose a reason for hiding this comment

Assessment: 🟢 APPROVE

Uh oh!

docker-agent bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants