How OpenAI Cut Agent Latency With WebSockets and Smarter Caching
OpenAI’s Responses API now uses WebSockets and connection-scoped caching to slash overhead in agentic loops like Codex. Here’s what changed and why it matters.
OpenAI’s Responses API now uses WebSockets and connection-scoped caching to slash overhead in agentic loops like Codex. Here’s what changed and why it matters.