Inworld websocket improvements #979

cshape · 2026-01-16T09:08:37Z

Description

Update Websockets implementation as per Inworld's connection/context setup

https://docs.inworld.ai/api-reference/ttsAPI/texttospeech/synthesize-speech-websocket

Pre-Review Checklist

Build passes: All builds (lint, typecheck, tests) pass locally
AI-generated code reviewed: Removed unnecessary comments and ensured code quality
Changes explained: All changes are properly documented and justified above
Scope appropriate: All changes relate to the PR title, or explanations provided for why they're included

Testing

Tested in examples/src/inworld_tts.ts

Summary by CodeRabbit

Bug Fixes
- Fixed punctuation formatting in text-to-speech output.
Performance Improvements
- Added shared connection pooling to improve TTS streaming efficiency, concurrency, and stability.
API Changes
- Tightened allowed TTS encoding values for stricter validation.
- Added accessors to expose the active connection URL and current TTS options.
- Exposed pooling-related exports for advanced connection management.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

changeset-bot · 2026-01-16T09:08:41Z

🦋 Changeset detected

Latest commit: f8fda4a

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 18 packages

Name	Type
@livekit/agents-plugin-inworld	Patch
@livekit/agents	Patch
@livekit/agents-plugin-anam	Patch
@livekit/agents-plugin-baseten	Patch
@livekit/agents-plugin-bey	Patch
@livekit/agents-plugin-cartesia	Patch
@livekit/agents-plugin-deepgram	Patch
@livekit/agents-plugin-elevenlabs	Patch
@livekit/agents-plugin-google	Patch
@livekit/agents-plugin-hedra	Patch
@livekit/agents-plugin-livekit	Patch
@livekit/agents-plugin-neuphonic	Patch
@livekit/agents-plugin-openai	Patch
@livekit/agents-plugin-resemble	Patch
@livekit/agents-plugin-rime	Patch
@livekit/agents-plugin-silero	Patch
@livekit/agents-plugin-xai	Patch
@livekit/agents-plugins-test	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

coderabbitai · 2026-01-16T09:08:50Z

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

📝 Walkthrough

Walkthrough

This PR adds a changeset and refactors the TTS implementation to use a shared WebSocket connection pool (InworldConnection/ConnectionPool), narrows the Encoding type to a fixed union, and exposes new TTS accessors and pooling exports while migrating per-context lifecycle into the pool.

Changes

Cohort / File(s)	Summary
Changeset entry `.changeset/chatty-rockets-start.md`	Adds a changeset bumping `@livekit/agents-plugin-inworld` (patch) and a punctuation-fix note.
TTS connection pooling & API surface `plugins/inworld/src/tts.ts`	Replaces per-instance WebSocket handling with a shared connection pool (InworldConnection, ConnectionPool). Adds pool lifecycle, context acquisition/release, waiter synchronization, idle/session timeouts, per-context callbacks, and global pool constants. Narrows `Encoding` to a fixed union, adds `wsURL`/`opts` getters on `TTS`, updates streaming to use pooled contexts, and exports pooling symbols.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant TTS
    participant SharedPool as "Shared Pool"
    participant InworldConn as "Inworld Connection"
    participant WebSocket

    Client->>TTS: synthesize(text)
    TTS->>SharedPool: acquireContext(wsURL, auth)
    alt context available
        SharedPool-->>TTS: contextId
    else create connection
        SharedPool->>InworldConn: create connection
        InworldConn->>WebSocket: connect
        WebSocket-->>InworldConn: connected
        InworldConn-->>SharedPool: ready
        SharedPool-->>TTS: contextId
    end

    TTS->>InworldConn: send_text(contextId, chunks)
    loop stream audio
        InworldConn->>WebSocket: send message
        WebSocket-->>InworldConn: audio frames
        InworldConn-->>TTS: emit frames
        TTS-->>Client: yield frame
    end

    TTS->>InworldConn: flush_context(contextId)
    TTS->>InworldConn: close_context(contextId)
    InworldConn->>SharedPool: release context
    SharedPool->>SharedPool: update refs
    alt no refs
        SharedPool->>InworldConn: close connection
        InworldConn->>WebSocket: disconnect
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

🐇 I hopped through sockets, found a shared abode,
Contexts gathered neatly on a pooled, cozy road.
Chunks sing in chorus, frames flutter and play,
Punctuation fixed — I nibble and sway.
— your rabbit, delighted with the new way ✨

🚥 Pre-merge checks | ❌ 3

❌ Failed checks (2 warnings, 1 inconclusive)

Check name	Status	Explanation	Resolution
Description check	⚠️ Warning	The description is incomplete relative to the template. It omits the 'Changes Made' section detailing specific modifications, lacks details on testing methodology, and does not fully address all checklist items despite marking them complete.	Add a detailed 'Changes Made' section listing the main modifications (e.g., connection pooling, Encoding type narrowing, new public APIs), provide specific testing details, and explain how testing was performed in inworld_tts.ts.
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title 'Inworld websocket improvements' is vague and generic, using non-descriptive terms that don't clearly convey what specific changes were made beyond general 'improvements'.	Consider using a more specific title that describes the main change, such as 'Implement connection pooling for Inworld WebSocket management' or 'Refactor Inworld TTS to use shared connection pool architecture'.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

📜 Recent review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e85a663 and f8fda4a.

📒 Files selected for processing (1)

.changeset/chatty-rockets-start.md

🚧 Files skipped from review as they are similar to previous changes (1)

.changeset/chatty-rockets-start.md

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

plugins/inworld/src/tts.ts (1)

908-961: Ensure context cleanup on error paths in SynthesizeStream.

If sendLoop, flushContext, or waiter rejects, the context can remain open and capacity never returns to the pool. Close the context on failure.

🩹 Proposed fix

     try {
       // Acquire a context from the shared pool
       const acquired = await pool.acquireContext(handleMessage, config);
       contextId = acquired.contextId;
       connection = acquired.connection;
       waiter = acquired.waiter;
@@
       await waiter;
@@
       for (const frame of bstream.flush()) {
         this.queue.put({
           requestId: contextId,
           segmentId: contextId,
           frame,
           final: false,
         });
       }
     } catch (e) {
       this.#logger.error({ error: e, contextId }, 'Error in SynthesizeStream run');
+      if (connection && contextId) {
+        try {
+          await connection.closeContext(contextId);
+        } catch (closeErr) {
+          this.#logger.warn({ error: closeErr, contextId }, 'Failed to close context after error');
+        }
+      }
       throw e;
     }

🤖 Fix all issues with AI agents

In `@plugins/inworld/src/tts.ts`:
- Around line 220-255: In acquireContext, if this.#sendCreateContext(contextId,
config) throws the created ContextInfo remains in this.#contexts and capacity is
leaked; wrap the send in try/catch, and on error remove the contextId from
this.#contexts, clear or call rejectWaiter(err) if set, then rethrow the error;
ensure this.#lastActivityAt is only left as-is or updated appropriately. Apply
the same cleanup pattern to the other send-create-context call noted (lines
446-448) so any failed send removes the context entry and signals the waiter.
- Around line 471-479: The recurring setInterval in ConnectionPool's constructor
(the `#idleCleanupInterval` created for `#cleanupIdleConnections`) prevents process
exit in short-lived contexts; update the interval creation to call .unref() if
available (e.g., setInterval(...).unref?.()) so the timer is non-blocking, and
also ensure resource cleanup by wiring TTS.close() to call
ConnectionPool.close() and remove entries from the module-level sharedPools Map
so intervals are cleared when a TTS instance is closed; apply the same .unref()
fix to the other setInterval usages mentioned (around the other constructors at
the referenced locations).

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between df523ab and ef62402.

📒 Files selected for processing (3)

.changeset/chatty-rockets-start.md
README.md
plugins/inworld/src/tts.ts

🧰 Additional context used

🧬 Code graph analysis (1)

plugins/inworld/src/tts.ts (1)

agents/src/utils.ts (1)

shortuuid (554-556)

🔇 Additional comments (2)

README.md (1)

74-74: Inworld plugin entry looks good.

Clear addition to the supported plugins table.

.changeset/chatty-rockets-start.md (1)

1-5: Changeset entry is clear.

Version bump and summary look fine.

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

plugins/inworld/src/tts.ts

toubatbrian · 2026-01-16T19:37:34Z

@codex review this PR

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ef62402c77

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

plugins/inworld/src/tts.ts

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

plugins/inworld/src/tts.ts (1)

862-1009: Ensure contexts are closed on error to avoid capacity leaks.
If any exception occurs after acquiring a context, the context can remain open and occupy pool capacity. Add a best‑effort cleanup path.

🩹 Suggested fix

     let contextId: string | undefined;
     let connection: InworldConnection | undefined;
     let waiter: Promise<void> | undefined;
+    let completed = false;

     try {
       // Acquire a context from the shared pool
       const acquired = await pool.acquireContext(handleMessage, config);
       contextId = acquired.contextId;
       connection = acquired.connection;
       waiter = acquired.waiter;
@@
       for (const frame of bstream.flush()) {
         this.queue.put({
           requestId: contextId,
           segmentId: contextId,
           frame,
           final: false,
         });
       }
+      completed = true;
     } catch (e) {
       this.#logger.error({ error: e, contextId }, 'Error in SynthesizeStream run');
       throw e;
+    } finally {
+      if (!completed && contextId && connection) {
+        try {
+          await connection.closeContext(contextId);
+        } catch (err) {
+          this.#logger.debug({ error: err, contextId }, 'Failed to close context after error');
+        }
+      }
     }

🤖 Fix all issues with AI agents

In `@plugins/inworld/src/tts.ts`:
- Around line 603-636: The shared-pool release bug occurs because the instance
property `#pool` (and its key) can drift when options (apiKey/wsUrl) change;
updateOptions must either prevent changing those identity fields or rebind the
pool: detect when wsUrl or authorization/apiKey changed, call releaseSharedPool
with the old wsUrl/authorization key, then set `#pool` =
acquireSharedPool(newWsUrl, newAuthorization) and update any stored key; use the
helper functions getSharedPoolKey, acquireSharedPool and releaseSharedPool and
update the class field that tracks the current key (alongside `#pool`) so close()
always releases the correct pool.
- Around line 369-379: When the WebSocket close handler clears contexts it
currently never notifies capacity waiters, so acquireContext can hang; update
the ws.on('close') handler (the close callback that sets this.#ws and
this.#connecting and iterates this.#contexts) to also signal the pool capacity
for each cleared context by resolving or rejecting any pending capacity waiters
(e.g., resolve pending promises or call the pool/semaphore release method used
by acquireContext). Ensure you notify those capacity waiters before or while
removing entries from this.#contexts so waiting acquireContext calls are
unblocked.

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between ef62402 and d97100f.

📒 Files selected for processing (1)

plugins/inworld/src/tts.ts

🧰 Additional context used

📓 Path-based instructions (3)

**/*.{ts,tsx,js,jsx}

📄 CodeRabbit inference engine (.cursor/rules/agent-core.mdc)

Add SPDX-FileCopyrightText and SPDX-License-Identifier headers to all newly added files with '// SPDX-FileCopyrightText: 2025 LiveKit, Inc.' and '// SPDX-License-Identifier: Apache-2.0'

Files:

plugins/inworld/src/tts.ts

**/*.{ts,tsx}?(test|example|spec)

📄 CodeRabbit inference engine (.cursor/rules/agent-core.mdc)

When testing inference LLM, always use full model names from agents/src/inference/models.ts (e.g., 'openai/gpt-4o-mini' instead of 'gpt-4o-mini')

Files:

plugins/inworld/src/tts.ts

**/*.{ts,tsx}?(test|example)

📄 CodeRabbit inference engine (.cursor/rules/agent-core.mdc)

Initialize logger before using any LLM functionality with initializeLogger({ pretty: true }) from '@livekit/agents'

Files:

plugins/inworld/src/tts.ts

🧬 Code graph analysis (1)

plugins/inworld/src/tts.ts (1)

agents/src/utils.ts (1)

shortuuid (554-556)

🔇 Additional comments (7)

plugins/inworld/src/tts.ts (7)

27-27: Encoding union and default value look consistent.
The narrowed union and default assignment align cleanly.

Also applies to: 150-155

128-173: Pooling types and limits are well-scoped.
Clear type boundaries and centralized pool constants make lifecycle management easy to follow.

174-293: Context acquisition + lifecycle bookkeeping looks solid.
Nice to see creation failure cleanup and activity tracking consolidated here.

295-358: Connection retry/session refresh flow is clear.
Backoff and reconnection paths read well.

381-471: Message routing and context cleanup paths look good.
Status errors and context closures now release slots and notify capacity appropriately.

474-601: Pool orchestration and idle cleanup look good.
Acquire/wait/cleanup flow is straightforward and easy to reason about.

638-679: TTS pool wiring and accessors are clean.
Exposing pool, opts, and wsURL improves testability and inspection.

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

plugins/inworld/src/tts.ts

…tead of per api key

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

plugins/inworld/src/tts.ts (1)

926-979: Ensure contexts are released on early failures.

If any error occurs after acquireContext, the context can remain active and consume capacity, even though the stream fails. This can eventually exhaust the pool.

🩹 Suggested fix

-    try {
+    let completed = false;
+    try {
       // Acquire a context from the shared pool
       const acquired = await pool.acquireContext(handleMessage, config);
       contextId = acquired.contextId;
       connection = acquired.connection;
       waiter = acquired.waiter;
@@
       for (const frame of bstream.flush()) {
         this.queue.put({
           requestId: contextId,
           segmentId: contextId,
           frame,
           final: false,
         });
       }
+      completed = true;
     } catch (e) {
       this.#logger.error({ error: e, contextId }, 'Error in SynthesizeStream run');
       throw e;
+    } finally {
+      if (!completed && contextId && connection) {
+        try {
+          await connection.closeContext(contextId);
+        } catch (closeErr) {
+          this.#logger.warn(
+            { error: closeErr, contextId },
+            'Failed to close context after error',
+          );
+        }
+      }
     }

♻️ Duplicate comments (1)

plugins/inworld/src/tts.ts (1)

691-696: Rebind the pool when apiKey or wsURL changes.

updateOptions updates auth/URL but keeps the existing ConnectionPool, so new syntheses still use the old credentials/endpoint. This can silently route requests to the wrong backend and is hard to debug.

🩹 Suggested fix

   updateOptions(opts: Partial<TTSOptions>) {
-    this.#opts = { ...this.#opts, ...opts };
-    if (opts.apiKey) {
-      this.#authorization = `Basic ${opts.apiKey}`;
-    }
+    const prevWsURL = this.#opts.wsURL;
+    const prevAuth = this.#authorization;
+    this.#opts = { ...this.#opts, ...opts };
+    if (opts.apiKey) {
+      this.#authorization = `Basic ${opts.apiKey}`;
+    }
+    if (prevWsURL !== this.#opts.wsURL || prevAuth !== this.#authorization) {
+      this.#pool.close();
+      this.#pool = new ConnectionPool(this.#opts.wsURL, this.#authorization);
+    }
   }

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d97100f and e85a663.

📒 Files selected for processing (1)

plugins/inworld/src/tts.ts

🧰 Additional context used

📓 Path-based instructions (3)

**/*.{ts,tsx,js,jsx}

📄 CodeRabbit inference engine (.cursor/rules/agent-core.mdc)

Add SPDX-FileCopyrightText and SPDX-License-Identifier headers to all newly added files with '// SPDX-FileCopyrightText: 2025 LiveKit, Inc.' and '// SPDX-License-Identifier: Apache-2.0'

Files:

plugins/inworld/src/tts.ts

**/*.{ts,tsx}?(test|example|spec)

📄 CodeRabbit inference engine (.cursor/rules/agent-core.mdc)

When testing inference LLM, always use full model names from agents/src/inference/models.ts (e.g., 'openai/gpt-4o-mini' instead of 'gpt-4o-mini')

Files:

plugins/inworld/src/tts.ts

**/*.{ts,tsx}?(test|example)

📄 CodeRabbit inference engine (.cursor/rules/agent-core.mdc)

Initialize logger before using any LLM functionality with initializeLogger({ pretty: true }) from '@livekit/agents'

Files:

plugins/inworld/src/tts.ts

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

davidzhao

lg

.changeset/chatty-rockets-start.md

davidzhao · 2026-01-21T07:32:12Z

@cshape I think this comment is still valid.

cshape added 3 commits December 9, 2025 13:18

fix inworld punctuation bug and add inworld as plugin to readme

6ad4e51

added changeset

c4f2f53

fix: improved websocket implementation

ef62402

coderabbitai bot reviewed Jan 16, 2026

View reviewed changes

plugins/inworld/src/tts.ts Show resolved Hide resolved

plugins/inworld/src/tts.ts Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Jan 16, 2026

View reviewed changes

plugins/inworld/src/tts.ts Show resolved Hide resolved

plugins/inworld/src/tts.ts Show resolved Hide resolved

connection pool fixes from reviewer feedback

4d803c5

coderabbitai bot reviewed Jan 20, 2026

View reviewed changes

plugins/inworld/src/tts.ts Show resolved Hide resolved

plugins/inworld/src/tts.ts Outdated Show resolved Hide resolved

fixed reviewer feedback and simplified to make pools per-instance ins…

e85a663

…tead of per api key

cshape force-pushed the main branch from d97100f to e85a663 Compare January 20, 2026 22:37

coderabbitai bot reviewed Jan 20, 2026

View reviewed changes

Merge branch 'main' into main

fd86842

davidzhao approved these changes Jan 21, 2026

View reviewed changes

.changeset/chatty-rockets-start.md Outdated Show resolved Hide resolved

Update .changeset/chatty-rockets-start.md

f8fda4a

Inworld websocket improvements #979

Are you sure you want to change the base?

Inworld websocket improvements #979

Uh oh!

Conversation

cshape commented Jan 16, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Pre-Review Checklist

Testing

Summary by CodeRabbit

Uh oh!

changeset-bot bot commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

coderabbitai bot commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Other AI code review bot(s) detected

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

toubatbrian commented Jan 16, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

davidzhao left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

davidzhao commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cshape commented Jan 16, 2026 •

edited by coderabbitai bot

Loading

changeset-bot bot commented Jan 16, 2026 •

edited

Loading

coderabbitai bot commented Jan 16, 2026 •

edited

Loading