Skip to content

test: log-buffer, RWLock concurrency, SSE chunk splitting — 13 new tests#527

Closed
anandgupta42 wants to merge 1 commit intomainfrom
test/hourly-20260327-2019
Closed

test: log-buffer, RWLock concurrency, SSE chunk splitting — 13 new tests#527
anandgupta42 wants to merge 1 commit intomainfrom
test/hourly-20260327-2019

Conversation

@anandgupta42
Copy link
Copy Markdown
Contributor

@anandgupta42 anandgupta42 commented Mar 27, 2026

What does this PR do?

Adds 13 new tests across 3 modules that had zero or minimal test coverage, targeting real user-facing risk areas identified during automated test discovery.

1. bufferLog / getRecentDbtLogs / clearDbtLogspackages/dbt-tools/src/log-buffer.ts (6 new tests)

This ring buffer captures dbt log output to prevent TUI corruption (referenced in #249). Zero tests existed. New coverage includes:

  • Insertion order preservation
  • Oldest-entry eviction when the 100-message cap is exceeded
  • Buffer clearing resets state completely
  • getRecentDbtLogs() returns a defensive copy (not a mutable reference to internal state)
  • Correct behavior at boundary (200 messages → exactly 100 retained, correct window)
  • Empty buffer returns empty array

2. Lock.read / Lock.writepackages/opencode/src/util/lock.ts (2 new tests, 3 total)

The RWLock powers session concurrency. Only 1 test existed (basic writer exclusivity). New coverage includes:

  • Concurrent readers: multiple readers acquire simultaneously without blocking
  • Writer starvation prevention: a queued writer blocks subsequent readers, ensuring writers are not starved by a steady stream of read requests (tests the process() priority logic)

3. parseSSEpackages/opencode/src/control-plane/sse.ts (5 new tests, 7 total)

The SSE parser powers real-time workspace sync via the control plane. Only 2 basic tests existed. New coverage includes:

  • Event data split across chunk boundaries (buffer accumulation correctness)
  • \n\n delimiter split across two chunks (the most dangerous boundary case)
  • Empty events (consecutive \n\n with no data lines) are correctly ignored
  • Abort signal stops processing mid-stream (uses pull()-based stream for reliable abort timing)
  • Bare \r line endings are normalized correctly (existing tests only covered \r\n)

Type of change

  • New feature (non-breaking change which adds functionality)

Issue for this PR

N/A — proactive test coverage via automated test discovery

How did you verify your code works?

bun test packages/dbt-tools/test/log-buffer.test.ts          # 6 pass
bun test packages/opencode/test/util/lock.test.ts            # 3 pass (1 existing + 2 new)
bun test packages/opencode/test/control-plane/sse.test.ts    # 7 pass (2 existing + 5 new)

All tests validated by a critic agent before commit (rejected 2 proposed tests as redundant/flawed, revised 2 others for correctness).

Checklist

  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes

https://claude.ai/code/session_01153R7Dh9BMKiarndEUraBk

Summary by CodeRabbit

  • Tests
    • Enhanced test coverage for log buffer operations, including overflow handling and buffer isolation.
    • Extended test coverage for SSE parsing to handle chunked payloads, split delimiters, and stream interruption scenarios.
    • Added test coverage for concurrent read locking and writer-priority behavior in lock mechanisms to prevent starvation.

Cover three untested risk areas: dbt ring buffer overflow (ties to #249 TUI
corruption fix), reader-writer lock starvation ordering, and SSE event parsing
across chunk boundaries and abort signals.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01153R7Dh9BMKiarndEUraBk
Copy link
Copy Markdown

@claude claude bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This repository is configured for manual code reviews. Comment @claude review to trigger a review and subscribe this PR to future pushes, or @claude review once for a one-time review.

Tip: disable this comment in your organization's Code Review settings.

@coderabbitai
Copy link
Copy Markdown

coderabbitai bot commented Mar 27, 2026

📝 Walkthrough

Walkthrough

This pull request adds comprehensive test suites across three modules: a new test suite for the DBT log buffer module testing core buffer behaviors and overflow handling; five new test cases for SSE parsing covering edge cases including split tokens, event delimiters across chunks, abort signals, and carriage-return line endings; and two new test cases for Lock concurrency testing concurrent reads and writer-priority behavior.

Changes

Cohort / File(s) Summary
DBT Log Buffer Tests
packages/dbt-tools/test/log-buffer.test.ts
New test suite verifying buffer operations: message ordering, 100-entry overflow capacity with oldest entry eviction, clearing functionality, immutable copy return values, and empty buffer handling.
SSE Parser Edge Cases
packages/opencode/test/control-plane/sse.test.ts
Five new test cases covering: split tokens across chunk boundaries, event delimiters split across chunks, empty event ignoring, abort signal handling mid-stream, and carriage-return line ending parsing.
Lock Concurrency
packages/opencode/test/util/lock.test.ts
Two new test cases verifying concurrent read lock acquisition without blocking and writer-priority behavior with reader starvation prevention.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Possibly related PRs

  • #411: Adds SSE test adjustments for parseSSE behavior alongside similar test coverage for edge cases

Suggested labels

contributor

Poem

🐰 Test cases hop and play,
Buffering logs the rabbit way,
Locks and streams align just right,
Coverage shines both day and night!
Thump thump—all green, hooray! ✨

🚥 Pre-merge checks | ✅ 3
✅ Passed checks (3 passed)
Check name Status Explanation
Title check ✅ Passed The title clearly and specifically summarizes the main change: adding 13 new tests across three modules with a concise, concrete focus on the tested areas.
Description check ✅ Passed The pull request description comprehensively covers all template sections: it details what changed and why (specific modules and test rationale), provides test verification output (how it was tested), and includes checklist confirmations.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
📝 Generate docstrings
  • Create stacked PR
  • Commit on current branch
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch test/hourly-20260327-2019

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (1)
packages/dbt-tools/test/log-buffer.test.ts (1)

43-50: Consider caching getRecentDbtLogs() result to avoid repeated calls.

The function is called three times for assertions. While not a bug, storing the result in a variable improves readability and consistency with the earlier test at lines 20-23.

♻️ Suggested refactor
   test("buffer stays at exactly 100 after repeated overflow", () => {
     for (let i = 0; i < 200; i++) {
       bufferLog(`msg-${i}`)
     }
-    expect(getRecentDbtLogs()).toHaveLength(100)
-    expect(getRecentDbtLogs()[0]).toBe("msg-100")
-    expect(getRecentDbtLogs()[99]).toBe("msg-199")
+    const logs = getRecentDbtLogs()
+    expect(logs).toHaveLength(100)
+    expect(logs[0]).toBe("msg-100")
+    expect(logs[99]).toBe("msg-199")
   })
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@packages/dbt-tools/test/log-buffer.test.ts` around lines 43 - 50, The test
repeatedly calls getRecentDbtLogs() three times; cache its result in a local
variable (e.g., const logs = getRecentDbtLogs()) after filling the buffer with
bufferLog(...) and use that variable for the three assertions (length and index
checks) to improve readability and avoid redundant calls to getRecentDbtLogs().
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@packages/dbt-tools/test/log-buffer.test.ts`:
- Around line 43-50: The test repeatedly calls getRecentDbtLogs() three times;
cache its result in a local variable (e.g., const logs = getRecentDbtLogs())
after filling the buffer with bufferLog(...) and use that variable for the three
assertions (length and index checks) to improve readability and avoid redundant
calls to getRecentDbtLogs().

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: fe4b9d11-57a1-4c0a-b0c3-8c6c52669d00

📥 Commits

Reviewing files that changed from the base of the PR and between 86a02c4 and fa6288b.

📒 Files selected for processing (3)
  • packages/dbt-tools/test/log-buffer.test.ts
  • packages/opencode/test/control-plane/sse.test.ts
  • packages/opencode/test/util/lock.test.ts

anandgupta42 added a commit that referenced this pull request Mar 28, 2026
… fixes

Consolidates PRs #515, #526, #527, #528, #530, #531, #532, #533, #534,
#535, #536, #537, #538, #539, #540, #541, #542, #543 into a single PR.

Changes:
- 30 files changed, ~3000 lines of new test coverage
- Deduplicated redundant tests:
  - `copilot-compat.test.ts`: removed duplicate `mapOpenAICompatibleFinishReason`
    tests (already covered in `copilot/finish-reason.test.ts`)
  - `lazy.test.ts`: removed duplicate error-retry and `reset()` tests
  - `transform.test.ts`: kept most comprehensive version (#535) over
    subset PRs (#539, #541)
- Bug fixes from PR #528:
  - `extractEquivalenceErrors`: `null` entries in `validation_errors`
    crashed with TypeError (`null.message` throws before `??` evaluates).
    Fixed with optional chaining: `e?.message`
  - `extractSemanticsErrors`: same fix applied
  - Updated test from `expect(...).toThrow(TypeError)` to verify the fix

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@anandgupta42
Copy link
Copy Markdown
Contributor Author

Consolidated into #545

anandgupta42 added a commit that referenced this pull request Mar 28, 2026
… fixes (#545)

* test: MCP auth — URL validation, token expiry, and client secret lifecycle

Cover security-critical McpAuth functions (getForUrl, isTokenExpired) and
McpOAuthProvider.clientInformation() expiry detection that had zero test coverage.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01CqcvvXp5hUVsNU441DFTwb

* test: copilot provider — finish reason mapping and tool preparation

Add 27 unit tests for three previously untested copilot SDK functions
that are critical to the GitHub Copilot provider integration path.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* test: log-buffer, RWLock concurrency, SSE chunk splitting — 13 new tests

Cover three untested risk areas: dbt ring buffer overflow (ties to #249 TUI
corruption fix), reader-writer lock starvation ordering, and SSE event parsing
across chunk boundaries and abort signals.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01153R7Dh9BMKiarndEUraBk

* test: SQL tool formatters — check, equivalence, semantics (38 tests)

Export and test pure formatting functions across three SQL analysis tools
that had zero test coverage. Discovered a real bug: null entries in
validation_errors crash extractEquivalenceErrors (TypeError on null.message).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01Lz8zxrbwHXfsC2FbHxXZh9

* test: stats display + MCP OAuth XSS prevention — 26 new tests

Add first-ever test coverage for the `altimate-code stats` CLI output formatting
and the MCP OAuth callback server's HTML escaping (XSS prevention boundary).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* test: util — proxy detection and lazy error recovery

Add tests for proxied() corporate proxy detection (6 tests) and
lazy() error recovery + reset behavior (2 tests) to cover untested
code paths that affect package installation and initialization.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01EDCRjjHdb1dWvxyAfrLuhw

* test: session compaction — observation mask and arg truncation

Cover createObservationMask() which generates the replacement text when old
tool outputs are pruned during session compaction. Tests verify format
correctness, UTF-8 byte counting, arg truncation with surrogate pair safety,
unserializable input handling, and fingerprint capping.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01SHDrUNHjUpTwPvcjQcJ4ug

* test: bus — publish/subscribe/once/unsubscribe mechanics

Zero dedicated tests existed for the core event Bus that powers session updates,
permission prompts, file watcher notifications, and SSE delivery. New coverage
includes subscriber delivery, unsubscribe correctness, wildcard subscriptions,
type isolation, and Bus.once auto-removal.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01GchE7rUZayV1ouLEseVndK

* test: lazy utility and credential-store — error retry, reset, sensitive field coverage

Cover untested behaviors in lazy() (error non-caching and reset) that power shell
detection, plus complete isSensitiveField unit coverage for BigQuery/SSL/SSH fields.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01WoqeutgfwXNcktweCKoLwd

* test: provider/transform — temperature, topP, topK, smallOptions, maxOutputTokens

Add 35 tests for five previously untested ProviderTransform functions that
control model-specific inference parameters for all users.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_014NGgCMNXEg4Nn3JCpzDg5w

* test: fingerprint + context — fill coverage gaps in core utilities

Add tests for Fingerprint.refresh() cache invalidation and dbt-packages tag
detection (both untested code paths), plus first-ever unit tests for the
Context utility (AsyncLocalStorage wrapper) used by every module.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01N8kgPYhXX7SrYnZKJLiTfC

* test: session todo — CRUD lifecycle with database persistence

Adds 6 tests for the Todo module (zero prior coverage). Covers insert/get round-trip,
position ordering, empty-array clear, replacement semantics, bus event emission, and
cross-session isolation. These guard the TUI todo panel against stale or phantom tasks.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* test: finops recommendations + dbt manifest edge cases — 12 new tests

Cover untested recommendation logic in warehouse-advisor and credit-analyzer
edge cases in dbt manifest parsing that affect real-world dbt projects.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01XhZy7vaqdasKH5hQ6H9ee3

* test: provider — sampling parameter functions (temperature, topP, topK)

Add 28 tests for ProviderTransform.temperature(), topP(), and topK() which
had zero direct test coverage. These pure functions control LLM sampling
behavior per model family and wrong values cause degraded output quality.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_011NoVCnMW9Kw6eh92ayU7GB

* test: session utilities — isDefaultTitle, fromRow/toRow, createObservationMask

Add 17 tests covering two untested modules in the session subsystem:
session identity helpers and compaction observation masks.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* test: provider — temperature, topP, topK model parameter defaults

Add 30 unit tests for ProviderTransform.temperature(), topP(), and topK()
which are pure functions that return model-specific sampling defaults.
These functions are the sole source of per-model parameter configuration
and were previously untested, risking silent regressions when adding or
modifying model ID patterns (e.g., kimi-k2 sub-variants, minimax-m2
dot/hyphen variants).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01WZthZmQczd51XXSjhiABNH

* test: agent — .env read protection and analyst write denial

Verify security-relevant agent permission defaults: builder agent asks before
reading .env files (preventing accidental secret exposure), and analyst agent
denies file modification tools (edit/write/todowrite/todoread).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01Wp9YaEvw6jAAL73VVdXFxA

* test: docker discovery + copilot provider compatibility

Add 20 new tests covering two previously untested modules:

1. Docker container discovery (containerToConfig) — verifies correct
   ConnectionConfig shape generation from discovered containers
2. Copilot provider finish-reason mapping and response metadata —
   ensures OpenAI-compatible finish reasons are correctly translated
   and response timestamps are properly converted

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

https://claude.ai/code/session_01J8xz7ijLjbzEe3mu7ajdWh

* test: consolidate 18 test PRs — 434 new tests, deduplicated, with bug fixes

Consolidates PRs #515, #526, #527, #528, #530, #531, #532, #533, #534,
#535, #536, #537, #538, #539, #540, #541, #542, #543 into a single PR.

Changes:
- 30 files changed, ~3000 lines of new test coverage
- Deduplicated redundant tests:
  - `copilot-compat.test.ts`: removed duplicate `mapOpenAICompatibleFinishReason`
    tests (already covered in `copilot/finish-reason.test.ts`)
  - `lazy.test.ts`: removed duplicate error-retry and `reset()` tests
  - `transform.test.ts`: kept most comprehensive version (#535) over
    subset PRs (#539, #541)
- Bug fixes from PR #528:
  - `extractEquivalenceErrors`: `null` entries in `validation_errors`
    crashed with TypeError (`null.message` throws before `??` evaluates).
    Fixed with optional chaining: `e?.message`
  - `extractSemanticsErrors`: same fix applied
  - Updated test from `expect(...).toThrow(TypeError)` to verify the fix

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: resolve typecheck errors in test files

- `prepare-tools.test.ts`: use template literal type for provider tool `id`
- `compaction-mask.test.ts`: use `as unknown as` for branded type casts

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: remove flaky `setTimeout` in todo bus event test

`Bus.publish` is synchronous — the event is delivered immediately,
no 50ms delay needed. Removes resource contention risk in parallel CI.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: address CodeRabbit review feedback

- `formatCheck`: harden validation error formatting against null entries
  using optional chaining and filter (CodeRabbit + GPT consensus)
- `extractEquivalenceErrors`: propagate extracted errors into
  `formatEquivalence` output to prevent title/output inconsistency
- `todo.test.ts`: use `tmpdir({ git: true })` + `await using` for
  proper test isolation instead of shared project root

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants