Move getAdjustedTokenCountFromModel from CLI to core #9968

RomneyDa · 2026-01-28T06:27:50Z

Summary

This PR moves the getAdjustedTokenCountFromModel function from the CLI tokenizer to core/llm/countTokens.ts and integrates it with core's token counting logic.

Changes

Moved function: Extracted getAdjustedTokenCountFromModel from CLI to core
Integration: Updated core's countTokens() to apply model-specific multipliers
Tests: Added comprehensive test suite for the function in core
CLI Update: Updated CLI tokenizer to import from core instead of duplicating code
Cleanup: Removed duplicate constants and implementation from CLI

Why?

DRY: Eliminates code duplication between CLI and core
Consistency: Ensures token counting uses the same multipliers everywhere
Maintainability: Single source of truth for token adjustment logic
Better coverage: The llama tokenizer (~= gpt tokenizer) now applies safety multipliers in core as well

Model Multipliers

The function applies these multipliers based on empirical data:

Claude: 1.23x (Anthropic tokenizer produces ~23% more tokens)
Gemini: 1.18x (Gemini tokenizer produces ~18% more tokens)
Mistral family: 1.26x (includes mistral, mixtral, devstral, codestral)

Testing

Added unit tests covering:

Base cases (no multiplier)
All model-specific multipliers
Edge cases (zero tokens, undefined model names)
Case-insensitivity

Breaking Changes

None - this is backward compatible. The function signature changed slightly (takes model name string instead of ModelConfig) but all call sites have been updated.

Continue Tasks: ▶️ 1 queued · ✅ 2 no changes — View all

Summary by cubic

Moves getAdjustedTokenCountFromModel to core and applies model-specific multipliers inside core token counting. This unifies token estimates across core and CLI.

Refactors
- Centralized getAdjustedTokenCountFromModel in core/llm/getAdjustedTokenCount.ts and integrated it into countTokens().
- Applied model multipliers: Claude 1.23x, Gemini 1.18x, Mistral family 1.26x.
- CLI now imports from core; removed duplicate logic and constants.
- Added unit tests for multipliers and edge cases. No breaking changes.

^{Written for commit 7c32e1e. Summary will update on new commits.}

- Moved getAdjustedTokenCountFromModel function from CLI tokenizer to core/llm/countTokens.ts - Updated core countTokens() to use getAdjustedTokenCountFromModel for all models - Added comprehensive tests for getAdjustedTokenCountFromModel in core - Updated CLI tokenizer to import getAdjustedTokenCountFromModel from core - Removed duplicate implementation and constants from CLI This consolidates the token adjustment logic in one place and ensures consistent token counting across CLI and core. The llama tokenizer is now used for all models with appropriate multipliers for Claude (1.23x), Gemini (1.18x), and Mistral family (1.26x) models. Generated with [Continue](https://continue.dev) Co-Authored-By: Continue <noreply@continue.dev> Co-authored-by: dallin <dallin@continue.dev>

github-actions · 2026-01-28T06:27:59Z

⚠️ PR Title Format

Your PR title doesn't follow the conventional commit format, but this won't block your PR from being merged. We recommend using this format for better project organization.

Expected Format:

<type>[optional scope]: <description>

Examples:

feat: add changelog generation support
fix: resolve login redirect issue
docs: update README with new instructions
chore: update dependencies

Valid Types:

feat, fix, docs, style, refactor, perf, test, build, ci, chore, revert

This helps with:

📝 Automatic changelog generation
🚀 Automated semantic versioning
📊 Better project history tracking

This is a non-blocking warning - your PR can still be merged without fixing this.

github-actions · 2026-01-28T06:28:30Z

✅ Review Complete

Code Review Summary

⚠️ Continue API authentication failed. Please check your CONTINUE_API_KEY.

cubic-dev-ai

2 issues found across 3 files

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="extensions/cli/src/util/tokenizer.ts">

<violation number="1" location="extensions/cli/src/util/tokenizer.ts:3">
P2: CLI now imports getAdjustedTokenCountFromModel from core/llm/countTokens.js, which pulls in heavy tokenizer dependencies at module load time, likely regressing CLI startup/memory for a simple multiplier helper.</violation>
</file>

<file name="core/llm/countTokens.ts">

<violation number="1" location="core/llm/countTokens.ts:166">
P2: countTokensAsync does not apply model-specific token multipliers, so async token counts can be lower than countTokens for Claude/Gemini/Mistral models, affecting limit checks like readFileLimit.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

extensions/cli/src/util/tokenizer.ts

core/llm/countTokens.ts

cubic-dev-ai

1 issue found across 1 file (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="core/llm/getAdjustedTokenCount.ts">

<violation number="1" location="core/llm/getAdjustedTokenCount.ts:31">
P2: Regression: Mistral-family models like “Ministral”/“Mathstral” no longer match the multiplier because the broad `includes("stral")` check was replaced with a strict allowlist.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

cubic-dev-ai · 2026-01-31T23:46:56Z

core/llm/getAdjustedTokenCount.ts

+  } else if (lowerModelName.includes("gemini")) {
+    multiplier = GEMINI_TOKEN_MULTIPLIER;
+  } else if (
+    lowerModelName.includes("mistral") ||


P2: Regression: Mistral-family models like “Ministral”/“Mathstral” no longer match the multiplier because the broad includes("stral") check was replaced with a strict allowlist.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At core/llm/getAdjustedTokenCount.ts, line 31: <comment>Regression: Mistral-family models like “Ministral”/“Mathstral” no longer match the multiplier because the broad `includes("stral")` check was replaced with a strict allowlist.</comment> <file context> @@ -27,8 +27,13 @@ export function getAdjustedTokenCountFromModel( - } else if (lowerModelName.includes("stral")) { - // devstral, mixtral, mistral, etc + } else if ( + lowerModelName.includes("mistral") || + lowerModelName.includes("mixtral") || + lowerModelName.includes("codestral") || </file context>

RomneyDa added the continue-agent label Jan 28, 2026

RomneyDa requested a review from a team as a code owner January 28, 2026 06:27

RomneyDa requested review from sestinj and removed request for a team January 28, 2026 06:27

github-project-automation bot added this to Issues and PRs Jan 28, 2026

github-project-automation bot moved this to Todo in Issues and PRs Jan 28, 2026

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Jan 28, 2026

cubic-dev-ai bot reviewed Jan 28, 2026

View reviewed changes

extensions/cli/src/util/tokenizer.ts Outdated Show resolved Hide resolved

core/llm/countTokens.ts Show resolved Hide resolved

RomneyDa mentioned this pull request Jan 28, 2026

Context length calculation incorrect for Anthropic Claude models (shows 130k tokens at 70% when actually 200k) #9231

Open

RomneyDa added 2 commits January 31, 2026 15:17

fix: move token adjustment to separate module

d762690

fix: mistral model matching

7c32e1e

cubic-dev-ai bot reviewed Jan 31, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move getAdjustedTokenCountFromModel from CLI to core #9968

Move getAdjustedTokenCountFromModel from CLI to core #9968

RomneyDa commented Jan 28, 2026 •

edited by cubic-dev-ai bot

Loading

Uh oh!

github-actions bot commented Jan 28, 2026

Uh oh!

github-actions bot commented Jan 28, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

cubic-dev-ai bot Jan 31, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Move getAdjustedTokenCountFromModel from CLI to core #9968

Are you sure you want to change the base?

Move getAdjustedTokenCountFromModel from CLI to core #9968

Conversation

RomneyDa commented Jan 28, 2026 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Why?

Model Multipliers

Testing

Breaking Changes

Summary by cubic

Uh oh!

github-actions bot commented Jan 28, 2026

⚠️ PR Title Format

Expected Format:

Examples:

Valid Types:

Uh oh!

github-actions bot commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Summary

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RomneyDa commented Jan 28, 2026 •

edited by cubic-dev-ai bot

Loading

github-actions bot commented Jan 28, 2026 •

edited

Loading

cubic-dev-ai bot Jan 31, 2026 •

edited

Loading