Skip to content

Commit ae0f600

Browse files
committed
New doc: What makes Codebuff unique (generated from all our marketing material + tweaks)
1 parent 70d3787 commit ae0f600

File tree

1 file changed

+166
-0
lines changed

1 file changed

+166
-0
lines changed
Lines changed: 166 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,166 @@
1+
---
2+
title: 'What Makes Codebuff Unique'
3+
section: 'tips'
4+
tags: ['features', 'comparison', 'architecture']
5+
order: 1
6+
---
7+
8+
# What Makes Codebuff Unique
9+
10+
Codebuff is an open-source AI coding agent that coordinates specialized sub-agents instead of using one model for everything.
11+
12+
The result: better code quality and up to 3x faster performance than Claude Code, built on a deep agent framework continuously refined by our in-house evals
13+
14+
## 3x Faster Than Claude Code
15+
16+
Codebuff is dramatically faster—often completing tasks in 1/3 the time.
17+
18+
{/* TODO: Add speed comparison image/video */}
19+
20+
In real-world tests:
21+
- **Claude Code**: 19m 37s for a feature
22+
- **Codebuff**: 6m 45s for the same feature
23+
24+
That's 100+ seconds faster on average per prompt. We achieve this through parallel agents, prompt caching, and smarter file discovery.
25+
26+
See our detailed [comparison with Claude Code](/docs/advanced/claude-code-comparison).
27+
28+
## Tree-based File Discovery
29+
30+
Claude Code can spend 5+ minutes grep-ing and reading file excerpts one at a time.
31+
32+
{/* TODO: Add file picker screenshot */}
33+
34+
Codebuff's approach:
35+
1. **Parse your entire codebase**: We analyze all source files and extract function names, class names, and type names
36+
2. **Build a code tree**: This creates a compact tree of all directories, files, and symbols in your project
37+
3. **Grok 4.1 Fast scans the tree**: We feed this code tree to Grok 4.1 Fast, which identifies up to 12 relevant files in seconds
38+
4. **Gemini Flash summarizes**: Those 12 files are read and summarized by Gemini Flash
39+
5. **Main agent reads multiple files at once**: With the summaries, the main agent knows exactly what to read
40+
41+
This entire process takes just a few seconds and efficiently conveys a lot of information to the agent. No more watching your agent slowly explore your codebase.
42+
43+
## Parallel Multi-Strategy Editing
44+
45+
In MAX mode, Codebuff doesn't just try once—it tries three times in parallel with different strategies and picks the best result.
46+
47+
{/* TODO: Add multi-prompt editing diagram */}
48+
49+
How it works:
50+
1. The orchestrator spawns multiple editor agents, each with a different strategy
51+
2. All implementations run in parallel, reusing the prompt cache
52+
3. A selector agent chooses the best implementation
53+
4. The selector can incorporate good ideas from other attempts
54+
55+
This is remarkably efficient because all parallel agents share the cached conversation history—you only pay once for reading files.
56+
57+
## Automatic Code Review
58+
59+
Every prompt gets reviewed before Codebuff finishes.
60+
61+
{/* TODO: Add code review screenshot */}
62+
63+
- A reviewer agent spawns automatically
64+
- It runs in parallel with typechecks and tests
65+
- Catches bugs, dead code, and quality issues
66+
- Fixes are applied before you see the result
67+
68+
In MAX mode, multiple reviewers analyze your code from different angles—all reusing the prompt cache.
69+
70+
## Invisible Context Management
71+
72+
Other tools show you "% context used" and make you worry about it.
73+
74+
{/* TODO: Add context management diagram */}
75+
76+
Codebuff handles context automatically:
77+
- **Smart compaction**: After the prompt cache expires (5 min idle), we automatically summarize the conversation—much more efficient for long sessions
78+
- **Non-lossy summaries**: 10-20 roundtrips preserved with full details
79+
- **Deterministic strategy**: User messages, assistant messages, tool calls—all kept
80+
- **Immediate re-reading**: Codebuff quickly re-reads any relevant files it needs after compaction
81+
82+
You never think about context. It just works.
83+
84+
## Open Source Multi-Agent Framework
85+
86+
Our entire agent framework is [open source](/docs/advanced/sdk). The same code that powers Codebuff powers your custom agents.
87+
88+
{/* TODO: Add agent framework diagram */}
89+
90+
Key innovations:
91+
- **Agents as the composable unit**: Not individual LLM calls, but complete agents with tools and prompts
92+
- **Optional inherited context**: Subagents can optionally inherit conversation history (Claude Code's subagents always start with blank context)
93+
- **Arbitrary nesting**: Agents can spawn agents that spawn agents—unlimited depth (Claude Code only supports 1 level of subagents)
94+
- **Programmatic control**: Mix LLM calls with TypeScript code using generator functions
95+
- **Orchestrator pattern**: One agent with no tools except spawning other agents—perfect context management for free
96+
97+
```typescript
98+
// Simplified example of the orchestrator pattern
99+
const orchestrator = {
100+
tools: [spawnAgent],
101+
spawnableAgents: [filePicker, editor, reviewer, thinker, researcher]
102+
}
103+
```
104+
105+
Spawned agents contribute only their final output, keeping the orchestrator's context clean and focused.
106+
107+
## Research-Driven Agent Development
108+
109+
We built [BuffBench](https://github.com/CodebuffAI/codebuff/tree/main/evals)—our custom eval suite that tests agent configurations across 175+ real implementation tasks from open source repos.
110+
111+
{/* TODO: Add BuffBench results chart */}
112+
113+
BuffBench takes a fundamentally different approach from benchmarks like SWE Bench. Instead of passing predefined tests, our evals challenge coding agents to reimplement real git commits through multi-turn conversations. An AI judge scores implementations on completion, efficiency, code quality, and overall correctness—comparing against the ground truth commit.
114+
115+
- **Data-driven optimization**: We measure quality, speed, and cost across many agent combinations
116+
- **Ship what wins**: Only the highest-scoring, fastest, most cost-effective configurations go live
117+
- **Most complex agent system**: After testing countless subagent combinations, we ship the most robust multi-agent architecture of any major coding agent
118+
- **Continuous improvement**: We believe going deeper on agent research will unlock significant further advantages that no one else will find
119+
120+
Our research isn't theoretical—it's deployed in production, constantly refined by real-world testing.
121+
122+
## Ad Revenue Share
123+
124+
Codebuff optionally displays ads above the input box. Each impression earns you credits you can spend on more coding agent usage.
125+
126+
{/* TODO: Add ad display screenshot */}
127+
128+
- **Earn while you code**: Ad impressions convert directly to credits
129+
- **Completely optional**: Turn ads off at any time in settings
130+
- **Use credits for more prompts**: Earned credits work just like purchased credits
131+
132+
## Polished Terminal UI
133+
134+
Codebuff's CLI is built on [OpenTUI](https://github.com/anomalyco/opentui)—a React-based terminal framework.
135+
136+
{/* TODO: Add CLI screenshot */}
137+
138+
- No flicker, ever
139+
- Hover and click support
140+
- Sleek, polished experience
141+
142+
## Clickable Follow-up Suggestions
143+
144+
After every response, Codebuff suggests three follow-up prompts you can click to execute.
145+
146+
{/* TODO: Add follow-up suggestions screenshot */}
147+
148+
- Codebuff often has ideas you didn't think of
149+
- One click to continue building
150+
- A step toward Codebuff as a collaborative partner
151+
152+
## No Babysitting Required
153+
154+
When you ask Codebuff to do something, it just does it. No permission prompts. No "Are you sure?" dialogs.
155+
156+
{/* TODO: Add comparison screenshot */}
157+
158+
You can step away and come back to finished work.
159+
160+
## Try It Now
161+
162+
```bash
163+
npm install -g codebuff
164+
```
165+
166+
Then `cd` to your project and run `codebuff`. Experience the difference in seconds.

0 commit comments

Comments
 (0)