Add blog post: The Coding Harness Behind GitHub Copilot in VS Code by jukasper · Pull Request #9740 · microsoft/vscode-docs

jukasper · 2026-05-06T02:00:52Z

This PR adds a new blog post explaining the coding harness architecture behind GitHub Copilot's agent mode in VS Code.

The post covers:

What a coding harness is and why it matters
The model-harness interaction loop
How the harness provides tools and context to the model
Evaluation and benchmarking approach

ntrogh

@jukasper Great post - very well explained and good level of depth! Also like the additional of diagrams.

I left some feedback, but nothing major.

jukasper · 2026-05-08T00:48:23Z

Thanks again for the feedback! I pushed an update that works through the review comments, including the intro intent statement, agent loop terminology, why the loop matters, the engine/car metaphor, updated diagrams, and improved image descriptions/accessibility text. Could you please take another look when you have a chance?

ntrogh

@jukasper Left a few more small suggestions. Looks great already!

ntrogh · 2026-05-08T13:32:30Z

No need to update this file

Reverted — the blogs/2026/05/01/*.png LFS entry has been removed from .gitattributes.

isidorn · 2026-05-13T10:52:22Z

+
+![Simplified diagram of the VS Code agent loop: the user sends a chat message, the tool-calling loop builds a prompt, sends it to the model, executes requested tools, records results, checks loop-control conditions, and either continues or finalizes the chat result.](agent-loop.png)
+
+A **turn** is the user-visible chat exchange: you send one message, and the agent eventually produces a response. During that turn, the agent loop may perform many **rounds**. A round is one pass through the loop: build the prompt, call the model, receive text and/or tool calls, execute any tools, record the results, and decide whether to continue. The full execution of all those rounds is the loop’s **run**. A single user turn might trigger many rounds as the model searches files, reads code, edits files, runs tests, reads the output, and iterates on failures.


Give some interesting stats if you can. Like the p50, p90, p95 of rounds in a turn for Large/Small models.

isidorn · 2026-05-13T11:00:12Z

Also consider mentioning all the automatiation we have around our harness - for example I noticed that we have a flow where a dev can trigger a harness run for a specific model on a PR - to make sure the changes made are actually good for the harness. So I would mention more of things like that! How harness is the key part of the development lifecycle in our team.

cc @rwoll probably has good ideas of example we could mention

ntrogh

@jukasper A few final comments

ntrogh · 2026-05-14T08:15:52Z

+
+The tool-calling loop is bounded by loop-control checks. We enforce a tool-call limit, check for cancellation between rounds, and run stop hooks. Stop hooks are extension points that can inspect the agent state and either allow it to finish or push it to keep working. Within the loop, the prompt is rebuilt on every iteration. That means the model always sees the latest state of the workspace: if it edited a file three rounds ago, the current prompt reflects that edit. The harness also manages conversation summarization. When the accumulated history grows too large, it compresses earlier rounds into a summary so the model can keep working without hitting the context window ceiling.
+
+> **Note:**


Suggested change

> **Note:**

> [!NOTE]

Updated to > [!NOTE].

ntrogh · 2026-05-14T08:19:31Z

+The tool-calling loop is bounded by loop-control checks. We enforce a tool-call limit, check for cancellation between rounds, and run stop hooks. Stop hooks are extension points that can inspect the agent state and either allow it to finish or push it to keep working. Within the loop, the prompt is rebuilt on every iteration. That means the model always sees the latest state of the workspace: if it edited a file three rounds ago, the current prompt reflects that edit. The harness also manages conversation summarization. When the accumulated history grows too large, it compresses earlier rounds into a summary so the model can keep working without hitting the context window ceiling.
+
+> **Note:**
+> Want to see the harness in action? You can [explore the VS Code source code](https://github.com/microsoft/vscode), use the Tools UI in Chat to review the tools available for a request, and open the [Chat Debug View](https://code.visualstudio.com/docs/copilot/chat/chat-debug-view) to inspect the prompts, tool calls, and results.


The Agent Debug Logs window is more interesting to view the harness in action. It also has a flowchart visualization (maybe include a screenshot of this in the blog?). https://code.visualstudio.com/docs/copilot/chat/chat-debug-view#_agent-debug-log-panel

Updated the note to link directly to the Agent Debug Log panel and mention the flowchart visualization.

ntrogh · 2026-05-14T08:22:29Z

@copilot Can you review the blog post for typos and incorrect grammar?

…e note block

Copilot · 2026-05-14T08:30:12Z

@copilot Can you review the blog post for typos and incorrect grammar?

Reviewed the full post. Fixed two grammar issues: a missing comma after "Visual Studio Code" in the intro paragraph, and "how can we verify" → "how we can verify" (indirect question form). The rest of the post looks clean.

jukasper added 2 commits May 5, 2026 18:56

Add blog post: The Coding Harness Behind GitHub Copilot in VS Code

e9845ad

Update blog post and replace images

940a23a

ntrogh requested changes May 6, 2026

View reviewed changes

Address blog review feedback

ecb70a3

Update blog images and copy

d102bd0

ntrogh reviewed May 8, 2026

View reviewed changes

Comment thread blogs/2026/05/01/agent-harnesses-github-copilot-vscode.md Outdated

isidorn requested changes May 13, 2026

View reviewed changes

Update agent harness blog draft

d8c886b

ntrogh reviewed May 14, 2026

View reviewed changes

Copilot started work on behalf of ntrogh May 14, 2026 08:22 View session

Address PR review feedback: revert .gitattributes, fix grammar, updat…

394c992

…e note block

Copilot finished work on behalf of ntrogh May 14, 2026 08:30

Copilot AI requested a review from ntrogh May 14, 2026 08:30


		![Simplified diagram of the VS Code agent loop: the user sends a chat message, the tool-calling loop builds a prompt, sends it to the model, executes requested tools, records results, checks loop-control conditions, and either continues or finalizes the chat result.](agent-loop.png)

		A turn is the user-visible chat exchange: you send one message, and the agent eventually produces a response. During that turn, the agent loop may perform many rounds. A round is one pass through the loop: build the prompt, call the model, receive text and/or tool calls, execute any tools, record the results, and decide whether to continue. The full execution of all those rounds is the loop’s run. A single user turn might trigger many rounds as the model searches files, reads code, edits files, runs tests, reads the output, and iterates on failures.


		The tool-calling loop is bounded by loop-control checks. We enforce a tool-call limit, check for cancellation between rounds, and run stop hooks. Stop hooks are extension points that can inspect the agent state and either allow it to finish or push it to keep working. Within the loop, the prompt is rebuilt on every iteration. That means the model always sees the latest state of the workspace: if it edited a file three rounds ago, the current prompt reflects that edit. The harness also manages conversation summarization. When the accumulated history grows too large, it compresses earlier rounds into a summary so the model can keep working without hitting the context window ceiling.

		> Note:

Conversation

jukasper commented May 6, 2026

Uh oh!

ntrogh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jukasper commented May 8, 2026

Uh oh!

ntrogh left a comment

Choose a reason for hiding this comment

Uh oh!

ntrogh May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isidorn May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isidorn commented May 13, 2026

Uh oh!

ntrogh left a comment

Choose a reason for hiding this comment

Uh oh!

ntrogh May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 14, 2026

Choose a reason for hiding this comment

Uh oh!

ntrogh May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 14, 2026

Choose a reason for hiding this comment

Uh oh!

ntrogh commented May 14, 2026

Uh oh!

Copilot AI commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants