Python: Add Azure AI Agent V2 computer use tool sample by TaoChenOSU · Pull Request #2210 · microsoft/agent-framework

TaoChenOSU · 2025-11-14T01:15:47Z

Motivation and Context

Add a sample to show how to create and use an Azure AI agent with a computer use tool.

Description

Add a sample to show how to create and use an Azure AI agent with a computer use tool.

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the Contribution Guidelines
All unit tests pass, and I have added new tests where possible
Is this a breaking change? If yes, add "[BREAKING]" prefix to the title of the PR.

markwallace-microsoft · 2025-11-14T01:17:40Z

Python Test Coverage Report •

File	Stmts	Miss	Cover	Missing
packages/core/agent_framework/openai
_responses_client.py	418	71	83%	144–145, 148–149, 155–156, 159, 166, 201, 231, 259–260, 287, 291, 308, 313, 355, 416, 421, 498, 503, 507–509, 530, 545–546, 550–552, 600, 620–621, 634–635, 651–652, 690, 692, 730, 732, 741–742, 755, 757, 830–836, 853–858, 877, 895, 905, 907, 925–926, 928–930, 941–942, 945, 947
TOTAL	14699	2121	85%

Python Unit Test Overview

Tests	Skipped	Failures	Errors	Time
2037	129 💤	0 ❌	0 🔥	38.969s ⏱️

Copilot

Pull Request Overview

This pull request adds a comprehensive Python sample demonstrating how to create and use an Azure AI agent with the Computer Use Preview Tool, enabling computer automation tasks through simulated interactions.

Key Changes

New sample file demonstrating computer use tool integration with Azure AI agents
Core framework enhancement to support raw message representations for specialized tool responses
Updated documentation including the new sample in the README

Reviewed Changes

Copilot reviewed 4 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`python/samples/getting_started/agents/azure_ai/azure_ai_with_computer_use.py`	New 290-line sample demonstrating computer use tool with state machine for simulated web search workflow, including screenshot handling and action processing
`python/samples/getting_started/agents/azure_ai/assets/cua_browser_search.png`	Binary asset file - initial screenshot showing browser search page
`python/samples/getting_started/agents/azure_ai/assets/cua_search_typed.png`	Binary asset file - screenshot showing typed search query
`python/samples/getting_started/agents/azure_ai/assets/cua_search_results.png`	Binary asset file - screenshot showing search results
`python/samples/getting_started/agents/azure_ai/README.md`	Added documentation entry for the new computer use sample
`python/packages/core/agent_framework/openai/_responses_client.py`	Enhanced message parser to support raw representations for messages without standard content
`python/packages/core/tests/openai/test_openai_responses_client.py`	Added test coverage for raw representation message handling

Copilot · 2025-11-14T01:30:20Z

python/samples/getting_started/agents/azure_ai/azure_ai_with_computer_use.py

+        if item.type == "message":
+            contents = item.content
+            for part in contents:
+                final_output += getattr(part, "text", None) or getattr(part, "refusal", None) or "" + "\n"


Incorrect operator precedence in string concatenation. The current expression getattr(part, "text", None) or getattr(part, "refusal", None) or "" + "\n" will evaluate as (... or "" + "\n") which adds "\n" to an empty string before the or operation, not to the final result.

This should be: (getattr(part, "text", None) or getattr(part, "refusal", None) or "") + "\n"

Add parentheses to ensure the newline is concatenated to the result of the or chain.

Suggested change

final_output += getattr(part, "text", None) or getattr(part, "refusal", None) or "" + "\n"

final_output += (getattr(part, "text", None) or getattr(part, "refusal", None) or "") + "\n"

Copilot · 2025-11-14T01:30:21Z

python/samples/getting_started/agents/azure_ai/azure_ai_with_computer_use.py

+            screenshots = load_screenshot_assets()
+            print("Successfully loaded screenshot assets")
+        except FileNotFoundError:
+            print("Failed to load required screenshot assets. Please ensure the asset files exist in ../assets/")


The error message references an incorrect path. The assets directory is located at ./assets/ relative to the script file (as shown in line 74), not at ../assets/.

The error message should be: "Failed to load required screenshot assets. Please ensure the asset files exist in ./assets/"

Suggested change

print("Failed to load required screenshot assets. Please ensure the asset files exist in ../assets/")

print("Failed to load required screenshot assets. Please ensure the asset files exist in ./assets/")

Copilot · 2025-11-14T01:30:21Z

python/samples/getting_started/agents/azure_ai/azure_ai_with_computer_use.py

+    filename_map = {
+        "browser_search": "cua_browser_search.png",
+        "search_typed": "cua_search_typed.png",
+        "search_results": "cua_search_results.png",
+    }
+
+    for key, path in screenshot_paths.items():
+        try:
+            image_base64 = image_to_base64(path)
+            screenshots[key] = {"filename": filename_map[key], "url": f"data:image/png;base64,{image_base64}"}


The filename_map dictionary is redundant as it duplicates information already present in the screenshot_paths dictionary. The filenames can be extracted directly from the paths using os.path.basename(path).

Consider refactoring to:

for key, path in screenshot_paths.items(): try: image_base64 = image_to_base64(path) screenshots[key] = {"filename": os.path.basename(path), "url": f"data:image/png;base64,{image_base64}"} except FileNotFoundError as e: print(f"Error: Missing required screenshot asset: {e}") raise

This eliminates the need to maintain two separate dictionaries with the same information.

Suggested change

filename_map = {

"browser_search": "cua_browser_search.png",

"search_typed": "cua_search_typed.png",

"search_results": "cua_search_results.png",

}

for key, path in screenshot_paths.items():

try:

image_base64 = image_to_base64(path)

screenshots[key] = {"filename": filename_map[key], "url": f"data:image/png;base64,{image_base64}"}

for key, path in screenshot_paths.items():

try:

image_base64 = image_to_base64(path)

screenshots[key] = {"filename": os.path.basename(path), "url": f"data:image/png;base64,{image_base64}"}

dmytrostruk · 2025-11-14T01:40:24Z

python/packages/core/agent_framework/openai/_responses_client.py

        if "content" in args or "tool_calls" in args:
            all_messages.append(args)
+        elif message.raw_representation:
+            all_messages.append(message.raw_representation)


I would avoid using raw_representation as input. As far as I know, currently we use this property as output only, unless I miss something. Instead of using raw_representation as input, we can:

Allow to pass dict as part of ChatMessage.contents, which will enable breaking-glass scenario for message content types as input.

Add new content type for computer use tool. I think this would be a preferred approach, since this tool type exists in both OpenAI Responses API and Azure AI.

that was my comment as well (didn't see this before), and we had a ADR PR discussing the potential of a set of computer use types and decided against it: #796 (comment)

eavanvalkenburg · 2025-11-14T07:52:24Z

python/packages/core/agent_framework/openai/_responses_client.py

                    args["content"].append(self._openai_content_parser(message.role, content, call_id_to_id))  # type: ignore
        if "content" in args or "tool_calls" in args:
            all_messages.append(args)
+        elif message.raw_representation:


I'm not sure this is a good idea, there is a reason we have not created abstractions for computer use, and it's because the variety and complexity of the code needed to handle the input and outputs of it across platforms is too complex for our purposes. Adding a raw_representation as a input goes against all that we do and I think if a dev needs this kind of special behavior then they are probably better off building directly against an SDK anyway since it is not abstracted, so it's not like they will be able to swap in and out between models and therefore the added value is low, and putting this method in, might break some other things, and putting this sample in implies we support this scenario, while we really don't...

TaoChenOSU · 2025-11-14T16:34:44Z

Based on offline discussions, we are not ready for this tool. Closing

Add Azure AI Agent V2 computer use tool sample

2605222

TaoChenOSU self-assigned this Nov 14, 2025

Copilot AI review requested due to automatic review settings November 14, 2025 01:15

TaoChenOSU added this to Agent Framework Nov 14, 2025

TaoChenOSU added the python label Nov 14, 2025

TaoChenOSU requested review from dmytrostruk and removed request for Copilot November 14, 2025 01:15

github-actions bot changed the title ~~Add Azure AI Agent V2 computer use tool sample~~ Python: Add Azure AI Agent V2 computer use tool sample Nov 14, 2025

Copilot started reviewing on behalf of TaoChenOSU November 14, 2025 01:16 View session

Copilot finished reviewing on behalf of TaoChenOSU November 14, 2025 01:18

Add readme

e58eb77

Copilot AI review requested due to automatic review settings November 14, 2025 01:26

markwallace-microsoft added the documentation Improvements or additions to documentation label Nov 14, 2025

Copilot started reviewing on behalf of TaoChenOSU November 14, 2025 01:26 View session

Copilot finished reviewing on behalf of TaoChenOSU November 14, 2025 01:29

Copilot AI reviewed Nov 14, 2025

View reviewed changes

Update images

c4611a2

dmytrostruk reviewed Nov 14, 2025

View reviewed changes

eavanvalkenburg reviewed Nov 14, 2025

View reviewed changes

TaoChenOSU closed this Nov 14, 2025

github-project-automation bot moved this to Done in Agent Framework Nov 14, 2025

TaoChenOSU mentioned this pull request Nov 19, 2025

Single Agent: Computer Use abstraction - content types and handler #2220

Open

crickman deleted the taochen/python-add-computer-use-tool-sample branch December 18, 2025 17:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: Add Azure AI Agent V2 computer use tool sample#2210

Python: Add Azure AI Agent V2 computer use tool sample#2210
TaoChenOSU wants to merge 3 commits intomainfrom
taochen/python-add-computer-use-tool-sample

TaoChenOSU commented Nov 14, 2025 •

edited

Loading

Uh oh!

markwallace-microsoft commented Nov 14, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 14, 2025

Uh oh!

Copilot AI Nov 14, 2025

Uh oh!

Copilot AI Nov 14, 2025

Uh oh!

dmytrostruk Nov 14, 2025

Uh oh!

eavanvalkenburg Nov 14, 2025 •

edited

Loading

Uh oh!

eavanvalkenburg Nov 14, 2025

Uh oh!

TaoChenOSU commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	final_output += getattr(part, "text", None) or getattr(part, "refusal", None) or "" + "\n"
	final_output += (getattr(part, "text", None) or getattr(part, "refusal", None) or "") + "\n"

	print("Failed to load required screenshot assets. Please ensure the asset files exist in ../assets/")
	print("Failed to load required screenshot assets. Please ensure the asset files exist in ./assets/")

Conversation

TaoChenOSU commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Description

Contribution Checklist

Uh oh!

markwallace-microsoft commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Python Unit Test Overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Reviewed Changes

Uh oh!

Copilot AI Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

dmytrostruk Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

eavanvalkenburg Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eavanvalkenburg Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

TaoChenOSU commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

TaoChenOSU commented Nov 14, 2025 •

edited

Loading

markwallace-microsoft commented Nov 14, 2025 •

edited

Loading

eavanvalkenburg Nov 14, 2025 •

edited

Loading