[Eval SDK] Fix argument type in agent response reformatting for evaluators #44897

salma-elshafey · 2026-01-28T10:26:36Z

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

If an SDK is being regenerated based on a new API spec, a link to the pull request containing these API spec changes should be included above.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

Copilot

Pull request overview

This PR fixes a bug in the agent response reformatting functionality for evaluators. The issue was that tool call arguments were always being formatted with quotes around them, regardless of their actual type. This caused non-string arguments (numbers, booleans, None, etc.) to be incorrectly displayed as strings.

Changes:

Added a new _format_value helper function to properly format different argument types
Updated _get_agent_response to use the new formatting function for tool call arguments

Copilot · 2026-01-28T10:36:40Z

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_common/utils.py

+def _format_value(v):
+    if isinstance(v, str):
+        return f'"{v}"'
+    elif v is None:
+        return 'None'
+    else:
+        return str(v)


The _format_value function uses str() for complex types (dicts, lists), which produces Python-style representations (e.g., {'key': 'value'} instead of JSON-style {"key": "value"}). For tool call arguments that may contain nested structures, consider whether json.dumps() would be more appropriate to ensure consistent JSON-style formatting. This would produce output like ranking_options={"ranker": "default_2024_08_21", "score_threshold": 0.0} instead of ranking_options={'ranker': 'default_2024_08_21', 'score_threshold': 0.0}.

Copilot · 2026-01-28T10:36:40Z

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_common/utils.py

+def _format_value(v):
+    if isinstance(v, str):
+        return f'"{v}"'
+    elif v is None:
+        return 'None'
+    else:
+        return str(v)


This bug fix improves argument formatting in tool call messages, but test coverage should be expanded to verify the behavior with different argument types. Consider adding test cases to test__get_agent_response_with_tool_messages that verify formatting with:

Numeric arguments (integers and floats like score_threshold: 0.0)

Boolean arguments (e.g., enabled: true)

None/null values

Nested dictionaries (e.g., ranking_options: {"ranker": "default", "score_threshold": 0.0})

This would ensure the fix works correctly for all the data types that can appear in tool call arguments.

Copilot · 2026-01-28T10:36:40Z

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_common/utils.py

+def _format_value(v):
+    if isinstance(v, str):
+        return f'"{v}"'
+    elif v is None:
+        return 'None'
+    else:
+        return str(v)


The _format_value helper function lacks a docstring explaining its purpose and behavior. Consider adding a docstring that describes:

The purpose: Format tool call argument values for display in agent response text

Parameter type and meaning

Return value and format

Examples of how different types are formatted (strings, numbers, None, booleans, complex types)

This would improve code maintainability and help other developers understand the formatting rules.

Copilot · 2026-01-28T10:36:40Z

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_common/utils.py

+def _format_value(v):
+    if isinstance(v, str):
+        return f'"{v}"'
+    elif v is None:
+        return 'None'
+    else:
+        return str(v)


The _format_value function handles booleans using str(), which produces Python-style capitalized boolean strings ("True" or "False"). Consider whether lowercase booleans ("true" or "false") would be more appropriate for consistency with JSON conventions, especially since tool call arguments often originate from JSON data. If the current behavior is intentional for Python-style output, this is acceptable.

Fix argument type in agent response reformatting

7c75991

salma-elshafey requested a review from a team as a code owner January 28, 2026 10:26

Copilot AI review requested due to automatic review settings January 28, 2026 10:26

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Jan 28, 2026

Copilot started reviewing on behalf of salma-elshafey January 28, 2026 10:27 View session

Copilot AI reviewed Jan 28, 2026

View reviewed changes

Black

e7291a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Eval SDK] Fix argument type in agent response reformatting for evaluators #44897

[Eval SDK] Fix argument type in agent response reformatting for evaluators #44897

Uh oh!

salma-elshafey commented Jan 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Eval SDK] Fix argument type in agent response reformatting for evaluators #44897

Are you sure you want to change the base?

[Eval SDK] Fix argument type in agent response reformatting for evaluators #44897

Uh oh!

Conversation

salma-elshafey commented Jan 28, 2026

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants