Move duplicate sample.output_status to output_item.sample.error if agent/model response genarated failed #44914

YoYoJa · 2026-01-29T11:40:31Z

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

If an SDK is being regenerated based on a new API spec, a link to the pull request containing these API spec changes should be included above.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

Copilot

Pull request overview

This PR modifies the evaluation result conversion logic to move error status information from sample.output_status to the sample.error field in the AOAI output format when agent or model responses fail to generate.

Changes:

Modified _convert_single_row_to_aoai_format to check for error status in input data and populate the top-level sample field with error information
Updated test data files to include sample.output_status fields with both success and error scenarios
Changed behavior so that successful evaluations have sample: null instead of detailed sample information

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py	Added logic to extract error status from input data and populate sample.error field; removed code that populated sample from criteria metrics
sdk/evaluation/azure-ai-evaluation/tests/unittests/data/evaluation_util_convert_old_output_test.jsonl	Updated test data to include sample.output_status fields for both success and error cases
sdk/evaluation/azure-ai-evaluation/tests/unittests/data/evaluation_util_convert_expected_output.json	Updated expected output to show sample: null for success cases and sample.error for error cases

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py

Copilot · 2026-01-29T11:50:35Z

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py

+    if (input_data is not None 
+        and "sample.output_status.status" in input_data 
+        and isinstance(input_data["sample.output_status.status"], str) 
+        and not _is_none_or_nan(input_data["sample.output_status.status"])
+        and input_data["sample.output_status.status"] != "success"):


The error checking logic has unnecessary parentheses wrapping the entire condition starting on line 2358. While syntactically correct, the outer parentheses spanning lines 2358-2362 make the code harder to read. Consider removing them or restructuring for better readability.

Suggested change

if (input_data is not None

and "sample.output_status.status" in input_data

and isinstance(input_data["sample.output_status.status"], str)

and not _is_none_or_nan(input_data["sample.output_status.status"])

and input_data["sample.output_status.status"] != "success"):

if input_data is not None \

and "sample.output_status.status" in input_data \

and isinstance(input_data["sample.output_status.status"], str) \

and not _is_none_or_nan(input_data["sample.output_status.status"]) \

and input_data["sample.output_status.status"] != "success":

Copilot · 2026-01-29T11:50:36Z

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py

@@ -2354,14 +2354,35 @@ def _convert_single_row_to_aoai_format(
    run_output_results = []
    top_sample = {}


The initialization top_sample = {} on line 2355 is immediately overwritten by the logic on lines 2358-2372 (either setting it to {"error": error_info} or None). This initialization serves no purpose and could be removed. Consider initializing it as top_sample = None instead for clarity, or remove the initialization entirely since it's set in all branches.

Suggested change

top_sample = {}

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Move sample.output_status to output_item.sample.error

f293b58

YoYoJa requested a review from a team as a code owner January 29, 2026 11:40

Copilot AI review requested due to automatic review settings January 29, 2026 11:40

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Jan 29, 2026

Copilot started reviewing on behalf of YoYoJa January 29, 2026 11:41 View session

Copilot AI reviewed Jan 29, 2026

View reviewed changes

YoYoJa added 2 commits January 29, 2026 10:19

run black

5a038cf

update

6ae7621

YoYoJa changed the title ~~Move sample.output_status to output_item.sample.error if agent/model response genarated failed~~ Move duplicate sample.output_status to output_item.sample.error if agent/model response genarated failed Jan 29, 2026

YoYoJa and others added 2 commits January 29, 2026 12:09

Apply suggestions from code review

3bcaf8d

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

update

0678fbb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move duplicate sample.output_status to output_item.sample.error if agent/model response genarated failed #44914

Move duplicate sample.output_status to output_item.sample.error if agent/model response genarated failed #44914

Uh oh!

YoYoJa commented Jan 29, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 29, 2026

Uh oh!

Copilot AI Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -2354,14 +2354,35 @@ def _convert_single_row_to_aoai_format(
		run_output_results = []
		top_sample = {}

Move duplicate sample.output_status to output_item.sample.error if agent/model response genarated failed #44914

Are you sure you want to change the base?

Move duplicate sample.output_status to output_item.sample.error if agent/model response genarated failed #44914

Uh oh!

Conversation

YoYoJa commented Jan 29, 2026

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants