Feat/eval state injection @W-21514105@ by Jaganpro · Pull Request #356 · salesforcecli/plugin-agent

Jaganpro · 2026-03-09T18:49:22Z

Summary

@W-21514105@
Addresses 3 confirmed gaps in sf agent test run-eval discovered through empirical testing against the Einstein Evaluation API:

Whitelist state + setupSessionContext + context_variables in evalNormalizer.ts — the normalizer's stripUnrecognizedFields pass currently removes these fields from agent.create_session, making it impossible to test post-auth business topics or inject session context without --no-normalize
Translate YAML contextVariables to context_variables in yamlSpecTranslator.ts — the TestCase type already supports contextVariables but translateTestCase() silently drops them
Preserve outputs[] in --json result in run-eval.ts — buildResultSummary() only returns evaluations, discarding the full outputs[] array with agent responses, topic routing, and planner state needed for CI debugging

Empirical Evidence

Testing with a Salesforce Agentforce agent that requires authentication before routing to business topics:

Scenario	With normalization (state stripped)	With `--no-normalize` (state preserved)
Topic routing	`account_validation` (auth gate)	`product_help` (correct business topic)
Quality score	2/5	5/5

Changes

File	Change
`src/evalNormalizer.ts`	Add `state`, `setupSessionContext`, `context_variables` to `VALID_AGENT_FIELDS` whitelist
`src/yamlSpecTranslator.ts`	Map `testCase.contextVariables` → `context_variables` on `create_session` step
`src/commands/agent/test/run-eval.ts`	Add `outputs[]` to `RunEvalResult` type and `buildResultSummary()`
`test/evalNormalizer.test.ts`	3 new tests for state/setupSessionContext/context_variables passthrough
`test/yamlSpecTranslator.test.ts`	3 new tests for contextVariables translation
`schemas/agent-test-run__eval.json`	Regenerated to reflect `outputs` field

Test plan

yarn build succeeds
yarn test:only — 214/214 passing (6 new tests)
test:json-schema passes with regenerated schema
Lint + prettier + commitlint hooks pass
Integration test: run product_help.json with normalization ON → confirm topic=product_help
Integration test: YAML spec with contextVariables → confirm context_variables in payload
Integration test: --json output includes outputs[] per test

…tVariables, preserve outputs - Whitelist `state`, `setupSessionContext`, and `context_variables` in evalNormalizer's VALID_AGENT_FIELDS for agent.create_session so the normalizer no longer strips fields needed for auth bypass and session context injection. - Translate YAML TestSpec `contextVariables` into `context_variables` on the agent.create_session step in yamlSpecTranslator, enabling YAML specs to inject context variables without raw JSON payloads. - Include `outputs[]` array in RunEvalResult's --json output so CI pipelines retain agent responses, topic routing, and planner state for debugging.

WillieRuemmele · 2026-03-09T20:03:10Z

FYI: adopted in #357

Jaganpro added 2 commits March 9, 2026 14:36

chore: regenerate JSON schemas after RunEvalResult type change

bb45251

WillieRuemmele changed the title ~~Feat/eval state injection~~ Feat/eval state injection @W-21514105@ Mar 9, 2026

WillieRuemmele closed this Mar 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/eval state injection @W-21514105@#356

Feat/eval state injection @W-21514105@#356
Jaganpro wants to merge 2 commits intosalesforcecli:mainfrom
Jaganpro:feat/eval-state-injection

Jaganpro commented Mar 9, 2026 •

edited by WillieRuemmele

Loading

Uh oh!

WillieRuemmele commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Jaganpro commented Mar 9, 2026 • edited by WillieRuemmele Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Empirical Evidence

Changes

Test plan

Uh oh!

WillieRuemmele commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Jaganpro commented Mar 9, 2026 •

edited by WillieRuemmele

Loading