Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
450 commits
Select commit Hold shift + click to select a range
69cd315
chore(frontend): remove unused hooks (useForceRemount, useStateCallback)
ardaerzin Mar 22, 2026
4348604
chore(frontend): remove unused Playground assets (DeployButton, Deplo…
ardaerzin Mar 22, 2026
2ee2111
chore(frontend): remove unused SelectEvaluatorModal component
ardaerzin Mar 22, 2026
dd8eb25
chore(frontend): remove unused Logo and ExpandableCell components
ardaerzin Mar 22, 2026
21d3a36
refactor(frontend): use @agenta/ui VersionBadge and workflowMolecule.…
ardaerzin Mar 22, 2026
7016dbd
style(frontend): minor formatting cleanup in URL state and workflow s…
ardaerzin Mar 22, 2026
5b8447b
refactor(frontend): replace PlaygroundHeader styles.ts with bgColors …
ardaerzin Mar 22, 2026
1a4b741
feat(entity-ui): add variant display components (VariantNameCell, Var…
ardaerzin Mar 23, 2026
278b73e
feat(evaluations): add evaluatorTypes.ts for evaluator DTOs
ardaerzin Mar 23, 2026
02959b3
refactor: remove old VariantDrawer, VariantDetailsWithStatus, Variant…
ardaerzin Mar 23, 2026
ab32035
refactor: remove old EvaluatorDrawer components and useEvaluators/typ…
ardaerzin Mar 23, 2026
8943270
feat(state): add evaluatorDrawerStore.ts to state/evaluator/
ardaerzin Mar 23, 2026
b8b42b0
refactor: update import paths for evaluator types
ardaerzin Mar 23, 2026
6467c06
refactor: update imports to use @agenta/entity-ui/variant components
ardaerzin Mar 23, 2026
b2a3cee
refactor: update Playground component imports to use @agenta/entity-u…
ardaerzin Mar 23, 2026
aa00d75
refactor: update Deployments and Overview component imports to use @a…
ardaerzin Mar 23, 2026
3ebe665
refactor: clean up Types.ts - remove unused type definitions
ardaerzin Mar 23, 2026
71fadbe
test: update test fixtures with local type definitions
ardaerzin Mar 23, 2026
bcdd36c
refactor: remove unused VariantDrawerTitleMenu component
ardaerzin Mar 23, 2026
3908607
feat(ui): add useGroupedTreeData hook to @agenta/ui package
ardaerzin Mar 23, 2026
5b4b5aa
refactor(oss): migrate CopyButton imports to @agenta/ui
ardaerzin Mar 23, 2026
ba5a61b
refactor(evaluators): use useGroupedTreeData hook in EvaluatorsTable
ardaerzin Mar 23, 2026
bcacf6b
refactor(variants): use useGroupedTreeData hook in RegistryTable
ardaerzin Mar 23, 2026
460f393
refactor(evaluations): use useGroupedTreeData in NewEvaluation select…
ardaerzin Mar 23, 2026
530c5ee
refactor: remove unused components and hooks
ardaerzin Mar 23, 2026
3b89950
feat(agenta-ui): add drill-in module with core components
ardaerzin Mar 23, 2026
95366ce
refactor(agenta-entity-ui): remove deprecated DrillInView core files
ardaerzin Mar 23, 2026
b70cb94
refactor(agenta-entity-ui): update DrillInView index.ts to re-export …
ardaerzin Mar 23, 2026
96da626
refactor(agenta-entity-ui): update SchemaControls imports to use @age…
ardaerzin Mar 23, 2026
2134d47
refactor(agenta-entity-ui): update DrillInView components imports
ardaerzin Mar 23, 2026
97251c8
refactor(agenta-entity-ui): update modals and selection components im…
ardaerzin Mar 23, 2026
0e2e1ec
chore(packages): update package.json dependencies
ardaerzin Mar 23, 2026
62f3ae5
refactor(playground): use EnvironmentStatus from @agenta/entity-ui
ardaerzin Mar 23, 2026
1e23ab0
chore(web): update pnpm-lock.yaml
ardaerzin Mar 23, 2026
be58768
refactor(agenta-entities): add explicit type assertions and complete …
ardaerzin Mar 23, 2026
ea30b3a
feat(workflows): add ag-type schema endpoint and catalog filtering
ardaerzin Mar 23, 2026
7c02a31
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 23, 2026
0d33c96
Merge branch 'frontend-feat/annotation-queues-checkpoint-1' into fron…
ardaerzin Mar 23, 2026
866e1dd
refactor(playground): replace custom header styles with bgColors util…
ardaerzin Mar 23, 2026
12a6df4
Merge branch 'frontend-feat/annotation-queues-checkpoint-1' into fron…
ardaerzin Mar 23, 2026
924281e
Initial specs
jp-agenta Mar 23, 2026
7e61712
initial implementation
jp-agenta Mar 23, 2026
2ef48ef
feat(entities): add JSON Schema $ref resolution utilities
ardaerzin Mar 23, 2026
a8e20ab
feat(entities): add resolveOutputSchema and fix metric resolution
ardaerzin Mar 23, 2026
20b85ac
refactor(entities): change invocation URL to direct service endpoint
ardaerzin Mar 23, 2026
6abcad6
refactor(playground): update execution payload format for direct invoke
ardaerzin Mar 23, 2026
86c347a
fix(ui): minor UI fixes and code cleanup
ardaerzin Mar 23, 2026
b2b19a8
snippet to config,
jp-agenta Mar 24, 2026
70b9ab0
fix comments
jp-agenta Mar 24, 2026
6688705
remove fakeredis from pypoetry.toml
jp-agenta Mar 24, 2026
51929bc
Merge branch 'main' into feat/extend-runs-and-queues
jp-agenta Mar 24, 2026
09fe3c8
Merge branch 'frontend-feat/annotation-queues-checkpoint-1' into fron…
ardaerzin Mar 24, 2026
185f259
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 24, 2026
c671d1a
refactor(templates): filter out custom templates from template modal
ardaerzin Mar 24, 2026
262fc8d
Merge branch 'frontend-feat/evaluator-playground-integration' into fe…
jp-agenta Mar 24, 2026
6fffa27
fix ruff issues
jp-agenta Mar 24, 2026
746a78a
feat(entity-ui): add schema validator utility for JSON config validation
ardaerzin Mar 24, 2026
9ee3ae2
feat(entities): filter system fields (x-ag-*) from input ports extrac…
ardaerzin Mar 24, 2026
6b9d4ec
feat(entity-ui): support x-ag-type: messages in schema detection
ardaerzin Mar 24, 2026
eb1e10c
feat(entity-ui): add config validation UI in PlaygroundConfigSection
ardaerzin Mar 24, 2026
0e3c896
fix(frontend): minor fixes for registry columns, app type, and evalua…
ardaerzin Mar 24, 2026
0c28bd0
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 24, 2026
8a28b9f
fix inferred url
jp-agenta Mar 24, 2026
1d86cdf
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 24, 2026
05ac918
fixing catalog (ongoing)
jp-agenta Mar 24, 2026
85998d8
Update docs/design/evaluation-runtime-heartbeats/README.md
jp-agenta Mar 24, 2026
8280174
remove jit
jp-agenta Mar 24, 2026
d9ed2e5
fix comments
jp-agenta Mar 24, 2026
b4929cc
Merge branch 'main' into feat/extend-runs-and-queues
jp-agenta Mar 24, 2026
3bf07f3
refactor(shared): replace @scalar/openapi-parser with custom JSON poi…
ardaerzin Mar 24, 2026
22f5809
chore(oss): remove unused convertToCsv function from fileManipulations
ardaerzin Mar 24, 2026
caa7d27
chore(oss): remove unused VirtualizedSharedEditors component
ardaerzin Mar 24, 2026
d3a09f1
chore(deps): update package dependencies
ardaerzin Mar 24, 2026
3e6086b
fix hashing
jp-agenta Mar 24, 2026
68e9e45
fix inspect
jp-agenta Mar 24, 2026
cf256f2
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 24, 2026
2e6b5df
quick cleanup
jp-agenta Mar 24, 2026
28bec8a
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 24, 2026
bc50bdd
initial caching plans
jp-agenta Mar 24, 2026
0375544
Merge branch 'feat/extend-runs-and-queues' into feat/allow-trace-reuse
jp-agenta Mar 24, 2026
fcfe09a
fix ruff
jp-agenta Mar 24, 2026
5e6a07b
clean up logs
jp-agenta Mar 24, 2026
287d84a
Revert "chore(deps): update package dependencies"
ardaerzin Mar 24, 2026
2e01451
fix fake tokens
jp-agenta Mar 24, 2026
cfefd0b
fix non-leaks
jp-agenta Mar 24, 2026
440d9ca
cache and split audit and plans
jp-agenta Mar 24, 2026
0fa2c83
implement caching and splitting ?
jp-agenta Mar 24, 2026
6945d56
add multi input to batch evals
jp-agenta Mar 24, 2026
8ff95e5
quick loops docs
jp-agenta Mar 24, 2026
b60cc31
Merge branch 'feat/extend-runs-and-queues' into feat/allow-trace-reuse
jp-agenta Mar 24, 2026
59c3581
Merge pull request #4053 from Agenta-AI/feat/allow-trace-reuse
jp-agenta Mar 24, 2026
f267ba9
chore(deps): update turbo, dayjs, and jsonrepair versions
ardaerzin Mar 25, 2026
52bd059
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 25, 2026
8d38e41
move docs
jp-agenta Mar 25, 2026
5e88b21
fix findings
jp-agenta Mar 25, 2026
91c35a2
fixing new findings
jp-agenta Mar 25, 2026
bbf5a9e
fix findings
jp-agenta Mar 25, 2026
fb5172f
Merge branch 'feat/extend-runnables' into feat/extend-runs-and-queues
jp-agenta Mar 25, 2026
c329cf9
feat(workflow): improve execution mode detection and skip redundant s…
ardaerzin Mar 25, 2026
bffaad5
fix(playground): place chat messages at data.messages instead of data…
ardaerzin Mar 25, 2026
c8d1a16
feat(workflow): add is_application flag when creating app from template
ardaerzin Mar 25, 2026
547c9f9
refactor(references): use self-contained query for evaluator workflow…
ardaerzin Mar 25, 2026
766b192
feat(evaluations): add outputs support to step result upsert API
ardaerzin Mar 25, 2026
57b7a3f
fix(evaluations): include revision ID in evaluator row mapping for ne…
ardaerzin Mar 25, 2026
a9a0fa2
refactor(evaluations): pass projectId to evaluator reference resolvers
ardaerzin Mar 25, 2026
6d0497d
refactor(eval-run-details): improve evaluator reference resolution an…
ardaerzin Mar 25, 2026
4246b7d
refactor(custom-workflow): use inspect endpoint for app reachability …
ardaerzin Mar 25, 2026
1660b5d
Merge branch 'main' into feat/extend-runs-and-queues
jp-agenta Mar 25, 2026
31abe0d
fix docs
jp-agenta Mar 25, 2026
629d264
Merge pull request #4047 from Agenta-AI/feat/extend-runs-and-queues
junaway Mar 25, 2026
08d0c35
Merge branch 'main' into feat/extend-runnables
junaway Mar 25, 2026
bdcac56
Fixing comments
jp-agenta Mar 26, 2026
a71de36
CR ongoing (WIP)
jp-agenta Mar 26, 2026
736aaa8
Merge branch 'feat/extend-runnables' of github.com:Agenta-AI/agenta i…
jp-agenta Mar 26, 2026
b279fc6
updating CR.md
jp-agenta Mar 26, 2026
7cf51cb
refactor(workflow-drawer): remove top-level actions and navigation fr…
ardaerzin Mar 26, 2026
5014660
refactor(app-management): filter out builtin LLM templates from app c…
ardaerzin Mar 26, 2026
377efee
fix(playground): suppress workflow draft overlay during evaluation ex…
ardaerzin Mar 26, 2026
6fe4184
Merge branch 'main' into frontend-feat/evaluator-playground-integration
ardaerzin Mar 26, 2026
87aff36
fix default evaluators ?
jp-agenta Mar 26, 2026
91b7c9c
style(frontend): reformat ternary expression in DrawerHeader
ardaerzin Mar 26, 2026
1e6ac75
feat(frontend): enhance TraceSpanDrillInView with simplifyValue helpe…
ardaerzin Mar 26, 2026
2ce0287
refactor(frontend): simplify OverviewTabItem output rendering
ardaerzin Mar 26, 2026
5a801b8
fix(frontend): use maxDepth prop in recursive RenderedValueBlock calls
ardaerzin Mar 26, 2026
a4dee05
feat(frontend): improve TraceSpanDrillInView value rendering with inl…
ardaerzin Mar 26, 2026
1f397da
feat(frontend): refactor OverviewTabItem to render message groups sep…
ardaerzin Mar 26, 2026
4dd898f
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 26, 2026
440d430
fix taskiq events
jp-agenta Mar 26, 2026
8df33fe
fix(workflow): correct x-ag-type-ref assignment to use agType instead…
ardaerzin Mar 27, 2026
0ff6eab
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Mar 27, 2026
c97ca39
feat(frontend): strip and preserve agenta_metadata in raw parameter e…
ardaerzin Mar 27, 2026
47112cd
feat(workflow): implement seedCreatedWorkflowCache for immediate app …
bekossy Mar 27, 2026
99807db
Merge branch 'frontend-feat/evaluator-playground-integration' of gith…
bekossy Mar 27, 2026
629d6d8
feat(app-management): enhance app workflow querying with total count …
bekossy Mar 27, 2026
7693d80
feat(playground): force re-mount of editors on draft discard using re…
ardaerzin Mar 27, 2026
ba6c58c
feat(playground): prevent raw editor focus loss by eagerly syncing on…
ardaerzin Mar 27, 2026
94cc398
feat(playground): improve chat mode detection and template variable e…
ardaerzin Mar 27, 2026
a2370e2
refactor(app-selector): migrate app update endpoints from PATCH /apps…
ardaerzin Mar 27, 2026
bd495f0
fix(workflows): preserve unset fields during partial artifact updates…
ardaerzin Mar 27, 2026
7eb8630
Implement `invalidateAppManagementWorkflowQueries` to refresh app man…
ashrafchowdury Mar 27, 2026
94aaab3
Merge branch 'frontend-feat/evaluator-playground-integration' of http…
ashrafchowdury Mar 27, 2026
80231e8
feat(evaluator-playground): integrate app selector and connection log…
ardaerzin Mar 27, 2026
16482b8
feat(evaluator-template): add horizontal scrolling and spacing to tem…
ardaerzin Mar 27, 2026
dc821f1
refactor(workflow-revision-adapter): extract getWorkflowDisplayName h…
ardaerzin Mar 27, 2026
f89958e
refactor(testset-selection): replace deprecated type prop with orient…
ardaerzin Mar 27, 2026
bed2499
feat(playground): hide discard draft button for ephemeral workflow re…
ardaerzin Mar 27, 2026
4b2e7e8
refactor(playground): split drawer playground into app and evaluator …
ardaerzin Mar 27, 2026
6cc9451
feat(evaluators): enhance delete modal with entity name/version displ…
ardaerzin Mar 27, 2026
ff31b82
Merge branch 'frontend-feat/evaluator-playground-integration' of http…
ardaerzin Mar 27, 2026
adb03e2
feat(search): implement server-side search filtering across deploymen…
ardaerzin Mar 28, 2026
3f24a65
refactor(ui): update Ant Design component props to v5 API conventions
ardaerzin Mar 28, 2026
f7150ad
feat(annotate-drawer): resolve evaluator schemas from simple API and …
ardaerzin Mar 28, 2026
f0e6ee7
refactor(annotation): replace evaluatorMolecule selectors with workfl…
ardaerzin Mar 28, 2026
9612550
add export configuration to columns and sync row selection with datas…
ashrafchowdury Mar 29, 2026
34d815c
fix
ashrafchowdury Mar 30, 2026
09199a5
Merge branch 'frontend-feat/evaluator-playground-integration' of http…
ardaerzin Mar 30, 2026
9c22929
refactor(evaluators): fix stale callback closure in drawer and auto-i…
ardaerzin Mar 30, 2026
aef08c8
refactor(evaluators): pass workflow ID and variant ID for human evalu…
ardaerzin Mar 30, 2026
1c59228
fix build
ashrafchowdury Mar 30, 2026
d12365a
centerd the modals
ashrafchowdury Mar 30, 2026
6b51460
Fix Dockerfile.dev files
jp-agenta Mar 30, 2026
adb5737
fix app-level refs
jp-agenta Mar 30, 2026
82f22de
annotations > simples traces
jp-agenta Mar 30, 2026
8f7e92f
Merge branch 'main' of https://github.com/Agenta-AI/agenta into front…
ashrafchowdury Mar 30, 2026
e6d2e62
refactor: update DrawerHeader icons to X, reduce icon sizes, and adju…
ashrafchowdury Mar 31, 2026
8fbdb87
enable group display in SelectLLMProviderBase within PlaygroundConfig…
ashrafchowdury Mar 31, 2026
843a781
hiding the deployment button from the evaluator drawer
ashrafchowdury Mar 31, 2026
4875960
fix(playground): prevent horizontal overflow and reset testset state …
ardaerzin Mar 31, 2026
61c56dc
Merge branch 'frontend-feat/evaluator-playground-integration' of http…
ardaerzin Mar 31, 2026
89c37d1
feat(playground): merge evaluator-expected columns into row variable …
ardaerzin Mar 31, 2026
f00030c
fix(playground): fallback to schema-based columns when testcase data …
ardaerzin Mar 31, 2026
2f2c199
refactor(playground): remove unnecessary line break in evaluatorExpec…
ardaerzin Mar 31, 2026
c394b2f
refactor: update UI layout for evaluator header and add compact mode …
ashrafchowdury Mar 31, 2026
4341023
Merge branch 'frontend-feat/evaluator-playground-integration' of http…
ardaerzin Mar 31, 2026
f2b4502
refactor(playground): show evaluator outputs from json_schema config …
ardaerzin Mar 31, 2026
ddceb81
refactor(playground): stabilize displayParameters memo with serialize…
ardaerzin Mar 31, 2026
c578229
merge main
jp-agenta Mar 31, 2026
363265b
improve compare view ui for run
ashrafchowdury Mar 31, 2026
88e4a89
refactor(workflow): invalidate queries via QueryClient to fix cache u…
ardaerzin Mar 31, 2026
40db7b4
refactor(workflow): close drawer and show success message after evalu…
ardaerzin Mar 31, 2026
fe51b1b
CR cleanup (WIP)
jp-agenta Mar 31, 2026
2aff33e
fix conflict
jp-agenta Mar 31, 2026
c6608f7
add focus drawer trigger to execution rows and implement comparison v…
ashrafchowdury Mar 31, 2026
14b3c11
Merge branch 'frontend-feat/evaluator-playground-integration' of http…
ashrafchowdury Mar 31, 2026
122d7f1
basic cleanup
jp-agenta Mar 31, 2026
4b8c9a4
Merge branch 'frontend-feat/evaluator-playground-integration' of gith…
jp-agenta Mar 31, 2026
53ccd7a
merge main
jp-agenta Mar 31, 2026
a1659e7
Merge branch 'frontend-feat/evaluator-playground-integration' into fe…
jp-agenta Mar 31, 2026
cffa215
Merge branch 'frontend-feat/evaluator-playground-integration' of http…
ardaerzin Mar 31, 2026
900e865
refactor(cache): simplify query invalidation and add refetchOnMount f…
ardaerzin Mar 31, 2026
02ece62
update CR/QA skills
jp-agenta Mar 31, 2026
dc08d8b
update CR findings
jp-agenta Mar 31, 2026
02899d9
move skills
jp-agenta Mar 31, 2026
eb571e9
fix skill ref
jp-agenta Mar 31, 2026
7adb91d
CR round 1
jp-agenta Mar 31, 2026
a77147e
update skills
jp-agenta Mar 31, 2026
6580657
feat(drill-in): add external controls API and refactor add property UI
ardaerzin Mar 31, 2026
c3fd772
update skills to clean up open/closed findings
jp-agenta Mar 31, 2026
9fac2cf
Fix some findings
jp-agenta Mar 31, 2026
6a68b08
fix skills and docs
jp-agenta Mar 31, 2026
77e3ce1
Fix F7
jp-agenta Mar 31, 2026
62cff22
Patch F15.
jp-agenta Mar 31, 2026
52df494
refactor(ui): improve drill-in and focus drawer styling and layout co…
ardaerzin Mar 31, 2026
e39015c
update skills
jp-agenta Mar 31, 2026
9e3f743
fixing findings
jp-agenta Mar 31, 2026
2348dc1
Cleaning up F14
jp-agenta Mar 31, 2026
1b4bef0
refactor(ui): remove text-sm class from InlineKeyValue component for …
ardaerzin Mar 31, 2026
cc2356c
refactor(editor): improve error handling in code block language switc…
ardaerzin Mar 31, 2026
12e3d78
Fix new findings
jp-agenta Mar 31, 2026
a05b3dd
Merge branch 'main' into feat/extend-runnables
junaway Apr 1, 2026
ba4eb30
Merge branch 'frontend-feat/evaluator-playground-integration' into fe…
jp-agenta Apr 1, 2026
be30bc3
data migrations
jp-agenta Apr 1, 2026
5d53340
clean up migrations docs
jp-agenta Apr 1, 2026
2c70b19
fix migrations
jp-agenta Apr 1, 2026
2d567f3
update py deps
jp-agenta Apr 1, 2026
95a8fd1
chore(deps): pin axios to exact version and downgrade jotai-immer
ardaerzin Apr 2, 2026
8cbec3c
chore(deps): upgrade jotai-immer to ^0.4.3 and update axios peer depe…
ardaerzin Apr 2, 2026
b4c91e4
refactor(schema): intersect numeric bounds when merging parent/child …
ardaerzin Apr 2, 2026
1dbd279
update URI parsing logic to correctly extract key and version for ser…
ashrafchowdury Apr 2, 2026
9e4be13
revert
ashrafchowdury Apr 2, 2026
bd49c3c
Merge branch 'frontend-feat/evaluator-playground-integration' into fe…
jp-agenta Apr 2, 2026
ca0ec22
introduce evaluator step references and update annotation API to use …
ashrafchowdury Apr 2, 2026
eaa9a7d
refactor: migrate annotation API to use simple/traces backend endpoin…
ashrafchowdury Apr 2, 2026
fc69ab5
fix flags and migrations (WIP)
jp-agenta Apr 2, 2026
0c3acb8
fix regression in migration
jp-agenta Apr 2, 2026
a8215b4
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
ardaerzin Apr 2, 2026
0704b93
refactor(workflow): strip kind segment from service URLs and skip ins…
ardaerzin Apr 2, 2026
a9b9d1f
remove debug console.log from useEvaluatorHeaderReference hook
ardaerzin Apr 2, 2026
2dca9d1
fix warnings and evaluators tabs filtering
jp-agenta Apr 3, 2026
957fc16
many FE fixes with workflows
jp-agenta Apr 3, 2026
c39527c
Merge branch 'frontend-feat/evaluator-playground-integration' into fe…
jp-agenta Apr 3, 2026
e9caa72
fix auto/human evaluators creation
jp-agenta Apr 3, 2026
7334172
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
jp-agenta Apr 3, 2026
31a1632
fix evaluations
jp-agenta Apr 3, 2026
7e981fa
add logs
jp-agenta Apr 3, 2026
d9a624a
Merge branch 'feat/extend-runnables' into frontend-feat/evaluator-pla…
jp-agenta Apr 3, 2026
f6da89c
Merge branch 'main' into frontend-feat/evaluator-playground-integration
ardaerzin Apr 3, 2026
1fa63f7
update dependencies: bump turbo, jotai-immer, lucide-react, dayjs, an…
ardaerzin Apr 3, 2026
b69234c
add selectOnRowClick option to InfiniteVirtualTable row selection for…
ardaerzin Apr 4, 2026
e75ca04
fix merge conflicts and migration
jp-agenta Apr 4, 2026
659058c
Merge branch 'frontend-feat/evaluator-playground-integration' of gith…
jp-agenta Apr 4, 2026
c16cf9c
update dependencies: bump jotai, react, antd, tanstack/react-query, e…
ardaerzin Apr 4, 2026
736f723
change row selection from radio to checkbox across testsets and evalu…
ardaerzin Apr 4, 2026
8ade0b9
Merge branch 'frontend-feat/evaluator-playground-integration' into fe…
jp-agenta Apr 6, 2026
8701400
fix migrations ?
jp-agenta Apr 6, 2026
644526d
fixing more migrations
jp-agenta Apr 6, 2026
46ae7b7
las migration fix ?
jp-agenta Apr 6, 2026
61dcb90
Merge branch 'feat/extend-runnables' into chore/update-py-deps-2026-0…
jp-agenta Apr 6, 2026
6862168
Merge pull request #4100 from Agenta-AI/chore/update-py-deps-2026-04-01
jp-agenta Apr 6, 2026
d50db1d
fix py deps
jp-agenta Apr 6, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
4 changes: 4 additions & 0 deletions .gitleaksignore
Original file line number Diff line number Diff line change
Expand Up @@ -282,3 +282,7 @@ bb4b06cd13e5aca1c75886990f164b643360da2b:sdk/tests/legacy/debugging/simple-app/c
ce2aa0c2d9990d25d5771b65e0fad6db01518e14:sdk/tests/integration/conftest.py:agenta-api-key:23
a00f015276504fbf7a4820b26d17eb725c63635b:bench_traces.py:generic-api-key:19
a00f015276504fbf7a4820b26d17eb725c63635b:bench_bulk_insert.py:generic-api-key:304
70b9ab08f0f2c96f78ea436691ea161958ff9f18:docs/design/evaluation-runtime-heartbeats/README.md:generic-api-key:206
70b9ab08f0f2c96f78ea436691ea161958ff9f18:docs/design/evaluation-runtime-heartbeats/README.md:generic-api-key:234
70b9ab08f0f2c96f78ea436691ea161958ff9f18:docs/design/evaluation-runtime-heartbeats/plan.md:generic-api-key:46
70b9ab08f0f2c96f78ea436691ea161958ff9f18:docs/design/evaluation-runtime-heartbeats/plan.md:generic-api-key:58
27 changes: 22 additions & 5 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -1,11 +1,28 @@
repos:
- repo: https://github.com/astral-sh/ruff-pre-commit
rev: v0.11.2
hooks:
- id: ruff-format
- id: ruff
- repo: local
hooks:
- id: ruff-format
name: ruff format
entry: python3 -m ruff format
language: system
types: [python]
- id: ruff-check
name: ruff check
entry: python3 -m ruff check
language: system
types: [python]
Comment on lines 2 to +13
Copy link

Copilot AI Mar 19, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Switching to language: system hooks makes pre-commit non-hermetic: contributors/CI must have compatible ruff/node/turbo versions installed globally, and hook behavior can drift over time. Prefer using the official ruff-pre-commit repo with a pinned rev (or language: python with additional_dependencies) and similarly pin node tooling via a managed runner (or a pre-commit mirror) to keep formatting/lint stable and reproducible.

Suggested change
- repo: local
hooks:
- id: ruff-format
name: ruff format
entry: ruff format
language: system
types: [python]
- id: ruff-check
name: ruff check
entry: ruff check
language: system
types: [python]
- repo: https://github.com/astral-sh/ruff-pre-commit
rev: v0.6.9
hooks:
- id: ruff-format
- id: ruff
- repo: local
hooks:

Copilot uses AI. Check for mistakes.
Comment on lines 1 to +13
Copy link

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Switching to language: system unpins tool versions, which makes pre-commit behavior depend on each developer/CI environment (and can cause inconsistent formatting/lint results). If consistent results are important, prefer using the upstream pre-commit repo with a pinned rev, or enforce a pinned toolchain version in CI and developer setup.

Copilot uses AI. Check for mistakes.
- id: prettier-format
name: prettier --write (web)
entry: bash -c 'cd web && pnpm exec prettier --write --cache --log-level warn "**/*.{js,jsx,cjs,mjs,ts,tsx,cts,mts,mdx}" --ignore-path ./ee/.gitignore --ignore-path ./oss/.gitignore'
language: system
Comment thread
junaway marked this conversation as resolved.
files: ^web/.*\.(js|jsx|cjs|mjs|ts|tsx|cts|mts|mdx)$
pass_filenames: false
- id: turbo-lint
name: turbo lint (web)
entry: bash -c 'cd web && if [ "$(sysctl -in hw.optional.arm64 2>/dev/null || echo 0)" = "1" ] && [ "$(node -p "process.arch")" = "x64" ]; then arch -arm64 pnpm exec turbo run lint; else pnpm exec turbo run lint; fi'
language: system
Comment thread
junaway marked this conversation as resolved.
files: ^web/.*\.(js|jsx|cjs|mjs|ts|tsx|cts|mts|mdx)$
pass_filenames: false
Comment on lines +14 to +25
Comment on lines 2 to +25
Comment thread
junaway marked this conversation as resolved.
- id: gitleaks-pre-commit
name: gitleaks git (staged only)
entry: bash -c 'gitleaks --config .gitleaks.toml --exit-code 1 --verbose git --staged'
Expand Down
2 changes: 1 addition & 1 deletion AGENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

## Dev Environment Tips
- If you make changes to the frontend, make sure to run `pnpm lint-fix` within the web folder
- If you make changes to the API or SDK, make sure to run `ruff format` and `ruff check --fix` within the SDK or API folder
- If you make changes to the API or SDK, make sure to run `ruff format` and `ruff check --fix` within the SDK or API folder (run from the repo root: `ruff format` then `ruff check`; fix all errors before committing)
- If you update Ant Design tokens, run `pnpm generate:tailwind-tokens` in the web folder and commit the generated file

## Environment Config Conventions
Expand Down
213 changes: 213 additions & 0 deletions agents/skills/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,213 @@
# Findings Skills

This folder contains the canonical skill definitions for findings-driven review work in this repo.

The workflow is organized around five generic skills:

1. `scan-codebase`
2. `test-codebase`
3. `sync-findings`
4. `triage-findings`
5. `resolve-findings`

The split is by activity, not by a CR/QA lane name:

- `scan-codebase` is verification-oriented review from code and docs into findings
- `test-codebase` is validation-oriented execution from tests and runtime behavior into findings
- `sync-findings` keeps GitHub and the findings record aligned
- `triage-findings` is the discussion and planning layer
- `resolve-findings` is the execution layer back into code, tests, and docs

All skills accept optional `path=`.

- If `path` is provided, use that local design or findings folder.
- If `path` is omitted, infer it and state the inferred value before starting.

Preferred master document:

- `findings.md`

Shared references:

- `shared/references/findings.schema.md`
- `shared/references/findings.lifecycle.md`

## Diagram

```text
Git / PR / Review Threads
^
| sync-findings
v
Code + Docs -- scan-codebase --> Findings <-- test-codebase -- Tests / Runtime / Docs
|
v
triage-findings
|
v
resolve-findings
|
v
Code / Tests / Docs
```

## Invocation

For Codex:

- invoke a skill with `$skill-name`
- `agents/skills/` is the canonical source, but Codex auto-discovers from `~/.codex/skills`
- if a skill does not appear under `$`, refresh the symlink or install it into `~/.codex/skills` and reload Codex

For Claude:

- invoke a project skill with `/project:skill-name` when your Claude client exposes project skills that way
- otherwise use the Claude project skills picker after reloading the project
- `.claude/skills/` contains the Claude-facing wrappers for the canonical skills in `agents/skills/`

## Skills

### `scan-codebase`

Use for a fresh-context scan of code and docs that turns review observations into findings.

Parameters:

- `path=<local-folder>` optional
- `depth=deep` by default
- supported values: `shallow`, `deep`

Use when:

- you want a review pass anchored in current code and docs
- you want verification findings before planning
- you want to surface missing tests or coverage gaps as review findings without running tests yet

For Codex:

```text
$scan-codebase
```

For Claude:

```text
/project:scan-codebase
```

### `test-codebase`

Use to run or inspect the relevant validation paths and turn failures, regressions, or missing coverage into findings.

Parameters:

- `path=<local-folder>` optional
- `depth=deep` by default
- supported values: `shallow`, `deep`

Use when:

- you want validation findings from actual test execution or targeted repro
- you want to confirm whether missing or broken behavior is observable
- you want missing test coverage turned into findings

For Codex:

```text
$test-codebase
```

For Claude:

```text
/project:test-codebase
```

### `sync-findings`

Use to sync the findings record against local review artifacts and optionally a GitHub PR.

Parameters:

- `path=<local-folder>` optional
- `url=<github-pr-url>` for remote + local sync
- omitted `url` means local-only sync

Use when:

- you want findings updated from open PR comments or local notes
- you want clearly closed threads acknowledged and resolved
- you want the master findings file to match current GitHub state and current local code state

For Codex:

```text
$sync-findings
```

For Claude:

```text
/project:sync-findings
```

### `triage-findings`

Use to coordinate the next review or testing actions with the user and turn findings into a plan.

Parameters:

- `path=<local-folder>` optional
- `url=<github-pr-url>` optional when PR context matters

Use when:

- you need follow-up questions answered before acting
- you need to decide whether to run `scan-codebase`, `test-codebase`, or `sync-findings`
- you need severity, confidence, status, owner questions, and next action clarified

For Codex:

```text
$triage-findings
```

For Claude:

```text
/project:triage-findings
```

### `resolve-findings`

Use to implement the chosen fix path for findings and update the findings record afterward.

Parameters:

- `path=<local-folder>` optional
- default `priority=next-highest`
- explicit values: `P0`, `P1`, `P2`, `P3`, `all`

Default behavior:

- if unresolved findings exist at `P0`, resolve `P0` on this run
- if `P0` is exhausted, the next run resolves `P1`
- then `P2`, then `P3`

Use when:

- findings are implementation-ready or nearly so
- you want code, test, and docs changes applied
- you want targeted verification or validation rerun after the fix

For Codex:

```text
$resolve-findings
```

For Claude:

```text
/project:resolve-findings
```
72 changes: 72 additions & 0 deletions agents/skills/resolve-findings/SKILL.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,72 @@
---
name: resolve-findings
description: Resolve findings by implementing the chosen fix path in code, tests, or docs. Accept optional `path` and a `priority` selector; by default resolve only the next highest remaining priority bucket, in order `P0`, `P1`, `P2`, `P3`. Also accept explicit levels or `all`. Default to `path=infer`. Confirm effective variables before starting.
---

# Resolve Findings

Read these shared references when needed:

- `../shared/references/findings.schema.md`
- `../shared/references/findings.lifecycle.md`

## Role

Resolve is execution mode from findings back into code, tests, and docs.

- It may change production code for verification findings.
- It may change tests or test harnesses for validation findings.
- It should update the active findings record after implementation and rerun targeted checks when feasible.

## Priority Input

Accept a `priority` parameter from the prompt:

- omitted priority: resolve the next highest remaining bucket only
- explicit level: `P0`, `P1`, `P2`, or `P3`
- `all`: resolve all remaining buckets

Default:

- `priority=next-highest`

## Path Input

Accept an optional `path` from the prompt.

- If `path` is provided, use it as the local design or findings folder.
- If `path` is omitted, infer it from the branch, subsystem, or matching docs and state the inferred value before starting.

Default:

- `path=infer`

## Workflow

1. Determine the selected bucket.
Confirm the effective variables first:
- `path`
- `priority`
- target findings files when inferable

Use the requested `priority`, or the next highest unresolved bucket in the active findings record.

2. Load the active findings record.
Use `path/findings.md`.

3. Check readiness before coding.
If the intended resolution path, policy boundary, or data contract is still ambiguous, ask the next follow-up question before editing.

4. Implement the selected fixes.
Make the smallest coherent set of code, test, and doc changes needed for the selected findings bucket.

5. Re-run targeted checks.
Use the narrowest useful verification or validation pass that demonstrates the fix.

6. Update the findings record.
Move findings between open and closed sections, preserve notes and open questions ordering, and record what was fixed or what remains blocked.

## Rules

- Do not hide ambiguity behind `open` or `needs-user-decision` when the user already started answering. Ask the next concrete question.
- Do not silently widen scope from the selected priority bucket unless the fix is tightly coupled.
7 changes: 7 additions & 0 deletions agents/skills/resolve-findings/agents/openai.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
interface:
display_name: "Resolve Findings"
short_description: "Implement fixes from findings"
default_prompt: "Use $resolve-findings with priority=next-highest to implement the next unresolved findings bucket and update the findings record."

policy:
allow_implicit_invocation: true
Loading
Loading