Skip to content

Conversation

@d42me
Copy link
Contributor

@d42me d42me commented Dec 25, 2025

Closes ENG-2399


Note

Change overview

  • Rename Sample model field metadata to info in models.py (extra fields still allowed via extra="allow").
  • Update README examples to use info when pushing samples.
  • Adjust tests to validate info instead of metadata.

Written by Cursor Bugbot for commit dc42f60. This will update automatically on new commits. Configure here.

@d42me d42me requested a review from JannikSt December 25, 2025 21:46
@d42me d42me merged commit cf24861 into main Dec 30, 2025
11 checks passed
@d42me d42me deleted the fix/eval-sample-type-info branch December 30, 2025 04:05
JannikSt pushed a commit that referenced this pull request Jan 3, 2026
* Update eval sample field.

* Update docs.
JannikSt added a commit that referenced this pull request Jan 3, 2026
* Implement commands for hosted RL

* Hosted RL

* Allow for user to use just
 Usage: prime rl [OPTIONS] ENVIRONMENTS... | COMMAND [ARGS]...

 Manage RL training runs.

 By default, 'prime rl <environments>' runs 'prime rl run <environments>'.

╭─ Options ──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --help  -h        Show this message and exit.                                                                                                                                                                      │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Commands ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ run      Create an RL training run with specified environments and model.                                                                                                                                          │
│ models   List available models for RL training.                                                                                                                                                                    │
│ runs     List your RL training runs.                                                                                                                                                                               │
│ stop     Stop an RL training run.                                                                                                                                                                                  │
│ delete   Delete an RL training run.                                                                                                                                                                                │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ to start a run

* Support tomls on prime rl cmd

* Minor fix

* Cleanup references to RFT

* Minor improvements

* Fix ruff

* Match post rft run schema to new backend

* Refactor delete_run method to remove return value and simplify success handling in RLClient and related command.

* Fix/prime rl list (#267)

* quick fix for prime rl list when no name set

* remove truncation of id in prime rl list

* Add support for run_config

* feat: add eval_config support to RL API client (#271)

* feat: add eval_config support to RL API client

* Remove accidentally committed test files

* feat: add logs command for RL runs

* fix: move time import to top, add rl_config example

* feat: add --watch flag and improve log streaming

* fix: allow built-in envs like reverse-text, update example

* feat: add --eval-* options to rl run command

* fix: strip ANSI escape codes from logs output

* fix: increase poll interval to 5s, add rate limit handling

* fix: filter progress bars from logs output, remove redundant --watch flag

* fix: keep 100% progress bar completion lines in logs

* fix: address review comments - simplify log follow, warn on unused eval options

* fix: handle log rotation in follow mode when tail window is full

* fix: always use overlap detection for log follow to handle fast growth with rotation

* feat: add [eval] section support in TOML config files

* fix: improve progress bar filtering to remove empty lines

* fix: require owner/name format for environments, remove example config

* fix: use from_sources for eval config merging, require owner/name format

- Use BaseConfig.from_sources for eval config precedence instead of manual if-statements
- Require owner/name format for --eval-envs (same as training environments)
- Rename EvalConfig.eval_base_model to base_model for proper underscore mapping

* prime registry support (#215)

* custom image registry for sandboxes

* prime images

* --image typo

* linux/amd64

* updated to not build locally

* full image path

* rm emojis

* remove inline

* image status

* full image path

* add cleanup

* adjust scope output

* bug bot stuff

* validate_output_format

* bug bot comment

* update prime images list

* limit platform

* bump timeout

* add closed beta info

* Chore/bump version 0.5.8 (#270)

* bump version to 0.5.8

* bump versions

* Fix: Update eval sample field (#265)

* Update eval sample field.

* Update docs.

* Fix: Remove trailing comma from API token URL (#273)

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
Co-authored-by: sami <sami@primeintellect.ai>

---------

Co-authored-by: Johannes Hagemann <johannes@primeintellect.ai>
Co-authored-by: JannikSt <JannikSt@users.noreply.github.com>
Co-authored-by: Jannik Straube <info@jannik-straube.de>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants