You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# Containerized reinforcement learning with human feedback
1
2
3
+
This directory has a project to containerize PyTorch scripts that help you run your own reward modelling using reinforcement learning with human feedback, pointing to any LLM you like.
0 commit comments