Skip to content

Commit e90ee34

Browse files
authored
Update README.md
1 parent 361e4ff commit e90ee34

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

9_rlhf/README.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,4 @@
1-
1+
# Reinforcement learning with human feedback
2+
Hi there! This directory has a few examples for you. First is a notebook that has all of the PyTorch code running the entire process, end-to-end on a single instance. This is just called `RLHF_locally.ipynb`, and you should be able to run this largely without error start to finish.
3+
4+
As you can imagine this introduces complexities around managing the software, distributed the models, and so forth. So, we're working on another example to containerize this and make it easier for you to work with. That's in the subdirectory below, `wip`.

0 commit comments

Comments
 (0)