lewtun
|
8000dd2384
|
[WIP] RL goes brrr (#533)
* Fix vLLM recipes
* Add vllm server to Slurm
* Add overlap across srun
* Fix NUM_NODES
* Refactor TP to script
* fix train script to work withnew GRPO
* lewis nits
* bump trl, transformers
---------
Co-authored-by: edbeeching <edbeeching@gmail.com>
|
2025-03-24 15:15:02 +01:00 |
|