mirror of
https://github.com/huggingface/open-r1.git
synced 2026-06-24 01:54:06 +00:00
* adds binary code reward, refactors grpo with get_reward_funcs * adds return type to the function * add get_reward_funcs test * remote type hint * move script args to another file * update test |
||
|---|---|---|
| .. | ||
| slow | ||
| __init__.py | ||
| test_rewards.py | ||