open-r1/tests
lewtun 9366aa2df3
Add dataset mixer (#647)
* Prototype

* Clean up

* Refactor

* Add tests

* Add doc and make scripts work

* Tune doc

* Up

* Tune

* Add column verification

* Fix types

* Fix YAML

* Fix types

* Fix doc

* f

* f
2025-05-20 11:40:42 +02:00
..
slow Fix style again :) (#636) 2025-05-08 16:29:01 +02:00
utils Add dataset mixer (#647) 2025-05-20 11:40:42 +02:00
__init__.py Refactoring reward functions. Adding step by step reasoning reward. Adding test coverage for reward functions (#144) 2025-02-06 20:10:05 +01:00
test_rewards.py soft_overlong_punishment from DAPO paper (#638) 2025-05-09 17:26:34 +02:00