open-r1/tests
2025-05-19 13:39:10 +00:00
..
slow Fix style again :) (#636) 2025-05-08 16:29:01 +02:00
utils Add column verification 2025-05-19 13:39:10 +00:00
__init__.py Refactoring reward functions. Adding step by step reasoning reward. Adding test coverage for reward functions (#144) 2025-02-06 20:10:05 +01:00
test_rewards.py soft_overlong_punishment from DAPO paper (#638) 2025-05-09 17:26:34 +02:00