open-r1/scripts
Edward Beeching ea5b7edf22
Add dataset filtering script (#637)
* add dataset filtering script

* remove subset selection

* save wip

* save wip

* update filter script

* refactor to run on chunks

* rename script

* cleanup

* update dapo filtering

* fixes

* dapo filt config

* udpate compute pass rate

* clean

* update readme and config

* add merging snippet
2025-05-16 10:26:49 +02:00
..
pass_rate_filtering Add dataset filtering script (#637) 2025-05-16 10:26:49 +02:00
benchmark_e2b.py Code Execution using Morph Cloud (#614) 2025-05-08 08:59:54 +02:00
decontaminate.py Enable decontamination on dataset configs (#460) 2025-03-04 09:22:01 +01:00
e2b_router.py Code Execution using Morph Cloud (#614) 2025-05-08 08:59:54 +02:00
generate_reasoning.py Fix uuid in the data generator (#284) 2025-02-11 14:08:46 +01:00
get_tensor_parallel_size.py [WIP] RL goes brrr (#533) 2025-03-24 15:15:02 +01:00
morph_router.py Code Execution using Morph Cloud (#614) 2025-05-08 08:59:54 +02:00
run_benchmarks.py use ruff (#137) 2025-01-31 13:36:08 +01:00
upload_details.py move details script and fix wandb logging (#314) 2025-02-13 11:13:00 +01:00