open-r1/scripts
Edward Beeching 1b3bf043dc
Adds a E2B router server that executes batches of scripts (#561)
* adds a dedicated e2b server to handle batches of requests

* fix reward tests

* update slow reward

* style

* updates e2b router to be more generic

* refactor

* refactoring

* licence, cleanup

* update tests

* style

* fix import when e2b not present

* style

* rename sandbox file

* rename to RoutedSandbox

* update readme

* nits

* nits2

* unlimited max time

* update logs path
2025-04-07 21:01:06 +02:00
..
benchmark_e2b.py Async code reward fixes (#546) 2025-03-28 14:08:15 +01:00
decontaminate.py Enable decontamination on dataset configs (#460) 2025-03-04 09:22:01 +01:00
e2b_router.py Adds a E2B router server that executes batches of scripts (#561) 2025-04-07 21:01:06 +02:00
generate_reasoning.py Fix uuid in the data generator (#284) 2025-02-11 14:08:46 +01:00
get_tensor_parallel_size.py [WIP] RL goes brrr (#533) 2025-03-24 15:15:02 +01:00
run_benchmarks.py use ruff (#137) 2025-01-31 13:36:08 +01:00
upload_details.py move details script and fix wandb logging (#314) 2025-02-13 11:13:00 +01:00