mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* Ring allreduce v3 * Configurable size, number of gpus and jit in benchmark * ScheduleBarrier v0 * GB/s that make sense * ScheduleBarrier v0.1 * Fallback on 2 GPUs * ScheduleBarrier v0.2 * ScheduleBarrier v0.3 * ScheduleBarrier v0.3.1 * ScheduleBarrier v0.3.2 * Replace ScheduleBarrier with automatic optimization * unused import * fix comment * typing * better fallback * python 3.8 * RING=2 and use ContextVar * DEBUG >= 2 and change name * linter * type --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com> Co-authored-by: chenyu <chenyu@fastmail.com> Co-authored-by: nimlgen <138685161+nimlgen@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| mlperf_resnet | ||
| external_benchmark_hip_compile.py | ||
| external_benchmark_load_stable_diffusion.py | ||
| external_benchmark_multitensor_allreduce.py | ||
| external_cl_half_max.py | ||
| external_hip_compiler_bug.py | ||
| external_jit_failure.py | ||
| external_llama_eval.py | ||
| external_model_benchmark.py | ||
| external_multi_gpu.py | ||
| external_osx_profiling.py | ||
| external_test_embedding.py | ||
| external_test_example.py | ||
| external_test_hip_compile.py | ||
| external_test_hsa_driver.py | ||
| external_test_image.py | ||
| external_test_jit_on_models.py | ||
| external_test_mnist_data_select.py | ||
| external_test_onnx_backend.py | ||
| external_test_opt.py | ||
| external_test_optim.py | ||
| external_test_speed_llama.py | ||
| external_test_uops_graphing.py | ||
| external_test_whisper_librispeech.py | ||
| external_test_yolo.py | ||
| external_test_yolov8.py | ||
| fuzz_linearizer.py | ||
| fuzz_shapetracker.py | ||
| fuzz_shapetracker_math.py | ||
| fuzz_symbolic.py | ||
| graph_batchnorm.py | ||
| speed_beam_v_hcopt.py | ||
| speed_compare_cuda_ptx.py | ||
| verify_kernel.py | ||