..
mlperf_bert
New BERT dataloader ( #5881 )
2024-08-02 15:12:23 -04:00
mlperf_resnet
PolynomialDecayWithWarmup + tests ( #3649 )
2024-03-07 18:53:36 -05:00
mlperf_unet3d
[MLPerf] UNet3D dataloader ( #4343 )
2024-04-28 22:34:18 -04:00
openpilot
default threefry ( #6116 )
2024-09-25 17:45:13 +08:00
process_replay
add pr alias [pr] ( #6834 )
2024-10-01 18:48:44 +08:00
external_benchmark_ast.py
refactor to list of kernels [run_process_replay] ( #6403 )
2024-09-08 17:19:45 +08:00
external_benchmark_hip_compile.py
hip compile speed ( #2606 )
2023-12-04 13:47:40 -08:00
external_benchmark_load_stable_diffusion.py
gate METAL_FAST_LOAD
2023-12-01 15:28:40 -08:00
external_benchmark_multitensor_allreduce.py
add RING_ALLREDUCE_THRESHOLD ( #5835 )
2024-07-31 16:13:09 +03:00
external_benchmark_openpilot.py
20 jitted steps in openpilot benchmark ( #6577 )
2024-09-18 02:15:16 -04:00
external_benchmark_resnet.py
ruff: unnecessary-comprehension ( #5174 )
2024-06-27 07:45:29 -04:00
external_benchmark_schedule.py
no global kernel stuff [run_process_replay] ( #6808 )
2024-09-30 13:52:33 +08:00
external_cl_half_max.py
use default dict for external_model_benchmark ( #2592 )
2023-12-03 15:25:43 -08:00
external_gpu_fail_osx.py
new openpilot compile ( #6573 )
2024-09-18 14:22:50 +08:00
external_hip_compiler_bug.py
CompiledASTRunner -> CompiledRunner ( #4148 )
2024-04-11 08:49:52 -07:00
external_jit_failure.py
fix jit realize issue ( #3258 )
2024-01-26 18:27:35 -08:00
external_llama_eval.py
ruff checks the max line length is 150 ( #2734 )
2023-12-12 17:34:47 -08:00
external_metal_compile_fail.py
metal compile fail
2024-07-11 19:27:05 -07:00
external_model_benchmark.py
hotfix: missing return in METAL dm benchmark ( #6749 )
2024-09-26 09:12:38 +08:00
external_multi_gpu.py
move disassemblers and openpilot ( #4592 )
2024-05-14 19:30:02 -07:00
external_osx_profiling.py
move dtypes to dtype.py ( #2964 )
2024-01-01 14:58:48 -08:00
external_test_amd.py
RUF018 assignment-in-assert [run_process_replay] ( #6172 )
2024-08-19 00:34:52 -04:00
external_test_datasets.py
clean up how preprocessed folder is defined ( #5813 )
2024-07-30 12:35:26 -04:00
external_test_embedding.py
make embedding and GPT-2 fast ( #1631 )
2023-08-22 15:14:38 -07:00
external_test_example.py
numpy device + pickle it ( #4120 )
2024-04-09 13:19:30 -07:00
external_test_hcq.py
RUF018 assignment-in-assert [run_process_replay] ( #6172 )
2024-08-19 00:34:52 -04:00
external_test_hip_compile.py
lowerer is kernel [run_process_replay] ( #5437 )
2024-07-12 18:50:55 -07:00
external_test_hsa_driver.py
Rename tinygrad/runtime/driver to support ( #5413 )
2024-07-12 11:06:42 -07:00
external_test_image.py
fixes ( #1893 )
2023-09-22 07:20:27 +08:00
external_test_jit_on_models.py
Pulled CLIP and UNet into Seperate Files ( #5253 )
2024-07-01 22:33:01 -04:00
external_test_llama3_ff.py
work to make GEMV fast ( #5824 )
2024-07-30 17:41:40 -07:00
external_test_lm_head.py
isolate the 134ms kernel in train_gpt2.py ( #4773 )
2024-05-29 17:26:24 -04:00
external_test_losses.py
[MLPerf][UNet3D] Add DICE loss + metrics ( #4204 )
2024-04-17 20:09:33 -04:00
external_test_mamba.py
external that test
2024-03-29 19:35:50 -07:00
external_test_metrics.py
Convert BinaryOps.DIV to UnaryOps.RECIP and BinaryOps.IDIV ( #4887 )
2024-06-14 02:43:46 -07:00
external_test_mnist_data_select.py
add quick external data select test
2024-03-02 05:38:32 -08:00
external_test_nv.py
RUF018 assignment-in-assert [run_process_replay] ( #6172 )
2024-08-19 00:34:52 -04:00
external_test_onnx_backend.py
Tensor.prod ( #6250 )
2024-08-23 10:06:32 -04:00
external_test_opt.py
test: put conv in one reduce ( #4441 )
2024-07-22 12:16:13 +03:00
external_test_optim.py
improve test_dropout_on_shard ( #4912 )
2024-06-11 11:36:02 -04:00
external_test_speed_llama.py
all realize 2 ( #4527 )
2024-05-10 22:43:09 -07:00
external_test_speed_theoretical.py
test flops (and allow wide ALU in UOps) [run_process_replay] ( #5749 )
2024-07-26 21:07:28 -07:00
external_test_uops_graphing.py
lowerer is kernel [run_process_replay] ( #5437 )
2024-07-12 18:50:55 -07:00
external_test_valid_remove.py
add an example that idx is const and valid cannot be removed ( #6625 )
2024-09-20 05:46:27 -04:00
external_test_whisper_librispeech.py
names shadowing builtins ( #5179 )
2024-06-27 08:15:01 -04:00
external_test_yolo.py
move to new cached fetch ( #2493 )
2023-11-28 17:36:55 -08:00
external_test_yolov8.py
ruff checks the max line length is 150 ( #2734 )
2023-12-12 17:34:47 -08:00
fuzz_graph.py
graph fuzzer ( #5082 )
2024-06-21 18:47:23 +03:00
fuzz_kfd.py
add _alloc_signal/_free_signal to hcq ( #5264 )
2024-07-02 23:35:39 +03:00
fuzz_linearizer.py
uop resolve [run_process_replay] ( #6826 )
2024-10-01 13:11:42 +08:00
fuzz_schedule.py
give EXT schedules metadata [pr] ( #6865 )
2024-10-03 20:14:18 +08:00
fuzz_shapetracker.py
minor improvements ( #2845 )
2023-12-18 22:09:08 -08:00
fuzz_shapetracker_math.py
tinytqdm.set_description and tinytrange ( #5101 )
2024-06-22 14:45:06 -04:00
fuzz_symbolic.py
switch symbolic from old to uops, final PR ( #6872 )
2024-10-04 16:42:27 +08:00
fuzz_uops.py
UOpGraph -> linearize_uop [run_process_replay] ( #6119 )
2024-08-16 19:48:39 -07:00
graph_batchnorm.py
with Tensor.train() ( #1935 )
2023-09-28 18:02:31 -07:00
speed_beam_v_hcopt.py
move graph/search to engine ( #4596 )
2024-05-14 23:12:59 -07:00
speed_compare_cuda_nv.py
move colorize_float to helpers.py ( #5490 )
2024-07-15 11:29:03 -07:00
speed_compare_cuda_ptx.py
move colorize_float to helpers.py ( #5490 )
2024-07-15 11:29:03 -07:00
verify_kernel.py
pretty print lazy op per default ( #5505 )
2024-07-18 09:34:08 -07:00