| .. |
|
mlperf_bert
|
Refactor to class style (#4804)
|
2024-06-04 14:08:31 -07:00 |
|
mlperf_resnet
|
PolynomialDecayWithWarmup + tests (#3649)
|
2024-03-07 18:53:36 -05:00 |
|
mlperf_unet3d
|
[MLPerf] UNet3D dataloader (#4343)
|
2024-04-28 22:34:18 -04:00 |
|
process_replay
|
limit pickled objects [run_process_replay] (#5154)
|
2024-06-26 13:51:32 +03:00 |
|
external_benchmark_hip_compile.py
|
hip compile speed (#2606)
|
2023-12-04 13:47:40 -08:00 |
|
external_benchmark_load_stable_diffusion.py
|
gate METAL_FAST_LOAD
|
2023-12-01 15:28:40 -08:00 |
|
external_benchmark_multitensor_allreduce.py
|
move image into tensor.py. delete features (#4603)
|
2024-05-15 10:50:25 -07:00 |
|
external_benchmark_resnet.py
|
ruff: unnecessary-comprehension (#5174)
|
2024-06-27 07:45:29 -04:00 |
|
external_benchmark_schedule.py
|
graph_dedup function [run_process_replay] (#4955)
|
2024-06-14 04:24:37 -07:00 |
|
external_cl_half_max.py
|
use default dict for external_model_benchmark (#2592)
|
2023-12-03 15:25:43 -08:00 |
|
external_hip_compiler_bug.py
|
CompiledASTRunner -> CompiledRunner (#4148)
|
2024-04-11 08:49:52 -07:00 |
|
external_jit_failure.py
|
fix jit realize issue (#3258)
|
2024-01-26 18:27:35 -08:00 |
|
external_llama_eval.py
|
ruff checks the max line length is 150 (#2734)
|
2023-12-12 17:34:47 -08:00 |
|
external_model_benchmark.py
|
ruff: unnecessary-comprehension (#5174)
|
2024-06-27 07:45:29 -04:00 |
|
external_multi_gpu.py
|
move disassemblers and openpilot (#4592)
|
2024-05-14 19:30:02 -07:00 |
|
external_osx_profiling.py
|
move dtypes to dtype.py (#2964)
|
2024-01-01 14:58:48 -08:00 |
|
external_test_amd.py
|
amd doorbell size is 64bits (#4448)
|
2024-05-06 16:59:59 +03:00 |
|
external_test_datasets.py
|
[MLPerf] UNet3D dataloader (#4343)
|
2024-04-28 22:34:18 -04:00 |
|
external_test_embedding.py
|
make embedding and GPT-2 fast (#1631)
|
2023-08-22 15:14:38 -07:00 |
|
external_test_example.py
|
numpy device + pickle it (#4120)
|
2024-04-09 13:19:30 -07:00 |
|
external_test_hcq.py
|
add _alloc_signal/_free_signal to hcq (#5264)
|
2024-07-02 23:35:39 +03:00 |
|
external_test_hip_compile.py
|
add renderer class (#4524)
|
2024-05-10 21:40:02 -07:00 |
|
external_test_hsa_driver.py
|
JitItem -> ExecItem (#4146)
|
2024-04-11 08:24:57 -07:00 |
|
external_test_image.py
|
fixes (#1893)
|
2023-09-22 07:20:27 +08:00 |
|
external_test_jit_on_models.py
|
Pulled CLIP and UNet into Seperate Files (#5253)
|
2024-07-01 22:33:01 -04:00 |
|
external_test_lm_head.py
|
isolate the 134ms kernel in train_gpt2.py (#4773)
|
2024-05-29 17:26:24 -04:00 |
|
external_test_losses.py
|
[MLPerf][UNet3D] Add DICE loss + metrics (#4204)
|
2024-04-17 20:09:33 -04:00 |
|
external_test_mamba.py
|
external that test
|
2024-03-29 19:35:50 -07:00 |
|
external_test_metrics.py
|
Convert BinaryOps.DIV to UnaryOps.RECIP and BinaryOps.IDIV (#4887)
|
2024-06-14 02:43:46 -07:00 |
|
external_test_mnist_data_select.py
|
add quick external data select test
|
2024-03-02 05:38:32 -08:00 |
|
external_test_nv.py
|
amd support kernel with bf16 (#4863)
|
2024-06-08 22:52:32 +03:00 |
|
external_test_onnx_backend.py
|
remove "no-nans-fp-math"="true" for LLVM (#5282)
|
2024-07-03 17:52:50 -04:00 |
|
external_test_opt.py
|
names shadowing builtins (#5179)
|
2024-06-27 08:15:01 -04:00 |
|
external_test_optim.py
|
improve test_dropout_on_shard (#4912)
|
2024-06-11 11:36:02 -04:00 |
|
external_test_speed_llama.py
|
all realize 2 (#4527)
|
2024-05-10 22:43:09 -07:00 |
|
external_test_uops_graphing.py
|
move graph/search to engine (#4596)
|
2024-05-14 23:12:59 -07:00 |
|
external_test_whisper_librispeech.py
|
names shadowing builtins (#5179)
|
2024-06-27 08:15:01 -04:00 |
|
external_test_yolo.py
|
move to new cached fetch (#2493)
|
2023-11-28 17:36:55 -08:00 |
|
external_test_yolov8.py
|
ruff checks the max line length is 150 (#2734)
|
2023-12-12 17:34:47 -08:00 |
|
fuzz_graph.py
|
graph fuzzer (#5082)
|
2024-06-21 18:47:23 +03:00 |
|
fuzz_kfd.py
|
add _alloc_signal/_free_signal to hcq (#5264)
|
2024-07-02 23:35:39 +03:00 |
|
fuzz_linearizer.py
|
import from uops, not linearizer (#5064)
|
2024-06-20 08:08:44 -07:00 |
|
fuzz_schedule.py
|
prescheduling refactor (#5300)
|
2024-07-06 12:04:03 +03:00 |
|
fuzz_shapetracker.py
|
minor improvements (#2845)
|
2023-12-18 22:09:08 -08:00 |
|
fuzz_shapetracker_math.py
|
tinytqdm.set_description and tinytrange (#5101)
|
2024-06-22 14:45:06 -04:00 |
|
fuzz_symbolic.py
|
Variable.num -> NumNode (#2354)
|
2023-11-18 15:45:52 -05:00 |
|
fuzz_uops.py
|
uops dfs cleanup (#5147)
|
2024-06-25 18:51:42 -07:00 |
|
graph_batchnorm.py
|
with Tensor.train() (#1935)
|
2023-09-28 18:02:31 -07:00 |
|
replay_schedule.py
|
_recurse_lb small cleanup (#4601)
|
2024-05-15 19:10:42 +03:00 |
|
speed_beam_v_hcopt.py
|
move graph/search to engine (#4596)
|
2024-05-14 23:12:59 -07:00 |
|
speed_compare_cuda_nv.py
|
remove numpy from dtype (#4969)
|
2024-06-14 15:38:45 -04:00 |
|
speed_compare_cuda_ptx.py
|
move graph/search to engine (#4596)
|
2024-05-14 23:12:59 -07:00 |
|
verify_kernel.py
|
test/external/verify_kernel: add support for single pickled kernel (#4836)
|
2024-06-04 18:59:21 -04:00 |