..
mlperf_bert
fix TRAIN_BEAM and Tensor.training for mlperf bert ( #4525 )
2024-05-11 00:18:36 -04:00
mlperf_resnet
PolynomialDecayWithWarmup + tests ( #3649 )
2024-03-07 18:53:36 -05:00
mlperf_unet3d
[MLPerf] UNet3D dataloader ( #4343 )
2024-04-28 22:34:18 -04:00
external_benchmark_hip_compile.py
hip compile speed ( #2606 )
2023-12-04 13:47:40 -08:00
external_benchmark_load_stable_diffusion.py
gate METAL_FAST_LOAD
2023-12-01 15:28:40 -08:00
external_benchmark_multitensor_allreduce.py
move image into tensor.py. delete features ( #4603 )
2024-05-15 10:50:25 -07:00
external_benchmark_resnet.py
add mem/s to external_benchmark_resnet ( #4309 )
2024-04-26 20:07:17 -04:00
external_benchmark_schedule.py
add cpu objdump to LLVM/CLANG ( #4537 )
2024-05-11 14:28:44 -07:00
external_cl_half_max.py
use default dict for external_model_benchmark ( #2592 )
2023-12-03 15:25:43 -08:00
external_hip_compiler_bug.py
CompiledASTRunner -> CompiledRunner ( #4148 )
2024-04-11 08:49:52 -07:00
external_jit_failure.py
fix jit realize issue ( #3258 )
2024-01-26 18:27:35 -08:00
external_llama_eval.py
ruff checks the max line length is 150 ( #2734 )
2023-12-12 17:34:47 -08:00
external_model_benchmark.py
Add driving monitoring model to benchmarks ( #4134 )
2024-04-10 14:27:03 -04:00
external_multi_gpu.py
move disassemblers and openpilot ( #4592 )
2024-05-14 19:30:02 -07:00
external_osx_profiling.py
move dtypes to dtype.py ( #2964 )
2024-01-01 14:58:48 -08:00
external_test_amd.py
amd doorbell size is 64bits ( #4448 )
2024-05-06 16:59:59 +03:00
external_test_datasets.py
[MLPerf] UNet3D dataloader ( #4343 )
2024-04-28 22:34:18 -04:00
external_test_embedding.py
make embedding and GPT-2 fast ( #1631 )
2023-08-22 15:14:38 -07:00
external_test_example.py
numpy device + pickle it ( #4120 )
2024-04-09 13:19:30 -07:00
external_test_hcq.py
hcq update queue in place ( #4626 )
2024-05-17 22:18:20 +03:00
external_test_hip_compile.py
add renderer class ( #4524 )
2024-05-10 21:40:02 -07:00
external_test_hsa_driver.py
JitItem -> ExecItem ( #4146 )
2024-04-11 08:24:57 -07:00
external_test_image.py
fixes ( #1893 )
2023-09-22 07:20:27 +08:00
external_test_jit_on_models.py
create engine folder and move code ( #3948 )
2024-03-26 20:38:03 -07:00
external_test_losses.py
[MLPerf][UNet3D] Add DICE loss + metrics ( #4204 )
2024-04-17 20:09:33 -04:00
external_test_mamba.py
external that test
2024-03-29 19:35:50 -07:00
external_test_metrics.py
[MLPerf][UNet3D] Add DICE loss + metrics ( #4204 )
2024-04-17 20:09:33 -04:00
external_test_mnist_data_select.py
add quick external data select test
2024-03-02 05:38:32 -08:00
external_test_nv.py
nv setup constbuf4 ( #4511 )
2024-05-16 10:42:35 +03:00
external_test_onnx_backend.py
update onnx to 1.16.0 ( #4127 )
2024-04-10 11:19:13 -04:00
external_test_opt.py
pick schedule tests from external_test_opt ( #4615 )
2024-05-16 15:43:41 +03:00
external_test_optim.py
optimizer shouldn't be run without training ( #4460 )
2024-05-06 15:34:12 -07:00
external_test_speed_llama.py
all realize 2 ( #4527 )
2024-05-10 22:43:09 -07:00
external_test_uops_graphing.py
move graph/search to engine ( #4596 )
2024-05-14 23:12:59 -07:00
external_test_whisper_librispeech.py
whisper: support audio >30s ( #2378 )
2023-11-21 14:37:51 -08:00
external_test_yolo.py
move to new cached fetch ( #2493 )
2023-11-28 17:36:55 -08:00
external_test_yolov8.py
ruff checks the max line length is 150 ( #2734 )
2023-12-12 17:34:47 -08:00
fuzz_kfd.py
renamed ( #4260 )
2024-04-23 09:00:28 +04:00
fuzz_linearizer.py
move graph/search to engine ( #4596 )
2024-05-14 23:12:59 -07:00
fuzz_schedule.py
refactor UOps.CONST ( #4639 )
2024-05-18 10:07:36 +03:00
fuzz_shapetracker.py
minor improvements ( #2845 )
2023-12-18 22:09:08 -08:00
fuzz_shapetracker_math.py
touchup canonicalize empty mask ( #3308 )
2024-02-03 21:05:10 -05:00
fuzz_symbolic.py
Variable.num -> NumNode ( #2354 )
2023-11-18 15:45:52 -05:00
fuzz_uops.py
uops fuzzing infra ( #4641 )
2024-05-18 20:19:57 +03:00
graph_batchnorm.py
with Tensor.train() ( #1935 )
2023-09-28 18:02:31 -07:00
replay_schedule.py
_recurse_lb small cleanup ( #4601 )
2024-05-15 19:10:42 +03:00
speed_beam_v_hcopt.py
move graph/search to engine ( #4596 )
2024-05-14 23:12:59 -07:00
speed_compare_cuda_nv.py
nv setup constbuf4 ( #4511 )
2024-05-16 10:42:35 +03:00
speed_compare_cuda_ptx.py
move graph/search to engine ( #4596 )
2024-05-14 23:12:59 -07:00
verify_kernel.py
move graph/search to engine ( #4596 )
2024-05-14 23:12:59 -07:00