| .. |
|
amd_seb
|
kernel4 and 5 in uops (#11411)
|
2025-07-28 19:35:48 -07:00 |
|
max_kernels
|
extra/gemm/max_matmul: start of custom kernels for GEMM (#6926)
|
2025-03-19 15:04:57 +08:00 |
|
.gitignore
|
fast amd gemm (#9318)
|
2025-03-03 12:01:14 +08:00 |
|
amd_matmul.py
|
kernel4 and 5 in uops (#11411)
|
2025-07-28 19:35:48 -07:00 |
|
amd_uop_matmul.py
|
check elements_per_thread in tensorcore [pr] (#11435)
|
2025-07-30 11:55:48 -07:00 |
|
amx.py
|
rename allocator methods to not conflict [pr] (#7788)
|
2024-11-20 00:10:29 +08:00 |
|
cuda_matmul.py
|
rename allocator methods to not conflict [pr] (#7788)
|
2024-11-20 00:10:29 +08:00 |
|
fuzz_matmul.py
|
acc_dtype -> dtype (#9402)
|
2025-03-10 16:05:30 -04:00 |
|
gemm.c
|
only 62 gflops (#2629)
|
2023-12-05 13:28:24 -08:00 |
|
gemm.py
|
only 62 gflops (#2629)
|
2023-12-05 13:28:24 -08:00 |
|
halide_gemm.py
|
add halide example (#10980)
|
2025-06-26 16:14:57 -07:00 |
|
hip_matmul.py
|
rename allocator methods to not conflict [pr] (#7788)
|
2024-11-20 00:10:29 +08:00 |
|
intel_xmx.py
|
Intel XMX Tensor Core Support (#5622)
|
2024-08-16 09:19:21 -07:00 |
|
jax_pmatmul.py
|
jax parallel matmul example
|
2023-11-28 13:48:11 -08:00 |
|
max_matmul.py
|
remove Kernel.get_program [pr] (#11203)
|
2025-07-12 20:50:29 -04:00 |
|
metal_conv.py
|
create engine folder and move code (#3948)
|
2024-03-26 20:38:03 -07:00 |
|
metal_matmul.py
|
compile fixes (#10442)
|
2025-06-06 18:38:37 -04:00 |
|
metal_matvec.py
|
compile fixes (#10442)
|
2025-06-06 18:38:37 -04:00 |
|
mlx_matmul.py
|
mlx benchmark, a lil slower than tg
|
2023-12-05 19:00:43 -08:00 |
|
real_pmatmul.py
|
pmatmul example + GB/s bugfix [run_process_replay] (#5974)
|
2024-08-07 22:32:11 -07:00 |
|
simple_conv.py
|
acc_dtype -> dtype (#9402)
|
2025-03-10 16:05:30 -04:00 |
|
simple_matmul.py
|
move kernel to opt (#10899)
|
2025-06-20 15:22:28 -07:00 |
|
simple_matvec.py
|
acc_dtype -> dtype (#9402)
|
2025-03-10 16:05:30 -04:00 |
|
tf_gemm.py
|
Add tensorflow GEMM benchmark script (#1000)
|
2023-06-18 10:57:45 -07:00 |
|
tinygrad_nv_matmul.py
|
remove Kernel.get_program [pr] (#11203)
|
2025-07-12 20:50:29 -04:00 |
|
torch_gemm.py
|
speed docs + upgrades [pr] (#8964)
|
2025-02-08 17:28:52 +08:00 |
|
triton_nv_matmul.py
|
BufferSpec and ProgramSpec [pr] (#7814)
|
2024-11-21 12:18:05 +08:00 |
|
tvm_gemm.py
|
DEPRECATED_linearize -> to_program [pr] (#11198)
|
2025-07-12 13:46:20 -04:00 |