tinygrad/extra/gemm
George Hotz 856759c79c
add halide example (#10980)
* add halide example

* upd halide gemm

* partial works

* touchups
2025-06-26 16:14:57 -07:00
..
max_kernels extra/gemm/max_matmul: start of custom kernels for GEMM (#6926) 2025-03-19 15:04:57 +08:00
.gitignore fast amd gemm (#9318) 2025-03-03 12:01:14 +08:00
amd_matmul.py move kernel to opt (#10899) 2025-06-20 15:22:28 -07:00
amx.py rename allocator methods to not conflict [pr] (#7788) 2024-11-20 00:10:29 +08:00
cuda_matmul.py rename allocator methods to not conflict [pr] (#7788) 2024-11-20 00:10:29 +08:00
fuzz_matmul.py acc_dtype -> dtype (#9402) 2025-03-10 16:05:30 -04:00
gemm.c only 62 gflops (#2629) 2023-12-05 13:28:24 -08:00
gemm.py only 62 gflops (#2629) 2023-12-05 13:28:24 -08:00
halide_gemm.py add halide example (#10980) 2025-06-26 16:14:57 -07:00
hip_matmul.py rename allocator methods to not conflict [pr] (#7788) 2024-11-20 00:10:29 +08:00
intel_xmx.py Intel XMX Tensor Core Support (#5622) 2024-08-16 09:19:21 -07:00
jax_pmatmul.py jax parallel matmul example 2023-11-28 13:48:11 -08:00
kernel8_batched_gmem.s fast amd gemm (#9318) 2025-03-03 12:01:14 +08:00
max_matmul.py move kernel to opt (#10899) 2025-06-20 15:22:28 -07:00
metal_conv.py create engine folder and move code (#3948) 2024-03-26 20:38:03 -07:00
metal_matmul.py compile fixes (#10442) 2025-06-06 18:38:37 -04:00
metal_matvec.py compile fixes (#10442) 2025-06-06 18:38:37 -04:00
mlx_matmul.py mlx benchmark, a lil slower than tg 2023-12-05 19:00:43 -08:00
real_pmatmul.py pmatmul example + GB/s bugfix [run_process_replay] (#5974) 2024-08-07 22:32:11 -07:00
simple_conv.py acc_dtype -> dtype (#9402) 2025-03-10 16:05:30 -04:00
simple_matmul.py move kernel to opt (#10899) 2025-06-20 15:22:28 -07:00
simple_matvec.py acc_dtype -> dtype (#9402) 2025-03-10 16:05:30 -04:00
tf_gemm.py Add tensorflow GEMM benchmark script (#1000) 2023-06-18 10:57:45 -07:00
tinygrad_nv_matmul.py move kernel to opt (#10899) 2025-06-20 15:22:28 -07:00
torch_gemm.py speed docs + upgrades [pr] (#8964) 2025-02-08 17:28:52 +08:00
triton_nv_matmul.py BufferSpec and ProgramSpec [pr] (#7814) 2024-11-21 12:18:05 +08:00
tvm_gemm.py move kernel to opt (#10899) 2025-06-20 15:22:28 -07:00