tinygrad/extra/gemm
wozeparrot fe9b19b12d
llama: more mp mem fixes (#16701)
* llama: more mp mem fixes

* clean: unused

* fix: batch
2026-06-22 10:54:35 -04:00
..
amd_seb kernel4 and 5 in uops (#11411) 2025-07-28 19:35:48 -07:00
max_kernels extra/gemm/max_matmul: start of custom kernels for GEMM (#6926) 2025-03-19 15:04:57 +08:00
.gitignore mi350x 1tflop bf16 gemm in extra (#13702) 2025-12-28 21:45:42 +09:00
amd_asm_matmul.py remove DEFINE_LOCAL and DEFINE_REG (gpt) (#16673) 2026-06-19 10:07:50 -07:00
amd_copy_matmul.py full_like to CreationMixin [PR] (#16702) 2026-06-22 09:33:23 -04:00
amd_flash_attention.py remove DEFINE_LOCAL and DEFINE_REG (gpt) (#16673) 2026-06-19 10:07:50 -07:00
amd_matmul.py remove execitem (#15932) 2026-04-25 19:33:04 +03:00
amd_uop_matmul.py remove DEVECTORIZE (#16290) 2026-05-20 13:25:49 -07:00
cdna_asm_gemm.py llama: more mp mem fixes (#16701) 2026-06-22 10:54:35 -04:00
cuda_matmul.py rename allocator methods to not conflict [pr] (#7788) 2024-11-20 00:10:29 +08:00
fuzz_matmul.py acc_dtype -> dtype (#9402) 2025-03-10 16:05:30 -04:00
gemm.c only 62 gflops (#2629) 2023-12-05 13:28:24 -08:00
gemm.py only 62 gflops (#2629) 2023-12-05 13:28:24 -08:00
halide_gemm.py add halide example (#10980) 2025-06-26 16:14:57 -07:00
hip_matmul.py rename allocator methods to not conflict [pr] (#7788) 2024-11-20 00:10:29 +08:00
max_matmul.py programinfo (#15942) 2026-04-27 23:12:03 +03:00
metal_conv.py create engine folder and move code (#3948) 2024-03-26 20:38:03 -07:00
metal_matmul.py compile fixes (#10442) 2025-06-06 18:38:37 -04:00
metal_matvec.py compile fixes (#10442) 2025-06-06 18:38:37 -04:00
metal_uop_matmul.py matmul example on metal showing off tensor core (#13033) 2025-10-31 19:40:36 +08:00
mi350x_uop_matmul.py remove DEVECTORIZE (#16290) 2026-05-20 13:25:49 -07:00
mi350x_uop_matmul_2.py remove DEVECTORIZE (#16290) 2026-05-20 13:25:49 -07:00
rdna4_asm_matmul.py remove DEFINE_LOCAL and DEFINE_REG (gpt) (#16673) 2026-06-19 10:07:50 -07:00
real_pmatmul.py pmatmul example + GB/s bugfix [run_process_replay] (#5974) 2024-08-07 22:32:11 -07:00
simple_conv.py acc_dtype -> dtype (#9402) 2025-03-10 16:05:30 -04:00
simple_matmul.py remove schedule from extra/docs/examples (#15929) 2026-04-25 14:09:12 +03:00
simple_matvec.py acc_dtype -> dtype (#9402) 2025-03-10 16:05:30 -04:00
tinygrad_nv_matmul.py remove schedule from extra/docs/examples (#15929) 2026-04-25 14:09:12 +03:00
torch_gemm.py work from benchmarking tinybox red v2 (#13264) 2025-11-13 16:38:40 -08:00
triton_nv_matmul.py remove CompiledRunner (#15970) 2026-04-29 22:45:48 +03:00
tvm_gemm.py remove schedule from extra/docs/examples (#15929) 2026-04-25 14:09:12 +03:00