tinygrad/extra/thunder/amd
2026-06-19 13:28:53 -07:00
..
include faster mxfp8 gemm (#16656) 2026-06-17 22:35:36 -07:00
fa.py llama: less E kernels (#16517) 2026-06-12 19:49:25 +09:00
fa_bwd_causal.cpp update hipkittens (#16544) 2026-06-08 18:53:25 -07:00
fa_bwd_post.cpp hipkittens fa backward (#14723) 2026-02-16 00:38:44 -08:00
fa_bwd_pre.cpp hipkittens fa backward (#14723) 2026-02-16 00:38:44 -08:00
fa_fwd_causal.cpp fix fa forward building with clang 22 (#15124) 2026-03-04 02:32:25 -08:00
gemm_bf16.cpp llama: fix bf16 gemm oob (#16603) 2026-06-12 19:43:05 -07:00
gemm_bf16_atb.cpp llama: a_bT and aT_b bf16 gemms (#16487) 2026-06-04 23:30:21 +09:00
gemm_fp8.cpp fp8 gemm inv_scale in epilogue (#16625) 2026-06-15 18:44:41 +09:00
gemm_mxfp8.cpp gemm: fix mxfp8 on more shapes (#16677) 2026-06-19 13:28:53 -07:00