tinygrad/extra/llama_kernels
2026-05-28 04:33:07 +09:00
..
cast_amax llama: don't allocate grad_xw13 in bf16 (#16359) 2026-05-28 04:33:07 +09:00
fp8_transpose llama speed 6 (#16071) 2026-05-06 20:51:03 -07:00
fused_ce fused ce llama kernel in UOps (#16263) 2026-05-20 19:45:28 +09:00
fused_rmsnorm_mul_quantize_fp8 llama mp fixes (#16050) 2026-05-05 15:35:32 -07:00
quantize_fp8_delayed quantize_fp8 kernels in uops (#16288) 2026-05-22 20:54:06 +09:00
rmsnorm llama: move llama kernels to llama_kernels (#15952) 2026-04-27 22:48:53 -07:00
__init__.py llama mp fixes (#16050) 2026-05-05 15:35:32 -07:00