tinygrad/extra/llama_kernels/cast_amax
qazal f998b9930a
fp8 gemm inv_scale in epilogue (#16625)
* fuse scale

* remove python inv_scale

* more inv_scale removal

* more cleanups

* cleaner

* diff polish

* work

* rename

* simpler

* simpler

* compute

* c

* Revert "c"

This reverts commit 8941fec7ca.

* Revert "compute"

This reverts commit 9db573a6d3.

* Revert "simpler"

This reverts commit 910ad33f87.

* Revert "simpler"

This reverts commit bf75d235a1.

* s_g

* update types

* less diff noise

* remove
2026-06-15 18:44:41 +09:00
..
__init__.py fp8 gemm inv_scale in epilogue (#16625) 2026-06-15 18:44:41 +09:00
cast_amax_bwd_w13.cpp llama: don't allocate grad_xw13 in bf16 (#16359) 2026-05-28 04:33:07 +09:00
cast_amax_fwd_w13.cpp llama: speed 2 (#15960) 2026-04-28 20:44:37 -07:00