Commit graph

6 commits

Author SHA1 Message Date
qazal
452c7d4230
llama: don't allocate grad_xw13 in bf16 (#16359) 2026-05-28 04:33:07 +09:00
qazal
eecd4706ff
fix mailbox comment, add types (#16360) 2026-05-25 22:24:00 +09:00
wozeparrot
e97f2c1114
llama: only gemm + fa custom kernel (#16180)
* llama: tie store to grad directly

* llama: set mp flags

* llama: non fused grad fp8 quantize path
2026-05-12 21:03:49 -07:00
wozeparrot
ab6218bc92
llama mp fixes (#16050) 2026-05-05 15:35:32 -07:00
wozeparrot
ef09071073
llama: speed 2 (#15960) 2026-04-28 20:44:37 -07:00
wozeparrot
5e861cd2c4
llama: move llama kernels to llama_kernels (#15952) 2026-04-27 22:48:53 -07:00