mirrors/tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-24 02:14:17 +00:00

Author	SHA1	Message	Date
chenyu	19eb72ff60	remove use of full with buffer=False and non-None device= (#16489 )	2026-06-03 16:21:24 -04:00
qazal	29b47a0057	llama: update local amax implementation after ParamArgs change (#16446 ) * local amax failing test * update _local_abs_max_fxn	2026-05-30 16:55:43 +09:00
qazal	bbfe4f80ec	quantize_fp8 kernels in uops (#16288 ) * add tests * simple UOp kernel is n^2 * fast kernel matching c++, opts_to_apply=() * remove cpp * simple o(n) kernel, two passes * fuse the loops * works on DEV=CPU * multi regression test * fix multi, this can possibly be its own bugfix * test cleanups * minimal diff * match C in UOps * Revert "match C in UOps" This reverts commit `0bef740c30`. * edit test * match speed with C try 2 * needs_second_gpu * cleanup	2026-05-22 20:54:06 +09:00
Christopher Milan	172f9493e1	move is_dtype_supported to renderer (#16226 )	2026-05-20 21:19:37 -04:00
qazal	1e0fffe256	fused ce llama kernel in UOps (#16263 ) * work * using uops * delete things * work * work * higher level uops * cleanups	2026-05-20 19:45:28 +09:00