mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* ai slop flash attention (it works) * speed up, 2 TFLOPS + 7 GB/s * simpler * simpler * optimize * faster * warp shuffle * sqtt: link dispatch to exec (#15396) * sqtt packet linking infra python * javascript * ~doubly linked list * ui works * work * exec can also highlight the pc, coloring work * more work * rm sqtt/model.py, doesn't need to be upstreamed * viz: no context enters in cli, update llama profile (#15404) * removed unused named arg in rules [pr] (#15414) * viz: sqtt printer in viz/cli.py (#15411) * work * sqtt timeline in CLI * format all printers nicely * s/Showed/Printed * ansistrip * sys.exit * keep colors in list * work from amd_copy_matmul * has_more always gets returned * linter * don't print colors * more colors * wow this is so deep * work * minor details * selected * improve progress bar * remove it * 22, global_load_vaddr is so long * remove *0 hack in sign, gradient materializes zeros for unconnected nodes (#15416) Amp-Thread-ID: https://ampcode.com/threads/T-019d1612-6322-706b-a94d-a812400a55cb Co-authored-by: Amp <amp@ampcode.com> * works * cnt=20 * revert that * uop slice tests * simpler --------- Co-authored-by: qazal <77887910+Qazalin@users.noreply.github.com> Co-authored-by: chenyu <chenyu@fastmail.com> Co-authored-by: gg <ggordbegli@gmail.com> Co-authored-by: Amp <amp@ampcode.com> |
||
|---|---|---|
| .. | ||
| amd_seb | ||
| max_kernels | ||
| .gitignore | ||
| amd_asm_matmul.py | ||
| amd_copy_matmul.py | ||
| amd_flash_attention.py | ||
| amd_matmul.py | ||
| amd_uop_matmul.py | ||
| amx.py | ||
| cdna_asm_gemm.py | ||
| cuda_matmul.py | ||
| fuzz_matmul.py | ||
| gemm.c | ||
| gemm.py | ||
| halide_gemm.py | ||
| hip_matmul.py | ||
| intel_xmx.py | ||
| max_matmul.py | ||
| metal_conv.py | ||
| metal_matmul.py | ||
| metal_matvec.py | ||
| metal_uop_matmul.py | ||
| mi350x_uop_matmul.py | ||
| mi350x_uop_matmul_2.py | ||
| real_pmatmul.py | ||
| simple_conv.py | ||
| simple_matmul.py | ||
| simple_matvec.py | ||
| tinygrad_nv_matmul.py | ||
| torch_gemm.py | ||
| triton_nv_matmul.py | ||
| tvm_gemm.py | ||