tinygrad/tinygrad/codegen
George Hotz 65a0a31475
AMD mi350x matmul from stream (#13040)
* works

* working mfma

* 120 TFLOPS

* regs

* 192 TFLOPS

* try pipelining

* something

* notes

* contract

* linter to 3.11

* that was a bug
2025-11-01 17:55:19 +08:00
..
late AMD mi350x matmul from stream (#13040) 2025-11-01 17:55:19 +08:00
opt AMD mi350x matmul from stream (#13040) 2025-11-01 17:55:19 +08:00
__init__.py prepare for custom kernel (#13029) 2025-10-31 14:47:37 +08:00
gpudims.py more uop programs (#13007) 2025-10-30 14:57:59 +08:00
simplify.py add loads at the end (#12988) 2025-10-30 10:42:19 +08:00