Commit graph

13,706 commits

Author SHA1 Message Date
George Hotz
1168ed9730
Merge branch 'master' into codegen2 2026-06-17 00:37:09 -07:00
George Hotz
be9b570cb2
late numbering of var params (#16640)
* do_number_param

* fix sort order in x86

* we don't want this
2026-06-17 00:36:08 -07:00
qazal
c7055d658f
viz: only store kernel info (#16641) 2026-06-17 16:21:57 +09:00
George Hotz
017edbbbb5 param -1 2026-06-16 21:52:07 -07:00
George Hotz
daa72812b0 add gpu dims 2026-06-16 21:37:59 -07:00
George Hotz
fd325d662c
Merge branch 'master' into codegen2 2026-06-16 21:29:09 -07:00
George Hotz
d631716858
remove const without STACK (#16639)
* remove const without STACK

* fix GEP rewrite

* fix null tests

* fix openpilot regression

* it's 10 in CI
2026-06-16 21:25:42 -07:00
wozeparrot
36f6d1b064
gemm: fix bf16 atb for mp sharding (#16637) 2026-06-16 15:58:47 -07:00
George Hotz
8d36539656 test tiny passes 2026-06-16 14:57:12 -07:00
George Hotz
db2c71536b almost passing 2026-06-16 13:29:38 -07:00
George Hotz
4112b34a32 closer 2026-06-16 13:23:30 -07:00
George Hotz
1ad72dff08 more passing 2026-06-16 12:54:54 -07:00
George Hotz
6f1eaa8d46 fixes 2026-06-16 12:38:17 -07:00
George Hotz
35d2882991 no vec 2026-06-16 10:47:19 -07:00
George Hotz
a31732d819
Merge branch 'master' into codegen2 2026-06-16 10:33:34 -07:00
qazal
1cb6b88d37
viz: show contents of vconst (#16636)
* failing test

* render vconst

* simpler test

* reorder
2026-06-17 02:31:03 +09:00
George Hotz
43d62c4211 hreduce 2026-06-16 09:36:47 -07:00
George Hotz
4d0429090c split reduce types 2026-06-16 09:27:09 -07:00
nimlgen
5644605d92
hcq2: pack bufs (#16635)
* hcq2: pack bufs

* x
2026-06-16 18:58:16 +03:00
George Hotz
2c7a1450e7 fix reduce 2026-06-16 08:40:00 -07:00
chenyu
d5d59a2be6
remove dead rangeify rules [PR] (#16634) 2026-06-16 10:03:08 -04:00
chenyu
f0998e9bba
Revert "invalid clone is anonymous buffer" (#16613) (#16633) 2026-06-16 08:27:48 -04:00
qazal
7d2b0b697d
simple failing test for invalid extra E kernel (#16632)
* simple failing test for invalid extra E kernel

* 6 kernels
2026-06-16 17:57:44 +09:00
wozeparrot
70cac72781
llama: realize weight init (#16623) 2026-06-15 23:00:19 -07:00
Christopher Milan
443f976305
fix buffer overrun in dcache_flush (#16630) 2026-06-15 23:26:32 -04:00
chenyu
aa2bef24a8
no_vectorized_alu in cstyle does nothing now [PR] (#16631) 2026-06-15 23:07:20 -04:00
George Hotz
6ffb55cc74
Merge branch 'master' into codegen2 2026-06-15 17:19:25 -07:00
chenyu
efd03d7153
invalid clone is anonymous buffer [PR] (#16613) 2026-06-15 20:14:26 -04:00
nimlgen
4a0488ae97
hcq2: optims (#16624)
* hcq2: optims

* x
2026-06-15 23:58:28 +03:00
George Hotz
1a280829ca
Merge branch 'master' into codegen2 2026-06-15 12:48:46 -07:00
George Hotz
41aa2fe119
test_gemm needs .clone() on eye (#16629) 2026-06-15 12:48:27 -07:00
George Hotz
3b426b1072 devec 2026-06-15 08:57:52 -07:00
qazal
10bdb9c9d0
viz: check node exists before anchoring zoom (#16627) 2026-06-15 21:03:24 +09:00
qazal
f998b9930a
fp8 gemm inv_scale in epilogue (#16625)
* fuse scale

* remove python inv_scale

* more inv_scale removal

* more cleanups

* cleaner

* diff polish

* work

* rename

* simpler

* simpler

* compute

* c

* Revert "c"

This reverts commit 8941fec7ca.

* Revert "compute"

This reverts commit 9db573a6d3.

* Revert "simpler"

This reverts commit 910ad33f87.

* Revert "simpler"

This reverts commit bf75d235a1.

* s_g

* update types

* less diff noise

* remove
2026-06-15 18:44:41 +09:00
George Hotz
ce2cdc3708
Merge branch 'master' into codegen2 2026-06-14 16:43:48 -07:00
nimlgen
4dc51aff6e
hcq2: jit (#16621)
* hcq2: jit

* x

* x

* minor
2026-06-15 06:35:35 +07:00
chenyu
2adedf5ccb
clean up fold_divmod_general [pr] (#16622)
genralized fold_binary_numerator in fold_divmod_congruence
2026-06-14 17:15:52 -04:00
George Hotz
333f062eee new expander 2026-06-14 13:54:13 -07:00
George Hotz
0d5bf3ca6d revert that 2026-06-14 13:28:28 -07:00
George Hotz
56bad940df disable that 2026-06-14 13:28:02 -07:00
George Hotz
f98deb9250 preprocess 2026-06-14 13:24:19 -07:00
George Hotz
bdfcb1cb98 test ops passes 2026-06-14 12:58:18 -07:00
George Hotz
a6fdb53a1e
Merge branch 'master' into codegen2 2026-06-14 10:09:00 -07:00
George Hotz
a6d7fb9d4d
only SHRINK for non scalar access (#16619) 2026-06-14 10:08:37 -07:00
George Hotz
b1fb39502d delete that test 2026-06-14 09:42:58 -07:00
George Hotz
49deb9714b test_tiny passes 2026-06-14 09:36:51 -07:00
George Hotz
afab220947
Merge branch 'master' into codegen2 2026-06-14 08:52:36 -07:00
chenyu
2e181f4259
simpler cancel_divmod [PR] (#16616) 2026-06-14 11:41:31 -04:00
George Hotz
a7523b2596 simpler 2026-06-13 10:40:52 -07:00
chenyu
5d5ead78da
inline unique_const in invalids [PR] (#16612) 2026-06-13 10:14:32 -04:00