Commit graph

5,738 commits

Author SHA1 Message Date
Christopher Milan
4d893f626a
move a bunch of test_schedule to null (#16578) 2026-06-11 20:26:34 -04:00
chenyu
5f1e2d3900
PADTO pads Invalids (#16562) 2026-06-11 16:54:26 -04:00
qazal
a83710396c
support mselect input to CALL, less kernels in allreduce (#16567)
* support mselect input to CALL, less kernels in allreduce

* resolve mstack
2026-06-11 18:10:47 +09:00
qazal
21f1101691
add allreduce kernel count test (#16566) 2026-06-11 15:54:12 +09:00
wozeparrot
c38d6a7e3a
mxfp8 part 2 (#16561) 2026-06-10 23:36:11 -07:00
George Hotz
7e6d617935
addrspace cleanups (#16565)
* addrspace cleanups

* bumps

* eh, relax a little
2026-06-10 15:57:18 -07:00
qazal
34481830f1
rangeify: fix cost function for AFTER(out, CALL) (#16559)
* simple failing test

* fix rangeify cost function

* new ops count
2026-06-10 17:30:50 +09:00
Christopher Milan
f9d88d3c3a
fix race in test_quantize_onnx (#16555) 2026-06-09 18:39:48 -04:00
wozeparrot
2bdc360606
gemm: mxfp8 hipkittens gemm (#16541)
* gemm: mxfp8 hipkittens gemm

* feat: update hipkittens

* feat: kernel signature

* clean: just kernel

* feat: from tinygrad

* feat: test

* fix: add back utils

* clean: no diff

* clean: no diff
2026-06-09 15:20:05 -07:00
chenyu
3f053a3370
move functional part of rand to RandMixin (#16551) 2026-06-09 09:40:48 -04:00
qazal
598cc13ad2
more readable null graph profile in VIZ (#16548)
* more readable null graph profile in VIZ

* change

* fix flaky test
2026-06-09 18:35:05 +09:00
qazal
d18ad49f20
fix flaky test_disktensor (#16549) 2026-06-09 18:23:22 +09:00
qazal
fa400f9790
less E kernels in all2all (#16546) 2026-06-09 13:51:57 +09:00
qazal
b8931440ae
add all2all schedule test (#16545) 2026-06-09 12:41:35 +09:00
Christopher Milan
4e2e2e9956
ocelot: use c.DLL (#16540) 2026-06-08 21:27:28 -04:00
chenyu
11fee53527
RandMixin [PR] (#16543) 2026-06-08 19:11:28 -04:00
chenyu
12764161c9
UOp.shard support axis=None [PR] (#16538)
match Tensor
2026-06-08 11:36:50 -04:00
chenyu
ebc5390c9a
advance indexing to mixin [PR] (#16532) 2026-06-08 09:24:49 -04:00
George Hotz
0336ba8eb1
buffer param arg + dsp fixups (#16528) 2026-06-07 12:07:00 -07:00
chenyu
90b556ca48
move gradient to mixin [PR] (#16526) 2026-06-07 00:05:02 -04:00
chenyu
4e7c6260b0
clean up test_tesnor_uop_mixin (#16525)
most of those don't have UNIQUE anymore
2026-06-06 23:25:44 -04:00
George Hotz
2a2f81dd3d
remove ANON from addrspace, refactor marg (#16523)
* remove ANON from addrspace, refactor marg

* as_shape

* as_shape is cached
2026-06-06 09:49:09 -07:00
Christopher Milan
8c0ba1da5c
cleanup more from test/backend (#16521) 2026-06-05 18:38:46 -04:00
Christopher Milan
9b0f75622c
many jit tests belong in unit (#16508) 2026-06-04 21:36:53 -04:00
chenyu
bb407d8b3c
fix transform_precompiled_call for MULTI (#16510)
based on my understanding for https://github.com/tinygrad/tinygrad/pull/16084
2026-06-04 20:09:58 -04:00
chenyu
4a8bf07a87
remove CONST(DEVICE) (#16506) 2026-06-04 11:29:46 -04:00
chenyu
5fad87252d
no device= into arange and eye (#16503) 2026-06-04 09:21:50 -04:00
wozeparrot
fd13080636
deviceless const skip axis check (#16496) 2026-06-03 19:13:20 -07:00
chenyu
2f83d01ccf
fix deviceless materialize device (#16493)
symbolic arange currently does not fuse, which creates a deviceless UOp post rangeify that needs a device to bufferize
2026-06-03 19:13:21 -04:00
chenyu
19eb72ff60
remove use of full with buffer=False and non-None device= (#16489) 2026-06-03 16:21:24 -04:00
chenyu
243446b44f
remove CONST(DEVICE) from const_like (#16488) 2026-06-03 14:04:51 -04:00
chenyu
8a4203638a
make full with buffer=False deviceless (#16483)
affects arange and eye
2026-06-03 12:35:59 -04:00
George Hotz
ffadd7a315
remove intel and amx support (#16482) 2026-06-02 18:53:05 -07:00
Christopher Milan
80eeb4dd21
mockgpu: use autogen.libc (#16479) 2026-06-02 19:59:36 -04:00
chenyu
a43b55d480
deviceless const folding schedule test (#16477) 2026-06-02 18:46:30 -04:00
chenyu
6b7d2b91df
update test_uop_graph (#16470)
use UOp methods instead of constructing UOp directly, some of it violated spec
2026-06-02 08:53:54 -04:00
George Hotz
20242fdf1d
update test + spec from shrink_in_render (#16467)
* update test + spec from shrink_in_render

* cast
2026-06-01 19:24:43 -07:00
Christopher Milan
2d0f132a3b
ci: cleanup more duplicate tests (#16462) 2026-06-01 18:56:29 -04:00
George Hotz
124d2f8227
anon addrspace from new renderer (#16461)
* anon addrspace from new renderer

* use max_numel in python renderer

* add sizes to ptrs in tests

* more

* correct fix
2026-06-01 14:42:02 -07:00
chenyu
7e7b481ba7
less CONST(DEVICE) (#16452)
* less CONST(DEVICE)

no DEVICE for single device in const_like, multi has other issues

* maybe

* that?
2026-06-01 15:55:12 -04:00
George Hotz
556defa0f7
minor updates from vec removal (#16456) 2026-05-31 09:48:51 -07:00
Javier De Jesus
989f713c1b
support negative pads in circular pad mode (#16448) 2026-05-31 09:28:45 -07:00
nimlgen
2c2cb339e0
fix word wrap (#16450) 2026-05-30 23:21:24 +03:00
qazal
29b47a0057
llama: update local amax implementation after ParamArgs change (#16446)
* local amax failing test

* update _local_abs_max_fxn
2026-05-30 16:55:43 +09:00
Christopher Milan
434cfa96a3
ci: no fetch in backend tests (#16438)
should make for less actions cache thrashing
2026-05-29 17:11:16 -04:00
chenyu
b7280705a7
limit CONST(UNIQUE) to invalids only (#16432) 2026-05-29 16:02:06 -04:00
qazal
81bc00c006
do not require clearing method_cache in viz tests (#16428)
* update

* update test_dedup
2026-05-29 18:12:34 +09:00
qazal
54cfb794b8
viz: addrspace little colored box (#16427)
* return addrspace

* layout

* render

* addrspace encodes color

* update colors

* in input_ast all are params are green

* update stroke
2026-05-29 17:25:07 +09:00
Christopher Milan
69aa2054f6
rename clangjit to clang (#16423) 2026-05-28 22:41:58 -04:00
George Hotz
1e7f1dcf49
add ParamArgs [pr] (#16421)
* add ParamArgs

* fix export

* cleanups

* fixes

* simpler
2026-05-28 19:17:17 -07:00