| .. |
|
apps
|
add a test for 1B llm (#11124)
|
2025-07-07 18:47:25 -07:00 |
|
codegen
|
axis types in KernelInfo (#11196)
|
2025-07-12 09:36:20 -07:00 |
|
engine
|
move cpu_profile and shared ProfileEvents from device.py to helpers [pr] (#11126)
|
2025-07-08 12:14:03 +03:00 |
|
frontend
|
OnnxRunner file as input (#10789)
|
2025-07-12 14:27:46 -04:00 |
|
kernelize
|
[pr] Unify CONTIGUOUS and GBARRIER (#11121)
|
2025-07-08 10:27:23 -07:00 |
|
nn
|
outerworld range test [pr] (#11059)
|
2025-07-02 14:28:44 -07:00 |
|
opt
|
update float4 condition in hcopt (#11211)
|
2025-07-13 09:51:45 -04:00 |
|
renderer
|
tc should be in opt (#11148)
|
2025-07-09 14:12:21 -07:00 |
|
runtime
|
hcq: reduce launch overhead (#11193)
|
2025-07-13 19:25:50 +03:00 |
|
shape
|
Fix division by zero and mask bug in add views (#11088)
|
2025-07-07 10:05:47 -07:00 |
|
uop
|
lil cleanups from uop branch [pr] (#11197)
|
2025-07-12 09:46:28 -07:00 |
|
viz
|
viz: early convert to cpu time (#11192)
|
2025-07-12 17:19:41 +03:00 |
|
__init__.py
|
move files into uop dir (#10399)
|
2025-05-18 11:38:28 -07:00 |
|
device.py
|
DMARef infra (#10753)
|
2025-07-11 14:09:47 -07:00 |
|
dtype.py
|
Add DType.min and DType.max properties (#10749)
|
2025-06-10 08:31:34 -07:00 |
|
gradient.py
|
more robust reduce_gradient (#10965)
|
2025-06-24 14:09:33 -07:00 |
|
helpers.py
|
hcq: reduce launch overhead (#11193)
|
2025-07-13 19:25:50 +03:00 |
|
py.typed
|
add a single py.typed (#6083)
|
2024-08-14 17:31:46 -07:00 |
|
tensor.py
|
clean: keccak cleanups + explicit shapes (#11202)
|
2025-07-12 18:17:14 -07:00 |