tinygrad/tinygrad
Alisher Zhubanyshev 4ef6b46b34
hcq: reduce launch overhead (#11193)
* nv: improve mmio creation speed

* add memoryview test

* fix indents

* move mv bench to `test_helpers`, remove comparison
2025-07-13 19:25:50 +03:00
..
apps add a test for 1B llm (#11124) 2025-07-07 18:47:25 -07:00
codegen axis types in KernelInfo (#11196) 2025-07-12 09:36:20 -07:00
engine move cpu_profile and shared ProfileEvents from device.py to helpers [pr] (#11126) 2025-07-08 12:14:03 +03:00
frontend OnnxRunner file as input (#10789) 2025-07-12 14:27:46 -04:00
kernelize [pr] Unify CONTIGUOUS and GBARRIER (#11121) 2025-07-08 10:27:23 -07:00
nn outerworld range test [pr] (#11059) 2025-07-02 14:28:44 -07:00
opt update float4 condition in hcopt (#11211) 2025-07-13 09:51:45 -04:00
renderer tc should be in opt (#11148) 2025-07-09 14:12:21 -07:00
runtime hcq: reduce launch overhead (#11193) 2025-07-13 19:25:50 +03:00
shape Fix division by zero and mask bug in add views (#11088) 2025-07-07 10:05:47 -07:00
uop lil cleanups from uop branch [pr] (#11197) 2025-07-12 09:46:28 -07:00
viz viz: early convert to cpu time (#11192) 2025-07-12 17:19:41 +03:00
__init__.py move files into uop dir (#10399) 2025-05-18 11:38:28 -07:00
device.py DMARef infra (#10753) 2025-07-11 14:09:47 -07:00
dtype.py Add DType.min and DType.max properties (#10749) 2025-06-10 08:31:34 -07:00
gradient.py more robust reduce_gradient (#10965) 2025-06-24 14:09:33 -07:00
helpers.py hcq: reduce launch overhead (#11193) 2025-07-13 19:25:50 +03:00
py.typed add a single py.typed (#6083) 2024-08-14 17:31:46 -07:00
tensor.py clean: keccak cleanups + explicit shapes (#11202) 2025-07-12 18:17:14 -07:00