tinygrad/tinygrad
Francis Lam 6d5dec2fef
log optimized kernels and a script to compare with non-optimized ones (#3829)
* search: add BEAM_VERIFY option to validate search results

refactor fuzz_linearizer comparison to allow it to be used in for
BEAM_VERIFY in device.py

* search: fix to verify the beam_search result and not the fastest

* search: fix typing and clean up

* device: remove imports from test and add LOGKERN options

LOGKERN output can be used with test/external/verify_kernel.py
to validate correctness

* fix example in verify_kernel.py

* cleanup fixes

* fix to use f-strings
2024-03-20 19:22:08 -04:00
..
codegen use int32 instead of default_int in simplify_phi_loops (#3828) 2024-03-19 17:49:58 -04:00
features Revert "Ring allreduce in multitensor (#3000)" (#3840) 2024-03-20 11:41:49 -07:00
nn hotfix: this makes beautiful mnist work again, not okay 2024-03-18 18:22:44 -07:00
renderer fix broken loops in llvm (#3751) 2024-03-15 11:57:51 +03:00
runtime clean up hsa driver (#3818) 2024-03-20 00:17:41 +03:00
shape factor out partial in SumNode div int (#3841) 2024-03-20 16:34:33 -04:00
__init__.py move graph.py and jit.py into features (#3376) 2024-02-12 17:34:34 +01:00
device.py log optimized kernels and a script to compare with non-optimized ones (#3829) 2024-03-20 19:22:08 -04:00
dtype.py Rename .sz to .count on DType (#3413) 2024-02-15 15:03:49 +01:00
helpers.py threefry again (#3785) 2024-03-18 16:47:07 -04:00
lazy.py Revert "Ring allreduce in multitensor (#3000)" (#3840) 2024-03-20 11:41:49 -07:00
mlops.py reciprocal mlop (#3694) 2024-03-11 16:08:46 -07:00
ops.py simple LoadOps.ASSIGN (#3745) 2024-03-14 20:44:34 -07:00
realize.py conceptual small refactor (#3842) 2024-03-20 16:46:14 -04:00
tensor.py use dtypes.int32 as return dtype for functions that return indices (#3827) 2024-03-19 17:06:57 -04:00