tinygrad/extra
George Hotz e0201922e3
Q network for pruning BEAM / uops deduping / BEAM_ESTIMATE (#2142)
* stable diffusion < 324ms

* revert swap action

* fix tests due to more sum splitting

* REDUCEOP_SPLIT_THRESHOLD env var

* added from unaligned np test (#2134)

* align cpu buffer before copy into cl buffer (#2135)

* remove shelve from handcode_resnet50_opt.py (#2139)

* Add dictionary keys to reduce db size (#2131)

* work

* ignore beam cache

* dictionary keys are generic

* minor db cleanups

* fix baseline and extract dataset

* fix training

* log likelihood

* more lin to feats

* sts

* training policynet

* net sort of works

* dedup

* refactor, stupid new actions

* fix uops deduping

* BEAM_ESTIMATE

---------

Co-authored-by: chenyu <chenyu@fastmail.com>
Co-authored-by: imaolo <56898718+imaolo@users.noreply.github.com>
2023-10-27 10:53:06 -10:00
..
accel Make Triton work again (#1547) 2023-09-23 14:17:12 +08:00
assembly disable assembly (#1755) 2023-09-04 09:41:20 -07:00
datasets add world dataset (#2045) 2023-10-11 15:54:30 -07:00
disk Fast DiskTensor to other Tensor (#916) 2023-06-03 12:25:41 -07:00
dist hip multigpu training (#1878) 2023-10-24 17:35:53 -04:00
gemm openpilot compile2 (#1977) 2023-10-15 20:39:46 -07:00
intel Revert "update editorconfig, enforce via CI (#1343)" (#1380) 2023-07-31 10:35:50 -07:00
optimization Q network for pruning BEAM / uops deduping / BEAM_ESTIMATE (#2142) 2023-10-27 10:53:06 -10:00
ptx ptx example 2023-05-26 19:28:51 -07:00
rocm fast HIP gemm -> 100 TFLOPS (#1476) 2023-08-09 06:54:15 -07:00
triton Disable ocelot cache + fix matvec in triton (#2010) 2023-10-17 10:33:32 -07:00
archprobe.py no werror in archprobe 2023-05-03 19:34:17 +00:00
augment.py [ready] Replacing os with pathlib (#1708) 2023-08-30 10:41:08 -07:00
dump_cache.py Add dictionary keys to reduce db size (#2131) 2023-10-24 10:49:22 -04:00
export_model.py Fix output Float32Array size in webgpu export (#2096) 2023-10-17 15:28:19 -07:00
gradcheck.py Fix: Jacobian tests [WIP] (#1126) 2023-07-05 15:36:22 -07:00
helpers.py helpers.py: improved test coverage + exception handling (#1165) 2023-07-07 10:26:05 -07:00
hip_wrapper.py hip multigpu training (#1878) 2023-10-24 17:35:53 -04:00
introspection.py Devicebufferless (#708) 2023-03-18 14:40:23 -07:00
kernel_search.py no process() in Linearizer (#1966) 2023-10-04 07:18:42 -07:00
lib_test_ast.py fix ast test 2023-03-04 07:49:25 -08:00
lr_scheduler.py Add pylint trailing whitespace rule (#1314) 2023-07-21 13:37:55 -04:00
onnx.py Add ONNX to CI for other backends (#2069) 2023-10-17 09:33:54 -07:00
onnx_ops.py Add ONNX to CI for other backends (#2069) 2023-10-17 09:33:54 -07:00
thneed.py openpilot compile2 (#1977) 2023-10-15 20:39:46 -07:00
training.py with Tensor.train() (#1935) 2023-09-28 18:02:31 -07:00
utils.py fix: remove runtime imports (#1982) 2023-10-07 05:23:08 -07:00