tinygrad/extra
Yixiang Gao 6480a1a180
CIFAR 94.03% (#1340)
* add disk_tensor

* fix jit

* new baseline before whitening

* whitening through torch

* whiting done currently at 91.65%

* 91.99%

* clean up mixup and 92.3%

* clean up 92.30%

* 92.49% before searching for new hyper-parameters

* fix CI

* fix white space

* add whitening init in test

* refactor, update hyperpara, 92.72%

* converting whiting to tinygrad operation

* update CI kernels count for CIFAR

* add pad reflect

* add random crop 92.53%

* update hyperpara 93%

* 93.15% on docker container, need to refactor the assignment for hyper param

* print out weights and bias to be separated

* bias/non-bias params separated

* fix whitespace

* clean up

* refactor hyper-param with dict

* refactor lr schedular params

* fix whitespace

* fix cross entropy loss

* fix whitespace

* move opt hyp to hyp dict

* minor fixup

* adjust model, loss scaling

* 92.74% while using half of compute as before

* update hyp for cutmix

* random shuffle during batches

* clean up

* updating the model

* update ConvGroup

* disable gradients for batchnorm layer weights

* whitespace

* 93.92%

* clean up

* finally 94%git add .!

* rewrite whitening to remove dependency on torch

* whitespace

* remove dependency on torch, 93.91%

* back to 94.03%

* clean up

* update test_real_world
2023-08-08 15:13:24 -07:00
..
accel Revert "update editorconfig, enforce via CI (#1343)" (#1380) 2023-07-31 10:35:50 -07:00
assembly just cmplt (#1493) 2023-08-08 13:58:10 -07:00
datasets CIFAR 94.03% (#1340) 2023-08-08 15:13:24 -07:00
disk Fast DiskTensor to other Tensor (#916) 2023-06-03 12:25:41 -07:00
gemm syncthreads before wmma (#1389) 2023-07-31 17:05:49 -07:00
intel Revert "update editorconfig, enforce via CI (#1343)" (#1380) 2023-07-31 10:35:50 -07:00
ptx ptx example 2023-05-26 19:28:51 -07:00
rocm faster RDNA assembly backend (#990) 2023-06-16 12:06:38 -07:00
triton A Triton backend for tinygrad (#470) 2023-02-01 11:53:57 -08:00
archprobe.py no werror in archprobe 2023-05-03 19:34:17 +00:00
augment.py Fix naming conflict with huggingface datasets (#1161) 2023-07-07 10:43:44 -07:00
export_model.py simple exporting models (#1344) 2023-08-01 09:35:48 -07:00
gradcheck.py Fix: Jacobian tests [WIP] (#1126) 2023-07-05 15:36:22 -07:00
helpers.py helpers.py: improved test coverage + exception handling (#1165) 2023-07-07 10:26:05 -07:00
hip_wrapper.py hip matmul 2023-06-28 19:21:01 +00:00
introspection.py Devicebufferless (#708) 2023-03-18 14:40:23 -07:00
kernel_search.py move to shapetracker.py 2023-03-11 07:50:07 -08:00
lib_test_ast.py fix ast test 2023-03-04 07:49:25 -08:00
lr_scheduler.py Add pylint trailing whitespace rule (#1314) 2023-07-21 13:37:55 -04:00
onnx.py Add pylint trailing whitespace rule (#1314) 2023-07-21 13:37:55 -04:00
onnx_ops.py fix: is not use with a literal (#1487) 2023-08-08 07:35:30 -07:00
thneed.py cl fixes for multigpu (#1276) 2023-07-18 19:59:30 -07:00
training.py Add pylint trailing whitespace rule (#1314) 2023-07-21 13:37:55 -04:00
utils.py change fetch to allow for local file selection (#1309) 2023-07-23 15:00:16 -04:00