tinygrad/examples
Kunwar Raj Singh 8391648822
Over 90% on CIFAR with examples/hlb_cifar10.py (#1073)
* fix eval, lr decay, best eval

* 82.27

* 82.64

* 82.79, reproducable

* add lr sched, 85.26

* 87.42

* 87.94

* 87.42

* tta with flip

* training flip aug

* refactor

* using Tensor for LR is faster

* 89.5

* refactor, flip only train set

* 90.01

* 90.64

* eval jit

* refactor

* only JIT model

* fix eval JIT

* fix eval JIT

* 90.82

* STEPS=900 reaches 90.22

* TTA envvar

* TTA default 0

* fully jit training

* refactor optim

* fix sched

* add label smoothing

* param changes

* patial gelu

* OneCycle with pause

* gelu maybe works

* 90.12

* remove pause lr

* maybe fix lr schedulers

* scheduler test passing

* comments

* try mixup

* shuffle!

* add back the missing last eval

* fix shuffle bugs

* add mixup prob

* fix mixup prob

* 90.19

* correct mixup

* correct mixup

* correct mixup

* 90.24

* 90.33

* refactor, add type hints

* add gradient clipping

* maybe fix test

* full JIT

* back to relu for now

* pass mixup prob as param

* add typehints

* maybe CI works

* try erf gelu

* CI, types

* remove useless import/

* refactor optim

* refactor optim

* try leakyrelu

* try celu

* gelu

* 90.67

* remove grad clip

* remove grad clip tests

* revert params

* add test for OneCycleLR

* 90.62

* fix eval timing

* fix eval timing again

* so where i calculate mixup_prob matters

---------

Co-authored-by: Kunwar Raj Singh <kunwar31@pop-os.localdomain>
2023-07-06 20:46:22 -07:00
..
mlperf imagenet eval in 1 min 28 sec 2023-06-28 04:23:26 +00:00
vgg7_helpers Renamed examples/yolo to examples/vgg7_helpers because that directory contains no yolo-related code and only helper code for vgg7. This was confusing to a new user when trying to understand the examples. (#1086) 2023-07-01 12:04:28 -07:00
__init__.py failing llama test 2023-03-11 16:28:10 -08:00
benchmark_train_efficientnet.py Refactor nn.optim (#1091) 2023-07-02 15:07:30 -07:00
compile_efficientnet.py Fix input path in examples/compile_efficientnet.py, examples/efficientnet.py. (#1034) 2023-06-23 16:34:33 -07:00
compile_tensorflow.py moved extras/jit.py -> tinygrad/jit.py (#599) 2023-02-25 08:32:33 -08:00
deep_deterministic_policy_gradient.py Refactor nn.optim (#1091) 2023-07-02 15:07:30 -07:00
efficientnet.py Fix input path in examples/compile_efficientnet.py, examples/efficientnet.py. (#1034) 2023-06-23 16:34:33 -07:00
hlb_cifar10.py Over 90% on CIFAR with examples/hlb_cifar10.py (#1073) 2023-07-06 20:46:22 -07:00
hlb_cifar10_torch.py nn init matches torch (#901) 2023-06-01 21:24:11 -07:00
llama.py no need to slice A and B twice in LLaMa complex multiplication (#1054) 2023-06-26 14:42:58 -07:00
mask_rcnn.py MaskRCNN Inference (#884) 2023-06-25 15:37:51 -07:00
mnist_gan.py Refactor nn.optim (#1091) 2023-07-02 15:07:30 -07:00
serious_mnist.py Refactor nn.optim (#1091) 2023-07-02 15:07:30 -07:00
simple_conv_bn.py examples: simple conv bn 2023-07-04 13:50:26 -07:00
stable_diffusion.py Adds tril & triu support (#936) 2023-06-09 22:13:20 -07:00
train_efficientnet.py Refactor nn.optim (#1091) 2023-07-02 15:07:30 -07:00
train_resnet.py Refactor nn.optim (#1091) 2023-07-02 15:07:30 -07:00
transformer.py fix imports for examples/transformer.py (#1136) 2023-07-05 08:15:13 -07:00
vgg7.py Renamed examples/yolo to examples/vgg7_helpers because that directory contains no yolo-related code and only helper code for vgg7. This was confusing to a new user when trying to understand the examples. (#1086) 2023-07-01 12:04:28 -07:00
vit.py Remove Tensor.data (#565) 2023-02-18 16:36:12 -08:00
whisper.py there is a better way to do that! (#950) 2023-06-06 15:23:30 -07:00
yolov3.py Permute examples (#731) 2023-03-29 05:07:06 +04:00
yolov8-onnx.py Add yolov8 implementation (#806) 2023-06-16 18:55:19 -07:00
yolov8.py "Fixed argument passing in example yolov8" (#1004) 2023-06-18 14:29:39 -07:00