tinygrad/examples/mlperf
David Hou 0afaf70d57
lars optimizer + tests (#3631)
* lars optimizer + tests

* fix skip list!

* use id to compare in skip list

* go back to using set

* Tensor(bool) * Tensor(bool) is and

* don't lint external/mlperf_resnet

* whitespace

* add external_test_optim to opencl tests

* give mlperf task a name

* mlperf under onnx

* remove track_gnorm

* contiguous instead of realize

* assert momentum and weight decay positive

---------

Co-authored-by: chenyu <chenyu@fastmail.com>
2024-03-06 18:11:01 -05:00
..
dataloader.py use hip events (#3157) 2024-01-17 10:39:57 -08:00
helpers.py Add MLPerf UNet3D model (#775) 2023-05-28 20:38:19 -07:00
metrics.py Add MLPerf UNet3D model (#775) 2023-05-28 20:38:19 -07:00
model_eval.py move graph.py and jit.py into features (#3376) 2024-02-12 17:34:34 +01:00
model_spec.py move globalcounters to ops (#2960) 2024-01-01 14:21:02 -08:00
model_train.py with Tensor.train() (#1935) 2023-09-28 18:02:31 -07:00
optimizers.py lars optimizer + tests (#3631) 2024-03-06 18:11:01 -05:00
README start on mlperf models 2023-05-10 16:30:49 -07:00

Each model should be a clean single file.
They are imported from the top level `models` directory

It should be capable of loading weights from the reference imp.

We will focus on these 5 models:

# Resnet50-v1.5 (classic) -- 8.2 GOPS/input
# Retinanet
# 3D UNET (upconvs)
# RNNT
# BERT-large (transformer)

They are used in both the training and inference benchmark:
https://mlcommons.org/en/training-normal-21/
https://mlcommons.org/en/inference-edge-30/
And we will submit to both.

NOTE: we are Edge since we don't have ECC RAM