mirrors/tinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-24 02:14:17 +00:00

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 73.1%
C 14.2%
Cuda 8%
C++ 2.2%
Metal 1.7%
Other 0.6%

Find a file

Kunwar Raj Singh 8391648822 Over 90% on CIFAR with examples/hlb_cifar10.py (#1073 ) * fix eval, lr decay, best eval * 82.27 * 82.64 * 82.79, reproducable * add lr sched, 85.26 * 87.42 * 87.94 * 87.42 * tta with flip * training flip aug * refactor * using Tensor for LR is faster * 89.5 * refactor, flip only train set * 90.01 * 90.64 * eval jit * refactor * only JIT model * fix eval JIT * fix eval JIT * 90.82 * STEPS=900 reaches 90.22 * TTA envvar * TTA default 0 * fully jit training * refactor optim * fix sched * add label smoothing * param changes * patial gelu * OneCycle with pause * gelu maybe works * 90.12 * remove pause lr * maybe fix lr schedulers * scheduler test passing * comments * try mixup * shuffle! * add back the missing last eval * fix shuffle bugs * add mixup prob * fix mixup prob * 90.19 * correct mixup * correct mixup * correct mixup * 90.24 * 90.33 * refactor, add type hints * add gradient clipping * maybe fix test * full JIT * back to relu for now * pass mixup prob as param * add typehints * maybe CI works * try erf gelu * CI, types * remove useless import/ * refactor optim * refactor optim * try leakyrelu * try celu * gelu * 90.67 * remove grad clip * remove grad clip tests * revert params * add test for OneCycleLR * 90.62 * fix eval timing * fix eval timing again * so where i calculate mixup_prob matters --------- Co-authored-by: Kunwar Raj Singh <kunwar31@pop-os.localdomain>		2023-07-06 20:46:22 -07:00
.github/workflows	Quickstart (#1015 )	2023-06-29 13:26:58 -07:00
cache	add ff_dim to transformer	2021-11-29 12:40:52 -05:00
datasets	Make cross_process use cloudpickle (#1118 )	2023-07-04 00:47:34 -07:00
disassemblers/adreno	fix path linter issue	2023-04-18 19:17:41 -07:00
docs	Remove POW llop and add SQRT llop (#1104 )	2023-07-05 18:07:58 -07:00
examples	Over 90% on CIFAR with examples/hlb_cifar10.py (#1073 )	2023-07-06 20:46:22 -07:00
extra	Over 90% on CIFAR with examples/hlb_cifar10.py (#1073 )	2023-07-06 20:46:22 -07:00
models	Use generators instead of lists in `any`s and `all`s (#1111 )	2023-07-03 16:06:06 -07:00
openpilot	global -> group (#1007 )	2023-06-21 11:50:43 -07:00
test	Over 90% on CIFAR with examples/hlb_cifar10.py (#1073 )	2023-07-06 20:46:22 -07:00
tinygrad	Over 90% on CIFAR with examples/hlb_cifar10.py (#1073 )	2023-07-06 20:46:22 -07:00
weights	cleanup clip tokenizer	2022-09-12 09:20:12 -07:00
.editorconfig	Basic editorconfig support (#422 )	2022-11-08 10:34:25 -08:00
.gitignore	imagenet loader minor cleanups	2023-06-28 05:08:09 +00:00
.pre-commit-config.yaml	fix mypy	2023-05-13 21:25:36 -07:00
.pylintrc	Use generators instead of lists in `any`s and `all`s (#1111 )	2023-07-03 16:06:06 -07:00
.tokeignore	Add a quick start guide (#900 )	2023-06-04 08:51:20 -07:00
compile.sh	stop wasting time with the compiler. tinygrad needs to just jit	2023-03-12 12:08:46 -07:00
CONTRIBUTING.md	feat: reword contributing (#1131 )	2023-07-04 22:17:47 -07:00
LICENSE	Updated LICENSE year (#760 )	2023-05-01 15:35:23 -07:00
push_pypi.sh	push pypi	2020-10-27 08:13:15 -07:00
README.md	feat: fix shell alias on readme (#1022 )	2023-06-23 00:00:34 -07:00
rmso.sh	compile works (#688 )	2023-03-12 11:01:25 -07:00
run_multibackend.sh	dtypes nice and clean (#673 )	2023-03-10 16:56:07 -08:00
setup.py	Make cross_process use cloudpickle (#1118 )	2023-07-04 00:47:34 -07:00
strip_whitespace.sh	strip whitespace	2023-06-27 10:11:43 -07:00
sz.py	move line counter to python	2023-05-29 09:21:40 -07:00

README.md

tinygrad: For something between PyTorch and karpathy/micrograd. Maintained by tiny corp.

Homepage | Documentation | Examples | Showcase | Discord

This may not be the best deep learning framework, but it is a deep learning framework.

Due to its extreme simplicity, it aims to be the easiest framework to add new accelerators to, with support for both inference and training. If XLA is CISC, tinygrad is RISC.

tinygrad is still alpha software, but we raised some money to make it good. Someday, we will tape out chips.

Features

LLaMA and Stable Diffusion

tinygrad can run LLaMA and Stable Diffusion!

Laziness

Try a matmul. See how, despite the style, it is fused into one kernel with the power of laziness.

DEBUG=3 python3 -c "from tinygrad.tensor import Tensor;
N = 1024; a, b = Tensor.rand(N, N), Tensor.rand(N, N);
c = (a.reshape(N, 1, N) * b.permute(1,0).reshape(1, N, N)).sum(axis=2);
print((c.numpy() - (a.numpy() @ b.numpy())).mean())"

And we can change DEBUG to 4 to see the generated code.

Neural networks

As it turns out, 90% of what you need for neural networks are a decent autograd/tensor library. Throw in an optimizer, a data loader, and some compute, and you have all you need.

Neural network example (from test/models/test_mnist.py)

from tinygrad.tensor import Tensor
import tinygrad.nn.optim as optim

class TinyBobNet:
  def __init__(self):
    self.l1 = Tensor.uniform(784, 128)
    self.l2 = Tensor.uniform(128, 10)

  def forward(self, x):
    return x.dot(self.l1).relu().dot(self.l2).log_softmax()

model = TinyBobNet()
optim = optim.SGD([model.l1, model.l2], lr=0.001)

# ... complete data loader here

out = model.forward(x)
loss = out.mul(y).mean()
optim.zero_grad()
loss.backward()
optim.step()

Accelerators

tinygrad already supports numerous accelerators, including:

CPU
GPU (OpenCL)
C Code (Clang)
LLVM
METAL
CUDA
Triton
PyTorch

And it is easy to add more! Your accelerator of choice only needs to support a total of 26 (optionally 27) low level ops. More information can be found in the documentation for adding new accelerators.

Installation

The current recommended way to install tinygrad is from source.

From source

git clone https://github.com/geohot/tinygrad.git
cd tinygrad
python3 -m pip install -e .

Don't forget the . at the end!

Documentation

Documentation along with a quick start guide can be found in the docs/ directory.

Quick example comparing to PyTorch

from tinygrad.tensor import Tensor

x = Tensor.eye(3, requires_grad=True)
y = Tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

The same thing but in PyTorch:

import torch

x = torch.eye(3, requires_grad=True)
y = torch.tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

Contributing

There has been a lot of interest in tinygrad lately. Here are some basic guidelines for contributing:

Bug fixes are the best and always welcome! Like this one.
If you don't understand the code you are changing, don't change it!
All code golf PRs will be closed, but conceptual cleanups are great.
Features are welcome. Though if you are adding a feature, you need to include tests.
Improving test coverage is great, with reliable non-brittle tests.

Additional guidelines can be found in CONTRIBUTING.md.

Running tests

For more examples on how to run the full test suite please refer to the CI workflow.

Some examples:

python3 -m pip install -e '.[testing]'
python3 -m pytest
python3 -m pytest -v -k TestTrain
python3 ./test/models/test_train.py TestTrain.test_efficientnet