mirrors/tinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-24 02:14:17 +00:00

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 73.1%
C 14.2%
Cuda 8%
C++ 2.2%
Metal 1.7%
Other 0.6%

Find a file

Joqsan ef129bcb85 Zero dim Tensor support (#777 ) * add and reorganize test_slice_* tests * refactor Tensor.__getitem__() * preliminary tests for 1) 0D tensors and 2) varargs for Tensor.zeros and Tensor.ones * always compare shapes of the numpy arrays obtained from tinygrad and torch tensors * add more tests for 0D support * remove test_tensor.test_slicing(). All slicing tests at test/test_ops.py * add zero-dim support * make test_end2end.py consistent with 0dim support * add test for tensor with zero in shape * don't simplify ones if shape is () * skip tests that need zero-size tensor support. - zero-size tensor support not related to 0dim tensors. * add tests for __getitem__() supporting strides >= 1 * refactor __getitem__: support for strides >= 1 * minor refactors and add comments to __getitem__ * add tests for slices with negative steps * add support for slices with negative strides		2023-06-01 11:32:02 -07:00
.github/workflows	make tests faster + add onnx (#815 )	2023-05-27 08:53:32 -07:00
accel	move to shapetracker.py	2023-03-11 07:50:07 -08:00
cache	add ff_dim to transformer	2021-11-29 12:40:52 -05:00
datasets	int8/uint8 support (#837 )	2023-05-28 23:15:06 -07:00
disassemblers/adreno	fix path linter issue	2023-04-18 19:17:41 -07:00
docs	add changeable DEBUG (#816 )	2023-05-27 13:28:25 -07:00
examples	feat: add train scaffolding (#859 )	2023-05-30 07:10:40 -07:00
extra	add output_padding to transposed conv (#875 )	2023-06-01 00:03:22 -07:00
models	default transformer dropout to 0 (#828 )	2023-05-29 08:06:16 -07:00
openpilot	jit: TODO, use abstractions	2023-05-05 22:51:30 -07:00
test	Zero dim Tensor support (#777 )	2023-06-01 11:32:02 -07:00
tinygrad	Zero dim Tensor support (#777 )	2023-06-01 11:32:02 -07:00
weights	cleanup clip tokenizer	2022-09-12 09:20:12 -07:00
.editorconfig	Basic editorconfig support (#422 )	2022-11-08 10:34:25 -08:00
.gitignore	Onnx ops And, Or, Xor, Not (#847 )	2023-05-29 11:09:20 -07:00
.pre-commit-config.yaml	fix mypy	2023-05-13 21:25:36 -07:00
.pylintrc	Devicebufferless (#708 )	2023-03-18 14:40:23 -07:00
compile.sh	stop wasting time with the compiler. tinygrad needs to just jit	2023-03-12 12:08:46 -07:00
LICENSE	Updated LICENSE year (#760 )	2023-05-01 15:35:23 -07:00
push_pypi.sh	push pypi	2020-10-27 08:13:15 -07:00
README.md	remove other install method	2023-05-28 08:36:21 -07:00
rmso.sh	compile works (#688 )	2023-03-12 11:01:25 -07:00
run_multibackend.sh	dtypes nice and clean (#673 )	2023-03-10 16:56:07 -08:00
setup.py	make tests faster + add onnx (#815 )	2023-05-27 08:53:32 -07:00
sz.py	move line counter to python	2023-05-29 09:21:40 -07:00

README.md

For something in between a pytorch and a karpathy/micrograd

This may not be the best deep learning framework, but it is a deep learning framework.

The sub 1000 line core of it is in tinygrad/

Due to its extreme simplicity, it aims to be the easiest framework to add new accelerators to, with support for both inference and training. Support the simple basic ops, and you get SOTA vision models/efficientnet.py and language models/transformer.py models.

We are working on support for the Apple Neural Engine and the Google TPU in the accel/ folder. Eventually, we will build custom hardware for tinygrad, and it will be blindingly fast. Now, it is slow.

This project is maintained by tiny corp.

Installation

git clone https://github.com/geohot/tinygrad.git
cd tinygrad
python3 -m pip install -e .

Contributing

There's a lot of interest in tinygrad lately. Here's some guidelines for contributing:

Bugfixes are the best and always welcome! Like this one.
If you don't understand the code you are changing, don't change it!
All code golf PRs will be closed, but conceptual cleanups are great.
Features are welcome. Though if you are adding a feature, you need to include tests.
Improving test coverage is great, with reliable non brittle tests.

Example

from tinygrad.tensor import Tensor

x = Tensor.eye(3, requires_grad=True)
y = Tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

Same example in torch

import torch

x = torch.eye(3, requires_grad=True)
y = torch.tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad)  # dz/dx
print(y.grad)  # dz/dy

Is tinygrad fast?

Try a matmul. See how, despite the style, it is fused into one kernel with the power of laziness.

DEBUG=3 OPTLOCAL=1 python3 -c "from tinygrad.tensor import Tensor;
N = 1024; a, b = Tensor.randn(N, N), Tensor.randn(N, N);
c = (a.reshape(N, 1, N) * b.permute(1,0).reshape(1, N, N)).sum(axis=2);
print((c.numpy() - (a.numpy() @ b.numpy())).mean())"

Change to DEBUG=4 to see the generated code.

Neural networks?

It turns out, a decent autograd tensor library is 90% of what you need for neural networks. Add an optimizer (SGD, Adam, AdamW implemented) from tinygrad.nn.optim, write some boilerplate minibatching code, and you have all you need.

Neural network example (from test/models/test_mnist.py)

from tinygrad.tensor import Tensor
import tinygrad.nn.optim as optim

class TinyBobNet:
  def __init__(self):
    self.l1 = Tensor.uniform(784, 128)
    self.l2 = Tensor.uniform(128, 10)

  def forward(self, x):
    return x.dot(self.l1).relu().dot(self.l2).log_softmax()

model = TinyBobNet()
optim = optim.SGD([model.l1, model.l2], lr=0.001)

# ... and complete like pytorch, with (x,y) data

out = model.forward(x)
loss = out.mul(y).mean()
optim.zero_grad()
loss.backward()
optim.step()

GPU and Accelerator Support

tinygrad supports GPUs through PyOpenCL.

from tinygrad.tensor import Tensor
(Tensor.ones(4,4).gpu() + Tensor.ones(4,4).gpu()).cpu()

hlops (in tensor.py)

hlops are syntactic sugar around mlops. They support most things torch does.

mlops

mlops are mid level ops. They understand derivatives. They are very simple.

Relu, Log, Exp, Sin                            # unary ops
Sum, Max                                       # reduce ops (with axis argument)
Maximum, Add, Sub, Mul, Pow, Div, Equal        # binary ops (no broadcasting, use expand)
Expand, Reshape, Permute, Pad, Shrink, Flip    # movement ops

You no longer need to write mlops for a new accelerator

Adding an accelerator (llops)

The autodiff stuff is all in mlops now so you can focus on the raw operations

Buffer                                                       # class of memory on this device
unary_op  (NOOP, EXP, LOG, CAST, SIN)                        # A -> A
reduce_op (SUM, MAX)                                         # A -> B (smaller size, B has 1 in shape)
binary_op (ADD, SUB, MUL, DIV, POW, CMPEQ, MAX)              # A + A -> A (all the same size)
movement_op (EXPAND, RESHAPE, PERMUTE, PAD, SHRINK, STRIDE)  # A -> B (different size)
fused_op [[optional]] (MULACC)                               # A * A -> B

ImageNet inference

Despite being tiny, tinygrad supports the full EfficientNet. Pass in a picture to discover what it is.

python3 examples/efficientnet.py https://media.istockphoto.com/photos/hen-picture-id831791190

Or, if you have a webcam and cv2 installed

python3 examples/efficientnet.py webcam

PROTIP: Set "DEBUG=2" environment variable if you want to see why it's slow.

tinygrad supports Stable Diffusion!

You might need to download the weight of Stable Diffusion and put it into weights/

Run python3 examples/stable_diffusion.py

"a horse sized cat eating a bagel"

tinygrad supports LLaMA

After putting the weights in weights/LLaMA, you can have a chat with Stacy. She lives inside tinygrad.

python3 examples/llama.py

tinygrad supports GANs

See examples/mnist_gan.py

tinygrad supports yolo

See examples/yolov3.py

Drawing Execution Graph

GRAPH=1 python3 test/models/test_mnist.py TestMNIST.test_sgd_onestep
# requires dot, outputs /tmp/net.svg

Running tests

For more examples on how to run the full test suite please refer to the CI workflow.

python3 -m pip install -e '.[testing]'
python3 -m pytest
python3 -m pytest -v -k TestTrain
python3 ./test/models/test_train.py TestTrain.test_efficientnet