mirrors/tinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-24 02:14:17 +00:00

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 73.1%
C 14.2%
Cuda 8%
C++ 2.2%
Metal 1.7%
Other 0.6%

Find a file

George Hotz 07ece2105e actually move it		2020-12-12 15:26:58 -08:00
.github/workflows	Add pyopencl to dependency installs (#174 )	2020-12-10 09:24:08 -08:00
ane	ane	2020-12-12 15:23:21 -08:00
docs	adds beautiful and meaningful logo	2020-10-26 18:12:49 +01:00
examples	tinygrad.utils -> extra.utils	2020-12-12 15:26:07 -08:00
extra	actually move it	2020-12-12 15:26:58 -08:00
test	tinygrad.utils -> extra.utils	2020-12-12 15:26:07 -08:00
tinygrad	actually move it	2020-12-12 15:26:58 -08:00
.gitignore	Support for Apple Neural Engine (#130 )	2020-12-03 10:32:26 -08:00
LICENSE	readme	2020-10-18 11:27:37 -07:00
push_pypi.sh	push pypi	2020-10-27 08:13:15 -07:00
README.md	need zero grad now	2020-12-07 23:10:43 -08:00
requirements.txt	fix pyopencl (#125 )	2020-11-19 19:03:04 -08:00
setup.py	Extra install requirements. (#164 )	2020-12-09 02:22:47 -08:00

README.md

For something in between a pytorch and a karpathy/micrograd

This may not be the best deep learning framework, but it is a deep learning framework.

The Tensor class is a wrapper around a numpy array, except it does Tensor things.

tinygrad is also a city in Russia.

Installation

pip3 install git+https://github.com/geohot/tinygrad.git --upgrade

Example

from tinygrad.tensor import Tensor

x = Tensor.eye(3)
y = Tensor([[2.0,0,-2.0]])
z = y.matmul(x).sum()
z.backward()

print(x.grad)  # dz/dx
print(y.grad)  # dz/dy

Same example in torch

import torch

x = torch.eye(3, requires_grad=True)
y = torch.tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad)  # dz/dx
print(y.grad)  # dz/dy

Neural networks?

It turns out, a decent autograd tensor library is 90% of what you need for neural networks. Add an optimizer (SGD, RMSprop, and Adam implemented) from tinygrad.optim, write some boilerplate minibatching code, and you have all you need.

Neural network example (from test/test_mnist.py)

from tinygrad.tensor import Tensor
import tinygrad.optim as optim

class TinyBobNet:
  def __init__(self):
    self.l1 = Tensor.uniform(784, 128)
    self.l2 = Tensor.uniform(128, 10)

  def forward(self, x):
    return x.dot(self.l1).relu().dot(self.l2).logsoftmax()

model = TinyBobNet()
optim = optim.SGD([model.l1, model.l2], lr=0.001)

# ... and complete like pytorch, with (x,y) data

out = model.forward(x)
loss = out.mul(y).mean()
optim.zero_grad()
loss.backward()
optim.step()

GPU Support?!

tinygrad supports GPUs through PyOpenCL.

from tinygrad.tensor import Tensor
(Tensor.ones(4,4).cuda() + Tensor.ones(4,4).cuda()).cpu()

ANE Support?!?!

So it doesn't work yet, but see the ane directory for code to use the Apple Neural Engine at a low level.

ImageNet inference

Despite being tiny, tinygrad supports the full EfficientNet. Pass in a picture to discover what it is.

ipython3 examples/efficientnet.py https://upload.wikimedia.org/wikipedia/commons/4/41/Chicken.jpg

Or, if you have a webcam and cv2 installed

ipython3 examples/efficientnet.py webcam

PROTIP: Set "GPU=1" environment variable if you want this to go faster.

PROPROTIP: Set "DEBUG=1" environment variable if you want to see why it's slow.

The promise of small

tinygrad will always be below 1000 lines. If it isn't, we will revert commits until tinygrad becomes smaller.

Running tests

python3 -m pytest

TODO

Train an EfficientNet on ImageNet
- Make broadcasting work on the backward pass (simple please)
- EfficientNet backward pass
- Tensors on GPU (a few more backward)
Add a language model. BERT?
Add a detection model. EfficientDet?
Reduce code
Increase speed
Add features