You like pytorch? You like micrograd? You love tinygrad! ❤️
  • Python 73.1%
  • C 14.2%
  • Cuda 8%
  • C++ 2.2%
  • Metal 1.7%
  • Other 0.6%
Find a file
Yosef Frost 613bcd945d
Added Test Coverage to Int32 and Make Sure Tests Succeed (#1174)
* Added test coverage for int32 in `test/test_dtype.py`

Tests for int32 include:
- testing that int32 can be converted into a numpy array
- testing that float and int64 can be cast into int32
- testing that int32 can be cast into float and int64
- testing addition, multiplication, and matrix multiplication with int32
- testing that addition, multiplication, and matrix multiplication with int32 and either float or int64 gets successfully cast into float and int64, respectively

Additional changes include testing that int8 casts into int32 and testing that float16 casts into int32

* Added type casting to the add, subtract, and divide binary operations

* Added automatic type casting when types differ to FusedOps.MULACC

I moved the match_types function back so that I could call it in einsum_mulacc where it would cast the types of the MULACC to be the same

* Added unit test for match_types and added type hints to the parameters

* Added tests for ops_cpu.match_types

* Changed ops_cpu.einsum logic to play nicely with PyTorch

Changed `tinygrad.runtime.ops_cpu.einsum_mulacc` logic to not perform type matching. Type matching was instead moved to the numpy_fxn_for_op dictionary in the ops_cpu file. Since ops_torch uses the same einsum_mulacc function, this should fix all the broken pytorch tests.

* empty commit to rerun ci

* reverting PR#1213 in attempt to fix broken test

* Removed all tests I added to see if they are causing CI issues

* Added back type matching tests

* removed type matching tests and added back int tests

* added back part of the type matching tests

* removed braking type matching tests

* empty commit for testing

* added test back but inside comment

* removed a test from the comment to see if it breaks CI

* removed another function

* more testing

* emptied test comment

* cleaned up comments

* Added optimize=True flag to einsum_mullac in cpu_ops.py

* Removed unnecessary imports from tests

* optimized match_types by removing unnecessary array copying
2023-07-12 10:29:15 -07:00
.github/workflows ci: don't install recommended packages for GPU (#1215) 2023-07-11 15:38:49 -07:00
cache add ff_dim to transformer 2021-11-29 12:40:52 -05:00
disassemblers/adreno fix path linter issue 2023-04-18 19:17:41 -07:00
docs Fix constant folding for Tensor([3]) (#1227) 2023-07-11 14:01:32 -07:00
examples Outdated repository URL (#1218) 2023-07-11 23:14:19 -07:00
extra good stuff from tensor cores branch (#1199) 2023-07-08 16:58:26 -07:00
models style: else-after-return (#1216) 2023-07-12 10:26:38 -07:00
openpilot global -> group (#1007) 2023-06-21 11:50:43 -07:00
test Added Test Coverage to Int32 and Make Sure Tests Succeed (#1174) 2023-07-12 10:29:15 -07:00
tinygrad Added Test Coverage to Int32 and Make Sure Tests Succeed (#1174) 2023-07-12 10:29:15 -07:00
weights cleanup clip tokenizer 2022-09-12 09:20:12 -07:00
.editorconfig Basic editorconfig support (#422) 2022-11-08 10:34:25 -08:00
.gitignore Fix naming conflict with huggingface datasets (#1161) 2023-07-07 10:43:44 -07:00
.pre-commit-config.yaml Refactor load/store before tensor cores (#1193) 2023-07-08 15:54:58 -07:00
.pylintrc style: else-after-return (#1216) 2023-07-12 10:26:38 -07:00
.tokeignore Add a quick start guide (#900) 2023-06-04 08:51:20 -07:00
compile.sh stop wasting time with the compiler. tinygrad needs to just jit 2023-03-12 12:08:46 -07:00
CONTRIBUTING.md feat: reword contributing (#1131) 2023-07-04 22:17:47 -07:00
LICENSE Updated LICENSE year (#760) 2023-05-01 15:35:23 -07:00
push_pypi.sh push pypi 2020-10-27 08:13:15 -07:00
README.md Outdated repository URL (#1218) 2023-07-11 23:14:19 -07:00
rmso.sh compile works (#688) 2023-03-12 11:01:25 -07:00
run_multibackend.sh dtypes nice and clean (#673) 2023-03-10 16:56:07 -08:00
setup.py Make cross_process use cloudpickle (#1118) 2023-07-04 00:47:34 -07:00
strip_whitespace.sh strip whitespace 2023-06-27 10:11:43 -07:00
sz.py move line counter to python 2023-05-29 09:21:40 -07:00

logo

tinygrad: For something between PyTorch and karpathy/micrograd. Maintained by tiny corp.

Homepage | Documentation | Examples | Showcase | Discord

GitHub Repo stars Unit Tests Discord Lines of code


This may not be the best deep learning framework, but it is a deep learning framework.

Due to its extreme simplicity, it aims to be the easiest framework to add new accelerators to, with support for both inference and training. If XLA is CISC, tinygrad is RISC.

tinygrad is still alpha software, but we raised some money to make it good. Someday, we will tape out chips.

Features

LLaMA and Stable Diffusion

tinygrad can run LLaMA and Stable Diffusion!

Laziness

Try a matmul. See how, despite the style, it is fused into one kernel with the power of laziness.

DEBUG=3 python3 -c "from tinygrad.tensor import Tensor;
N = 1024; a, b = Tensor.rand(N, N), Tensor.rand(N, N);
c = (a.reshape(N, 1, N) * b.permute(1,0).reshape(1, N, N)).sum(axis=2);
print((c.numpy() - (a.numpy() @ b.numpy())).mean())"

And we can change DEBUG to 4 to see the generated code.

Neural networks

As it turns out, 90% of what you need for neural networks are a decent autograd/tensor library. Throw in an optimizer, a data loader, and some compute, and you have all you need.

Neural network example (from test/models/test_mnist.py)

from tinygrad.tensor import Tensor
import tinygrad.nn.optim as optim

class TinyBobNet:
  def __init__(self):
    self.l1 = Tensor.uniform(784, 128)
    self.l2 = Tensor.uniform(128, 10)

  def forward(self, x):
    return x.dot(self.l1).relu().dot(self.l2).log_softmax()

model = TinyBobNet()
optim = optim.SGD([model.l1, model.l2], lr=0.001)

# ... complete data loader here

out = model.forward(x)
loss = out.mul(y).mean()
optim.zero_grad()
loss.backward()
optim.step()

Accelerators

tinygrad already supports numerous accelerators, including:

  • CPU
  • GPU (OpenCL)
  • C Code (Clang)
  • LLVM
  • METAL
  • CUDA
  • Triton
  • PyTorch

And it is easy to add more! Your accelerator of choice only needs to support a total of 26 (optionally 27) low level ops. More information can be found in the documentation for adding new accelerators.

Installation

The current recommended way to install tinygrad is from source.

From source

git clone https://github.com/tinygrad/tinygrad.git
cd tinygrad
python3 -m pip install -e .

Don't forget the . at the end!

Documentation

Documentation along with a quick start guide can be found in the docs/ directory.

Quick example comparing to PyTorch

from tinygrad.tensor import Tensor

x = Tensor.eye(3, requires_grad=True)
y = Tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

The same thing but in PyTorch:

import torch

x = torch.eye(3, requires_grad=True)
y = torch.tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

Contributing

There has been a lot of interest in tinygrad lately. Here are some basic guidelines for contributing:

  • Bug fixes are the best and always welcome! Like this one.
  • If you don't understand the code you are changing, don't change it!
  • All code golf PRs will be closed, but conceptual cleanups are great.
  • Features are welcome. Though if you are adding a feature, you need to include tests.
  • Improving test coverage is great, with reliable non-brittle tests.

Additional guidelines can be found in CONTRIBUTING.md.

Running tests

For more examples on how to run the full test suite please refer to the CI workflow.

Some examples:

python3 -m pip install -e '.[testing]'
python3 -m pytest
python3 -m pytest -v -k TestTrain
python3 ./test/models/test_train.py TestTrain.test_efficientnet