tinygrad/tinygrad
George Hotz cd97b036cc
A Triton backend for tinygrad (#470)
* triton can add

* print stuff from triton

* write out file

* ops triton working

* reduce ops

* sort of works

* Triton bugfixes & implementation of remaining ops (#490)

* padding

* support pow, max, relu, gt0

* allocate return buffer

* Fix reduce

* Add tests for power op

* Fix triton illegal memory accesses and memory leak (#512)

* Fix mypy issue

* Add triton to setup.py

* Replace torch with pycuda

* Use one cuda stream for data transfer and kernels

* Remove triton submodule

* Fix memory leak by using weakrefs for caching

* Fix memory access by adding valid as mask for load

* Fix invalid kernel launches by flattening the grid (#515)

---------

Co-authored-by: Martin Loretz <20306567+martinloretzzz@users.noreply.github.com>
2023-02-01 11:53:57 -08:00
..
llops A Triton backend for tinygrad (#470) 2023-02-01 11:53:57 -08:00
nn oops, broke BN 2023-01-31 08:18:48 -08:00
runtime Refactor getenv into helpers (#508) 2023-01-31 15:09:09 -08:00
shape no generic exceptions 2023-02-01 11:14:37 -08:00
__init__.py tinygrad.nn (#367) 2022-08-18 07:41:00 -07:00
ast.py tests pass w/o float4 2023-01-30 15:40:49 -08:00
graph.py Refactor getenv into helpers (#508) 2023-01-31 15:09:09 -08:00
helpers.py no generic exceptions 2023-02-01 11:14:37 -08:00
lazy.py Refactor getenv into helpers (#508) 2023-01-31 15:09:09 -08:00
mlops.py Replace SIGN with GT0 (#511) 2023-02-01 11:01:39 -08:00
ops.py no generic exceptions 2023-02-01 11:14:37 -08:00
tensor.py no generic exceptions 2023-02-01 11:14:37 -08:00