tinygrad/accel
George Hotz 5e07d4669d
the speedy chonker is going to replace the old chonker (#432)
* bringing back reshape and permute

* done with E701

* 4x4 works in generic way

* max and sum not vectorizing...

* special case single float

* support comparing to MPS

* improve matmul speed, consider generic principles

* GlobalCounter

* fix op tracking

* faster

* comment that out for now

* err, it needs that

* fix minor issues

* fix global_mem
2022-11-11 18:34:24 -08:00
..
ane ane: procPath issue. don't waste more time with this, focus on core tinygrad 2022-08-16 10:36:13 +02:00
cherry only resnet18, it's too slow otherwise 2021-10-30 16:48:39 -07:00
cuda Accel/cuda (#319) 2022-05-14 21:25:30 -07:00
llvm the speedy chonker is going to replace the old chonker (#432) 2022-11-11 18:34:24 -08:00
metal a bit of work on metal 2021-12-30 13:53:08 -05:00
opencl s/contiguous_op/contiguous 2022-11-11 00:07:05 -08:00
rawcpu s/contiguous_op/contiguous 2022-11-11 00:07:05 -08:00
tpu header 2021-10-30 16:41:05 -07:00
MAPPING fix ane on new mac os x 2022-08-06 19:10:22 +00:00
README refactor efficientnet loading 2021-10-30 17:02:17 -07:00

This is where we scope out adding accelerators to tinygrad

ane -- Apple Neural Engine, in the M1 + newer iPhones
cherry -- Largely defunct custom hardware based on a RISC-V extension
tpu -- Google's TPU, available for rent in Google Cloud