tinygrad/extra/models
George Hotz a72b1b6d65
sharding for llama (#3151)
* shard llama

* sharding works

* simpler

* simpler

* consume option

* disable that test

* save a line

---------

Co-authored-by: George Hotz <george@tinygrad.org>
2024-01-16 19:28:00 -08:00
..
bert.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
convnext.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
efficientnet.py add name support to fetch (#2407) 2023-11-23 14:16:17 -08:00
llama.py sharding for llama (#3151) 2024-01-16 19:28:00 -08:00
mask_rcnn.py move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
resnet.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
retinanet.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
rnnt.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
transformer.py replace with tensor op (#3099) 2024-01-12 14:13:40 -05:00
unet3d.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
vit.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00