tinygrad/extra/models
chenyu e468601226
update llama attention casting (#5096)
* update llama attention casting

updated scaled_dot_product_attention middle cast and removed hard-coded half in llama attention.

* fix that
2024-06-22 10:57:17 -04:00
..
bert.py Residual in MLM loss + Change default steps (#4935) 2024-06-12 16:09:18 -04:00
convnext.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
efficientnet.py simple LoadOps.ASSIGN (#3745) 2024-03-14 20:44:34 -07:00
llama.py update llama attention casting (#5096) 2024-06-22 10:57:17 -04:00
mask_rcnn.py remove numpy from dtype (#4969) 2024-06-14 15:38:45 -04:00
resnet.py update resnet.load_from_pretrained (#5040) 2024-06-18 16:29:22 -04:00
retinanet.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
rnnt.py change Tensor.stack to method (#4719) 2024-05-24 17:04:19 -04:00
transformer.py replace with tensor op (#3099) 2024-01-12 14:13:40 -05:00
unet3d.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
vit.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00