tinygrad/extra/models
Francis Lata c3187087f7
QwQ-32B-Preview support (#7962)
* load weights with some debugging

* start running a prompt

* cleanup

* optionally permute layers and cleanup

* add validation for simple prompt

* small cleanup

* minor cleanup with formatting download links

* add a longer prompt

* add timing option

* some typings

* remove unused arg

* reset GlobalCounters

* minor cleanups
2024-12-04 21:46:37 -05:00
..
bert.py Tensor.erf (#7419) 2024-10-30 18:12:28 -04:00
clip.py clip device fix (#6924) 2024-10-07 00:47:32 +08:00
convnext.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
efficientnet.py remove the magic methods for moving between devices [pr] (#6881) 2024-10-04 20:27:52 +08:00
inception.py Compute FID Score (#6802) 2024-10-01 19:47:58 -04:00
llama.py QwQ-32B-Preview support (#7962) 2024-12-04 21:46:37 -05:00
mask_rcnn.py remove numpy from dtype (#4969) 2024-06-14 15:38:45 -04:00
resnet.py combine pad2d with pad (#7677) 2024-11-14 17:56:02 +08:00
retinanet.py combine pad2d with pad (#7677) 2024-11-14 17:56:02 +08:00
rnnt.py change Tensor.stack to method (#4719) 2024-05-24 17:04:19 -04:00
t5.py Flux.1 (#6334) 2024-09-24 10:08:04 +08:00
transformer.py replace with tensor op (#3099) 2024-01-12 14:13:40 -05:00
unet.py These casts should only happen if these are supported (#7644) 2024-11-12 07:56:50 +08:00
unet3d.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
vit.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00