tinygrad/examples
chenyu f0d7ad8aaa
fix gpt2 attention with start_pos = 0 (#3061)
* fix gpt2 attention with start_pos size 1

test cases taken from ll_transformer branch

* fix interpreted
2024-01-09 16:14:55 -05:00
..
conversation_data Whisper + LLAMA + VITS (#2332) 2023-12-02 15:03:46 -08:00
mlperf move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
rl more beautiful_cartpole with exposed hparams 2024-01-07 17:41:09 -08:00
sovits_helpers move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
vgg7_helpers move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
webgl/yolov8 webgl backend in extra (#3041) 2024-01-08 09:29:13 -08:00
webgpu/stable_diffusion s/lazydata.realized/lazydata.base.realized/g (#2914) 2023-12-22 14:45:13 -05:00
__init__.py failing llama test 2023-03-11 16:28:10 -08:00
beautiful_cartpole.py more beautiful_cartpole with exposed hparams 2024-01-07 17:41:09 -08:00
beautiful_mnist.py remove obsolete TODO in beautiful_mnist (#2946) 2023-12-28 17:09:23 -05:00
benchmark_train_efficientnet.py move globalcounters to ops (#2960) 2024-01-01 14:21:02 -08:00
coder.py move gpt2/llama sampling inside the model call (#3013) 2024-01-04 17:01:50 -05:00
compile_efficientnet.py webgl backend in extra (#3041) 2024-01-08 09:29:13 -08:00
compile_tensorflow.py updating to work with new internal apis (#2755) 2023-12-13 21:54:47 -08:00
conversation.py move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
efficientnet.py torch and numpy don't share ops anymore (#2412) 2023-11-23 16:58:10 -08:00
f16_w_uint32.py move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
gpt2.py fix gpt2 attention with start_pos = 0 (#3061) 2024-01-09 16:14:55 -05:00
handcode_resnet50_opt.py vars_from_ast -> LazyOp.vars (#2965) 2024-01-01 18:12:38 -05:00
hlb_cifar10.py test works 2024-01-03 07:22:01 -08:00
index.html Enable Multi-Output Export (#2179) 2023-10-30 18:42:26 -07:00
llama.py St real size (#3046) 2024-01-08 14:44:53 -08:00
mask_rcnn.py move things, clean up extra (#2292) 2023-11-13 20:18:40 -08:00
mixtral.py Bitcast hip fix + fix mixtral (#3022) 2024-01-05 14:51:25 -08:00
mnist_gan.py move state to nn/state (#1619) 2023-08-22 07:36:24 -07:00
serious_mnist.py move state to nn/state (#1619) 2023-08-22 07:36:24 -07:00
simple_conv_bn.py with Tensor.train() (#1935) 2023-09-28 18:02:31 -07:00
so_vits_svc.py move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
stable_diffusion.py move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
stable_diffusion_seed0.png validate stable diffusion for seed 0 (#2773) 2023-12-15 00:07:09 -05:00
train_efficientnet.py move things, clean up extra (#2292) 2023-11-13 20:18:40 -08:00
train_resnet.py move things, clean up extra (#2292) 2023-11-13 20:18:40 -08:00
transformer.py move things, clean up extra (#2292) 2023-11-13 20:18:40 -08:00
vgg7.py waifu2x vgg7: testcase, auto-RGBA->RGB, function to grab pretrained models, training "fix" (#2117) 2023-10-19 22:07:15 -07:00
vit.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00
vits.py move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
whisper.py Update Whisper to use fetch helper (#2401) 2023-11-23 12:59:59 -08:00
yolov3.py Update yolov3.py (#2680) 2023-12-08 12:59:38 -08:00
yolov8-onnx.py [ready] Replacing os with pathlib (#1708) 2023-08-30 10:41:08 -07:00
yolov8.py move to new cached fetch (#2493) 2023-11-28 17:36:55 -08:00