mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* BERT language modeling head + trunc normal initializers * add train loop + helpers * shuffle in dataloaders + slight changes in main loop * beam change * Minor changes * random.shuffle * HParam update * Use deque for dataloader * wandb bert project name * half fixes * BENCHMARK + remove epoch * cast + print() --------- Co-authored-by: chenyu <chenyu@fastmail.com> |
||
|---|---|---|
| .. | ||
| bert.py | ||
| convnext.py | ||
| efficientnet.py | ||
| llama.py | ||
| mask_rcnn.py | ||
| resnet.py | ||
| retinanet.py | ||
| rnnt.py | ||
| transformer.py | ||
| unet3d.py | ||
| vit.py | ||