mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* newton schulz * add muon + move newton schulz to tensor * compact newton schulz * better tests * cleanup * add comments for muon * cleanup * add export with tests * match muon optim with test optim * cleanup * unsed import * correct comment * whitespace * move export * muon test fix * match reference impl + tests * remove export by moving muon device * add credit * cleanup * remove print * spacing * spacing * comma * cleanup * removal * fix tests + optim momentum * consistent is not/ not * more consistency * fix test * cleanup * fix the nones * remove comment * cast * comment * comment * muon teeny test * muon flag beautiful mnist * set steps * steps as hyperparam * match default test steps * name * large cleanup * dont care about steps * nesterov false default * match each other impl * steps * switch nest * swap defaults * update docstring * add no nesterov test * ban fuse_optim * prints * classical momentum * alternative condition * recon * pre + post wd * false default * detach * signature changes * context * swap order * big cleanup * 0 step instead * parity * remove fuse * remove fused * better paper * assert message * correct shape check + eps * multidim * add eps * cleanup * correct assert message * lint * better tests * naming * ns_steps,ns_params * update docstring * docstring * match sgd and muon together * sandwich * add back fused * parity --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| amdpci | ||
| assembly | ||
| backends | ||
| datasets | ||
| disassemblers/adreno | ||
| dsp | ||
| gemm | ||
| hcq | ||
| hcqfuzz | ||
| hip_gpu_driver | ||
| hiprtc | ||
| huggingface_onnx | ||
| junk | ||
| mmapeak | ||
| models | ||
| nv_gpu_driver | ||
| optimization | ||
| perfetto | ||
| qcom_gpu_driver | ||
| remu | ||
| resnet18 | ||
| sched | ||
| sqtt | ||
| torch_backend | ||
| torch_hook | ||
| usbgpu | ||
| webgpu | ||
| archprobe.py | ||
| augment.py | ||
| bench_log.py | ||
| disk_read_speed.py | ||
| dump_cache.py | ||
| export_model.py | ||
| f16_decompress.py | ||
| gradcheck.py | ||
| hip_events.py | ||
| hip_large_kernel.py | ||
| hook_cuda.py | ||
| introspection.py | ||
| lr_scheduler.py | ||
| mcts_search.py | ||
| multitensor.py | ||
| onnx.py | ||
| onnx_helpers.py | ||
| reduce_speed.py | ||
| replay_pkl.py | ||
| ring_copy.py | ||
| setup_mock_amd_osx.sh | ||
| setup_mock_nv_osx.sh | ||
| thneed.py | ||
| threefry.py | ||
| to_movement_ops.py | ||
| torch_muon.py | ||
| training.py | ||
| transfer_speed.py | ||