mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* add ability to ORT=1 * test_vs_ort * useless f * actually have benchmark take in modelproto for more flexibility in huggingface stuff * ok runs * good * oops fix benchmark_onnx __main__ * 224 as default * add ORT=1 option to huggingface_onnx * use Tensor to get_input * add abilty to do single onnx model testing * better names * merge properly... * copy in onnx_helpers * better * decent script * need to add debug tool first * new limit usage * why did narrowing_error come back.. * pretty decent * revert validate change * more ops bug fixes * revert unnecessary changes * fix InstanceNorm too * remove op from O4 * minimize diff * address old feedback * unsure of this, just revert * remove that assert * working attention * to_python_const Attention * cant init from np constant so just do this * final * fix bug in attention * attention clean ups * add hard TODOs and REPOPATH and TRUNCATE envvar * fix input_ids default value * final * fix scatter * cleaner _prepare_quantize * use new attention and tempfile for huggingface script * more stats * update * remove outdated code * big refactor to something usable by CI * booooooom * clean up * update to using yaml as env var input * add dry run * try * valid pad * use argparser and fix gather bug * ignore all yaml * tiny bit more polish * woah ignoring all yaml was not right * typo * decouple huggingface_onnx_run debug run with huggingface_onnx_download * bug fix for downloading single model * WOOOO ok much better * oops argparse 'required' is an invalid argument for positionals * oops argparse 'required' is an invalid argument for positionals * add assert * fix types --------- Co-authored-by: chenyu <chenyu@fastmail.com> |
||
|---|---|---|
| .. | ||
| accel | ||
| amdpci | ||
| assembly | ||
| backends | ||
| datasets | ||
| disassemblers/adreno | ||
| dsp | ||
| gemm | ||
| hip_gpu_driver | ||
| hiprtc | ||
| huggingface_onnx | ||
| junk | ||
| models | ||
| nv_gpu_driver | ||
| optimization | ||
| qcom_gpu_driver | ||
| resnet18 | ||
| sqtt | ||
| torch_backend | ||
| torch_hook | ||
| webgpu | ||
| archprobe.py | ||
| augment.py | ||
| disk_read_speed.py | ||
| dump_cache.py | ||
| export_model.py | ||
| f16_decompress.py | ||
| gradcheck.py | ||
| hip_events.py | ||
| hook_cuda.py | ||
| introspection.py | ||
| lr_scheduler.py | ||
| mcts_search.py | ||
| multitensor.py | ||
| onnx.py | ||
| onnx_helpers.py | ||
| reduce_speed.py | ||
| replay_pkl.py | ||
| ring_copy.py | ||
| setup_mock_amd_osx.sh | ||
| setup_mock_nv_osx.sh | ||
| thneed.py | ||
| threefry.py | ||
| to_movement_ops.py | ||
| training.py | ||
| transfer_speed.py | ||