tinygrad/extra
George Hotz 55d3a5def9
preallocate all realized buffers (#14823)
* preallocate all realized buffers

* contiguous

* work

* comment that out

* move to schedule

* better

* correct fix

* just buffer

* disk bufs

* fixes disk tensor stuff

* fix symbolic stuff

* fix multi

* 162 failures

* bugfixes

* don't check that anymore

* fix schedule tests

* mnist should be contiguious

* type and buffer

* fix tests

* shrink axis correction

* mypy fixes

* tests skips

* same 37 failures

* dedup

* no shrink in the graph

* 29 failures

* skips

* fix custom kernel

* fix training

* those optimizations aren't supported currently

* simpler

* more correct

* tests

* 14 failures

* works

* fix that test

* broken

* 11 failures

* only kernel counts left

* fixes

* all tests pass

* remove tensor_map

* op test

* 200 -> 230

* test fixes

* fixes

* revert test_tiny thing

* guard

* revert that

* test tiny passes

* no contigs there

* base realize back

* Revert "no contigs there"

This reverts commit c45bb9fcfd.

* revert that

* chop many assigns

* 12 failures

* fix tests

* tests

* apply after

* pre-commit

* remove old code

* delete that

* fix types

* remove extra contig

* fix dataloader

* torch fix

* disk fix

* update kernel fusion numbres

* runs on amd

* restore kernel count

* add that rule back

* that

* disable that

* wrong

* add the correct rule for that folding

* more tests

* guard c1.arg

* no newlines

* realize those

* split into a different file

* remove detach/contig back

* skip 2

* update that
2026-02-20 20:05:54 +08:00
..
amdpci am_smi: enable mem usage back (#14858) 2026-02-18 19:27:27 +03:00
datasets remove more stale stuff (#13765) 2025-12-19 17:14:56 -04:00
dsp dsp stuff / sniff ioctls from snpe (#9490) 2025-03-20 10:38:23 +08:00
fp8 remove CUSTOM_KERNEL / directly construct it (#14604) 2026-02-08 18:43:33 +08:00
gemm gemm multiple is only for cdna4 asm (#14814) 2026-02-17 14:00:02 +09:00
hcq hcq_smi: kill mac pids (#14398) 2026-01-28 15:00:28 +03:00
hcqfuzz feat: add repro command to summary (#10930) 2025-11-13 08:52:27 -08:00
hevc hevc: decoder as iterator (#14091) 2026-01-10 14:57:56 +03:00
hip_gpu_driver amd: alive wgps (#14149) 2026-01-23 00:08:45 +03:00
hiprtc use comgr to compile (#3248) 2024-01-26 18:27:49 -08:00
huggingface_onnx fix test_xlm_roberta_large (#14564) 2026-02-05 14:56:06 -05:00
mesa In-tree autogen: all C libraries (#13220) 2025-11-13 18:57:44 -08:00
mmapeak improve mmapeak, print names with sqtt (#14726) 2026-02-13 16:07:06 +08:00
models feat: llama wqkv (#14841) 2026-02-17 23:01:33 -08:00
nv_gpu_driver nv: pma for 5090 (#14420) 2026-01-29 20:06:01 +03:00
nv_pma stream pma decoder (#14746) 2026-02-14 17:40:18 +03:00
optimization move tests to test/backend (#14691) 2026-02-12 11:09:44 +08:00
perfetto diff devices for sdma (#14589) 2026-02-06 16:39:12 +03:00
qcom_gpu_driver working ioctls (#14272) 2026-01-21 20:29:04 +03:00
remu renderer/amd: move in tree (#14702) 2026-02-12 18:09:16 +08:00
sqtt viz/sqtt: decoder fixes pre rdna4/cdna4 work (#14900) 2026-02-20 12:10:15 +09:00
thunder fa: explicitly pass shapes (#14857) 2026-02-19 05:26:16 -08:00
tinyfs feat: tinyfs load test in benchmark (#14602) 2026-02-06 18:00:00 -08:00
torch_backend preallocate all realized buffers (#14823) 2026-02-20 20:05:54 +08:00
torch_hook rename lazydata to uop (#10698) 2025-06-08 08:42:22 -07:00
usbgpu usbgpu: use BOT interface for patch.py (#13644) 2026-02-02 11:54:46 +08:00
viz viz: get program binary from the UOp (#14787) 2026-02-16 15:46:58 +09:00
webgpu Autogen webgpu dawn, removing wgpu-py dependency (f16 support part 1) (#8646) 2025-02-07 15:16:59 +08:00
archprobe.py ops_gpu -> ops_cl (#12103) 2025-09-10 15:15:48 -04:00
bench_log.py hotfix: BenchEvent MLPERF_RUN is mlperf_run (#10526) 2025-05-26 20:19:37 -04:00
cl_android.sh source extra/cl_android.sh to fix opencl on android 2025-10-26 15:27:51 +08:00
export_model.py no core_id (#14265) 2026-01-23 21:30:12 +03:00
f16_decompress.py u32 to f16 in tinygrad (#8074) 2024-12-06 12:00:13 +01:00
gradcheck.py tests from grad uop path [pr] (#8313) 2024-12-18 09:25:05 -08:00
hip_large_kernel.py Buffer.as_buffer -> Buffer.as_memoryview [pr] (#14535) 2026-02-04 11:31:11 -05:00
hook_cuda.py cuda hooking (#9180) 2025-02-20 19:20:01 +08:00
introspection.py move files into uop dir (#10399) 2025-05-18 11:38:28 -07:00
lr_scheduler.py more beautiful cifar (#10551) 2025-05-28 20:48:20 -07:00
multitensor.py rename lazydata to uop (#10698) 2025-06-08 08:42:22 -07:00
nvJitLink.h In-tree autogen: all C libraries (#13220) 2025-11-13 18:57:44 -08:00
onnx_helpers.py onnx helper intermediate node output validation (#12740) 2025-10-16 11:17:47 -04:00
setup_mock_amd_osx.sh add rocm 6.4 support (#10491) 2025-05-23 16:20:54 -07:00
setup_mock_nv_osx.sh hotfix: setup_mock_nv_osx 2025-02-13 12:26:15 +08:00
setup_nvcc_osx.sh nvcc/nvdisasm macos shim (#14822) 2026-02-17 20:07:05 +09:00
test_mi350.sh move tests to test/backend (#14691) 2026-02-12 11:09:44 +08:00
thneed.py ops_gpu -> ops_cl (#12103) 2025-09-10 15:15:48 -04:00
training.py tinytqdm.set_description and tinytrange (#5101) 2024-06-22 14:45:06 -04:00
weekly_commits_table.py add chrism 2025-12-14 00:45:57 -05:00