..
amdpci
hive_reset respects lock ( #14618 )
2026-02-08 10:47:25 +03:00
assembly /amd
assembly/amd: fix saturation in python remu ( #14557 )
2026-02-05 18:35:57 +08:00
datasets
remove more stale stuff ( #13765 )
2025-12-19 17:14:56 -04:00
dsp
dsp stuff / sniff ioctls from snpe ( #9490 )
2025-03-20 10:38:23 +08:00
fp8
remove CUSTOM_KERNEL / directly construct it ( #14604 )
2026-02-08 18:43:33 +08:00
gemm
remove CUSTOM_KERNEL / directly construct it ( #14604 )
2026-02-08 18:43:33 +08:00
hcq
hcq_smi: kill mac pids ( #14398 )
2026-01-28 15:00:28 +03:00
hcqfuzz
feat: add repro command to summary ( #10930 )
2025-11-13 08:52:27 -08:00
hevc
hevc: decoder as iterator ( #14091 )
2026-01-10 14:57:56 +03:00
hip_gpu_driver
amd: alive wgps ( #14149 )
2026-01-23 00:08:45 +03:00
hiprtc
use comgr to compile ( #3248 )
2024-01-26 18:27:49 -08:00
huggingface_onnx
fix test_xlm_roberta_large ( #14564 )
2026-02-05 14:56:06 -05:00
mesa
In-tree autogen: all C libraries ( #13220 )
2025-11-13 18:57:44 -08:00
mmapeak
mfma loop in asm dsl ( #14349 )
2026-01-27 11:11:37 +09:00
models
remove CUSTOM_KERNEL / directly construct it ( #14604 )
2026-02-08 18:43:33 +08:00
nv_gpu_driver
nv: pma for 5090 ( #14420 )
2026-01-29 20:06:01 +03:00
nv_pma
nv: add prof props to dev ( #14437 )
2026-01-30 12:51:43 +03:00
optimization
move more tests to test/null, split some existing ones ( #14512 )
2026-02-03 20:20:20 +08:00
perfetto
diff devices for sdma ( #14589 )
2026-02-06 16:39:12 +03:00
qcom_gpu_driver
working ioctls ( #14272 )
2026-01-21 20:29:04 +03:00
remu
simplify mi350x gemm / viz asm tests ( #13984 )
2026-01-03 11:11:07 +09:00
sqtt
diff devices for sdma ( #14589 )
2026-02-06 16:39:12 +03:00
thunder
fa: block skipping for fa kv bwd ( #14569 )
2026-02-05 16:13:53 -08:00
tinyfs
feat: tinyfs load test in benchmark ( #14602 )
2026-02-06 18:00:00 -08:00
torch_backend
remove allow_shape_mismatch in Tensor.replace ( #14536 )
2026-02-04 12:38:18 -05:00
torch_hook
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
usbgpu
usbgpu: use BOT interface for patch.py ( #13644 )
2026-02-02 11:54:46 +08:00
viz
llama: add VIZ=-1 to dev_run ( #14583 )
2026-02-06 22:59:22 +09:00
webgpu
Autogen webgpu dawn, removing wgpu-py dependency (f16 support part 1) ( #8646 )
2025-02-07 15:16:59 +08:00
archprobe.py
ops_gpu -> ops_cl ( #12103 )
2025-09-10 15:15:48 -04:00
bench_log.py
hotfix: BenchEvent MLPERF_RUN is mlperf_run ( #10526 )
2025-05-26 20:19:37 -04:00
cl_android.sh
source extra/cl_android.sh to fix opencl on android
2025-10-26 15:27:51 +08:00
export_model.py
no core_id ( #14265 )
2026-01-23 21:30:12 +03:00
f16_decompress.py
u32 to f16 in tinygrad ( #8074 )
2024-12-06 12:00:13 +01:00
gradcheck.py
tests from grad uop path [pr] ( #8313 )
2024-12-18 09:25:05 -08:00
hip_large_kernel.py
Buffer.as_buffer -> Buffer.as_memoryview [pr] ( #14535 )
2026-02-04 11:31:11 -05:00
hook_cuda.py
cuda hooking ( #9180 )
2025-02-20 19:20:01 +08:00
introspection.py
move files into uop dir ( #10399 )
2025-05-18 11:38:28 -07:00
lr_scheduler.py
more beautiful cifar ( #10551 )
2025-05-28 20:48:20 -07:00
multitensor.py
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
nvJitLink.h
In-tree autogen: all C libraries ( #13220 )
2025-11-13 18:57:44 -08:00
onnx_helpers.py
onnx helper intermediate node output validation ( #12740 )
2025-10-16 11:17:47 -04:00
setup_mock_amd_osx.sh
add rocm 6.4 support ( #10491 )
2025-05-23 16:20:54 -07:00
setup_mock_nv_osx.sh
hotfix: setup_mock_nv_osx
2025-02-13 12:26:15 +08:00
test_mi350.sh
amd fp8 llvm ( #13186 )
2025-11-20 12:35:57 -05:00
thneed.py
ops_gpu -> ops_cl ( #12103 )
2025-09-10 15:15:48 -04:00
training.py
tinytqdm.set_description and tinytrange ( #5101 )
2024-06-22 14:45:06 -04:00
weekly_commits_table.py
add chrism
2025-12-14 00:45:57 -05:00