..
amdpci
proclogs with xccs ( #13626 )
2025-12-09 16:46:08 +03:00
assembly
rdna3 asm + zip_extract ( #13499 )
2025-12-02 22:56:01 -08:00
backends
var_vals uses str for var (#12011 )
2025-09-06 04:16:12 +02:00
datasets
very tiny generate_dataset ( #11013 )
2025-06-27 17:10:45 -04:00
disassemblers /adreno
qcom fix disasm ( #6703 )
2024-09-24 15:23:43 +08:00
dsp
dsp stuff / sniff ioctls from snpe ( #9490 )
2025-03-20 10:38:23 +08:00
gemm
use numpy in amd_uop_matmul for simpler tracing ( #13503 )
2025-11-30 08:04:38 -08:00
hcq
system: reset is a method of pcidevice ( #12936 )
2025-10-27 16:21:10 +08:00
hcqfuzz
feat: add repro command to summary ( #10930 )
2025-11-13 08:52:27 -08:00
hevc
jit: support encdec ( #13563 )
2025-12-04 11:58:34 +03:00
hip_gpu_driver
hip: fix ioctl ( #13548 )
2025-12-03 16:40:43 +03:00
hiprtc
use comgr to compile ( #3248 )
2024-01-26 18:27:49 -08:00
huggingface_onnx
move frontend dir to nn [pr] ( #12470 )
2025-10-07 10:42:22 +08:00
junk
coder.py can write and run code ( #2439 )
2023-11-25 12:27:54 -08:00
mesa
In-tree autogen: all C libraries ( #13220 )
2025-11-13 18:57:44 -08:00
mmapeak
hotfix: 32 workgroups for radeon 8050s
2025-11-30 08:20:17 -08:00
models
stub attention ( #13196 )
2025-11-10 13:48:38 -08:00
nv_gpu_driver
nv: minimal hevc ( #13502 )
2025-11-30 16:46:55 +03:00
optimization
ShapeTracker.real_strides -> is_expanded [pr] ( #12579 )
2025-10-09 22:52:45 -04:00
perfetto
upd perfetto ( #11528 )
2025-08-06 14:00:34 +03:00
qcom_gpu_driver
qcom: support cpu mappings ( #13565 )
2025-12-04 14:50:46 +03:00
remu
add new remu instructions from #13533 ( #13539 )
2025-12-03 06:29:20 +08:00
resnet18
remove Tensor.no_grad, it's meaningless now [pr] ( #10556 )
2025-05-28 22:20:02 -07:00
sched
move fuzz_schedule.py to extra [pr] ( #10444 )
2025-05-21 10:07:24 +03:00
sqtt
lds bank count tests from pmc counters ( #13667 )
2025-12-13 17:39:32 +08:00
thunder
fix: cast on transpose ( #13653 )
2025-12-11 21:03:49 -08:00
tinyfs
tinyfs tweaks ( #13444 )
2025-11-24 18:07:32 -08:00
torch_backend
torch backend: no aten.detach for torch 2.10 compat ( #13381 )
2025-11-20 09:12:15 -08:00
torch_hook
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
usbgpu
hotfix: no hexdump for usbgpu patch.py
2025-11-12 12:05:37 -08:00
webgpu
Autogen webgpu dawn, removing wgpu-py dependency (f16 support part 1) ( #8646 )
2025-02-07 15:16:59 +08:00
archprobe.py
ops_gpu -> ops_cl ( #12103 )
2025-09-10 15:15:48 -04:00
augment.py
[ready] Replacing os with pathlib ( #1708 )
2023-08-30 10:41:08 -07:00
bandwidth_test.py
work from benchmarking tinybox red v2 ( #13264 )
2025-11-13 16:38:40 -08:00
bench_log.py
hotfix: BenchEvent MLPERF_RUN is mlperf_run ( #10526 )
2025-05-26 20:19:37 -04:00
cl_android.sh
source extra/cl_android.sh to fix opencl on android
2025-10-26 15:27:51 +08:00
disk_read_speed.py
io_uring for copies from disk ( #5035 )
2024-06-21 11:36:51 +03:00
dump_cache.py
wow how did i think that was okay ( #2339 )
2023-11-16 21:21:11 -08:00
export_model.py
ops_gpu -> ops_cl ( #12103 )
2025-09-10 15:15:48 -04:00
f16_decompress.py
u32 to f16 in tinygrad ( #8074 )
2024-12-06 12:00:13 +01:00
gpuburn.py
work from benchmarking tinybox red v2 ( #13264 )
2025-11-13 16:38:40 -08:00
gradcheck.py
tests from grad uop path [pr] ( #8313 )
2024-12-18 09:25:05 -08:00
hip_events.py
move autogen to runtime/autogen ( #3254 )
2024-01-26 12:44:19 -08:00
hip_large_kernel.py
minimum change for rdna4 [pr] ( #9455 )
2025-03-16 13:39:24 +08:00
hook_cuda.py
cuda hooking ( #9180 )
2025-02-20 19:20:01 +08:00
introspection.py
move files into uop dir ( #10399 )
2025-05-18 11:38:28 -07:00
lr_scheduler.py
more beautiful cifar ( #10551 )
2025-05-28 20:48:20 -07:00
mcts_search.py
var_vals uses str for var (#12011 )
2025-09-06 04:16:12 +02:00
multitensor.py
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
nvJitLink.h
In-tree autogen: all C libraries ( #13220 )
2025-11-13 18:57:44 -08:00
onnx_helpers.py
onnx helper intermediate node output validation ( #12740 )
2025-10-16 11:17:47 -04:00
reduce_speed.py
VALIDATE_WITH_CPU [pr] ( #9488 )
2025-03-18 15:15:04 +08:00
replay_pkl.py
update Kernel API in tests + move optimize_local_size ( #11907 )
2025-08-28 15:12:47 -07:00
ring_copy.py
ring copy example ( #3185 )
2024-01-19 23:34:30 -05:00
self_tokenize.py
lil op cleanup ( #13424 )
2025-11-22 15:21:15 -08:00
setup_mock_amd_osx.sh
add rocm 6.4 support ( #10491 )
2025-05-23 16:20:54 -07:00
setup_mock_nv_osx.sh
hotfix: setup_mock_nv_osx
2025-02-13 12:26:15 +08:00
test_mi350.sh
amd fp8 llvm ( #13186 )
2025-11-20 12:35:57 -05:00
test_pyrender.py
test pyrender ( #12005 )
2025-09-04 11:48:40 -07:00
thneed.py
ops_gpu -> ops_cl ( #12103 )
2025-09-10 15:15:48 -04:00
threefry.py
feat: make buffer ( #6745 )
2024-09-25 18:31:03 +08:00
to_movement_ops.py
update torch 2.8 ( #12172 )
2025-09-14 15:19:03 -04:00
training.py
tinytqdm.set_description and tinytrange ( #5101 )
2024-06-22 14:45:06 -04:00
transfer_speed.py
hotfix: copy size is in bytes
2024-01-17 16:44:15 +00:00
weekly_commits_table.py
hotfix: update weekly commits table
2025-11-09 19:37:06 -08:00