..
amdpci
am_smi: kill process group ( #10750 )
2025-06-10 15:23:39 +03:00
assembly
move kernel to opt ( #10899 )
2025-06-20 15:22:28 -07:00
backends
move kernel to opt ( #10899 )
2025-06-20 15:22:28 -07:00
datasets
refactor LOAD(DEFINE_GLOBAL, VIEW) in kernels to LOAD(VIEW(DEFINE_GLOBAL)) ( #10541 )
2025-05-30 14:27:58 +03:00
disassemblers /adreno
qcom fix disasm ( #6703 )
2024-09-24 15:23:43 +08:00
dsp
dsp stuff / sniff ioctls from snpe ( #9490 )
2025-03-20 10:38:23 +08:00
gemm
add halide example ( #10980 )
2025-06-26 16:14:57 -07:00
hcqfuzz
Fix/hcqfuzz harnesss bug ( #10923 )
2025-06-23 11:22:30 +03:00
hip_gpu_driver
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
hiprtc
use comgr to compile ( #3248 )
2024-01-26 18:27:49 -08:00
huggingface_onnx
onnx parser ( #10435 )
2025-06-09 12:44:28 -04:00
junk
coder.py can write and run code ( #2439 )
2023-11-25 12:27:54 -08:00
mmapeak
mmapeak implementation for 7900 XTX ( #10417 )
2025-05-23 16:26:12 -07:00
models
remove (some) kernelize from llama and test schedule speed ( #10939 )
2025-06-23 15:07:31 -07:00
nv_gpu_driver
start nvpci ( #10521 )
2025-06-25 00:37:34 +03:00
nvpci
start nvpci ( #10521 )
2025-06-25 00:37:34 +03:00
optimization
kernel.py no longer permutes reduce axis [pr] ( #10968 )
2025-06-26 17:44:58 -07:00
perfetto
move perfetto to extra ( #10994 )
2025-06-27 01:53:54 +03:00
qcom_gpu_driver
qcom match texture/sampler descriptors to OpenCL ( #7622 )
2024-11-11 21:56:51 +03:00
remu
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
resnet18
remove Tensor.no_grad, it's meaningless now [pr] ( #10556 )
2025-05-28 22:20:02 -07:00
sched
move fuzz_schedule.py to extra [pr] ( #10444 )
2025-05-21 10:07:24 +03:00
sqtt
use tuple in isinstance for type checking ( #9583 )
2025-03-26 19:36:48 +08:00
torch_backend
improve test_nll_loss ( #10986 )
2025-06-26 02:46:55 -04:00
torch_hook
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
usbgpu
usbgpu: check hash in patcher ( #10266 )
2025-05-12 21:08:53 +03:00
webgpu
Autogen webgpu dawn, removing wgpu-py dependency (f16 support part 1) ( #8646 )
2025-02-07 15:16:59 +08:00
archprobe.py
move dtypes to dtype.py ( #2964 )
2024-01-01 14:58:48 -08:00
augment.py
[ready] Replacing os with pathlib ( #1708 )
2023-08-30 10:41:08 -07:00
bench_log.py
hotfix: BenchEvent MLPERF_RUN is mlperf_run ( #10526 )
2025-05-26 20:19:37 -04:00
disk_read_speed.py
io_uring for copies from disk ( #5035 )
2024-06-21 11:36:51 +03:00
dump_cache.py
wow how did i think that was okay ( #2339 )
2023-11-16 21:21:11 -08:00
export_model.py
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
f16_decompress.py
u32 to f16 in tinygrad ( #8074 )
2024-12-06 12:00:13 +01:00
gradcheck.py
tests from grad uop path [pr] ( #8313 )
2024-12-18 09:25:05 -08:00
hip_events.py
move autogen to runtime/autogen ( #3254 )
2024-01-26 12:44:19 -08:00
hip_large_kernel.py
minimum change for rdna4 [pr] ( #9455 )
2025-03-16 13:39:24 +08:00
hook_cuda.py
cuda hooking ( #9180 )
2025-02-20 19:20:01 +08:00
introspection.py
move files into uop dir ( #10399 )
2025-05-18 11:38:28 -07:00
lr_scheduler.py
more beautiful cifar ( #10551 )
2025-05-28 20:48:20 -07:00
mcts_search.py
move kernel to opt ( #10899 )
2025-06-20 15:22:28 -07:00
multitensor.py
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
onnx.py
ONNX real float16 ( #10694 )
2025-06-26 14:05:12 -04:00
onnx_helpers.py
onnx parser ( #10435 )
2025-06-09 12:44:28 -04:00
onnx_parser.py
ONNX improve dtype fallback ( #10800 )
2025-06-21 19:29:45 -04:00
reduce_speed.py
VALIDATE_WITH_CPU [pr] ( #9488 )
2025-03-18 15:15:04 +08:00
replay_pkl.py
move kernel to opt ( #10899 )
2025-06-20 15:22:28 -07:00
ring_copy.py
ring copy example ( #3185 )
2024-01-19 23:34:30 -05:00
setup_mock_amd_osx.sh
add rocm 6.4 support ( #10491 )
2025-05-23 16:20:54 -07:00
setup_mock_nv_osx.sh
hotfix: setup_mock_nv_osx
2025-02-13 12:26:15 +08:00
thneed.py
new style device ( #2530 )
2023-11-30 17:07:16 -08:00
threefry.py
feat: make buffer ( #6745 )
2024-09-25 18:31:03 +08:00
to_movement_ops.py
fix: handle buffer size calculation in to_movement_ops and add scalar assignment test in torch_backend ( #10464 )
2025-05-22 10:54:13 -07:00
training.py
tinytqdm.set_description and tinytrange ( #5101 )
2024-06-22 14:45:06 -04:00
transfer_speed.py
hotfix: copy size is in bytes
2024-01-17 16:44:15 +00:00