| .. |
|
amdpci
|
am_smi: increase kill retries (#11099)
|
2025-07-05 16:23:50 +03:00 |
|
assembly
|
rename DEFINE_ACC -> DEFINE_REG (#11006)
|
2025-06-27 11:09:25 -07:00 |
|
backends
|
insert_before -> insert_at [pr] (#11257)
|
2025-07-15 17:44:34 -04:00 |
|
datasets
|
very tiny generate_dataset (#11013)
|
2025-06-27 17:10:45 -04:00 |
|
disassemblers/adreno
|
qcom fix disasm (#6703)
|
2024-09-24 15:23:43 +08:00 |
|
dsp
|
dsp stuff / sniff ioctls from snpe (#9490)
|
2025-03-20 10:38:23 +08:00 |
|
gemm
|
remove Kernel.get_program [pr] (#11203)
|
2025-07-12 20:50:29 -04:00 |
|
hcqfuzz
|
Fix/hcqfuzz harnesss bug (#10923)
|
2025-06-23 11:22:30 +03:00 |
|
hip_gpu_driver
|
hip_ioctl: platform.machine added (#11084)
|
2025-07-04 17:20:24 +03:00 |
|
hiprtc
|
use comgr to compile (#3248)
|
2024-01-26 18:27:49 -08:00 |
|
huggingface_onnx
|
OnnxRunner file as input (#10789)
|
2025-07-12 14:27:46 -04:00 |
|
junk
|
coder.py can write and run code (#2439)
|
2023-11-25 12:27:54 -08:00 |
|
mmapeak
|
mmapeak implementation for 7900 XTX (#10417)
|
2025-05-23 16:26:12 -07:00 |
|
models
|
remove (some) kernelize from llama and test schedule speed (#10939)
|
2025-06-23 15:07:31 -07:00 |
|
nv_gpu_driver
|
nv: parse falcon app descs (#11118)
|
2025-07-07 18:14:14 +03:00 |
|
nvpci
|
nv_smi: init basic insmod/rmmod/reset cmds (#11282)
|
2025-07-19 15:43:03 +03:00 |
|
optimization
|
remove Kernel.get_program [pr] (#11203)
|
2025-07-12 20:50:29 -04:00 |
|
perfetto
|
move cpu_profile and shared ProfileEvents from device.py to helpers [pr] (#11126)
|
2025-07-08 12:14:03 +03:00 |
|
qcom_gpu_driver
|
qcom match texture/sampler descriptors to OpenCL (#7622)
|
2024-11-11 21:56:51 +03:00 |
|
remu
|
clean up AMD_LLVM in tests (#11021)
|
2025-06-28 22:45:47 -04:00 |
|
resnet18
|
remove Tensor.no_grad, it's meaningless now [pr] (#10556)
|
2025-05-28 22:20:02 -07:00 |
|
sched
|
move fuzz_schedule.py to extra [pr] (#10444)
|
2025-05-21 10:07:24 +03:00 |
|
sqtt
|
fix typo in sqtt/readme (#11281)
|
2025-07-19 15:10:24 +03:00 |
|
torch_backend
|
Add amin support to Tensor operations in Torch backend (#11290)
|
2025-07-21 09:14:08 -04:00 |
|
torch_hook
|
rename lazydata to uop (#10698)
|
2025-06-08 08:42:22 -07:00 |
|
usbgpu
|
usbgpu: check hash in patcher (#10266)
|
2025-05-12 21:08:53 +03:00 |
|
webgpu
|
Autogen webgpu dawn, removing wgpu-py dependency (f16 support part 1) (#8646)
|
2025-02-07 15:16:59 +08:00 |
|
archprobe.py
|
move dtypes to dtype.py (#2964)
|
2024-01-01 14:58:48 -08:00 |
|
augment.py
|
[ready] Replacing os with pathlib (#1708)
|
2023-08-30 10:41:08 -07:00 |
|
bench_log.py
|
hotfix: BenchEvent MLPERF_RUN is mlperf_run (#10526)
|
2025-05-26 20:19:37 -04:00 |
|
disk_read_speed.py
|
io_uring for copies from disk (#5035)
|
2024-06-21 11:36:51 +03:00 |
|
dump_cache.py
|
wow how did i think that was okay (#2339)
|
2023-11-16 21:21:11 -08:00 |
|
export_model.py
|
hcq: move cpu to hcq (#11262)
|
2025-07-21 15:10:38 +03:00 |
|
f16_decompress.py
|
u32 to f16 in tinygrad (#8074)
|
2024-12-06 12:00:13 +01:00 |
|
gradcheck.py
|
tests from grad uop path [pr] (#8313)
|
2024-12-18 09:25:05 -08:00 |
|
hip_events.py
|
move autogen to runtime/autogen (#3254)
|
2024-01-26 12:44:19 -08:00 |
|
hip_large_kernel.py
|
minimum change for rdna4 [pr] (#9455)
|
2025-03-16 13:39:24 +08:00 |
|
hook_cuda.py
|
cuda hooking (#9180)
|
2025-02-20 19:20:01 +08:00 |
|
introspection.py
|
move files into uop dir (#10399)
|
2025-05-18 11:38:28 -07:00 |
|
lr_scheduler.py
|
more beautiful cifar (#10551)
|
2025-05-28 20:48:20 -07:00 |
|
mcts_search.py
|
remove Kernel.get_program [pr] (#11203)
|
2025-07-12 20:50:29 -04:00 |
|
multitensor.py
|
rename lazydata to uop (#10698)
|
2025-06-08 08:42:22 -07:00 |
|
onnx.py
|
Bump onnx to 1.18.0 (#11266)
|
2025-07-17 15:35:41 -04:00 |
|
onnx_helpers.py
|
OnnxRunner file as input (#10789)
|
2025-07-12 14:27:46 -04:00 |
|
onnx_parser.py
|
mypy onnx_parser (#11141)
|
2025-07-08 19:50:28 -04:00 |
|
reduce_speed.py
|
VALIDATE_WITH_CPU [pr] (#9488)
|
2025-03-18 15:15:04 +08:00 |
|
replay_pkl.py
|
remove Kernel.get_program [pr] (#11203)
|
2025-07-12 20:50:29 -04:00 |
|
ring_copy.py
|
ring copy example (#3185)
|
2024-01-19 23:34:30 -05:00 |
|
setup_mock_amd_osx.sh
|
add rocm 6.4 support (#10491)
|
2025-05-23 16:20:54 -07:00 |
|
setup_mock_nv_osx.sh
|
hotfix: setup_mock_nv_osx
|
2025-02-13 12:26:15 +08:00 |
|
thneed.py
|
new style device (#2530)
|
2023-11-30 17:07:16 -08:00 |
|
threefry.py
|
feat: make buffer (#6745)
|
2024-09-25 18:31:03 +08:00 |
|
to_movement_ops.py
|
fix: handle buffer size calculation in to_movement_ops and add scalar assignment test in torch_backend (#10464)
|
2025-05-22 10:54:13 -07:00 |
|
training.py
|
tinytqdm.set_description and tinytrange (#5101)
|
2024-06-22 14:45:06 -04:00 |
|
transfer_speed.py
|
hotfix: copy size is in bytes
|
2024-01-17 16:44:15 +00:00 |