..
accel
move things, clean up extra ( #2292 )
2023-11-13 20:18:40 -08:00
assembly
s/UOps/Ops ( #7500 )
2024-11-03 11:26:10 +08:00
backends
s/UOps/Ops ( #7500 )
2024-11-03 11:26:10 +08:00
datasets
set PAGE_SIZE=1 and generate new dataset ( #7559 )
2024-11-05 11:25:01 -05:00
disassemblers /adreno
qcom fix disasm ( #6703 )
2024-09-24 15:23:43 +08:00
dsp
add qcom dsp runtime ( #6112 )
2024-09-13 21:01:33 +03:00
gemm
remove graph [pr] ( #7085 )
2024-10-16 11:40:07 +08:00
hip_gpu_driver
feat: autogen from kernel register offset headers ( #6056 )
2024-08-12 14:08:35 -07:00
hiprtc
use comgr to compile ( #3248 )
2024-01-26 18:27:49 -08:00
junk
coder.py can write and run code ( #2439 )
2023-11-25 12:27:54 -08:00
mockgpu
Hook memoryview via class instead of a function ( #7627 )
2024-11-11 09:07:06 +08:00
models
combine pad2d with pad ( #7677 )
2024-11-14 17:56:02 +08:00
nv_gpu_driver
nv fix shared_memory_size ( #7239 )
2024-10-23 21:59:47 +03:00
optimization
Remove UnaryOps, BinaryOps, TernaryOps, MetaOps [pr] ( #7725 )
2024-11-16 20:56:56 +08:00
qcom_gpu_driver
qcom match texture/sampler descriptors to OpenCL ( #7622 )
2024-11-11 21:56:51 +03:00
resnet18
beat mlx at resnet 18 ( #6611 )
2024-09-20 11:28:01 +08:00
archprobe.py
move dtypes to dtype.py ( #2964 )
2024-01-01 14:58:48 -08:00
augment.py
[ready] Replacing os with pathlib ( #1708 )
2023-08-30 10:41:08 -07:00
debug_sd_speed.py
example script to show BasicTransformerBlock speed regression ( #7724 )
2024-11-15 15:48:25 -05:00
disk_read_speed.py
io_uring for copies from disk ( #5035 )
2024-06-21 11:36:51 +03:00
dump_cache.py
wow how did i think that was okay ( #2339 )
2023-11-16 21:21:11 -08:00
export_model.py
all realize 2 ( #4527 )
2024-05-10 22:43:09 -07:00
f16_w_uint32.py
fix various examples ( #4691 )
2024-05-22 20:43:21 -04:00
gradcheck.py
Fix: Jacobian tests [WIP] ( #1126 )
2023-07-05 15:36:22 -07:00
hip_events.py
move autogen to runtime/autogen ( #3254 )
2024-01-26 12:44:19 -08:00
introspection.py
remove graph [pr] ( #7085 )
2024-10-16 11:40:07 +08:00
lr_scheduler.py
use at least float32 for optim.lr ( #4297 )
2024-04-25 14:42:28 -04:00
mcts_search.py
safe softmax trick in MCTS ucb_explored_children ( #7515 )
2024-11-03 15:59:31 -05:00
multitensor.py
multitensor start ( #2676 )
2023-12-07 17:07:05 -08:00
onnx.py
remove copied is_dtype_supported from onnx [pr] ( #7646 )
2024-11-11 19:20:32 -05:00
onnx_ops.py
add Tensor.meshgrid ( #7714 )
2024-11-16 23:06:47 -05:00
ring_copy.py
ring copy example ( #3185 )
2024-01-19 23:34:30 -05:00
thneed.py
new style device ( #2530 )
2023-11-30 17:07:16 -08:00
threefry.py
feat: make buffer ( #6745 )
2024-09-25 18:31:03 +08:00
to_movement_ops.py
s/UOps/Ops ( #7500 )
2024-11-03 11:26:10 +08:00
training.py
tinytqdm.set_description and tinytrange ( #5101 )
2024-06-22 14:45:06 -04:00
transfer_speed.py
hotfix: copy size is in bytes
2024-01-17 16:44:15 +00:00