| .. |
|
accel
|
move things, clean up extra (#2292)
|
2023-11-13 20:18:40 -08:00 |
|
amdpci
|
am: add am_smi (#8739)
|
2025-01-24 20:16:19 +03:00 |
|
assembly
|
s/UOps/Ops (#7500)
|
2024-11-03 11:26:10 +08:00 |
|
backends
|
bring back the DSP runtime
|
2024-12-31 12:01:42 -05:00 |
|
datasets
|
improve isin checks (#8589)
|
2025-01-13 12:12:31 -05:00 |
|
disassemblers/adreno
|
qcom fix disasm (#6703)
|
2024-09-24 15:23:43 +08:00 |
|
dsp
|
mypy for mockgpu/cuda & dsp/run (#8575)
|
2025-01-12 18:25:39 +03:00 |
|
gemm
|
fix for int plus minor cleanup (#8650)
|
2025-01-17 22:30:39 -05:00 |
|
hip_gpu_driver
|
create_schedule([x.lazydata]) -> x.schedule() in tests (#8449)
|
2024-12-31 03:15:52 +08:00 |
|
hiprtc
|
use comgr to compile (#3248)
|
2024-01-26 18:27:49 -08:00 |
|
junk
|
coder.py can write and run code (#2439)
|
2023-11-25 12:27:54 -08:00 |
|
models
|
simpler bert acc [pr] (#8714)
|
2025-01-22 10:32:19 -05:00 |
|
nv_gpu_driver
|
nv fix shared_memory_size (#7239)
|
2024-10-23 21:59:47 +03:00 |
|
optimization
|
use CAPTURE_PROCESS_REPLAY=1 in CI [pr] (#8564)
|
2025-01-11 06:03:48 -05:00 |
|
qcom_gpu_driver
|
qcom match texture/sampler descriptors to OpenCL (#7622)
|
2024-11-11 21:56:51 +03:00 |
|
resnet18
|
beat mlx at resnet 18 (#6611)
|
2024-09-20 11:28:01 +08:00 |
|
archprobe.py
|
move dtypes to dtype.py (#2964)
|
2024-01-01 14:58:48 -08:00 |
|
augment.py
|
[ready] Replacing os with pathlib (#1708)
|
2023-08-30 10:41:08 -07:00 |
|
disk_read_speed.py
|
io_uring for copies from disk (#5035)
|
2024-06-21 11:36:51 +03:00 |
|
dump_cache.py
|
wow how did i think that was okay (#2339)
|
2023-11-16 21:21:11 -08:00 |
|
export_model.py
|
encapsulate the exported webgpu model (#8203)
|
2024-12-13 10:55:37 +01:00 |
|
f16_decompress.py
|
u32 to f16 in tinygrad (#8074)
|
2024-12-06 12:00:13 +01:00 |
|
gradcheck.py
|
tests from grad uop path [pr] (#8313)
|
2024-12-18 09:25:05 -08:00 |
|
hip_events.py
|
move autogen to runtime/autogen (#3254)
|
2024-01-26 12:44:19 -08:00 |
|
introspection.py
|
rename LazyBuffer -> UOp [pr] (#8169)
|
2024-12-11 16:15:52 -08:00 |
|
lr_scheduler.py
|
use at least float32 for optim.lr (#4297)
|
2024-04-25 14:42:28 -04:00 |
|
mcts_search.py
|
safe softmax trick in MCTS ucb_explored_children (#7515)
|
2024-11-03 15:59:31 -05:00 |
|
multitensor.py
|
multitensor start (#2676)
|
2023-12-07 17:07:05 -08:00 |
|
onnx.py
|
make onnx runner a class (#8647)
|
2025-01-20 10:11:05 -08:00 |
|
onnx_ops.py
|
reorder and categorize onnx_ops (#8731)
|
2025-01-23 13:18:54 -05:00 |
|
ring_copy.py
|
ring copy example (#3185)
|
2024-01-19 23:34:30 -05:00 |
|
setup_mock_amd_osx.sh
|
add script to install amd mockgpu on macOS (#8536)
|
2025-01-09 01:29:25 +03:00 |
|
thneed.py
|
new style device (#2530)
|
2023-11-30 17:07:16 -08:00 |
|
threefry.py
|
feat: make buffer (#6745)
|
2024-09-25 18:31:03 +08:00 |
|
to_movement_ops.py
|
s/UOps/Ops (#7500)
|
2024-11-03 11:26:10 +08:00 |
|
training.py
|
tinytqdm.set_description and tinytrange (#5101)
|
2024-06-22 14:45:06 -04:00 |
|
transfer_speed.py
|
hotfix: copy size is in bytes
|
2024-01-17 16:44:15 +00:00 |