tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-24 02:14:17 +00:00

History

qazal f9cfb64cd9 test asm_gemm in CI (#14551 ) * test asm_gemm in CI * default float16 * use a smaller shape for multi * smaller size * smaller for CI * smaller for ci * need half		2026-02-05 13:32:22 +09:00
..
amdpci	am: mi3xx perf (#14529 )	2026-02-04 19:32:43 +03:00
assembly/amd	PYTHONREMU: VOP3P integer operations with constants don't cast to fp16 (#14546 )	2026-02-04 20:10:59 -05:00
datasets	remove more stale stuff (#13765 )	2025-12-19 17:14:56 -04:00
dsp	dsp stuff / sniff ioctls from snpe (#9490 )	2025-03-20 10:38:23 +08:00
fp8	train bert with fp8 (#13874 )	2026-01-09 09:21:59 -05:00
gemm	test asm_gemm in CI (#14551 )	2026-02-05 13:32:22 +09:00
hcq	hcq_smi: kill mac pids (#14398 )	2026-01-28 15:00:28 +03:00
hcqfuzz	feat: add repro command to summary (#10930 )	2025-11-13 08:52:27 -08:00
hevc	hevc: decoder as iterator (#14091 )	2026-01-10 14:57:56 +03:00
hip_gpu_driver	amd: alive wgps (#14149 )	2026-01-23 00:08:45 +03:00
hiprtc	use comgr to compile (#3248 )	2024-01-26 18:27:49 -08:00
huggingface_onnx	move frontend dir to nn [pr] (#12470 )	2025-10-07 10:42:22 +08:00
mesa	In-tree autogen: all C libraries (#13220 )	2025-11-13 18:57:44 -08:00
mmapeak	mfma loop in asm dsl (#14349 )	2026-01-27 11:11:37 +09:00
models	feat: llama uses enable_gqa during training (#14545 )	2026-02-04 16:22:31 -08:00
nv_gpu_driver	nv: pma for 5090 (#14420 )	2026-01-29 20:06:01 +03:00
nv_pma	nv: add prof props to dev (#14437 )	2026-01-30 12:51:43 +03:00
optimization	move more tests to test/null, split some existing ones (#14512 )	2026-02-03 20:20:20 +08:00
perfetto	upd perfetto (#11528 )	2025-08-06 14:00:34 +03:00
qcom_gpu_driver	working ioctls (#14272 )	2026-01-21 20:29:04 +03:00
remu	simplify mi350x gemm / viz asm tests (#13984 )	2026-01-03 11:11:07 +09:00
sqtt	sqtt: update examples after event field change (#14493 )	2026-02-02 21:39:48 +09:00
thunder	fa: faster (#14453 )	2026-02-02 21:34:17 -08:00
tinyfs	tinyfs tweaks (#13444 )	2025-11-24 18:07:32 -08:00
torch_backend	remove allow_shape_mismatch in Tensor.replace (#14536 )	2026-02-04 12:38:18 -05:00
torch_hook	rename lazydata to uop (#10698 )	2025-06-08 08:42:22 -07:00
usbgpu	usbgpu: use BOT interface for `patch.py` (#13644 )	2026-02-02 11:54:46 +08:00
viz	viz: profiler command line tool (#14515 )	2026-02-03 19:51:25 +09:00
webgpu	Autogen webgpu dawn, removing wgpu-py dependency (f16 support part 1) (#8646 )	2025-02-07 15:16:59 +08:00
archprobe.py	ops_gpu -> ops_cl (#12103 )	2025-09-10 15:15:48 -04:00
bench_log.py	hotfix: BenchEvent MLPERF_RUN is mlperf_run (#10526 )	2025-05-26 20:19:37 -04:00
cl_android.sh	source extra/cl_android.sh to fix opencl on android	2025-10-26 15:27:51 +08:00
export_model.py	no core_id (#14265 )	2026-01-23 21:30:12 +03:00
f16_decompress.py	u32 to f16 in tinygrad (#8074 )	2024-12-06 12:00:13 +01:00
gradcheck.py	tests from grad uop path [pr] (#8313 )	2024-12-18 09:25:05 -08:00
hip_large_kernel.py	Buffer.as_buffer -> Buffer.as_memoryview [pr] (#14535 )	2026-02-04 11:31:11 -05:00
hook_cuda.py	cuda hooking (#9180 )	2025-02-20 19:20:01 +08:00
introspection.py	move files into uop dir (#10399 )	2025-05-18 11:38:28 -07:00
lr_scheduler.py	more beautiful cifar (#10551 )	2025-05-28 20:48:20 -07:00
multitensor.py	rename lazydata to uop (#10698 )	2025-06-08 08:42:22 -07:00
nvJitLink.h	In-tree autogen: all C libraries (#13220 )	2025-11-13 18:57:44 -08:00
onnx_helpers.py	onnx helper intermediate node output validation (#12740 )	2025-10-16 11:17:47 -04:00
setup_mock_amd_osx.sh	add rocm 6.4 support (#10491 )	2025-05-23 16:20:54 -07:00
setup_mock_nv_osx.sh	hotfix: setup_mock_nv_osx	2025-02-13 12:26:15 +08:00
test_mi350.sh	amd fp8 llvm (#13186 )	2025-11-20 12:35:57 -05:00
thneed.py	ops_gpu -> ops_cl (#12103 )	2025-09-10 15:15:48 -04:00
training.py	tinytqdm.set_description and tinytrange (#5101 )	2024-06-22 14:45:06 -04:00
weekly_commits_table.py	add chrism	2025-12-14 00:45:57 -05:00