tinygrad/extra
Francis Lata c3187087f7
QwQ-32B-Preview support (#7962)
* load weights with some debugging

* start running a prompt

* cleanup

* optionally permute layers and cleanup

* add validation for simple prompt

* small cleanup

* minor cleanup with formatting download links

* add a longer prompt

* add timing option

* some typings

* remove unused arg

* reset GlobalCounters

* minor cleanups
2024-12-04 21:46:37 -05:00
..
accel move things, clean up extra (#2292) 2023-11-13 20:18:40 -08:00
assembly s/UOps/Ops (#7500) 2024-11-03 11:26:10 +08:00
backends Remove WebGL (#8012) 2024-12-03 16:02:53 +01:00
datasets set PAGE_SIZE=1 and generate new dataset (#7559) 2024-11-05 11:25:01 -05:00
disassemblers/adreno qcom fix disasm (#6703) 2024-09-24 15:23:43 +08:00
dsp add qcom dsp runtime (#6112) 2024-09-13 21:01:33 +03:00
gemm BufferSpec and ProgramSpec [pr] (#7814) 2024-11-21 12:18:05 +08:00
hip_gpu_driver amd pkt3 refactor (#7923) 2024-11-28 11:06:37 +03:00
hiprtc use comgr to compile (#3248) 2024-01-26 18:27:49 -08:00
junk coder.py can write and run code (#2439) 2023-11-25 12:27:54 -08:00
mockgpu Hook memoryview via class instead of a function (#7627) 2024-11-11 09:07:06 +08:00
models QwQ-32B-Preview support (#7962) 2024-12-04 21:46:37 -05:00
nv_gpu_driver nv fix shared_memory_size (#7239) 2024-10-23 21:59:47 +03:00
optimization Remove UnaryOps, BinaryOps, TernaryOps, MetaOps [pr] (#7725) 2024-11-16 20:56:56 +08:00
qcom_gpu_driver qcom match texture/sampler descriptors to OpenCL (#7622) 2024-11-11 21:56:51 +03:00
resnet18 beat mlx at resnet 18 (#6611) 2024-09-20 11:28:01 +08:00
archprobe.py move dtypes to dtype.py (#2964) 2024-01-01 14:58:48 -08:00
augment.py [ready] Replacing os with pathlib (#1708) 2023-08-30 10:41:08 -07:00
disk_read_speed.py io_uring for copies from disk (#5035) 2024-06-21 11:36:51 +03:00
dump_cache.py wow how did i think that was okay (#2339) 2023-11-16 21:21:11 -08:00
export_model.py Remove WebGL (#8012) 2024-12-03 16:02:53 +01:00
f16_w_uint32.py fix various examples (#4691) 2024-05-22 20:43:21 -04:00
gradcheck.py Fix: Jacobian tests [WIP] (#1126) 2023-07-05 15:36:22 -07:00
hip_events.py move autogen to runtime/autogen (#3254) 2024-01-26 12:44:19 -08:00
introspection.py remove graph [pr] (#7085) 2024-10-16 11:40:07 +08:00
lr_scheduler.py use at least float32 for optim.lr (#4297) 2024-04-25 14:42:28 -04:00
mcts_search.py safe softmax trick in MCTS ucb_explored_children (#7515) 2024-11-03 15:59:31 -05:00
multitensor.py multitensor start (#2676) 2023-12-07 17:07:05 -08:00
onnx.py simple onnx_ops cleanups (#8003) 2024-12-04 15:33:03 -05:00
onnx_ops.py simple onnx_ops cleanups (#8003) 2024-12-04 15:33:03 -05:00
ring_copy.py ring copy example (#3185) 2024-01-19 23:34:30 -05:00
thneed.py new style device (#2530) 2023-11-30 17:07:16 -08:00
threefry.py feat: make buffer (#6745) 2024-09-25 18:31:03 +08:00
to_movement_ops.py s/UOps/Ops (#7500) 2024-11-03 11:26:10 +08:00
training.py tinytqdm.set_description and tinytrange (#5101) 2024-06-22 14:45:06 -04:00
transfer_speed.py hotfix: copy size is in bytes 2024-01-17 16:44:15 +00:00