mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* Start from andredaprato:webgpu-clean * Fix infs * inf wgsl function is not needed * Emulated ulong for threefry, more tests passing * Randomness tests passing * Update model export to support new changes in webgpu, efficientnet export works again * Simplify shift emulation in wgsl * Delete test file * Fix bigger than u32 u32 literal * Why was skip copies added here? * Python3.12 for webgpu tests * Fix model export syntax error * Get test ops passing with some skips * Fix lint * Much simpler shift * Run more tests * Timestamp queries are not supported in CI, so skip search tests * All fancy indexing passing * r is ctx * Run more dtype tests by using is_dtype_supported * Cleanup ulong shift rendering * UPat -> Pat, UOps -> Ops * Pat -> UPat * Refactor render_ushift if-else * Pattern to avoid ulong mul * Remove vals_dtype * is_nan trick + rewrite, test_isnan passing * Rewrite a * select(1, nan, gate) -> select(a, nan, gate) * No arg, just op * Support char, uchar, short, ushort * Run test_index_mnis now that we have uint8 * Fix pyling * Save 3 lines by using base Compiler * No more long emulation * Remove fixup_binops * No more external_local_bufx wgsl specific cstyle modif, use base extra_pm * Simpler, faster copyin/out * Skip some new tests that use long * Fix typo * copyout touchup * Save lines by using render_cast * WebGL is not supported in core, delete it from is_dtype_supported * More narrow test skips for some unary tests * TernaryOps, UnaryOps -> Ops * TinyGrad supports WebGPU * StableDiffusion demo: f16tof32 gpu is a lib, update UI * Packed load/store, no more scale_size, no core tinygrad changes * Rename copyin, copyout * Device -> dev * Fix lint * Pattern matcher rule for packed load/store * Refactor * Shorter packed load/store * this should fix lint * Fix mypy * SD compile script working * New SD webgpu UI * New default prompt * New SD weights * Fix title when webgpu not available * Run symbolic tests, simplify is_nan, use round_up * Show step time on UI * Bump minimum wgpu version to v0.19 * Fix latent --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| conversation_data | ||
| llm.c | ||
| mlperf | ||
| openpilot | ||
| other_mnist | ||
| rl | ||
| sovits_helpers | ||
| tinychat | ||
| vgg7_helpers | ||
| webgl/yolov8 | ||
| webgpu/stable_diffusion | ||
| __init__.py | ||
| beautiful_cartpole.py | ||
| beautiful_cifar.py | ||
| beautiful_mnist.py | ||
| beautiful_mnist_multigpu.py | ||
| coder.py | ||
| compile_efficientnet.py | ||
| compile_tensorflow.py | ||
| conversation.py | ||
| efficientnet.py | ||
| flux1.py | ||
| flux1_seed0.png | ||
| gpt2.py | ||
| handcode_opt.py | ||
| hlb_cifar10.py | ||
| index.html | ||
| llama.py | ||
| llama3.py | ||
| mamba.py | ||
| mask_rcnn.py | ||
| mixtral.py | ||
| mnist_gan.py | ||
| openelm.py | ||
| sdv2.py | ||
| sdxl.py | ||
| sdxl_seed0.png | ||
| serious_mnist.py | ||
| simple_conv_bn.py | ||
| so_vits_svc.py | ||
| stable_diffusion.py | ||
| stable_diffusion_seed0.png | ||
| stunning_mnist.py | ||
| train_efficientnet.py | ||
| train_resnet.py | ||
| transformer.py | ||
| vgg7.py | ||
| vit.py | ||
| vits.py | ||
| whisper.py | ||
| yolov3.py | ||
| yolov8-onnx.py | ||
| yolov8.py | ||