mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* Start from andredaprato:webgpu-clean * Fix infs * inf wgsl function is not needed * Emulated ulong for threefry, more tests passing * Randomness tests passing * Update model export to support new changes in webgpu, efficientnet export works again * Simplify shift emulation in wgsl * Delete test file * Fix bigger than u32 u32 literal * Why was skip copies added here? * Python3.12 for webgpu tests * Fix model export syntax error * Get test ops passing with some skips * Fix lint * Much simpler shift * Run more tests * Timestamp queries are not supported in CI, so skip search tests * All fancy indexing passing * r is ctx * Run more dtype tests by using is_dtype_supported * Cleanup ulong shift rendering * UPat -> Pat, UOps -> Ops * Pat -> UPat * Refactor render_ushift if-else * Pattern to avoid ulong mul * Remove vals_dtype * is_nan trick + rewrite, test_isnan passing * Rewrite a * select(1, nan, gate) -> select(a, nan, gate) * No arg, just op * Support char, uchar, short, ushort * Run test_index_mnis now that we have uint8 * Fix pyling * Save 3 lines by using base Compiler * No more long emulation * Remove fixup_binops * No more external_local_bufx wgsl specific cstyle modif, use base extra_pm * Simpler, faster copyin/out * Skip some new tests that use long * Fix typo * copyout touchup * Save lines by using render_cast * WebGL is not supported in core, delete it from is_dtype_supported * More narrow test skips for some unary tests * TernaryOps, UnaryOps -> Ops * TinyGrad supports WebGPU * StableDiffusion demo: f16tof32 gpu is a lib, update UI * Packed load/store, no more scale_size, no core tinygrad changes * Rename copyin, copyout * Device -> dev * Fix lint * Pattern matcher rule for packed load/store * Refactor * Shorter packed load/store * this should fix lint * Fix mypy * SD compile script working * New SD webgpu UI * New default prompt * New SD weights * Fix title when webgpu not available * Run symbolic tests, simplify is_nan, use round_up * Show step time on UI * Bump minimum wgpu version to v0.19 * Fix latent --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| external | ||
| imported | ||
| models | ||
| testextra | ||
| unit | ||
| web | ||
| __init__.py | ||
| Dockerfile | ||
| helpers.py | ||
| test_arange.py | ||
| test_assign.py | ||
| test_compile_failures.py | ||
| test_const_folding.py | ||
| test_conv.py | ||
| test_conv_shapetracker.py | ||
| test_copy_speed.py | ||
| test_device_speed.py | ||
| test_dtype.py | ||
| test_dtype_alu.py | ||
| test_fusion_op.py | ||
| test_fuzz_shape_ops.py | ||
| test_gc.py | ||
| test_graph.py | ||
| test_hcq.py | ||
| test_image_dtype.py | ||
| test_jit.py | ||
| test_kernel_cache.py | ||
| test_lazybuffer.py | ||
| test_linearizer.py | ||
| test_linearizer_dumb.py | ||
| test_linearizer_failures.py | ||
| test_linearizer_overflows.py | ||
| test_masked_st.py | ||
| test_metal.py | ||
| test_method_cache.py | ||
| test_multitensor.py | ||
| test_net_speed.py | ||
| test_nn.py | ||
| test_ocl.py | ||
| test_ops.py | ||
| test_optim.py | ||
| test_pickle.py | ||
| test_profiler.py | ||
| test_randomness.py | ||
| test_rearrange_einops.py | ||
| test_renderer_failures.py | ||
| test_sample.py | ||
| test_schedule.py | ||
| test_search.py | ||
| test_setitem.py | ||
| test_specific_conv.py | ||
| test_speed_v_torch.py | ||
| test_subbuffer.py | ||
| test_symbolic_jit.py | ||
| test_symbolic_ops.py | ||
| test_symbolic_shapetracker.py | ||
| test_tensor.py | ||
| test_tensor_data.py | ||
| test_tensor_variable.py | ||
| test_tiny.py | ||
| test_to_numpy.py | ||
| test_transcendental.py | ||
| test_uop_graph.py | ||
| test_uops.py | ||
| test_uops_stats.py | ||
| test_viz.py | ||
| test_winograd.py | ||
| test_zero_copy.py | ||