mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* kfd driver wip * cleanups * kfd almost ready to ring doorbell * ding dong? * issues with signals * something * works * ops kfd * add amd_signal_t * works...sometimes * program runs * _gpu_alloc cleanup * cleanups * work * header + enable profiling (#3959) * header + enable profiling * just cleaner * measure * only local time domain * remove old comments * fix with master * elf parsing (#3965) * elf parsing * fix kernels with private * not used * clean up * clean up 2 * add flags * kfd sdma (#3970) * working sdma * remove driver, shorter * all commands we might need * svm * kfd remove hardcoded values (#4007) * remove hardcoded values * match above line * 7k lines + revert hsa * update that from origin * fix sdma reg gen * not the updated SDMA * compiler_opts * don't require kfd_ioctl * get ioctls from python * get ioctls from python * remove build_sdma_command * merge into 64-bit fields * shorter * fix property spelling and off by one --------- Co-authored-by: nimlgen <138685161+nimlgen@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| accel | ||
| assembly | ||
| backends | ||
| datasets | ||
| gemm | ||
| hip_gpu_driver | ||
| hiprtc | ||
| junk | ||
| models | ||
| nv_gpu_driver | ||
| optimization | ||
| qcom_gpu_driver | ||
| archprobe.py | ||
| augment.py | ||
| autopad.py | ||
| disk_read_speed.py | ||
| dump_cache.py | ||
| export_model.py | ||
| gradcheck.py | ||
| hip_events.py | ||
| introspection.py | ||
| lr_scheduler.py | ||
| multitensor.py | ||
| onnx.py | ||
| onnx_ops.py | ||
| ring_copy.py | ||
| thneed.py | ||
| to_movement_ops.py | ||
| training.py | ||
| transfer_speed.py | ||