tinygrad/extra/sqtt
qazal 12f4cf0e49
rename amd/test_custom_kernel.py to test_asm_kernel (#16586)
* rename amd/test_custom_kernel.py to test_asm_kernel

* update
2026-06-12 16:11:01 +09:00
..
examples rename amd/test_custom_kernel.py to test_asm_kernel (#16586) 2026-06-12 16:11:01 +09:00
install_rocprof_decoder.py sqtt: match amd's TS_DELTA_SHORT offset (#15901) 2026-04-24 06:41:22 +03:00
README.md sqtt: remove old files (#15108) 2026-03-03 22:43:24 +09:00
rgptool.py diff devices for sdma (#14589) 2026-02-06 16:39:12 +03:00
roc.py viz/sqtt: move amd decoder to extra, don't import from ops_amd (#15969) 2026-04-30 00:49:15 +09:00
sqtt.h amd: sqtt on gfx12 (#12564) 2025-10-10 17:54:14 +08:00

SQTT Profiling

Getting SQ Thread Trace

VIZ=2 to enable SQTT profiling.

SQTT_ITRACE_SE_MASK=X to select shader engines for instruction tracing, -1 = all, 0 = disabled, >0 = SE bitmask, default 0b11.

SQTT_BUFFER_SIZE=X to change size of SQTT buffer (per shader engine, 6 SEs on 7900xtx) in megabytes, default 256.

Viewing the traces

  • Web UI: tinygrad/viz/serve.py
  • Command line: python -m tinygrad.renderer.amd.sqtt