mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* connect to gpu
* rlc init?
* gfx comp start init
* early init is hardoded, some progress with fw
* gart
* progress, next mqd
* ring setup, still does not execute anything
* ugh write correct reg
* pci2: vm
* pci2: start psp
* vm seems to work
* pci2: gfx start
* pci2: fix psp ring resp
* pci2: try ring
* pci2: mes and some fixes
* pci2: some progress
* pci2: progress
* pci2: mm
* pci2: discovery
* pci2: correct apertures
* pci2: b
* pci2: i
* pci2: l
* pci2: o
* pci2: cmu
* pci2: mes_kiq works
* pci2: mes
* pci2: kcq does not work(
* pci2: unhalt gfx
* ops_am
* minor
* check if amdgpu is there, or we will crash
* bring back graph, it just works
* less prints
* do not init mes (not used)
* remove unused files
* ops_am: start move into core
* ops_am: works
* clcks, but still slower
* faster + no mes_kiq
* vm frags + remove mes
* cleanup fw
* gmc tiny cleanup
* move to ops_amd
* comment out what we dont really need
* driverless
* close in speed
* am clean most of ips
* gmc to ips
* cleaner
* new vm walker
* comment old one
* remove unsued autogens
* last write ups
* remove psp hardcoded values
* more
* add logs
* ih
* p2p and sdma
* vfio hal and interrupts
* smth
* amd dev iface
* minor after rebase
* bind for sdma
* Revert "bind for sdma"
This reverts commit
|
||
|---|---|---|
| .. | ||
| mlperf_bert | ||
| mlperf_resnet | ||
| mlperf_unet3d | ||
| openpilot | ||
| process_replay | ||
| external_benchmark_hcopt.py | ||
| external_benchmark_hip_compile.py | ||
| external_benchmark_load_stable_diffusion.py | ||
| external_benchmark_multitensor_allreduce.py | ||
| external_benchmark_openpilot.py | ||
| external_benchmark_resnet.py | ||
| external_benchmark_schedule.py | ||
| external_cl_half_max.py | ||
| external_debug_metal_sd_conv.py | ||
| external_fuzz_ampt.py | ||
| external_fuzz_tlsf.py | ||
| external_gpu_fail_osx.py | ||
| external_hip_compiler_bug.py | ||
| external_jit_failure.py | ||
| external_llama_eval.py | ||
| external_metal_compile_fail.py | ||
| external_model_benchmark.py | ||
| external_multi_gpu.py | ||
| external_osx_profiling.py | ||
| external_test_am.py | ||
| external_test_amd.py | ||
| external_test_datasets.py | ||
| external_test_embedding.py | ||
| external_test_example.py | ||
| external_test_hcq.py | ||
| external_test_hcq_fuzz_failures.py | ||
| external_test_hip_compile.py | ||
| external_test_hsa_driver.py | ||
| external_test_image.py | ||
| external_test_jit_on_models.py | ||
| external_test_llama3_ff.py | ||
| external_test_lm_head.py | ||
| external_test_losses.py | ||
| external_test_mamba.py | ||
| external_test_metrics.py | ||
| external_test_mnist_data_select.py | ||
| external_test_nv.py | ||
| external_test_onnx_backend.py | ||
| external_test_opt.py | ||
| external_test_optim.py | ||
| external_test_speed_llama.py | ||
| external_test_speed_theoretical.py | ||
| external_test_tlsf.py | ||
| external_test_train_gpt2.py | ||
| external_test_valid_remove.py | ||
| external_test_whisper_librispeech.py | ||
| external_test_yolo.py | ||
| external_test_yolov8.py | ||
| fuzz_graph.py | ||
| fuzz_kfd.py | ||
| fuzz_linearizer.py | ||
| fuzz_schedule.py | ||
| fuzz_shapetracker.py | ||
| fuzz_shapetracker_math.py | ||
| fuzz_symbolic.py | ||
| fuzz_uops.py | ||
| graph_batchnorm.py | ||
| speed_beam_v_hcopt.py | ||
| speed_compare_cuda_nv.py | ||
| speed_compare_cuda_ptx.py | ||
| speed_v_theoretical.py | ||
| verify_kernel.py | ||