TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels
-
Updated
Jun 26, 2026 - Python
TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels
An end-to-end agent project for GPU kernel implementation, analysis, profiling, and iterative optimization. It helps an agent turn PyTorch logic or an existing kernel into a high-performance GPU kernel through a structured, profile-driven workflow.
Production-grade HIP kernel optimization lab — matrix ops, reductions, shared memory patterns
Add a description, image, and links to the gpu-kernel topic page so that developers can more easily learn about it.
To associate your repository with the gpu-kernel topic, visit your repo's landing page and select "manage topics."