KPerfIR: Towards an Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads

Abstract

KPerfIR proposes a compiler-centric ecosystem that unifies GPU kernel introspection, performance analysis, and optimization pipelines for state-of-the-art AI workloads.

Publication
Proceedings of the 20th USENIX Symposium on Operating Systems Design and Implementation (OSDI)