A tool for top-down performance analysis of GPU-accelerated applications

Publication
Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’20)