Performance Tool

Optimizing GPU-accelerated Applications with HPCToolkit

Presented our GPU performance tool

HPCToolkit

Our tool provides a profile view and a trace view for GPU-accelerated applications. The profile view identifies where GPU APIs are invoked in CPU calling context, approximates calling context for GPU execution, and analyzes instruction mix for GPU kernels. The tool traces CPU and GPU activities for a large number of processes and threads with minimal overhead.

A Tool for Performance Analysis of GPU-accelerated Applications

Presented the prototype of our GPU performance tool

A performance analysis framework for exploiting GPU microarchitectural capability