Deep Learning

ValueExpert: exploring value patterns in GPU-accelerated applications

General-purpose GPUs have become common in modern computing systems to accelerate applications in many domains, including machine learning, high-performance computing, and autonomous driving. However, inefficiencies abound in GPU-accelerated …

A Performance Analysis Framework for Exploiting GPU Microarchitectural Capability

Presented our ICS'17 work.

Deep Learning on Modern Architectures

Discussed how state-of-the-art deep learning libraries optimize computations by utilizing architectural features.

Understanding the GPU microarchitecture to achieve bare-metal performance tuning

Convolution Methods

Introduced various kinds of convolution methods and analyzed their complexities, memory consumptions, and data access patterns.