Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
Performance Analysis
DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning Workloads
Performance profiling toolkit that unifies deep learning workload analysis across platforms and frameworks.
Qidong Zhao
,
Hao Wu
,
Yuming Hao
,
Zilingfeng Ye
,
Jiajia Li
,
Xu Liu
,
Keren Zhou
Cite
DOI
arXiv
GPA: A GPU Performance Advisor Based on Instruction Sampling
Developing efficient GPU kernels can be difficult because of the complexity of GPU architectures and programming models. Existing …
Keren Zhou
,
Xiaozhu Meng
,
Ryuichi Sai
,
John Mellor-Crummey
Cite
Project
DOI
URL
Measurement and Analysis of GPU-accelerated Applications with HPCToolkit
To address the challenge of performance analysis on the US DOE’s forthcoming exascale supercomputers, Rice University has been …
Keren Zhou
,
Laksono Adhianto
,
Jonathon Anderson
,
Aaron Cherian
,
Dejan Grubisic
,
Mark Krentel
,
Yumeng Liu
,
Xiaozhu Meng
,
John Mellor-Crummey
Cite
Project
DOI
URL
Cite
×