Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
Systems
Mercury: Unlocking Multi-GPU Operator Optimization for LLMs via Remote Memory Scheduling
Remote memory scheduling framework that optimizes LLM operators across multi-GPU deployments.
Yue Guan
,
Xinwei Qiang
,
Zaifeng Pan
,
Daniels Johnson
,
Yuanwei Fang
,
Keren Zhou
,
Yuke Wang
,
Wanlu Li
,
Yufei Ding
,
Adnan Aziz
Cite
DOI
PDF
KPerfIR: Towards an Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads
An open, compiler-focused infrastructure for profiling and optimizing GPU kernels on AI workloads.
Yue Guan
,
Yuanwei Fang
,
Keren Zhou
,
Corbin Robeck
,
Manman Ren
,
Zhongkai Yu
,
Yufei Ding
,
Adnan Aziz
Cite
DOI
PDF
arXiv
Cite
×