Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
Distributed Systems
Mercury: Unlocking Multi-GPU Operator Optimization for LLMs via Remote Memory Scheduling
Remote memory scheduling framework that optimizes LLM operators across multi-GPU deployments.
Yue Guan
,
Xinwei Qiang
,
Zaifeng Pan
,
Daniels Johnson
,
Yuanwei Fang
,
Keren Zhou
,
Yuke Wang
,
Wanlu Li
,
Yufei Ding
,
Adnan Aziz
Cite
DOI
PDF
Cite
×