Yanpeng Yu
Yanpeng Yu
Home
Publications
Contact
Light
Dark
Automatic
1
Efficient MoE Serving in the Memory-Bound Regime: Balance Activated Experts, Not Tokens
Yanpeng Yu\*
,
Haiyue Ma*
,
(*equal contribution)
,
Krish Agarwal
,
Nicolai Oswald
,
Qijing Huang
,
Hugo Linsenmaier
,
Chunhui Mei
,
Ritchie Zhao
,
Ritika Borkar
,
Bita Rouhani
,
David Nellans
,
Ronny Krashinsky
,
Anurag Khandelwal
PDF
CORD: Low-Latency, Bandwidth-Efficient and Scalable Release Consistency via Directory Ordering
Yanpeng Yu
,
Nicolai Oswald
,
Anurag Khandelwal
PDF
Code
GCS: Generalized cache coherence for efficient and scalable synchronization
Yanpeng Yu
,
Seung-seob Lee
,
Anurag Khandelwal
,
Lin Zhong
An Efficient Data Structure for Dynamic Graph on GPUs
Lei Zou
,
Fan Zhang
,
Yinnian Lin
,
Yanpeng Yu
PDF
MIND: In-Network Memory Management for Disaggregated Data Centers
Seung-seob Lee
,
Yanpeng Yu
,
Yupeng Tang
,
Anurag Khandelwal
,
Lin Zhong
,
Abhishek Bhattacharjee
PDF
Code
An example conference paper
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis posuere tellus ac convallis placerat. Proin tincidunt magna sed ex sollicitudin condimentum.
Yanpeng Yu
,
Robert Ford
PDF
Cite
Project
Slides
Cite
×