1

Efficient and Scalable Synchronization via Generalized Cache Coherence
Efficient MoE Serving in the Memory-Bound Regime: Balance Activated Experts, Not Tokens
An Efficient Data Structure for Dynamic Graph on GPUs
MIND: In-Network Memory Management for Disaggregated Data Centers