About
He is a PhD candidate in Computer Science and Engineering at The Chinese University of Hong Kong, supervised by Associate Professor Ming-Chang Yang. His research focuses on hardware-software co-optimization of computer systems, specifically including large language model inference optimization and high-performance indexing. His research has been published in top-tier conferences such as OSDI, SOSP, and ICML.
Education
PhD in Computer Science and Engineering
The Chinese University of Hong Kong (CUHK): 2020 - 2026 (Expected)
Bachelor of Engineering in Computer Science and Technology
Huazhong University of Science and Technology (HUST): 2016 - 2020
Publications
TileSparse: Arithmetic-Intensity-Aware Sparse Attention for Compute-Bound LLM Decoding
SEPH: Scalable, Efficient, and Predictable Hashing on Persistent Memory
OSDI'23 (First OSDI paper from CUHK)
Prefill-Decode Aggregation or Disaggregation? Unifying Both for Goodput-Optimized LLM Serving
Under Review, FAIsys Workshop 2025, ArXiv, 公众号
THash: A High-Performance Tiered Hashing for CXL-Based Tiered Memory Systems
Under Review
SparseServe: Unlocking Parallelism for Dynamic Sparse Attention in Long-Context LLM Serving
CARINA: An Efficient CXL-Oriented Embedding Serving System for Recommendation Models
Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value Stores
SOSP'24 (Second SOSP paper from CUHK)
Academic Service
External Reviewer
OSDI'26, Eurosys'26, ICML'26, SoCC'26, HPCA'25, NSDI'25, FAST'24, ATC'24, SIGMOD'24, TOS'24, DAC'20-21, NAS'24, ASP-DAC'24, CODES'20~23, ICCAD'22, RTAS'21
Contact
Email: cwang@cse.cuhk.edu.hk
Department: Computer Science and Engineering
University: The Chinese University of Hong Kong