Publications

Publications in reversed chronological order.

2026

  1. ICML
    TileSparse: Arithmetic-Intensity-Aware Sparse Attention for Compute-Bound LLM Decoding
    Chao Wang, Pengfei Zuo, Zhangyu Chen, and 3 more authors
    In Proceedings of the 43rd International Conference on Machine Learning (ICML), 2026
  2. ICLR
    DualMap: Enabling Both Cache Affinity and Load Balancing for Distributed LLM Serving
    Ying Yuan, Pengfei Zuo, Bo Wang, and 3 more authors
    In Proceedings of the 14th International Conference on Learning Representations (ICLR), 2026

2025

  1. Arxiv
    Serving Large Language Models on Huawei CloudMatrix384
    Huawei and SiliconFlow
    Arxiv, 2025
  2. Arxiv
    Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation
    Yunkai Liang, Zhangyu Chen, Pengfei Zuo, and 3 more authors
    Arxiv, 2025
  3. USENIX ATC
    Understanding and Detecting Fail-Slow Hardware Failure Bugs in Cloud Systems
    Gen Dong, Yu Hua, Yongle Zhang, and 2 more authors
    In Proceedings of the USENIX Annual Technical Conference (USENIX ATC), 2025
  4. DATE
    MPFS: A Scalable User-Space Persistent Memory File System for Multiple Processes
    Bo Ding, Wei Tong, Yu Hua, and 6 more authors
    In 2025 Design, Automation & Test in Europe Conference (DATE), 2025
  5. FAST
    GPHash: An Efficient Hash Index for GPU with Byte-Granularity Persistent Memory
    Menglei Chen, Yu Hua, Zhangyu Chen, and 2 more authors
    In Proceedings of the 23rd USENIX Conference on File and Storage Technologies (FAST), 2025

2024

  1. TC
    Enabling Reliable Memory-Mapped I/O With Auto-Snapshot for Persistent Memory Systems
    Bo Ding, Wei Tong, Yu Hua, and 3 more authors
    IEEE Transactions on Computers (TC), 2024

2023

  1. TOS
    A High-performance RDMA-oriented Learned Key-value Store for Disaggregated Memory Systems
    Pengfei Li, Yu Hua, Pengfei Zuo, and 2 more authors
    ACM Transactions on Storage (TOS), 2023
  2. TCAD
    APPcache+: An STT-MRAM-Based Approximate Cache System With Low Power and Long Lifetime
    Wei Zhao, Dan Feng, Wei Tong, and 4 more authors
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2023
  3. JCST
    Approximate Similarity-Aware Compression for Non-Volatile Main Memory
    Zhangyu Chen, Yu Hua, Pengfei Zuo, and 2 more authors
    Journal of Computer Science and Technology (JCST), 2023
    Accepted and to appear
  4. FAST
    ROLEX: A Scalable RDMA-oriented Learned Key-Value Store for Disaggregated Memory Systems
    Pengfei Li, Yu Hua, Pengfei Zuo, and 2 more authors
    In Proceedings of the 21st USENIX Conference on File and Storage Technologies (FAST), 2023
  5. TACO
    Lock-Free High-Performance Hashing for Persistent Memory via PM-Aware Holistic Optimization
    Zhangyu Chen, Yu Hua, Luochangqi Ding, and 3 more authors
    ACM Transactions on Architecture and Code Optimization (TACO), 2023

2022

  1. ICCD
    RMMIO: Enabling Reliable Memory-Mapped I/O for Persistent Memory Systems
    Bo Ding, Wei Tong, Yu Hua, and 3 more authors
    In Proceedings of the 40th IEEE International Conference on Computer Design (ICCD), 2022
  2. ASPLOS
    Efficiently Detecting Concurrency Bugs in Persistent Memory Programs
    Zhangyu Chen, Yu Hua, Yongle Zhang, and 1 more author
    In Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022

2021

  1. DATE
    Improving the Energy Efficiency of STT-MRAM Based Approximate Cache
    Wei Zhao, Wei Tong, Dan Feng, and 6 more authors
    In Proceedings of the 24th Design, Automation and Test in Europe Conference (DATE), 2021

2020

  1. USENIX ATC
    Lock-free Concurrent Level Hashing for Persistent Memory
    Zhangyu Chen, Yu Hua, Bo Ding, and 1 more author
    In Proceedings of the USENIX Annual Technical Conference (USENIX ATC), 2020
  2. DAC
    Reducing Bit Writes in Non-volatile Main Memory by Similarity-aware Compression
    Zhangyu Chen, Yu Hua, Pengfei Zuo, and 2 more authors
    In Proceedings of the 57th Design Automation Conference (DAC), 2020

2019

  1. USENIX ATC
    Mitigating Asymmetric Read and Write Costs in Cuckoo Hashing for Storage Systems
    Yuanyuan Sun, Yu Hua, Zhangyu Chen, and 1 more author
    In Proceedings of the USENIX Annual Technical Conference (USENIX ATC), 2019