publications
2024
-
PUZZLE: Efficiently Aligning Large Language Models through Light-Weight Context SwitchIn 2024 USENIX Annual Technical Conference (USENIX ATC 24), 2024
2023
-
GraphSet: High Performance Graph Mining through Equivalent Set TransformationsIn Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2023, Denver, CO, USA, November 12-17, 2023, 2023
-
SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online ParallelizationIn 2023 USENIX Annual Technical Conference (USENIX ATC 23), 2023
2022
-
Critique of “A parallel framework for constraint-based Bayesian network learning via Markov blanket discovery” by SCC team from Tsinghua UniversityIEEE Transactions on Parallel and Distributed Systems, 2022
2021
-
Critique of “MemXCT: memory-centric X-ray CT reconstruction with massive parallelization” by SCC Team from Tsinghua UniversityIEEE Transactions on Parallel and Distributed Systems, 2021
2020
-
GraphPi: high performance graph pattern matching through effective redundancy eliminationIn Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2020, Virtual Event / Atlanta, Georgia, USA, November 9-19, 2020, 2020