[1]. Changlong Li, Yu Liang, Liang Shi, Chao Wang, Chun Jason Xue, Xuehai Zhou: Flexible and Efficient Memory Swapping Across Mobile Devices With LegoSwap. IEEE Trans. Parallel Distributed Syst. 35(1): 140-153 (2024)
[2]. Quan Li, Xike Xie, Chao Wang, S. Kevin Zhou: Prompt Learning with Extended Kalman Filter for Pre-trained Language Models. IJCAI 2024: 4452-4460
[3]. Junyuan Guo, Hao Tang, Teng Wang, Chao Wang: R4D-planes: Remapping Planes For Novel View Synthesis and Self-Supervised Decoupling of Monocular Videos. ACM Multimedia 2024: 6569-6577
[4]. Wenqi Lou, Lei Gong, Chao Wang, Jiaming Qian, Xuan Wang, Changlong Li, Xuehai Zhou: Unleashing Network/Accelerator Co-Exploration Potential on FPGAs: A Deeper Joint Search. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 43(10): 3041-3054 (2024)
[5]. Yingxue Gao, Teng Wang, Lei Gong, Chao Wang, Yiqing Hu, Yi Yang, Zhongming Liu, Xi Li, Xuehai Zhou: Enhancing Graph Random Walk Acceleration via Efficient Dataflow and Hybrid Memory Architecture. IEEE Trans. Computers 73(3): 887-901 (2024)
[6]. Enshuai Zhou, Yifan Hao, Rui Zhang, Yuxuan Guo, Zidong Du, Xishan Zhang, Xinkai Song, Chao Wang, Xuehai Zhou, Jiaming Guo, Qi Yi, Shaohui Peng, Di Huang, Ruizhi Chen, Qi Guo, Yunji Chen: Emergent Communication for Numerical Concepts Generalization. AAAI 2024: 17609-17617
[7]. Lei Gong, Chao Wang, Haojun Xia, Xianglan Chen, Xi Li, Xuehai Zhou: Enabling Fast and Memory-Efficient Acceleration for Pattern Matching Workloads: The Lightweight Automata Processing Engine. IEEE Trans. Computers 72(4): 1011-1025 (2023)
[8]. Yingxue Gao, Lei Gong, Chao Wang, Teng Wang, Xi Li, Xuehai Zhou: Algorithm/Hardware Co-Optimization for Sparsity-Aware SpMM Acceleration of GNNs. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 42(12): 4763-4776 (2023)
[9]. Wenqi Lou, Lei Gong, Chao Wang, Zidong Du, Xuehai Zhou: OctCNN: A High Throughput FPGA Accelerator for CNNs Using Octave Convolution Algorithm. IEEE Trans. Computers 71(8): 1847-1859 (2022)
[10]. Yuanbo Wen, Qi Guo, Zidong Du, Jianxing Xu, Zhenxing Zhang, Xing Hu, Wei Li, Rui Zhang, Chao Wang, Xuehai Zhou, Tianshi Chen: Enabling One-Size-Fits-All Compilation Optimization for Inference Across Machine Learning Computers. IEEE Trans. Computers 71(9): 2313-2326 (2022)
[11]. Teng Wang, Lei Gong, Chao Wang, Yang Yang, Yingxue Gao, Xuehai Zhou, Huaping Chen: ViA: A Novel Vision-Transformer Accelerator Based on FPGA. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 41(11): 4088-4099 (2022)
[12]. Yuanbo Wen, Qi Guo, Qiang Fu, Xiaqing Li, Jianxing Xu, Yanlin Tang, Yongwei Zhao, Xing Hu, Zidong Du, Ling Li, Chao Wang, Xuehai Zhou, Yunji Chen: BabelTower: Learning to Auto-parallelized Program Translation. ICML 2022: 23685-23700
[13]. Chao Wang, Lei Gong, Fahui Jia, Xuehai Zhou: An FPGA Based Accelerator for Clustering Algorithms With Custom Instructions. IEEE Trans. Computers 70(5): 725-732 (2021)
[14]. Chao Wang, Lihui Jin, Lei Gong, Chongchong Xu, Yahui Hu, Luchao Tan, Xuehai Zhou: Tinker: A Middleware for Deploying Multiple NN-Based Applications on a Single Machine. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 40(7): 14e95-1499 (2021)
[15]. Lei Gong, Chao Wang, Xi Li, Xuehai Zhou: Improving HW/SW Adaptability for Accelerating CNNs on FPGAs Through A Dynamic/Static Co-Reconfiguration Approach. IEEE Trans. Parallel Distributed Syst. 32(7): 1854-1865 (2021)
[16]. Chao Wang, Lei Gong, Xi Li, Qi Yu, Aili Wang, Patrick Hung, Xuehai Zhou: SOLAR: Services-Oriented Deep Learning Architectures-Deep Learning as a Service. IEEE Trans. Serv. Comput. 14(1): 262-273 (2021)
[17]. Changlong Li, Hang Zhuang, Qingfeng Wang, Chao Wang, Xuehai Zhou: LKSM: Light Weight Key-Value Store for Efficient Application Services on Local Distributed Mobile Devices. IEEE Trans. Serv. Comput. 14(4): 1026-1039 (2021)
[18]. Xi Zeng, Tian Zhi, Xuda Zhou, Zidong Du, Qi Guo, Shaoli Liu, Bingrui Wang, Yuanbo Wen, Chao Wang, Xuehai Zhou, Ling Li, Tianshi Chen, Ninghui Sun, Yunji Chen: Addressing Irregularity in Sparse Neural Networks Through a Cooperative Software/Hardware Approach. IEEE Trans. Computers 69(7): 968-985 (2020)
[19]. Chao Wang, Lei Gong, Xiang Ma, Xi Li, Xuehai Zhou: WooKong: A Ubiquitous Accelerator for Recommendation Algorithms With Custom Instruction Sets on FPGA. IEEE Trans. Computers 69(7): 1071-1082 (2020)
[20]. Xuan Wang, Chao Wang, Jing Cao, Lei Gong, Xuehai Zhou: WinoNN: Optimizing FPGA-Based Convolutional Neural Network Accelerators Using Sparse Winograd Algorithm. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 39(11): 4290-4302 (2020)
[21]. Chao Wang, Lei Gong, Xi Li, Xuehai Zhou: A Ubiquitous Machine Learning Accelerator With Automatic Parallelization on FPGA. IEEE Trans. Parallel Distributed Syst. 31(10): 2346-2359 (2020)
[22]. Lei Gong, Chao Wang, Xi Li, Huaping Chen, Xuehai Zhou: MALOC: A Fully Pipelined FPGA Accelerator for Convolutional Neural Networks With All Layers Mapped on Chip. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 37(11): 2601-2612 (2018)
[23]. Xuda Zhou, Zidong Du, Qi Guo, Shaoli Liu, Chengsi Liu, Chao Wang, Xuehai Zhou, Ling Li, Tianshi Chen, Yunji Chen: Cambricon-S: Addressing Irregularity in Sparse Neural Networks through A Cooperative Software/Hardware Approach. MICRO 2018: 15-28
[24]. Chao Wang, Xi Li, Aili Wang, Xuehai Zhou: A Classroom Scheduling Service for Smart Classes. IEEE Trans. Serv. Comput. 10(2): 155-164 (2017)
[25]. Chao Wang, Xi Li, Yunji Chen, Youhui Zhang, Oliver Diessel, Xuehai Zhou: Service-Oriented Architecture on FPGA-Based MPSoC. IEEE Trans. Parallel Distributed Syst. 28(10): 2993-3006 (2017)
[26]. Chao Wang, Lei Gong, Qi Yu, Xi Li, Yuan Xie, Xuehai Zhou: DLAU: A Scalable Deep Learning Accelerator Unit on FPGA. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 36(3): 513-517 (2017)
[27]. Bo Wan, Xi Li, Haizhao Luo, Chao Wang, Xianglan Chen, Xuehai Zhou: Work-in-Progress: TTI: A Timing ISA for LET Model in Safety-Critical Systems. RTSS 2017: 363-365
[28]. Chao Wang, Junneng Zhang, Xi Li, Aili Wang, Xuehai Zhou: Hardware Implementation on FPGA for Task-Level Parallel Dataflow Execution Engine. IEEE Trans. Parallel Distributed Syst. 27(8): 2303-2315 (2016)
[29]. Chao Wang, Xi Li, Junneng Zhang, Peng Chen, Yunji Chen, Xuehai Zhou, Ray C. C. Cheung: Architecture Support for Task Out-of-Order Execution in MPSoCs. IEEE Trans. Computers 64(5): 1296-1310 (2015)
[30]. Shaoli Liu, Tianshi Chen, Ling Li, Xi Li, Mingzhe Zhang, Chao Wang, Haibo Meng, Xuehai Zhou, Yunji Chen: FreeRider: Non-Local Adaptive Network-on-Chip Routing with Packet-Carried Propagation of Congestion Information. IEEE Trans. Parallel Distributed Syst. 26(8): 2272-2285 (2015)
[31]. Chao Wang, Xi Li, Junneng Zhang, Xuehai Zhou, Xiaoning Nie: MP-Tomasulo: A Dependency-Aware Automatic Parallel Execution Engine for Sequential Programs. ACM Trans. Archit. Code Optim. 10(2): 9:1-9:26 (2013)