Publications

A Google Scholar version of this list is also available on my Google Scholar profile.

Peer-Reviewed Publications

All publications below are full-length papers. The list is ordered by year.

In 2026:

ISCA'26 Accelerating MoE with Dynamic In-Switch Computing on Multi-GPUs

Qijun Zhang, Chen Zhang, Zhuoshan Zhou, Haibo Wang, Zhe Zhou, Zhipeng Tu, Guangyu Sun, Zhiyao Xie, Yijia Diao, Zhigang Ji, Jingwen Leng, Guanghui He, and Minyi Guo

International Symposium on Computer Architecture (ISCA), 2026.

DAC'26 G-Power: Architecture-level GPU Power Modeling with Aggregated Knowledge Foundations from Known GPUs

Qijun Zhang, Yao Lu, Shang Liu, Mengming Li, Chen Zhang, Dongbo Wang, and Zhiyao Xie

ACM/IEEE Design Automation Conference (DAC), 2026.

HPCA'26 Towards Compute-Aware In-Switch Computing for LLMs Tensor-Parallelism on Multi-GPU Systems

Chen Zhang, Qijun Zhang†, Zhuoshan Zhou, Yijia Diao, Haibo Wang, Zhe Zhou, Zhipeng Tu, Zhiyao Li, Guangyu Sun, Zhuoran Song, Zhigang Ji, Jingwen Leng, and Minyi Guo († Corresponding Author)

IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2026.

ASPDAC'26 ReadyPower: A Reliable, Interpretable, and Handy Architectural Power Model Based on Analytical Framework

Qijun Zhang, Shang Liu, Yao Lu, Mengming Li, and Zhiyao Xie

Asia and South Pacific Design Automation Conference (ASP-DAC), 2026.

Rank Second in the Track

ISCA'26 MoE-Hub: Taming Software Complexity for Seamless MoE Overlap with Hardware-Accelerated Communication on Multi-GPU Systems

Zhuoshan Zhou, Chen Zhang, Shuyi Zhang, Qijun Zhang, Haibo Wang, Zhe Zhou, Zhipeng Tu, Guangyu Sun, Yijia Diao, Zhigang Ji, Jingwen Leng, Guanghui He, and Minyi Guo

International Symposium on Computer Architecture (ISCA), 2026.

ISCA'26 ICP: Exploiting Instruction Correlation for Prefetching Irregular Memory Accesses

Mengming Li, Chenlu Miao, Buqing Xu, Qijun Zhang, Xiangfeng Sun, Ceyu Xu, Yuan Xie, Wenkai Li, Shang Liu, and Zhiyao Xie

International Symposium on Computer Architecture (ISCA), 2026.

DAC'26 FSGen: Agile Fused and Sparse Accelerator Generator with Accurate Power Model for LLM Applications

Jay Zhe-An Mok, Qijun Zhang, and Zhiyao Xie

ACM/IEEE Design Automation Conference (DAC), 2026.

DAC'26 COOL: A Cooling-Aware Point Transformer Framework for Thermal Prediction in Advanced 3D/3.5D IC Packaging

Yao Lu, Zhicheng Guo, Qijun Zhang, Shang Liu, Wenji Fang, Wenkai Li, and Zhiyao Xie

ACM/IEEE Design Automation Conference (DAC), 2026.

TCAD'26 A Self-Supervised and Cross-Design Netlist Power Model for Time-Based Layout Power Analysis

Wenkai Li, Yao Lu, Wenji Fang, Yugao Zhu, Ziyan Guo, Jing Wang, Mengming Li, Qijun Zhang, and Zhiyao Xie

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Early Access.

TCAD'26 MFSPart: A Generalized Partitioning Framework for Multi-FPGA Systems and Its Ensemble-Based Extension

Yugao Zhu, Wenji Fang, Yao Lu, Shang Liu, Yanzhen Zhu, Qijun Zhang, and Zhiyao Xie

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Early Access.

In 2025:

NeurIPS'25 ArchPower: Dataset for Architecture-Level Power Modeling of Modern CPU Design

Qijun Zhang, Yao Lu, Mengming Li, Shang Liu, and Zhiyao Xie

Advances in Neural Information Processing Systems (NeurIPS), D&B Track, 2025.

DAC'25 AutoPower: Automated Few-Shot Architecture-Level Power Modeling by Power Group Decoupling

Qijun Zhang, Yao Lu, Mengming Li, and Zhiyao Xie

ACM/IEEE Design Automation Conference (DAC), 2025.

HPCA'25 Integrating Prefetcher Selection with Dynamic Request Allocation Improves Prefetching Efficiency

Mengming Li*, Qijun Zhang*, Yongqing Ren, and Zhiyao Xie (* Equal Contribution)

IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2025.

TCAD'25 An Architecture-Level CPU Modeling Framework for Power and Other Design Qualities

Qijun Zhang, Mengming Li, Andrea Mondelli, and Zhiyao Xie

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025.

ASPDAC'25 FirePower: Towards a Foundation with Generalizable Knowledge for Architecture-Level Power Modeling

Qijun Zhang, Mengming Li, Yao Lu, and Zhiyao Xie

Asia and South Pacific Design Automation Conference (ASP-DAC), 2025.

ASPDAC'25 Pointer: An Energy-Efficient ReRAM-based Point Cloud Recognition Accelerator with Inter-layer and Intra-layer Optimizations

Qijun Zhang, and Zhiyao Xie

Asia and South Pacific Design Automation Conference (ASP-DAC), 2025.

ISCA'25 Profile-Guided Temporal Prefetching

Mengming Li, Qijun Zhang, Yichuan Gao, Wenji Fang, Yao Lu, Yongqing Ren, and Zhiyao Xie

International Symposium on Computer Architecture (ISCA), 2025.

DAC'25 ATLAS: A Self-Supervised and Cross-Stage Netlist Power Model for Fine-Grained Time-Based Layout Power Analysis

Wenkai Li, Yao Lu, Wenji Fang, Jing Wang, Qijun Zhang, and Zhiyao Xie

ACM/IEEE Design Automation Conference (DAC), 2025.

TCAD'25 RTLCoder: Fully Open-Source and Efficient LLM-Assisted RTL Code Generation Technique

Shang Liu, Wenji Fang, Yao Lu, Jing Wang, Qijun Zhang, Hongce Zhang, and Zhiyao Xie

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025.

TCAD'25 Transferable Pre-Synthesis PPA Estimation for RTL Designs with Data Augmentation Techniques

Wenji Fang, Yao Lu, Shang Liu, Qijun Zhang, Ceyu Xu, Lisa Wu Wills, Hongce Zhang, and Zhiyao Xie

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025.

ASPDAC'25 Towards Big Data in AI for EDA Research: Generation of New Pseudo Circuits at RTL Stage

Shang Liu, Wenji Fang, Yao Lu, Qijun Zhang, and Zhiyao Xie

Asia and South Pacific Design Automation Conference (ASP-DAC), 2025.

In 2024:

ISLPED'24 Unleashing Flexibility of ML-based Power Estimators Through Efficient Development Strategies

Yao Lu*, Qijun Zhang*, and Zhiyao Xie (* Equal Contribution)

ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), 2024.

Best Paper Nomination (8 nominees out of 167 submissions)

LAD'24 RTLCoder: Outperforming GPT-3.5 in Design RTL Generation with Our Open-Source Dataset and Lightweight Solution

Shang Liu, Wenji Fang, Yao Lu, Qijun Zhang, Hongce Zhang, and Zhiyao Xie

IEEE International Workshop on LLM-Aided Design (LAD), 2024.

Best Paper Nomination (6 nominees out of 50 submissions)

ASPDAC'24 RTLLM: An Open-Source Benchmark for Design RTL Generation with Large Language Model

Yao Lu, Shang Liu, Qijun Zhang, and Zhiyao Xie

Asia and South Pacific Design Automation Conference (ASP-DAC), 2024.

In 2023:

ICCAD'23 PANDA: Architecture-Level Power Evaluation by Unifying Analytical and Machine Learning Solutions

Qijun Zhang, Shiyu Li, Guanglei Zhou, Jingyu Pan, Chen-Chia Chang, Yiran Chen, and Zhiyao Xie

IEEE/ACM International Conference on Computer Aided Design (ICCAD), 2023.

ICCAD'23 MasterRTL: A Pre-Synthesis PPA Estimation Framework for Any RTL Design

Wenji Fang, Yao Lu, Shang Liu, Qijun Zhang, Ceyu Xu, Lisa Wu Wills, Hongce Zhang, and Zhiyao Xie

IEEE/ACM International Conference on Computer Aided Design (ICCAD), 2023.