Double-click to toggle motion / 双击切换动效

Dawei Zhu

Hi! I am a Ph.D. student at the School of Computer Science, Peking University, advised by Prof. Sujian Li.

My research focuses on long-context language modeling. I am also interested in agents and multimodality. Before that, I received my bachelor's degree from the School of EECS, Peking University.

Dawei Zhu
Double-click to toggle motion / 双击切换动效
HOVER IMAGES TO SHOW ABSTRACTS

Representative Research Works

SELECTED AND RECENT

Publications

Research Papers

PaperBanana PaperBanana
PaperBanana: Automating Academic Illustration for AI Scientists
Dawei Zhu, Rui Meng, Yale Song, Xiyu Wei, Sujian Li, Tomas Pfister, Jinsung Yoon
ICML 2026 Spotlight
DocLens DocLens
DocLens: A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding
Dawei Zhu, Rui Meng, Jiefeng Chen, Sujian Li, Tomas Pfister, Jinsung Yoon
ACL, 2026
Learning to Draft Learning to Draft
Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning
Jiebin Zhang, Zhenghan Yu, Liang Wang, Nan Yang, Eugene J. Yu, Zheng Li, Yifan Song, Dawei Zhu, et al.
ICLR, 2026
LongRePS LongRePS
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Dawei Zhu*, Xiyu Wei*, Guangxiang Zhao, Wenhao Wu, Haosheng Zou, Junfeng Ran, Xun Wang, Lin Sun, Xiangzheng Zhang, Sujian Li
Findings of EMNLP, 2025
LongAttn LongAttn
LongAttn: Selecting Long-context Training Data via Token-level Attention
Longyun Wu*, Dawei Zhu*, Guangxiang Zhao, Zhuocheng Yu, Junfeng Ran, Xiangyu Wong, Lin Sun, Sujian Li
Findings of ACL, 2025
PLD PLD
PLD: A Choice-Theoretic List-Wise Knowledge Distillation
Ejafa Bassam, Dawei Zhu, Kaigui Bian
NeurIPS, 2025
MMTEB MMTEB
MMTEB: Massive Multilingual Text Embedding Benchmark
Kenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, Julian Stap, Nikhil Gala, et al., including Dawei Zhu
ICLR, 2025
WIKIGENBENCH WIKIGENBENCH
WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario
Jiebin Zhang, Eugene J. Yu, Qinyu Chen, Chenhao Xiong, Dawei Zhu, Han Qian, Mingbo Song, Weimin Xiong, et al.
COLING, 2025
More Tokens, Lower Precision More Tokens, Lower Precision
More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
Jiebin Zhang, Dawei Zhu, Yifan Song, Wenhao Wu, Chuqiao Kuang, Xiaoguang Li, Lifeng Shang, Qun Liu, Sujian Li
Findings of ACL, 2025
EERPD EERPD
EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
Zheng Li, Dawei Zhu, Qilong Ma, Weimin Xiong, Sujian Li
COLING, 2025
LongEmbed LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval
Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li
EMNLP, 2024
PoSE PoSE
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Dawei Zhu, Nan Yang, Liang Wang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li
ICLR, 2024
CoUDA CoUDA
CoUDA: Coherence Evaluation via Unified Data Augmentation
Dawei Zhu*, Wenhao Wu, Yifan Song, Fangwei Zhu, Ziqiang Cao, Sujian Li
NAACL, 2024
AgentBank AgentBank
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Yifan Song, Weimin Xiong, Xiutian Zhao, Dawei Zhu, Wenhao Wu, Ke Wang, Cheng Li, Wei Peng, Sujian Li
Findings of EMNLP, 2024
FairEval FairEval
Large Language Models are not Fair Evaluators
Peiyi Wang, Lei Li, Liang Chen, Dawei Zhu, Binghuai Lin, Yunbo Cao, Qi Liu, Tianyu Liu, Zhifang Sui
ACL, 2024
Long Context Alignment Long Context Alignment
Long Context Alignment with Short Instructions and Synthesized Positions
Wenhao Wu, Yizhong Wang, Yao Fu, Xiang Yue, Dawei Zhu, Sujian Li
arXiv, 2024
BiGuid BiGuid
Probing Bilingual Guidance for Cross-Lingual Summarization
Dawei Zhu, Wenhao Wu, Sujian Li
NLPCC, 2023
InfoCL InfoCL
InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective
Yifan Song, Peiyi Wang, Weimin Xiong, Dawei Zhu, Tianyu Liu, Zhifang Sui, Sujian Li
Findings of EMNLP, 2023
GraphPrompt GraphPrompt
GraphPrompt: Graph-Based Prompt Templates for Biomedical Synonym Prediction
Hanwen Xu, Jiayou Zhang, Zhirui Wang, Shizhuo Zhang, Megh Bhalerao, Yucong Liu, Dawei Zhu, Sheng Wang
AAAI, 2023
RestGPT RestGPT
RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs
Yifan Song, Weimin Xiong, Dawei Zhu, Cheng Li, Ke Wang, Ye Tian, Sujian Li
arXiv, 2023
DocRED-FE DocRED-FE
DocRED-FE: A Document-Level Fine-Grained Entity And Relation Extraction Dataset
Hongbo Wang, Weimin Xiong, Yifan Song, Dawei Zhu, Yu Xia, Sujian Li
ICASSP, 2023
ConFiguRe ConFiguRe
ConFiguRe: Exploring Discourse-level Chinese Figures of Speech
Dawei Zhu, Qiusi Zhan, Zhejian Zhou, Yifan Song, Jiebin Zhang, Sujian Li
COLING, 2022

Surveys

Long Context Survey Long Context Survey
A Comprehensive Survey on Long Context Language Modeling
Jiaheng Liu, Dawei Zhu, Zhiqi Bai, et al.
arXiv, 2025
Latent Reasoning Survey Latent Reasoning Survey
A Survey on Latent Reasoning
Ruijie Zhu, Tianyang Peng, Tian Cheng, Xingwei Qu, Jiaheng Huang, Dawei Zhu, et al.
arXiv, 2025

Technical Reports

MiMo-V2-Flash MiMo-V2-Flash
MiMo-V2-Flash Technical Report
Xiaomi LLM-Core Team, including Dawei Zhu
arXiv, 2026
MiMo-Audio MiMo-Audio
MiMo-Audio: Audio Language Models are Few-Shot Learners
Xiaomi LLM-Core Team, including Dawei Zhu
arXiv, 2025
MiMo-VL MiMo-VL
MiMo-VL Technical Report
Xiaomi LLM-Core Team, including Dawei Zhu
arXiv, 2025
MiMo MiMo
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
LLM-Core Xiaomi, including Dawei Zhu
arXiv, 2025

* denotes equal contribution. For a full list, please see Google Scholar.

EDUCATION & EXPERIENCE

Education & Work Experience

Peking University Beijing, China
Ph.D. Student, School of Computer Science Sept 2022 - Present
Advisor: Prof. Sujian Li.
B.Sc. in Computer Science and Technology, School of EECS Sept 2018 - Jun 2022
Xiaomi MiMo Beijing, China
Research Intern Mar 2026 - Present
Core Contributor of MiMo-V2 and V2.5 Series.
Google Cloud AI Sunnyvale, CA, USA
Student Researcher Jun 2025 - Feb 2026
Xiaomi MiMo Beijing, China
Research Intern Jan 2025 - Jun 2025
Core Contributor of MiMo-7B and MiMo-7B-VL.
Microsoft Research Asia Beijing, China
Research Intern Jun 2023 - Mar 2024
Mentored by Liang Wang and Nan Yang. Research on long-context retrieval and efficient context window extension.
AWARDS & COMMUNITY

Awards & Community Contributions