Yizhen Zhang

I am a first-year master student at IIGroup in Tsinghua University, supervised by Prof. Yujiu Yang. I received my Bachelor's degree at Computer Science and Technology, Harbin Institute of Technology, Shenzhen.

Previously, I was honoured to work with Prof. Wenjie Pei in HITsz. I am currently working closely with Ruilin Luo and Yang Ding at THU.

Email  /  WeChat  /  Github  /  Google Scholar

profile photo
Education
THU logo Tsinghua University
M.Eng. in Artificial Intelligence (2024 - Now)
Advisor: Prof. Yujiu Yang
XDU logo Harbin Institute of Technology, Shenzhen
B.Eng. in Computer Science and Technology (2020 - 2024)
Advisor: Prof. Wenjie Pei
Research

My current research focuses on Multimodal Large Language Models (MLLMs), specifically revolutionizing reinforcement learning for the alignment of Vision-Language Models and continuously pushing the boundaries of their reasoning ability to unlock full potential in complex scenarios.

Publications

profile photo PixelCraft: A Multi-Agent System for High-Fidelity Visual Reasoning on Structured Images
Shuoshuo Zhang, Zijian Li, Yizhen Zhang, Jingjing Fu, Lei Song, Jiang Bian, Jun Zhang, Yujiu Yang, Rui Wang
Under Review
Preprint / Code

profile photo PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning
Yizhen Zhang, Yang Ding, Shuoshuo Zhang, Xinchen Zhang, Haoling Li, Zhong-zhi Li, Peijie Wang, Jie Wu, Lei Ji, Yelong Shen, Yujiu Yang, Yeyun Gong
NeurIPS 2025
Preprint / Code

profile photo Teaching Your Models to Understand Code via Focal Preference Alignment
Jie Wu, Haoling Li, Xin Zhang, Xiao Liu, Yangyu Huang, Jianwen Luo, Yizhen Zhang, Zuchao Li, Ruihang Chu, Yujiu Yang, Scarlett Li
EMNLP main 2025
Preprint / Code

profile photo Efficiently Building Large Language Models through Merging
Yizhen Zhang, Yang Ding, Jie Wu, Yujiu Yang
NeurIPS 2024 LMC Oral
Preprint

Experience
bytedance logo Microsoft, NLC Group
Research Intern (Feb. 2025 - Present)
Topic: Multimodal Large Language Models Reasoning
Mentor: Lei Ji, Yeyun Gong, Yelong Shen
rednote logo Rednote, Communities Feed Team
Research Intern (Apr. 2024 - Aug. 2024)
Topic: Recommendation System & Computational Advertising
Honors & Awards
  • Neurips 2024 Large Language Model Merging Challenge (LMC)[link] Rank:1/150, 2024
  • CVPR 2023 workshop Image Matching Challenge [link] Silver medal, 2023