Homepage
I am a first-year PhD student of Multimedia Laboratory at The Chinese University of Hong Kong supervised by Prof. Wanli Ouyang. Before that, I obtained my Bachelor’s degree from the Honor Class of Artificial Intelligence at Shanghai Jiao Tong University. My research interests lie in Large Language Models, Multi-agent Systems, and Reinforcement Learning.
News
- 2025.08: ReSo was accepted by EMNLP 2025.
- 2025.04: I accepted the offer of Hong Kong PhD Fellowship Scheme.
- 2025.02: ComfyBench was accepted by CVPR 2025.
- 2025.02: I accepted the offer of PhD study at The Chinese University of Hong Kong.
- 2024.05: I interned at Shanghai Artificial Intelligence Laboratory under the supervision of Dr. Lei Bai.
Publications
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Guibin Zhang, Hejia Geng, Xiaohang Yu, Zhenfei Yin, Zaibin Zhang, Zelin Tan, Heng Zhou, Zhongzhi Li, Xiangyuan Xue, Yijiang Li, Yifan Zhou, Yang Chen, Chen Zhang, Yutao Fan, Zihu Wang, Songtao Huang, Yue Liao, Hongru Wang, Mengyue Yang, Heng Ji, Michael Littman, Jun Wang, Shuicheng Yan, Philip Torr, Lei Bai (arXiv preprint)
[Paper] [Code]Position: Intelligent Science Laboratory Requires the Integration of Cognitive and Embodied AI
Sha Zhang, Suorong Yang, Tong Xie, Xiangyuan Xue, Zixuan Hu, Rui Li, Wenxi Qu, Zhenfei Yin, Tianfan Fu, Di Hu, Andres M Bran, Nian Ran, Bram Hoex, Wangmeng Zuo, Philippe Schwaller, Wanli Ouyang, Lei Bai, Yanyong Zhang, Lingyu Duan, Shixiang Tang, Dongzhan Zhou (arXiv preprint)
[Paper]ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks
Heng Zhou, Hejia Geng, Xiangyuan Xue, Li Kang, Yiran Qin, Zhiyong Wang, Zhenfei Yin, Lei Bai
Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)
[Paper] [Code]ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems
Xiangyuan Xue, Zeyu Lu, Di Huang, Zidong Wang, Wanli Ouyang, Lei Bai
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025)
[Paper] [Code] [Project]