- Statistical Foundations of Large Language Models.
- Representation Learning.
- MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning.
Peng Xia, Jinglu Wang, Yibo Peng, Kaide Zeng, Zihan Dong, Xian Wu, Xiangru Tang, Hongtu Zhu, Yun Li, Linjun Zhang, Shujie Liu, Yan Lu, and Huaxiu Yao
ICLR 2026.