Selected Publications


Exploring Visual Pretraining for Learning Language Intelligence
Zhonghan Zhao*, Yiming Zhang*, Wenwei Zhang*, ..., Gaoang Wang†, Kai Chen†
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , 2026
[Paper]
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale
Yicheng Zou, ..., Wenwei Zhang, et al.
arXiv, 2026
[Paper] [Project Page]
Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving
Songyang Gao*, Yuzhe Gu*, Zijian Wu*, Lingkai Kong*, Wenwei Zhang*, et al.
arXiv, 2025
Officially ranked 3rd among human competitors and 1st among AI competitors in CMO2025.
[Paper]
Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning
Haiteng Zhao*, Junhao Shen*, ..., Wenwei Zhang†, Kai Chen†
International Conference on Learning Representations (ICLR) , 2026
Surpasses AlphaGeometry2, SeedGeometry, and Human Gold Medalists.
[Paper] [Code] [Project Page]
RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy
Zhonghan Zhao*, Wenwei Zhang*, Haian Huang, Kuikun Liu, Jianfei Gao, Gaoang Wang†, Kai Chen†
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , 2026
[Paper]
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Zhonghua Wu, Qingyi Tao, Wentao Liu, Wei Li, Chen Change Loy
International Conference on Computer Vision (ICCV) , 2025
[Paper] [Code] [Project Page]
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs
Yuzhe Gu, Wenwei Zhang†, Chengqi Lyu, Dahua Lin, Kai Chen†
International Conference on Learning Representations (ICLR) , 2025
[Paper] [Code]
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Chengqi Lyu*, Songyang Gao*, Yuzhe Gu*, Wenwei Zhang*†, ..., Kai Chen†
Conference on Language Modeling (COLM) , 2025
[Paper] [Code] [Project Page]
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Zehui Chen*, Kuikun Liu*, Qiuchen Wang, Jiangning Liu, Wenwei Zhang, Kai Chen, Feng Zhao
International Conference on Learning Representations (ICLR) , 2025
[Paper] [Code] [Project Page]
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao
Findings of the Association for Computational Linguistics (ACL Findings) , 2024
[Paper] [Project Page]
InternLM: A Multilingual Language Model with Progressively Enhanced Capabilities
InternLM Team
GitHub, 2023
[Code] [Project Page]
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Tao Gong*, Chengqi Lyu*, Shilong Zhang*, ..., Wenwei Zhang*, Ping Luo, Kai Chen
arXiv, 2023
[Paper] [Code] [Project Page]
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
Xiangtai Li*, Wenwei Zhang*, Jiangmiao Pang*, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2022 (oral)
[Paper] [Code] [Project Page]
K-Net: Towards Unified Image Segmentation
Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy
Advances in Neural Information Processing Systems (NeurIPS) , 2021
New state of the art on COCO Panoptic Segmentation and ADE20K Semantic Segmentation datasets.
[Paper] [Code] [Project Page]
Seesaw Loss for Long-Tailed Instance Segmentation
Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2021
Key component for the 2nd runner-up method in LVIS Challenge 2020.
[Paper] [Code]
Exploring Data Augmentation for Multi-Modality 3D Object Detection
Wenwei Zhang, Zhe Wang, Chen Change Loy
International Conference on Learning Representations, Scene Representations For Autonomous Driving Workshop (ICLRW) , 2023
Method for the Best PKL Award in 3rd nuScenes detection challenge of 5th AI Driving Olympics, NeurIPS 2020
[Paper] [Code]
Side-Aware Boundary Localization for More Precise Object Detection
Jiaqi Wang, Wenwei Zhang, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin
European Conference on Computer Vision (ECCV) , 2020 (spotlight)
Key component for the 1st place method in COCO Detection Challenge 2019.
[Paper] [Code]
Robust Multi-Modality Multi-Object Tracking
Wenwei Zhang, Hui Zhou, Shuyang Sun, Zhe Wang, Jianping Shi, Chen Change Loy
International Conference on Computer Vision (ICCV) , 2019
[Paper] [Code] [Project Page] [Poster]
Before the Rise of Machines, The beginning of Consciousness and the Human Intelligence
H. J. Cai, Tianqi Cai, Wenwei Zhang, Kai Wang
Tsinghua University Press, 2017
Awarded the Wu Wenjun Award for Science and Technology of Artificial Intelligence in 2017.