Xinyi Chen

student_chen.jpg

I am a first-year PhD student at Fudan University, jointly advised by Prof. Bowen Zhou and Prof. Xin Peng. In parallel with my PhD, I work as an intern at the Embodied AI Center, Shanghai AI Laboratory, collaborating with Jiangmiao Pang, Linning Xu and Yilun Chen. Prior to my PhD, I earned my B.Eng degree at Nanjing University with honors.

My research focuses on embodied AI, particularly on robotic manipulation through vision-language-action models and world models that enable robots to understand multimodal contexts, predict interaction dynamics, and generalize across diverse tasks. Always happy to chat, collaborate, or just make new friends—drop me a message anytime!

news

Feb 21, 2026 One papers has been accepted by CVPR 2026.
Jul 15, 2025 Joined the PhD program at Fudan University after graduating from Nanjing University.
Feb 27, 2025 Two papers have been accepted by CVPR 2025.
Jul 22, 2024 I started my internship at Shanghai AI Laboratory :smile:.

selected publications

  1. MM-ACT: Learn from Multimodal Parallel Generation to Act
    Haotian Liang*, Xinyi Chen*, Bin Wang*, Mingkang Chen, and 11 more authors
    2025
  2. AAAI 2026 (Oral)
    cronusvla.png
    CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
    Hao Li*, Shuai Yang*, Yilun Chen, Xinyi Chen, and 7 more authors
    2025
  3. RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
    Haifeng Huang*, Xinyi Chen*, Yilun Chen, Hao Li, and 5 more authors
    In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), Jun 2025
  4. GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
    Ning Gao*, Yilun Chen*, Shuai Yang*, Xinyi Chen*, and 6 more authors
    In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), Jun 2025