publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. arXiv 2025
    mmact.png
    MM-ACT: Learn from Multimodal Parallel Generation to Act
    Haotian Liang*, Xinyi Chen*, Bin Wang*, Mingkang Chen, and 11 more authors
    2025
  2. AAAI 2026 (Oral)
    cronusvla.png
    CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
    Hao Li*, Shuai Yang*, Yilun Chen, Xinyi Chen, and 7 more authors
    2025
  3. arXiv 2025
    internvlam1.png
    InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
    Xinyi Chen, Yilun Chen, Yanwei Fu, Ning Gao, and 25 more authors
    2025
  4. RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
    Haifeng Huang*, Xinyi Chen*, Yilun Chen, Hao Li, and 5 more authors
    In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), Jun 2025
  5. GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
    Ning Gao*, Yilun Chen*, Shuai Yang*, Xinyi Chen*, and 6 more authors
    In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), Jun 2025