Publications

2025

  1. MMM 2025
    Understanding the Roles of Visual Modality in Multimodal Dialogue: An Empirical Study
    Qian Cao, Ruihua Song, and Xu Chen
    In Proceedings of the 31st International Conference on Multimedia Modeling, 2025, Jan 2025

2024

  1. BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain
    Kaisi Guan, Qian Cao, Yuchong Sun, Xiting Wang, and Ruihua Song
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  2. See or Guess: Counterfactually Regularized Image Captioning
    Qian Cao, Xu Chen, Ruihua Song, Xiting Wang, Xinting Huang, and Yuchen Ren
    In Proceedings of the 32th ACM international conference on Multimedia, Oct 2024
  3. YuLan: An Open-source Large Language Model
    Yutao Zhu, Kun Zhou, Kelong Mao, Wentong Chen, Yiding Sun, Zhipeng Chen, Qian Cao, Yihan Wu, Yushuo Chen, Feng Wang, and  others
    arXiv preprint arXiv:2406.19853 Oct 2024

2022

  1. Multi-Modal Experience Inspired AI Creation
    Qian Cao, Xu Chen, Ruihua Song, Hao Jiang, Guang Yang, and Zhao Cao
    In Proceedings of the 30th ACM international conference on Multimedia, Oct 2022