💻 Public Papers

Decoupled Global-Local Alignment for Improving Compositional Understanding
Xiaoxing Hu*, Kaicheng Yang*, Jun Wang, Haoran Xu, Ziyong Feng, Yupei Wang (Equal First author)

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Tiancheng Gu*, Kaicheng Yang*, Ziyong Feng, Xingjun Wang, Yanzhao Zhang, Dingkun Long, Yingda Chen, Weidong Cai, Jiankang Deng (Equal First author)

RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm
Tiancheng Gu*, Kaicheng Yang*, Chaoyi Zhang, Yin Xie, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng (Equal First author)

CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination
Kaicheng Yang*, Tiancheng Gu*, Xiang An, Haiqiang Jiang, Xiangzi Dai, Ziyong Feng, Weidong Cai, Jiankang Deng (First author)

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation
Tiancheng Gu*, Kaicheng Yang*, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai (Oral, Equal First author)

RWKV-CLIP: A Robust Vision-Language Representation Learner
Tiancheng Gu*, Kaicheng Yang*, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng (Equal First author)

Multi-label Cluster Discrimination for Visual Representation Learning
Xiang An, Kaicheng Yang, Xiangzi Dai, Ziyong Feng, Jiankang Deng

ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
Kaicheng Yang, Jiankang Deng, Xiang An, Jiawei Li, Ziyong Feng, Jia Guo, Jing Yang, Tongliang Liu

Unicom: Universal and Compact Representation Learning for Image Retrieval
Xiang An, Jiankang Deng, Kaicheng Yang, Jiawei Li, Ziyong Feng, Jia Guo, Jing Yang, Tongliang Liu

CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis
Kaicheng Yang, Hua Xu, Kai Gao (Oral)

CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality
Wenmeng Yu, Hua Xu, Fanyang Meng, Yilin Zhu, Yixiao Ma, Jiele Wu, Jiyun Zou, Kaicheng Yang