💻 Public Papers

ACM MM2025
sym

Decoupled Global-Local Alignment for Improving Compositional Understanding
Xiaoxing Hu*, Kaicheng Yang*, Jun Wang, Haoran Xu, Ziyong Feng, Yupei Wang (Equal First author)

Arxiv, 公众号, GitHub, Project Websit

ACM MM2025
sym

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Tiancheng Gu*, Kaicheng Yang*, Ziyong Feng, Xingjun Wang, Yanzhao Zhang, Dingkun Long, Yingda Chen, Weidong Cai, Jiankang Deng (Equal First author)

Arxiv, 公众号, GitHub, Project Websit

ACM MM2025
sym

RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm
Tiancheng Gu*, Kaicheng Yang*, Chaoyi Zhang, Yin Xie, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng (Equal First author)

Arxiv, 公众号, GitHub, Project Websit

AAAI2025
sym

CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination
Kaicheng Yang*, Tiancheng Gu*, Xiang An, Haiqiang Jiang, Xiangzi Dai, Ziyong Feng, Weidong Cai, Jiankang Deng (First author)

Arxiv, 公众号

WACV2025
sym

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation
Tiancheng Gu*, Kaicheng Yang*, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai (Oral, Equal First author)

Arxiv

EMNLP2024
sym

RWKV-CLIP: A Robust Vision-Language Representation Learner
Tiancheng Gu*, Kaicheng Yang*, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng (Equal First author)

Arxiv, Code, 公众号

ECCV2024
sym

Multi-label Cluster Discrimination for Visual Representation Learning
Xiang An, Kaicheng Yang, Xiangzi Dai, Ziyong Feng, Jiankang Deng

Arxiv, Code, 公众号

ICCV2023
sym

ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
Kaicheng Yang, Jiankang Deng, Xiang An, Jiawei Li, Ziyong Feng, Jia Guo, Jing Yang, Tongliang Liu

Arxiv, Code, 公众号

ICLR2023
sym

Unicom: Universal and Compact Representation Learning for Image Retrieval
Xiang An, Jiankang Deng, Kaicheng Yang, Jiawei Li, Ziyong Feng, Jia Guo, Jing Yang, Tongliang Liu

Arxiv, Code, 公众号

ACM MM2020
sym
ACL2020
sym

CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality
Wenmeng Yu, Hua Xu, Fanyang Meng, Yilin Zhu, Yixiao Ma, Jiele Wu, Jiyun Zou, Kaicheng Yang

Code