Computer Vision and Pattern Recognition · Computer Science
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data
Sachin Mehta, Maxwell Horton, Fartash Faghri, Mohammad Hossein Sekhavat +4
2024-04-25
Computer Vision and Pattern Recognition · Computer Science
SuperCLIP: CLIP with Simple Classification Supervision
Weiheng Zhao, Zilong Huang, Jiashi Feng, Xinggang Wang
2025-12-17
Computer Vision and Pattern Recognition · Computer Science
Improved baselines for vision-language pre-training
Enrico Fini, Pietro Astolfi, Adriana Romero-Soriano, Jakob Verbeek +1
2023-11-07
Computer Vision and Pattern Recognition · Computer Science
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Haicheng Wang, Chen Ju, Weixiong Lin, Shuai Xiao +8
2024-12-03
Computer Vision and Pattern Recognition · Computer Science
Classification Done Right for Vision-Language Pre-Training
Zilong Huang, Qinghao Ye, Bingyi Kang, Jiashi Feng +1
2024-11-07
Computer Vision and Pattern Recognition · Computer Science
Learning from Children: Improving Image-Caption Pretraining via Curriculum
Hammad A. Ayyubi, Rahul Lokesh, Alireza Zareian, Bo Wu +1
2023-05-31
Computer Vision and Pattern Recognition · Computer Science
Retrieval-Enhanced Contrastive Vision-Text Models
Ahmet Iscen, Mathilde Caron, Alireza Fathi, Cordelia Schmid
2024-02-22
Computer Vision and Pattern Recognition · Computer Science
TULIP: Towards Unified Language-Image Pretraining
Zineng Tang, Long Lian, Seun Eisape, XuDong Wang +5
2025-04-09
Computer Vision and Pattern Recognition · Computer Science
Language-Image Alignment with Fixed Text Encoders
Jingfeng Yang, Ziyang Wu, Yue Zhao, Yi Ma
2025-06-05
Computer Vision and Pattern Recognition · Computer Science
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding
Le Zhang, Rabiul Awal, Aishwarya Agrawal
2024-04-26
Computer Vision and Pattern Recognition · Computer Science
Demonstrating and Reducing Shortcuts in Vision-Language Representation Learning
Maurits Bleeker, Mariya Hendriksen, Andrew Yates, Maarten de Rijke
2024-08-02
Computer Vision and Pattern Recognition · Computer Science
Learning Visual Composition through Improved Semantic Guidance
Austin Stone, Hagen Soltau, Robert Geirhos, Xi Yi +5
2025-04-07
Machine Learning · Computer Science
Finetuning CLIP to Reason about Pairwise Differences
Dylan Sam, Devin Willmott, Joao D. Semedo, J. Zico Kolter
2025-07-08
Computer Vision and Pattern Recognition · Computer Science
MedCLIP: Contrastive Learning from Unpaired Medical Images and Text
Zifeng Wang, Zhenbang Wu, Dinesh Agarwal, Jimeng Sun
2022-10-20
Computer Vision and Pattern Recognition · Computer Science
Unified Contrastive Learning in Image-Text-Label Space
Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Bin Xiao +3
2022-04-08
Computation and Language · Computer Science
Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality
Harman Singh, Pengchuan Zhang, Qifan Wang, Mengjiao Wang +3
2023-10-26
Computer Vision and Pattern Recognition · Computer Science
Contrastive Localized Language-Image Pre-Training
Hong-You Chen, Zhengfeng Lai, Haotian Zhang, Xinze Wang +6
2025-02-20
Computer Vision and Pattern Recognition · Computer Science
Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Xin Xiao, Bohong Wu, Jiacong Wang, Chunyuan Li +2
2024-11-06
Computer Vision and Pattern Recognition · Computer Science
CLIP-Event: Connecting Text and Images with Event Structures
Manling Li, Ruochen Xu, Shuohang Wang, Luowei Zhou +5
2022-05-02