English
Related papers

Related papers: Contrastive Learning for Multi-Modal Automatic Cod…

200 papers

Cross-lingual Cross-modal Retrieval (CCR) is an essential task in web search, which aims to break the barriers between modality and language simultaneously and achieves image-text retrieval in the multi-lingual scenario with a single model.…

Information Retrieval · Computer Science 2024-06-27 Zhijie Nie , Richong Zhang , Zhangchi Feng , Hailang Huang , Xudong Liu

Context: Code Clone Detection (CCD) is a software engineering task that is used for plagiarism detection, code search, and code comprehension. Recently, deep learning-based models have achieved an F1 score (a metric used to assess…

Software Engineering · Computer Science 2023-11-10 Mohamad Khajezade , Fatemeh Hendijani Fard , Mohamed S. Shehata

Modern Review Helpfulness Prediction systems are dependent upon multiple modalities, typically texts and images. Unfortunately, those contemporary approaches pay scarce attention to polish representations of cross-modal relations and tend…

Computation and Language · Computer Science 2026-05-13 Thong Nguyen , Xiaobao Wu , Anh-Tuan Luu , Cong-Duy Nguyen , Zhen Hai , Lidong Bing

Modern Code Review (MCR) is a standard in all kinds of organizations that develop software. MCR pays for itself through perceived and proven benefits in quality assurance and knowledge transfer. However, the time invest in MCR is generally…

Software Engineering · Computer Science 2021-04-20 Robert Heumüller

Multi-modal Contrastive Representation learning aims to encode different modalities into a semantically aligned shared space. This paradigm shows remarkable generalization ability on numerous downstream tasks across various modalities.…

Machine Learning · Computer Science 2023-10-20 Zehan Wang , Yang Zhao , Xize Cheng , Haifeng Huang , Jiageng Liu , Li Tang , Linjun Li , Yongqi Wang , Aoxiong Yin , Ziang Zhang , Zhou Zhao

Machine unlearning offers effective solutions for revoking the influence of specific training data on pre-trained model parameters. While existing approaches address unlearning for classification and generative models, they overlook an…

Machine Learning · Computer Science 2025-08-19 Yihan Wang , Yiwei Lu , Guojun Zhang , Franziska Boenisch , Adam Dziedzic , Yaoliang Yu , Xiao-Shan Gao

Cross-modal retrieval (CMR) has been extensively applied in various domains, such as multimedia search engines and recommendation systems. Most existing CMR methods focus on image-to-text retrieval, whereas audio-to-text retrieval, a less…

Sound · Computer Science 2023-09-19 Kaiyi Luo , Xulong Zhang , Jianzong Wang , Huaxiong Li , Ning Cheng , Jing Xiao

Cross-lingual cross-modal retrieval (CCR) aims to retrieve visually relevant content based on non-English queries, without relying on human-labeled cross-modal data pairs during training. One popular approach involves utilizing machine…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Yabing Wang , Le Wang , Qiang Zhou , Zhibin Wang , Hao Li , Gang Hua , Wei Tang

Modern Code Review (MCR) is a standard practice in software engineering, yet it demands substantial time and resource investments. Recent research has increasingly explored automating core review tasks using machine learning (ML) and deep…

Software Engineering · Computer Science 2025-08-26 Robert Heumüller , Frank Ortmeier

Cross-modal medical image-report retrieval task plays a significant role in clinical diagnosis and various medical generative tasks. Eliminating heterogeneity between different modalities to enhance semantic consistency is the key challenge…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Zeqiang Wei , Kai Jin , Xiuzhuang Zhou

Deep learning-based AMC methods have achieved remarkable performance, but their practical deployment remains constrained by the high cost of labeled data. Although self-supervised learning (SSL) reduces the reliance on labels, existing…

Signal Processing · Electrical Eng. & Systems 2026-05-13 Chenxu Wang , Shuang Wang , Lirong Han , Xinyu Hu , Hanlin Mo , Hantong Xing , Licheng Jiao

Human Activity Recognition is a field of research where input data can take many forms. Each of the possible input modalities describes human behaviour in a different way, and each has its own strengths and weaknesses. We explore the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-07 Razvan Brinzea , Bulat Khaertdinov , Stylianos Asteriadis

The deployment of multimodal models in high-stakes domains, such as self-driving vehicles and medical diagnostics, demands not only strong predictive performance but also reliable mechanisms for detecting failures. In this work, we address…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Moru Liu , Hao Dong , Olga Fink , Mario Trapp

Contrastive learning (CL) methods effectively learn data representations in a self-supervision manner, where the encoder contrasts each positive sample over multiple negative samples via a one-vs-many softmax cross-entropy loss. By…

Machine Learning · Computer Science 2023-08-16 Huangjie Zheng , Xu Chen , Jiangchao Yao , Hongxia Yang , Chunyuan Li , Ya Zhang , Hao Zhang , Ivor Tsang , Jingren Zhou , Mingyuan Zhou

With the boom of e-commerce, Multimodal Review Helpfulness Prediction (MRHP), which aims to sort product reviews according to the predicted helpfulness scores has become a research hotspot. Previous work on this task focuses on…

Computation and Language · Computer Science 2022-10-06 Wei Han , Hui Chen , Zhen Hai , Soujanya Poria , Lidong Bing

Contrastive Learning (CL) performances as a rising approach to address the challenge of sparse and noisy recommendation data. Although having achieved promising results, most existing CL methods only perform either hand-crafted data or…

Information Retrieval · Computer Science 2023-11-22 Xiuyuan Qin , Huanhuan Yuan , Pengpeng Zhao , Junhua Fang , Fuzhen Zhuang , Guanfeng Liu , Victor Sheng

Code review is an effective software quality assurance activity; however, it is labor-intensive and time-consuming. Thus, a number of generation-based automatic code review (ACR) approaches have been proposed recently, which leverage deep…

Software Engineering · Computer Science 2023-03-14 Xin Zhou , Kisub Kim , Bowen Xu , DongGyun Han , Junda He , David Lo

Image-Text Retrieval (ITR) is challenging in bridging visual and lingual modalities. Contrastive learning has been adopted by most prior arts. Except for limited amount of negative image-text pairs, the capability of constrastive learning…

Computer Vision and Pattern Recognition · Computer Science 2026-03-27 Haoran Wang , Dongliang He , Wenhao Wu , Boyang Xia , Min Yang , Fu Li , Yunlong Yu , Zhong Ji , Errui Ding , Jingdong Wang

Although one-hot encoding is commonly used for multiclass classification, it is not always the most effective encoding mechanism. Error Correcting Output Codes (ECOC) address multiclass classification by mapping each class to a unique…

Machine Learning · Computer Science 2025-08-15 Che-Yu Chou , Hung-Hsuan Chen

CLIP (Contrastive Language-Image Pre-training) uses contrastive learning from noise image-text pairs to excel at recognizing a wide array of candidates, yet its focus on broad associations hinders the precision in distinguishing subtle…

Computer Vision and Pattern Recognition · Computer Science 2026-05-18 Ziyu Liu , Zeyi Sun , Yuhang Zang , Wei Li , Pan Zhang , Xiaoyi Dong , Yuanjun Xiong , Dahua Lin , Jiaqi Wang
‹ Prev 1 2 3 10 Next ›