English
Related papers

Related papers: Multi-level Contrastive Learning for Script-based …

200 papers

Learning similarity metrics for glyphs and writing systems faces a fundamental challenge: while individual graphemes within invented alphabets can be reliably labeled, the historical relationships between different scripts remain uncertain…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Claire Roman , Philippe Meyer

This work considers supervised contrastive learning for semantic segmentation. We apply contrastive learning to enhance the discriminative power of the multi-scale features extracted by semantic segmentation networks. Our key methodological…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Theodoros Pissas , Claudio S. Ravasio , Lyndon Da Cruz , Christos Bergeles

An NLP model that understands stories should be able to understand the characters in them. To support the development of neural models for this purpose, we construct a benchmark, Story2Personality. The task is to predict a movie character's…

Artificial Intelligence · Computer Science 2022-10-21 Yisi Sang , Xiangyang Mou , Mo Yu , Dakuo Wang , Jing Li , Jeffrey Stanton

Multimodal models, such as the Contrastive Language-Image Pre-training (CLIP) model, have demonstrated remarkable success in aligning visual and linguistic representations. However, these models exhibit limitations when applied to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-02 Hiroshi Sasaki

Language tasks involving character-level manipulations (e.g., spelling corrections, arithmetic operations, word games) are challenging for models operating on subword units. To address this, we develop a causal intervention framework to…

Computation and Language · Computer Science 2023-12-20 Jing Huang , Zhengxuan Wu , Kyle Mahowald , Christopher Potts

Contrastive learning is a well-established paradigm in representation learning. The standard framework of contrastive learning minimizes the distance between "similar" instances and maximizes the distance between dissimilar ones in the…

Machine Learning · Computer Science 2025-02-06 Naghmeh Ghanooni , Barbod Pajoum , Harshit Rawal , Sophie Fellenz , Vo Nguyen Le Duy , Marius Kloft

Dense correspondence across semantically related images has been extensively studied, but still faces two challenges: 1) large variations in appearance, scale and pose exist even for objects from the same category, and 2) labeling…

Computer Vision and Pattern Recognition · Computer Science 2022-03-11 Taihong Xiao , Sifei Liu , Shalini De Mello , Zhiding Yu , Jan Kautz , Ming-Hsuan Yang

Recent multimodal models such as Contrastive Language-Image Pre-training (CLIP) have shown remarkable ability to align visual and linguistic representations. However, domains where small visual differences carry large semantic significance,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Hiroshi Sasaki

Contrastive learning is a discriminative approach that aims at grouping similar samples closer and diverse samples far from each other. It it an efficient technique to train an encoder generating distinguishable and informative…

Computer Vision and Pattern Recognition · Computer Science 2021-07-19 Qing Chen , Jian Zhang

This paper studies a text classification algorithm based on an improved Transformer to improve the performance and efficiency of the model in text classification tasks. Aiming at the shortcomings of the traditional Transformer model in…

Computation and Language · Computer Science 2025-01-24 Jia Gao , Guiran Liu , Binrong Zhu , Shicheng Zhou , Hongye Zheng , Xiaoxuan Liao

Large language models (LLMs) have demonstrated impressive performance and spurred numerous AI applications, in which role-playing agents (RPAs) are particularly popular, especially for fictional characters. The prerequisite for these RPAs…

Computation and Language · Computer Science 2024-10-10 Xinfeng Yuan , Siyu Yuan , Yuhan Cui , Tianhe Lin , Xintao Wang , Rui Xu , Jiangjie Chen , Deqing Yang

We study self-supervised learning on graphs using contrastive methods. A general scheme of prior methods is to optimize two-view representations of input graphs. In many studies, a single graph-level representation is computed as one of the…

Machine Learning · Computer Science 2021-07-22 Xinyi Xu , Cheng Deng , Yaochen Xie , Shuiwang Ji

We propose to improve text recognition from a new perspective by separating the text content from complex backgrounds. As vanilla GANs are not sufficiently robust to generate sequence-like characters in natural images, we propose an…

Computer Vision and Pattern Recognition · Computer Science 2020-12-15 Canjie Luo , Qingxiang Lin , Yuliang Liu , Lianwen Jin , Chunhua Shen

Inspired by the great success of language model (LM)-based pre-training, recent studies in visual document understanding have explored LM-based pre-training methods for modeling text within document images. Among them, pre-training that…

Computer Vision and Pattern Recognition · Computer Science 2023-09-25 Daehee Kim , Yoonsik Kim , DongHyun Kim , Yumin Lim , Geewook Kim , Taeho Kil

Contrastively trained vision-language models have achieved remarkable progress in vision and language representation learning, leading to state-of-the-art models for various downstream multimodal tasks. However, recent research has…

Computation and Language · Computer Science 2023-10-26 Harman Singh , Pengchuan Zhang , Qifan Wang , Mengjiao Wang , Wenhan Xiong , Jingfei Du , Yu Chen

Contrastive learning has revolutionized self-supervised image representation learning field, and recently been adapted to video domain. One of the greatest advantages of contrastive learning is that it allows us to flexibly define powerful…

Computer Vision and Pattern Recognition · Computer Science 2021-08-06 Haofei Kuang , Yi Zhu , Zhi Zhang , Xinyu Li , Joseph Tighe , Sören Schwertfeger , Cyrill Stachniss , Mu Li

Multi-modal semantic understanding requires integrating information from different modalities to extract users' real intention behind words. Most previous work applies a dual-encoder structure to separately encode image and text, but fails…

Computation and Language · Computer Science 2024-03-12 Ming Zhang , Ke Chang , Yunfang Wu

Large language models (LLMs) are becoming increasingly important for machine learning applications. However, it can be challenging to align LLMs with our intent, particularly when we want to generate content that is preferable over others…

Computation and Language · Computer Science 2024-04-09 Xiang Gao , Kamalika Das

Selecting an appropriate response from many candidates given the utterances in a multi-turn dialogue is the key problem for a retrieval-based dialogue system. Existing work formalizes the task as matching between the utterances and a…

Computation and Language · Computer Science 2022-03-03 Wentao Zhang , Shuang Xu , Haoran Huang

Recently, as an effective way of learning latent representations, contrastive learning has been increasingly popular and successful in various domains. The success of constrastive learning in single-label classifications motivates us to…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Son D. Dao , Ethan Zhao , Dinh Phung , Jianfei Cai
‹ Prev 1 2 3 10 Next ›