English
Related papers

Related papers: DiffCSE: Difference-based Contrastive Learning for…

200 papers

Several prior studies have suggested that word frequency biases can cause the Bert model to learn indistinguishable sentence embeddings. Contrastive learning schemes such as SimCSE and ConSERT have already been adopted successfully in…

Computation and Language · Computer Science 2023-09-15 Pu Miao , Zeyao Du , Junlin Zhang

This paper presents SimCSE, a simple contrastive learning framework that greatly advances state-of-the-art sentence embeddings. We first describe an unsupervised approach, which takes an input sentence and predicts itself in a contrastive…

Computation and Language · Computer Science 2022-05-19 Tianyu Gao , Xingcheng Yao , Danqi Chen

Contrastive learning has been the dominant approach to train state-of-the-art sentence embeddings. Previous studies have typically learned sentence embeddings either through the use of human-annotated natural language inference (NLI) data…

Computation and Language · Computer Science 2023-10-25 Junlei Zhang , Zhenzhong Lan , Junxian He

Unsupervised sentence representation learning is one of the fundamental problems in natural language processing with various downstream applications. Recently, contrastive learning has been widely adopted which derives high-quality sentence…

Computation and Language · Computer Science 2023-05-29 Jiduan Liu , Jiahao Liu , Qifan Wang , Jingang Wang , Wei Wu , Yunsen Xian , Dongyan Zhao , Kai Chen , Rui Yan

Sample contrastive methods, typically referred to simply as contrastive are the foundation of most unsupervised methods to learn text and sentence embeddings. On the other hand, a different class of self-supervised loss functions and…

Computation and Language · Computer Science 2023-10-30 Marco Farina , Duccio Pappadopulo

Contrastive learning has been extensively studied in sentence embedding learning, which assumes that the embeddings of different views of the same sentence are closer. The constraint brought by this assumption is weak, and a good sentence…

Computation and Language · Computer Science 2022-10-17 Xing Wu , Chaochen Gao , Zijia Lin , Jizhong Han , Zhongyuan Wang , Songlin Hu

Unsupervised sentence embedding aims to obtain the most appropriate embedding for a sentence to reflect its semantic. Contrastive learning has been attracting developing attention. For a sentence, current models utilize diverse data…

Computation and Language · Computer Science 2022-03-03 Hao Wang , Yangguang Li , Zhen Huang , Yong Dou , Lingpeng Kong , Jing Shao

Contrastive learning has been attracting much attention for learning unsupervised sentence embeddings. The current state-of-the-art unsupervised method is the unsupervised SimCSE (unsup-SimCSE). Unsup-SimCSE takes dropout as a minimal data…

Computation and Language · Computer Science 2022-09-13 Xing Wu , Chaochen Gao , Liangjun Zang , Jizhong Han , Zhongyuan Wang , Songlin Hu

This paper improves contrastive learning for sentence embeddings from two perspectives: handling dropout noise and addressing feature corruption. Specifically, for the first perspective, we identify that the dropout noise from negative…

Computation and Language · Computer Science 2023-12-25 Jiahao Xu , Wei Shao , Lihui Chen , Lemao Liu

This paper presents miCSE, a mutual information-based contrastive learning framework that significantly advances the state-of-the-art in few-shot sentence embedding. The proposed approach imposes alignment between the attention pattern of…

Computation and Language · Computer Science 2023-05-24 Tassilo Klein , Moin Nabi

Contrastive learning has been studied for improving the performance of learning sentence embeddings. The current state-of-the-art method is the SimCSE, which takes dropout as the data augmentation method and feeds a pre-trained transformer…

Computation and Language · Computer Science 2021-11-25 Junlei Zhang , Zhenzhong lan

This paper describes Difference-aware Deep continuous prompt for Contrastive Sentence Embeddings (D2CSE) that learns sentence embeddings. Compared to state-of-the-art approaches, D2CSE computes sentence vectors that are exceptional to…

Computation and Language · Computer Science 2023-04-19 Hyunjae Lee

Universal cross-lingual sentence embeddings map semantically similar cross-lingual sentences into a shared embedding space. Aligning cross-lingual sentence embeddings usually requires supervised cross-lingual parallel sentences. In this…

Computation and Language · Computer Science 2022-11-14 Yau-Shian Wang , Ashley Wu , Graham Neubig

We propose reCSE, a self supervised contrastive learning sentence representation framework based on feature reshaping. This framework is different from the current advanced models that use discrete data augmentation methods, but instead…

Computation and Language · Computer Science 2024-08-27 Fufangchen Zhao , Jian Gao , Danfeng Yan

Learning sentence embeddings from dialogues has drawn increasing attention due to its low annotation cost and high domain adaptability. Conventional approaches employ the siamese-network for this task, which obtains the sentence embeddings…

Computation and Language · Computer Science 2021-09-28 Che Liu , Rui Wang , Jinghua Liu , Jian Sun , Fei Huang , Luo Si

Contrastive learning-based methods, such as unsup-SimCSE, have achieved state-of-the-art (SOTA) performances in learning unsupervised sentence embeddings. However, in previous studies, each embedding used for contrastive learning only…

Computation and Language · Computer Science 2023-05-19 Hongliang He , Junlei Zhang , Zhenzhong Lan , Yue Zhang

Generic sentence embeddings provide a coarse-grained approximation of semantic textual similarity but ignore specific aspects that make texts similar. Conversely, aspect-based sentence embeddings provide similarities between texts based on…

Computation and Language · Computer Science 2023-09-26 Tim Schopf , Emanuel Gerber , Malte Ostendorff , Florian Matthes

Following SimCSE, contrastive learning based methods have achieved the state-of-the-art (SOTA) performance in learning sentence embeddings. However, the unsupervised contrastive learning methods still lag far behind the supervised…

Computation and Language · Computer Science 2022-06-07 Wei Wang , Liangzhu Ge , Jingqiao Zhang , Cheng Yang

Traditional comparative learning sentence embedding directly uses the encoder to extract sentence features, and then passes in the comparative loss function for learning. However, this method pays too much attention to the sentence body and…

Computation and Language · Computer Science 2023-06-19 Wei Zhang , Xu Chen

Learning sentence embeddings in an unsupervised manner is fundamental in natural language processing. Recent common practice is to couple pre-trained language models with unsupervised contrastive learning, whose success relies on augmenting…

Computation and Language · Computer Science 2022-10-20 Qiyu Wu , Chongyang Tao , Tao Shen , Can Xu , Xiubo Geng , Daxin Jiang
‹ Prev 1 2 3 10 Next ›