Related papers: Sentence Embeddings using Supervised Contrastive L…

Self-Guided Contrastive Learning for BERT Sentence Representations

Although BERT and its variants have reshaped the NLP landscape, it still remains unclear how best to derive sentence embeddings from such pre-trained Transformers. In this work, we propose a contrastive learning method that utilizes…

Computation and Language · Computer Science 2021-06-15 Taeuk Kim , Kang Min Yoo , Sang-goo Lee

Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks

Contextualized representations from a pre-trained language model are central to achieve a high performance on downstream NLP task. The pre-trained BERT and A Lite BERT (ALBERT) models can be fine-tuned to give state-ofthe-art results in…

Computation and Language · Computer Science 2021-01-27 Hyunjin Choi , Judong Kim , Seongho Joe , Youngjune Gwon

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

Unsupervised sentence embedding aims to obtain the most appropriate embedding for a sentence to reflect its semantic. Contrastive learning has been attracting developing attention. For a sentence, current models utilize diverse data…

Computation and Language · Computer Science 2022-03-03 Hao Wang , Yangguang Li , Zhen Huang , Yong Dou , Lingpeng Kong , Jing Shao

Learning Sentence Embeddings for Coherence Modelling and Beyond

We present a novel and effective technique for performing text coherence tasks while facilitating deeper insights into the data. Despite obtaining ever-increasing task performance, modern deep-learning approaches to NLP tasks often only…

Computation and Language · Computer Science 2019-08-09 Tanner Bohn , Yining Hu , Jinhang Zhang , Charles X. Ling

Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings

Semantic representation learning for sentences is an important and well-studied problem in NLP. The current trend for this task involves training a Transformer-based sentence encoder through a contrastive objective with text, i.e.,…

Computation and Language · Computer Science 2022-09-21 Yiren Jian , Chongyang Gao , Soroush Vosoughi

DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations

Sentence embeddings are an important component of many natural language processing (NLP) systems. Like word embeddings, sentence embeddings are typically learned on large text corpora and then transferred to various downstream tasks, such…

Computation and Language · Computer Science 2021-05-28 John Giorgi , Osvald Nitski , Bo Wang , Gary Bader

Sentence transition matrix: An efficient approach that preserves sentence semantics

Sentence embedding is a significant research topic in the field of natural language processing (NLP). Generating sentence embedding vectors reflecting the intrinsic meaning of a sentence is a key factor to achieve an enhanced performance in…

Computation and Language · Computer Science 2019-01-17 Myeongjun Jang , Pilsung Kang

An Unsupervised Sentence Embedding Method by Mutual Information Maximization

BERT is inefficient for sentence-pair tasks such as clustering or semantic search as it needs to evaluate combinatorially many sentence pairs which is very time-consuming. Sentence BERT (SBERT) attempted to solve this challenge by learning…

Computation and Language · Computer Science 2021-02-08 Yan Zhang , Ruidan He , Zuozhu Liu , Kwan Hui Lim , Lidong Bing

Sentence Embeddings in NLI with Iterative Refinement Encoders

Sentence-level representations are necessary for various NLP tasks. Recurrent neural networks have proven to be very effective in learning distributed representations and can be trained efficiently on natural language inference tasks. We…

Computation and Language · Computer Science 2019-08-15 Aarne Talman , Anssi Yli-Jyrä , Jörg Tiedemann

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Learning high-quality sentence representations benefits a wide range of natural language processing tasks. Though BERT-based pre-trained language models achieve high performance on many downstream tasks, the native derived sentence…

Computation and Language · Computer Science 2021-05-26 Yuanmeng Yan , Rumei Li , Sirui Wang , Fuzheng Zhang , Wei Wu , Weiran Xu

Unsupervised Attention-based Sentence-Level Meta-Embeddings from Contextualised Language Models

A variety of contextualised language models have been proposed in the NLP community, which are trained on diverse corpora to produce numerous Neural Language Models (NLMs). However, different NLMs have reported different levels of…

Computation and Language · Computer Science 2022-04-19 Keigo Takahashi , Danushka Bollegala

MCSE: Multimodal Contrastive Learning of Sentence Embeddings

Learning semantically meaningful sentence embeddings is an open problem in natural language processing. In this work, we propose a sentence embedding learning approach that exploits both visual and textual information via a multimodal…

Computation and Language · Computer Science 2022-04-26 Miaoran Zhang , Marius Mosbach , David Ifeoluwa Adelani , Michael A. Hedderich , Dietrich Klakow

Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning

Though language model text embeddings have revolutionized NLP research, their ability to capture high-level semantic information, such as relations between entities in text, is limited. In this paper, we propose a novel contrastive learning…

Computation and Language · Computer Science 2023-10-10 Christos Theodoropoulos , James Henderson , Andrei C. Coman , Marie-Francine Moens

SenTest: Evaluating Robustness of Sentence Encoders

Contrastive learning has proven to be an effective method for pre-training models using weakly labeled data in the vision domain. Sentence transformers are the NLP counterparts to this architecture, and have been growing in popularity due…

Computation and Language · Computer Science 2023-11-30 Tanmay Chavan , Shantanu Patankar , Aditya Kane , Omkar Gokhale , Geetanjali Kale , Raviraj Joshi

Contrastive Learning of Sentence Embeddings from Scratch

Contrastive learning has been the dominant approach to train state-of-the-art sentence embeddings. Previous studies have typically learned sentence embeddings either through the use of human-annotated natural language inference (NLI) data…

Computation and Language · Computer Science 2023-10-25 Junlei Zhang , Zhenzhong Lan , Junxian He

SimCSE: Simple Contrastive Learning of Sentence Embeddings

This paper presents SimCSE, a simple contrastive learning framework that greatly advances state-of-the-art sentence embeddings. We first describe an unsupervised approach, which takes an input sentence and predicts itself in a contrastive…

Computation and Language · Computer Science 2022-05-19 Tianyu Gao , Xingcheng Yao , Danqi Chen

RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

Pre-trained language models (PLMs) have consistently demonstrated outstanding performance across a diverse spectrum of natural language processing tasks. Nevertheless, despite their success with unseen data, current PLM-based…

Computation and Language · Computer Science 2024-03-19 Javad Rafiei Asl , Prajwal Panzade , Eduardo Blanco , Daniel Takabi , Zhipeng Cai

An Efficient Self-Supervised Cross-View Training For Sentence Embedding

Self-supervised sentence representation learning is the task of constructing an embedding space for sentences without relying on human annotation efforts. One straightforward approach is to finetune a pretrained language model (PLM) with a…

Computation and Language · Computer Science 2023-11-07 Peerat Limkonchotiwat , Wuttikorn Ponwitayarat , Lalita Lowphansirikul , Can Udomcharoenchaikit , Ekapol Chuangsuwanich , Sarana Nutanong

A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddings

Contrastive learning has shown great potential in unsupervised sentence embedding tasks, e.g., SimCSE. However, We find that these existing solutions are heavily affected by superficial features like the length of sentences or syntactic…

Computation and Language · Computer Science 2022-03-14 Haochen Tan , Wei Shao , Han Wu , Ke Yang , Linqi Song

Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models

Sentence embedding is essential for many NLP tasks, with contrastive learning methods achieving strong performance using annotated datasets like NLI. Yet, the reliance on manual labels limits scalability. Recent studies leverage large…

Computation and Language · Computer Science 2025-06-05 Liyang He , Chenglong Liu , Rui Li , Zhenya Huang , Shulan Ruan , Jun Zhou , Enhong Chen