Related papers: AnglE-optimized Text Embeddings

Scaling Sentence Embeddings with Large Language Models

Large language models (LLMs) have recently garnered significant interest. With in-context learning, LLMs achieve impressive results in various natural language tasks. However, the application of LLMs to sentence embeddings remains an area…

Computation and Language · Computer Science 2023-08-01 Ting Jiang , Shaohan Huang , Zhongzhi Luan , Deqing Wang , Fuzhen Zhuang

TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling

Recent efforts target spoken language models (SLMs) that not only listen but also speak for more natural human-LLM interaction. Joint speech-text modeling is a promising direction to achieve this. However, the effectiveness of recent speech…

Computation and Language · Computer Science 2026-02-06 Liang-Hsuan Tseng , Yi-Chang Chen , Kuan-Yi Lee , Da-Shan Shiu , Hung-yi Lee

BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models

Large language models (LLMs) call for extension of context to handle many critical applications. However, the existing approaches are prone to expensive costs and inferior quality of context extension. In this work, we proposeExtensible…

Computation and Language · Computer Science 2024-02-20 Kun Luo , Zheng Liu , Shitao Xiao , Kang Liu

ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning

Large Language Models (LLMs) excel in various natural language processing tasks, but leveraging them for dense passage embedding remains challenging. This is due to their causal attention mechanism and the misalignment between their…

Computation and Language · Computer Science 2024-08-08 Hieu Man , Nghia Trung Ngo , Franck Dernoncourt , Thien Huu Nguyen

Making Text Embedders Few-Shot Learners

Large language models (LLMs) with decoder-only architectures demonstrate remarkable in-context learning (ICL) capabilities. This feature enables them to effectively handle both familiar and novel tasks by utilizing examples provided within…

Information Retrieval · Computer Science 2024-09-25 Chaofan Li , MingHao Qin , Shitao Xiao , Jianlyu Chen , Kun Luo , Yingxia Shao , Defu Lian , Zheng Liu

LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification

With the booming of Large Language Models (LLMs), prompt-learning has become a promising method mainly researched in various research areas. Recently, many attempts based on prompt-learning have been made to improve the performance of text…

Computation and Language · Computer Science 2024-06-07 Chun Liu , Hongguang Zhang , Kainan Zhao , Xinghai Ju , Lin Yang

Length-Induced Embedding Collapse in PLM-based Models

Text embeddings from PLM-based models enable a wide range of applications, yet their performance often degrades on longer texts. In this paper, we introduce a phenomenon we call Length Collapse, where embeddings of longer texts tend to…

Computation and Language · Computer Science 2025-06-11 Yuqi Zhou , Sunhao Dai , Zhanshuo Cao , Xiao Zhang , Jun Xu

An Effective Approach to Embedding Source Code by Combining Large Language and Sentence Embedding Models

The advent of large language models (LLMs) has significantly advanced artificial intelligence (AI) in software engineering (SE), with source code embeddings playing a crucial role in tasks such as source code clone detection and source code…

Software Engineering · Computer Science 2025-06-04 Zixiang Xian , Chenhui Cui , Rubing Huang , Chunrong Fang , Zhenyu Chen

Dissecting Deep Metric Learning Losses for Image-Text Retrieval

Visual-Semantic Embedding (VSE) is a prevalent approach in image-text retrieval by learning a joint embedding space between the image and language modalities where semantic similarities would be preserved. The triplet loss with…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Hong Xuan , Xi Chen

Text4Seg: Reimagining Image Segmentation as Text Generation

Multimodal Large Language Models (MLLMs) have shown exceptional capabilities in vision-language tasks; however, effectively integrating image segmentation into these models remains a significant challenge. In this paper, we introduce…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Mengcheng Lan , Chaofeng Chen , Yue Zhou , Jiaxing Xu , Yiping Ke , Xinjiang Wang , Litong Feng , Wayne Zhang

Beyond MLE: Convex Learning for Text Generation

Maximum likelihood estimation (MLE) is a statistical method used to estimate the parameters of a probability distribution that best explain the observed data. In the context of text generation, MLE is often used to train generative language…

Computation and Language · Computer Science 2023-10-27 Chenze Shao , Zhengrui Ma , Min Zhang , Yang Feng

Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging

Text embeddings are vital for tasks such as text retrieval and semantic textual similarity (STS). Recently, the advent of pretrained language models, along with unified benchmarks like the Massive Text Embedding Benchmark (MTEB), has…

Computation and Language · Computer Science 2024-10-22 Mingxin Li , Zhijie Nie , Yanzhao Zhang , Dingkun Long , Richong Zhang , Pengjun Xie

Text4Seg++: Advancing Image Segmentation via Generative Language Modeling

Multimodal Large Language Models (MLLMs) have shown exceptional capabilities in vision-language tasks. However, effectively integrating image segmentation into these models remains a significant challenge. In this work, we propose a novel…

Computer Vision and Pattern Recognition · Computer Science 2025-09-09 Mengcheng Lan , Chaofeng Chen , Jiaxing Xu , Zongrui Li , Yiping Ke , Xudong Jiang , Yingchen Yu , Yunqing Zhao , Song Bai

Resonance RoPE: Improving Context Length Generalization of Large Language Models

This paper addresses the challenge of train-short-test-long (TSTL) scenarios in Large Language Models (LLMs) equipped with Rotary Position Embedding (RoPE), where models pre-trained on shorter sequences face difficulty with…

Computation and Language · Computer Science 2024-09-05 Suyuchen Wang , Ivan Kobyzev , Peng Lu , Mehdi Rezagholizadeh , Bang Liu

Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography

With the rapid progress of LLMs, high quality generative text has become widely available as a cover for text steganography. However, prevailing methods rely on hand-crafted or pre-specified strategies and struggle to balance efficiency,…

Cryptography and Security · Computer Science 2025-10-09 Jiuan Zhou , Yu Cheng , Yuan Xie , Zhaoxia Yin

Adaptive Margin Ranking Loss for Knowledge Graph Embeddings via a Correntropy Objective Function

Translation-based embedding models have gained significant attention in link prediction tasks for knowledge graphs. TransE is the primary model among translation-based embeddings and is well-known for its low complexity and high efficiency.…

Computation and Language · Computer Science 2019-07-12 Mojtaba Nayyeri , Xiaotian Zhou , Sahar Vahdati , Hamed Shariat Yazdi , Jens Lehmann

Efficient Tuning and Inference for Large Language Models on Textual Graphs

Rich textual and topological information of textual graphs need to be modeled in real-world applications such as webpages, e-commerce, and academic articles. Practitioners have been long following the path of adopting a shallow text encoder…

Computation and Language · Computer Science 2024-07-25 Yun Zhu , Yaoke Wang , Haizhou Shi , Siliang Tang

Efficient Sentence Embedding via Semantic Subspace Analysis

A novel sentence embedding method built upon semantic subspace analysis, called semantic subspace sentence embedding (S3E), is proposed in this work. Given the fact that word embeddings can capture semantic relationship while semantically…

Computation and Language · Computer Science 2020-03-05 Bin Wang , Fenxiao Chen , Yuncheng Wang , C. -C. Jay Kuo

Out-of-the-Box Conditional Text Embeddings from Large Language Models

Conditional text embedding is a proposed representation that captures the shift in perspective on texts when conditioned on a specific aspect. Previous methods have relied on extensive training data for fine-tuning models, leading to…

Computation and Language · Computer Science 2025-04-24 Kosuke Yamada , Peinan Zhang

Seeing Text in the Dark: Algorithm and Benchmark

Localizing text in low-light environments is challenging due to visual degradations. Although a straightforward solution involves a two-stage pipeline with low-light image enhancement (LLE) as the initial step followed by detector, LLE is…

Computer Vision and Pattern Recognition · Computer Science 2024-04-25 Chengpei Xu , Hao Fu , Long Ma , Wenjing Jia , Chengqi Zhang , Feng Xia , Xiaoyu Ai , Binghao Li , Wenjie Zhang