English
Related papers

Related papers: Embedding-Aligned Language Models

200 papers

Training agents to act competently in complex 3D environments from high-dimensional visual information is challenging. Reinforcement learning is conventionally used to train such agents, but requires a carefully designed reward function,…

Machine Learning · Computer Science 2025-12-30 Adam Jelley , Yuhan Cao , Dave Bignell , Amos Storkey , Sam Devlin , Tabish Rashid

Large Language Models (LLMs) excel in various natural language processing tasks, but leveraging them for dense passage embedding remains challenging. This is due to their causal attention mechanism and the misalignment between their…

Computation and Language · Computer Science 2024-08-08 Hieu Man , Nghia Trung Ngo , Franck Dernoncourt , Thien Huu Nguyen

Recent studies have uncovered the potential of Large Language Models (LLMs) in addressing complex sequential decision-making tasks through the provision of high-level instructions. However, LLM-based agents lack specialization in tackling…

Artificial Intelligence · Computer Science 2024-05-28 Zihao Zhou , Bin Hu , Chenyang Zhao , Pu Zhang , Bin Liu

Large language models (LLMs) exhibit remarkable capabilities across diverse tasks, yet aligning them efficiently and effectively with human expectations remains a critical challenge. This thesis advances LLM alignment by introducing novel…

Computation and Language · Computer Science 2025-06-12 Yuxin Jiang

Ensembling large language models (LLMs) can effectively combine diverse strengths of different models, offering a promising approach to enhance performance across various tasks. However, existing methods typically rely on fixed weighting…

Machine Learning · Computer Science 2025-06-03 Yuqian Fu , Yuanheng Zhu , Jiajun Chai , Guojun Yin , Wei Lin , Qichao Zhang , Dongbin Zhao

Reinforcement Learning (RL) has traditionally focused on training specialized agents to optimize predefined reward functions within narrowly defined environments. However, the advent of powerful Large Language Models (LLMs) and increasingly…

Artificial Intelligence · Computer Science 2026-05-18 Fangming Cui , Ruixiao Zhu , Cheng Fang , Sunan Li , Jiahong Li

The advent of Large Language Models (LLMs) has revolutionized language understanding and human-like text generation, drawing interest from many other fields with this question in mind: What else are the LLMs capable of? Despite their…

Artificial Intelligence · Computer Science 2024-10-24 Nurullah Sevim , Mostafa Ibrahim , Sabit Ekin

Recently, Multimodal Large Language Models (MLLMs) have sparked great research interests owing to their exceptional content-reasoning and instruction-following capabilities. To effectively instruct an MLLM, in addition to conventional…

Computer Vision and Pattern Recognition · Computer Science 2024-09-27 Jiacheng Zhang , Yang Jiao , Shaoxiang Chen , Jingjing Chen , Yu-Gang Jiang

Large Language Models (LLMs) have proven immensely beneficial in education by capturing vast amounts of literature-based information, allowing them to generate context without relying on external sources. In this paper, we propose a…

Information Retrieval · Computer Science 2025-07-03 Umar Ali Khan , Ekram Khan , Fiza Khan , Athar Ali Moinuddin

Multi-modal Large Language Models (MLLMs) have advanced greatly in general tasks. However, they still face challenges in geometric reasoning, a task that requires synergistic integration of visual recognition proficiency and complex…

Computer Vision and Pattern Recognition · Computer Science 2026-02-10 Zhihao Li , Yao Du , Yang Liu , Yan Zhang , Yufang Liu , Mengdi Zhang , Xunliang Cai , Charles Ling , Boyu Wang

Recent SOTA approaches for embodied learning via interaction directly employ large language models (LLMs) as agents to determine the next steps in an environment. Due to their world knowledge and reasoning capabilities, LLM agents achieve…

Computation and Language · Computer Science 2024-07-15 Abhay Zala , Jaemin Cho , Han Lin , Jaehong Yoon , Mohit Bansal

Employing large language models (LLMs) to enable embodied agents has become popular, yet it presents several limitations in practice. In this work, rather than using LLMs directly as agents, we explore their use as tools for embodied agent…

Artificial Intelligence · Computer Science 2024-11-28 Yujeong Lee , Sangwoo Shin , Wei-Jin Park , Honguk Woo

Large Language Models (LLMs) have emerged as one of the most significant technological advancements in artificial intelligence in recent years. Their ability to understand, generate, and reason with natural language has transformed how we…

Artificial Intelligence · Computer Science 2025-07-03 Yanfei Zhang

Large Language Models (LLMs) are transforming artificial intelligence, enabling autonomous agents to perform diverse tasks across various domains. These agents, proficient in human-like text comprehension and generation, have the potential…

Artificial Intelligence · Computer Science 2024-04-10 Saikat Barua

Fine-tuning LLM-based text embedders via contrastive learning maps inputs and outputs into a new representational space, discarding the LLM's output semantics. We propose LLM2Vec-Gen, a self-supervised alternative that instead produces…

Computation and Language · Computer Science 2026-04-03 Parishad BehnamGhader , Vaibhav Adlakha , Fabian David Schmidt , Nicolas Chapados , Marius Mosbach , Siva Reddy

Inspired by the exceptional general intelligence of Large Language Models (LLMs), researchers have begun to explore their application in pioneering the next generation of recommender systems - systems that are conversational, explainable,…

Information Retrieval · Computer Science 2024-08-06 Wensheng Lu , Jianxun Lian , Wei Zhang , Guanghua Li , Mingyang Zhou , Hao Liao , Xing Xie

The increasing prevalence of Large Language Models (LMs) in critical applications highlights the need for controlled language generation strategies that are not only computationally efficient but that also enjoy performance guarantees. To…

Computation and Language · Computer Science 2026-03-16 Emily Cheng , Carmen Amo Alonso

Language Models and Vision Language Models have recently demonstrated unprecedented capabilities in terms of understanding human intentions, reasoning, scene understanding, and planning-like behaviour, in text form, among many others. In…

Agents built with large language models (LLMs) have shown great potential across a wide range of domains. However, in complex decision-making tasks, pure LLM-based agents tend to exhibit intrinsic bias in their choice of actions, which is…

Artificial Intelligence · Computer Science 2025-05-30 Zelai Xu , Chao Yu , Fei Fang , Yu Wang , Yi Wu

Language model (LM) agents have gained significant attention for their ability to autonomously complete tasks through interactions with environments, tools, and APIs. LM agents are primarily built with prompt engineering or supervised…

Artificial Intelligence · Computer Science 2025-07-22 Renxi Wang , Rifo Ahmad Genadi , Bilal El Bouardi , Yongxin Wang , Fajri Koto , Zhengzhong Liu , Timothy Baldwin , Haonan Li
‹ Prev 1 2 3 10 Next ›