Related papers: Aligner: Efficient Alignment by Learning to Correc…

Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction

The rapid advancement of large language models (LLMs) has led to significant improvements in their capabilities, but also to increased concerns about their alignment with human values and intentions. Current alignment strategies, including…

Computation and Language · Computer Science 2025-01-10 Hantao Lou , Jiaming Ji , Kaile Wang , Yaodong Yang

P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis

Large Language Models (LLMs) are expected to produce safe, helpful, and honest content during interaction with human users, but they frequently fail to align with such values when given flawed instructions, e.g., missing context, ambiguous…

Computation and Language · Computer Science 2025-08-07 Feifan Song , Bofei Gao , Yifan Song , Yi Liu , Weimin Xiong , Yuyang Song , Tianyu Liu , Guoyin Wang , Houfeng Wang

Aligners: Decoupling LLMs and Alignment

Large Language Models (LLMs) need to be aligned with human expectations to ensure their safety and utility in most applications. Alignment is challenging, costly, and needs to be repeated for every LLM and alignment criterion. We propose to…

Computation and Language · Computer Science 2024-10-07 Lilian Ngweta , Mayank Agarwal , Subha Maity , Alex Gittens , Yuekai Sun , Mikhail Yurochkin

InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance

With the rapid development of large language models (LLMs), they are not only used as general-purpose AI assistants but are also customized through further fine-tuning to meet the requirements of different applications. A pivotal factor in…

Computation and Language · Computer Science 2024-01-23 Pengyu Wang , Dong Zhang , Linyang Li , Chenkun Tan , Xinghao Wang , Ke Ren , Botian Jiang , Xipeng Qiu

Aligner: One Global Token is Worth Millions of Parameters When Aligning Large Language Models

We introduce Aligner, a novel Parameter-Efficient Fine-Tuning (PEFT) method for aligning multi-billion-parameter-sized Large Language Models (LLMs). Aligner employs a unique design that constructs a globally shared set of tunable tokens…

Computation and Language · Computer Science 2023-12-12 Zhou Ziheng , Yingnian Wu , Song-Chun Zhu , Demetri Terzopoulos

Efficient Model-agnostic Alignment via Bayesian Persuasion

With recent advancements in large language models (LLMs), alignment has emerged as an effective technique for keeping LLMs consensus with human intent. Current methods primarily involve direct training through Supervised Fine-tuning (SFT)…

Computation and Language · Computer Science 2024-05-30 Fengshuo Bai , Mingzhi Wang , Zhaowei Zhang , Boyuan Chen , Yinda Xu , Ying Wen , Yaodong Yang

MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models

Recent advancements in large language models (LLMs) focus on aligning to heterogeneous human expectations and values via multi-objective preference alignment. However, existing methods are dependent on the policy model parameters, which…

Computation and Language · Computer Science 2025-07-22 Kailai Yang , Zhiwei Liu , Qianqian Xie , Jimin Huang , Tianlin Zhang , Sophia Ananiadou

Towards Efficient and Effective Alignment of Large Language Models

Large language models (LLMs) exhibit remarkable capabilities across diverse tasks, yet aligning them efficiently and effectively with human expectations remains a critical challenge. This thesis advances LLM alignment by introducing novel…

Computation and Language · Computer Science 2025-06-12 Yuxin Jiang

Learning Backward Compatible Embeddings

Embeddings, low-dimensional vector representation of objects, are fundamental in building modern machine learning systems. In industrial settings, there is usually an embedding team that trains an embedding model to solve intended tasks…

Machine Learning · Statistics 2022-06-08 Weihua Hu , Rajas Bansal , Kaidi Cao , Nikhil Rao , Karthik Subbian , Jure Leskovec

DeclareAligner: A Leap Towards Efficient Optimal Alignments for Declarative Process Model Conformance Checking

In many engineering applications, processes must be followed precisely, making conformance checking between event logs and declarative process models crucial for ensuring adherence to desired behaviors. This is a critical area where…

Artificial Intelligence · Computer Science 2025-08-08 Jacobo Casas-Ramos , Manuel Lama , Manuel Mucientes

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Aligning Large Language Models (LLMs) with human values and preferences is essential for making them helpful and safe. However, building efficient tools to perform alignment can be challenging, especially for the largest and most competent…

Computation and Language · Computer Science 2024-09-04 Gerald Shen , Zhilin Wang , Olivier Delalleau , Jiaqi Zeng , Yi Dong , Daniel Egert , Shengyang Sun , Jimmy Zhang , Sahil Jain , Ali Taghibakhshi , Markel Sanz Ausin , Ashwath Aithal , Oleksii Kuchaiev

We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong

Alignment of Large Language Models (LLMs) is the ability to satisfy desired objectives during generation, which is critical for trustworthy deployment. In practice, alignment is often operationalized through multiple objectives such as…

Computation and Language · Computer Science 2026-05-19 Gautam Siddharth Kashyap , Mark Dras , Usman Naseem

Aligning Large Language Models for Controllable Recommendations

Inspired by the exceptional general intelligence of Large Language Models (LLMs), researchers have begun to explore their application in pioneering the next generation of recommender systems - systems that are conversational, explainable,…

Information Retrieval · Computer Science 2024-08-06 Wensheng Lu , Jianxun Lian , Wei Zhang , Guanghua Li , Mingyang Zhou , Hao Liao , Xing Xie

Efficient Alignment of Large Language Models via Data Sampling

LLM alignment ensures that large language models behave safely and effectively by aligning their outputs with human values, goals, and intentions. Aligning LLMs employ huge amounts of data, computation, and time. Moreover, curating data…

Machine Learning · Computer Science 2025-02-19 Amrit Khera , Rajat Ghosh , Debojyoti Dutta

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

The rapid development of large language models (LLMs) has not only provided numerous opportunities but also presented significant challenges. This becomes particularly evident when LLMs inadvertently generate harmful or toxic content,…

Computation and Language · Computer Science 2024-02-20 Kai Chen , Chunwei Wang , Kuo Yang , Jianhua Han , Lanqing Hong , Fei Mi , Hang Xu , Zhengying Liu , Wenyong Huang , Zhenguo Li , Dit-Yan Yeung , Lifeng Shang , Xin Jiang , Qun Liu

Decoupled Alignment for Robust Plug-and-Play Adaptation

We introduce a low-resource safety enhancement method for aligning large language models (LLMs) without the need for supervised fine-tuning (SFT) or reinforcement learning from human feedback (RLHF). Our main idea is to exploit knowledge…

Computation and Language · Computer Science 2024-06-07 Haozheng Luo , Jiahao Yu , Wenxin Zhang , Jialong Li , Jerry Yao-Chieh Hu , Xinyu Xing , Han Liu

PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter

This paper demonstrates that a progressively aligned language model can effectively bridge frozen vision encoders and large language models (LLMs). While the fundamental architecture and pre-training methods of vision encoders and LLMs have…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Junfei Xiao , Zheng Xu , Alan Yuille , Shen Yan , Boyu Wang

ALIGN: Prompt-based Attribute Alignment for Reliable, Responsible, and Personalized LLM-based Decision-Making

Large language models (LLMs) are increasingly being used as decision aids. However, users have diverse values and preferences that can affect their decision-making, which requires novel methods for LLM alignment and personalization.…

Computation and Language · Computer Science 2025-07-15 Bharadwaj Ravichandran , David Joy , Paul Elliott , Brian Hu , Jadie Adams , Christopher Funk , Emily Veenhuis , Anthony Hoogs , Arslan Basharat

EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration

Large language models (LLMs) are increasingly leveraged as foundational backbones in the development of advanced recommender systems, offering enhanced capabilities through their extensive knowledge and reasoning. Existing llm-based…

Information Retrieval · Computer Science 2025-02-21 Minjie Hong , Yan Xia , Zehan Wang , Jieming Zhu , Ye Wang , Sihang Cai , Xiaoda Yang , Quanyu Dai , Zhenhua Dong , Zhimeng Zhang , Zhou Zhao

Align then Train: Efficient Retrieval Adapter Learning

Dense retrieval systems increasingly need to handle complex queries. In many realistic settings, users express intent through long instructions or task-specific descriptions, while target documents remain relatively simple and static. This…

Information Retrieval · Computer Science 2026-04-07 Seiji Maekawa , Moin Aminnaseri , Pouya Pezeshkpour , Estevam Hruschka