Related papers: Value Alignment from Unstructured Text

Efficient Alignment of Large Language Models via Data Sampling

LLM alignment ensures that large language models behave safely and effectively by aligning their outputs with human values, goals, and intentions. Aligning LLMs employ huge amounts of data, computation, and time. Moreover, curating data…

Machine Learning · Computer Science 2025-02-19 Amrit Khera , Rajat Ghosh , Debojyoti Dutta

Quantile Regression with Large Language Models for Price Prediction

Large Language Models (LLMs) have shown promise in structured prediction tasks, including regression, but existing approaches primarily focus on point estimates and lack systematic comparison across different methods. We investigate…

Computation and Language · Computer Science 2025-06-10 Nikhita Vedula , Dushyanta Dhyani , Laleh Jalali , Boris Oreshkin , Mohsen Bayati , Shervin Malmasi

Towards Efficient and Effective Alignment of Large Language Models

Large language models (LLMs) exhibit remarkable capabilities across diverse tasks, yet aligning them efficiently and effectively with human expectations remains a critical challenge. This thesis advances LLM alignment by introducing novel…

Computation and Language · Computer Science 2025-06-12 Yuxin Jiang

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

The rapid development of large language models (LLMs) has not only provided numerous opportunities but also presented significant challenges. This becomes particularly evident when LLMs inadvertently generate harmful or toxic content,…

Computation and Language · Computer Science 2024-02-20 Kai Chen , Chunwei Wang , Kuo Yang , Jianhua Han , Lanqing Hong , Fei Mi , Hang Xu , Zhengying Liu , Wenyong Huang , Zhenguo Li , Dit-Yan Yeung , Lifeng Shang , Xin Jiang , Qun Liu

LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models

The advent of Large Language Models (LLMs) has provided unprecedented capabilities for analyzing unstructured text data. However, deploying these models as reliable, robust, and scalable classifiers in production environments presents…

Computation and Language · Computer Science 2025-08-25 Doohee You , Andy Parisi , Zach Vander Velden , Lara Dantas Inojosa

From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models

Big models, exemplified by Large Language Models (LLMs), are models typically pre-trained on massive data and comprised of enormous parameters, which not only obtain significantly improved performance across diverse tasks but also present…

Artificial Intelligence · Computer Science 2023-09-06 Jing Yao , Xiaoyuan Yi , Xiting Wang , Jindong Wang , Xing Xie

Towards Accurate and Efficient Document Analytics with Large Language Models

Unstructured data formats account for over 80% of the data currently stored, and extracting value from such formats remains a considerable challenge. In particular, current approaches for managing unstructured documents do not support…

Databases · Computer Science 2024-05-09 Yiming Lin , Madelon Hulsebos , Ruiying Ma , Shreya Shankar , Sepanta Zeigham , Aditya G. Parameswaran , Eugene Wu

A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens

Text embeddings from large language models (LLMs) have achieved excellent results in tasks such as information retrieval, semantic textual similarity, etc. In this work, we show an interesting finding: when feeding a text into the LLM-based…

Computation and Language · Computer Science 2025-07-08 Zhijie Nie , Richong Zhang , Zhanyu Wu

Align to Structure: Aligning Large Language Models with Structural Information

Generating long, coherent text remains a challenge for large language models (LLMs), as they lack hierarchical planning and structured organization in discourse generation. We introduce Structural Alignment, a novel method that aligns LLMs…

Computation and Language · Computer Science 2026-02-04 Zae Myung Kim , Anand Ramachandran , Farideh Tavazoee , Joo-Kyung Kim , Oleg Rokhlenko , Dongyeop Kang

Embedding-Based Rankings of Educational Resources based on Learning Outcome Alignment: Benchmarking, Expert Validation, and Learner Performance

As the online learning landscape evolves, the need for personalization is increasingly evident. Although educational resources are burgeoning, educators face challenges selecting materials that both align with intended learning outcomes and…

Computers and Society · Computer Science 2025-12-16 Mohammadreza Molavi , Mohammad Moein , Mohammadreza Tavakoli , Abdolali Faraji , Stefan T. Mol , Gábor Kismihók

A Teacher Is Worth A Million Instructions

Large Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent…

Machine Learning · Computer Science 2024-06-28 Nikhil Kothari , Ravindra Nayak , Shreyas Shetty , Amey Patil , Nikesh Garera

Aligning LLMs for Multilingual Consistency in Enterprise Applications

Large language models (LLMs) remain unreliable for global enterprise applications due to substantial performance gaps between high-resource and mid/low-resource languages, driven by English-centric pretraining and internal reasoning biases.…

Computation and Language · Computer Science 2025-10-28 Amit Agarwal , Hansa Meghwani , Hitesh Laxmichand Patel , Tao Sheng , Sujith Ravi , Dan Roth

Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning

Improving the alignment of Large Language Models (LLMs) with respect to the cultural values that they encode has become an increasingly important topic. In this work, we study whether we can exploit existing knowledge about cultural values…

Computation and Language · Computer Science 2025-09-09 Rochelle Choenni , Ekaterina Shutova

Bottom-Up and Top-Down Analysis of Values, Agendas, and Observations in Corpora and LLMs

Large language models (LLMs) generate diverse, situated, persuasive texts from a plurality of potential perspectives, influenced heavily by their prompts and training data. As part of LLM adoption, we seek to characterize - and ideally,…

Computation and Language · Computer Science 2024-11-11 Scott E. Friedman , Noam Benkler , Drisana Mosaphir , Jeffrey Rye , Sonja M. Schmer-Galunder , Micah Goldwater , Matthew McLure , Ruta Wheelock , Jeremy Gottlieb , Robert P. Goldman , Christopher Miller

TransformLLM: Adapting Large Language Models via LLM-Transformed Reading Comprehension Text

Large Language Models (LLMs) have shown promise in highly-specialized domains, however challenges are still present in aspects of accuracy and costs. These limitations restrict the usage of existing models in domain-specific tasks. While…

Computation and Language · Computer Science 2024-10-30 Iftach Arbel , Yehonathan Refael , Ofir Lindenbaum

Transforming Expert Knowledge into Scalable Ontology via Large Language Models

Having a unified, coherent taxonomy is essential for effective knowledge representation in domain-specific applications as diverse terminologies need to be mapped to underlying concepts. Traditional manual approaches to taxonomy alignment…

Artificial Intelligence · Computer Science 2025-06-12 Ikkei Itoku , David Theil , Evelyn Eichelsdoerfer Uehara , Sreyoshi Bhaduri , Junnosuke Kuroda , Toshi Yumoto , Alex Gil , Natalie Perez , Rajesh Cherukuri , Naumaan Nayyar

Value Lens: Using Large Language Models to Understand Human Values

The autonomous decision-making process, which is increasingly applied to computer systems, requires that the choices made by these systems align with human values. In this context, systems must assess how well their decisions reflect human…

Computers and Society · Computer Science 2025-12-19 Eduardo de la Cruz Fernández , Marcelo Karanik , Sascha Ossowski

Measuring Teaching with LLMs

Objective and scalable measurement of teaching quality is a persistent challenge in education. While Large Language Models (LLMs) offer potential, general-purpose models have struggled to reliably apply complex, authentic classroom…

Computation and Language · Computer Science 2025-11-07 Michael Hardy

Resource-Efficient Adaptation of Large Language Models for Text Embeddings via Prompt Engineering and Contrastive Fine-tuning

Large Language Models (LLMs) have become a cornerstone in Natural Language Processing (NLP), achieving impressive performance in text generation. Their token-level representations capture rich, human-aligned semantics. However, pooling…

Computation and Language · Computer Science 2025-09-25 Benedikt Roth , Stephan Rappensperger , Tianming Qiu , Hamza Imamović , Julian Wörmann , Hao Shen

Optimising Language Models for Downstream Tasks: A Post-Training Perspective

Language models (LMs) have demonstrated remarkable capabilities in NLP, yet adapting them efficiently and robustly to specific tasks remains challenging. As their scale and complexity grow, fine-tuning LMs on labelled data often…

Computation and Language · Computer Science 2025-06-27 Zhengyan Shi