Related papers: Refining Language Models with Compositional Explan…

Regularizing Explanations in Bayesian Convolutional Neural Networks

Neural networks are powerful function approximators with tremendous potential in learning complex distributions. However, they are prone to overfitting on spurious patterns. Bayesian inference provides a principled way to regularize neural…

Machine Learning · Computer Science 2024-12-02 Yanzhe Bekkemoen , Helge Langseth

Enhancing Recommendation Explanations through User-Centric Refinement

Generating natural language explanations for recommendations has become increasingly important in recommender systems. Traditional approaches typically treat user reviews as ground truth for explanations and focus on improving review…

Information Retrieval · Computer Science 2025-02-18 Jingsen Zhang , Zihang Tian , Xueyang Feng , Xu Chen

Consistency Regularization for Cross-Lingual Fine-Tuning

Fine-tuning pre-trained cross-lingual language models can transfer task-specific supervision from one language to the others. In this work, we propose to improve cross-lingual fine-tuning with consistency regularization. Specifically, we…

Computation and Language · Computer Science 2021-06-16 Bo Zheng , Li Dong , Shaohan Huang , Wenhui Wang , Zewen Chi , Saksham Singhal , Wanxiang Che , Ting Liu , Xia Song , Furu Wei

Investigating the Working of Text Classifiers

Text classification is one of the most widely studied tasks in natural language processing. Motivated by the principle of compositionality, large multilayer neural network models have been employed for this task in an attempt to effectively…

Computation and Language · Computer Science 2018-08-07 Devendra Singh Sachan , Manzil Zaheer , Ruslan Salakhutdinov

Regularizing Black-box Models for Improved Interpretability

Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, whose explanation quality can be…

Machine Learning · Computer Science 2020-11-10 Gregory Plumb , Maruan Al-Shedivat , Angel Alexander Cabrera , Adam Perer , Eric Xing , Ameet Talwalkar

Feature-informed Embedding Space Regularization For Audio Classification

Feature representations derived from models pre-trained on large-scale datasets have shown their generalizability on a variety of audio analysis tasks. Despite this generalizability, however, task-specific features can outperform if…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-13 Yun-Ning Hung , Alexander Lerch

Regularization Through Reasoning: Systematic Improvements in Language Model Classification via Explanation-Enhanced Fine-Tuning

Fine-tuning LLMs for classification typically maps inputs directly to labels. We ask whether attaching brief explanations to each label during fine-tuning yields better models. We evaluate conversational response quality along three axes:…

Machine Learning · Computer Science 2026-03-03 Vivswan Shah , Randy Cogill , Hanwei Yue , Gopinath Chennupati , Rinat Khaziev

Robust Reading Comprehension with Linguistic Constraints via Posterior Regularization

In spite of great advancements of machine reading comprehension (RC), existing RC models are still vulnerable and not robust to different types of adversarial examples. Neural models over-confidently predict wrong answers to semantic…

Computation and Language · Computer Science 2019-11-19 Mantong Zhou , Minlie Huang , Xiaoyan Zhu

Learning to Explain: Answering Why-Questions via Rephrasing

Providing plausible responses to why questions is a challenging but critical goal for language based human-machine interaction. Explanations are challenging in that they require many different forms of abstract knowledge and reasoning.…

Computation and Language · Computer Science 2019-06-05 Allen Nie , Erin D. Bennett , Noah D. Goodman

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization

Transfer learning has fundamentally changed the landscape of natural language processing (NLP) research. Many existing state-of-the-art models are first pre-trained on a large text corpus and then fine-tuned on downstream tasks. However,…

Computation and Language · Computer Science 2021-09-10 Haoming Jiang , Pengcheng He , Weizhu Chen , Xiaodong Liu , Jianfeng Gao , Tuo Zhao

Supervising Model Attention with Human Explanations for Robust Natural Language Inference

Natural Language Inference (NLI) models are known to learn from biases and artefacts within their training data, impacting how well they generalise to other unseen datasets. Existing de-biasing approaches focus on preventing the models from…

Computation and Language · Computer Science 2022-05-03 Joe Stacey , Yonatan Belinkov , Marek Rei

Regularizing Black-box Models for Improved Interpretability (HILL 2019 Version)

Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, which lack guarantees about their…

Machine Learning · Computer Science 2019-06-05 Gregory Plumb , Maruan Al-Shedivat , Eric Xing , Ameet Talwalkar

Feature Normalization for Fine-tuning Self-Supervised Models in Speech Enhancement

Large, pre-trained representation models trained using self-supervised learning have gained popularity in various fields of machine learning because they are able to extract high-quality salient features from input data. As such, they have…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-16 Hejung Yang , Hong-Goo Kang

Investigating Training and Generalization in Faithful Self-Explanations of Large Language Models

Large language models have the potential to generate explanations for their own predictions in a variety of styles based on user instructions. Recent research has examined whether these self-explanations faithfully reflect the models'…

Computation and Language · Computer Science 2025-12-09 Tomoki Doi , Masaru Isonuma , Hitomi Yanaka

An Evaluation of the Human-Interpretability of Explanation

Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains…

Machine Learning · Computer Science 2019-08-30 Isaac Lage , Emily Chen , Jeffrey He , Menaka Narayanan , Been Kim , Sam Gershman , Finale Doshi-Velez

Training Deep Models to be Explained with Fewer Examples

Although deep models achieve high predictive performance, it is difficult for humans to understand the predictions they made. Explainability is important for real-world applications to justify their reliability. Many example-based…

Machine Learning · Statistics 2021-12-08 Tomoharu Iwata , Yuya Yoshikawa

Teaching Meaningful Explanations

The adoption of machine learning in high-stakes applications such as healthcare and law has lagged in part because predictions are not accompanied by explanations comprehensible to the domain user, who often holds the ultimate…

Artificial Intelligence · Computer Science 2018-09-12 Noel C. F. Codella , Michael Hind , Karthikeyan Natesan Ramamurthy , Murray Campbell , Amit Dhurandhar , Kush R. Varshney , Dennis Wei , Aleksandra Mojsilovic

An Extended Framework for Marginalized Domain Adaptation

We propose an extended framework for marginalized domain adaptation, aimed at addressing unsupervised, supervised and semi-supervised scenarios. We argue that the denoising principle should be extended to explicitly promote domain-invariant…

Computer Vision and Pattern Recognition · Computer Science 2017-02-21 Gabriela Csurka , Boris Chidlovski , Stephane Clinchant , Sophia Michel

Adding noise to the input of a model trained with a regularized objective

Regularization is a well studied problem in the context of neural networks. It is usually used to improve the generalization performance when the number of input samples is relatively small or heavily contaminated with noise. The…

Artificial Intelligence · Computer Science 2011-04-19 Salah Rifai , Xavier Glorot , Yoshua Bengio , Pascal Vincent

Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation

Attention mechanism is contributing to the majority of recent advances in machine learning for natural language processing. Additionally, it results in an attention map that shows the proportional influence of each input in its decision.…

Computation and Language · Computer Science 2025-01-23 Duc Hau Nguyen , Cyrielle Mallart , Guillaume Gravier , Pascale Sébillot