Related papers: Evolutionary Retrofitting

Domain Adversarial Fine-Tuning as an Effective Regularizer

In Natural Language Processing (NLP), pretrained language models (LMs) that are transferred to downstream tasks have been recently shown to achieve state-of-the-art results. However, standard fine-tuning can degrade the general-domain…

Machine Learning · Computer Science 2020-10-07 Giorgos Vernikos , Katerina Margatina , Alexandra Chronopoulou , Ion Androutsopoulos

Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models

Reinforcement learning is used to align language models with human preference signals after first pre-training the model to predict the next token of text within a large corpus using likelihood maximization. Before being deployed in a…

Computation and Language · Computer Science 2024-08-30 Alec Solway

AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback

Large Language Models (LLMs) have demonstrated significant success across various domains. However, their application in complex decision-making tasks frequently necessitates intricate prompt engineering or fine-tuning, leading to…

Artificial Intelligence · Computer Science 2024-05-06 Wanpeng Zhang , Zongqing Lu

Beyond One-Size-Fits-All Pruning via Evolutionary Metric Search for Large Language Models

Post-training pruning has emerged as a crucial optimization technique as large language models (LLMs) continue to grow rapidly. However, the significant variations in weight distributions across different LLMs make fixed pruning strategies…

Computation and Language · Computer Science 2025-05-26 Shuqi Liu , Bowei He , Han Wu , Linqi Song

PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery

Model pruning is an effective approach for compressing large language models (LLMs). However, this process often leads to significant degradation of model capabilities. While post-training techniques such as instruction tuning are commonly…

Computation and Language · Computer Science 2026-02-13 Bowei He , Lihao Yin , Hui-Ling Zhen , Xiaokun Zhang , Mingxuan Yuan , Chen Ma

RE-Adapt: Reverse Engineered Adaptation of Large Language Models

We introduce RE-Adapt, an approach to fine-tuning large language models on new domains without degrading any pre-existing instruction-tuning. We reverse engineer an adapter which isolates what an instruction-tuned model has learned beyond…

Computation and Language · Computer Science 2024-05-27 William Fleshman , Benjamin Van Durme

AdaFilter: Adaptive Filter Fine-tuning for Deep Transfer Learning

There is an increasing number of pre-trained deep neural network models. However, it is still unclear how to effectively use these models for a new task. Transfer learning, which aims to transfer knowledge from source tasks to a target…

Computer Vision and Pattern Recognition · Computer Science 2019-12-10 Yunhui Guo , Yandong Li , Liqiang Wang , Tajana Rosing

Evolutionary Discovery of Reinforcement Learning Algorithms via Large Language Models

Reinforcement learning algorithms are defined by their learning update rules, which are typically hand-designed and fixed. We present an evolutionary framework for discovering reinforcement learning algorithms by searching directly over…

Machine Learning · Computer Science 2026-03-31 Alkis Sygkounas , Amy Loutfi , Andreas Persson

Machine Unlearning on Pre-trained Models by Residual Feature Alignment Using LoRA

Machine unlearning is an emerging technology that removes a subset of the training data from a trained model without significantly affecting the model performance on the remaining data. This topic is becoming increasingly important in…

Machine Learning · Computer Science 2026-05-12 Laiqiao Qin , Tianqing Zhu , Linlin Wang , Wanlei Zhou

REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration

Post-training for large language models (LLMs) is constrained by the high cost of acquiring new knowledge or correcting errors and by the unintended side effects that frequently arise from retraining. To address these issues, we introduce…

Computation and Language · Computer Science 2026-02-11 Yisu Wang , Ming Wang , Haoyuan Song , Wenjie Huang , Chaozheng Wang , Yi Xie , Xuming Ran

EA4LLM: A Gradient-Free Approach to Large Language Model Optimization via Evolutionary Algorithms

In recent years, large language models (LLMs) have made remarkable progress, with model optimization primarily relying on gradient-based optimizers such as Adam. However, these gradient-based methods impose stringent hardware requirements,…

Artificial Intelligence · Computer Science 2025-10-24 WenTao Liu , Siyu Song , Hao Hao , Aimin Zhou

Training Large Neural Networks With Low-Dimensional Error Feedback

Training deep neural networks typically relies on backpropagating high dimensional error signals a computationally intensive process with little evidence supporting its implementation in the brain. However, since most tasks involve…

Machine Learning · Computer Science 2026-01-15 Maher Hanut , Jonathan Kadmon

AutoLR: An Evolutionary Approach to Learning Rate Policies

The choice of a proper learning rate is paramount for good Artificial Neural Network training and performance. In the past, one had to rely on experience and trial-and-error to find an adequate learning rate. Presently, a plethora of state…

Neural and Evolutionary Computing · Computer Science 2020-07-09 Pedro Carvalho , Nuno Lourenço , Filipe Assunção , Penousal Machado

ReHear: Iterative Pseudo-Label Refinement for Semi-Supervised Speech Recognition via Audio Large Language Models

Semi-supervised learning in automatic speech recognition (ASR) typically relies on pseudo-labeling, which often suffers from confirmation bias and error accumulation due to noisy supervision. To address this limitation, we propose ReHear, a…

Computation and Language · Computer Science 2026-02-24 Zefang Liu , Chenyang Zhu , Sangwoo Cho , Shi-Xiong Zhang

Adaptive Explainable Continual Learning Framework for Regression Problems with Focus on Power Forecasts

Compared with traditional deep learning techniques, continual learning enables deep neural networks to learn continually and adaptively. Deep neural networks have to learn new tasks and overcome forgetting the knowledge obtained from the…

Machine Learning · Computer Science 2022-02-08 Yujiang He

Reinforcement Learning with Inverse Rewards for World Model Post-training

World models simulate dynamic environments, enabling agents to interact with diverse input modalities. Although recent advances have improved the visual quality and temporal consistency of video world models, their ability of accurately…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Yang Ye , Tianyu He , Shuo Yang , Jiang Bian

Evolving Learning Rate Optimizers for Deep Neural Networks

Artificial Neural Networks (ANNs) became popular due to their successful application difficult problems such image and speech recognition. However, when practitioners want to design an ANN they need to undergo laborious process of selecting…

Neural and Evolutionary Computing · Computer Science 2021-03-24 Pedro Carvalho , Nuno Lourenço , Penousal Machado

Narrowing the Focus: Learned Optimizers for Pretrained Models

In modern deep learning, the models are learned by applying gradient updates using an optimizer, which transforms the updates based on various statistics. Optimizers are often hand-designed and tuning their hyperparameters is a big part of…

Machine Learning · Computer Science 2024-10-08 Gus Kristiansen , Mark Sandler , Andrey Zhmoginov , Nolan Miller , Anirudh Goyal , Jihwan Lee , Max Vladymyrov

Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Fine-tuning is the most effective way of adapting pre-trained large language models (LLMs) to downstream applications. With the fast growth of LLM-enabled AI applications and democratization of open-souced LLMs, fine-tuning has become…

Machine Learning · Computer Science 2024-03-01 Kai Huang , Hanyun Yin , Heng Huang , Wei Gao

Multiplicative learning from observation-prediction ratios

Additive parameter updates, as used in gradient descent and its adaptive extensions, underpin most modern machine-learning optimization. Yet, such additive schemes often demand numerous iterations and intricate learning-rate schedules to…

Machine Learning · Computer Science 2026-03-25 Han Kim , Hyungjoon Soh , Vipul Periwal , Junghyo Jo