Related papers: Reinforcement Learning Based Argument Component De…

Compact Prompting in Instruction-tuned LLMs for Joint Argumentative Component Detection

Argumentative component detection (ACD) is a core subtask of Argument(ation) Mining (AM) and one of its most challenging aspects, as it requires jointly delimiting argumentative spans and classifying them into components such as claims and…

Computation and Language · Computer Science 2026-03-04 Sofiane Elguendouze , Erwan Hain , Elena Cabrio , Serena Villata

Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study

In a recent study, Reinforcement Learning (RL) used in combination with many-objective search, has been shown to outperform alternative techniques (random search and many-objective search) for online testing of Deep Neural Network-enabled…

Software Engineering · Computer Science 2024-03-21 Luca Giamattei , Matteo Biagiola , Roberto Pietrantuono , Stefano Russo , Paolo Tonella

Automated Cyber Defense with Generalizable Graph-based Reinforcement Learning Agents

Deep reinforcement learning (RL) is emerging as a viable strategy for automated cyber defense (ACD). The traditional RL approach represents networks as a list of computers in various states of safety or threat. Unfortunately, these models…

Machine Learning · Computer Science 2025-09-22 Isaiah J. King , Benjamin Bowman , H. Howie Huang

Joint RNN Model for Argument Component Boundary Detection

Argument Component Boundary Detection (ACBD) is an important sub-task in argumentation mining; it aims at identifying the word sequences that constitute argument components, and is usually considered as the first sub-task in the…

Computation and Language · Computer Science 2017-05-08 Minglan Li , Yang Gao , Hui Wen , Yang Du , Haijing Liu , Hao Wang

Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions

From a young age humans learn to use grammatical principles to hierarchically combine words into sentences. Action grammars is the parallel idea, that there is an underlying set of rules (a "grammar") that govern how we hierarchically…

Machine Learning · Computer Science 2019-10-24 Petros Christodoulou , Robert Tjarko Lange , Ali Shafti , A. Aldo Faisal

Deep Anomaly Detection and Search via Reinforcement Learning

Semi-supervised Anomaly Detection (AD) is a kind of data mining task which aims at learning features from partially-labeled datasets to help detect outliers. In this paper, we classify existing semi-supervised AD methods into two…

Machine Learning · Computer Science 2022-10-27 Chao Chen , Dawei Wang , Feng Mao , Zongzhang Zhang , Yang Yu

Return-Based Contrastive Representation Learning for Reinforcement Learning

Recently, various auxiliary tasks have been proposed to accelerate representation learning and improve sample efficiency in deep reinforcement learning (RL). However, existing auxiliary tasks do not take the characteristics of RL problems…

Machine Learning · Computer Science 2021-02-23 Guoqing Liu , Chuheng Zhang , Li Zhao , Tao Qin , Jinhua Zhu , Jian Li , Nenghai Yu , Tie-Yan Liu

RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models

Reinforcement learning (RL) has emerged as a promising strategy for finetuning small language models (SLMs) to solve targeted tasks such as math and coding. However, RL algorithms tend to be resource-intensive, taking a significant amount…

Machine Learning · Computer Science 2025-10-07 Lianghuan Huang , Sagnik Anupam , Insup Lee , Shuo Li , Osbert Bastani

Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning

Deep reinforcement learning (deep RL) has achieved superior performance in complex sequential tasks by using deep neural networks as function approximators to learn directly from raw input images. However, learning directly from raw images…

Machine Learning · Computer Science 2019-07-31 Gabriel V. de la Cruz , Yunshu Du , Matthew E. Taylor

Reinforcement Learning with Automated Auxiliary Loss Search

A good state representation is crucial to solving complicated reinforcement learning (RL) challenges. Many recent works focus on designing auxiliary losses for learning informative representations. Unfortunately, these handcrafted…

Machine Learning · Computer Science 2022-10-13 Tairan He , Yuge Zhang , Kan Ren , Minghuan Liu , Che Wang , Weinan Zhang , Yuqing Yang , Dongsheng Li

RLAD: Time Series Anomaly Detection through Reinforcement Learning and Active Learning

We introduce a new semi-supervised, time series anomaly detection algorithm that uses deep reinforcement learning (DRL) and active learning to efficiently learn and adapt to anomalies in real-world time series data. Our model - called RLAD…

Machine Learning · Computer Science 2021-04-02 Tong Wu , Jorge Ortiz

Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis

The advent of large pre-trained language models in the domain of Code Synthesis has shown remarkable performance on various benchmarks, treating the problem of Code Generation in a fashion similar to Natural Language Generation, trained…

Machine Learning · Computer Science 2023-10-23 Philip John Gorinski , Matthieu Zimmer , Gerasimos Lampouras , Derrick Goh Xin Deik , Ignacio Iacobacci

Reinforce-Ada: An Adaptive Sampling Framework under Non-linear RL Objectives

Reinforcement learning (RL) for large language model reasoning is frequently hindered by signal loss, a phenomenon where standard uniform sampling with small group sizes fails to uncover informative learning signals for difficult prompts.…

Machine Learning · Computer Science 2025-12-08 Wei Xiong , Chenlu Ye , Baohao Liao , Hanze Dong , Xinxing Xu , Christof Monz , Jiang Bian , Nan Jiang , Tong Zhang

Assisted Text Annotation Using Active Learning to Achieve High Quality with Little Effort

Large amounts of annotated data have become more important than ever, especially since the rise of deep learning techniques. However, manual annotations are costly. We propose a tool that enables researchers to create large, high-quality,…

Digital Libraries · Computer Science 2021-12-23 Franziska Weeber , Felix Hamborg , Karsten Donnay , Bela Gipp

Introduction to Reinforcement Learning

Reinforcement Learning (RL), a subfield of Artificial Intelligence (AI), focuses on training agents to make decisions by interacting with their environment to maximize cumulative rewards. This paper provides an overview of RL, covering its…

Artificial Intelligence · Computer Science 2024-12-04 Majid Ghasemi , Dariush Ebrahimi

Automatic Curricula via Expert Demonstrations

We propose Automatic Curricula via Expert Demonstrations (ACED), a reinforcement learning (RL) approach that combines the ideas of imitation learning and curriculum learning in order to solve challenging robotic manipulation tasks with…

Machine Learning · Computer Science 2022-02-08 Siyu Dai , Andreas Hofmann , Brian Williams

Learning to Detect Language Model Training Data via Active Reconstruction

Detecting LLM training data is generally framed as a membership inference attack (MIA) problem. However, conventional MIAs operate passively on fixed model weights, using log-likelihoods or text generations. In this work, we introduce…

Machine Learning · Computer Science 2026-02-24 Junjie Oscar Yin , John X. Morris , Vitaly Shmatikov , Sewon Min , Hannaneh Hajishirzi

APRIL: Active Preference-learning based Reinforcement Learning

This paper focuses on reinforcement learning (RL) with limited prior knowledge. In the domain of swarm robotics for instance, the expert can hardly design a reward function or demonstrate the target behavior, forbidding the use of both…

Machine Learning · Computer Science 2012-08-07 Riad Akrour , Marc Schoenauer , Michèle Sebag

Survey on reinforcement learning for language processing

In recent years some researchers have explored the use of reinforcement learning (RL) algorithms as key components in the solution of various natural language processing tasks. For instance, some of these algorithms leveraging deep neural…

Computation and Language · Computer Science 2026-04-29 Victor Uc-Cetina , Nicolas Navarro-Guerrero , Anabel Martin-Gonzalez , Cornelius Weber , Stefan Wermter

Reinforcement Learning Agent Training with Goals for Real World Tasks

Reinforcement Learning (RL) is a promising approach for solving various control, optimization, and sequential decision making tasks. However, designing reward functions for complex tasks (e.g., with multiple objectives and safety…

Artificial Intelligence · Computer Science 2021-07-23 Xuan Zhao , Marcos Campos