English
Related papers

Related papers: Self-Regulated Interactive Sequence-to-Sequence Le…

200 papers

Sequence-to-sequence learning involves a trade-off between signal strength and annotation cost of training data. For example, machine translation data range from costly expert-generated translations that enable supervised learning, to weak…

Computation and Language · Computer Science 2020-04-24 Julia Kreutzer , Nathaniel Berger , Stefan Riezler

Sequential learning, also called lifelong learning, studies the problem of learning tasks in a sequence with access restricted to only the data of the current task. In this paper we look at a scenario with fixed model capacity, and…

Machine Learning · Statistics 2019-04-15 Rahaf Aljundi , Marcus Rohrbach , Tinne Tuytelaars

Self-supervised learning enables the training of large neural models without the need for large, labeled datasets. It has been generating breakthroughs in several fields, including computer vision, natural language processing, biology, and…

Computation and Language · Computer Science 2023-12-19 Luis Lugo , Valentin Vielzeuf

Feedback is essential for learning, but its effectiveness relies heavily on how well it engages students in the educational process. Generative AI offers novel opportunities to efficiently produce rich, formative feedback, ranging from…

Human-Computer Interaction · Computer Science 2026-04-16 Jie Cao , Chloe Qianhui Zhao , Christian Schunn , Elizabeth A. McLaughlin , Jionghao Lin , Kenneth R. Koedinger

Recent work has demonstrated that problems-- particularly imitation learning and structured prediction-- where a learner's predictions influence the input-distribution it is tested on can be naturally addressed by an interactive approach…

Machine Learning · Computer Science 2014-06-24 Stephane Ross , J. Andrew Bagnell

A key challenge in Imitation Learning (IL) is that optimal state actions demonstrations are difficult for the teacher to provide. For example in robotics, providing kinesthetic demonstrations on a robotic manipulator requires the teacher to…

Robotics · Computer Science 2021-04-05 Matthew Schmittle , Sanjiban Choudhury , Siddhartha S. Srinivasa

Reasoning models excel in complex problem solving but exhibit a concerning trade off between reasoning capabilities and instruction following abilities. Existing approaches for improving instruction following rely on stronger external…

Artificial Intelligence · Computer Science 2025-08-05 Qingyu Ren , Qianyu He , Bowei Zhang , Jie Zeng , Jiaqing Liang , Yanghua Xiao , Weikang Zhou , Zeye Sun , Fei Yu

Recently self supervised learning has seen explosive growth and use in variety of machine learning tasks because of its ability to avoid the cost of annotating large-scale datasets. This paper gives an overview for best self supervised…

Machine Learning · Computer Science 2022-10-21 Naman Goyal

Machine Learning and Inference methods have become ubiquitous in our attempt to induce more abstract representations of natural language text, visual scenes, and other messy, naturally occurring data, and support decisions that depend on…

Machine Learning · Computer Science 2020-05-27 Dan Roth

Complex planning and scheduling problems have long been solved using various optimization or heuristic approaches. In recent years, imitation learning that aims to learn from expert demonstrations has been proposed as a viable alternative…

Machine Learning · Computer Science 2024-05-24 Qian Shao , Pradeep Varakantham , Shih-Fen Cheng

Self-supervision provides effective representations for downstream tasks without requiring labels. However, existing approaches lag behind fully supervised training and are often not thought beneficial beyond obviating or reducing the need…

Machine Learning · Computer Science 2019-10-30 Dan Hendrycks , Mantas Mazeika , Saurav Kadavath , Dawn Song

In sequential machine teaching, a teacher's objective is to provide the optimal sequence of inputs to sequential learners in order to guide them towards the best model. In this paper we extend this setting from current static one-data-set…

Machine Learning · Computer Science 2020-09-15 Mustafa Mert Celikok , Pierre-Alexandre Murena , Samuel Kaski

Reinforcement learning optimizes policies for expected cumulative reward. Need the supervision be so narrow? Reward is delayed and sparse for many tasks, making it a difficult and impoverished signal for end-to-end optimization. To augment…

Machine Learning · Computer Science 2017-03-10 Evan Shelhamer , Parsa Mahmoudieh , Max Argus , Trevor Darrell

Reinforcement learning (RL) has become widely adopted in robot control. Despite many successes, one major persisting problem can be very low data efficiency. One solution is interactive feedback, which has been shown to speed up RL…

Robotics · Computer Science 2026-04-29 Daniel Harnack , Julie Pivin-Bachler , Nicolás Navarro-Guerrero

Learning good representations without supervision is still an open issue in machine learning, and is particularly challenging for speech signals, which are often characterized by long sequences with a complex hierarchical structure. Some…

Machine Learning · Computer Science 2019-04-09 Santiago Pascual , Mirco Ravanelli , Joan Serrà , Antonio Bonafonte , Yoshua Bengio

To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we…

Machine Learning · Computer Science 2019-09-17 Sébastien Jean , Orhan Firat , Melvin Johnson

When deploying autonomous agents in the real world, we need effective ways of communicating objectives to them. Traditional skill learning has revolved around reinforcement and imitation learning, each with rigid constraints on the format…

Artificial Intelligence · Computer Science 2019-11-21 Mark Woodward , Chelsea Finn , Karol Hausman

Adapting one's thought process based on corrective feedback is an essential ability in human learning, particularly in collaborative settings. In contrast, the current large language model training paradigm relies heavily on modeling vast,…

Artificial Intelligence · Computer Science 2026-02-19 Martin Klissarov , Jonathan Cook , Diego Antognini , Hao Sun , Jingling Li , Natasha Jaques , Claudiu Musat , Edward Grefenstette

Modern approaches to text to speech require the entire input character sequence to be processed before any audio is synthesised. This latency limits the suitability of such models for time-sensitive tasks like simultaneous interpretation.…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-16 Devang S Ram Mohan , Raphael Lenain , Lorenzo Foglianti , Tian Huey Teh , Marlene Staib , Alexandra Torresquintero , Jiameng Gao

Machine teaching addresses the problem of finding the best training data that can guide a learning algorithm to a target model with minimal effort. In conventional settings, a teacher provides data that are consistent with the true data…

Machine Learning · Computer Science 2019-11-04 Tomi Peltola , Mustafa Mert Çelikok , Pedram Daee , Samuel Kaski
‹ Prev 1 2 3 10 Next ›