English
Related papers

Related papers: Teraflop-scale Incremental Machine Learning

200 papers

We present a concrete design for Solomonoff's incremental machine learning system suitable for desktop computers. We use R5RS Scheme and its standard library with a few omissions as the reference machine. We introduce a Levin Search variant…

Artificial Intelligence · Computer Science 2017-09-12 Eray Özkural

Continuously learning new tasks using high-level ideas or knowledge is a key capability of humans. In this paper, we propose Lifelong reinforcement learning with Sequential linear temporal logic formulas and Reward Machines (LSRM), which…

Artificial Intelligence · Computer Science 2021-11-19 Xuejing Zheng , Chao Yu , Chen Chen , Jianye Hao , Hankz Hankui Zhuo

This paper explores a top-down approach to automating incremental advances in machine learning research through component-level innovation, facilitated by Large Language Models (LLMs). Our framework systematically generates novel…

Machine Learning · Computer Science 2024-09-10 Shervin Ardeshir

We introduce a lifelong imitation learning framework that enables continual policy refinement across sequential tasks under realistic memory and data constraints. Our approach departs from conventional experience replay by operating…

Computer Vision and Pattern Recognition · Computer Science 2026-03-13 Fanqi Yu , Matteo Tiezzi , Tommaso Apicella , Cigdem Beyan , Vittorio Murino

Large language models (LLMs) demonstrate exceptional performance in numerous tasks but still heavily rely on knowledge stored in their parameters. Moreover, updating this knowledge incurs high training costs. Retrieval-augmented generation…

Computation and Language · Computer Science 2024-06-07 Yanming Liu , Xinyue Peng , Xuhong Zhang , Weihao Liu , Jianwei Yin , Jiannan Cao , Tianyu Du

In this paper, we propose an incremental learning method for end-to-end Automatic Speech Recognition (ASR) which enables an ASR system to perform well on new tasks while maintaining the performance on its originally learned ones. To…

Audio and Speech Processing · Electrical Eng. & Systems 2021-09-17 Li Fu , Xiaoxiao Li , Libo Zi , Zhengchen Zhang , Youzheng Wu , Xiaodong He , Bowen Zhou

Recent work in incremental learning has introduced diverse approaches to tackle catastrophic forgetting from data augmentation to optimized training regimes. However, most of them focus on very few training steps. We propose a method for…

Computation and Language · Computer Science 2022-10-27 Karan Praharaj , Irina Matveeva

Generative large language models (LLMs) exhibit impressive capabilities, which can be further augmented by integrating a pre-trained vision model into the original LLM to create a multimodal LLM (MLLM). However, this integration often…

Computation and Language · Computer Science 2025-08-14 Shikhar Srivastava , Md Yousuf Harun , Robik Shrestha , Christopher Kanan

Incremental learning is the ability of systems to acquire knowledge over time, enabling their adaptation and generalization to novel tasks. It is a critical ability for intelligent, real-world systems, especially when data changes…

Machine Learning · Computer Science 2025-09-03 Mladjan Jovanovic , Peter Voss

Text data augmentation is a widely used strategy for mitigating data sparsity in natural language processing (NLP), particularly in low-resource settings where limited samples hinder effective semantic modeling. While augmentation can…

Computation and Language · Computer Science 2025-07-17 Payal Bhattad , Sai Manoj Pudukotai Dinakarrao , Anju Gupta

Existing research on large language models (LLMs) shows that they can solve information extraction tasks through multi-step planning. However, their extraction behavior on complex sentences and tasks is unstable, emerging issues such as…

Computation and Language · Computer Science 2024-08-30 Zepeng Ding , Ruiyang Ke , Wenhao Huang , Guochao Jiang , Yanda Li , Deqing Yang , Jiaqing Liang

We present a novel incremental learning approach for unsupervised word segmentation that combines features from probabilistic modeling and model selection. This includes super-additive penalties for addressing the cognitive burden imposed…

Computation and Language · Computer Science 2016-09-26 Ruey-Cheng Chen

With current state-of-the-art approaches aimed at enhancing the reasoning capabilities of Large Language Models(LLMs) through iterative preference learning inspired by AlphaZero, we propose to further enhance the step-wise reasoning…

Machine Learning · Computer Science 2024-12-24 Huchen Jiang , Yangyang Ma , Chaofan Ding , Kexin Luan , Xinhan Di

Continual Learning, also known as Lifelong Learning, aims to continually learn from new data as it becomes available. While prior research on continual learning in automatic speech recognition has focused on the adaptation of models across…

Machine Learning · Computer Science 2022-07-13 Muqiao Yang , Ian Lane , Shinji Watanabe

Large Language Models (LLMs) are typically static after training, yet real-world applications require continual adaptation to new knowledge without degrading existing capabilities. Standard approaches to updating models, like full…

Machine Learning · Computer Science 2026-04-08 Satyam Goyal , Anirudh Kanchi , Garv Shah , Prakhar Gupta

In learning-to-learn the goal is to infer a learning algorithm that works well on a class of tasks sampled from an unknown meta distribution. In contrast to previous work on batch learning-to-learn, we consider a scenario where tasks are…

Machine Learning · Statistics 2018-03-23 Giulia Denevi , Carlo Ciliberto , Dimitris Stamos , Massimiliano Pontil

In this chapter, we consider generative information retrieval evaluation from two distinct but interrelated perspectives. First, large language models (LLMs) themselves are rapidly becoming tools for evaluation, with current research…

Information Retrieval · Computer Science 2025-01-31 Marwah Alaofi , Negar Arabzadeh , Charles L. A. Clarke , Mark Sanderson

Retrieval-Augmented Generation (RAG) expands the knowledge of Large Language Models (LLMs), yet current static retrieval methods struggle with complex, multi-hop problems. While recent dynamic retrieval strategies offer improvements, they…

Computation and Language · Computer Science 2026-04-23 Haijian Liang , Zenghao Niu , Junjie Wu , Changwang Zhang , Wangchunshu Zhou , Jun Wang

Large Language Models (LLMs) have achieved significant advancements, however, the common learning paradigm treats LLMs as passive information repositories, neglecting their potential for active learning and alignment. Some approaches train…

Computation and Language · Computer Science 2024-12-18 Yiming Liang , Ge Zhang , Xingwei Qu , Tianyu Zheng , Jiawei Guo , Xinrun Du , Zhenzhu Yang , Jiaheng Liu , Chenghua Lin , Lei Ma , Wenhao Huang , Jiajun Zhang

Large Language Models (LLMs) have achieved remarkable capabilities, yet their improvement methods remain fundamentally constrained by human design. We present Self-Developing, a framework that enables LLMs to autonomously discover,…

Computation and Language · Computer Science 2025-06-11 Yoichi Ishibashi , Taro Yano , Masafumi Oyamada
‹ Prev 1 2 3 10 Next ›