English
Related papers

Related papers: When Do Curricula Work?

200 papers

In humans and animals, curriculum learning -- presenting data in a curated order - is critical to rapid learning and effective pedagogy. Yet in machine learning, curricula are not widely used and empirically often yield only moderate…

Machine Learning · Computer Science 2022-12-07 Luca Saglietti , Stefano Sarao Mannelli , Andrew Saxe

Curriculum learning, a training technique where data is presented to the model in order of example difficulty (e.g., from simpler to more complex documents), has shown limited success for pre-training language models. In this work, we…

Computation and Language · Computer Science 2025-09-29 Loris Schoenegger , Lukas Thoma , Terra Blevins , Benjamin Roth

Training machine learning models in a meaningful order, from the easy samples to the hard ones, using curriculum learning can provide performance improvements over the standard training approach based on random data shuffling, without any…

Machine Learning · Computer Science 2022-04-12 Petru Soviany , Radu Tudor Ionescu , Paolo Rota , Nicu Sebe

In this work, we (1) introduce Curriculum Instruction Tuning, (2) explore the potential advantages of employing diverse curriculum strategies, and (3) delineate a synthetic instruction-response generation framework that complements our…

Computation and Language · Computer Science 2024-06-18 Bruce W. Lee , Hyunsoo Cho , Kang Min Yoo

It is common knowledge that the quantity and quality of the training data play a significant role in the creation of a good machine learning model. In this paper, we take it one step further and demonstrate that the way the training…

Audio and Speech Processing · Electrical Eng. & Systems 2022-08-12 Georgios Karakasidis , Tamás Grósz , Mikko Kurimo

Curriculum learning (CL) - ordering training data from easy to hard - has become a popular strategy for improving reasoning in large language models (LLMs). Yet prior work employs disparate difficulty metrics and training setups, leaving…

Machine Learning · Computer Science 2025-10-28 Yaning Jia , Chunhui Zhang , Xingjian Diao , Xiangchi Yuan , Zhongyu Ouyang , Chiyu Ma , Soroush Vosoughi

Curriculum learning-organizing training data from easy to hard-has improved efficiency across machine learning domains, yet remains underexplored for language model pretraining. We present the first systematic investigation of curriculum…

Computation and Language · Computer Science 2026-01-29 Yang Zhang , Amr Mohamed , Hadi Abdine , Guokan Shang , Michalis Vazirgiannis

Curriculum design is a fundamental component of education. For example, when we learn mathematics at school, we build upon our knowledge of addition to learn multiplication. These and other concepts must be mastered before our first algebra…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Parantak Singh , You Li , Ankur Sikarwar , Weixian Lei , Daniel Gao , Morgan Bruce Talbot , Ying Sun , Mike Zheng Shou , Gabriel Kreiman , Mengmi Zhang

Neural ranking models are traditionally trained on a series of random batches, sampled uniformly from the entire training set. Curriculum learning has recently been shown to improve neural models' effectiveness by sampling batches…

Information Retrieval · Computer Science 2019-12-19 Gustavo Penha , Claudia Hauff

Current state-of-the-art NLP systems use large neural networks that require lots of computational resources for training. Inspired by human knowledge acquisition, researchers have proposed curriculum learning, - sequencing of tasks…

Computation and Language · Computer Science 2024-02-06 Maxim K. Surkov , Vladislav D. Mosin , Ivan P. Yamshchikov

Curriculum learning is a training strategy that sorts the training examples by some measure of their difficulty and gradually exposes them to the learner to improve the network performance. Motivated by our insights from implicit curriculum…

Machine Learning · Computer Science 2021-07-28 Vinu Sankar Sadasivan , Anirban Dasgupta

Curriculum learning (CL) is a commonly used machine learning training strategy. However, we still lack a clear theoretical understanding of CL's benefits. In this paper, we study the benefits of CL in the multitask linear regression problem…

Machine Learning · Statistics 2021-11-16 Ziping Xu , Ambuj Tewari

Experimental results have shown that curriculum learning, i.e., presenting simpler examples before more complex ones, can improve the efficiency of learning. Some recent theoretical results also showed that changing the sampling…

Machine Learning · Computer Science 2023-06-30 Emmanuel Abbe , Elisabetta Cornacchia , Aryo Lotfi

Training neural networks is traditionally done by providing a sequence of random mini-batches sampled uniformly from the entire training data. In this work, we analyze the effect of curriculum learning, which involves the non-uniform…

Machine Learning · Computer Science 2020-12-03 Guy Hacohen , Daphna Weinshall

Curriculum learning strategies in prior multi-task learning approaches arrange datasets in a difficulty hierarchy either based on human perception or by exhaustively searching the optimal arrangement. However, human perception of difficulty…

Machine Learning · Computer Science 2022-05-30 Neeraj Varshney , Swaroop Mishra , Chitta Baral

Recent advancements in data-to-text generation largely take on the form of neural end-to-end systems. Efforts have been dedicated to improving text generation systems by changing the order of training samples in a process known as…

Computation and Language · Computer Science 2021-02-09 Ernie Chang , Hui-Syuan Yeh , Vera Demberg

Curriculum learning--ordering training examples in a sequence to aid machine learning--takes inspiration from human learning, but has not gained widespread acceptance. Static strategies for scoring item difficulty rely on indirect proxy…

Machine Learning · Computer Science 2026-03-17 Zhenwei Tang , Amogh Inamdar , Ashton Anderson , Richard Zemel

Curriculum Learning emphasizes the order of training instances in a computational learning setup. The core hypothesis is that simpler instances should be learned early as building blocks to learn more complex ones. Despite its usefulness,…

Computation and Language · Computer Science 2016-11-21 Volkan Cirik , Eduard Hovy , Louis-Philippe Morency

Curriculum learning changes the order of pretraining data, but it remains unclear how ordering changes the learning dynamics. We pretrain models from 14M to 1B parameters for 300B tokens under three linguistically motivated…

Machine Learning · Computer Science 2026-05-12 Mohamed Elgaar , Hadi Amiri

Curriculum learning (CL) posits that machine learning models -- similar to humans -- may learn more efficiently from data that match their current learning progress. However, CL methods are still poorly understood and, in particular for…

Machine Learning · Computer Science 2023-08-24 Lucas Weber , Jaap Jumelet , Paul Michel , Elia Bruni , Dieuwke Hupkes
‹ Prev 1 2 3 10 Next ›