Related papers: MOLTE: a Modular Optimal Learning Testing Environm…

Learning Together to Perform Better: Teaching Small-Scale LLMs to Collaborate via Preferential Rationale Tuning

LLMssuch as GPT-4 have shown a remarkable ability to solve complex questions by generating step-by-step rationales. Prior works have utilized this capability to improve smaller and cheaper LMs (say, with 7B parameters). However, various…

Computation and Language · Computer Science 2025-06-04 Sohan Patnaik , Milan Aggarwal , Sumit Bhatia , Balaji Krishnamurthy

Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making

This paper introduces a simple efficient learning algorithms for general sequential decision making. The algorithm combines Optimism for exploration with Maximum Likelihood Estimation for model estimation, which is thus named OMLE. We prove…

Machine Learning · Computer Science 2022-11-24 Qinghua Liu , Praneeth Netrapalli , Csaba Szepesvári , Chi Jin

MOLE: MOdular Learning FramEwork via Mutual Information Maximization

This paper is to introduce an asynchronous and local learning framework for neural networks, named Modular Learning Framework (MOLE). This framework modularizes neural networks by layers, defines the training objective via mutual…

Machine Learning · Computer Science 2026-05-28 Tianchao Li , Yulong Pei

MLTEing Models: Negotiating, Evaluating, and Documenting Model and System Qualities

Many organizations seek to ensure that machine learning (ML) and artificial intelligence (AI) systems work as intended in production but currently do not have a cohesive methodology in place to do so. To fill this gap, we propose MLTE…

Software Engineering · Computer Science 2023-03-06 Katherine R. Maffey , Kyle Dotterrer , Jennifer Niemann , Iain Cruickshank , Grace A. Lewis , Christian Kästner

MOODE: An R Package for Multi-Objective Optimal Design of Experiments

We describe the R package MOODE and demonstrate its use to find multi-objective optimal experimental designs. Multi-Objective Optimal Design of Experiments (MOODE) targets the experimental objectives directly, ensuring that the full set of…

Computation · Statistics 2024-12-24 Vasiliki Koutra , Olga Egorova , Steven G. Gilmour , Luzia A. Trinca

Tailored Teaching with Balanced Difficulty: Elevating Reasoning in Multimodal Chain-of-Thought via Prompt Curriculum

The effectiveness of Multimodal Chain-of-Thought (MCoT) prompting is often limited by the use of randomly or manually selected examples. These examples fail to account for both model-specific knowledge distributions and the intrinsic…

Computation and Language · Computer Science 2025-10-14 Xinglong Yang , Quan Feng , Zhongying Pan , Xiang Chen , Yu Tian , Wentong Li , Shuofei Qiao , Yuxia Geng , Xingyu Zhao , Sheng-Jun Huang

SMITE: Enhancing Fairness in LLMs through Optimal In-Context Example Selection via Dynamic Validation

Large Language Models (LLMs) are widely used for downstream tasks such as tabular classification, where ensuring fairness in their outputs is critical for inclusivity, equal representation, and responsible AI deployment. This study…

Computation and Language · Computer Science 2025-08-26 Garima Chhikara , Kripabandhu Ghosh , Abhijnan Chakraborty

modOpt: A modular development environment and library for optimization algorithms

Recent advances in computing hardware and modeling software have given rise to new applications for numerical optimization. These new applications occasionally uncover bottlenecks in existing optimization algorithms and necessitate further…

Mathematical Software · Computer Science 2024-10-18 Anugrah Jo Joshy , John T. Hwang

YAMLE: Yet Another Machine Learning Environment

YAMLE: Yet Another Machine Learning Environment is an open-source framework that facilitates rapid prototyping and experimentation with machine learning (ML) models and methods. The key motivation is to reduce repetitive work when…

Machine Learning · Computer Science 2024-02-12 Martin Ferianc , Miguel Rodrigues

Combining Difficulty Ranking with Multi-Armed Bandits to Sequence Educational Content

As e-learning systems become more prevalent, there is a growing need for them to accommodate individual differences between students. This paper addresses the problem of how to personalize educational content to students in order to…

Artificial Intelligence · Computer Science 2019-04-24 Avi Segal , Yossi Ben David , Joseph Jay Williams , Kobi Gal , Yaar Shalom

MuTSE: A Human-in-the-Loop Multi-use Text Simplification Evaluator

As Large Language Models (LLMs) become increasingly prevalent in text simplification, systematically evaluating their outputs across diverse prompting strategies and architectures remains a critical methodological challenge in both NLP…

Computation and Language · Computer Science 2026-04-13 Rares-Alexandru Roscan , Gabriel Petre1 , Adrian-Marius Dumitran , Angela-Liliana Dumitran

METTLE: a METamorphic testing approach to assessing and validating unsupervised machine LEarning systems

Unsupervised machine learning is the training of an artificial intelligence system using information that is neither classified nor labeled, with a view to modeling the underlying structure or distribution in a dataset. Since unsupervised…

Software Engineering · Computer Science 2020-03-18 Xiaoyuan Xie , Zhiyi Zhang , Tsong Yueh Chen , Yang Liu , Pak-Lok Poon , Baowen Xu

Ecole: A Library for Learning Inside MILP Solvers

In this paper we describe Ecole (Extensible Combinatorial Optimization Learning Environments), a library to facilitate integration of machine learning in combinatorial optimization solvers. It exposes sequential decision making that must be…

Machine Learning · Computer Science 2021-04-08 Antoine Prouvost , Justin Dumouchelle , Maxime Gasse , Didier Chételat , Andrea Lodi

PLUME: Latent Reasoning Based Universal Multimodal Embedding

Universal multimodal embedding (UME) maps heterogeneous inputs into a shared retrieval space with a single model. Recent approaches improve UME by generating explicit chain-of-thought (CoT) rationales before extracting embeddings, enabling…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Chenwei He , Xiangzhao Hao , Tianyu Yang , Yuxiang Ma , Yuheng Jia , Lingxiang Wu , Chaoyang Zhao , Haiyun Guo , Jinqiao Wang

Equitable Multi-task Learning

Multi-task learning (MTL) has achieved great success in various research domains, such as CV, NLP and IR etc. Due to the complex and competing task correlation, naive training all tasks may lead to inequitable learning, i.e. some tasks are…

Machine Learning · Computer Science 2023-06-21 Jun Yuan , Rui Zhang

MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization

In drug discovery, molecular optimization aims to iteratively refine a lead compound to improve molecular properties while preserving structural similarity to the original molecule. However, each oracle evaluation is expensive, making…

Machine Learning · Computer Science 2026-04-15 Ziqing Wang , Yibo Wen , Abhishek Pandy , Han Liu , Kaize Ding

Supporting Instructors in Collaborating with Researchers using MOOClets

Most education and workplace learning takes place in classroom contexts far removed from laboratories or field sites with special arrangements for scientific research. But digital online resources provide a novel opportunity for large scale…

Computers and Society · Computer Science 2015-02-17 Joseph Jay Williams , Juho Kim , Brian C. Keegan

Tackling Multimodal Learning Challenges with Mixture-of-Expert: A Survey

Mixture-of-Experts (MoE) presents a naturally compatible and scalable framework for multimodal learning, demonstrating strong adaptability across diverse modalities and tasks. Despite its growing success, a comprehensive and systematic…

Machine Learning · Computer Science 2026-05-28 Liangwei Nathan Zheng , Wei Emma Zhang , Olaf Maennel , Lin Yue , Weitong Chen

MAPLE: Modality-Aware Post-training and Learning Ecosystem

Multimodal language models now integrate text, audio, and video for unified reasoning. Yet existing RL post-training pipelines treat all input signals as equally relevant, ignoring which modalities each task actually requires. This…

Artificial Intelligence · Computer Science 2026-02-13 Nikhil Verma , Minjung Kim , JooYoung Yoo , Kyung-Min Jin , Manasa Bharadwaj , Kevin Ferreira , Ko Keun Kim , Youngjoon Kim

Robust Imitation Learning for Automated Game Testing

Game development is a long process that involves many stages before a product is ready for the market. Human play testing is among the most time consuming, as testers are required to repeatedly perform tasks in the search for errors in the…

Machine Learning · Computer Science 2024-01-10 Pierluigi Vito Amadori , Timothy Bradley , Ryan Spick , Guy Moss