English
Related papers

Related papers: Adaptive Learn-then-Test: Statistically Valid and …

200 papers

Hyperparameter selection is a critical step in the deployment of artificial intelligence (AI) models, particularly in the current era of foundational, pre-trained, models. By framing hyperparameter selection as a multiple hypothesis testing…

Machine Learning · Computer Science 2025-02-07 Amirmohammad Farzaneh , Osvaldo Simeone

The increasing adoption of Artificial Intelligence (AI) in engineering problems calls for the development of calibration methods capable of offering robust statistical reliability guarantees. The calibration of black box AI models is…

Information Theory · Computer Science 2024-07-25 Amirmohammad Farzaneh , Sangwoo Park , Osvaldo Simeone

Existing test-time adaptation (TTA) approaches often adapt models with the unlabeled testing data stream. A recent attempt relaxed the assumption by introducing limited human annotation, referred to as Human-In-the-Loop Test-Time Adaptation…

Computer Vision and Pattern Recognition · Computer Science 2024-12-25 Yushu Li , Yongyi Su , Xulei Yang , Kui Jia , Xun Xu

Embedding tables are usually huge in click-through rate (CTR) prediction models. To train and deploy the CTR models efficiently and economically, it is necessary to compress their embedding tables at the training stage. To this end, we…

Machine Learning · Computer Science 2024-08-07 Shiwei Li , Huifeng Guo , Lu Hou , Wei Zhang , Xing Tang , Ruiming Tang , Rui Zhang , Ruixuan Li

Test-Time Adaptation (TTA) has recently emerged as a promising strategy for tackling the problem of machine learning model robustness under distribution shifts by adapting the model during inference without access to any labels. Because of…

Machine Learning · Computer Science 2024-07-22 Sebastian Cygert , Damian Sójka , Tomasz Trzciński , Bartłomiej Twardowski

Artificial neural networks (ANNs) require tremendous amount of data to train on. However, in classification models, most data features are often similar which can lead to increase in training time without significant improvement in the…

Machine Learning · Computer Science 2023-03-03 Sreelekha Guggilam , Varun Chandola , Abani Patra

Accelerated life testing (ALT) is typically used to assess the reliability of material's lifetime under desired stress levels. Recent advances in material engineering have made a variety of material alternatives readily available. To…

Methodology · Statistics 2020-01-17 Ye Chen , Qiong Zhang , Mingyang Li , Wenjun Cai

Learning-Based Testing (LBT) merges learning and testing processes to achieve both testing and behavioral adequacy. LBT utilizes active learning to infer the model of the System Under Test (SUT), enabling scalability for large and complex…

Software Engineering · Computer Science 2025-10-02 Sheikh Md. Mushfiqur Rahman , Nasir Eisty

While Large Language Models (LLMs) have exhibited remarkable emergent capabilities through extensive pre-training, they still face critical limitations in generalizing to specialized domains and handling diverse linguistic variations, known…

Computation and Language · Computer Science 2025-05-28 Jinwu Hu , Zhitian Zhang , Guohao Chen , Xutao Wen , Chao Shuai , Wei Luo , Bin Xiao , Yuanqing Li , Mingkui Tan

Learning through tests is a broadly used methodology in human learning and shows great effectiveness in improving learning outcome: a sequence of tests are made with increasing levels of difficulty; the learner takes these tests to identify…

Machine Learning · Computer Science 2021-03-15 Xuefeng Du , Haochen Zhang , Pengtao Xie

Meta-learning aims to solve unseen tasks with few labelled instances. Nevertheless, despite its effectiveness for quick learning in existing optimization-based methods, it has several flaws. Inconsequential connections are frequently seen…

Machine Learning · Computer Science 2023-04-07 Sambhavi Tiwari , Manas Gogoi , Shekhar Verma , Krishna Pratap Singh

This work proposes a novel strategy for social learning by introducing the critical feature of adaptation. In social learning, several distributed agents update continually their belief about a phenomenon of interest through: i) direct…

Multiagent Systems · Computer Science 2021-07-27 Virginia Bordignon , Vincenzo Matta , Ali H. Sayed

Most methods for decision-theoretic online learning are based on the Hedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case…

Machine Learning · Statistics 2015-03-04 Tim van Erven , Peter Grünwald , Wouter M. Koolen , Steven de Rooij

Test-time adaptation (TTA) addresses distribution shifts for streaming test data in unsupervised settings. Currently, most TTA methods can only deal with minor shifts and rely heavily on heuristic and empirical studies. To advance TTA under…

Machine Learning · Computer Science 2024-04-09 Shurui Gui , Xiner Li , Shuiwang Ji

Multi-task learning (MTL) aims to enhance the performance and efficiency of machine learning models by simultaneously training them on multiple tasks. However, MTL research faces two challenges: 1) effectively modeling the relationships…

Information Retrieval · Computer Science 2023-06-06 Danwei Li , Zhengyu Zhang , Siyang Yuan , Mingze Gao , Weilin Zhang , Chaofei Yang , Xi Liu , Jiyan Yang

The static ``train then deploy" paradigm fundamentally limits Large Language Models (LLMs) from dynamically adapting their weights in response to continuous streams of new information inherent in real-world tasks. Test-Time Training (TTT)…

Machine Learning · Computer Science 2026-04-08 Guhao Feng , Shengjie Luo , Kai Hua , Ge Zhang , Di He , Wenhao Huang , Tianle Cai

Decoding from large language models (LLMs) typically relies on fixed sampling hyperparameters (e.g., temperature, top-p), despite substantial variation in task difficulty and uncertainty across prompts and individual decoding steps. We…

Machine Learning · Computer Science 2026-03-17 Chloe H. Su , Zhe Ye , Samuel Tenka , Aidan Yang , Soonho Kong , Udaya Ghai

Large Language Models (LLMs) demonstrate promising capabilities in solving scientific problems but often suffer from the issue of hallucination. While integrating LLMs with tools can mitigate this issue, models fine-tuned on tool usage…

Machine Learning · Computer Science 2025-06-23 Bohan Lyu , Yadi Cao , Duncan Watson-Parris , Leon Bergen , Taylor Berg-Kirkpatrick , Rose Yu

The remarkable advancements in large language models (LLMs) have significantly enhanced the performance in few-shot learning settings. By using only a small number of labeled examples, referred to as demonstrations, LLMs can effectively…

Computation and Language · Computer Science 2023-11-23 Katerina Margatina , Timo Schick , Nikolaos Aletras , Jane Dwivedi-Yu

Multi-Task Learning (MTL) networks have emerged as a promising method for transferring learned knowledge across different tasks. However, MTL must deal with challenges such as: overfitting to low resource tasks, catastrophic forgetting, and…

Machine Learning · Computer Science 2022-04-22 Jonathan Pilault , Amine Elhattami , Christopher Pal
‹ Prev 1 2 3 10 Next ›