English
Related papers

Related papers: Optimize Incompatible Parameters through Compatibi…

200 papers

Knowledge transfer has strong potential to improve hyperspectral image (HSI) classification, yet two inherent challenges fundamentally restrict effective cross-domain transfer: spectral variations caused by different sensors and semantic…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Lu Huo , Wenjian Huang , Jianguo Zhang , Min Xu , Haimin Zhang

In this work, we explore the limitations of combining models by averaging intermediate features, referred to as model merging, and propose a new direction for achieving collective model intelligence through what we call compatible…

Machine Learning · Computer Science 2024-11-05 Jyothish Pari , Samy Jelassi , Pulkit Agrawal

Multimodal Large Language Models (MLLMs) have achieved success across various domains. However, their applicability tends to degrade when confronted with different types of data inputs, especially for MLLMs that have been fine-tuned for…

Computation and Language · Computer Science 2025-07-02 Yang Dai , Jianxiang An , Tianwei Lin , Hongyang He , Hongzhe Huang , Wenqiao Zhang , Zheqi Lv , Siliang Tang , Yueting Zhuang

Combining multiple machine learning models has long been a technique for enhancing performance, particularly in distributed settings. Traditional approaches, such as model ensembles, work well, but are expensive in terms of memory and…

Machine Learning · Computer Science 2024-12-23 Albert Manuel Orozco Camacho , Stefan Horoi , Guy Wolf , Eugene Belilovsky

Combining information from different sources is a common way to improve classification accuracy in Brain-Computer Interfacing (BCI). For instance, in small sample settings it is useful to integrate data from other subjects or sessions in…

Machine Learning · Statistics 2013-10-24 Wojciech Samek , Alexander Binder , Klaus-Robert Müller

Autonomous driving demands accurate perception and safe decision-making. To achieve this, automated vehicles are now equipped with multiple sensors (e.g., camera, Lidar, etc.), enabling them to exploit complementary environmental context by…

Computer Vision and Pattern Recognition · Computer Science 2022-02-24 Xiaoming Zeng , Zhendong Wang , Yang Hu

While fine-tuning pretrained models has become common practice, these models often underperform outside their specific domains. Recently developed model merging techniques enable the direct integration of multiple models, each fine-tuned…

Computer Vision and Pattern Recognition · Computer Science 2024-10-04 Guodong Du , Junlin Lee , Jing Li , Runhua Jiang , Yifei Guo , Shuyang Yu , Hanting Liu , Sim Kuan Goh , Ho-Kin Tang , Daojing He , Min Zhang

We consider a distributed learning setting where each agent/learner holds a specific parametric model and data source. The goal is to integrate information across a set of learners to enhance the prediction accuracy of a given learner. A…

Methodology · Statistics 2021-09-21 Jiaying Zhou , Jie Ding , Kean Ming Tan , Vahid Tarokh

In recent years, with the advent of massive computational power and the availability of huge amounts of data, Deep neural networks have enabled the exploration of uncharted areas in several domains. But at times, they under-perform due to…

Machine Learning · Computer Science 2020-08-14 Pramod Vadiraja , Muhammad Ali Chattha

In recent years, deep neural network is introduced in recommender systems to solve the collaborative filtering problem, which has achieved immense success on computer vision, speech recognition and natural language processing. On one hand,…

Information Retrieval · Computer Science 2020-10-14 Ge Fan , Wei Zeng , Shan Sun , Biao Geng , Weiyi Wang , Weibo Liu

This study addresses the Domain-Class Incremental Learning problem, a realistic but challenging continual learning scenario where both the domain distribution and target classes vary across tasks. To handle these diverse tasks, pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Longxiang Tang , Zhuotao Tian , Kai Li , Chunming He , Hantao Zhou , Hengshuang Zhao , Xiu Li , Jiaya Jia

Investigating better ways to reuse the released pre-trained language models (PLMs) can significantly reduce the computational cost and the potential environmental side-effects. This paper explores a novel PLM reuse paradigm, Knowledge…

Computation and Language · Computer Science 2022-10-12 Lei Li , Yankai Lin , Xuancheng Ren , Guangxiang Zhao , Peng Li , Jie Zhou , Xu Sun

We propose deep parameter interpolation (DPI), a general-purpose method for transforming an existing deep neural network architecture into one that accepts an additional scalar input. Recent deep generative models, including diffusion…

Image and Video Processing · Electrical Eng. & Systems 2025-11-27 Chicago Y. Park , Michael T. McCann , Cristina Garcia-Cardona , Brendt Wohlberg , Ulugbek S. Kamilov

Multi-party learning provides solutions for training joint models with decentralized data under legal and practical constraints. However, traditional multi-party learning approaches are confronted with obstacles such as system…

Machine Learning · Computer Science 2021-05-26 Yuan Gao , Jiawei Li , Maoguo Gong , Yu Xie , A. K. Qin

Multi-task model merging offers a promising paradigm for integrating multiple expert models into a unified model without additional training. Existing state-of-the-art techniques, such as Task Arithmetic and its variants, merge models by…

Artificial Intelligence · Computer Science 2025-05-15 Wenju Sun , Qingyong Li , Yangli-ao Geng , Boyang Li

Providing natural language-based explanations to justify recommendations helps to improve users' satisfaction and gain users' trust. However, as current explanation generation methods are commonly trained with an objective to mimic existing…

Information Retrieval · Computer Science 2024-08-22 Yurou Zhao , Yiding Sun , Ruidong Han , Fei Jiang , Lu Guan , Xiang Li , Wei Lin , Weizhi Ma , Jiaxin Mao

Large Language Models (LLMs) have achieved exceptional capabilities in open generation across various domains, yet they encounter difficulties with tasks that require intensive knowledge. To address these challenges, methods for integrating…

Computation and Language · Computer Science 2024-12-17 Fali Wang , Runxue Bao , Suhang Wang , Wenchao Yu , Yanchi Liu , Wei Cheng , Haifeng Chen

The concepts of conditional mutual information (CMI) and normalized conditional mutual information (NCMI) are introduced to measure the concentration and separation performance of a classification deep neural network (DNN) in the output…

Machine Learning · Computer Science 2023-09-19 En-Hui Yang , Shayan Mohajer Hamidi , Linfeng Ye , Renhao Tan , Beverly Yang

While deep neural networks have achieved remarkable performance, they tend to lack transparency in prediction. The pursuit of greater interpretability in neural networks often results in a degradation of their original performance. Some…

Computer Vision and Pattern Recognition · Computer Science 2024-08-09 Hefeng Wu , Hao Jiang , Keze Wang , Ziyi Tang , Xianghuan He , Liang Lin

Knowledge augmentation has significantly enhanced the performance of Large Language Models (LLMs) in knowledge-intensive tasks. However, existing methods typically operate on the simplistic premise that model performance equates with…

Computation and Language · Computer Science 2026-02-16 Hao Chen , Ye He , Yuchun Fan , Yukun Yan , Zhenghao Liu , Qingfu Zhu , Maosong Sun , Wanxiang Che
‹ Prev 1 2 3 10 Next ›