Related papers: Optimize Incompatible Parameters through Compatibi…

Enhancing Knowledge Transfer in Hyperspectral Image Classification via Cross-scene Knowledge Integration

Knowledge transfer has strong potential to improve hyperspectral image (HSI) classification, yet two inherent challenges fundamentally restrict effective cross-domain transfer: spectral variations caused by different sensors and semantic…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Lu Huo , Wenjian Huang , Jianguo Zhang , Min Xu , Haimin Zhang

Collective Model Intelligence Requires Compatible Specialization

In this work, we explore the limitations of combining models by averaging intermediate features, referred to as model merging, and propose a new direction for achieving collective model intelligence through what we call compatible…

Machine Learning · Computer Science 2024-11-05 Jyothish Pari , Samy Jelassi , Pulkit Agrawal

Graft: Integrating the Domain Knowledge via Efficient Parameter Synergy for MLLMs

Multimodal Large Language Models (MLLMs) have achieved success across various domains. However, their applicability tends to degrade when confronted with different types of data inputs, especially for MLLMs that have been fine-tuned for…

Computation and Language · Computer Science 2025-07-02 Yang Dai , Jianxiang An , Tianwei Lin , Hongyang He , Hongzhe Huang , Wenqiao Zhang , Zheqi Lv , Siliang Tang , Yueting Zhuang

Non-Uniform Parameter-Wise Model Merging

Combining multiple machine learning models has long been a technique for enhancing performance, particularly in distributed settings. Traditional approaches, such as model ensembles, work well, but are expensive in terms of memory and…

Machine Learning · Computer Science 2024-12-23 Albert Manuel Orozco Camacho , Stefan Horoi , Guy Wolf , Eugene Belilovsky

Multiple Kernel Learning for Brain-Computer Interfacing

Combining information from different sources is a common way to improve classification accuracy in Brain-Computer Interfacing (BCI). For instance, in small sample settings it is useful to integrate data from other subjects or sessions in…

Machine Learning · Statistics 2013-10-24 Wojciech Samek , Alexander Binder , Klaus-Robert Müller

Enabling Efficient Deep Convolutional Neural Network-based Sensor Fusion for Autonomous Driving

Autonomous driving demands accurate perception and safe decision-making. To achieve this, automated vehicles are now equipped with multiple sensors (e.g., camera, Lidar, etc.), enabling them to exploit complementary environmental context by…

Computer Vision and Pattern Recognition · Computer Science 2022-02-24 Xiaoming Zeng , Zhendong Wang , Yang Hu

Parameter Competition Balancing for Model Merging

While fine-tuning pretrained models has become common practice, these models often underperform outside their specific domains. Recently developed model merging techniques enable the direct integration of multiple models, each fine-tuned…

Computer Vision and Pattern Recognition · Computer Science 2024-10-04 Guodong Du , Junlin Lee , Jing Li , Runhua Jiang , Yifei Guo , Shuyang Yu , Hanting Liu , Sim Kuan Goh , Ho-Kin Tang , Daojing He , Min Zhang

Model Linkage Selection for Cooperative Learning

We consider a distributed learning setting where each agent/learner holds a specific parametric model and data source. The goal is to integrate information across a set of learners to enhance the prediction accuracy of a given learner. A…

Methodology · Statistics 2021-09-21 Jiaying Zhou , Jie Ding , Kean Ming Tan , Vahid Tarokh

A Survey on Knowledge integration techniques with Artificial Neural Networks for seq-2-seq/time series models

In recent years, with the advent of massive computational power and the availability of huge amounts of data, Deep neural networks have enabled the exploration of uncharted areas in several domains. But at times, they under-perform due to…

Machine Learning · Computer Science 2020-08-14 Pramod Vadiraja , Muhammad Ali Chattha

A collaborative filtering model with heterogeneous neural networks for recommender systems

In recent years, deep neural network is introduced in recommender systems to solve the collaborative filtering problem, which has achieved immense success on computer vision, speech recognition and natural language processing. On one hand,…

Information Retrieval · Computer Science 2020-10-14 Ge Fan , Wei Zeng , Shan Sun , Biao Geng , Weiyi Wang , Weibo Liu

Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

This study addresses the Domain-Class Incremental Learning problem, a realistic but challenging continual learning scenario where both the domain distribution and target classes vary across tasks. To handle these diverse tasks, pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Longxiang Tang , Zhuotao Tian , Kai Li , Chunming He , Hantao Zhou , Hengshuang Zhao , Xiu Li , Jiaya Jia

From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models

Investigating better ways to reuse the released pre-trained language models (PLMs) can significantly reduce the computational cost and the potential environmental side-effects. This paper explores a novel PLM reuse paradigm, Knowledge…

Computation and Language · Computer Science 2022-10-12 Lei Li , Yankai Lin , Xuancheng Ren , Guangxiang Zhao , Peng Li , Jie Zhou , Xu Sun

Deep Parameter Interpolation for Scalar Conditioning

We propose deep parameter interpolation (DPI), a general-purpose method for transforming an existing deep neural network architecture into one that accepts an additional scalar input. Recent deep generative models, including diffusion…

Image and Video Processing · Electrical Eng. & Systems 2025-11-27 Chicago Y. Park , Michael T. McCann , Cristina Garcia-Cardona , Brendt Wohlberg , Ulugbek S. Kamilov

Towards Explainable Multi-Party Learning: A Contrastive Knowledge Sharing Framework

Multi-party learning provides solutions for training joint models with decentralized data under legal and practical constraints. However, traditional multi-party learning approaches are confronted with obstacles such as system…

Machine Learning · Computer Science 2021-05-26 Yuan Gao , Jiawei Li , Maoguo Gong , Yu Xie , A. K. Qin

CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging

Multi-task model merging offers a promising paradigm for integrating multiple expert models into a unified model without additional training. Existing state-of-the-art techniques, such as Task Arithmetic and its variants, merge models by…

Artificial Intelligence · Computer Science 2025-05-15 Wenju Sun , Qingyong Li , Yangli-ao Geng , Boyang Li

Aligning Explanations for Recommendation with Rating and Feature via Maximizing Mutual Information

Providing natural language-based explanations to justify recommendations helps to improve users' satisfaction and gain users' trust. However, as current explanation generation methods are commonly trained with an objective to mimic existing…

Information Retrieval · Computer Science 2024-08-22 Yurou Zhao , Yiding Sun , Ruidong Han , Fei Jiang , Lu Guan , Xiang Li , Wei Lin , Weizhi Ma , Jiaxin Mao

InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration

Large Language Models (LLMs) have achieved exceptional capabilities in open generation across various domains, yet they encounter difficulties with tasks that require intensive knowledge. To address these challenges, methods for integrating…

Computation and Language · Computer Science 2024-12-17 Fali Wang , Runxue Bao , Suhang Wang , Wenchao Yu , Yanchi Liu , Wei Cheng , Haifeng Chen

Conditional Mutual Information Constrained Deep Learning for Classification

The concepts of conditional mutual information (CMI) and normalized conditional mutual information (NCMI) are introduced to measure the concentration and separation performance of a classification deep neural network (DNN) in the output…

Machine Learning · Computer Science 2023-09-19 En-Hui Yang , Shayan Mohajer Hamidi , Linfeng Ye , Renhao Tan , Beverly Yang

Improving Network Interpretability via Explanation Consistency Evaluation

While deep neural networks have achieved remarkable performance, they tend to lack transparency in prediction. The pursuit of greater interpretability in neural networks often results in a degradation of their original performance. Some…

Computer Vision and Pattern Recognition · Computer Science 2024-08-09 Hefeng Wu , Hao Jiang , Keze Wang , Ziyi Tang , Xianghuan He , Liang Lin

Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

Knowledge augmentation has significantly enhanced the performance of Large Language Models (LLMs) in knowledge-intensive tasks. However, existing methods typically operate on the simplistic premise that model performance equates with…

Computation and Language · Computer Science 2026-02-16 Hao Chen , Ye He , Yuchun Fan , Yukun Yan , Zhenghao Liu , Qingfu Zhu , Maosong Sun , Wanxiang Che