Related papers: Cloud-Device Collaborative Learning for Multimodal…

Device-Cloud Collaborative Learning for Recommendation

With the rapid development of storage and computing power on mobile devices, it becomes critical and popular to deploy models on devices to save onerous communication latencies and to capture real-time features. While quite a lot of works…

Machine Learning · Computer Science 2021-06-18 Jiangchao Yao , Feng Wang , KunYang Jia , Bo Han , Jingren Zhou , Hongxia Yang

DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models

Many large vision models have been deployed on the cloud for real-time services. Meanwhile, fresh samples are continuously generated on the served mobile device. How to leverage the device-side samples to improve the cloud-side large model…

Machine Learning · Computer Science 2023-03-21 Yucheng Ding , Chaoyue Niu , Fan Wu , Shaojie Tang , Chengfei Lyu , Guihai Chen

Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-world

When facing changing environments in the real world, the lightweight model on client devices suffers from severe performance drops under distribution shifts. The main limitations of the existing device model lie in (1) unable to update due…

Computer Vision and Pattern Recognition · Computer Science 2022-12-05 Yulu Gan , Mingjie Pan , Rongyu Zhang , Zijian Ling , Lingran Zhao , Jiaming Liu , Shanghang Zhang

Bridging On-Device and Cloud LLMs for Collaborative Reasoning: A Unified Methodology for Local Routing and Post-Training

Device-cloud collaboration holds promise for deploying large language models (LLMs), leveraging lightweight on-device models for efficiency while relying on powerful cloud models for superior reasoning. A central challenge in this setting…

Machine Learning · Computer Science 2026-05-26 Wenzhi Fang , Dong-Jun Han , Liangqi Yuan , Evan Chen , Christopher Brinton

Bridging Compressed Image Latents and Multimodal Large Language Models

This paper presents the first-ever study of adapting compressed image latents to suit the needs of downstream vision tasks that adopt Multimodal Large Language Models (MLLMs). MLLMs have extended the success of large language models to…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Chia-Hao Kao , Cheng Chien , Yu-Jen Tseng , Yi-Hsin Chen , Alessandro Gnutti , Shao-Yuan Lo , Wen-Hsiao Peng , Riccardo Leonardi

Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration

In our increasingly interconnected world, where intelligent devices continually amass copious personalized multi-modal data, a pressing need arises to deliver high-quality, personalized device-aware services. However, this endeavor presents…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-20 Wei Ji , Li Li , Zheqi Lv , Wenqiao Zhang , Mengze Li , Zhen Wan , Wenqiang Lei , Roger Zimmermann

ECLM: Efficient Edge-Cloud Collaborative Learning with Continuous Environment Adaptation

Pervasive mobile AI applications primarily employ one of the two learning paradigms: cloud-based learning (with powerful large models) or on-device learning (with lightweight small models). Despite their own advantages, neither paradigm can…

Machine Learning · Computer Science 2023-11-21 Yan Zhuang , Zhenzhe Zheng , Yunfeng Shao , Bingshuai Li , Fan Wu , Guihai Chen

Large Language Models over Networks: Collaborative Intelligence under Resource Constraints

Large language models (LLMs) are transforming society, powering applications from smartphone assistants to autonomous driving. Yet cloud-based LLM services alone cannot serve a growing class of applications, including those operating under…

Signal Processing · Electrical Eng. & Systems 2026-05-12 Liangqi Yuan , Wenzhi Fang , Shiqiang Wang , H. Vincent Poor , Christopher G. Brinton

Unleashing MLLMs on the Edge: A Unified Framework for Cross-Modal ReID via Adaptive SVD Distillation

Practical cloud-edge deployment of Cross-Modal Re-identification (CM-ReID) faces challenges due to maintaining a fragmented ecosystem of specialized cloud models for diverse modalities. While Multi-Modal Large Language Models (MLLMs) offer…

Computer Vision and Pattern Recognition · Computer Science 2026-02-16 Hongbo Jiang , Jie Li , Xinqi Cai , Tianyu Xie , Yunhang Shen , Pingyang Dai , Liujuan Cao

Towards On-Device Personalization: Cloud-device Collaborative Data Augmentation for Efficient On-device Language Model

With the advancement of large language models (LLMs), significant progress has been achieved in various Natural Language Processing (NLP) tasks. However, existing LLMs still face two major challenges that hinder their broader adoption: (1)…

Information Retrieval · Computer Science 2026-01-28 Zhaofeng Zhong , Wei Yuan , Liang Qu , Tong Chen , Hao Wang , Xiangyu Zhao , Hongzhi Yin

Adaptive Guidance Semantically Enhanced via Multimodal LLM for Edge-Cloud Object Detection

Traditional object detection methods face performance degradation challenges in complex scenarios such as low-light conditions and heavy occlusions due to a lack of high-level semantic understanding. To address this, this paper proposes an…

Computer Vision and Pattern Recognition · Computer Science 2025-09-25 Yunqing Hu , Zheming Yang , Chang Zhao , Wen Ji

On-Device Large Language Models for Sequential Recommendation

On-device recommendation is critical for a number of real-world applications, especially in scenarios that have agreements on execution latency, user privacy, and robust functionality when internet connectivity is unstable or even…

Information Retrieval · Computer Science 2026-01-15 Xin Xia , Hongzhi Yin , Shane Culpepper

AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection

Multimodal large language models (MLLMs) demonstrate exceptional capabilities in semantic understanding and visual reasoning, yet they still face challenges in precise object localization and resource-constrained edge-cloud deployment. To…

Computer Vision and Pattern Recognition · Computer Science 2026-01-09 Yunqing Hu , Zheming Yang , Chang Zhao , Qi Guo , Meng Gao , Pengcheng Li , Wen Ji

AMMKD: Adaptive Multimodal Multi-teacher Distillation for Lightweight Vision-Language Models

The success of large-scale visual language pretraining (VLP) models has driven widespread adoption of image-text retrieval tasks. However, their deployment on mobile devices remains limited due to large model sizes and computational…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Yuqi Li , Chuanguang Yang , Junhao Dong , Zhengtao Yao , Haoyan Xu , Zeyu Dong , Hansheng Zeng , Zhulin An , Yingli Tian

CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration

Large Language Models (LLMs) exhibit remarkable human-like predictive capabilities. However, it is challenging to deploy LLMs to provide efficient and adaptive inference services at the edge. This paper proposes a novel Cloud-Edge…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-10 Hongpeng Jin , Yanzhao Wu

AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs

Multimodal Large Language Models (MLLMs) have demonstrated substantial value in unified text-image understanding and reasoning, primarily by converting images into sequences of patch-level tokens that align with their architectural…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Xinliang Zhang , Lei Zhu , Hangzhou He , Shuang Zeng , Ourui Fu , Jiakui Hu , Zhengjian Yao , Yanye Lu

Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing Environments

With the rapid evolution of Large Language Models (LLMs) and their large-scale experimentation in cloud-computing spaces, the challenge of guaranteeing their security and efficiency in a failure scenario has become a main issue. To ensure…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-18 Yihong Jin , Ze Yang , Xinhe Xu , Yihan Zhang , Shuyang Ji

Multimodal Large Language Models for End-to-End Affective Computing: Benchmarking and Boosting with Generative Knowledge Prompting

Multimodal Affective Computing (MAC) aims to recognize and interpret human emotions by integrating information from diverse modalities such as text, video, and audio. Recent advancements in Multimodal Large Language Models (MLLMs) have…

Artificial Intelligence · Computer Science 2025-08-05 Miaosen Luo , Jiesen Long , Zequn Li , Yunying Yang , Yuncheng Jiang , Sijie Mai

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Large Language Models (LLMs) need to adapt to the continuous changes in data, tasks, and user preferences. Due to their massive size and the high costs associated with training, LLMs are not suitable for frequent retraining. However,…

Computation and Language · Computer Science 2024-12-11 Dongfang Li , Zetian Sun , Xinshuo Hu , Baotian Hu , Min Zhang

AdLoCo: adaptive batching significantly improves communications efficiency and convergence for Large Language Models

Scaling distributed training of Large Language Models (LLMs) requires not only algorithmic advances but also efficient utilization of heterogeneous hardware resources. While existing methods such as DiLoCo have demonstrated promising…

Machine Learning · Computer Science 2025-08-26 Nikolay Kutuzov , Makar Baderko , Stepan Kulibaba , Artem Dzhalilov , Daniel Bobrov , Maxim Mashtaler , Alexander Gasnikov