English
Related papers

Related papers: Cloud-Device Collaborative Learning for Multimodal…

200 papers

With the rapid development of storage and computing power on mobile devices, it becomes critical and popular to deploy models on devices to save onerous communication latencies and to capture real-time features. While quite a lot of works…

Machine Learning · Computer Science 2021-06-18 Jiangchao Yao , Feng Wang , KunYang Jia , Bo Han , Jingren Zhou , Hongxia Yang

Many large vision models have been deployed on the cloud for real-time services. Meanwhile, fresh samples are continuously generated on the served mobile device. How to leverage the device-side samples to improve the cloud-side large model…

Machine Learning · Computer Science 2023-03-21 Yucheng Ding , Chaoyue Niu , Fan Wu , Shaojie Tang , Chengfei Lyu , Guihai Chen

When facing changing environments in the real world, the lightweight model on client devices suffers from severe performance drops under distribution shifts. The main limitations of the existing device model lie in (1) unable to update due…

Computer Vision and Pattern Recognition · Computer Science 2022-12-05 Yulu Gan , Mingjie Pan , Rongyu Zhang , Zijian Ling , Lingran Zhao , Jiaming Liu , Shanghang Zhang

Device-cloud collaboration holds promise for deploying large language models (LLMs), leveraging lightweight on-device models for efficiency while relying on powerful cloud models for superior reasoning. A central challenge in this setting…

Machine Learning · Computer Science 2026-05-26 Wenzhi Fang , Dong-Jun Han , Liangqi Yuan , Evan Chen , Christopher Brinton

This paper presents the first-ever study of adapting compressed image latents to suit the needs of downstream vision tasks that adopt Multimodal Large Language Models (MLLMs). MLLMs have extended the success of large language models to…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Chia-Hao Kao , Cheng Chien , Yu-Jen Tseng , Yi-Hsin Chen , Alessandro Gnutti , Shao-Yuan Lo , Wen-Hsiao Peng , Riccardo Leonardi

In our increasingly interconnected world, where intelligent devices continually amass copious personalized multi-modal data, a pressing need arises to deliver high-quality, personalized device-aware services. However, this endeavor presents…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-20 Wei Ji , Li Li , Zheqi Lv , Wenqiao Zhang , Mengze Li , Zhen Wan , Wenqiang Lei , Roger Zimmermann

Pervasive mobile AI applications primarily employ one of the two learning paradigms: cloud-based learning (with powerful large models) or on-device learning (with lightweight small models). Despite their own advantages, neither paradigm can…

Machine Learning · Computer Science 2023-11-21 Yan Zhuang , Zhenzhe Zheng , Yunfeng Shao , Bingshuai Li , Fan Wu , Guihai Chen

Large language models (LLMs) are transforming society, powering applications from smartphone assistants to autonomous driving. Yet cloud-based LLM services alone cannot serve a growing class of applications, including those operating under…

Signal Processing · Electrical Eng. & Systems 2026-05-12 Liangqi Yuan , Wenzhi Fang , Shiqiang Wang , H. Vincent Poor , Christopher G. Brinton

Practical cloud-edge deployment of Cross-Modal Re-identification (CM-ReID) faces challenges due to maintaining a fragmented ecosystem of specialized cloud models for diverse modalities. While Multi-Modal Large Language Models (MLLMs) offer…

Computer Vision and Pattern Recognition · Computer Science 2026-02-16 Hongbo Jiang , Jie Li , Xinqi Cai , Tianyu Xie , Yunhang Shen , Pingyang Dai , Liujuan Cao

With the advancement of large language models (LLMs), significant progress has been achieved in various Natural Language Processing (NLP) tasks. However, existing LLMs still face two major challenges that hinder their broader adoption: (1)…

Information Retrieval · Computer Science 2026-01-28 Zhaofeng Zhong , Wei Yuan , Liang Qu , Tong Chen , Hao Wang , Xiangyu Zhao , Hongzhi Yin

Traditional object detection methods face performance degradation challenges in complex scenarios such as low-light conditions and heavy occlusions due to a lack of high-level semantic understanding. To address this, this paper proposes an…

Computer Vision and Pattern Recognition · Computer Science 2025-09-25 Yunqing Hu , Zheming Yang , Chang Zhao , Wen Ji

On-device recommendation is critical for a number of real-world applications, especially in scenarios that have agreements on execution latency, user privacy, and robust functionality when internet connectivity is unstable or even…

Information Retrieval · Computer Science 2026-01-15 Xin Xia , Hongzhi Yin , Shane Culpepper

Multimodal large language models (MLLMs) demonstrate exceptional capabilities in semantic understanding and visual reasoning, yet they still face challenges in precise object localization and resource-constrained edge-cloud deployment. To…

Computer Vision and Pattern Recognition · Computer Science 2026-01-09 Yunqing Hu , Zheming Yang , Chang Zhao , Qi Guo , Meng Gao , Pengcheng Li , Wen Ji

The success of large-scale visual language pretraining (VLP) models has driven widespread adoption of image-text retrieval tasks. However, their deployment on mobile devices remains limited due to large model sizes and computational…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Yuqi Li , Chuanguang Yang , Junhao Dong , Zhengtao Yao , Haoyan Xu , Zeyu Dong , Hansheng Zeng , Zhulin An , Yingli Tian

Large Language Models (LLMs) exhibit remarkable human-like predictive capabilities. However, it is challenging to deploy LLMs to provide efficient and adaptive inference services at the edge. This paper proposes a novel Cloud-Edge…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-10 Hongpeng Jin , Yanzhao Wu

Multimodal Large Language Models (MLLMs) have demonstrated substantial value in unified text-image understanding and reasoning, primarily by converting images into sequences of patch-level tokens that align with their architectural…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Xinliang Zhang , Lei Zhu , Hangzhou He , Shuang Zeng , Ourui Fu , Jiakui Hu , Zhengjian Yao , Yanye Lu

With the rapid evolution of Large Language Models (LLMs) and their large-scale experimentation in cloud-computing spaces, the challenge of guaranteeing their security and efficiency in a failure scenario has become a main issue. To ensure…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-18 Yihong Jin , Ze Yang , Xinhe Xu , Yihan Zhang , Shuyang Ji

Multimodal Affective Computing (MAC) aims to recognize and interpret human emotions by integrating information from diverse modalities such as text, video, and audio. Recent advancements in Multimodal Large Language Models (MLLMs) have…

Artificial Intelligence · Computer Science 2025-08-05 Miaosen Luo , Jiesen Long , Zequn Li , Yunying Yang , Yuncheng Jiang , Sijie Mai

Large Language Models (LLMs) need to adapt to the continuous changes in data, tasks, and user preferences. Due to their massive size and the high costs associated with training, LLMs are not suitable for frequent retraining. However,…

Computation and Language · Computer Science 2024-12-11 Dongfang Li , Zetian Sun , Xinshuo Hu , Baotian Hu , Min Zhang

Scaling distributed training of Large Language Models (LLMs) requires not only algorithmic advances but also efficient utilization of heterogeneous hardware resources. While existing methods such as DiLoCo have demonstrated promising…

‹ Prev 1 2 3 10 Next ›