Related papers: LLM-assisted Agentic Edge Intelligence Framework

Agentic AI Reasoning for Mobile Edge General Intelligence: Fundamentals, Approaches, and Directions

The rapid advancement of large language models (LLMs) has enabled an emergence of agentic artificial intelligence (AI) with powerful reasoning and autonomous decision-making capabilities. This integration with edge computing has led to the…

Artificial Intelligence · Computer Science 2026-02-10 Mingyi Luo , Ruichen Zhang , Xiangwang Hou , Jun Du , Chunxiao Jiang , Yong Ren , Dusit Niyato , Shiwen Mao

Mobile Edge Intelligence for Large Language Models: A Contemporary Survey

On-device large language models (LLMs), referring to running LLMs on edge devices, have raised considerable interest since they are more cost-effective, latency-efficient, and privacy-preserving compared with the cloud paradigm.…

Networking and Internet Architecture · Computer Science 2025-03-21 Guanqiao Qu , Qiyuan Chen , Wei Wei , Zheng Lin , Xianhao Chen , Kaibin Huang

Towards Edge General Intelligence via Large Language Models: Opportunities and Challenges

Edge Intelligence (EI) has been instrumental in delivering real-time, localized services by leveraging the computational capabilities of edge networks. The integration of Large Language Models (LLMs) empowers EI to evolve into the next…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-07 Handi Chen , Weipeng Deng , Shuo Yang , Jinfeng Xu , Zhihan Jiang , Edith C. H. Ngai , Jiangchuan Liu , Xue Liu

Toward Edge General Intelligence with Multiple-Large Language Model (Multi-LLM): Architecture, Trust, and Orchestration

Edge computing enables real-time data processing closer to its source, thus improving the latency and performance of edge-enabled AI applications. However, traditional AI models often fall short when dealing with complex, dynamic tasks that…

Networking and Internet Architecture · Computer Science 2025-07-02 Haoxiang Luo , Yinqiu Liu , Ruichen Zhang , Jiacheng Wang , Gang Sun , Dusit Niyato , Hongfang Yu , Zehui Xiong , Xianbin Wang , Xuemin Shen

Cognitive Edge Computing: A Comprehensive Survey on Optimizing Large Models and AI Agents for Pervasive Deployment

This article surveys Cognitive Edge Computing as a practical and methodical pathway for deploying reasoning-capable Large Language Models (LLMs) and autonomous AI agents on resource-constrained devices at the network edge. We present a…

Machine Learning · Computer Science 2025-11-10 Xubin Wang , Qing Li , Weijia Jia

Adaptive AI Agent Placement and Migration in Edge Intelligence Systems

The rise of LLMs such as ChatGPT and Claude fuels the need for AI agents capable of real-time task handling. However, migrating data-intensive, multi-modal edge workloads to cloud data centers, traditionally used for agent deployment,…

Artificial Intelligence · Computer Science 2025-08-06 Xingdan Wang , Jiayi He , Zhiqing Tang , Jianxiong Guo , Jiong Lou , Liping Qian , Tian Wang , Weijia Jia

EdgeReasoning: Characterizing Reasoning LLM Deployment on Edge GPUs

Edge intelligence paradigm is increasingly demanded by the emerging autonomous systems, such as robotics. Beyond ensuring privacy-preserving operation and resilience in connectivity-limited environments, edge deployment offers significant…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-05 Benjamin Kubwimana , Qijing Huang

EdgeFM: Efficient Edge Inference for Vision-Language Models

Vision-language models (VLMs) have demonstrated strong applicability in edge industrial applications, yet their deployment remains severely constrained by requirements for deterministic low latency and stable execution under resource…

Computer Vision and Pattern Recognition · Computer Science 2026-05-01 Mengling Deng , Yuanpeng Chen , Sheng Yang , Wei Tao , Wenhai Zhang , Hui Song , Linyuanhao Qin , Kai Zhao , Xiaojun Ye , Shanhui Mo , Jingli Fan , Shuang Zhang , Bei Liu , Tiankun Zhao , Xiangjing An

Distributed Threat Intelligence at the Edge Devices: A Large Language Model-Driven Approach

With the proliferation of edge devices, there is a significant increase in attack surface on these devices. The decentralized deployment of threat intelligence on edge devices, coupled with adaptive machine learning techniques such as the…

Cryptography and Security · Computer Science 2024-10-10 Syed Mhamudul Hasan , Alaa M. Alotaibi , Sajedul Talukder , Abdur R. Shahid

AI Flow at the Network Edge

Recent advancements in large language models (LLMs) and their multimodal variants have led to remarkable progress across various domains, demonstrating impressive capabilities and unprecedented potential. In the era of ubiquitous…

Signal Processing · Electrical Eng. & Systems 2025-02-14 Jiawei Shao , Xuelong Li

CLONE: Customizing LLMs for Efficient Latency-Aware Inference at the Edge

Deploying large language models (LLMs) on edge devices is crucial for delivering fast responses and ensuring data privacy. However, the limited storage, weight, and power of edge devices make it difficult to deploy LLM-powered applications.…

Hardware Architecture · Computer Science 2025-06-04 Chunlin Tian , Xinpeng Qin , Kahou Tam , Li Li , Zijian Wang , Yuanzhe Zhao , Minglei Zhang , Chengzhong Xu

Deep-Edge: An Efficient Framework for Deep Learning Model Update on Heterogeneous Edge

Deep Learning (DL) model-based AI services are increasingly offered in a variety of predictive analytics services such as computer vision, natural language processing, speech recognition. However, the quality of the DL models can degrade…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-11-04 Anirban Bhattacharjee , Ajay Dev Chhokra , Hongyang Sun , Shashank Shekhar , Aniruddha Gokhale , Gabor Karsai , Abhishek Dubey

EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting

Efficient adaption of large language models (LLMs) on edge devices is essential for applications requiring continuous and privacy-preserving adaptation and inference. However, existing tuning techniques fall short because of the high…

Machine Learning · Computer Science 2024-06-25 Zhongzhi Yu , Zheng Wang , Yuhan Li , Haoran You , Ruijie Gao , Xiaoya Zhou , Sreenidhi Reedy Bommu , Yang Katie Zhao , Yingyan Celine Lin

Networking-Aware Energy Efficiency in Agentic AI Inference: A Survey

The rapid emergence of Large Language Models (LLMs) has catalyzed Agentic artificial intelligence (AI), autonomous systems integrating perception, reasoning, and action into closed-loop pipelines for continuous adaptation. While unlocking…

Systems and Control · Electrical Eng. & Systems 2026-04-10 Xiaojing Chen , Haiqi Yu , Wei Ni , Dusit Niyato , Ruichen Zhang , Xin Wang , Shunqing Zhang , Shugong Xu

A Review on Edge Large Language Models: Design, Execution, and Applications

Large language models (LLMs) have revolutionized natural language processing with their exceptional understanding, synthesizing, and reasoning capabilities. However, deploying LLMs on resource-constrained edge devices presents significant…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-02-25 Yue Zheng , Yuhao Chen , Bin Qian , Xiufang Shi , Yuanchao Shu , Jiming Chen

EdgeShard: Efficient LLM Inference via Collaborative Edge Computing

Large language models (LLMs) have shown great potential in natural language processing and content generation. However, current LLMs heavily rely on cloud computing, leading to prolonged latency, high bandwidth cost, and privacy concerns.…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-24 Mingjin Zhang , Jiannong Cao , Xiaoming Shen , Zeyang Cui

EdgeFlow: Fast Cold Starts for LLMs on Mobile Devices

Deploying large language models (LLMs) on mobile devices is an emerging trend to enable data privacy and offline accessibility of LLM applications. Modern mobile neural processing units (NPUs) make such deployment increasingly feasible.…

Operating Systems · Computer Science 2026-04-13 Yongsheng Yan , Jiacheng Shen , Xuchuan Luo , Yangfan Zhou

Edge-First Language Model Inference: Models, Metrics, and Tradeoffs

The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge. This shift aims to reduce costs, lower latency, and…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-05-30 SiYoung Jang , Roberto Morabito

Large Language Models Empowered Autonomous Edge AI for Connected Intelligence

The evolution of wireless networks gravitates towards connected intelligence, a concept that envisions seamless interconnectivity among humans, objects, and intelligence in a hyper-connected cyber-physical world. Edge artificial…

Information Theory · Computer Science 2023-12-27 Yifei Shen , Jiawei Shao , Xinjie Zhang , Zehong Lin , Hao Pan , Dongsheng Li , Jun Zhang , Khaled B. Letaief

Multi-Agentic AI for Fairness-Aware and Accelerated Multi-modal Large Model Inference in Real-world Mobile Edge Networks

Generative AI (GenAI) has transformed applications in natural language processing and content creation, yet centralized inference remains hindered by high latency, limited customizability, and privacy concerns. Deploying large models (LMs)…

Systems and Control · Electrical Eng. & Systems 2026-02-10 Haiyuan Li , Hari Madhukumar , Shuangyi Yan , Yulei Wu , Dimitra Simeonidou