English
Related papers

Related papers: Object-Attribute-Relation Representation Based Vid…

200 papers

Traditional video coding (VVC, HEVC) prioritizes human visual perception, transmitting substantial texture redundancy that severely hinders machine decision-making under constrained bandwidths. In dynamic channels, this redundancy causes…

Signal Processing · Electrical Eng. & Systems 2026-04-10 Chenxing Li , Yiping Duan , Han Jiao , Xiaoming Tao , Weiyao Lin , Mingquan Lu

Despite the widespread adoption of vision sensors in edge applications, such as surveillance, the transmission of video data consumes substantial spectrum resources. Semantic communication (SC) offers a solution by extracting and…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Yubo Peng , Luping Xiang , Kun Yang , Kezhi Wang , Merouane Debbah

In Earth observation (EO) missions with Low Earth orbit (LEO) satellites, high-resolution image acquisition generates a massive data volume that poses a significant challenge for transmission under the limited satellite power budget, while…

Signal Processing · Electrical Eng. & Systems 2026-03-13 Hung Nguyen-Kha , Ti Ti Nguyen , Vu Nguyen Ha , Eva Lagunas , Symeon Chatzinotas , Bjorn Ottersten

Semantic communication has undergone considerable evolution due to the recent rapid development of artificial intelligence (AI), significantly enhancing both communication robustness and efficiency. Despite these advancements, most current…

Image and Video Processing · Electrical Eng. & Systems 2024-05-24 Jiarun Ding , Peiwen Jiang , Chao-Kai Wen , Shi Jin

3D Semantic Scene Graph Prediction aims to detect objects and their semantic relationships in 3D scenes, and has emerged as a crucial technology for robotics and AR/VR applications. While previous research has addressed dataset limitations…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 KunHo Heo , GiHyun Kim , SuYeon Kim , MyeongAh Cho

Task-oriented image semantic communication is a new communication paradigm, which aims to transmit semantics for artificial intelligent (AI) tasks while ignoring the reconstruction quality of the images. However, in some applications, such…

Information Theory · Computer Science 2022-12-05 Fangfang Liu , Wanjie Tong , Yang Yang , Zhengfen Sun , Caili Guo

Accurate and timely image transmission is critical for emerging time-sensitive applications such as remote sensing in satellite-assisted Internet of Things. However, the bandwidth limitation poses a significant challenge in existing…

Signal Processing · Electrical Eng. & Systems 2025-09-25 Xiaolei Yang , Zijing Wang , Zhijin Qin , Xiaoming Tao

Traditional video captioning requests a holistic description of the video, yet the detailed descriptions of the specific objects may not be available. Without associating the moving trajectories, these image-based data-driven methods cannot…

Computer Vision and Pattern Recognition · Computer Science 2020-07-15 Fangyi Zhu , Jenq-Neng Hwang , Zhanyu Ma , Guang Chen , Jun Guo

Semantic communications has received growing interest since it can remarkably reduce the amount of data to be transmitted without missing critical information. Most existing works explore the semantic encoding and transmission for text and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-09 Danlan Huang , Feifei Gao , Xiaoming Tao , Qiyuan Du , Jianhua Lu

The advent of 6G networks demands unprecedented levels of intelligence, adaptability, and efficiency to address challenges such as ultra-high-speed data transmission, ultra-low latency, and massive connectivity in dynamic environments.…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Chen Zhu , Kang Liang , Jianrong Bao , Zhouxiang Zhao , Zhaohui Yang , Zhaoyang Zhang , Mohammad Shikh-Bahaei

With the rapid advancement of image captioning and visual question answering at single-round level, the question of how to generate multi-round dialogue about visual content has not yet been well explored.Existing visual dialogue methods…

Computer Vision and Pattern Recognition · Computer Science 2020-06-16 Ziwei Wang , Zi Huang , Yadan Luo , Huimin Lu

Semantic communications is considered as a promising technology to increase the efficiency of next-generation communication systems, particularly targeting human-machine and machine-type communications. In contrast to the source-agnostic…

Information Theory · Computer Science 2023-07-20 Jialong Xu , Tze-Yang Tung , Bo Ai , Wei Chen , Yuxuan Sun , Deniz Gunduz

We present an AI-based framework for semantic transmission of multimedia data over band-limited, time-varying channels. The method targets scenarios where large content is split into multiple packets, with an unknown number potentially…

Multimedia · Computer Science 2026-01-29 Homa Esfahanizadeh , Nargis Fayaz , Jinfeng Du , Harish Viswanathan

Semantic- and task-oriented communication has emerged as a promising approach to reducing the latency and bandwidth requirements of next-generation mobile networks by transmitting only the most relevant information needed to complete a…

Information Theory · Computer Science 2024-09-27 Deniz Gündüz , Michèle A. Wigger , Tze-Yang Tung , Ping Zhang , Yong Xiao

Recently, by introducing large-scale dataset and strong transformer network, video-language pre-training has shown great success especially for retrieval. Yet, existing video-language transformer models do not explicitly fine-grained…

Computer Vision and Pattern Recognition · Computer Science 2022-05-19 Alex Jinpeng Wang , Yixiao Ge , Guanyu Cai , Rui Yan , Xudong Lin , Ying Shan , Xiaohu Qie , Mike Zheng Shou

Video question answering (Video QA) presents a powerful testbed for human-like intelligent behaviors. The task demands new capabilities to integrate video processing, language understanding, binding abstract linguistic concepts to concrete…

Computer Vision and Pattern Recognition · Computer Science 2021-07-12 Long Hoang Dang , Thao Minh Le , Vuong Le , Truyen Tran

Semantic communication is an increasingly popular framework for wireless image transmission due to its high communication efficiency. With the aid of the joint-source-and-channel (JSC) encoder implemented by neural network, semantic…

Information Theory · Computer Science 2022-12-02 Maojun Zhang , Yang Li , Zezhong Zhang , Guangxu Zhu , Caijun Zhong

Wireless extended reality (XR) has attracted wide attentions as a promising technology to improve users' mobility and quality of experience. However, the ultra-high data rate requirement of wireless XR has hindered its development for many…

Signal Processing · Electrical Eng. & Systems 2023-03-14 Bowen Zhang , Zhijin Qin , Geoffrey Ye Li

This paper studies an end-to-end video semantic communication system for massive communication. In the considered system, the transmitter must continuously send the video to the receiver to facilitate character reconstruction in immersive…

Networking and Internet Architecture · Computer Science 2024-02-05 Haopeng Li , Haonan Tong , Sihua Wang , Nuocheng Yang , Zhaohui Yang , Changchuan Yin

This paper studies referring video object segmentation (RVOS) by boosting video-level visual-linguistic alignment. Recent approaches model the RVOS task as a sequence prediction problem and perform multi-modal interaction as well as…

Computer Vision and Pattern Recognition · Computer Science 2023-05-29 Zhuoyan Luo , Yicheng Xiao , Yong Liu , Shuyan Li , Yitong Wang , Yansong Tang , Xiu Li , Yujiu Yang
‹ Prev 1 2 3 10 Next ›