English
Related papers

Related papers: Multi-modal Knowledge Distillation-based Human Tra…

200 papers

Accurate prediction of future human positions is an essential task for modern video-surveillance systems. Current state-of-the-art models usually rely on a "history" of past tracked locations (e.g., 3 to 5 seconds) to predict a plausible…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Alessio Monti , Angelo Porrello , Simone Calderara , Pasquale Coscia , Lamberto Ballan , Rita Cucchiara

Scene information plays a crucial role in trajectory forecasting systems for autonomous driving by providing semantic clues and constraints on potential future paths of traffic agents. Prevalent trajectory prediction techniques often take…

Computer Vision and Pattern Recognition · Computer Science 2023-06-27 Yuning Wang , Pu Zhang , Lei Bai , Jianru Xue

Depth estimation and scene segmentation are two important tasks in intelligent transportation systems. A joint modeling of these two tasks will reduce the requirement for both the storage and training efforts. This work explores how the…

Machine Learning · Computer Science 2025-05-16 Tiancong Cheng , Ying Zhang , Yuxuan Liang , Roger Zimmermann , Zhiwen Yu , Bin Guo

There is a growing need for social robots and intelligent agents that can effectively interact with and support users. For the interactions to be seamless, the agents need to analyse social scenes and behavioural cues from their (robot's)…

Robotics · Computer Science 2025-10-28 Tongfei Bian , Mathieu Chollet , Tanaya Guha

Vision-language models (VLMs) have become a promising approach to enhancing perception and decision-making in autonomous driving. The gap remains in applying VLMs to understand complex scenarios interacting with pedestrians and efficient…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Haoxiang Gao , Li Zhang , Yu Zhao , Zhou Yang , Jinghan Cao

Accurate prediction of future trajectories of traffic agents is essential for ensuring safe autonomous driving. However, partially observed trajectories can significantly degrade the performance of even state-of-the-art models. Previous…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Peng Shu , Pengfei Zhu , Mengshi Qi , Liang Liu

Long-term trajectory forecasting is an important and challenging problem in the fields of computer vision, machine learning, and robotics. One fundamental difficulty stands in the evolution of the trajectory that becomes more and more…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Sourav Das , Guglielmo Camporese , Shaokang Cheng , Lamberto Ballan

Deep learning achieved great progress recently, however, it is not easy or efficient to further improve its performance by increasing the size of the model. Multi-modal learning can mitigate this challenge by introducing richer and more…

Artificial Intelligence · Computer Science 2025-10-07 Cairong Zhao , Yufeng Jin , Zifan Song , Haonan Chen , Duoqian Miao , Guosheng Hu

Multimodal dataset distillation aims to construct compact synthetic datasets that enable efficient compression and knowledge transfer from large-scale image-text data. However, existing approaches often fail to capture the complex,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-27 Shengbin Guo , Hang Zhao , Senqiao Yang , Chenyang Jiang , Yuhang Cheng , Xiangru Peng , Rui Shao , Zhuotao Tian

Accurate traffic flow prediction is vital for optimizing urban mobility, yet it remains difficult in many cities due to complex spatio-temporal dependencies and limited high-quality data. While deep graph-based models demonstrate strong…

Machine Learning · Computer Science 2025-04-04 Chenyang Yu , Xinpeng Xie , Yan Huang , Chenxi Qiu

Knowledge distillation as an efficient knowledge transfer technique, has achieved remarkable success in unimodal scenarios. However, in cross-modal settings, conventional distillation methods encounter significant challenges due to data and…

Computer Vision and Pattern Recognition · Computer Science 2025-07-10 Hui Li , Pengfei Yang , Juanyang Chen , Le Dong , Yanxin Chen , Quan Wang

Trajectory prediction is a fundamental task in Autonomous Vehicles (AVs) and Intelligent Transportation Systems (ITS), supporting efficient motion planning and real-time traffic safety management. Diffusion models have recently demonstrated…

Artificial Intelligence · Computer Science 2025-10-02 Bingzhang Wang , Kehua Chen , Yinhai Wang

Online HD map construction is a fundamental task in autonomous driving systems, aiming to acquire semantic information of map elements around the ego vehicle based on real-time sensor inputs. Recently, several approaches have achieved…

Computer Vision and Pattern Recognition · Computer Science 2025-08-25 Ziyang Yan , Ruikai Li , Zhiyong Cui , Bohan Li , Han Jiang , Yilong Ren , Aoyong Li , Zhenning Li , Sijia Wen , Haiyang Yu

Knowledge distillation is used, in generative language modeling, to train a smaller student model using the help of a larger teacher model, resulting in improved capabilities for the student model. In this paper, we formulate a more general…

Computation and Language · Computer Science 2025-02-26 Guanlin Liu , Anand Ramachandran , Tanmay Gangwani , Yan Fu , Abhinav Sethy

Trajectory prediction remains a critical yet challenging component in autonomous driving systems, requiring sophisticated reasoning capabilities while meeting strict real-time deployment constraints. While knowledge distillation has…

Artificial Intelligence · Computer Science 2026-04-14 Wenchang Duan

Multimodal transfer learning aims to transform pretrained representations of diverse modalities into a common domain space for effective multimodal fusion. However, conventional systems are typically built on the assumption that all…

Computer Vision and Pattern Recognition · Computer Science 2023-09-28 Yanan Wang , Donghuo Zeng , Shinya Wada , Satoshi Kurihara

Machine learning has proved to be very successful for making predictions in travel behavior modeling. However, most machine-learning models have complex model structures and offer little or no explanation as to how they arrive at these…

Machine Learning · Statistics 2019-10-31 Xilei Zhao , Zhengze Zhou , Xiang Yan , Pascal Van Hentenryck

In autonomous vehicle (AV) technology, the ability to accurately predict the movements of surrounding vehicles is paramount for ensuring safety and operational efficiency. Incorporating human decision-making insights enables AVs to more…

Artificial Intelligence · Computer Science 2024-03-01 Haicheng Liao , Yongkang Li , Zhenning Li , Chengyue Wang , Zhiyong Cui , Shengbo Eben Li , Chengzhong Xu

Deep learning methods usually require a large amount of training data and lack interpretability. In this paper, we propose a novel knowledge distillation and model interpretation framework for medical image classification that jointly…

Computer Vision and Pattern Recognition · Computer Science 2022-01-13 Thanh Nguyen-Duc , He Zhao , Jianfei Cai , Dinh Phung

The significance of mental health classification is paramount in contemporary society, where digital platforms serve as crucial sources for monitoring individuals' well-being. However, existing social media mental health datasets primarily…

Computation and Language · Computer Science 2024-11-08 Rina Carines Cabral , Siwen Luo , Josiah Poon , Soyeon Caren Han
‹ Prev 1 2 3 10 Next ›