English
Related papers

Related papers: Cross-Modal Knowledge Transfer Without Task-Releva…

200 papers

Source-free cross-modal knowledge transfer is a crucial yet challenging task, which aims to transfer knowledge from one source modality (e.g., RGB) to the target modality (e.g., depth or infrared) with no access to the task-relevant (TR)…

Computer Vision and Pattern Recognition · Computer Science 2024-01-11 Jinjing Zhu , Yucheng Chen , Lin Wang

Neuromorphic spike data, an upcoming modality with high temporal resolution, has shown promising potential in autonomous driving by mitigating the challenges posed by high-velocity motion blur. However, training the spike depth estimation…

Computer Vision and Pattern Recognition · Computer Science 2024-07-22 Jiaming Liu , Qizhe Zhang , Xiaoqi Li , Jianing Li , Guanqun Wang , Ming Lu , Tiejun Huang , Shanghang Zhang

This paper strives for action recognition and detection in video modalities like RGB, depth maps or 3D-skeleton sequences when only limited modality-specific labeled examples are available. For the RGB, and derived optical-flow, modality…

Computer Vision and Pattern Recognition · Computer Science 2021-08-10 Fida Mohammad Thoker , Cees G. M. Snoek

Commonsense reasoning systems should be able to generalize to diverse reasoning cases. However, most state-of-the-art approaches depend on expensive data annotations and overfit to a specific benchmark without learning how to perform…

Artificial Intelligence · Computer Science 2022-06-23 Yu Jin Kim , Beong-woo Kwak , Youngwook Kim , Reinald Kim Amplayo , Seung-won Hwang , Jinyoung Yeo

Existing color-guided depth super-resolution (DSR) approaches require paired RGB-D data as training samples where the RGB image is used as structural guidance to recover the degraded depth map due to their geometrical similarity. However,…

Computer Vision and Pattern Recognition · Computer Science 2021-03-25 Baoli Sun , Xinchen Ye , Baopu Li , Haojie Li , Zhihui Wang , Rui Xu

In this study, we focus on heterogeneous knowledge transfer across entirely different model architectures, tasks, and modalities. Existing knowledge transfer methods (e.g., backbone sharing, knowledge distillation) often hinge on shared…

Machine Learning · Computer Science 2024-12-30 Kunxi Li , Tianyu Zhan , Kairui Fu , Shengyu Zhang , Kun Kuang , Jiwei Li , Zhou Zhao , Fan Wu , Fei Wu

Multimodal sensing has proven valuable for visual tracking, as different sensor types offer unique strengths in handling one specific challenging scene where object appearance varies. While a generalist model capable of leveraging all…

Computer Vision and Pattern Recognition · Computer Science 2024-12-02 Yuedong Tan , Zongwei Wu , Yuqian Fu , Zhuyun Zhou , Guolei Sun , Eduard Zamfi , Chao Ma , Danda Pani Paudel , Luc Van Gool , Radu Timofte

Many practical applications require training of semantic segmentation models on unlabelled datasets and their execution on low-resource hardware. Distillation from a trained source model may represent a solution for the first but does not…

Computer Vision and Pattern Recognition · Computer Science 2024-02-21 Claudia Cuttano , Antonio Tavera , Fabio Cermelli , Giuseppe Averta , Barbara Caputo

Cross-modality image segmentation aims to segment the target modalities using a method designed in the source modality. Deep generative models can translate the target modality images into the source modality, thus enabling cross-modality…

Image and Video Processing · Electrical Eng. & Systems 2024-04-11 Zihao Wang , Yingyu Yang , Yuzhou Chen , Tingting Yuan , Maxime Sermesant , Herve Delingette , Ona Wu

Transferring knowledge from a source domain to a target domain can be crucial for whole slide image classification, since the number of samples in a dataset is often limited due to high annotation costs. However, domain shift and task…

Computer Vision and Pattern Recognition · Computer Science 2024-07-12 Conghao Xiong , Yi Lin , Hao Chen , Hao Zheng , Dong Wei , Yefeng Zheng , Joseph J. Y. Sung , Irwin King

Recently, general salient object detection (SOD) has made great progress with the rapid development of deep neural networks. However, task-aware SOD has hardly been studied due to the lack of task-specific datasets. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2021-05-19 Jinming Su , Changqun Xia , Jia Li

Transfer learning is crucial in training deep neural networks on new target tasks. Current transfer learning methods always assume at least one of (i) source and target task label spaces overlap, (ii) source datasets are available, and…

Machine Learning · Computer Science 2025-02-21 Shin'ya Yamaguchi , Sekitoshi Kanai , Atsutoshi Kumagai , Daiki Chijiwa , Hisashi Kashima

Cross-modal knowledge distillation deals with transferring knowledge from a model trained with superior modalities (Teacher) to another model trained with weak modalities (Student). Existing approaches require paired training examples exist…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Long Zhao , Xi Peng , Yuxiao Chen , Mubbasir Kapadia , Dimitris N. Metaxas

With the development of foundation models such as large language models, zero-shot transfer learning has become increasingly significant. This is highlighted by the generative capabilities of NLP models like GPT-4, and the retrieval-based…

Machine Learning · Computer Science 2024-06-25 Yuhan Li , Peisong Wang , Zhixun Li , Jeffrey Xu Yu , Jia Li

Cross-media retrieval is a research hotspot in multimedia area, which aims to perform retrieval across different media types such as image and text. The performance of existing methods usually relies on labeled data for model training.…

Multimedia · Computer Science 2018-03-13 Xin Huang , Yuxin Peng

Multi-label zero-shot classification aims to predict multiple unseen class labels for an input image. It is more challenging than its single-label counterpart. On one hand, the unconstrained number of labels assigned to each image makes the…

Computer Vision and Pattern Recognition · Computer Science 2020-08-03 He Huang , Yuanwei Chen , Wei Tang , Wenhao Zheng , Qing-Guo Chen , Yao Hu , Philip Yu

Recently, although pre-trained language models have achieved great success on multilingual NLP (Natural Language Processing) tasks, the lack of training data on many tasks in low-resource languages still limits their performance. One…

Computation and Language · Computer Science 2023-10-10 Yuyang Zhang , Xiaofeng Han , Baojun Wang

In this work, we propose to utilize Convolutional Neural Networks to boost the performance of depth-induced salient object detection by capturing the high-level representative features for depth modality. We formulate the depth-induced…

Computer Vision and Pattern Recognition · Computer Science 2017-06-01 Hao Chen , Y. F. Li , Dan Su

Although existing cross-domain continual learning approaches successfully address many streaming tasks having domain shifts, they call for a fully labeled source domain hindering their feasibility in the privacy constrained environments.…

In reinforcement learning for visual navigation, it is common to develop a model for each new task, and train that model from scratch with task-specific interactions in 3D environments. However, this process is expensive; massive amounts of…

Computer Vision and Pattern Recognition · Computer Science 2022-05-02 Ziad Al-Halah , Santhosh K. Ramakrishnan , Kristen Grauman
‹ Prev 1 2 3 10 Next ›