Related papers: Multi-Semantic Interactive Learning for Object Det…

Selective Multi-Scale Learning for Object Detection

Pyramidal networks are standard methods for multi-scale object detection. Current researches on feature pyramid networks usually adopt layer connections to collect features from certain levels of the feature hierarchy, and do not consider…

Computer Vision and Pattern Recognition · Computer Science 2022-06-17 Junliang Chen , Weizeng Lu , Linlin Shen

Learning to recognize occluded and small objects with partial inputs

Recognizing multiple objects in an image is challenging due to occlusions, and becomes even more so when the objects are small. While promising, existing multi-label image recognition models do not explicitly learn context-based…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Hasib Zunair , A. Ben Hamza

Visual saliency estimation by integrating features using multiple kernel learning

In the last few decades, significant achievements have been attained in predicting where humans look at images through different computational models. However, how to determine contributions of different visual features to overall saliency…

Computer Vision and Pattern Recognition · Computer Science 2013-07-23 Yasin Kavak , Erkut Erdem , Aykut Erdem

Multiple Instance Curriculum Learning for Weakly Supervised Object Detection

When supervising an object detector with weakly labeled data, most existing approaches are prone to trapping in the discriminative object parts, e.g., finding the face of a cat instead of the full body, due to lacking the supervision on the…

Computer Vision and Pattern Recognition · Computer Science 2017-11-28 Siyang Li , Xiangxin Zhu , Qin Huang , Hao Xu , C. -C. Jay Kuo

Object Relation Detection Based on One-shot Learning

Detecting the relations among objects, such as "cat on sofa" and "person ride horse", is a crucial task in image understanding, and beneficial to bridging the semantic gap between images and natural language. Despite the remarkable progress…

Computer Vision and Pattern Recognition · Computer Science 2018-07-17 Li Zhou , Jian Zhao , Jianshu Li , Li Yuan , Jiashi Feng

Multi-View Class Incremental Learning

Multi-view learning (MVL) has gained great success in integrating information from multiple perspectives of a dataset to improve downstream task performance. To make MVL methods more practical in an open-ended environment, this paper…

Machine Learning · Computer Science 2023-10-16 Depeng Li , Tianqi Wang , Junwei Chen , Kenji Kawaguchi , Cheng Lian , Zhigang Zeng

Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing

Object detection in remote sensing imagery plays a vital role in various Earth observation applications. However, unlike object detection in natural scene images, this task is particularly challenging due to the abundance of small, often…

Computer Vision and Pattern Recognition · Computer Science 2024-09-16 Minh-Duc Vu , Zuheng Ming , Fangchen Feng , Bissmella Bahaduri , Anissa Mokraoui

Multi-scale Interactive Network for Salient Object Detection

Deep-learning based salient object detection methods achieve great progress. However, the variable scale and unknown category of salient objects are great challenges all the time. These are closely related to the utilization of multi-level…

Computer Vision and Pattern Recognition · Computer Science 2020-07-20 Youwei Pang , Xiaoqi Zhao , Lihe Zhang , Huchuan Lu

C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection

Weakly supervised object detection (WSOD) is a challenging task when provided with image category supervision but required to simultaneously learn object locations and object detectors. Many WSOD approaches adopt multiple instance learning…

Computer Vision and Pattern Recognition · Computer Science 2019-04-12 Fang Wan , Chang Liu , Wei Ke , Xiangyang Ji , Jianbin Jiao , Qixiang Ye

An Iterative Classification and Semantic Segmentation Network for Old Landslide Detection Using High-Resolution Remote Sensing Images

Huge challenges exist for old landslide detection because their morphology features have been partially or strongly transformed over a long time and have little difference from their surrounding. Besides, small-sample problem also restrict…

Computer Vision and Pattern Recognition · Computer Science 2023-04-25 Zili Lu , Yuexing Peng , Wei Li , Junchuan Yu , Daqing Ge , Wei Xiang

Fully Convolutional Multi-Class Multiple Instance Learning

Multiple instance learning (MIL) can reduce the need for costly annotation in tasks such as semantic segmentation by weakening the required degree of supervision. We propose a novel MIL formulation of multi-class semantic segmentation…

Computer Vision and Pattern Recognition · Computer Science 2015-04-16 Deepak Pathak , Evan Shelhamer , Jonathan Long , Trevor Darrell

Improving Long-tailed Object Detection with Image-Level Supervision by Multi-Task Collaborative Learning

Data in real-world object detection often exhibits the long-tailed distribution. Existing solutions tackle this problem by mitigating the competition between the head and tail categories. However, due to the scarcity of training samples,…

Computer Vision and Pattern Recognition · Computer Science 2022-10-12 Bo Li , Yongqiang Yao , Jingru Tan , Xin Lu , Fengwei Yu , Ye Luo , Jianwei Lu

Interactive multiclass segmentation using superpixel classification

This paper adresses the problem of interactive multiclass segmentation. We propose a fast and efficient new interactive segmentation method called Superpixel Classification-based Interactive Segmentation (SCIS). From a few strokes drawn by…

Computer Vision and Pattern Recognition · Computer Science 2015-10-13 Bérengère Mathieu , Alain Crouzil , Jean-Baptiste Puel

Multi-Scale Saliency Detection using Dictionary Learning

Saliency detection has drawn a lot of attention of researchers in various fields over the past several years. Saliency is the perceptual quality that makes an object, person to draw the attention of humans at the very sight. Salient object…

Computer Vision and Pattern Recognition · Computer Science 2017-07-06 Shubham Pachori

MetaMIML: Meta Multi-Instance Multi-Label Learning

Multi-Instance Multi-Label learning (MIML) models complex objects (bags), each of which is associated with a set of interrelated labels and composed with a set of instances. Current MIML solutions still focus on a single-type of objects and…

Machine Learning · Computer Science 2021-11-09 Yuanlin Yang , Guoxian Yu , Jun Wang , Lei Liu , Carlotta Domeniconi , Maozu Guo

Semantic Representation and Dependency Learning for Multi-Label Image Recognition

Recently many multi-label image recognition (MLR) works have made significant progress by introducing pre-trained object detection models to generate lots of proposals or utilizing statistical label co-occurrence enhance the correlation…

Computer Vision and Pattern Recognition · Computer Science 2023-01-10 Tao Pu , Mingzhan Sun , Hefeng Wu , Tianshui Chen , Ling Tian , Liang Lin

Cross-scale Multi-instance Learning for Pathological Image Diagnosis

Analyzing high resolution whole slide images (WSIs) with regard to information across multiple scales poses a significant challenge in digital pathology. Multi-instance learning (MIL) is a common solution for working with high resolution…

Image and Video Processing · Electrical Eng. & Systems 2024-02-19 Ruining Deng , Can Cui , Lucas W. Remedios , Shunxing Bao , R. Michael Womick , Sophie Chiron , Jia Li , Joseph T. Roland , Ken S. Lau , Qi Liu , Keith T. Wilson , Yaohong Wang , Lori A. Coburn , Bennett A. Landman , Yuankai Huo

Efficiently Disentangling CLIP for Multi-Object Perception

Vision-language models like CLIP excel at recognizing the single, prominent object in a scene. However, they struggle in complex scenes containing multiple objects. We identify a fundamental reason for this limitation: VLM feature space…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Samyak Rawlekar , Yujun Cai , Yiwei Wang , Ming-Hsuan Yang , Narendra Ahuja

Improving Feature Generalizability with Multitask Learning in Class Incremental Learning

Many deep learning applications, like keyword spotting, require the incorporation of new concepts (classes) over time, referred to as Class Incremental Learning (CIL). The major challenge in CIL is catastrophic forgetting, i.e., preserving…

Machine Learning · Computer Science 2022-04-28 Dong Ma , Chi Ian Tang , Cecilia Mascolo

SIL: Symbiotic Interactive Learning for Language-Conditioned Human-Agent Co-Adaptation

Today's autonomous agents, largely driven by foundation models (FMs), can understand natural language instructions and solve long-horizon tasks with human-like reasoning. However, current human-robot interaction largely follows a one-way…

Robotics · Computer Science 2026-03-17 Linus Nwankwo , Bjoern Ellensohn , Christian Rauch , Elmar Rueckert