Related papers: Align-Deform-Subtract: An Interventional Framework…

Functional Change Point Detection via Adjacent Deviation Subspace

This paper develops the concept of the Adjacent Deviation Subspace (ADS), a novel framework for reducing infinite-dimensional functional data into finite-dimensional vector or scalar representations while preserving critical information of…

Methodology · Statistics 2025-06-19 Luoyao Yu , Long Feng , Xuehu Zhu

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search

Recent works have shown that a rich set of semantic directions exist in the latent space of Generative Adversarial Networks (GANs), which enables various facial attribute editing applications. However, existing methods may suffer poor…

Computer Vision and Pattern Recognition · Computer Science 2021-05-28 Yuxuan Han , Jiaolong Yang , Ying Fu

Unsupervised Part-Based Disentangling of Object Shape and Appearance

Large intra-class variation is the result of changes in multiple object characteristics. Images, however, only show the superposition of different variable factors such as appearance or shape. Therefore, learning to disentangle and…

Computer Vision and Pattern Recognition · Computer Science 2019-06-18 Dominik Lorenz , Leonard Bereska , Timo Milbich , Björn Ommer

Explicitly Disentangled Representations in Object-Centric Learning

Extracting structured representations from raw visual data is an important and long-standing challenge in machine learning. Recently, techniques for unsupervised learning of object-centric representations have raised growing interest. In…

Computer Vision and Pattern Recognition · Computer Science 2025-01-24 Riccardo Majellaro , Jonathan Collu , Aske Plaat , Thomas M. Moerland

Updated version: A Video Anomaly Detection Framework based on Appearance-Motion Semantics Representation Consistency

Video anomaly detection is an essential but challenging task. The prevalent methods mainly investigate the reconstruction difference between normal and abnormal patterns but ignore the semantics consistency between appearance and motion…

Computer Vision and Pattern Recognition · Computer Science 2023-03-10 Xiangyu Huang , Caidan Zhao , Zhiqiang Wu

Instance-Invariant Domain Adaptive Object Detection via Progressive Disentanglement

Most state-of-the-art methods of object detection suffer from poor generalization ability when the training and test data are from different domains, e.g., with different styles. To address this problem, previous methods mainly use holistic…

Computer Vision and Pattern Recognition · Computer Science 2021-02-16 Aming Wu , Yahong Han , Linchao Zhu , Yi Yang

Axis-Aligned Document Dewarping

Document dewarping is crucial for many applications. However, existing learning-based methods rely heavily on supervised regression with annotated data without fully leveraging the inherent geometric properties of physical documents. Our…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Chaoyun Wang , I-Chao Shen , Takeo Igarashi , Caigui Jiang

Multiscale Adaptive Representation of Signals: I. The Basic Framework

We introduce a framework for designing multi-scale, adaptive, shift-invariant frames and bi-frames for representing signals. The new framework, called AdaFrame, improves over dictionary learning-based techniques in terms of computational…

Computer Vision and Pattern Recognition · Computer Science 2015-07-20 Cheng Tai , Weinan E

Learning Debiased Representation via Disentangled Feature Augmentation

Image classification models tend to make decisions based on peripheral attributes of data items that have strong correlation with a target variable (i.e., dataset bias). These biased models suffer from the poor generalization capability…

Machine Learning · Computer Science 2021-10-26 Jungsoo Lee , Eungyeup Kim , Juyoung Lee , Jihyeon Lee , Jaegul Choo

Detect-and-describe: Joint learning framework for detection and description of objects

Traditional object detection answers two questions; "what" (what the object is?) and "where" (where the object is?). "what" part of the object detection can be fine-grained further i.e. "what type", "what shape" and "what material" etc.…

Computer Vision and Pattern Recognition · Computer Science 2022-04-20 Addel Zafar , Umar Khalid

Enhancing Deformable Object Manipulation By Using Interactive Perception and Assistive Tools

In the field of robotic manipulation, the proficiency of deformable object manipulation lags behind human capabilities due to the inherent characteristics of deformable objects. These objects have infinite degrees of freedom, resulting in…

Robotics · Computer Science 2023-11-17 Peng Zhou

Understanding Deformable Alignment in Video Super-Resolution

Deformable convolution, originally proposed for the adaptation to geometric variations of objects, has recently shown compelling performance in aligning multiple frames and is increasingly adopted for video super-resolution. Despite its…

Computer Vision and Pattern Recognition · Computer Science 2020-09-16 Kelvin C. K. Chan , Xintao Wang , Ke Yu , Chao Dong , Chen Change Loy

Disentangle Object and Non-object Infrared Features via Language Guidance

Infrared object detection focuses on identifying and locating objects in complex environments (\eg, dark, snow, and rain) where visible imaging cameras are disabled by poor illumination. However, due to low contrast and weak edge…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Fan Liu , Ting Wu , Chuanyi Zhang , Liang Yao , Xing Ma , Yuhui Zheng

A Video Anomaly Detection Framework based on Appearance-Motion Semantics Representation Consistency

Video anomaly detection refers to the identification of events that deviate from the expected behavior. Due to the lack of anomalous samples in training, video anomaly detection becomes a very challenging task. Existing methods almost…

Computer Vision and Pattern Recognition · Computer Science 2022-04-11 Xiangyu Huang , Caidan Zhao , Yilin Wang , Zhiqiang Wu

Attribute Descent: Simulating Object-Centric Datasets on the Content Level and Beyond

This article aims to use graphic engines to simulate a large number of training data that have free annotations and possibly strongly resemble to real-world data. Between synthetic and real, a two-level domain gap exists, involving content…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Yue Yao , Liang Zheng , Xiaodong Yang , Milind Napthade , Tom Gedeon

Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers

Recent work has shown that object-centric representations can greatly help improve the accuracy of learning dynamics while also bringing interpretability. In this work, we take this idea one step further, ask the following question: "can…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Sanket Gandhi , Atul , Samanyu Mahajan , Vishal Sharma , Rushil Gupta , Arnab Kumar Mondal , Parag Singla

ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation

Manipulating volumetric deformable objects in the real world, like plush toys and pizza dough, bring substantial challenges due to infinite shape variations, non-rigid motions, and partial observability. We introduce ACID, an…

Computer Vision and Pattern Recognition · Computer Science 2022-08-09 Bokui Shen , Zhenyu Jiang , Christopher Choy , Leonidas J. Guibas , Silvio Savarese , Anima Anandkumar , Yuke Zhu

DDAVS: Disentangled Audio Semantics and Delayed Bidirectional Alignment for Audio-Visual Segmentation

Audio-Visual Segmentation (AVS) aims to localize sound-producing objects at the pixel level by jointly leveraging auditory and visual information. However, existing methods often suffer from multi-source entanglement and audio-visual…

Computer Vision and Pattern Recognition · Computer Science 2025-12-24 Jingqi Tian , Yiheng Du , Haoji Zhang , Yuji Wang , Isaac Ning Lee , Xulong Bai , Tianrui Zhu , Jingxuan Niu , Yansong Tang

Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection

Change detection is a widely adopted technique in remote sense imagery (RSI) analysis in the discovery of long-term geomorphic evolution. To highlight the areas of semantic changes, previous effort mostly pays attention to learning…

Computer Vision and Pattern Recognition · Computer Science 2023-05-31 Supeng Wang , Yuxi Li , Ming Xie , Mingmin Chi , Yabiao Wang , Chengjie Wang , Wenbing Zhu

Unsupervised Part Segmentation through Disentangling Appearance and Shape

We study the problem of unsupervised discovery and segmentation of object parts, which, as an intermediate local representation, are capable of finding intrinsic object structure and providing more explainable recognition results. Recent…

Computer Vision and Pattern Recognition · Computer Science 2021-05-27 Shilong Liu , Lei Zhang , Xiao Yang , Hang Su , Jun Zhu