English
Related papers

Related papers: Structured State-Space Regularization for Generati…

200 papers

With a specific emphasis on control design objectives, achieving accurate system modeling with limited complexity is crucial in parametric system identification. The recently introduced deep structured state-space models (SSM), which…

Machine Learning · Computer Science 2024-03-25 Marco Forgione , Manas Mejari , Dario Piga

In the majority of GAN architectures, the latent space is defined as a set of vectors of given dimensionality. Such representations are not easily interpretable and do not capture spatial information of image content directly. In this work,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Maciej Sypetkowski

We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then…

Computer Vision and Pattern Recognition · Computer Science 2019-11-06 Taesung Park , Ming-Yu Liu , Ting-Chun Wang , Jun-Yan Zhu

Quantizing images into discrete representations has been a fundamental problem in unified generative modeling. Predominant approaches learn the discrete representation either in a deterministic manner by selecting the best-matching token or…

Computer Vision and Pattern Recognition · Computer Science 2023-10-17 Jiahui Zhang , Fangneng Zhan , Christian Theobalt , Shijian Lu

Sparsity-regularized synthetic aperture radar (SAR) imaging framework has shown its remarkable performance to generate a feature enhanced high resolution image, in which a sparsity-inducing regularizer is involved by exploiting the sparsity…

Computer Vision and Pattern Recognition · Computer Science 2018-05-09 Biao Hou , Zaidao Wen , Licheng Jiao , Qian Wu

Generating structured textual content requires mechanisms that enforce coherence, stability, and adherence to predefined constraints while maintaining semantic fidelity. Conventional approaches often rely on rule-based heuristics or…

Computation and Language · Computer Science 2025-08-11 Derek Yotheringhay , Beatrix Nightingale , Maximilian Featherstone , Edmund Worthington , Hugo Ashdown

Lossy image compression networks aim to minimize the latent entropy of images while adhering to specific distortion constraints. However, optimizing the neural network can be challenging due to its nature of learning quantized latent…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Yingwen Zhang , Meng Wang , Xihua Sheng , Peilin Chen , Junru Li , Li Zhang , Shiqi Wang

When adopting a model-based formulation, solving inverse problems encountered in multiband imaging requires to define spatial and spectral regularizations. In most of the works of the literature, spectral information is extracted from the…

Image and Video Processing · Electrical Eng. & Systems 2023-07-03 Min Zhao , Nicolas Dobigeon , Jie Chen

Medical image segmentation models are typically optimised with voxel-wise losses that constrain predictions only in the output space. This leaves latent feature representations largely unconstrained, potentially limiting generalisation. We…

Image and Video Processing · Electrical Eng. & Systems 2026-03-02 Puru Vaish , Amin Ranem , Felix Meister , Tobias Heimann , Christoph Brune , Jelmer M. Wolterink

Autoencoders represent an effective approach for computing the underlying factors characterizing datasets of different types. The latent representation of autoencoders have been studied in the context of enabling interpolation between data…

Machine Learning · Computer Science 2020-10-23 Alon Oring , Zohar Yakhini , Yacov Hel-Or

In the past, optimization-based registration models have used spatially-varying regularization to account for deformation variations in different image regions. However, deep learning-based registration models have mostly relied on…

Image and Video Processing · Electrical Eng. & Systems 2023-03-14 Junyu Chen , Yihao Liu , Yufan He , Yong Du

Implicit neural representations (INR) have been recently proposed as deep learning (DL) based solutions for image compression. An image can be compressed by training an INR model with fewer weights than the number of image pixels to map the…

Image and Video Processing · Electrical Eng. & Systems 2022-10-28 Harry Gao , Weijie Gan , Zhixin Sun , Ulugbek S. Kamilov

Image tokenizers map images to sequences of discrete tokens, and are a crucial component of autoregressive transformer-based image generation. The tokens are typically associated with spatial locations in the input image, arranged in raster…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Carlos Esteves , Mohammed Suhail , Ameesh Makadia

We propose a regularization scheme for image reconstruction that leverages the power of deep learning while hinging on classic sparsity-promoting models. Many deep-learning-based models are hard to interpret and cumbersome to analyze…

Image and Video Processing · Electrical Eng. & Systems 2024-07-10 Mehrsa Pourya , Sebastian Neumayer , Michael Unser

Modern Latent Diffusion Models (LDMs) typically operate in low-level Variational Autoencoder (VAE) latent spaces that are primarily optimized for pixel-level reconstruction. To unify vision generation and understanding, a burgeoning trend…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Shilong Zhang , He Zhang , Zhifei Zhang , Chongjian Ge , Shuchen Xue , Shaoteng Liu , Mengwei Ren , Soo Ye Kim , Yuqian Zhou , Qing Liu , Daniil Pakhomov , Kai Zhang , Zhe Lin , Ping Luo

The importance of regularization has been well established in image reconstruction -- which is the computational inversion of imaging forward model -- with applications including deconvolution for microscopy, tomographic reconstruction,…

Image and Video Processing · Electrical Eng. & Systems 2021-06-29 Sanjay Viswanath , Manu Ghulyani , Muthuvel Arigovindan

Progress in GANs has enabled the generation of high-resolution photorealistic images of astonishing quality. StyleGANs allow for compelling attribute modification on such images via mathematical operations on the latent style vectors in the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-09 Tejan Karmali , Rishubh Parihar , Susmit Agrawal , Harsh Rangwani , Varun Jampani , Maneesh Singh , R. Venkatesh Babu

The goal of this paper is to provide a system identification-friendly introduction to the Structured State-space Models (SSMs). These models have become recently popular in the machine learning community since, owing to their…

Systems and Control · Electrical Eng. & Systems 2024-05-21 Fabio Bonassi , Carl Andersson , Per Mattsson , Thomas B. Schön

Recent image generation schemes typically capture image distribution in a pre-constructed latent space relying on a frozen image tokenizer. Though the performance of tokenizer plays an essential role to the successful generation, its…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Kai Qiu , Xiang Li , Jason Kuen , Hao Chen , Xiaohao Xu , Jiuxiang Gu , Yinyi Luo , Bhiksha Raj , Zhe Lin , Marios Savvides

RNNs and their variants have been widely adopted for image captioning. In RNNs, the production of a caption is driven by a sequence of latent states. Existing captioning models usually represent latent states as vectors, taking this…

Computer Vision and Pattern Recognition · Computer Science 2018-08-15 Bo Dai , Deming Ye , Dahua Lin
‹ Prev 1 2 3 10 Next ›