Related papers: Evaluating Generative Models via One-Dimensional C…

On the Distributed Evaluation of Generative Models

The evaluation of deep generative models has been extensively studied in the centralized setting, where the reference data are drawn from a single probability distribution. On the other hand, several applications of generative models…

Machine Learning · Computer Science 2024-06-12 Zixiao Wang , Farzan Farnia , Zhenghao Lin , Yunheng Shen , Bei Yu

Assessing Generative Models via Precision and Recall

Recent advances in generative modeling have led to an increased interest in the study of statistical divergences as means of model comparison. Commonly used evaluation methods, such as the Frechet Inception Distance (FID), correlate well…

Machine Learning · Statistics 2018-10-30 Mehdi S. M. Sajjadi , Olivier Bachem , Mario Lucic , Olivier Bousquet , Sylvain Gelly

Attribute Based Interpretable Evaluation Metrics for Generative Models

When the training dataset comprises a 1:1 proportion of dogs to cats, a generative model that produces 1:1 dogs and cats better resembles the training species distribution than another model with 3:1 dogs and cats. Can we capture this…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Dongkyun Kim , Mingi Kwon , Youngjung Uh

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Scaling up autoregressive models in vision has not proven as beneficial as in large language models. In this work, we investigate this scaling problem in the context of text-to-image generation, focusing on two critical factors: whether…

Computer Vision and Pattern Recognition · Computer Science 2024-10-18 Lijie Fan , Tianhong Li , Siyang Qin , Yuanzhen Li , Chen Sun , Michael Rubinstein , Deqing Sun , Kaiming He , Yonglong Tian

Rethinking FID: Towards a Better Evaluation Metric for Image Generation

As with many machine learning problems, the progress of image generation methods hinges on good evaluation metrics. One of the most popular is the Frechet Inception Distance (FID). FID estimates the distance between a distribution of…

Computer Vision and Pattern Recognition · Computer Science 2024-01-29 Sadeep Jayasumana , Srikumar Ramalingam , Andreas Veit , Daniel Glasner , Ayan Chakrabarti , Sanjiv Kumar

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

We systematically study a wide variety of generative models spanning semantically-diverse image datasets to understand and improve the feature extractors and metrics used to evaluate them. Using best practices in psychophysics, we measure…

Machine Learning · Computer Science 2023-12-06 George Stein , Jesse C. Cresswell , Rasa Hosseinzadeh , Yi Sui , Brendan Leigh Ross , Valentin Villecroze , Zhaoyan Liu , Anthony L. Caterini , J. Eric T. Taylor , Gabriel Loaiza-Ganem

Barcode Method for Generative Model Evaluation driven by Topological Data Analysis

Evaluating the performance of generative models in image synthesis is a challenging task. Although the Fr\'echet Inception Distance is a widely accepted evaluation metric, it integrates different aspects (e.g., fidelity and diversity) of…

Computer Vision and Pattern Recognition · Computer Science 2021-06-07 Ryoungwoo Jang , Minjee Kim , Da-in Eun , Kyungjin Cho , Jiyeon Seo , Namkug Kim

Vision2Code: A Multi-Domain Benchmark for Evaluating Image-to-Code Generation

Image-to-code generation tests whether a vision-language model (VLM) can recover the structure of an image enough to express it as executable code. Existing benchmarks either focus on narrow visual domains, depend on paired executable…

Computer Vision and Pattern Recognition · Computer Science 2026-05-13 Ajay Vikram Periasami , Junlin Wang , Bhuwan Dhingra

Variational Masked Diffusion Models

Masked diffusion models have recently emerged as a flexible framework for discrete generative modeling. However, a key limitation of standard masked diffusion is its inability to effectively capture dependencies among tokens that are…

Machine Learning · Computer Science 2025-10-28 Yichi Zhang , Alex Schwing , Zhizhen Zhao

A Characteristic Function Approach to Deep Implicit Generative Modeling

Implicit Generative Models (IGMs) such as GANs have emerged as effective data-driven models for generating samples, particularly images. In this paper, we formulate the problem of learning an IGM as minimizing the expected distance between…

Machine Learning · Computer Science 2020-06-18 Abdul Fatir Ansari , Jonathan Scarlett , Harold Soh

Measuring Style Similarity in Diffusion Models

Generative models are now widely used by graphic designers and artists. Prior works have shown that these models remember and often replicate content from their training data during generation. Hence as their proliferation increases, it has…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Gowthami Somepalli , Anubhav Gupta , Kamal Gupta , Shramay Palta , Micah Goldblum , Jonas Geiping , Abhinav Shrivastava , Tom Goldstein

Reliable Fidelity and Diversity Metrics for Generative Models

Devising indicative evaluation metrics for the image generation task remains an open problem. The most widely used metric for measuring the similarity between real and generated images has been the Fr\'echet Inception Distance (FID) score.…

Computer Vision and Pattern Recognition · Computer Science 2020-06-30 Muhammad Ferjad Naeem , Seong Joon Oh , Youngjung Uh , Yunjey Choi , Jaejun Yoo

Feature Likelihood Divergence: Evaluating the Generalization of Generative Models Using Samples

The past few years have seen impressive progress in the development of deep generative models capable of producing high-dimensional, complex, and photo-realistic data. However, current methods for evaluating such models remain incomplete:…

Machine Learning · Computer Science 2024-03-14 Marco Jiralerspong , Avishek Joey Bose , Ian Gemp , Chongli Qin , Yoram Bachrach , Gauthier Gidel

Distribution-Conditional Generation: From Class Distribution to Creative Generation

Text-to-image (T2I) diffusion models are effective at producing semantically aligned images, but their reliance on training data distributions limits their ability to synthesize truly novel, out-of-distribution concepts. Existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-05-07 Fu Feng , Yucheng Xie , Xu Yang , Jing Wang , Xin Geng

Image Generation Diversity Issues and How to Tame Them

Generative methods now produce outputs nearly indistinguishable from real data but often fail to fully capture the data distribution. Unlike quality issues, diversity limitations in generative models are hard to detect visually, requiring…

Computer Vision and Pattern Recognition · Computer Science 2024-12-13 Mischa Dombrowski , Weitong Zhang , Sarah Cechnicka , Hadrien Reynaud , Bernhard Kainz

Unleashing Text-to-Image Diffusion Models for Visual Perception

Diffusion models (DMs) have become the new trend of generative models and have demonstrated a powerful ability of conditional synthesis. Among those, text-to-image diffusion models pre-trained on large-scale image-text pairs are highly…

Computer Vision and Pattern Recognition · Computer Science 2023-03-06 Wenliang Zhao , Yongming Rao , Zuyan Liu , Benlin Liu , Jie Zhou , Jiwen Lu

Diffusion Models Need Visual Priors for Image Generation

Conventional class-guided diffusion models generally succeed in generating images with correct semantic content, but often struggle with texture details. This limitation stems from the usage of class priors, which only provide coarse and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Xiaoyu Yue , Zidong Wang , Zeyu Lu , Shuyang Sun , Meng Wei , Wanli Ouyang , Lei Bai , Luping Zhou

FFAD: A Novel Metric for Assessing Generated Time Series Data Utilizing Fourier Transform and Auto-encoder

The success of deep learning-based generative models in producing realistic images, videos, and audios has led to a crucial consideration: how to effectively assess the quality of synthetic samples. While the Fr\'{e}chet Inception Distance…

Machine Learning · Computer Science 2024-03-12 Yang Chen , Dustin J. Kempton , Rafal A. Angryk

Gram-MMD: A Texture-Aware Metric for Image Realism Assessment

Evaluating the realism of generated images remains a fundamental challenge in generative modeling. Existing distributional metrics such as the Frechet Inception Distance (FID) and CLIP-MMD (CMMD) compare feature distributions at a semantic…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Joé Napolitano , Pascal Nguyen

Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens

Visual generation with discrete tokens has gained significant attention as it enables a unified token prediction paradigm shared with language models, promising seamless multimodal architectures. However, current discrete generation methods…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Yuqing Wang , Chuofan Ma , Zhijie Lin , Yao Teng , Lijun Yu , Shuai Wang , Jiaming Han , Jiashi Feng , Yi Jiang , Xihui Liu