Related papers: CGI: Identifying Conditional Generative Models wit…

You Only Submit One Image to Find the Most Suitable Generative Model

Deep generative models have achieved promising results in image generation, and various generative model hubs, e.g., Hugging Face and Civitai, have been developed that enable model developers to upload models and users to download models.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Zhi Zhou , Lan-Zhe Guo , Peng-Xiao Song , Yu-Feng Li

Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models

The rapid advancement of Text-to-Image(T2I) generative models has enabled the synthesis of high-quality images guided by textual descriptions. Despite this significant progress, these models are often susceptible in generating contents that…

Computer Vision and Pattern Recognition · Computer Science 2024-06-25 Yichen Sun , Zhixuan Chu , Zhan Qin , Kui Ren

Parallel Medical Imaging for Intelligent Medical Image Analysis: Concepts, Methods, and Applications

There has been much progress in data-driven artificial intelligence technology for medical image analysis in the last decades. However, it still remains challenging due to its distinctive complexity of acquiring and annotating image data,…

Computer Vision and Pattern Recognition · Computer Science 2021-06-30 Chao Gou , Tianyu Shen , Wenbo Zheng , Huadan Xue , Hui Yu , Qiang Ji , Zhengyu Jin , Fei-Yue Wang

Meta-probabilistic Modeling

Probabilistic graphical models (PGMs) are widely used to discover latent structure in data, but their success hinges on selecting an appropriate model design. In practice, model specification is difficult and often requires iterative…

Machine Learning · Computer Science 2026-04-08 Kevin Zhang , Yixin Wang

MCGM: Mask Conditional Text-to-Image Generative Model

Recent advancements in generative models have revolutionized the field of artificial intelligence, enabling the creation of highly-realistic and detailed images. In this study, we propose a novel Mask Conditional Text-to-Image Generative…

Computer Vision and Pattern Recognition · Computer Science 2024-10-02 Rami Skaik , Leonardo Rossi , Tomaso Fontanini , Andrea Prati

PIGMIL: Positive Instance Detection via Graph Updating for Multiple Instance Learning

Positive instance detection, especially for these in positive bags (true positive instances, TPIs), plays a key role for multiple instance learning (MIL) arising from a specific classification problem only provided with bag (a set of…

Computer Vision and Pattern Recognition · Computer Science 2016-12-13 Dongkuan Xu , Jia Wu , Wei Zhang , Yingjie Tian

ComfyGI: Automatic Improvement of Image Generation Workflows

Automatic image generation is no longer just of interest to researchers, but also to practitioners. However, current models are sensitive to the settings used and automatic optimization methods often require human involvement. To bridge…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Dominik Sobania , Martin Briesch , Franz Rothlauf

CCDM: Continuous Conditional Diffusion Models for Image Generation

Continuous Conditional Generative Modeling (CCGM) estimates high-dimensional data distributions, such as images, conditioned on scalar continuous variables (aka regression labels). While Continuous Conditional Generative Adversarial…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Xin Ding , Yongwei Wang , Kao Zhang , Z. Jane Wang

Conditional Generative Modeling of Stochastic LTI Systems: A Behavioral Approach

This paper presents a data-driven model for Linear Time-Invariant (LTI) stochastic systems by sampling from the conditional probability distribution of future outputs given past input-outputs and future inputs. It operates in a fully…

Optimization and Control · Mathematics 2025-11-27 Jiayun Li , Yilin Mo

PromptMagician: Interactive Prompt Engineering for Text-to-Image Creation

Generative text-to-image models have gained great popularity among the public for their powerful capability to generate high-quality images based on natural language prompts. However, developing effective prompts for desired images can be…

Artificial Intelligence · Computer Science 2023-11-02 Yingchaojie Feng , Xingbo Wang , Kam Kwai Wong , Sijia Wang , Yuhong Lu , Minfeng Zhu , Baicheng Wang , Wei Chen

Conditional Image Generation with Pretrained Generative Model

In recent years, diffusion models have gained popularity for their ability to generate higher-quality images in comparison to GAN models. However, like any other large generative models, these models require a huge amount of data,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Rajesh Shrestha , Bowen Xie

PTMPicker: Facilitating Efficient Pretrained Model Selection for Application Developers

The rapid emergence of pretrained models (PTMs) has attracted significant attention from both Deep Learning (DL) researchers and downstream application developers. However, selecting appropriate PTMs remains challenging because existing…

Software Engineering · Computer Science 2025-12-01 Pei Liu , Terry Zhuo , Jiawei Deng , Zhenchang Xing , Qinghua Lu , Xiaoning Du , Hongyu Zhan

Fast-PGM: Fast Probabilistic Graphical Model Learning and Inference

Probabilistic graphical models (PGMs) serve as a powerful framework for modeling complex systems with uncertainty and extracting valuable insights from data. However, users face challenges when applying PGMs to their problems in terms of…

Machine Learning · Computer Science 2024-05-29 Jiantong Jiang , Zeyi Wen , Peiyu Yang , Atif Mansoor , Ajmal Mian

Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

Generative models (e.g., GANs, diffusion models) learn the underlying data distribution in an unsupervised manner. However, many applications of interest require sampling from a particular region of the output space or sampling evenly over…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Chen Henry Wu , Saman Motamed , Shaunak Srivastava , Fernando De la Torre

Composable Generative Models

Generative modeling has recently seen many exciting developments with the advent of deep generative architectures such as Variational Auto-Encoders (VAE) or Generative Adversarial Networks (GAN). The ability to draw synthetic i.i.d.…

Machine Learning · Computer Science 2021-02-19 Johan Leduc , Nicolas Grislain

PGC: Peak-Guided Calibration for Generalizable AI-Generated Image Detection

The rapid evolution of generative AI, from GANs to modern diffusion models, has resulted in increasingly subtle discriminative clues. These fine-grained signals are often overshadowed by dominant, high-fidelity image content (e.g., the main…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Xiaoyu Zhou , Jianwei Fei , Peipeng Yu , Jingchang Xie , Chong Cheng , Zhihua Xia

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

The rapidly developing field of large multimodal models (LMMs) has led to the emergence of diverse models with remarkable capabilities. However, existing benchmarks fail to comprehensively, objectively and accurately evaluate whether LMMs…

Artificial Intelligence · Computer Science 2024-12-18 YiFan Zhang , Shanglin Lei , Runqi Qiao , Zhuoma GongQue , Xiaoshuai Song , Guanting Dong , Qiuna Tan , Zhe Wei , Peiqing Yang , Ye Tian , Yadong Xue , Xiaofei Wang , Honggang Zhang

Detecting AI-Generated Images via CLIP

As AI-generated image (AIGI) methods become more powerful and accessible, it has become a critical task to determine if an image is real or AI-generated. Because AIGI lack the signatures of photographs and have their own unique patterns,…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 A. G. Moskowitz , T. Gaona , J. Peterson

MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models

Recent multimodal image generators such as GPT-4o, Gemini 2.0 Flash, and Gemini 2.5 Pro excel at following complex instructions, editing images and maintaining concept consistency. However, they are still evaluated by disjoint toolkits:…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Hang Hua , Ziyun Zeng , Yizhi Song , Yunlong Tang , Liu He , Daniel Aliaga , Wei Xiong , Jiebo Luo

Predictive Hypothesis Identification

While statistics focusses on hypothesis testing and on estimating (properties of) the true sampling distribution, in machine learning the performance of learning algorithms on future data is the primary issue. In this paper we bridge the…

Machine Learning · Computer Science 2009-12-30 Marcus Hutter