Yuanting Fan — Scifaro

When Policy Entropy Constraint Fails: Preserving Diversity in Flow-based RLHF via Perceptual Entropy

RLHF is widely used to align flow-matching text-to-image models with human preferences, but often leads to severe diversity collapse after fine-tuning. In RL, diversity is often assumed to correlate with policy entropy, motivating entropy…

Computer Vision and Pattern Recognition · Computer Science 2026-05-13 Xiaofeng Tan , Jun Liu , Bin-Bin Gao , Yuanting Fan , Xi Jiang , Chengjie Wang , Hongsong Wang , Feng Zheng

Large-Scale Universal Defect Generation: Foundation Models and Datasets

Existing defect/anomaly generation methods often rely on few-shot learning, which overfits to specific defect categories due to the lack of large-scale paired defect editing data. This issue is aggravated by substantial variations in defect…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Yuanting Fan , Jun Liu , Bin-Bin Gao , Xiaochen Chen , Yuhuan Lin , Zhewei Dai , Jiawei Zhan , Chengjie Wang

ConsistentRFT: Reducing Visual Hallucinations in Flow-based Reinforcement Fine-Tuning

Reinforcement Fine-Tuning (RFT) on flow-based models is crucial for preference alignment. However, they often introduce visual hallucinations like over-optimized details and semantic misalignment. This work preliminarily explores why visual…

Computer Vision and Pattern Recognition · Computer Science 2026-02-04 Xiaofeng Tan , Jun Liu , Yuanting Fan , Bin-Bin Gao , Xi Jiang , Xiaochen Chen , Jinlong Peng , Chengjie Wang , Hongsong Wang , Feng Zheng

Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection

Few-shot anomaly detection (FSAD) methods identify anomalous regions with few known normal samples. Most existing methods rely on the generalization ability of pre-trained vision-language models (VLMs) to recognize potentially anomalous…

Computer Vision and Pattern Recognition · Computer Science 2025-10-31 Yuanting Fan , Jun Liu , Xiaochen Chen , Bin-Bin Gao , Jian Li , Yong Liu , Jinlong Peng , Chengjie Wang

TSAL: Few-shot Text Segmentation Based on Attribute Learning

Recently supervised learning rapidly develops in scene text segmentation. However, the lack of high-quality datasets and the high cost of pixel annotation greatly limit the development of them. Considering the well-performed few-shot…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Chenming Li , Chengxu Liu , Yuanting Fan , Xiao Jin , Xingsong Hou , Xueming Qian

AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution

Diffusion models (DMs) have shown promising results on single-image super-resolution and other image-to-image translation tasks. Benefiting from more computational resources and longer inference times, they are able to yield more realistic…

Computer Vision and Pattern Recognition · Computer Science 2024-10-24 Yuanting Fan , Chengxu Liu , Nengzhong Yin , Changlong Gao , Xueming Qian

Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera

Under-display camera (UDC) systems are the foundation of full-screen display devices in which the lens mounts under the display. The pixel array of light-emitting diodes used for display diffracts and attenuates incident light, causing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Chengxu Liu , Xuan Wang , Yuanting Fan , Shuai Li , Xueming Qian