Related papers: Boostlet.js: Image processing plugins for the web …

VideoBooth: Diffusion-based Video Generation with Image Prompts

Text-driven video generation witnesses rapid progress. However, merely using text prompts is not enough to depict the desired subject appearance that accurately aligns with users' intents, especially for customized content creation. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-12-04 Yuming Jiang , Tianxing Wu , Shuai Yang , Chenyang Si , Dahua Lin , Yu Qiao , Chen Change Loy , Ziwei Liu

LayoutBERT: Masked Language Layout Model for Object Insertion

Image compositing is one of the most fundamental steps in creative workflows. It involves taking objects/parts of several images to create a new image, called a composite. Currently, this process is done manually by creating accurate masks…

Computer Vision and Pattern Recognition · Computer Science 2022-05-03 Kerem Turgutlu , Sanat Sharma , Jayant Kumar

Insert Anything: Image Insertion via In-Context Editing in DiT

This work presents Insert Anything, a unified framework for reference-based image insertion that seamlessly integrates objects from reference images into target scenes under flexible, user-specified control guidance. Instead of training…

Computer Vision and Pattern Recognition · Computer Science 2025-04-22 Wensong Song , Hong Jiang , Zongxing Yang , Ruijie Quan , Yi Yang

PhotoScout: Synthesis-Powered Multi-Modal Image Search

Due to the availability of increasingly large amounts of visual data, there is a growing need for tools that can help users find relevant images. While existing tools can perform image retrieval based on similarity or metadata, they fall…

Human-Computer Interaction · Computer Science 2024-01-22 Celeste Barnaby , Qiaochu Chen , Chenglong Wang , Isil Dillig

FastBlend: a Powerful Model-Free Toolkit Making Video Stylization Easier

With the emergence of diffusion models and rapid development in image processing, it has become effortless to generate fancy images in tasks such as style transfer and image editing. However, these impressive image processing approaches…

Computer Vision and Pattern Recognition · Computer Science 2023-11-17 Zhongjie Duan , Chengyu Wang , Cen Chen , Weining Qian , Jun Huang , Mingyi Jin

Text images processing system using artificial intelligence models

This is to present a text image classifier device that identifies textual content in images and then categorizes each image into one of four predefined categories, including Invoice, Form, Letter, or Report. The device supports a gallery…

Computer Vision and Pattern Recognition · Computer Science 2025-12-15 Aya Kaysan Bahjat

FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network

We introduce a lightweight network to improve descriptors of keypoints within the same image. The network takes the original descriptors and the geometric properties of keypoints as the input, and uses an MLP-based self-boosting stage and a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-29 Xinjiang Wang , Zeyu Liu , Yu Hu , Wei Xi , Wenxian Yu , Danping Zou

FusionBooster: A Unified Image Fusion Boosting Paradigm

In recent years, numerous ideas have emerged for designing a mutually reinforcing mechanism or extra stages for the image fusion task, ignoring the inevitable gaps between different vision tasks and the computational burden. We argue that…

Computer Vision and Pattern Recognition · Computer Science 2024-02-09 Chunyang Cheng , Tianyang Xu , Xiao-Jun Wu , Hui Li , Xi Li , Josef Kittler

Generative Photomontage

Text-to-image models are powerful tools for image creation. However, the generation process is akin to a dice roll and makes it difficult to achieve a single image that captures everything a user wants. In this paper, we propose a framework…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Sean J. Liu , Nupur Kumari , Ariel Shamir , Jun-Yan Zhu

A new Contrast Based Image Fusion using Wavelet Packets

Image Fusion, a technique which combines complimentary information from different images of the same scene so that the fused image is more suitable for segmentation, feature extraction, object recognition and Human Visual System. In this…

Information Theory · Computer Science 2008-12-04 R. Balasubramanian , Gaurav Bhatnagar

Deep Boosting: Joint Feature Selection and Analysis Dictionary Learning in Hierarchy

This work investigates how the traditional image classification pipelines can be extended into a deep architecture, inspired by recent successes of deep neural networks. We propose a deep boosting framework based on layer-by-layer joint…

Computer Vision and Pattern Recognition · Computer Science 2015-08-12 Zhanglin Peng , Ya Li , Zhaoquan Cai , Liang Lin

Facelet-Bank for Fast Portrait Manipulation

Digital face manipulation has become a popular and fascinating way to touch images with the prevalence of smartphones and social networks. With a wide variety of user preferences, facial expressions, and accessories, a general and flexible…

Computer Vision and Pattern Recognition · Computer Science 2018-04-02 Ying-Cong Chen , Huaijia Lin , Michelle Shu , Ruiyu Li , Xin Tao , Yangang Ye , Xiaoyong Shen , Jiaya Jia

TextBoost: Boosting Text Encoder for Personalized Text-to-Image Generation

In this paper, we introduce TextBoost, an efficient one-shot personalization approach for text-to-image diffusion models. Traditional personalization methods typically involve fine-tuning extensive portions of the model, leading to…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 NaHyeon Park , Kunhee Kim , Hyunjung Shim

Counterpoint: Orchestrating Large-Scale Custom Animated Visualizations

Custom animated visualizations of large, complex datasets are helpful across many domains, but they are hard to develop. Much of the difficulty arises from maintaining visualization state across many animated graphical elements that may…

Graphics · Computer Science 2024-10-10 Venkatesh Sivaraman , Frank Elavsky , Dominik Moritz , Adam Perer

ImageLab: Simplifying Image Processing Exploration for Novices and Experts Alike

Image processing holds immense potential for societal benefit, yet its full potential is often accessible only to tech-savvy experts. Bridging this knowledge gap and providing accessible tools for users of all backgrounds remains an…

Computer Vision and Pattern Recognition · Computer Science 2024-01-09 Sahan Dissanayaka , Oshan Mudanayaka , Thilina Halloluwa , Chameera De Silva

Curriculum Dataset Distillation

Most dataset distillation methods struggle to accommodate large-scale datasets due to their substantial computational and memory requirements. Recent research has begun to explore scalable disentanglement methods. However, there are still…

Computer Vision and Pattern Recognition · Computer Science 2025-07-14 Zhiheng Ma , Anjia Cao , Funing Yang , Yihong Gong , Xing Wei

Real-Time Image Analysis Software Suitable for Resource-Constrained Computing

Methods: We have developed a software suite (DataSet Tracker) for real-time analysis designed to run on computers, smartphones, and smart glasses hardware and suitable for resource-constrained, on-the-fly computing in microscopes without…

Quantitative Methods · Quantitative Biology 2025-08-13 Alexandre Matov

Browserbite: Cross-Browser Testing via Image Processing

Cross-browser compatibility testing is concerned with identifying perceptible differences in the way a Web page is rendered across different browsers or configurations thereof. Existing automated cross-browser compatibility testing methods…

Software Engineering · Computer Science 2015-03-12 Tõnis Saar , Marlon Dumas , Marti Kaljuve , Nataliia Semenenko

Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis

Recent advances in multimodal learning have achieved remarkable success across diverse vision-language tasks. However, such progress heavily relies on large-scale image-text datasets, making training costly and inefficient. Prior efforts in…

Computer Vision and Pattern Recognition · Computer Science 2026-03-02 Junhyeok Choi , Sangwoo Mo , Minwoo Chae

PhotoBot: Reference-Guided Interactive Photography via Natural Language

We introduce PhotoBot, a framework for fully automated photo acquisition based on an interplay between high-level human language guidance and a robot photographer. We propose to communicate photography suggestions to the user via reference…

Computer Vision and Pattern Recognition · Computer Science 2024-12-30 Oliver Limoyo , Jimmy Li , Dmitriy Rivkin , Jonathan Kelly , Gregory Dudek