Related papers: PreCall: A Visual Interface for Threshold Optimiza…

ORES: Lowering Barriers with Participatory Machine Learning in Wikipedia

Algorithmic systems---from rule-based bots to machine learning classifiers---have a long history of supporting the essential work of content moderation and other curation work in peer production projects. From counter-vandalism to task…

Human-Computer Interaction · Computer Science 2020-08-21 Aaron Halfaker , R. Stuart Geiger

Keeping Community in the Loop: Understanding Wikipedia Stakeholder Values for Machine Learning-Based Systems

On Wikipedia, sophisticated algorithmic tools are used to assess the quality of edits and take corrective actions. However, algorithms can fail to solve the problems they were designed for if they conflict with the values of communities who…

Human-Computer Interaction · Computer Science 2020-01-15 C. Estelle Smith , Bowen Yu , Anjali Srivastava , Aaron Halfaker , Loren Terveen , Haiyi Zhu

ORES-Inspect: A technology probe for machine learning audits on enwiki

Auditing the machine learning (ML) models used on Wikipedia is important for ensuring that vandalism-detection processes remain fair and effective. However, conducting audits is challenging because stakeholders have diverse priorities and…

Human-Computer Interaction · Computer Science 2024-06-13 Zachary Levonian , Lauren Hagen , Lu Li , Jada Lilleboe , Solvejg Wastvedt , Aaron Halfaker , Loren Terveen

PREVIS -- A Combined Machine Learning and Visual Interpolation Approach for Interactive Reverse Engineering in Assembly Quality Control

We present PREVIS, a visual analytics tool, enhancing machine learning performance analysis in engineering applications. The presented toolchain allows for a direct comparison of regression models. In addition, we provide a methodology to…

Human-Computer Interaction · Computer Science 2022-09-27 Patrick Ruediger , Felix Claus , Viktor Leonhardt , Hans Hagen , Jan C. Aurich , Christoph Garth

InterVLS: Interactive Model Understanding and Improvement with Vision-Language Surrogates

Deep learning models are widely used in critical applications, highlighting the need for pre-deployment model understanding and improvement. Visual concept-based methods, while increasingly used for this purpose, face challenges: (1) most…

Artificial Intelligence · Computer Science 2024-06-27 Jinbin Huang , Wenbin He , Liang Gou , Liu Ren , Chris Bryan

Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations

Advances in text-based image generation and editing have revolutionized content creation, enabling users to create impressive content from imaginative text prompts. However, existing methods are not designed to work well with the…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Tiancheng Shen , Jun Hao Liew , Long Mai , Lu Qi , Jiashi Feng , Jiaya Jia

OARS: Process-Aware Online Alignment for Generative Real-World Image Super-Resolution

Aligning generative real-world image super-resolution models with human visual preference is challenging due to the perception--fidelity trade-off and diverse, unknown degradations. Prior approaches rely on offline preference optimization…

Computer Vision and Pattern Recognition · Computer Science 2026-03-16 Shijie Zhao , Xuanyu Zhang , Bin Chen , Weiqi Li , Qunliang Xing , Kexin Zhang , Yan Wang , Junlin Li , Li Zhang , Jian Zhang , Tianfan Xue

ORES: Open-vocabulary Responsible Visual Synthesis

Avoiding synthesizing specific visual concepts is an essential challenge in responsible visual synthesis. However, the visual concept that needs to be avoided for responsible visual synthesis tends to be diverse, depending on the region,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Minheng Ni , Chenfei Wu , Xiaodong Wang , Shengming Yin , Lijuan Wang , Zicheng Liu , Nan Duan

Enhanced User Interaction in Operating Systems through Machine Learning Language Models

With the large language model showing human-like logical reasoning and understanding ability, whether agents based on the large language model can simulate the interaction behavior of real users, so as to build a reliable virtual…

Information Retrieval · Computer Science 2024-03-05 Chenwei Zhang , Wenran Lu , Chunhe Ni , Hongbo Wang , Jiang Wu

Cross-Modal Adapter for Vision-Language Retrieval

Vision-language retrieval is an important multi-modal learning topic, where the goal is to retrieve the most relevant visual candidate for a given text query. Recently, pre-trained models, e.g., CLIP, show great potential on retrieval…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Haojun Jiang , Jianke Zhang , Rui Huang , Chunjiang Ge , Zanlin Ni , Shiji Song , Gao Huang

Visus: An Interactive System for Automatic Machine Learning Model Building and Curation

While the demand for machine learning (ML) applications is booming, there is a scarcity of data scientists capable of building such models. Automatic machine learning (AutoML) approaches have been proposed that help with this problem by…

Machine Learning · Computer Science 2019-07-08 Aécio Santos , Sonia Castelo , Cristian Felix , Jorge Piazentin Ono , Bowen Yu , Sungsoo Hong , Cláudio T. Silva , Enrico Bertini , Juliana Freire

Visual Persuasion: What Influences Decisions of Vision-Language Models?

The web is littered with images, once created for human consumption and now increasingly interpreted by agents using vision-language models (VLMs). These agents make visual decisions at scale, deciding what to click, recommend, or buy. Yet,…

Computer Vision and Pattern Recognition · Computer Science 2026-02-18 Manuel Cherep , Pranav M R , Pattie Maes , Nikhil Singh

Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-hoc Retrieval

With the development of deep learning and natural language processing techniques, pre-trained language models have been widely used to solve information retrieval (IR) problems. Benefiting from the pre-training and fine-tuning paradigm,…

Information Retrieval · Computer Science 2024-01-02 Weihang Su , Qingyao Ai , Xiangsheng Li , Jia Chen , Yiqun Liu , Xiaolong Wu , Shengluan Hou

CANVAS: A Benchmark for Vision-Language Models on Tool-Based User Interface Design

User interface (UI) design is an iterative process in which designers progressively refine their work with design software such as Figma or Sketch. Recent advances in vision language models (VLMs) with tool invocation suggest these models…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Daeheon Jeong , Seoyeon Byun , Kihoon Son , Dae Hyun Kim , Juho Kim

VizML: A Machine Learning Approach to Visualization Recommendation

Data visualization should be accessible for all analysts with data, not just the few with technical expertise. Visualization recommender systems aim to lower the barrier to exploring basic visualizations by automatically generating results…

Human-Computer Interaction · Computer Science 2018-08-16 Kevin Z. Hu , Michiel A. Bakker , Stephen Li , Tim Kraska , César A. Hidalgo

Learning Manipulation by Predicting Interaction

Representation learning approaches for robotic manipulation have boomed in recent years. Due to the scarcity of in-domain robot data, prevailing methodologies tend to leverage large-scale human video datasets to extract generalizable…

Robotics · Computer Science 2024-06-04 Jia Zeng , Qingwen Bu , Bangjun Wang , Wenke Xia , Li Chen , Hao Dong , Haoming Song , Dong Wang , Di Hu , Ping Luo , Heming Cui , Bin Zhao , Xuelong Li , Yu Qiao , Hongyang Li

User-Interactive Machine Learning Model for Identifying Structural Relationships of Code Features

Traditional machine learning based intelligent systems assist users by learning patterns in data and making recommendations. However, these systems are limited in that the user has little means of understanding the rationale behind the…

Human-Computer Interaction · Computer Science 2020-08-26 Ankit Gupta

NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding

Computer-aided medical image analysis is crucial for disease diagnosis and treatment planning, yet limited annotated datasets restrict medical-specific model development. While vision-language models (VLMs) like CLIP offer strong…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Zelin Peng , Yichen Zhao , Yu Huang , Piao Yang , Feilong Tang , Zhengqin Xu , Xiaokang Yang , Wei Shen

Simulation-Based Optimization of User Interfaces for Quality-Assuring Machine Learning Model Predictions

Quality-sensitive applications of machine learning (ML) require quality assurance (QA) by humans before the predictions of an ML model can be deployed. QA for ML (QA4ML) interfaces require users to view a large amount of data and perform…

Human-Computer Interaction · Computer Science 2023-09-01 Yu Zhang , Martijn Tennekes , Tim de Jong , Lyana Curier , Bob Coecke , Min Chen

PREVis: Perceived Readability Evaluation for Visualizations

We developed and validated an instrument to measure the perceived readability in data visualization: PREVis. Researchers and practitioners can easily use this instrument as part of their evaluations to compare the perceived readability of…

Human-Computer Interaction · Computer Science 2024-09-25 Anne-Flore Cabouat , Tingying He , Petra Isenberg , Tobias Isenberg