English
Related papers

Related papers: HiBug2: Efficient and Interpretable Error Slice Di…

200 papers

Machine learning (ML) models that achieve high average accuracy can still underperform on semantically coherent subsets ("slices") of data. This behavior can have significant societal consequences for the safety or bias of the model in…

Human-Computer Interaction · Computer Science 2024-02-12 Nari Johnson , Ángel Alexander Cabrera , Gregory Plumb , Ameet Talwalkar

As machine learning systems become democratized, it becomes increasingly important to help users easily debug their models. However, current data tools are still primitive when it comes to helping users trace model performance problems all…

Databases · Computer Science 2019-01-08 Yeounoh Chung , Tim Kraska , Neoklis Polyzotis , Ki Hyun Tae , Steven Euijong Whang

Machine learning models fit complex algorithms to arbitrarily large datasets. These algorithms are well-known to be high on performance and low on interpretability. We use interactive visualization of slices of predictor space to address…

Machine Learning · Statistics 2021-09-08 Catherine B. Hurley , Mark O'Connell , Katarina Domijan

Systematic failures of computer vision models on subsets with coherent visual patterns, known as error slices, pose a critical challenge for robust model evaluation. Existing slice discovery methods are primarily developed for image…

Computer Vision and Pattern Recognition · Computer Science 2026-01-01 Wei Zhang , Chaoqun Wang , Zixuan Guan , Sam Kao , Pengfei Zhao , Peng Wu , Sifeng He

We present a method for identifying groups of test examples -- slices -- on which a model under-performs, a task now known as slice discovery. We formalize coherence -- a requirement that erroneous predictions, within a slice, should be…

Machine Learning · Computer Science 2023-12-11 Fulton Wang , Julius Adebayo , Sarah Tan , Diego Garcia-Olano , Narine Kokhlikyan

We present deepflash2, a deep learning solution that facilitates the objective and reliable segmentation of ambiguous bioimages through multi-expert annotations and integrated quality assurance. Thereby, deepflash2 addresses typical…

Quantitative Methods · Quantitative Biology 2021-11-15 Matthias Griebel , Dennis Segebarth , Nikolai Stein , Nina Schukraft , Philip Tovote , Robert Blum , Christoph M. Flath

Machine learning models make mistakes, yet sometimes it is difficult to identify the systematic problems behind the mistakes. Practitioners engage in various activities, including error analysis, testing, auditing, and red-teaming, to form…

Software Engineering · Computer Science 2024-09-17 Chenyang Yang , Yining Hong , Grace A. Lewis , Tongshuang Wu , Christian Kästner

Writing correct distributed programs is hard. In spite of extensive testing and debugging, software faults persist even in commercial grade software. Many distributed systems, especially those employed in safety-critical environments,…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Neeraj Mittal , Vijay K. Garg

Error slice discovery is crucial to diagnose and mitigate model errors. Current clustering or discrete attribute-based slice discovery methods face key limitations: 1) clustering results in incoherent slices, while assigning discrete…

Computation and Language · Computer Science 2025-06-02 Shantanu Ghosh , Rayan Syed , Chenyu Wang , Vaibhav Choudhary , Binxu Li , Clare B. Poynton , Shyam Visweswaran , Kayhan Batmanghelich

Despite strong average-case performance, deep learning models often exhibit systematic errors on specific population groups, known as error slices. Identifying these groups and the root causes of their failures is critical for model…

Machine Learning · Computer Science 2026-05-29 Yael Konforti , Mateo Espinosa Zarlenga , Elaf Almahmoud , Mateja Jamnik

Automated slicing aims to identify subsets of evaluation data where a trained model performs anomalously. This is an important problem for machine learning pipelines in production since it plays a key role in model debugging and comparison,…

Machine Learning · Computer Science 2022-12-20 Zifan Liu , Evan Rosen , Paul Suganthan G. C

In real-world machine learning applications, data subsets correspond to especially critical outcomes: vulnerable cyclist detections are safety-critical in an autonomous driving task, and "question" sentences might be important to a dialogue…

Machine Learning · Computer Science 2020-03-03 Vincent S. Chen , Sen Wu , Zhenzhen Weng , Alexander Ratner , Christopher Ré

Large Language Models (LLMs) often exhibit systematic errors on specific subsets of data, known as error slices. For instance, a slice can correspond to a certain demographic, where a model does poorly in identifying toxic comments…

Machine Learning · Computer Science 2025-11-27 Minhui Zhang , Prahar Ijner , Yoav Wald , Elliot Creager

Machine learning models that achieve high overall accuracy often make systematic errors on important subsets (or slices) of data. Identifying underperforming slices is particularly challenging when working with high-dimensional inputs (e.g.…

Despite the great performance of deep learning models in many areas, they still make mistakes and underperform on certain subsets of data, i.e. error slices. Given a trained model, it is important to identify its semantically coherent error…

Machine Learning · Computer Science 2025-12-23 Han Yu , Hao Zou , Jiashuo Liu , Renzhe Xu , Yue He , Xingxuan Zhang , Peng Cui

Deep neural networks are increasingly employed in high-stakes medical applications, despite their tendency for shortcut learning in the presence of spurious correlations, which can have potentially fatal consequences in practice. Whereas a…

Artificial Intelligence · Computer Science 2025-07-30 Frederik Pahde , Thomas Wiegand , Sebastian Lapuschkin , Wojciech Samek

This paper contributes to interpretable machine learning via visual knowledge discovery in general line coordinates (GLC). The concepts of hyperblocks as interpretable dataset units and general line coordinates are combined to create a…

Machine Learning · Computer Science 2022-05-10 Charles Recaido , Boris Kovalerchuk

Variability models (e.g., feature models) are a common way for the representation of variabilities and commonalities of software artifacts. Such models can be translated to a logical representation and thus allow different operations for…

Software Engineering · Computer Science 2021-02-12 Viet-Man Le , Alexander Felfernig , Mathias Uta , David Benavides , José Galindo , Thi Ngoc Trang Tran

Biclustering is an unsupervised machine-learning approach aiming to cluster rows and columns simultaneously in a data matrix. Several biclustering algorithms have been proposed for handling numeric datasets. However, real-world data mining…

Machine Learning · Computer Science 2024-08-26 Adán José-García , Julie Jacques , Clément Chauvet , Vincent Sobanski , Clarisse Dhaenens

Deep learning models have been used to support analytics beyond simple aggregation, where deeper and wider models have been shown to yield great results. These models consume a huge amount of memory and computational operations. However,…

Machine Learning · Computer Science 2021-04-22 Shaofeng Cai , Gang Chen , Beng Chin Ooi , Jinyang Gao
‹ Prev 1 2 3 10 Next ›