Related papers: Inferring Generative Model Structure with Static A…

Learning the Structure of Generative Models without Labeled Data

Curating labeled training data has become the primary bottleneck in machine learning. Recent frameworks address this bottleneck with generative models to synthesize labels at scale from weak supervision sources. The generative model's…

Machine Learning · Computer Science 2017-09-12 Stephen H. Bach , Bryan He , Alexander Ratner , Christopher Ré

Socratic Learning: Augmenting Generative Models to Incorporate Latent Subsets in Training Data

A challenge in training discriminative models like neural networks is obtaining enough labeled training data. Recent approaches use generative models to combine weak supervision sources, like user-defined heuristics or knowledge bases, to…

Machine Learning · Computer Science 2017-09-29 Paroma Varma , Bryan He , Dan Iter , Peng Xu , Rose Yu , Christopher De Sa , Christopher Ré

Learning Dependency Structures for Weak Supervision Models

Labeling training data is a key bottleneck in the modern machine learning pipeline. Recent weak supervision approaches combine labels from multiple noisy sources by estimating their accuracies without access to ground truth labels; however,…

Machine Learning · Statistics 2019-03-15 Paroma Varma , Frederic Sala , Ann He , Alexander Ratner , Christopher Ré

Generative Modeling Helps Weak Supervision (and Vice Versa)

Many promising applications of supervised machine learning face hurdles in the acquisition of labeled data in sufficient quantity and quality, creating an expensive bottleneck. To overcome such limitations, techniques that do not depend on…

Machine Learning · Computer Science 2023-03-14 Benedikt Boecking , Nicholas Roberts , Willie Neiswanger , Stefano Ermon , Frederic Sala , Artur Dubrawski

Towards Robust Classification Model by Counterfactual and Invariant Data Generation

Despite the success of machine learning applications in science, industry, and society in general, many approaches are known to be non-robust, often relying on spurious correlations to make predictions. Spuriousness occurs when some…

Computer Vision and Pattern Recognition · Computer Science 2021-06-04 Chun-Hao Chang , George Alexandru Adam , Anna Goldenberg

CORAL: COde RepresentAtion Learning with Weakly-Supervised Transformers for Analyzing Data Analysis

Large scale analysis of source code, and in particular scientific source code, holds the promise of better understanding the data science process, identifying analytical best practices, and providing insights to the builders of scientific…

Machine Learning · Computer Science 2020-09-01 Ge Zhang , Mike A. Merrill , Yang Liu , Jeffrey Heer , Tim Althoff

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Spurious correlations threaten the validity of statistical classifiers. While model accuracy may appear high when the test data is from the same distribution as the training data, it can quickly degrade when the test distribution changes.…

Machine Learning · Computer Science 2020-12-21 Zhao Wang , Aron Culotta

Fighting Spurious Correlations in Text Classification via a Causal Learning Perspective

In text classification tasks, models often rely on spurious correlations for predictions, incorrectly associating irrelevant features with the target labels. This issue limits the robustness and generalization of models, especially when…

Machine Learning · Computer Science 2025-02-04 Yuqing Zhou , Ziwei Zhu

Deep Learning and Hierarchal Generative Models

It is argued that deep learning is efficient for data that is generated from hierarchal generative models. Examples of such generative models include wavelet scattering networks, functions of compositional structure, and deep rendering…

Machine Learning · Computer Science 2018-09-06 Elchanan Mossel

Generative Inverse Design of Crystal Structures via Diffusion Models with Transformers

Recent advances in deep learning have enabled the generation of realistic data by training generative models on large datasets of text, images, and audio. While these models have demonstrated exceptional performance in generating novel and…

Materials Science · Physics 2024-06-17 Izumi Takahara , Kiyou Shibata , Teruyasu Mizoguchi

Collaborative Cognitive Diagnosis with Disentangled Representation Learning for Learner Modeling

Learners sharing similar implicit cognitive states often display comparable observable problem-solving performances. Leveraging collaborative connections among such similar learners proves valuable in comprehending human learning. Motivated…

Machine Learning · Computer Science 2024-11-12 Weibo Gao , Qi Liu , Linan Yue , Fangzhou Yao , Hao Wang , Yin Gu , Zheng Zhang

Robust conditional GANs under missing or uncertain labels

Matching the performance of conditional Generative Adversarial Networks with little supervision is an important task, especially in venturing into new domains. We design a new training algorithm, which is robust to missing or ambiguous…

Machine Learning · Statistics 2019-06-11 Kiran Koshy Thekumparampil , Sewoong Oh , Ashish Khetan

CORAL: Learning Consistent Representations across Multi-step Training with Lighter Speculative Drafter

Speculative decoding is a powerful technique that accelerates Large Language Model (LLM) inference by leveraging a lightweight speculative draft model. However, existing designs suffers in performance due to misalignment between training…

Computation and Language · Computer Science 2025-05-27 Yepeng Weng , Dianwen Mei , Huishi Qiu , Xujie Chen , Li Liu , Jiang Tian , Zhongchao Shi

Weak Supervision with Incremental Source Accuracy Estimation

Motivated by the desire to generate labels for real-time data we develop a method to estimate the dependency structure and accuracy of weak supervision sources incrementally. Our method first estimates the dependency structure associated…

Machine Learning · Computer Science 2022-05-12 Richard Gresham Correro

Label-Retrieval-Augmented Diffusion Models for Learning from Noisy Labels

Learning from noisy labels is an important and long-standing problem in machine learning for real applications. One of the main research lines focuses on learning a label corrector to purify potential noisy labels. However, these methods…

Machine Learning · Computer Science 2023-12-05 Jian Chen , Ruiyi Zhang , Tong Yu , Rohan Sharma , Zhiqiang Xu , Tong Sun , Changyou Chen

Generative vs. Discriminative modeling under the lens of uncertainty quantification

Learning a parametric model from a given dataset indeed enables to capture intrinsic dependencies between random variables via a parametric conditional probability distribution and in turn predict the value of a label variable given…

Machine Learning · Statistics 2024-06-14 Elouan Argouarc'h , François Desbouvries , Eric Barat , Eiji Kawasaki

The Effects of Invertibility on the Representational Complexity of Encoders in Variational Autoencoders

Training and using modern neural-network based latent-variable generative models (like Variational Autoencoders) often require simultaneously training a generative direction along with an inferential(encoding) direction, which approximates…

Machine Learning · Computer Science 2021-07-13 Divyansh Pareek , Andrej Risteski

Iterative Label Improvement: Robust Training by Confidence Based Filtering and Dataset Partitioning

State-of-the-art, high capacity deep neural networks not only require large amounts of labelled training data, they are also highly susceptible to label errors in this data, typically resulting in large efforts and costs and therefore…

Machine Learning · Computer Science 2020-07-20 Christian Haase-Schütz , Rainer Stal , Heinz Hertlein , Bernhard Sick

Reliable Active Learning from Unreliable Labels via Neural Collapse Geometry

Active Learning (AL) promises to reduce annotation cost by prioritizing informative samples, yet its reliability is undermined when labels are noisy or when the data distribution shifts. In practice, annotators make mistakes, rare…

Machine Learning · Computer Science 2025-10-14 Atharv Goel , Sharat Agarwal , Saket Anand , Chetan Arora

Scene Graph Prediction with Limited Labels

Visual knowledge bases such as Visual Genome power numerous applications in computer vision, including visual question answering and captioning, but suffer from sparse, incomplete relationships. All scene graph models to date are limited to…

Computer Vision and Pattern Recognition · Computer Science 2019-12-03 Vincent S. Chen , Paroma Varma , Ranjay Krishna , Michael Bernstein , Christopher Re , Li Fei-Fei