Related papers: Topic Modeling as Multi-Objective Contrastive Opti…

Contrastive estimation reveals topic posterior information to linear models

Contrastive learning is an approach to representation learning that utilizes naturally occurring similar and dissimilar pairs of data points to find useful embeddings of data. In the context of document classification under topic modeling…

Machine Learning · Computer Science 2020-03-05 Christopher Tosh , Akshay Krishnamurthy , Daniel Hsu

Enhancing Context Through Contrast

Neural machine translation benefits from semantically rich representations. Considerable progress in learning such representations has been achieved by language modelling and mutual information maximization objectives using contrastive…

Computation and Language · Computer Science 2024-01-09 Kshitij Ambilduke , Aneesh Shetye , Diksha Bagade , Rishika Bhagwatkar , Khurshed Fitter , Prasad Vagdargi , Shital Chiddarwar

Enhancing Topic Interpretability for Neural Topic Modeling through Topic-wise Contrastive Learning

Data mining and knowledge discovery are essential aspects of extracting valuable insights from vast datasets. Neural topic models (NTMs) have emerged as a valuable unsupervised tool in this field. However, the predominant objective in NTMs,…

Artificial Intelligence · Computer Science 2024-12-24 Xin Gao , Yang Lin , Ruiqing Li , Yasha Wang , Xu Chu , Xinyu Ma , Hailong Yu

Momentum Contrastive Learning with Enhanced Negative Sampling and Hard Negative Filtering

Contrastive learning has become pivotal in unsupervised representation learning, with frameworks like Momentum Contrast (MoCo) effectively utilizing large negative sample sets to extract discriminative features. However, traditional…

Machine Learning · Computer Science 2025-01-29 Duy Hoang , Huy Ngo , Khoi Pham , Tri Nguyen , Gia Bao , Huy Phan

Paying Alignment Tax with Contrastive Learning

Current debiasing approaches often result a degradation in model capabilities such as factual accuracy and knowledge retention. Through systematic evaluation across multiple benchmarks, we demonstrate that existing debiasing methods face…

Machine Learning · Computer Science 2025-05-27 Buse Sibel Korkmaz , Rahul Nair , Elizabeth M. Daly , Antonio del Rio Chanona

Aligning Visual Contrastive learning models via Preference Optimization

Contrastive learning models have demonstrated impressive abilities to capture semantic similarities by aligning representations in the embedding space. However, their performance can be limited by the quality of the training data and its…

Computer Vision and Pattern Recognition · Computer Science 2025-03-27 Amirabbas Afzali , Borna Khodabandeh , Ali Rasekh , Mahyar JafariNodeh , Sepehr kazemi , Simon Gottschalk

Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings

Semantic representation learning for sentences is an important and well-studied problem in NLP. The current trend for this task involves training a Transformer-based sentence encoder through a contrastive objective with text, i.e.,…

Computation and Language · Computer Science 2022-09-21 Yiren Jian , Chongyang Gao , Soroush Vosoughi

Multi-Task Learning as Multi-Objective Optimization

In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise…

Machine Learning · Computer Science 2019-01-14 Ozan Sener , Vladlen Koltun

Fixing a Broken ELBO

Recent work in unsupervised representation learning has focused on learning deep directed latent-variable models. Fitting these models by maximizing the marginal likelihood or evidence is typically intractable, thus a common approximation…

Machine Learning · Computer Science 2018-02-15 Alexander A. Alemi , Ben Poole , Ian Fischer , Joshua V. Dillon , Rif A. Saurous , Kevin Murphy

Investigating the Role of Negatives in Contrastive Representation Learning

Noise contrastive learning is a popular technique for unsupervised representation learning. In this approach, a representation is obtained via reduction to supervised learning, where given a notion of semantic similarity, the learner tries…

Machine Learning · Computer Science 2021-06-21 Jordan T. Ash , Surbhi Goel , Akshay Krishnamurthy , Dipendra Misra

Improving Contextualized Topic Models with Negative Sampling

Topic modeling has emerged as a dominant method for exploring large document collections. Recent approaches to topic modeling use large contextualized language models and variational autoencoders. In this paper, we propose a negative…

Computation and Language · Computer Science 2023-03-28 Suman Adhya , Avishek Lahiri , Debarshi Kumar Sanyal , Partha Pratim Das

Multi-Objective Learning to Predict Pareto Fronts Using Hypervolume Maximization

Real-world problems are often multi-objective with decision-makers unable to specify a priori which trade-off between the conflicting objectives is preferable. Intuitively, building machine learning solutions in such cases would entail…

Machine Learning · Computer Science 2021-10-20 Timo M. Deist , Monika Grewal , Frank J. W. M. Dankers , Tanja Alderliesten , Peter A. N. Bosman

Contrastive Learning for Neural Topic Model

Recent empirical studies show that adversarial topic models (ATM) can successfully capture semantic patterns of the document by differentiating a document with another dissimilar sample. However, utilizing that discriminative-generative…

Computation and Language · Computer Science 2021-10-26 Thong Nguyen , Anh Tuan Luu

Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings

A topic model is often formulated as a generative model that explains how each word of a document is generated given a set of topics and document-specific topic proportions. It is focused on capturing the word co-occurrences in a document…

Machine Learning · Computer Science 2022-03-16 Dongsheng Wang , Dandan Guo , He Zhao , Huangjie Zheng , Korawat Tanwisuth , Bo Chen , Mingyuan Zhou

A Contrastive Learning Approach to Mitigate Bias in Speech Models

Speech models may be affected by performance imbalance in different population subgroups, raising concerns about fair treatment across these groups. Prior attempts to mitigate unfairness either focus on user-defined subgroups, potentially…

Computation and Language · Computer Science 2024-09-17 Alkis Koudounas , Flavio Giobergia , Eliana Pastor , Elena Baralis

A Mathematical Perspective On Contrastive Learning

Multimodal contrastive learning is a methodology for linking different data modalities; the canonical example is linking image and text data. The methodology is typically framed as the identification of a set of encoders, one for each…

Machine Learning · Statistics 2025-06-02 Ricardo Baptista , Andrew M. Stuart , Son Tran

Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive Learning

To overcome the data sparsity issue in short text topic modeling, existing methods commonly rely on data augmentation or the data characteristic of short texts to introduce more word co-occurrence information. However, most of them do not…

Computation and Language · Computer Science 2022-11-24 Xiaobao Wu , Anh Tuan Luu , Xinshuai Dong

Contrastive Learning-based Sentence Encoders Implicitly Weight Informative Words

The performance of sentence encoders can be significantly improved through the simple practice of fine-tuning using contrastive loss. A natural question arises: what characteristics do models acquire during contrastive learning? This paper…

Computation and Language · Computer Science 2023-10-25 Hiroto Kurita , Goro Kobayashi , Sho Yokoi , Kentaro Inui

Evaluating Negative Sampling Approaches for Neural Topic Models

Negative sampling has emerged as an effective technique that enables deep learning models to learn better representations by introducing the paradigm of learn-to-compare. The goal of this approach is to add robustness to deep learning…

Computation and Language · Computer Science 2025-03-26 Suman Adhya , Avishek Lahiri , Debarshi Kumar Sanyal , Partha Pratim Das

Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence

Topic models extract groups of words from documents, whose interpretation as a topic hopefully allows for a better understanding of the data. However, the resulting word groups are often not coherent, making them harder to interpret.…

Computation and Language · Computer Science 2021-06-18 Federico Bianchi , Silvia Terragni , Dirk Hovy