Related papers: A Generalized Probabilistic Framework for Compact …

Lossless Coding with Generalised Criteria

This paper presents prefix codes which minimize various criteria constructed as a convex combination of maximum codeword length and average codeword length or maximum redundancy and average redundancy, including a convex combination of the…

Information Theory · Computer Science 2011-02-11 Charalambos D. Charalambous , Themistoklis Charalambous , Farzad Rezaei

Optimal Merging Algorithms for Lossless Codes with Generalized Criteria

This paper presents lossless prefix codes optimized with respect to a pay-off criterion consisting of a convex combination of maximum codeword length and average codeword length. The optimal codeword lengths obtained are based on a new…

Information Theory · Computer Science 2012-08-18 Themistoklis Charalambous , Charalambos D. Charalambous , Farzad Rezaei

Unified Bayesian Frameworks for Multi-criteria Decision-making Problems

This paper introduces Bayesian frameworks for tackling various aspects of multi-criteria decision-making (MCDM) problems, leveraging a probabilistic interpretation of MCDM methods and challenges. By harnessing the flexibility of Bayesian…

Artificial Intelligence · Computer Science 2025-08-08 Majid Mohammadi

Probabilistic Models for Unified Collaborative and Content-Based Recommendation in Sparse-Data Environments

Recommender systems leverage product and community information to target products to consumers. Researchers have developed collaborative recommenders, content-based recommenders, and (largely ad-hoc) hybrid systems. We propose a unified…

Information Retrieval · Computer Science 2013-01-14 Alexandrin Popescul , Lyle H. Ungar , David M Pennock , Steve Lawrence

Abstractive Multi-Document Summarization via Phrase Selection and Merging

We propose an abstraction-based multi-document summarization framework that can construct new sentences by exploring more fine-grained syntactic units than sentences, namely, noun/verb phrases. Different from existing abstraction-based…

Computation and Language · Computer Science 2015-06-08 Lidong Bing , Piji Li , Yi Liao , Wai Lam , Weiwei Guo , Rebecca J. Passonneau

Inference and Evaluation of the Multinomial Mixture Model for Text Clustering

In this article, we investigate the use of a probabilistic model for unsupervised clustering in text collections. Unsupervised clustering has become a basic module for many intelligent text processing applications, such as information…

Information Retrieval · Computer Science 2016-08-16 Loïs Rigouste , Olivier Cappé , François Yvon

A Statistical Approach to Increase Classification Accuracy in Supervised Learning Algorithms

Probabilistic mixture models have been widely used for different machine learning and pattern recognition tasks such as clustering, dimensionality reduction, and classification. In this paper, we focus on trying to solve the most common…

Machine Learning · Computer Science 2020-04-08 Gustavo A Valencia-Zapata , Daniel Mejia , Gerhard Klimeck , Michael Zentner , Okan Ersoy

Sketching for Large-Scale Learning of Mixture Models

Learning parameters from voluminous data can be prohibitive in terms of memory and computational requirements. We propose a "compressive learning" framework where we estimate model parameters from a sketch of the training data. This sketch…

Machine Learning · Computer Science 2017-05-08 Nicolas Keriven , Anthony Bourrier , Rémi Gribonval , Patrick Pérez

SyncSDE: A Probabilistic Framework for Diffusion Synchronization

There have been many attempts to leverage multiple diffusion models for collaborative generation, extending beyond the original domain. A prominent approach involves synchronizing multiple diffusion trajectories by mixing the estimated…

Machine Learning · Computer Science 2025-06-04 Hyunjun Lee , Hyunsoo Lee , Sookwan Han

Constrained Sampling for Language Models Should Be Easy: An MCMC Perspective

Constrained decoding enables Language Models (LMs) to produce samples that provably satisfy hard constraints. However, existing constrained-decoding approaches often distort the underlying model distribution, a limitation that is especially…

Artificial Intelligence · Computer Science 2025-06-09 Emmanuel Anaya Gonzalez , Sairam Vaidya , Kanghee Park , Ruyi Ji , Taylor Berg-Kirkpatrick , Loris D'Antoni

List-Level Distribution Coupling with Applications to Speculative Decoding and Lossy Compression

We study a relaxation of the problem of coupling probability distributions -- a list of samples is generated from one distribution and an accept is declared if any one of these samples is identical to the sample generated from the other…

Machine Learning · Computer Science 2026-01-13 Joseph Rowan , Buu Phan , Ashish Khisti

An Uncertainty Framework for Classification

We define a generalized likelihood function based on uncertainty measures and show that maximizing such a likelihood function for different measures induces different types of classifiers. In the probabilistic framework, we obtain…

Machine Learning · Computer Science 2013-01-18 Loo-Nin Teow , Kia-Fock Loe

Compositional Distributed Learning for Multi-View Perception: A Maximal Coding Rate Reduction Perspective

In this letter, we formulate a compositional distributed learning framework for multi-view perception by leveraging the maximal coding rate reduction principle combined with subspace basis fusion. In the proposed algorithm, each agent…

Image and Video Processing · Electrical Eng. & Systems 2025-11-13 Zhuojun Tian , Mehdi Bennis

Variational Information Maximization for Feature Selection

Feature selection is one of the most fundamental problems in machine learning. An extensive body of work on information-theoretic feature selection exists which is based on maximizing mutual information between subsets of features and class…

Machine Learning · Statistics 2016-06-10 Shuyang Gao , Greg Ver Steeg , Aram Galstyan

Combining Independent Modules in Lexical Multiple-Choice Problems

Existing statistical approaches to natural language problems are very coarse approximations to the true complexity of language processing. As such, no single technique will be best for all problem instances. Many researchers are examining…

Machine Learning · Computer Science 2007-05-23 Peter D. Turney , Michael L. Littman , Jeffrey Bigham , Victor Shnayder

Marginal Likelihoods for Distributed Parameter Estimation of Gaussian Graphical Models

We consider distributed estimation of the inverse covariance matrix, also called the concentration or precision matrix, in Gaussian graphical models. Traditional centralized estimation often requires global inference of the covariance…

Machine Learning · Statistics 2015-06-15 Zhaoshi Meng , Dennis Wei , Ami Wiesel , Alfred O. Hero

Realistic Evaluation of Model Merging for Compositional Generalization

Merging has become a widespread way to cheaply combine individual models into a single model that inherits their capabilities and attains better performance. This popularity has spurred rapid development of many new merging methods, which…

Machine Learning · Computer Science 2024-09-30 Derek Tam , Yash Kant , Brian Lester , Igor Gilitschenski , Colin Raffel

Matching Catalogues by Probabilistic Pattern Classification

We consider the statistical problem of catalogue matching from a machine learning perspective with the goal of producing probabilistic outputs, and using all available information. A framework is provided that unifies two existing…

Astrophysics · Physics 2009-11-11 D. J. Rohde , M. R. Gallagher , M. J. Drinkwater , K. A. Pimbblet

Gem: Gaussian Mixture Model Embeddings for Numerical Feature Distributions

Embeddings are now used to underpin a wide variety of data management tasks, including entity resolution, dataset search and semantic type detection. Such applications often involve datasets with numerical columns, but there has been more…

Databases · Computer Science 2024-10-11 Hafiz Tayyab Rauf , Alex Bogatu , Norman W. Paton , Andre Freitas

Fusion of Probability Density Functions

Fusing probabilistic information is a fundamental task in signal and data processing with relevance to many fields of technology and science. In this work, we investigate the fusion of multiple probability density functions (pdfs) of a…

Signal Processing · Electrical Eng. & Systems 2023-01-20 Günther Koliander , Yousef El-Laham , Petar M. Djurić , Franz Hlawatsch