Related papers: Approximating Incomplete Kernel Matrices by the em…

Kernel matrix regression

We address the problem of filling missing entries in a kernel Gram matrix, given a related full Gram matrix. We attack this problem from the viewpoint of regression, assuming that the two kernel matrices can be considered as explanatory…

Quantitative Methods · Quantitative Biology 2011-11-10 Yoshihiro Yamanishi , Jean-Philippe Vert

Mutual Kernel Matrix Completion

With the huge influx of various data nowadays, extracting knowledge from them has become an interesting but tedious task among data scientists, particularly when the data come in heterogeneous form and have missing information. Many data…

Machine Learning · Computer Science 2017-05-11 Tsuyoshi Kato , Rachelle Rivero

Learning Shared Kernel Models: the Shared Kernel EM algorithm

Expectation maximisation (EM) is an unsupervised learning method for estimating the parameters of a finite mixture distribution. It works by introducing "hidden" or "latent" variables via Baum's auxiliary function $Q$ that allow the joint…

Machine Learning · Computer Science 2022-05-19 Graham W. Pulford

Open-Source GEMM Hardware Kernels Generator: Toward Numerically-Tailored Computations

Many scientific computing problems can be reduced to Matrix-Matrix Multiplications (MMM), making the General Matrix Multiply (GEMM) kernels in the Basic Linear Algebra Subroutine (BLAS) of interest to the high-performance computing…

Hardware Architecture · Computer Science 2023-05-31 Louis Ledoux , Marc Casas

Clustering on Multiple Incomplete Datasets via Collective Kernel Learning

Multiple datasets containing different types of features may be available for a given task. For instance, users' profiles can be used to group users for recommendation systems. In addition, a model can also use users' historical behaviors…

Machine Learning · Computer Science 2016-05-10 Weixiang Shao , Xiaoxiao Shi , Philip S. Yu

Fast model-based clustering of partial records

Partially recorded data are frequently encountered in many applications and usually clustered by first removing incomplete cases or features with missing values, or by imputing missing values, followed by application of a clustering…

Methodology · Statistics 2021-10-20 Emily M. Goren , Ranjan Maitra

Kernel Representation and Similarity Measure for Incomplete Data

Measuring similarity between incomplete data is a fundamental challenge in web mining, recommendation systems, and user behavior analysis. Traditional approaches either discard incomplete data or perform imputation as a preprocessing step,…

Machine Learning · Computer Science 2025-10-16 Yang Cao , Sikun Yang , Kai He , Wenjun Ma , Ming Liu , Yujiu Yang , Jian Weng

Kernel Alignment for Unsupervised Feature Selection via Matrix Factorization

By removing irrelevant and redundant features, feature selection aims to find a good representation of the original features. With the prevalence of unlabeled data, unsupervised feature selection has been proven effective in alleviating the…

Machine Learning · Computer Science 2024-03-25 Ziyuan Lin , Deanna Needell

A Non-Parametric Bootstrap for Spectral Clustering

Finite mixture modelling is a popular method in the field of clustering and is beneficial largely due to its soft cluster membership probabilities. A common method for fitting finite mixture models is to employ spectral clustering, which…

Machine Learning · Statistics 2024-03-22 Liam Welsh , Phillip Shreeves

A generalized EMS algorithm for model selection with incomplete data

Recently, a so-called E-MS algorithm was developed for model selection in the presence of missing data. Specifically, it performs the Expectation step (E step) and Model Selection step (MS step) alternately to find the minimum point of the…

Methodology · Statistics 2021-06-22 Ping-Feng Xu , Lai-Xu Shang , Man-Lai Tang , Na Shan , Guoliang Tian

Matching markers and unlabeled configurations in protein gels

Unlabeled shape analysis is a rapidly emerging and challenging area of statistics. This has been driven by various novel applications in bioinformatics. We consider here the situation where two configurations are matched under various…

Applications · Statistics 2012-09-28 Kanti V. Mardia , Emma M. Petty , Charles C. Taylor

Recovery Guarantees for Kernel-based Clustering under Non-parametric Mixture Models

Despite the ubiquity of kernel-based clustering, surprisingly few statistical guarantees exist beyond settings that consider strong structural assumptions on the data generation process. In this work, we take a step towards bridging this…

Machine Learning · Computer Science 2021-10-19 Leena Chennuru Vankadara , Sebastian Bordt , Ulrike von Luxburg , Debarghya Ghoshdastidar

Kernel Machines With Missing Responses

Missing responses is a missing data format in which outcomes are not always observed. In this work we develop kernel machines that can handle missing responses. First, we propose a kernel machine family that uses mainly the complete cases.…

Machine Learning · Statistics 2018-06-11 Tiantian Liu , Yair Goldberg

Kernel-based Approximate Bayesian Inference for Exponential Family Random Graph Models

Bayesian inference for exponential family random graph models (ERGMs) is a doubly-intractable problem because of the intractability of both the likelihood and posterior normalizing factor. Auxiliary variable based Markov Chain Monte Carlo…

Computation · Statistics 2020-07-15 Fan Yin , Carter T. Butts

Determinantal Clustering Processes - A Nonparametric Bayesian Approach to Kernel Based Semi-Supervised Clustering

Semi-supervised clustering is the task of clustering data points into clusters where only a fraction of the points are labelled. The true number of clusters in the data is often unknown and most models require this parameter as an input.…

Machine Learning · Computer Science 2013-09-27 Amar Shah , Zoubin Ghahramani

Geometry of EM and related iterative algorithms

The Expectation--Maximization (EM) algorithm is a simple meta-algorithm that has been used for many years as a methodology for statistical inference when there are missing measurements in the observed data or when the data is composed of…

Machine Learning · Statistics 2022-11-15 Hideitsu Hino , Shotaro Akaho , Noboru Murata

Generalized RBF kernel for incomplete data

We construct $\bf genRBF$ kernel, which generalizes the classical Gaussian RBF kernel to the case of incomplete data. We model the uncertainty contained in missing attributes making use of data distribution and associate every point with a…

Machine Learning · Computer Science 2017-05-03 Łukasz Struski , Marek Śmieja , Jacek Tabor

A Hybrid EM Algorithm for Linear Two-Way Interactions with Missing Data

We study an EM algorithm for estimating product-term regression models with missing data. The study of such problems in the likelihood tradition has thus far been restricted to an EM algorithm method using full numerical integration.…

Methodology · Statistics 2021-11-16 Dale S. Kim

Efficient Algorithms for Bayesian Network Parameter Learning from Incomplete Data

We propose an efficient family of algorithms to learn the parameters of a Bayesian network from incomplete data. In contrast to textbook approaches such as EM and the gradient method, our approach is non-iterative, yields closed form…

Machine Learning · Computer Science 2014-11-26 Guy Van den Broeck , Karthika Mohan , Arthur Choi , Judea Pearl

Kernels on Sample Sets via Nonparametric Divergence Estimates

Most machine learning algorithms, such as classification or regression, treat the individual data point as the object of interest. Here we consider extending machine learning algorithms to operate on groups of data points. We suggest…

Machine Learning · Computer Science 2021-01-15 Danica J. Sutherland , Liang Xiong , Barnabás Póczos , Jeff Schneider