English
Related papers

Related papers: Approximating Incomplete Kernel Matrices by the em…

200 papers

We address the problem of filling missing entries in a kernel Gram matrix, given a related full Gram matrix. We attack this problem from the viewpoint of regression, assuming that the two kernel matrices can be considered as explanatory…

Quantitative Methods · Quantitative Biology 2011-11-10 Yoshihiro Yamanishi , Jean-Philippe Vert

With the huge influx of various data nowadays, extracting knowledge from them has become an interesting but tedious task among data scientists, particularly when the data come in heterogeneous form and have missing information. Many data…

Machine Learning · Computer Science 2017-05-11 Tsuyoshi Kato , Rachelle Rivero

Expectation maximisation (EM) is an unsupervised learning method for estimating the parameters of a finite mixture distribution. It works by introducing "hidden" or "latent" variables via Baum's auxiliary function $Q$ that allow the joint…

Machine Learning · Computer Science 2022-05-19 Graham W. Pulford

Many scientific computing problems can be reduced to Matrix-Matrix Multiplications (MMM), making the General Matrix Multiply (GEMM) kernels in the Basic Linear Algebra Subroutine (BLAS) of interest to the high-performance computing…

Hardware Architecture · Computer Science 2023-05-31 Louis Ledoux , Marc Casas

Multiple datasets containing different types of features may be available for a given task. For instance, users' profiles can be used to group users for recommendation systems. In addition, a model can also use users' historical behaviors…

Machine Learning · Computer Science 2016-05-10 Weixiang Shao , Xiaoxiao Shi , Philip S. Yu

Partially recorded data are frequently encountered in many applications and usually clustered by first removing incomplete cases or features with missing values, or by imputing missing values, followed by application of a clustering…

Methodology · Statistics 2021-10-20 Emily M. Goren , Ranjan Maitra

Measuring similarity between incomplete data is a fundamental challenge in web mining, recommendation systems, and user behavior analysis. Traditional approaches either discard incomplete data or perform imputation as a preprocessing step,…

Machine Learning · Computer Science 2025-10-16 Yang Cao , Sikun Yang , Kai He , Wenjun Ma , Ming Liu , Yujiu Yang , Jian Weng

By removing irrelevant and redundant features, feature selection aims to find a good representation of the original features. With the prevalence of unlabeled data, unsupervised feature selection has been proven effective in alleviating the…

Machine Learning · Computer Science 2024-03-25 Ziyuan Lin , Deanna Needell

Finite mixture modelling is a popular method in the field of clustering and is beneficial largely due to its soft cluster membership probabilities. A common method for fitting finite mixture models is to employ spectral clustering, which…

Machine Learning · Statistics 2024-03-22 Liam Welsh , Phillip Shreeves

Recently, a so-called E-MS algorithm was developed for model selection in the presence of missing data. Specifically, it performs the Expectation step (E step) and Model Selection step (MS step) alternately to find the minimum point of the…

Methodology · Statistics 2021-06-22 Ping-Feng Xu , Lai-Xu Shang , Man-Lai Tang , Na Shan , Guoliang Tian

Unlabeled shape analysis is a rapidly emerging and challenging area of statistics. This has been driven by various novel applications in bioinformatics. We consider here the situation where two configurations are matched under various…

Applications · Statistics 2012-09-28 Kanti V. Mardia , Emma M. Petty , Charles C. Taylor

Despite the ubiquity of kernel-based clustering, surprisingly few statistical guarantees exist beyond settings that consider strong structural assumptions on the data generation process. In this work, we take a step towards bridging this…

Machine Learning · Computer Science 2021-10-19 Leena Chennuru Vankadara , Sebastian Bordt , Ulrike von Luxburg , Debarghya Ghoshdastidar

Missing responses is a missing data format in which outcomes are not always observed. In this work we develop kernel machines that can handle missing responses. First, we propose a kernel machine family that uses mainly the complete cases.…

Machine Learning · Statistics 2018-06-11 Tiantian Liu , Yair Goldberg

Bayesian inference for exponential family random graph models (ERGMs) is a doubly-intractable problem because of the intractability of both the likelihood and posterior normalizing factor. Auxiliary variable based Markov Chain Monte Carlo…

Computation · Statistics 2020-07-15 Fan Yin , Carter T. Butts

Semi-supervised clustering is the task of clustering data points into clusters where only a fraction of the points are labelled. The true number of clusters in the data is often unknown and most models require this parameter as an input.…

Machine Learning · Computer Science 2013-09-27 Amar Shah , Zoubin Ghahramani

The Expectation--Maximization (EM) algorithm is a simple meta-algorithm that has been used for many years as a methodology for statistical inference when there are missing measurements in the observed data or when the data is composed of…

Machine Learning · Statistics 2022-11-15 Hideitsu Hino , Shotaro Akaho , Noboru Murata

We construct $\bf genRBF$ kernel, which generalizes the classical Gaussian RBF kernel to the case of incomplete data. We model the uncertainty contained in missing attributes making use of data distribution and associate every point with a…

Machine Learning · Computer Science 2017-05-03 Łukasz Struski , Marek Śmieja , Jacek Tabor

We study an EM algorithm for estimating product-term regression models with missing data. The study of such problems in the likelihood tradition has thus far been restricted to an EM algorithm method using full numerical integration.…

Methodology · Statistics 2021-11-16 Dale S. Kim

We propose an efficient family of algorithms to learn the parameters of a Bayesian network from incomplete data. In contrast to textbook approaches such as EM and the gradient method, our approach is non-iterative, yields closed form…

Machine Learning · Computer Science 2014-11-26 Guy Van den Broeck , Karthika Mohan , Arthur Choi , Judea Pearl

Most machine learning algorithms, such as classification or regression, treat the individual data point as the object of interest. Here we consider extending machine learning algorithms to operate on groups of data points. We suggest…

Machine Learning · Computer Science 2021-01-15 Danica J. Sutherland , Liang Xiong , Barnabás Póczos , Jeff Schneider
‹ Prev 1 2 3 10 Next ›