English
Related papers

Related papers: A Multiscale Environment for Learning by Diffusion

200 papers

This paper proposes and analyzes a novel clustering algorithm that combines graph-based diffusion geometry with techniques based on density and mode estimation. The proposed method is suitable for data generated from mixtures of…

Machine Learning · Statistics 2019-01-01 Mauro Maggioni , James M. Murphy

Clustering algorithms partition a dataset into groups of similar points. The primary contribution of this article is the Multiscale Spatially-Regularized Diffusion Learning (M-SRDL) clustering algorithm, which uses spatially-regularized…

Machine Learning · Computer Science 2022-04-08 Sam L. Polk , James M. Murphy

This article proposes an active learning method for high dimensional data, based on intrinsic data geometries learned through diffusion processes on graphs. Diffusion distances are used to parametrize low-dimensional structures on the…

Machine Learning · Computer Science 2019-05-31 Mauro Maggioni , James M. Murphy

As a kind of basic machine learning method, clustering algorithms group data points into different categories based on their similarity or distribution. We present a clustering algorithm by finding hyper-planes to distinguish the data…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Luhong Diao , Jinying Gao1 , Manman Deng

The Multi-Task Learning (MTL) technique has been widely studied by word-wide researchers. The majority of current MTL studies adopt the hard parameter sharing structure, where hard layers tend to learn general representations over all tasks…

Information Retrieval · Computer Science 2021-01-25 Dehong Gao , Wenjing Yang , Huiling Zhou , Yi Wei , Yi Hu , Hao Wang

Unsupervised landmarks discovery (ULD) for an object category is a challenging computer vision problem. In pursuit of developing a robust ULD framework, we explore the potential of a recent paradigm of self-supervised learning algorithms,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Siddharth Tourani , Ahmed Alwheibi , Arif Mahmood , Muhammad Haris Khan

Clustering is an unsupervised machine learning methodology where unlabeled elements/objects are grouped together aiming to the construction of well-established clusters that their elements are classified according to their similarity. The…

Machine Learning · Statistics 2023-10-20 Dimitrios Saligkaras , Vasileios E. Papageorgiou

Diffusion models, widely recognized for their success in generative tasks, have not yet been applied to clustering. We introduce Clustering via Diffusion (CLUDI), a self-supervised framework that combines the generative power of diffusion…

Artificial Intelligence · Computer Science 2025-07-31 Roy Uziel , Irit Chelly , Oren Freifeld , Ari Pakman

Clustering is one of the most fundamental and wide-spread techniques in exploratory data analysis. Yet, the basic approach to clustering has not really changed: a practitioner hand-picks a task-specific clustering loss to optimize and fit…

Machine Learning · Computer Science 2019-11-01 Yibo Jiang , Nakul Verma

The problem of unsupervised learning and segmentation of hyperspectral images is a significant challenge in remote sensing. The high dimensionality of hyperspectral data, presence of substantial noise, and overlap of classes all contribute…

Computer Vision and Pattern Recognition · Computer Science 2018-10-17 James M. Murphy , Mauro Maggioni

Diffusion models have gained traction as powerful algorithms for synthesizing high-quality images. Central to these algorithms is the diffusion process, a set of equations which maps data to noise in a way that can significantly affect…

Machine Learning · Computer Science 2024-11-12 Subham Sekhar Sahoo , Aaron Gokaslan , Chris De Sa , Volodymyr Kuleshov

Masked diffusion models (MDMs) have achieved notable progress in modeling discrete data, while their potential in molecular generation remains underexplored. In this work, we explore their potential and introduce the surprising result that…

Machine Learning · Computer Science 2025-09-29 Hyunjin Seo , Taewon Kim , Sihyun Yu , SungSoo Ahn

A model based clustering procedure for data of mixed type, clustMD, is developed using a latent variable model. It is proposed that a latent variable, following a mixture of Gaussian distributions, generates the observed data of mixed type.…

Methodology · Statistics 2015-11-06 Damien McParland , Isobel Claire Gormley

Subspace clustering has been extensively studied from the hypothesis-and-test, algebraic, and spectral clustering based perspectives. Most assume that only a single type/class of subspace is present. Generalizations to multiple types are…

Computer Vision and Pattern Recognition · Computer Science 2019-04-04 Xun Xu , Loong-Fah Cheong , Zhuwen Li

Large-scale multi-layer networks with large numbers of nodes, edges, and layers arise across various domains, which poses a great computational challenge for the downstream analysis. In this paper, we develop an efficient randomized…

Computation · Statistics 2025-01-10 Wenqing Su , Xiao Guo , Xiangyu Chang , Ying Yang

Diffusion models have recently been successfully applied to a wide range of robotics applications for learning complex multi-modal behaviors from data. However, prior works have mostly been confined to single-robot and small-scale…

Robotics · Computer Science 2025-05-08 Yorai Shaoul , Itamar Mishani , Shivam Vats , Jiaoyang Li , Maxim Likhachev

This paper introduces a new clustering technique, called {\em dimensional clustering}, which clusters each data point by its latent {\em pointwise dimension}, which is a measure of the dimensionality of the data set local to that point.…

Machine Learning · Statistics 2018-05-29 Shohei Hidaka , Neeraj Kashyap

Predictive models trained on imbalanced data tend to produce biased results. This problem is exacerbated when there is not just one output label, but a set of them. This is the case for multilabel learning (MLL) algorithms used to classify…

Machine Learning · Computer Science 2025-01-22 Francisco Charte , Miguel Ángel Dávila , María Dolores Pérez-Godoy , María José del Jesus

Clustering high-dimensional spatiotemporal data using an unsupervised approach is a challenging problem for many data-driven applications. Existing state-of-the-art methods for unsupervised clustering use different similarity and distance…

Machine Learning · Computer Science 2023-09-15 Omar Faruque , Francis Ndikum Nji , Mostafa Cham , Rohan Mandar Salvi , Xue Zheng , Jianwu Wang

Clustering high-dimensional datasets is hard because interpoint distances become less informative in high-dimensional spaces. We present a clustering algorithm that performs nonlinear dimensionality reduction and clustering jointly. The…

Machine Learning · Computer Science 2018-03-06 Sohil Atul Shah , Vladlen Koltun
‹ Prev 1 2 3 10 Next ›