English
Related papers

Related papers: Mahalanobis Distance Metric Learning Algorithm for…

200 papers

For many machine learning algorithms such as $k$-Nearest Neighbor ($k$-NN) classifiers and $ k $-means clustering, often their success heavily depends on the metric used to calculate distances between different data points. An effective…

Computer Vision and Pattern Recognition · Computer Science 2010-03-03 Chunhua Shen , Junae Kim , Lei Wang

Effectively applying the K-means algorithm to clustering tasks with incomplete features remains an important research area due to its impact on real-world applications. Recent work has shown that unifying K-means clustering and imputation…

Machine Learning · Computer Science 2025-04-14 Lovis Kwasi Armah , Igor Melnykov

A number of machine learning algorithms are using a metric, or a distance, in order to compare individuals. The Euclidean distance is usually employed, but it may be more efficient to learn a parametric distance such as Mahalanobis metric.…

Machine Learning · Computer Science 2016-12-16 Hoel Le Capitaine

The need for appropriate ways to measure the distance or similarity between data is ubiquitous in machine learning, pattern recognition and data mining, but handcrafting such good metrics for specific problems is generally difficult. This…

Machine Learning · Computer Science 2019-01-25 Aurélien Bellet , Amaury Habrard , Marc Sebban

This report provides an exploration of different distance measures that can be used with the $K$-means algorithm for cluster analysis. Specifically, we investigate the Mahalanobis distance, and critically assess any benefits it may have…

Other Statistics · Statistics 2024-04-23 Zoe Shapcott

Common clustering algorithms require multiple scans of all the data to achieve convergence, and this is prohibitive when large databases, with data arriving in streams, must be processed. Some algorithms to extend the popular K-means method…

Applications · Statistics 2017-12-22 Giacomo Aletti , Alessandra Micheletti

In recent years, the crucial importance of metrics in machine learning algorithms has led to an increasing interest for optimizing distance and similarity functions. Most of the state of the art focus on learning Mahalanobis distances…

Machine Learning · Computer Science 2019-01-25 Aurelien Bellet , Amaury Habrard , Marc Sebban

A key element of any machine learning algorithm is the use of a function that measures the dis/similarity between data points. Given a task, such a function can be optimized with a metric learning algorithm. Although this research field has…

Machine Learning · Statistics 2019-09-05 Léo Gautheron , Emilie Morvant , Amaury Habrard , Marc Sebban

Mahalanobis metrics are widely used in machine learning in conjunction with methods like $k$-nearest neighbors, $k$-means clustering, and $k$-medians clustering. Despite their importance, there has not been any prior work on applying…

Machine Learning · Computer Science 2024-01-02 Lianke Qin , Aravind Reddy , Zhao Song

Online metric learning has been widely applied in classification and retrieval. It can automatically learn a suitable metric from data by restricting similar instances to be separated from dissimilar instances with a given margin. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-28 Wenbin Li , Yanfang Liu , Jing Huo , Yinghuan Shi , Yang Gao , Lei Wang , Jiebo Luo

Distance metric learning has attracted much attention in recent years, where the goal is to learn a distance metric based on user feedback. Conventional approaches to metric learning mainly focus on learning the Mahalanobis distance metric…

Machine Learning · Computer Science 2020-11-10 Zhongfang Zhuang , Xiangnan Kong , Elke Rundensteiner , Jihane Zouaoui , Aditya Arora

Distance metric learning is a successful way to enhance the performance of the nearest neighbor classifier. In most cases, however, the distribution of data does not obey a regular form and may change in different parts of the feature…

Computer Vision and Pattern Recognition · Computer Science 2018-03-19 Hossein Rajabzadeh , Mansoor Zolghadri Jahromi , Mohammad Sadegh Zare , Mostafa Fakhrahmad

A fundamental question in data analysis, machine learning and signal processing is how to compare between data points. The choice of the distance metric is specifically challenging for high-dimensional data sets, where the problem of…

Machine Learning · Statistics 2017-08-15 Almog Lahav , Ronen Talmon , Yuval Kluger

In this paper, we discuss Mahalanobis k-NN: A Statistical Lens designed to address the challenges of feature matching in learning-based point cloud registration when confronted with an arbitrary density of point clouds. We tackle this by…

Computer Vision and Pattern Recognition · Computer Science 2025-01-09 Tejas Anvekar , Shivanand Venkanna Sheshappanavar

The concepts of similarity and distance are crucial in data mining. We consider the problem of defining the distance between two data sets by comparing summary statistics computed from the data sets. The initial definition of our distance…

Data Structures and Algorithms · Computer Science 2019-02-05 Nikolaj Tatti

For many data mining and machine learning tasks, the quality of a similarity measure is the key for their performance. To automatically find a good similarity measure from datasets, metric learning and similarity learning are proposed and…

Machine Learning · Computer Science 2021-04-16 Dezhong Yao , Peilin Zhao , Chen Yu , Hai Jin , Bin Li

Data streams are often defined as large amounts of data flowing continuously at high speed. Moreover, these data are likely subject to changes in data distribution, known as concept drift. Given all the reasons mentioned above, learning…

Data stream mining aims at extracting meaningful knowledge from continually evolving data streams, addressing the challenges posed by nonstationary environments, particularly, concept drift which refers to a change in the underlying data…

Machine Learning · Computer Science 2025-01-03 Kleanthis Malialis , Jin Li , Christos G. Panayiotou , Marios M. Polycarpou

We propose a novel semiparametric classifier based on Mahalanobis distances of an observation from the competing classes. Our tool is a generalized additive model with the logistic link function that uses these distances as features to…

Methodology · Statistics 2025-02-05 Annesha Ghosh , Anil K. Ghosh , Rita SahaRay , Soham Sarkar

Metric learning is a key problem for many data mining and machine learning applications, and has long been dominated by Mahalanobis methods. Recent advances in nonlinear metric learning have demonstrated the potential power of…

Machine Learning · Statistics 2014-02-25 David M. Johnson , Caiming Xiong , Jason J. Corso
‹ Prev 1 2 3 10 Next ›