Related papers: Approximate kNN Classification for Biomedical Data

Rapid AkNN Query Processing for Fast Classification of Multidimensional Data in the Cloud

A $k$-nearest neighbor ($k$NN) query determines the $k$ nearest points, using distance metrics, from a specific location. An all $k$-nearest neighbor (A$k$NN) query constitutes a variation of a $k$NN query and retrieves the $k$ nearest…

Databases · Computer Science 2014-02-28 Nikolaos Nodarakis , Spyros Sioutas , Dimitrios Tsoumakos , Giannis Tzimas , Evaggelia Pitoura

Application of Deep Learning on Single-Cell RNA-sequencing Data Analysis: A Review

Single-cell RNA-sequencing (scRNA-seq) has become a routinely used technique to quantify the gene expression profile of thousands of single cells simultaneously. Analysis of scRNA-seq data plays an important role in the study of cell states…

Genomics · Quantitative Biology 2022-10-13 Matthew Brendel , Chang Su , Zilong Bai , Hao Zhang , Olivier Elemento , Fei Wang

Machine learning based biomedical image processing for echocardiographic images

The popularity of Artificial intelligence and machine learning have prompted researchers to use it in the recent researches. The proposed method uses K-Nearest Neighbor (KNN) algorithm for segmentation of medical images, extracting of image…

Image and Video Processing · Electrical Eng. & Systems 2023-03-17 Ayesha Heena , Nagashettappa Biradar , Najmuddin M. Maroof , Surbhi Bhatia , Rashmi Agarwal , Kanta Prasad

A new hashing based nearest neighbors selection technique for big datasets

KNN has the reputation to be the word simplest but efficient supervised learning algorithm used for either classification or regression. KNN prediction efficiency highly depends on the size of its training data but when this training data…

Machine Learning · Computer Science 2021-07-01 Jude Tchaye-Kondi , Yanlong Zhai , Liehuang Zhu

Minimax Rate Optimal Adaptive Nearest Neighbor Classification and Regression

k Nearest Neighbor (kNN) method is a simple and popular statistical method for classification and regression. For both classification and regression problems, existing works have shown that, if the distribution of the feature vector has…

Statistics Theory · Mathematics 2019-10-24 Puning Zhao , Lifeng Lai

Adaptive kNN using Expected Accuracy for Classification of Geo-Spatial Data

The k-Nearest Neighbor (kNN) classification approach is conceptually simple - yet widely applied since it often performs well in practical applications. However, using a global constant k does not always provide an optimal solution, e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2018-01-08 Mark Kibanov , Martin Becker , Juergen Mueller , Martin Atzmueller , Andreas Hotho , Gerd Stumme

Learned k-NN Distance Estimation

Big data mining is well known to be an important task for data science, because it can provide useful observations and new knowledge hidden in given large datasets. Proximity-based data analysis is particularly utilized in many real-life…

Databases · Computer Science 2022-11-29 Daichi Amagata , Yusuke Arai , Sumio Fujita , Takahiro Hara

Efficient Estimation of the number of neighbours in Probabilistic K Nearest Neighbour Classification

Probabilistic k-nearest neighbour (PKNN) classification has been introduced to improve the performance of original k-nearest neighbour (KNN) classification algorithm by explicitly modelling uncertainty in the classification of each feature…

Machine Learning · Computer Science 2013-05-07 Ji Won Yoon , Nial Friel

Efficient Estimation of k for the Nearest Neighbors Class of Methods

The k Nearest Neighbors (kNN) method has received much attention in the past decades, where some theoretical bounds on its performance were identified and where practical optimizations were proposed for making it work fairly well in high…

Machine Learning · Computer Science 2016-06-14 Aleksander Lodwich , Faisal Shafait , Thomas Breuel

AEkNN: An AutoEncoder kNN-based classifier with built-in dimensionality reduction

High dimensionality, i.e. data having a large number of variables, tends to be a challenge for most machine learning tasks, including classification. A classifier usually builds a model representing how a set of inputs explain the outputs.…

Machine Learning · Computer Science 2018-03-12 Francisco J. Pulgar , Francisco Charte , Antonio J. Rivera , María J. del Jesus

Nearest Neighbor CCP-Based Molecular Sequence Analysis

Molecular sequence analysis is crucial for comprehending several biological processes, including protein-protein interactions, functional annotation, and disease classification. The large number of sequences and the inherently complicated…

Genomics · Quantitative Biology 2025-10-10 Sarwan Ali , Prakash Chourasia , Bipin Koirala , Murray Patterson

Rates of Convergence for Large-scale Nearest Neighbor Classification

Nearest neighbor is a popular class of classification methods with many desirable properties. For a large data set which cannot be loaded into the memory of a single machine due to computation, communication, privacy, or ownership…

Machine Learning · Statistics 2019-11-01 Xingye Qiao , Jiexin Duan , Guang Cheng

Enhanced Single-Cell RNA-seq Embedding through Gene Expression and Data-Driven Gene-Gene Interaction Integration

Single-cell RNA sequencing (scRNA-seq) provides unprecedented insights into cellular heterogeneity, enabling detailed analysis of complex biological systems at single-cell resolution. However, the high dimensionality and technical noise…

Genomics · Quantitative Biology 2025-09-04 Hojjat Torabi Goudarzi , Maziyar Baran Pouyan

A Systematic Overview of Single-Cell Transcriptomics Databases, their Use cases, and Limitations

Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq) technologies and experimental protocols have led to the generation of vast amounts of genomic data that populates several online databases and repositories. Here, we…

Genomics · Quantitative Biology 2024-04-17 Mahnoor N. Gondal , Saad Ur Rehman Shah , Arul M. Chinnaiyan , Marcin Cieslik

RNA-seq data science: From raw data to effective interpretation

RNA-sequencing (RNA-seq) has become an exemplar technology in modern biology and clinical applications over the past decade. It has gained immense popularity in the recent years driven by continuous efforts of the bioinformatics community…

Genomics · Quantitative Biology 2021-02-17 Dhrithi Deshpande , Karishma Chhugani , Yutong Chang , Aaron Karlsberg , Caitlin Loeffler , Jinyang Zhang , Agata Muszynska , Jeremy Rotman , Laura Tao , Brunilda Balliu , Elizabeth Tseng , Eleazar Eskin , Fangqing Zhao , Pejman Mohammadi , Pawel P Labaj , Serghei Mangul

An Improved k-Nearest Neighbor Algorithm for Text Categorization

k is the most important parameter in a text categorization system based on k-Nearest Neighbor algorithm (kNN).In the classification process, k nearest documents to the test one in the training set are determined firstly. Then, the…

Computation and Language · Computer Science 2007-05-23 Baoli Li , Shiwen Yu , Qin Lu

Distributed Adaptive Nearest Neighbor Classifier: Algorithm and Theory

When data is of an extraordinarily large size or physically stored in different locations, the distributed nearest neighbor (NN) classifier is an attractive tool for classification. We propose a novel distributed adaptive NN classifier for…

Machine Learning · Statistics 2023-06-06 Ruiqi Liu , Ganggang Xu , Zuofeng Shang

DNNR: Differential Nearest Neighbors Regression

K-nearest neighbors (KNN) is one of the earliest and most established algorithms in machine learning. For regression tasks, KNN averages the targets within a neighborhood which poses a number of challenges: the neighborhood definition is…

Machine Learning · Computer Science 2022-05-18 Youssef Nader , Leon Sixt , Tim Landgraf

Adaptive Nearest Neighbor: A General Framework for Distance Metric Learning

$K$-NN classifier is one of the most famous classification algorithms, whose performance is crucially dependent on the distance metric. When we consider the distance metric as a parameter of $K$-NN, learning an appropriate distance metric…

Machine Learning · Computer Science 2019-11-26 Kun Song

Approximate Nearest Neighbour Search on Dynamic Datasets: An Investigation

Approximate k-Nearest Neighbour (ANN) methods are often used for mining information and aiding machine learning on large scale high-dimensional datasets. ANN methods typically differ in the index structure used for accelerating searches,…

Machine Learning · Computer Science 2025-02-04 Ben Harwood , Amir Dezfouli , Iadine Chades , Conrad Sanderson