Related papers: Discriminating sample groups with multi-way data

Multiway sparse distance weighted discrimination

Modern data often take the form of a multiway array. However, most classification methods are designed for vectors, i.e., 1-way arrays. Distance weighted discrimination (DWD) is a popular high-dimensional classification method that has been…

Methodology · Statistics 2021-10-12 Bin Guo , Lynn E. Eberly , Pierre-Gilles Henry , Christophe Lenglet , Eric F. Lock

Bayesian Distance Weighted Discrimination

Distance weighted discrimination (DWD) is a linear discrimination method that is particularly well-suited for classification tasks with high-dimensional data. The DWD coefficients minimize an intuitive objective function, which can solved…

Methodology · Statistics 2020-10-08 Eric F. Lock

Distance-weighted Support Vector Machine

A novel linear classification method that possesses the merits of both the Support Vector Machine (SVM) and the Distance-weighted Discrimination (DWD) is proposed in this article. The proposed Distance-weighted Support Vector Machine method…

Machine Learning · Statistics 2015-10-09 Xingye Qiao , Lingsong Zhang

Flexible High-dimensional Classification Machines and Their Asymptotic Properties

Classification is an important topic in statistics and machine learning with great potential in many real applications. In this paper, we investigate two popular large margin classification methods, Support Vector Machine (SVM) and Distance…

Machine Learning · Statistics 2013-10-14 Xingye Qiao , Lingsong Zhang

"Virus hunting" using radial distance weighted discrimination

Motivated by the challenge of using DNA-seq data to identify viruses in human blood samples, we propose a novel classification algorithm called "Radial Distance Weighted Discrimination" (or Radial DWD). This classifier is designed for…

Applications · Statistics 2016-02-10 Jie Xiong , D. P. Dittmer , J. S. Marron

Fast algorithms for large scale generalized distance weighted discrimination

High dimension low sample size statistical analysis is important in a wide range of applications. In such situations, the highly appealing discrimination method, support vector machine, can be improved to alleviate data piling at the…

Optimization and Control · Mathematics 2017-08-18 Xin Yee Lam , J. S. Marron , Defeng Sun , Kim-Chuan Toh

Flexible Bayesian Support Vector Machines for Brain Network-based Classification

Objective: Brain networks have gained increasing recognition as potential biomarkers in mental health studies, but there are limited approaches that can leverage complex brain networks for accurate classification. Our goal is to develop a…

Methodology · Statistics 2022-05-25 Jin Ming , Suprateek Kundu

A Foray into Parallel Optimisation Algorithms for High Dimension Low Sample Space Generalized Distance Weighted Discrimination problems

In many modern data sets, High dimension low sample size (HDLSS) data is prevalent in many fields of studies. There has been an increased focus recently on using machine learning and statistical methods to mine valuable information out of…

Optimization and Control · Mathematics 2023-05-23 Srivathsan Amruth , Xin Yee Lam

Support vector machine for functional data classification

In many applications, input data are sampled functions taking their values in infinite dimensional spaces rather than standard vectors. This fact has complex consequences on data analysis algorithms that motivate modifications of them. In…

Statistics Theory · Mathematics 2007-05-23 Fabrice Rossi , Nathalie Villa

Sequential Linear Discriminant Analysis in High Dimensions Using Individual Discriminant Functions

High dimensional classification has been highlighted for last two decades and much research has been conducted in order to circumvent challenges encountered in high dimensions. While existing methods have focused mainly on developing…

Methodology · Statistics 2022-11-16 Seungchul Baek

dCAM: Dimension-wise Class Activation Map for Explaining Multivariate Data Series Classification

Data series classification is an important and challenging problem in data science. Explaining the classification decisions by finding the discriminant parts of the input that led the algorithm to some decisions is a real need in many…

Machine Learning · Computer Science 2022-07-26 Paul Boniol , Mohammed Meftah , Emmanuel Remy , Themis Palpanas

Convex method for selection of fixed effects in high-dimensional linear mixed models

Analysis of high-dimensional data is currently a popular field of research, thanks to many applications e.g. in genetics (DNA data in genomewide association studies), spectrometry or web analysis. At the same time, the type of problems that…

Methodology · Statistics 2018-05-25 Jozef Jakubik

Multilevel Weighted Support Vector Machine for Classification on Healthcare Data with Missing Values

This work is motivated by the needs of predictive analytics on healthcare data as represented by Electronic Medical Records. Such data is invariably problematic: noisy, with missing entries, with imbalance in classes of interests, leading…

Machine Learning · Statistics 2016-09-28 Talayeh Razzaghi , Oleg Roderick , Ilya Safro , Nicholas Marko

Linear Discriminant Analysis with High-dimensional Mixed Variables

Datasets containing both categorical and continuous variables are frequently encountered in many areas, and with the rapid development of modern measurement technologies, the dimensions of these variables can be very high. Despite the…

Methodology · Statistics 2024-01-03 Binyan Jiang , Chenlei Leng , Cheng Wang , Zhongqing Yang , Xinyang Yu

Diagonal Discriminant Analysis with Feature Selection for High Dimensional Data

We introduce a new method of performing high dimensional discriminant analysis, which we call multiDA. We achieve this by constructing a hybrid model that seamlessly integrates a multiclass diagonal discriminant analysis model and feature…

Machine Learning · Statistics 2018-07-05 Sarah Elizabeth Romanes , John Thomas Ormerod , Jean YH Yang

Joint Dimensionality Reduction for Separable Embedding Estimation

Low-dimensional embeddings for data from disparate sources play critical roles in multi-modal machine learning, multimedia information retrieval, and bioinformatics. In this paper, we propose a supervised dimensionality reduction method…

Machine Learning · Computer Science 2021-01-15 Yanjun Li , Bihan Wen , Hao Cheng , Yoram Bresler

Speech Recognition: Increasing Efficiency of Support Vector Machines

With the advancement of communication and security technologies, it has become crucial to have robustness of embedded biometric systems. This paper presents the realization of such technologies which demands reliable and error-free…

Computer Vision and Pattern Recognition · Computer Science 2012-04-20 Aamir Khan , Muhammad Farhan , Asar Ali

Data-Driven Subgroup Identification for Linear Regression

Medical studies frequently require to extract the relationship between each covariate and the outcome with statistical confidence measures. To do this, simple parametric models are frequently used (e.g. coefficients of linear regression)…

Machine Learning · Computer Science 2023-05-02 Zachary Izzo , Ruishan Liu , James Zou

The classification for High-dimension low-sample size data

Huge amount of applications in various fields, such as gene expression analysis or computer vision, undergo data sets with high-dimensional low-sample-size (HDLSS), which has putted forward great challenges for standard statistical and…

Machine Learning · Computer Science 2022-06-07 Liran Shen , Meng Joo Er , Qingbo Yin

Linear classification methods for multivariate repeated measures data -- a simulation study

Researchers in the behavioral and social sciences use linear discriminant analysis (LDA) for predictions of group membership (classification) and for identifying the variables most relevant to group separation among a set of continuous…

Methodology · Statistics 2025-05-28 Ricarda Graf , Marina Zeldovich , Sarah Friedrich