Related papers: Classification of arrayCGH data using a fused SVM

A Fast and Flexible Method for the Segmentation of aCGH Data

Motivation: Array Comparative Genomic Hybridization (aCGH) is used to scan the entire genome for variations in DNA copy number. A central task in the analysis of aCGH data is the segmentation into groups of probes sharing the same DNA copy…

Quantitative Methods · Quantitative Biology 2008-04-29 Erez Ben-Yaacov , Yonina Eldar

Signal extraction and breakpoint identification for array CGH data using robust state space model

Array comparative genomic hybridization(CGH) is a high resolution technique to assess DNA copy number variation. Identifying breakpoints where copy number changes will enhance the understanding of the pathogenesis of human diseases, such as…

Applications · Statistics 2012-01-26 Bin Zhu , Jeremy M. G. Taylor , Peter X. -K. Song

Joint segmentation of many aCGH profiles using fast group LARS

Array-Based Comparative Genomic Hybridization (aCGH) is a method used to search for genomic regions with copy numbers variations. For a given aCGH profile, one challenge is to accurately segment it into regions of constant copy number.…

Quantitative Methods · Quantitative Biology 2009-10-08 Kevin Bleakley , Jean-Philippe Vert

Genomic Region Detection via Spatial Convex Clustering

Several modern genomic technologies, such as DNA-Methylation arrays, measure spatially registered probes that number in the hundreds of thousands across multiplechromosomes. The measured probes are by themselves less interesting…

Applications · Statistics 2016-11-16 John Nagorski , Genevera I. Allen

Spatial clustering of array CGH features in combination with hierarchical multiple testing

We propose a new approach for clustering DNA features using array CGH data from multiple tumor samples. We distinguish data-collapsing: joining contiguous DNA clones or probes with extremely similar data into regions, from clustering:…

Applications · Statistics 2010-12-21 Kyung In Kim , Etienne Roquain , Mark Van De Wiel

CGHTRIMMER: Discretizing noisy Array CGH Data

The development of cancer is largely driven by the gain or loss of subsets of the genome, promoting uncontrolled growth or disabling defenses against it. Identifying genomic regions whose DNA copy number deviates from the normal is…

Genomics · Quantitative Biology 2010-02-25 Charalampos E. Tsourakakis , David Tolliver , Maria A. Tsiarli , Stanley Shackney , Russell Schwartz

A hierarchical Bayesian model for inference of copy number variants and their association to gene expression

A number of statistical models have been successfully developed for the analysis of high-throughput data from a single source, but few methods are available for integrating data from different sources. Here we focus on integrating gene…

Applications · Statistics 2014-04-15 Alberto Cassese , Michele Guindani , Mahlet G. Tadesse , Francesco Falciani , Marina Vannucci

Cancer classification and pathway discovery using non-negative matrix factorization

Extracting genetic information from a full range of sequencing data is important for understanding diseases. We propose a novel method to effectively explore the landscape of genetic mutations and aggregate them to predict cancer type. We…

Genomics · Quantitative Biology 2018-10-10 Zexian Zeng , Andy Vo , Chengsheng Mao , Susan E Clare , Seema A Khan , Yuan Luo

Review on Feature Selection Techniques and the Impact of SVM for Cancer Classification using Gene Expression Profile

The DNA microarray technology has modernized the approach of biology research in such a way that scientists can now measure the expression levels of thousands of genes simultaneously in a single experiment. Gene expression profiles, which…

Computational Engineering, Finance, and Science · Computer Science 2011-09-07 G. Victo Sudha George , V. Cyril Raj

HybridRanker: Integrating network structure and disease knowledge to prioritize cancer candidate genes

One of the notable fields in studying the genetics of cancer is disease gene identification which affects disease treatment and drug discovery. Many researches have been done in this field. Genome-wide association studies (GWAS) are one of…

Computational Engineering, Finance, and Science · Computer Science 2016-04-27 Zahra Razaghi-Moghadama , Razieh Abdollahia , Sama Goliaeib , Morteza Ebrahimia

A Scalable Tool For Analyzing Genomic Variants Of Humans Using Knowledge Graphs and Machine Learning

The integration of knowledge graphs and graph machine learning (GML) in genomic data analysis offers several opportunities for understanding complex genetic relationships, especially at the RNA level. We present a comprehensive approach for…

Artificial Intelligence · Computer Science 2024-08-06 Shivika Prasanna , Ajay Kumar , Deepthi Rao , Eduardo Simoes , Praveen Rao

Gene selection for cancer classification using a hybrid of univariate and multivariate feature selection methods

Various approaches to gene selection for cancer classification based on microarray data can be found in the literature and they may be grouped into two categories: univariate methods and multivariate methods. Univariate methods look at each…

Quantitative Methods · Quantitative Biology 2015-06-18 Min Xu , Rudy Setiono

Supervised Convex Clustering

Clustering has long been a popular unsupervised learning approach to identify groups of similar objects and discover patterns from unlabeled data in many applications. Yet, coming up with meaningful interpretations of the estimated clusters…

Methodology · Statistics 2020-05-26 Minjie Wang , Tianyi Yao , Genevera I. Allen

An integrative sparse boosting analysis of cancer genomic commonality and difference

In cancer research, high-throughput profiling has been extensively conducted. In recent studies, the integrative analysis of data on multiple cancer patient groups/subgroups has been conducted. Such analysis has the potential to reveal the…

Methodology · Statistics 2022-12-01 Yifan Sun , Zhengyang Sun , Yu Jiang , Yang Li , Shuangge Ma

Supervised clustering of high dimensional data using regularized mixture modeling

Identifying relationships between molecular variations and their clinical presentations has been challenged by the heterogeneous causes of a disease. It is imperative to unveil the relationship between the high dimensional molecular…

Methodology · Statistics 2021-09-02 Wennan Chang , Changlin Wan , Yong Zang , Chi Zhang , Sha Cao

Improving Performance of a Group of Classification Algorithms Using Resampling and Feature Selection

In recent years the importance of finding a meaningful pattern from huge datasets has become more challenging. Data miners try to adopt innovative methods to face this problem by applying feature selection methods. In this paper we propose…

Machine Learning · Computer Science 2014-03-11 Mehdi Naseriparsa , Amir-masoud Bidgoli , Touraj Varaee

Semi-supervised Spectral Clustering for Classification

We propose a Classification Via Clustering (CVC) algorithm which enables existing clustering methods to be efficiently employed in classification problems. In CVC, training and test data are co-clustered and class-cluster distributions are…

Computer Vision and Pattern Recognition · Computer Science 2014-09-29 Arif Mahmood , Ajmal S. Mian

Supervised Bayesian joint graphical model for simultaneous network estimation and subgroup identification

Heterogeneity is a fundamental characteristic of cancer. To accommodate heterogeneity, subgroup identification has been extensively studied and broadly categorized into unsupervised and supervised analysis. Compared to unsupervised…

Methodology · Statistics 2026-02-25 Xing Qin , Xu Liu , Shuangge Ma , Mengyun Wu

Categorization of 33 computational methods to detect spatially variable genes from spatially resolved transcriptomics data

In the analysis of spatially resolved transcriptomics data, detecting spatially variable genes (SVGs) is crucial. Numerous computational methods exist, but varying SVG definitions and methodologies lead to incomparable results. We review 33…

Quantitative Methods · Quantitative Biology 2024-10-04 Guanao Yan , Shuo Harper Hua , Jingyi Jessica Li

Convolutional Support Vector Machine

The support vector machine (SVM) and deep learning (e.g., convolutional neural networks (CNNs)) are the two most famous algorithms in small and big data, respectively. Nonetheless, smaller datasets may be very important, costly, and not…

Machine Learning · Computer Science 2020-02-19 Wei-Chang Yeh