English
Related papers

Related papers: Visualizing Count Data Regressions Using Rootogram…

200 papers

Count data are common in medical research. When these data have more zeros than expected by the most used count distributions, it is common to employ a zero-inflated regression model. However, the interpretability of these models is much…

Methodology · Statistics 2025-09-30 Gustavo H. A. Pereira , Jeremias Leão , Manoel Santos-Neto , Jianwen Cai

'Optimal cutpoints' for binary classification tasks are often established by testing which cutpoint yields the best discrimination, for example the Youden index, in a specific sample. This results in 'optimal' cutpoints that are highly…

Computation · Statistics 2020-02-24 Christian Thiele , Gerrit Hirschfeld

In this paper, we examine roots of graph polynomials where those roots can be considered as structural graph measures. More precisely, we prove analytical results for the roots of certain modified graph polynomials and also discuss…

Combinatorics · Mathematics 2024-11-11 Simon Brezovnik , Matthias Dehmer , Niko Tratnik , Petra Žigert Pleteršek

A regression method for proportional, or fractional, data with mixed effects is outlined, designed for analysis of datasets in which the outcomes have substantial weight at the bounds. In such cases a normal approximation is particularly…

Methodology · Statistics 2018-05-23 Colman Humphrey , Dan Swingley

Count data take on non-negative integer values and are challenging to properly analyze using standard linear-Gaussian methods such as linear regression and principal components analysis. Generalized linear models enable direct modeling of…

Methodology · Statistics 2020-01-14 F. William Townes

Although models for count data with over-dispersion have been widely considered in the literature, models for under-dispersion -- the opposite phenomenon -- have received less attention as it is only relatively common in particular research…

The purpose of this article is to introduce the reader to the ROOT data analysis software package, and demonstrate how it may be used to complement one's accident reconstruction analyses.

Popular Physics · Physics 2014-04-09 Bob Scurlock

This paper proposes a new generalized linear model with the fractional binomial distribution. Zero-inflated Poisson/negative binomial distributions are used for count data with many zeros. To analyze the association of such a count variable…

Methodology · Statistics 2025-08-01 Jeonghwa Lee , Chloe Breece

The root is an important organ of a plant since it is responsible for water and nutrient uptake. Analyzing and modelling variabilities in the geometry and topology of roots can help in assessing the plant's health, understanding its growth…

Graphics · Computer Science 2021-01-26 Guan Wang , Hamid Laga , Jinyuan Jia , Stanley J. Miklavcic , Anuj Srivastava

Count-valued autoregressions are widely used to analyse time-series of reported infectious-disease cases because of their close connection with discrete-time transmission models. However, when such models are applied directly to…

Applications · Statistics 2025-09-16 Justin J. Slater , Sindi Bebeziqi

Tensor-on-tensor (TOT) regression is an important tool for the analysis of tensor data, aiming to predict a set of response tensors from a corresponding set of predictor tensors. However, standard TOT regression is sensitive to outliers,…

Methodology · Statistics 2026-03-30 Mehdi Hirari , Fabio Centofanti , Mia Hubert , Stefan Van Aelst

Poisson regression is a popular tool for modeling count data and is applied in a vast array of applications from the social to the physical sciences and beyond. Real data, however, are often over- or under-dispersed and, thus, not conducive…

Applications · Statistics 2010-11-10 Kimberly F. Sellers , Galit Shmueli

Histograms provide a powerful means of summarizing large data sets by representing their distribution in a compact, binned form. The HistogramTools R package enhances R built-in histogram functionality, offering advanced methods for…

Databases · Computer Science 2025-04-02 Shubham Malhotra

Traditional boxplots are widely used for summarizing and visualizing the distribution of numerical data, yet they exhibit significant limitations when applied to skewed or heavy-tailed distributions, often leading to misclassification of…

Methodology · Statistics 2025-11-24 Mustafa Cavus

For the task of relevance analysis, the conventional Tukey's test may be applied to the set of all pairwise comparisons. However, there were few studies that discuss both nonparametric k-sample comparisons and relevance analysis in high…

Methodology · Statistics 2021-07-05 Xiaoping Shi

Due to a wide spectrum of applications in the real world, such as security, financial surveillance, and health risk, various deep anomaly detection models have been proposed and achieved state-of-the-art performance. However, besides being…

Machine Learning · Computer Science 2023-09-08 Xiao Han , Lu Zhang , Yongkai Wu , Shuhan Yuan

The $k$-core decomposition is a widely studied summary statistic that describes a graph's global connectivity structure. In this paper, we move beyond using $k$-core decomposition as a tool to summarize a graph and propose using $k$-core…

Statistics Theory · Mathematics 2016-11-29 Vishesh Karwa , Michael J. Pelsmajer , Sonja Petrović , Despina Stasi , Dane Wilburne

Topographs, introduced by Conway in 1997, are infinite trivalent planar trees used to visualize the values of binary quadratic forms. In this work, we study series whose terms are indexed by the vertices of a topograph and show that they…

Number Theory · Mathematics 2025-10-03 Nikita Kalinin

Regression evaluation has been performed for decades. Some metrics have been identified to be robust against shifting and scaling of the data but considering the different distributions of data is much more difficult to address (imbalance…

Machine Learning · Computer Science 2020-09-14 Mario Michael Krell , Bilal Wehbe

Variable trees are a new method for the exploration of discrete multivariate data. They display nested subsets and corresponding frequencies and percentages. Manual calculation of these quantities can be laborious, especially when there are…

Computation · Statistics 2021-02-08 Nick Barrowman , Richard J. Webster
‹ Prev 1 2 3 10 Next ›