Related papers: Selection Criterion for Log-Linear Models Using St…

Regression model selection via log-likelihood ratio and constrained minimum criterion

Although the log-likelihood is widely used in model selection, the log-likelihood ratio has had few applications in this area. We develop a log-likelihood ratio based method for selecting regression models by focusing on the set of models…

Methodology · Statistics 2021-09-28 Min Tsao

Law of the Iterated Logarithm and Model Selection Consistency for GLMs with Independent and Dependent Responses

We study the law of the iterated logarithm (LIL) for the maximum likelihood estimation of the parameters (as a convex optimization problem) in the generalized linear models with independent or weakly dependent ($\rho$-mixing, $m$-dependent)…

Statistics Theory · Mathematics 2020-04-28 Xiaowei Yang , Shuang Song , Huiming Zhang

Efficient comparison of independence structures of log-linear models

Log-linear models are a family of probability distributions which capture relationships between variables. They have been proven useful in a wide variety of fields such as epidemiology, economics and sociology. The interest in using these…

Machine Learning · Computer Science 2022-12-29 Jan Strappa , Facundo Bromberg

Statistical Learning of Arbitrary Computable Classifiers

Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to…

Machine Learning · Computer Science 2008-07-10 David Soloveichik

Learning and Designing Stochastic Processes from Logical Constraints

Stochastic processes offer a flexible mathematical formalism to model and reason about systems. Most analysis tools, however, start from the premises that models are fully specified, so that any parameters controlling the system's dynamics…

Systems and Control · Computer Science 2017-01-11 Luca Bortolussi , Guido Sanguinetti

Robust model selection using likelihood as data

Model selection is a central task in statistics, but standard methods are not robust in misspecified settings where the true data-generating process (DGP) is not in the set of candidate models. The key limitation is that existing methods --…

Methodology · Statistics 2026-03-10 Jongwoo Choi , Neil A. Spencer , Jeffrey W. Miller

The Loss Rank Criterion for Variable Selection in Linear Regression Analysis

Lasso and other regularization procedures are attractive methods for variable selection, subject to a proper choice of shrinkage parameter. Given a set of potential subsets produced by a regularization algorithm, a consistent model…

Methodology · Statistics 2014-02-26 Minh-Ngoc Tran

Probabilistic Invariant Learning with Randomized Linear Classifiers

Designing models that are both expressive and preserve known invariances of tasks is an increasingly hard problem. Existing solutions tradeoff invariance for computational or memory resources. In this work, we show how to leverage…

Machine Learning · Computer Science 2023-09-29 Leonardo Cotta , Gal Yehuda , Assaf Schuster , Chris J. Maddison

Likelihood-based model selection for stochastic block models

The stochastic block model (SBM) provides a popular framework for modeling community structures in networks. However, more attention has been devoted to problems concerning estimating the latent node labels and the model parameters than the…

Statistics Theory · Mathematics 2016-03-02 Y. X. Rachel Wang , Peter J. Bickel

Statistical Consistency of Finite-dimensional Unregularized Linear Classification

This manuscript studies statistical properties of linear classifiers obtained through minimization of an unregularized convex risk over a finite sample. Although the results are explicitly finite-dimensional, inputs may be passed through…

Machine Learning · Computer Science 2012-06-15 Matus Telgarsky

Learning Complexity Dimensions for a Continuous-Time Control System

This paper takes a computational learning theory approach to a problem of linear systems identification. It is assumed that input signals have only a finite number k of frequency components, and systems to be identified have dimension no…

Optimization and Control · Mathematics 2007-05-23 Pirkko Kuusela , Daniel Ocone , Eduardo D. Sontag

Local Log-linear Models for Capture-Recapture

Log-linear models are often used to estimate the size of a closed population using capture-recapture data. When capture probabilities are related to auxiliary covariates, one may select a separate model based on each of several post-strata.…

Methodology · Statistics 2014-06-11 Zachary T. Kurtz

Variable selection for general index models via sliced inverse regression

Variable selection, also known as feature selection in machine learning, plays an important role in modeling high dimensional data and is key to data-driven scientific discoveries. We consider here the problem of detecting influential…

Methodology · Statistics 2014-09-24 Bo Jiang , Jun S. Liu

Graphical Log-linear Models: Fundamental Concepts and Applications

We present a comprehensive study of graphical log-linear models for contingency tables. High dimensional contingency tables arise in many areas such as computational biology, collection of survey and census data and others. Analysis of…

Methodology · Statistics 2016-03-15 Niharika Gauraha

Model Selection with the Loss Rank Principle

A key issue in statistics and machine learning is to automatically select the "right" model complexity, e.g., the number of neighbors to be averaged over in k nearest neighbor (kNN) regression or the polynomial degree in regression with…

Machine Learning · Computer Science 2010-10-04 Marcus Hutter , Minh-Ngoc Tran

A Statistical Learning Theory Approach for Uncertain Linear and Bilinear Matrix Inequalities

In this paper, we consider the problem of minimizing a linear functional subject to uncertain linear and bilinear matrix inequalities, which depend in a possibly nonlinear way on a vector of uncertain parameters. Motivated by recent results…

Optimization and Control · Mathematics 2015-05-29 Mohammadreza Chamanbaz , Fabrizio Dabbene , Roberto Tempo , Venkatakrishnan Venkataramanan , Qing-Guo Wang

On some pitfalls of the log-linear modeling framework for capture-recapture studies in disease surveillance

In epidemiological studies, the capture-recapture (CRC) method is a powerful tool that can be used to estimate the number of diseased cases or potentially disease prevalence based on data from overlapping surveillance systems. Estimators…

Applications · Statistics 2023-06-21 Yuzi Zhang , Lin Ge , Lance A. Waller , Robert H. Lyles

Credal Learning Theory

Statistical learning theory is the foundation of machine learning, providing theoretical bounds for the risk of models learned from a (single) training set, assumed to issue from an unknown probability distribution. In actual deployment,…

Machine Learning · Computer Science 2024-10-25 Michele Caprio , Maryam Sultana , Eleni Elia , Fabio Cuzzolin

The log-linear group-lasso estimator and its asymptotic properties

We define the group-lasso estimator for the natural parameters of the exponential families of distributions representing hierarchical log-linear models under multinomial sampling scheme. Such estimator arises as the solution of a convex…

Statistics Theory · Mathematics 2012-07-31 Yuval Nardi , Alessandro Rinaldo

Convex method for selection of fixed effects in high-dimensional linear mixed models

Analysis of high-dimensional data is currently a popular field of research, thanks to many applications e.g. in genetics (DNA data in genomewide association studies), spectrometry or web analysis. At the same time, the type of problems that…

Methodology · Statistics 2018-05-25 Jozef Jakubik