English
Related papers

Related papers: Selection Criterion for Log-Linear Models Using St…

200 papers

Although the log-likelihood is widely used in model selection, the log-likelihood ratio has had few applications in this area. We develop a log-likelihood ratio based method for selecting regression models by focusing on the set of models…

Methodology · Statistics 2021-09-28 Min Tsao

We study the law of the iterated logarithm (LIL) for the maximum likelihood estimation of the parameters (as a convex optimization problem) in the generalized linear models with independent or weakly dependent ($\rho$-mixing, $m$-dependent)…

Statistics Theory · Mathematics 2020-04-28 Xiaowei Yang , Shuang Song , Huiming Zhang

Log-linear models are a family of probability distributions which capture relationships between variables. They have been proven useful in a wide variety of fields such as epidemiology, economics and sociology. The interest in using these…

Machine Learning · Computer Science 2022-12-29 Jan Strappa , Facundo Bromberg

Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to…

Machine Learning · Computer Science 2008-07-10 David Soloveichik

Stochastic processes offer a flexible mathematical formalism to model and reason about systems. Most analysis tools, however, start from the premises that models are fully specified, so that any parameters controlling the system's dynamics…

Systems and Control · Computer Science 2017-01-11 Luca Bortolussi , Guido Sanguinetti

Model selection is a central task in statistics, but standard methods are not robust in misspecified settings where the true data-generating process (DGP) is not in the set of candidate models. The key limitation is that existing methods --…

Methodology · Statistics 2026-03-10 Jongwoo Choi , Neil A. Spencer , Jeffrey W. Miller

Lasso and other regularization procedures are attractive methods for variable selection, subject to a proper choice of shrinkage parameter. Given a set of potential subsets produced by a regularization algorithm, a consistent model…

Methodology · Statistics 2014-02-26 Minh-Ngoc Tran

Designing models that are both expressive and preserve known invariances of tasks is an increasingly hard problem. Existing solutions tradeoff invariance for computational or memory resources. In this work, we show how to leverage…

Machine Learning · Computer Science 2023-09-29 Leonardo Cotta , Gal Yehuda , Assaf Schuster , Chris J. Maddison

The stochastic block model (SBM) provides a popular framework for modeling community structures in networks. However, more attention has been devoted to problems concerning estimating the latent node labels and the model parameters than the…

Statistics Theory · Mathematics 2016-03-02 Y. X. Rachel Wang , Peter J. Bickel

This manuscript studies statistical properties of linear classifiers obtained through minimization of an unregularized convex risk over a finite sample. Although the results are explicitly finite-dimensional, inputs may be passed through…

Machine Learning · Computer Science 2012-06-15 Matus Telgarsky

This paper takes a computational learning theory approach to a problem of linear systems identification. It is assumed that input signals have only a finite number k of frequency components, and systems to be identified have dimension no…

Optimization and Control · Mathematics 2007-05-23 Pirkko Kuusela , Daniel Ocone , Eduardo D. Sontag

Log-linear models are often used to estimate the size of a closed population using capture-recapture data. When capture probabilities are related to auxiliary covariates, one may select a separate model based on each of several post-strata.…

Methodology · Statistics 2014-06-11 Zachary T. Kurtz

Variable selection, also known as feature selection in machine learning, plays an important role in modeling high dimensional data and is key to data-driven scientific discoveries. We consider here the problem of detecting influential…

Methodology · Statistics 2014-09-24 Bo Jiang , Jun S. Liu

We present a comprehensive study of graphical log-linear models for contingency tables. High dimensional contingency tables arise in many areas such as computational biology, collection of survey and census data and others. Analysis of…

Methodology · Statistics 2016-03-15 Niharika Gauraha

A key issue in statistics and machine learning is to automatically select the "right" model complexity, e.g., the number of neighbors to be averaged over in k nearest neighbor (kNN) regression or the polynomial degree in regression with…

Machine Learning · Computer Science 2010-10-04 Marcus Hutter , Minh-Ngoc Tran

In this paper, we consider the problem of minimizing a linear functional subject to uncertain linear and bilinear matrix inequalities, which depend in a possibly nonlinear way on a vector of uncertain parameters. Motivated by recent results…

Optimization and Control · Mathematics 2015-05-29 Mohammadreza Chamanbaz , Fabrizio Dabbene , Roberto Tempo , Venkatakrishnan Venkataramanan , Qing-Guo Wang

In epidemiological studies, the capture-recapture (CRC) method is a powerful tool that can be used to estimate the number of diseased cases or potentially disease prevalence based on data from overlapping surveillance systems. Estimators…

Applications · Statistics 2023-06-21 Yuzi Zhang , Lin Ge , Lance A. Waller , Robert H. Lyles

Statistical learning theory is the foundation of machine learning, providing theoretical bounds for the risk of models learned from a (single) training set, assumed to issue from an unknown probability distribution. In actual deployment,…

Machine Learning · Computer Science 2024-10-25 Michele Caprio , Maryam Sultana , Eleni Elia , Fabio Cuzzolin

We define the group-lasso estimator for the natural parameters of the exponential families of distributions representing hierarchical log-linear models under multinomial sampling scheme. Such estimator arises as the solution of a convex…

Statistics Theory · Mathematics 2012-07-31 Yuval Nardi , Alessandro Rinaldo

Analysis of high-dimensional data is currently a popular field of research, thanks to many applications e.g. in genetics (DNA data in genomewide association studies), spectrometry or web analysis. At the same time, the type of problems that…

Methodology · Statistics 2018-05-25 Jozef Jakubik
‹ Prev 1 2 3 10 Next ›