中文
相关论文

相关论文: Selection Criterion for Log-Linear Models Using St…

200 篇论文

Although the log-likelihood is widely used in model selection, the log-likelihood ratio has had few applications in this area. We develop a log-likelihood ratio based method for selecting regression models by focusing on the set of models…

统计方法学 · 统计学 2021-09-28 Min Tsao

We study the law of the iterated logarithm (LIL) for the maximum likelihood estimation of the parameters (as a convex optimization problem) in the generalized linear models with independent or weakly dependent ($\rho$-mixing, $m$-dependent)…

统计理论 · 数学 2020-04-28 Xiaowei Yang , Shuang Song , Huiming Zhang

Log-linear models are a family of probability distributions which capture relationships between variables. They have been proven useful in a wide variety of fields such as epidemiology, economics and sociology. The interest in using these…

机器学习 · 计算机科学 2022-12-29 Jan Strappa , Facundo Bromberg

Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to…

机器学习 · 计算机科学 2008-07-10 David Soloveichik

Stochastic processes offer a flexible mathematical formalism to model and reason about systems. Most analysis tools, however, start from the premises that models are fully specified, so that any parameters controlling the system's dynamics…

系统与控制 · 计算机科学 2017-01-11 Luca Bortolussi , Guido Sanguinetti

Model selection is a central task in statistics, but standard methods are not robust in misspecified settings where the true data-generating process (DGP) is not in the set of candidate models. The key limitation is that existing methods --…

统计方法学 · 统计学 2026-03-10 Jongwoo Choi , Neil A. Spencer , Jeffrey W. Miller

Lasso and other regularization procedures are attractive methods for variable selection, subject to a proper choice of shrinkage parameter. Given a set of potential subsets produced by a regularization algorithm, a consistent model…

统计方法学 · 统计学 2014-02-26 Minh-Ngoc Tran

Designing models that are both expressive and preserve known invariances of tasks is an increasingly hard problem. Existing solutions tradeoff invariance for computational or memory resources. In this work, we show how to leverage…

机器学习 · 计算机科学 2023-09-29 Leonardo Cotta , Gal Yehuda , Assaf Schuster , Chris J. Maddison

The stochastic block model (SBM) provides a popular framework for modeling community structures in networks. However, more attention has been devoted to problems concerning estimating the latent node labels and the model parameters than the…

统计理论 · 数学 2016-03-02 Y. X. Rachel Wang , Peter J. Bickel

This manuscript studies statistical properties of linear classifiers obtained through minimization of an unregularized convex risk over a finite sample. Although the results are explicitly finite-dimensional, inputs may be passed through…

机器学习 · 计算机科学 2012-06-15 Matus Telgarsky

This paper takes a computational learning theory approach to a problem of linear systems identification. It is assumed that input signals have only a finite number k of frequency components, and systems to be identified have dimension no…

最优化与控制 · 数学 2007-05-23 Pirkko Kuusela , Daniel Ocone , Eduardo D. Sontag

Log-linear models are often used to estimate the size of a closed population using capture-recapture data. When capture probabilities are related to auxiliary covariates, one may select a separate model based on each of several post-strata.…

统计方法学 · 统计学 2014-06-11 Zachary T. Kurtz

Variable selection, also known as feature selection in machine learning, plays an important role in modeling high dimensional data and is key to data-driven scientific discoveries. We consider here the problem of detecting influential…

统计方法学 · 统计学 2014-09-24 Bo Jiang , Jun S. Liu

We present a comprehensive study of graphical log-linear models for contingency tables. High dimensional contingency tables arise in many areas such as computational biology, collection of survey and census data and others. Analysis of…

统计方法学 · 统计学 2016-03-15 Niharika Gauraha

A key issue in statistics and machine learning is to automatically select the "right" model complexity, e.g., the number of neighbors to be averaged over in k nearest neighbor (kNN) regression or the polynomial degree in regression with…

机器学习 · 计算机科学 2010-10-04 Marcus Hutter , Minh-Ngoc Tran

In this paper, we consider the problem of minimizing a linear functional subject to uncertain linear and bilinear matrix inequalities, which depend in a possibly nonlinear way on a vector of uncertain parameters. Motivated by recent results…

In epidemiological studies, the capture-recapture (CRC) method is a powerful tool that can be used to estimate the number of diseased cases or potentially disease prevalence based on data from overlapping surveillance systems. Estimators…

应用统计 · 统计学 2023-06-21 Yuzi Zhang , Lin Ge , Lance A. Waller , Robert H. Lyles

Statistical learning theory is the foundation of machine learning, providing theoretical bounds for the risk of models learned from a (single) training set, assumed to issue from an unknown probability distribution. In actual deployment,…

机器学习 · 计算机科学 2024-10-25 Michele Caprio , Maryam Sultana , Eleni Elia , Fabio Cuzzolin

We define the group-lasso estimator for the natural parameters of the exponential families of distributions representing hierarchical log-linear models under multinomial sampling scheme. Such estimator arises as the solution of a convex…

统计理论 · 数学 2012-07-31 Yuval Nardi , Alessandro Rinaldo

Analysis of high-dimensional data is currently a popular field of research, thanks to many applications e.g. in genetics (DNA data in genomewide association studies), spectrometry or web analysis. At the same time, the type of problems that…

统计方法学 · 统计学 2018-05-25 Jozef Jakubik
‹ 上一页 1 2 3 10 下一页 ›