Related papers: Addressing the Impact of Data Truncation and Param…

On the Selection of Loss Severity Distributions to Model Operational Risk

Accurate modeling of operational risk is important for a bank and the finance industry as a whole to prepare for potentially catastrophic losses. One approach to modeling operational is the loss distribution approach, which requires a bank…

Risk Management · Quantitative Finance 2021-07-09 Daniel Hadley , Harry Joe , Natalia Nolde

Estimation of Operational Risk Capital Charge under Parameter Uncertainty

Many banks adopt the Loss Distribution Approach to quantify the operational risk capital charge under Basel II requirements. It is common practice to estimate the capital charge using the 0.999 quantile of the annual loss distribution,…

Risk Management · Quantitative Finance 2009-04-14 Pavel V. Shevchenko

Modeling operational risk data reported above a time-varying threshold

Typically, operational risk losses are reported above a threshold. Fitting data reported above a constant threshold is a well known and studied problem. However, in practice, the losses are scaled for business and other factors before the…

Risk Management · Quantitative Finance 2009-07-31 Pavel V. Shevchenko , Grigory Temnov

Testing for an ignorable sampling bias under random double truncation

In clinical and epidemiological research doubly truncated data often appear. This is the case, for instance, when the data registry is formed by interval sampling. Double truncation generally induces a sampling bias on the target variable,…

Methodology · Statistics 2023-01-11 Jacobo de Uña-Álvarez

Coping with Selection Effects: A Primer on Regression with Truncated Data

The finite sensitivity of instruments or detection methods means that data sets in many areas of astronomy, for example cosmological or exoplanet surveys, are necessarily systematically incomplete. Such data sets, where the population being…

Instrumentation and Methods for Astrophysics · Physics 2020-10-14 Adam B. Mantz

Distribution Shift in Missing Data Imputation: A Risk-Based Perspective and Importance-Weighted Correction under MAR

Missing data imputation, where a model is trained on observed data to estimate unobserved values, is a fundamental problem in machine learning. In this paper, we rigorously formulate imputation model learning as a mean-squared error risk…

Machine Learning · Statistics 2026-05-14 Luke Shannon , Song Liu , Katarzyna Reluga

Investigating Targeting Strategies and Truncation in TMLE for the Average Treatment Effect under Practical Positivity Violations

Estimating average treatment effects from observational data is challenging under practical violations of the positivity assumption. Targeted Maximum Likelihood Estimators (TMLEs) are widely used because of their double robustness and…

Methodology · Statistics 2026-04-28 Yichen Xu , Susan Gruber , Mark J. van der Laan

Truncating the Exponential with a Uniform Distribution

For a sample of Exponentially distributed durations we aim at point estimation and a confidence interval for its parameter. A duration is only observed if it has ended within a certain time interval, determined by a Uniform distribution.…

Methodology · Statistics 2021-10-19 Rafael Weißbach , Dominik Wied

Model Uncertainty and Selection of Risk Models for Left-Truncated and Right-Censored Loss Data

Insurance loss data are usually in the form of left-truncation and right-censoring due to deductibles and policy limits respectively. This paper investigates the model uncertainty and selection procedure when various parametric models are…

Methodology · Statistics 2024-02-01 Qian Zhao , Sahadeb Upretee , Daoping Yu

A note on stratification errors in the analysis of clinical trials

Stratification in both the design and analysis of randomized clinical trials is common. Despite features in automated randomization systems to re-confirm the stratifying variables, incorrect values of these variables may be entered. These…

Methodology · Statistics 2023-07-24 Neal Thomas

Non-standard conditionally specified models for non-ignorable missing data

Data analyses typically rely upon assumptions about missingness mechanisms that lead to observed versus missing data. When the data are missing not at random, direct assumptions about the missingness mechanism, and indirect assumptions…

Methodology · Statistics 2016-03-22 Alexander M Franks , Edoardo M Airoldi , Donald B Rubin

Off-Policy Evaluation Under Nonignorable Missing Data

Off-Policy Evaluation (OPE) aims to estimate the value of a target policy using offline data collected from potentially different policies. In real-world applications, however, logged data often suffers from missingness. While OPE has been…

Machine Learning · Statistics 2025-07-10 Han Wang , Yang Xu , Wenbin Lu , Rui Song

On Reliability of Stochastic Networks

In recent times we hear increasingly often about cyber attacks on various commercial and strategic sites that manage to escape any defense. In this article, we model such attacks on networks via stochastic processes and predict the time of…

Probability · Mathematics 2019-01-23 Jewgeni H. Dshalalow , Ryan T. White

Estimation and imputation in Probabilistic Principal Component Analysis with Missing Not At Random data

Missing Not At Random (MNAR) values lead to significant biases in the data, since the probability of missingness depends on the unobserved values.They are ''not ignorable'' in the sense that they often require defining a model for the…

Statistics Theory · Mathematics 2020-06-11 Aude Sportisse , Claire Boyer , Julie Josse

Parameter Estimation Through Ignorance

Dynamical modelling lies at the heart of our understanding of physical systems. Its role in science is deeper than mere operational forecasting, in that it allows us to evaluate the adequacy of the mathematical structure of our models.…

Data Analysis, Statistics and Probability · Physics 2015-06-05 Hailiang Du , Leonard A. Smith

Parameter uncertainties for imperfect surrogate models in the low-noise regime

Bayesian regression determines model parameters by minimizing the expected loss, an upper bound to the true generalization error. However, the loss ignores misspecification, where models are imperfect. Parameter uncertainties from Bayesian…

Machine Learning · Statistics 2024-11-07 Thomas D Swinburne , Danny Perez

Shrinkage Estimation of Network Spillovers with Factor Structured Errors

This paper explores the estimation of a panel data model with cross-sectional interaction that is flexible both in its approach to specifying the network of connections between cross-sectional units, and in controlling for unobserved…

Econometrics · Economics 2021-11-23 Ayden Higgins , Federico Martellosio

Estimating survival parameters under conditionally independent left truncation

Databases derived from electronic health records (EHRs) are commonly subject to left truncation, a type of selection bias induced due to patients needing to survive long enough to satisfy certain entry criteria. Standard methods to adjust…

Methodology · Statistics 2022-03-01 Arjun Sondhi

Bayesian Inference for Left-Truncated Log-Logistic Distributions for Time-to-event Data Analysis

Parameter estimation is a foundational step in statistical modeling, enabling us to extract knowledge from data and apply it effectively. Bayesian estimation of parameters incorporates prior beliefs with observed data to infer distribution…

Methodology · Statistics 2025-06-24 Fahad Mostafa , Md Rejuan Haque , Md Mostafijur Rahman , Farzana Nasrin

The Quantification of Operational Risk using Internal Data, Relevant External Data and Expert Opinions

To quantify an operational risk capital charge under Basel II, many banks adopt a Loss Distribution Approach. Under this approach, quantification of the frequency and severity distributions of operational risk involves the bank's internal…

Risk Management · Quantitative Finance 2009-04-09 Dominik D. Lambrigger , Pavel V. Shevchenko , Mario V. Wüthrich