Related papers: Data Augmentation via Levy Processes
The classical notion of L\'evy process is generalized to one that takes as its values probabilities on a first order model equipped with a commutative semigroup. This is achieved by applying a convolution product on definable probabilities…
This paper proposes a method for the automatic creation of variables (in the case of regression) that complement the information contained in the initial input vector. The method works as a pre-processing step in which the continuous values…
Generalising well in supervised learning tasks relies on correctly extrapolating the training data to a large region of the input space. One way to achieve this is to constrain the predictions to be invariant to transformations on the input…
A wide range of machine learning algorithms iteratively add data to the training sample. Examples include semi-supervised learning, active learning, multi-armed bandits, and Bayesian optimization. We embed this kind of data addition into…
Based on the theory of independently scattered random measures, we introduce a natural generalisation of Gaussian space-time white noise to a Levy-type setting, which we call Levy-valued random measures. We determine the subclass of…
There have been many recent advances in the structure and measurement of distributed language models: those that map from words to a vector-space that is rich in information about word choice and composition. This vector-space is the…
Levy processes, which have stationary independent increments, are ideal for modelling the various types of noise that can arise in communication channels. If a Levy process admits exponential moments, then there exists a parametric family…
We present an exact sampling method for the first passage event of a Levy process. The idea is to embed the process into another one whose first passage event can be sampled exactly, and then recover the part belonging to the former from…
Fine-tuning is becoming widely used for leveraging the power of pre-trained foundation models in new downstream tasks. While there are many successes of fine-tuning on various tasks, recent studies have observed challenges in the…
Generative classifiers offer potential advantages over their discriminative counterparts, namely in the areas of data efficiency, robustness to data shift and adversarial examples, and zero-shot learning (Ng and Jordan,2002; Yogatama et…
Two different ways of trimming the sample path of a stochastic process in D[0, 1]: global ("trim as you go") trimming and record time ("lookback") trimming are analysed to find conditions for the corresponding operators to be continuous…
We derive factorization identities for a class of preemptive-resume queueing systems, with batch arrivals and catastrophes that, whenever they occur, eliminate multiple customers present in the system. These processes are quite general, as…
Gaussian process-based latent variable models are flexible and theoretically grounded tools for nonlinear dimension reduction, but generalizing to non-Gaussian data likelihoods within this nonlinear framework is statistically challenging.…
There are now many explainable AI methods for understanding the decisions of a machine learning model. Among these are those based on counterfactual reasoning, which involve simulating features changes and observing the impact on the…
While discriminative classifiers often yield strong predictive performance, missing feature values at prediction time can still be a challenge. Classifiers may not behave as expected under certain ways of substituting the missing values,…
We extend the idea of tempering stable Levy processes to tempering more general classes of Levy processes. We show that the original process can be decomposed into the sum of the tempered process and an independent point process of large…
We use Levy processes to generate joint prior distributions, and therefore penalty functions, for a location parameter as p grows large. This generalizes the class of local-global shrinkage rules based on scale mixtures of normals,…
Neural network based generative models with discriminative components are a powerful approach for semi-supervised learning. However, these techniques a) cannot account for model uncertainty in the estimation of the model's discriminative…
Data augmentation is a commonly applied technique with two seemingly related advantages. With this method one can increase the size of the training set generating new samples and also increase the invariance of the network against the…
This paper proposes a method for the automatic creation of variables (in the case of regression) that complement the information contained in the initial input vector. The method works as a pre-processing step in which the continuous values…