Related papers: Generation-by-Generation Dissection of the Respons…
We consider a continuous-space and continuous-time diffusion process under resetting with memory. A particle resets to a position chosen from its trajectory in the past according to a memory kernel. Depending on the form of the memory…
We propose a simple yet efficient mechanism for passive error correction in topological quantum memories. Our scheme relies on driven-dissipative ancilla systems which couple to local excitations (anyons) and make them "sink" in energy,…
We find that multifractal scaling is a robust property of a large class of continuous stochastic processes, constructed as exponentials of long-memory processes. The long memory is characterized by a power law kernel with tail exponent…
We considered a model for an infectious disease outbreak, when the depletion of susceptible individuals is negligible, and assumed that individuals adapt their behavior according to the information they receive about new cases. In line with…
A very large class of ODE epidemic models (2.2) discussed in this paper enjoys the property of admitting also an integral renewal formulation, with respect to an "age of infection kernel" a(t) which has a matrix exponential form (3.2). We…
Parallel LLM inference scaling involves sampling a set of $N>1$ responses for a single input prompt. However, these $N$ parallel responses tend to be generated independently from each other, partitioning compute resources and leaving…
Kernel continual learning by \citet{derakhshani2021kernel} has recently emerged as a strong continual learner due to its non-parametric ability to tackle task interference and catastrophic forgetting. Unfortunately its success comes at the…
Test-time scaling has emerged as an effective way to improve language models on challenging reasoning tasks. However, most existing methods treat each problem in isolation and do not systematically reuse knowledge from prior reasoning…
A central challenge in cognitive neuroscience is to explain how semantic and episodic memory, two major forms of declarative memory, typically associated with cortical and hippocampal processing, interact to support learning, recall, and…
Athermal systems across a large range of length scales, ranging from foams and granular bead packings to crumpled metallic sheets, exhibit slow stress relaxation when compressed. Experimentally they show a non-monotonic stress response when…
We prove limit theorems of an entirely new type for certain long memory regularly varying stationary infinitely divisible random processes. These theorems involve multiple phase transitions governed by how long the memory is. Apart from one…
While humans naturally learn and adapt from past experiences, large language models (LLMs) and their agentic counterparts struggle to retain reasoning from previous tasks and apply them in future contexts. To address this limitation, we…
Catastrophic forgetting in continual learning is a common destructive phenomenon in gradient-based neural networks that learn sequential tasks, and it is much different from forgetting in humans, who can learn and accumulate knowledge…
Large language models handle single-turn generation well, but multi-turn interactions still require the model to reconstruct user intent and task state from an expanding token history because internal representations do not persist across…
A generalised form of time-translation-invariance permits to re-derive the known generic phenomenology of ageing, which arises in classical many-body systems after a quench from an initially disordered system to a temperature $T\leq T_c$,…
Reverse Chain-of-Thought Generation (RCG) synthesizes reasoning traces from query-answer pairs, but runs the risk of producing post-hoc rationalizations: when models can see the answer during generation, the answer serves as a cognitive…
Predicting how cells respond to genetic perturbations is fundamental to understanding gene function, disease mechanisms, and therapeutic development. While recent deep learning approaches have shown promise in modeling single-cell…
It is argued that systems whose elements are renewed according to an extremal criterion can generally be expected to exhibit long-term memory. This is verified for the minimal extremally driven model, which is first defined and then solved…
We reconsider back-reaction from large amplitude, short-scale perturbations onto a long wavelength adiabatic mode. In a loop expansion of the long-mode power spectrum, this back-reaction appears first at 1-loop. Due to the separation…
Systems with long-range persistence and memory are shown to exhibit different precursory as well as recovery patterns in response to shocks of exogeneous versus endogeneous origins. By endogeneous, we envision either fluctuations resulting…