Thomas Cook — Scifaro

Continual Learning of Domain Knowledge from Human Feedback in Text-to-SQL

Large Language Models (LLMs) can generate SQL queries from natural language questions but struggle with database-specific schemas and tacit domain knowledge. We introduce a framework for continual learning from human feedback in…

Computation and Language · Computer Science 2025-12-01 Thomas Cook , Kelly Patel , Sivapriya Vellaichamy , Udari Madhushani Sehwag , Saba Rahimi , Zhen Zeng , Sumitra Ganesh

ScaleCall -- Agentic Tool Calling at Scale for Fintech: Challenges, Methods, and Deployment Insights

While Large Language Models (LLMs) excel at tool calling, deploying these capabilities in regulated enterprise environments such as fintech presents unique challenges due to on-premises constraints, regulatory compliance requirements, and…

Software Engineering · Computer Science 2025-11-04 Richard Osuagwu , Thomas Cook , Maraim Masoud , Koustav Ghosal , Riccardo Mattivi

Retrieval Augmented Generation (RAG) for Fintech: Agentic Design and Evaluation

Retrieval-Augmented Generation (RAG) systems often face limitations in specialized domains such as fintech, where domain-specific ontologies, dense terminology, and acronyms complicate effective retrieval and synthesis. This paper…

Artificial Intelligence · Computer Science 2025-10-30 Thomas Cook , Richard Osuagwu , Liman Tsatiashvili , Vrynsia Vrynsia , Koustav Ghosal , Maraim Masoud , Riccardo Mattivi

Prune 'n Predict: Optimizing LLM Decision-making with Conformal Prediction

Large language models (LLMs) are empowering decision-making in several applications, including tool or API usage and answering multiple-choice questions (MCQs). However, incorrect outputs pose significant risks in high-stakes domains like…

Machine Learning · Computer Science 2025-07-15 Harit Vishwakarma , Alan Mishler , Thomas Cook , Niccolò Dalmasso , Natraj Raman , Sumitra Ganesh

Hedging in Sequential Experiments

Experimentation involves risk. The investigator expends time and money in the pursuit of data that supports a hypothesis. In the end, the investigator may find that all of these costs were for naught and the data fail to reject the null.…

Risk Management · Quantitative Finance 2024-06-25 Thomas Cook , Patrick Flaherty

Semiparametric Efficient Inference in Adaptive Experiments

We consider the problem of efficient inference of the Average Treatment Effect in a sequential experiment where the policy governing the assignment of subjects to treatment or control can change over time. We first provide a central limit…

Machine Learning · Statistics 2024-03-05 Thomas Cook , Alan Mishler , Aaditya Ramdas

Cost-aware Generalized $\alpha$-investing for Multiple Hypothesis Testing

We consider the problem of sequential multiple hypothesis testing with nontrivial data collection costs. This problem appears, for example, when conducting biological experiments to identify differentially expressed genes of a disease…

Machine Learning · Computer Science 2023-11-06 Thomas Cook , Harsh Vardhan Dubey , Ji Ah Lee , Guangyu Zhu , Tingting Zhao , Patrick Flaherty

Heavy quarkonium suppression beyond the adiabatic limit

Many prior studies of in-medium quarkonium suppression have implicitly made use of an adiabatic approximation in which it was assumed that the heavy quark potential is a slowly varying function of time. In the adiabatic limit, one can…

High Energy Physics - Phenomenology · Physics 2019-10-30 Jacob Boyd , Thomas Cook , Ajaharul Islam , Michael Strickland