计算机科学 — Scifaro

Pairwise Reflection Symmetry in Generalized Latin Rectangles

Many combinatorial designs ask for equal distribution of given symbols across the entries of a matrix. The paramount examples are Latin squares, where each symbol from $\{1,\dots,n\}$ appears once per row and column of an $n\times n$…

离散数学 · 计算机科学 2026-06-26 Enrico Iurlano , Günther R. Raidl

VGB for Masked Diffusion Model: Efficient Test-time Scaling for Reward Satisfaction and Sample Editing

Inference-time scaling is a promising paradigm to improve generative models, especially when outputs must satisfy structural constraints or optimize downstream rewards. We consider Masked Diffusion Model (MDM) and introduce MDM-VGB, a…

机器学习 · 计算机科学 2026-06-26 Kijung Jeon , Thuy-Duong Vuong , Molei Tao

How Width and Data Shape Generalization Scaling Laws in Quadratic Neural Networks

Understanding how performance scales jointly with model size and data is a central problem in modern machine learning. Existing theoretical works on scaling laws typically describe generalization as a function of data or compute, often in…

机器学习 · 计算机科学 2026-06-26 Julius Girardin , Emanuele Troiani , Yizhou Xu , Vittorio Erba , Florent Krzakala , Lenka Zdeborová

Disentangling Continuous-Time Latent Dynamics: Identifiability of Latent SDEs via Diffusion Shifts

Causal representation learning for time series has developed strong identifiability results in discrete-time latent causal models, but identifiability in continuous-time latent stochastic differential equation (SDE) models remains largely…

机器学习 · 计算机科学 2026-06-26 Yuanyuan Wang , Wenjie Wang , Haoxuan Li , Mingming Gong , Kun Zhang

Estimation--Prediction Tradeoff in Causal Probabilistic Temporal Graphs

Temporal link prediction is usually evaluated by predictive performance on unseen edges, but in probabilistic temporal graphs this criterion can conflate model error with irreducible uncertainty. We study this issue by characterising an…

机器学习 · 计算机科学 2026-06-26 Aniq Ur Rahman

Recovering Sharp Conductivity Features in the Finite-Data Calderón Problem with Physics-Informed Neural Networks

Physics-informed neural networks (PINNs) have recently emerged as a promising framework for addressing the Calder\'on inverse problem from limited boundary data. In this work, we revisit neural Calder\'on inversion by introducing multiscale…

机器学习 · 计算机科学 2026-06-26 Ali AlHadi Kalout , Pablo Tejerina-Pérez , Konstantin Karchev , Pedro Tarancón-Álvarez , Leonid Sarieddine , Raul Jimenez , Max Engelstein , Guy David

Dangerous Liaisons of Convex Learning and Non-Affine Aggregation

Last-iterate convergence and generalization guarantees in first-order convex learning hinge on the monotonicity of the update operator. While linear averaging preserves the monotonicity of gradient updates, this property is often violated…

机器学习 · 计算机科学 2026-06-26 Thomas Boudou , Batiste Le Bars , Nirupam Gupta , Aurélien Bellet

Discrete Event Population Updates: finding game theoretic emergent behaviour in queueing systems with simulation

Strategic behaviour in queueing systems has been studied extensively in the behavioural queueing literature, but almost exclusively for systems that admit closed-form expressions for the cost or utility experienced by a strategic user.…

计算机科学与博弈论 · 计算机科学 2026-06-26 Vincent Knight , Geraint I. Palmer-Liyu , Thomas Hutton

Fast and Feasible: Permutation-based Constrained Reranking for Revenue Maximization

Search and recommender systems have produced highly relevant search results. A natural next step in the development of such systems in e-commerce is to rerank these results to increase the platform's revenue from paid promotion products.…

信息检索 · 计算机科学 2026-06-26 Svetlana Shirokovskikh , Anastasiia Soboleva , Ekaterina Solodneva , Aleksandr Katrutsa , Roman Loginov , Egor Samosvat

DG^VoiC: Speaker Clustering for Fraud Investigation under Real Call-Centre Conditions

Insurance fraud remains costly and operationally difficult, particularly in call-centre workflows where many customer interactions begin at FNOL. While recent fraud detection methods mainly rely on structured data, text, or images, repeated…

声音 · 计算机科学 2026-06-26 Muhammad Shakeel Akram , Amal Htait , Abdul Hamid Sadka , Emma Meisingseth , Karishma Jaitly

A Flexible Encoding Model for Non-Unique Note Alignments

Symbolic music alignment links notes in a symbolic performance to their counterparts in a score. While existing alignment encoding formats provide unique correspondences between these notes, there are various musical practices and forms…

声音 · 计算机科学 2026-06-26 Suhit Chiruthapudi , Adam Štefunko , Silvan Peter , Patricia Hu , Jan Hajič , Carlos Eduardo Cancino-Chacón

Performance Analysis and Optimal Design of ORB-Type GRAND Algorithms

Guessing Random Additive Noise Decoding (GRAND) performs decoding by sequentially guessing channel error patterns (EPs). Ordered Reliability Bits GRAND (ORBGRAND) is a notable instance suitable for efficient implementation, as it schedules…

信息论 · 计算机科学 2026-06-26 Li Wan , Wenyi Zhang

Dialogue to Detection: A Multimodal Hybrid NLP Pipeline for Insurance Fraud Detection

Insurance fraud imposes substantial financial losses and operational inefficiencies, raising premiums and impacting trust among legitimate policyholders. Early detection at FNOL remains a persistent challenge. Existing approaches rely…

计算与语言 · 计算机科学 2026-06-26 Muhammad Shakeel Akram , Amal Htait , Abdul Hamid Sadka , Emma Meisingseth , Karishma Jaitly

Benchmarking on Tasks That Matter: Dataset Selection for Preserving Model Rankings

Benchmarks of machine learning models often include many datasets, making evaluation expensive. For efficiency, it is preferable to perform evaluations on small, representative datasets instead. The selection of such subsets typically…

机器学习 · 计算机科学 2026-06-26 Rostislav Gusev , Alexey Zaytsev

Grammar-Guided Hierarchical Parsing for Long-form Audio Activity Recognition

Long-form audio exhibits an inherent hierarchy: fine-grained events form sub-activities, which in turn constitute higher-level activities. Prior work often models these levels separately, leading to cross-level inconsistencies and requiring…

声音 · 计算机科学 2026-06-26 Peng Zhang , Qingyu Luo , Philip J. B. Jackson , Wenwu Wang

AI Persuasive Framing in Collective Dilemmas

AI agents are promising tools that can act as flexible behavioral nudges to enhance human cooperation in addressing large-scale societal problems. However, evidence on whether AI agents can effectively boost cooperation remains mixed. We…

计算机与社会 · 计算机科学 2026-06-26 Anders Giovanni Møller , Alessia Galdeman , Arianna Pera , Luca Maria Aiello

Two-Stage Fine-Tuning for Protein Sequence Generation with Targeted Amino-Acid Composition

Protein language models are standard priors for biological sequence generation, but steering them toward explicit distributional design targets remains largely unexplored. We study a constrained protein generation problem in which sequences…

机器学习 · 计算机科学 2026-06-26 Violeta Basten-Romero , Rubén Muñoz-Tafalla , Anna María Díaz-Rovira , Bertran Miquel-Oliver , Isaac Filella-Merce , Víctor Guallar

Agentic AI-Powered Re-Identification: An Emerging, Scalable Threat to Mobility Microdata Privacy

The widespread collection of fine-grained location data by commercial data brokers creates a re-identification risk that is not widely recognised by the public. While prior research has established that mobility traces are highly unique and…

密码学与安全 · 计算机科学 2026-06-26 Oscar Thees , Roman Müller , Matthias Templ

MathModDB: A Database for Mathematical Models

When researchers need a mathematical model for a research problem, they face a fragmented landscape: relevant formulas, quantities, assumptions, and model variants are scattered across publications and domain-specific conventions. The…

数字图书馆 · 计算机科学 2026-06-26 Jochen Fiedler , Christine Biedinger , Marco Reidelbach , Björn Schembera , Burkhard Schmidt , Aurela Shehu , Thomas Koprucki

An LLM-Powered Semantic Alignment Framework for Journal Recommendation

Journal recommendation is an important task in scholarly information systems. Existing approaches typically rely on supervised learning models, manually engineered features, or historical interaction data, which may limit their…

信息检索 · 计算机科学 2026-06-26 Yanglin Yan , Zicheng Xie , Tianchen Gao , Rui Pan , Hansheng Wang