English
Related papers

Related papers: Distribution-Aware Algorithm Design with LLM Agent…

200 papers

Large Language Models (LLMs) have advanced the field of Combinatorial Optimization through automated heuristic generation. Instead of relying on manual design, this LLM-Driven Heuristic Design (LHD) process leverages LLMs to iteratively…

Machine Learning · Computer Science 2026-04-17 Rongzheng Wang , Yihong Huang , Muquan Li , Jiakai Li , Di Liang , Bob Simons , Pei Ke , Shuang Liang , Ke Qin

Can modifying the training data distribution guide optimizers toward solutions with improved generalization when training large language models (LLMs)? In this work, we theoretically analyze an in-context linear regression model with…

Machine Learning · Computer Science 2026-02-03 Tushaar Gangavarapu , Jiping Li , Christopher Vattheuer , Zhangyang Wang , Baharan Mirzasoleiman

Exploration is a fundamental problem in robotics. While sampling-based planners have shown high performance, they are oftentimes compute intensive and can exhibit high variance. To this end, we propose to directly learn the underlying…

Robotics · Computer Science 2022-07-15 Lukas Schmid , Chao Ni , Yuliang Zhong , Roland Siegwart , Olov Andersson

Large language models can predict real-valued quantities from heterogeneous inputs such as text, code, and molecular strings, but most training objectives score each decoded floating-point number independently, improving point estimates…

Machine Learning · Computer Science 2026-05-21 Jungsoo Park , Hyungjoo Chae , Ethan Mendes , Jay DeYoung , Varsha Kishore , Wei Xu , Alan Ritter

Large Language Models have recently emerged as a promising paradigm for automated heuristic design for NP-hard combinatorial optimization problems. Despite this progress, existing LLM-based methods typically rely on monolithic workflows…

Artificial Intelligence · Computer Science 2026-05-11 Yuping Yan , Jirui Han , Fei Ming , Yuanshuai Li , Yaochu Jin

We consider the problem of solving a large-scale system of linear equations in a distributed or federated manner by a taskmaster and a set of machines, each possessing a subset of the equations. We provide a comprehensive comparison of two…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-24 Boris Velasevic , Rohit Parasnis , Christopher G. Brinton , Navid Azizan

Large Language Models can generate synthetic survey responses at low cost, but their accuracy varies unpredictably across questions. We study the design problem of allocating a fixed budget of human respondents across estimation tasks when…

Artificial Intelligence · Computer Science 2026-04-21 Zikun Ye , Hema Yoganarasimhan

This paper demonstrates that progressive localization, the gradual increase of attention locality from early distributed layers to late localized layers, represents the optimal architecture for creating interpretable large language models…

Artificial Intelligence · Computer Science 2025-12-16 Joachim Diederich

LLM-based automatic heuristic design has shown promise for generating executable heuristics for combinatorial optimization, but existing methods mainly rely on delayed endpoint performance. We propose a \emph{teacher-aware evolutionary…

Artificial Intelligence · Computer Science 2026-05-12 Minyu Chen , Song Qin , Ling-I Wu , Jianxin Xue , Guoqiang Li

We consider the problem of hypothesis testing for discrete distributions. In the standard model, where we have sample access to an underlying distribution $p$, extensive research has established optimal bounds for uniformity testing,…

Machine Learning · Computer Science 2024-12-03 Maryam Aliakbarpour , Piotr Indyk , Ronitt Rubinfeld , Sandeep Silwal

Randomized linear solvers randomly compress and solve a linear system with compelling theoretical convergence rates and computational complexities. However, such solvers suffer a substantial disconnect between their theoretical rates and…

Numerical Analysis · Mathematics 2023-05-01 Vivak Patel , Mohammad Jahangoshahi , Daniel Adrian Maldonado

Agentic theorem provers often introduce intermediate lemmas, proof sketches, or subgoal decompositions before returning to tactic-level search. This can look like an expensive detour: if proving lemmas is itself hard, why should a learned…

Machine Learning · Computer Science 2026-05-11 Sho Sonoda , Shunta Akiyama , Yuya Uezato

The deployment of large language models' (LLMs) inference at the edge can facilitate prompt service responsiveness while protecting user privacy. However, it is critically challenged by the resource constraints of a single edge node.…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-21 Peirong Zheng , Wenchao Xu , Haozhao Wang , Jinyu Chen , Xuemin Shen

We study zeroth-order optimization where solutions must minimize a cost $d(s)$ while maintaining high probability under a complex generative prior $L(s)$ (e.g., a parameterized model). This reduces to sampling from a target distribution…

Machine Learning · Computer Science 2026-05-06 Pranjal Awasthi , Sreenivas Gollapudi , Ravi Kumar , Kamesh Munagala

Recent studies show that LLMs possess different skills and specialize in different tasks. In fact, we observe that their varied performance occur in several levels of granularity. For example, in the code optimization task, code LLMs excel…

Artificial Intelligence · Computer Science 2025-10-24 Yuanzhe Liu , Ryan Deng , Tim Kaler , Xuhao Chen , Charles E. Leiserson , Yao Ma , Jie Chen

Sampling-based motion planning is the predominant paradigm in many real-world robotic applications, but its performance is immensely dependent on the quality of the samples. The majority of traditional planners are inefficient as they use…

Robotics · Computer Science 2020-10-23 Tin Lai , Fabio Ramos

Diffusion-based large language models (dLLMs) are trained flexibly to model extreme dependence in the data distribution; however, how to best utilize this information at inference time remains an open problem. In this work, we uncover an…

In this work, we propose a robust approach to design distributed controllers for unknown-but-sparse linear and time-invariant systems. By leveraging modern techniques in distributed controller synthesis and structured linear inverse…

Optimization and Control · Mathematics 2019-10-14 Salar Fattahi , Nikolai Matni , Somayeh Sojoudi

Multi-robot task planning requires decomposing natural-language instructions into executable actions for heterogeneous robot teams. Conventional Planning Domain Definition Language (PDDL) planners provide rigorous guarantees but struggle to…

Robotics · Computer Science 2026-02-27 Tomoya Kawabe , Rin Takano

In many sequence learning tasks, such as program synthesis and document summarization, a key problem is searching over a large space of possible output sequences. We propose to learn representations of the outputs that are specifically…

Machine Learning · Computer Science 2021-08-09 Joey Hong , David Dohan , Rishabh Singh , Charles Sutton , Manzil Zaheer
‹ Prev 1 2 3 10 Next ›