Related papers: CEDAR: Context Engineering for Agentic Data Scienc…

DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning

In this work, we investigate the potential of large language models (LLMs) based agents to automate data science tasks, with the goal of comprehending task requirements, then building and training the best-fit machine learning models.…

Machine Learning · Computer Science 2024-05-29 Siyuan Guo , Cheng Deng , Ying Wen , Hechang Chen , Yi Chang , Jun Wang

Monadic Context Engineering

The proliferation of Large Language Models (LLMs) has catalyzed a shift towards autonomous agents capable of complex reasoning and tool use. However, current agent architectures are frequently constructed using imperative, ad hoc patterns.…

Artificial Intelligence · Computer Science 2026-01-23 Yifan Zhang , Yang Yuan , Mengdi Wang , Andrew Chi-Chih Yao

A Language for Describing Agentic LLM Contexts

Large language models are increasingly used within larger systems ("LLM agents"). These make a sequence of LLM calls, each call providing the LLM with a combination of instructions, observations, and interaction history. The design of the…

Artificial Intelligence · Computer Science 2026-05-05 Noga Peleg Pelc , Gal A. Kaminka , Yoav Goldberg

On Problems of Implicit Context Compression for Software Engineering Agents

LLM-based Software Engineering agents face a critical bottleneck: context length limitations cause failures on complex, long-horizon tasks. One promising solution is to encode context as continuous embeddings rather than discrete tokens,…

Software Engineering · Computer Science 2026-05-13 Kirill Gelvan , Igor Slinko , Felix Steinbauer , Egor Bogomolov , Florian Kofler , Yaroslav Zharov

Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning

Embodied agents need to plan and act reliably in real and complex 3D environments. Classical planning (e.g., PDDL) offers structure and guarantees, but in practice it fails under noisy perception and incorrect predicate grounding. On the…

Robotics · Computer Science 2026-03-10 Emanuele Musumeci , Michele Brienza , Francesco Argenziano , Abdel Hakim Drid , Vincenzo Suriani , Daniele Nardi , Domenico D. Bloisi

Declarative Data Services: Structured Agentic Discovery for Composing Data Systems

Agentic discovery has shown that LLM-driven search can find novel algorithms, designs, and code under benchmark conditions. Translating the paradigm to multi-system data backends surfaces a harder problem: the search space is heterogeneous,…

Artificial Intelligence · Computer Science 2026-05-27 Shanshan Ye , Duo Lu

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Large language model (LLM) applications such as agents and domain-specific reasoning increasingly rely on context adaptation: modifying inputs with instructions, strategies, or evidence, rather than weight updates. Prior approaches improve…

Machine Learning · Computer Science 2026-03-31 Qizheng Zhang , Changran Hu , Shubhangi Upasani , Boyuan Ma , Fenglu Hong , Vamsidhar Kamanuru , Jay Rainton , Chen Wu , Mengmeng Ji , Hanchen Li , Urmish Thakker , James Zou , Kunle Olukotun

LLM-Based Agentic Systems for Software Engineering: Challenges and Opportunities

Despite recent advancements in Large Language Models (LLMs), complex Software Engineering (SE) tasks require more collaborative and specialized approaches. This concept paper systematically reviews the emerging paradigm of LLM-based…

Software Engineering · Computer Science 2026-01-21 Yongjian Tang , Thomas Runkler

Context Engineering for Multi-Agent LLM Code Assistants Using Elicit, NotebookLM, ChatGPT, and Claude Code

Large Language Models (LLMs) have shown promise in automating code generation and software engineering tasks, yet they often struggle with complex, multi-file projects due to context limitations and knowledge gaps. We propose a novel…

Software Engineering · Computer Science 2025-08-13 Muhammad Haseeb

A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions

Research interest in autonomous agents is on the rise as an emerging topic. The notable achievements of Large Language Models (LLMs) have demonstrated the considerable potential to attain human-like intelligence in autonomous agents.…

Multiagent Systems · Computer Science 2025-01-30 Hung Du , Srikanth Thudumu , Rajesh Vasa , Kon Mouzakis

Meta Context Engineering via Agentic Skill Evolution

The operational efficacy of large language models relies heavily on their inference-time context. This has established Context Engineering (CE) as a formal discipline for optimizing these inputs. Current CE methods rely on manually crafted…

Artificial Intelligence · Computer Science 2026-02-12 Haoran Ye , Xuning He , Vincent Arak , Haonan Dong , Guojie Song

When LLMs Team Up: A Coordinated Attack Framework for Automated Cyber Intrusions

Automated intrusion-style workflows require LLM agents to reason over partial observations, tool outputs, and executable artifacts under bounded budgets. A single LLM instance often compresses evidence extraction, planning, execution, and…

Cryptography and Security · Computer Science 2026-05-12 Minfeng Qi , Tianqing Zhu , Zijie Xu , Congcong Zhu , Qin Wang , Wanlei Zhou

Context Matters: Evaluating Context Strategies for Automated ADR Generation Using LLMs

Architecture Decision Records (ADRs) play a critical role in preserving the rationale behind system design, yet their creation and maintenance are often neglected due to the associated authoring overhead. This paper investigates whether…

Software Engineering · Computer Science 2026-04-16 Aviral Gupta , Rudra Dhar , Daniel Feitosa , Karthik Vaidhyanathan

Improving Coherence and Persistence in Agentic AI for System Optimization

Designing high-performance system heuristics is a creative, iterative process requiring experts to form hypotheses and execute multi-step conceptual shifts. While Large Language Models (LLMs) show promise in automating this loop, they…

Artificial Intelligence · Computer Science 2026-03-24 Pantea Karimi , Kimia Noorbakhsh , Mohammad Alizadeh , Hari Balakrishnan

CESAR: Automatic Induction of Compositional Instructions for Multi-turn Dialogs

Instruction-based multitasking has played a critical role in the success of large language models (LLMs) in multi-turn dialog applications. While publicly available LLMs have shown promising performance, when exposed to complex instructions…

Computation and Language · Computer Science 2023-11-30 Taha Aksu , Devamanyu Hazarika , Shikib Mehri , Seokhwan Kim , Dilek Hakkani-Tür , Yang Liu , Mahdi Namazifar

AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science

Large language models (LLMs) have advanced the automation of data science workflows. Yet it remains unclear whether they can critically leverage external domain knowledge as human data scientists do in practice. To answer this question, we…

Machine Learning · Computer Science 2025-10-24 An Luo , Xun Xian , Jin Du , Fangqiao Tian , Ganghua Wang , Ming Zhong , Shengchun Zhao , Xuan Bi , Zirui Liu , Jiawei Zhou , Jayanth Srinivasa , Ashish Kundu , Charles Fleming , Mingyi Hong , Jie Ding

Conversational Challenges in AI-Powered Data Science: Obstacles, Needs, and Design Opportunities

Large Language Models (LLMs) are being increasingly employed in data science for tasks like data preprocessing and analytics. However, data scientists encounter substantial obstacles when conversing with LLM-powered chatbots and acting on…

Human-Computer Interaction · Computer Science 2023-10-26 Bhavya Chopra , Ananya Singha , Anna Fariha , Sumit Gulwani , Chris Parnin , Ashish Tiwari , Austin Z. Henley

AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science

Data science plays a critical role in transforming complex data into actionable insights across numerous domains. Recent developments in large language models (LLMs) and artificial intelligence (AI) agents have significantly automated data…

Machine Learning · Computer Science 2026-03-20 An Luo , Jin Du , Xun Xian , Robert Specht , Fangqiao Tian , Ganghua Wang , Xuan Bi , Charles Fleming , Ashish Kundu , Jayanth Srinivasa , Mingyi Hong , Rui Zhang , Tianxi Li , Galin Jones , Jie Ding

Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems

Traditional Data+AI systems utilize data-driven techniques to optimize performance, but they rely heavily on human experts to orchestrate system pipelines, enabling them to adapt to changes in data, queries, tasks, and environments. For…

Databases · Computer Science 2025-07-03 Zhaoyan Sun , Jiayi Wang , Xinyang Zhao , Jiachi Wang , Guoliang Li

Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory

Multi-agent systems built on Large Language Models (LLMs) show exceptional promise for complex collaborative problem-solving, yet they face fundamental challenges stemming from context window limitations that impair memory consistency, role…

Artificial Intelligence · Computer Science 2026-01-13 Sizhe Yuen , Francisco Gomez Medina , Ting Su , Yali Du , Adam J. Sobey