Victor Dibia — Scifaro

Magentic-UI: Towards Human-in-the-loop Agentic Systems

AI agents powered by large language models are increasingly capable of autonomously completing complex, multi-step tasks using external tools. Yet, they still fall short of human-level performance in most domains including computer use,…

Artificial Intelligence · Computer Science 2025-07-31 Hussein Mozannar , Gagan Bansal , Cheng Tan , Adam Fourney , Victor Dibia , Jingya Chen , Jack Gerrits , Tyler Payne , Matheus Kunzler Maldaner , Madeleine Grunde-McLaughlin , Eric Zhu , Griffin Bassman , Jacob Alber , Peter Chang , Ricky Loynd , Friederike Niedtner , Ece Kamar , Maya Murad , Rafah Hosn , Saleema Amershi

Interactive Debugging and Steering of Multi-Agent AI Systems

Fully autonomous teams of LLM-powered AI agents are emerging that collaborate to perform complex tasks for users. What challenges do developers face when trying to build and debug these AI agent teams? In formative interviews with five AI…

Multiagent Systems · Computer Science 2025-03-06 Will Epperson , Gagan Bansal , Victor Dibia , Adam Fourney , Jack Gerrits , Erkang Zhu , Saleema Amershi

Concept Distillation from Strong to Weak Models via Hypotheses-to-Theories Prompting

Hand-crafting high quality prompts to optimize the performance of language models is a complicated and labor-intensive process. Furthermore, when migrating to newer, smaller, or weaker models (possibly due to latency or cost gains), prompts…

Artificial Intelligence · Computer Science 2025-02-25 Emmanuel Aboah Boateng , Cassiano O. Becker , Nabiha Asghar , Kabir Walia , Ashwin Srinivasan , Ehi Nosakhare , Soundar Srinivasan , Victor Dibia

Challenges in Human-Agent Communication

Remarkable advancements in modern generative foundation models have enabled the development of sophisticated and highly capable autonomous agents that can observe their environment, invoke tools, and communicate with other agents to solve…

Human-Computer Interaction · Computer Science 2024-12-17 Gagan Bansal , Jennifer Wortman Vaughan , Saleema Amershi , Eric Horvitz , Adam Fourney , Hussein Mozannar , Victor Dibia , Daniel S. Weld

Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks

Modern AI agents, driven by advances in large foundation models, promise to enhance our productivity and transform our lives by augmenting our knowledge and capabilities. To achieve this vision, AI agents must effectively plan, perform…

Artificial Intelligence · Computer Science 2024-11-08 Adam Fourney , Gagan Bansal , Hussein Mozannar , Cheng Tan , Eduardo Salinas , Erkang , Zhu , Friederike Niedtner , Grace Proebsting , Griffin Bassman , Jack Gerrits , Jacob Alber , Peter Chang , Ricky Loynd , Robert West , Victor Dibia , Ahmed Awadallah , Ece Kamar , Rafah Hosn , Saleema Amershi

Data Analysis in the Era of Generative AI

This paper explores the potential of AI-powered tools to reshape data analysis, focusing on design considerations and challenges. We explore how the emergence of large language and multimodal models offers new opportunities to enhance…

Artificial Intelligence · Computer Science 2024-09-30 Jeevana Priya Inala , Chenglong Wang , Steven Drucker , Gonzalo Ramos , Victor Dibia , Nathalie Riche , Dave Brown , Dan Marshall , Jianfeng Gao

AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems

Multi-agent systems, where multiple agents (generative AI models + tools) collaborate, are emerging as an effective pattern for solving long-running, complex tasks in numerous domains. However, specifying their parameters (such as models,…

Software Engineering · Computer Science 2024-08-29 Victor Dibia , Jingya Chen , Gagan Bansal , Suff Syed , Adam Fourney , Erkang Zhu , Chi Wang , Saleema Amershi

Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications

The rapid development in the field of Large Language Models (LLMs) has led to a surge in applications that facilitate collaboration among multiple agents to assist humans in their daily tasks. However, a significant gap remains in assessing…

Computation and Language · Computer Science 2024-02-26 Negar Arabzadeh , Julia Kiseleva , Qingyun Wu , Chi Wang , Ahmed Awadallah , Victor Dibia , Adam Fourney , Charles Clarke

The Aleph & Other Metaphors for Image Generation

In this position paper, we reflect on fictional stories dealing with the infinite and how they connect with the current, fast-evolving field of image generation models. We draw attention to how some of these literary constructs can serve as…

Human-Computer Interaction · Computer Science 2024-02-13 Gonzalo Ramos , Rick Barraza , Victor Dibia , Sharon Lo

Axiomatic Preference Modeling for Longform Question Answering

The remarkable abilities of large language models (LLMs) like GPT-4 partially stem from post-training processes like Reinforcement Learning from Human Feedback (RLHF) involving human preferences encoded in a reward model. However, these…

Artificial Intelligence · Computer Science 2023-12-06 Corby Rosset , Guoqing Zheng , Victor Dibia , Ahmed Awadallah , Paul Bennett

Aligning Offline Metrics and Human Judgments of Value for Code Generation Models

Large language models have demonstrated great potential to assist programmers in generating code. For such human-AI pair programming scenarios, we empirically demonstrate that while generated code is most often evaluated in terms of their…

Software Engineering · Computer Science 2023-06-14 Victor Dibia , Adam Fourney , Gagan Bansal , Forough Poursabzi-Sangdeh , Han Liu , Saleema Amershi

LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models

Systems that support users in the automatic creation of visualizations must address several subtasks - understand the semantics of data, enumerate relevant visualization goals and generate visualization specifications. In this work, we pose…

Artificial Intelligence · Computer Science 2023-06-07 Victor Dibia

NeuralQA: A Usable Library for Question Answering (Contextual Query Expansion + BERT) on Large Datasets

Existing tools for Question Answering (QA) have challenges that limit their use in practice. They can be complex to set up or integrate with existing infrastructure, do not offer configurable interactive interfaces, and do not cover the…

Computation and Language · Computer Science 2020-12-01 Victor Dibia

Designing for Democratization: Introducing Novices to Artificial Intelligence Via Maker Kits

Existing research highlight the myriad of benefits realized when technology is sufficiently democratized and made accessible to non-technical or novice users. However, democratizing complex technologies such as artificial intelligence (AI)…

Human-Computer Interaction · Computer Science 2019-01-08 Victor Dibia , Aaron Cox , Justin Weisz

Data2Vis: Automatic Generation of Data Visualizations Using Sequence to Sequence Recurrent Neural Networks

Rapidly creating effective visualizations using expressive grammars is challenging for users who have limited time and limited skills in statistics and data visualization. Even high-level, dedicated visualization tools often require users…

Human-Computer Interaction · Computer Science 2018-11-06 Victor Dibia , Çağatay Demiralp

Beyond Heuristics: Learning Visualization Design

In this paper, we describe a research agenda for deriving design principles directly from data. We argue that it is time to go beyond manually curated and applied visualization design guidelines. We propose learning models of visualization…

Human-Computer Interaction · Computer Science 2018-08-17 Bahador Saket , Dominik Moritz , Halden Lin , Victor Dibia , Cagatay Demiralp , Jeffrey Heer