Explanipedia

Interactive Debugging and Steering of Multi-Agent AI Systems Open

Will Epperson, Gagan Bansal, Victor Dibia, Adam Fourney, Jack Gerrits , et al. · 2025

Computer science

Fully autonomous teams of LLM-powered AI agents are emerging that collaborate to perform complex tasks for users. What challenges do developers face when trying to build and debug these AI agent teams? In formative interviews with five AI …

Challenges in Human-Agent Communication Open

Gagan Bansal, Jennifer Wortman Vaughan, Saleema Amershi, Eric Horvitz, Adam Fourney , et al. · 2024

Business Computer science

Remarkable advancements in modern generative foundation models have enabled the development of sophisticated and highly capable autonomous agents that can observe their environment, invoke tools, and communicate with other agents to solve …

Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks Open

Adam Fourney, Gagan Bansal, Hussein Mozannar, Cheng Tan, Eduardo Salinas , et al. · 2024

Computer science Psychology Biology

Modern AI agents, driven by advances in large foundation models, promise to enhance our productivity and transform our lives by augmenting our knowledge and capabilities. To achieve this vision, AI agents must effectively plan, perform mul…

AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems Open

Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney , et al. · 2024

Computer science

Multi-agent systems, where multiple agents (generative AI models + tools) collaborate, are emerging as an effective pattern for solving long-running, complex tasks in numerous domains. However, specifying their parameters (such as models, …

Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications Open

Negar Arabzadeh, Julia Kiseleva, Qingyun Wu, Chi Wang, Ahmed Hassan Awadallah , et al. · 2024

Computer science Business Engineering

The rapid development in the field of Large Language Models (LLMs) has led to a surge in applications that facilitate collaboration among multiple agents to assist humans in their daily tasks. However, a significant gap remains in assessin…

Axiomatic Preference Modeling for Longform Question Answering Open

Corby Rosset, Guo‐qing Zheng, Victor Dibia, Ahmed Hassan Awadallah, Paul N. Bennett · 2023

Computer science Mathematics Engineering

The remarkable abilities of large language models (LLMs) like GPT-4 partially stem from post-training processes like Reinforcement Learning from Human Feedback (RLHF) involving human preferences encoded in a reward model. However, these re…

LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models Open

Victor Dibia · 2023

Computer science

Systems that support users in the automatic creation of visualizations must address several subtasks - understand the semantics of data, enumerate relevant visualization goals and generate visualization specifications. In this work, we pos…

LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models Open

Victor Dibia · 2023

Computer science

Systems that support users in the automatic creation of visualizations must address several subtasks - understand the semantics of data, enumerate relevant visualization goals and generate visualization specifications. In this work, we pos…

Axiomatic Preference Modeling for Longform Question Answering Open

Corby Rosset, Guo‐qing Zheng, Victor Dibia, Ahmed Hassan Awadallah, Paul N. Bennett · 2023

Computer science Mathematics Engineering

The remarkable abilities of large language models (LLMs) like ChatGPT and GPT-4 partially stem from the post-training processes involving human preferences encoded within a reward model as part of a Reinforcement Learning from Human Feedba…

Aligning Offline Metrics and Human Judgments of Value for Code Generation Models Open

Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi-Sangdeh, Han Liu , et al. · 2023

Computer science Economics Mathematics

Large language models have demonstrated great potential to assist programmers in generating code. For such human-AI pair programming scenarios, we empirically demonstrate that while generated code are most often evaluated in terms of their…

Aligning Offline Metrics and Human Judgments of Value for Code Generation Models Open

Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi-Sangdeh, Han Liu , et al. · 2022

Computer science Mathematics Economics

Large language models have demonstrated great potential to assist programmers in generating code. For such human-AI pair programming scenarios, we empirically demonstrate that while generated code is most often evaluated in terms of their …

NeuralQA: A Usable Library for Question Answering (Contextual Query\n Expansion + BERT) on Large Datasets Open

Victor Dibia · 2020

Computer science

Existing tools for Question Answering (QA) have challenges that limit their\nuse in practice. They can be complex to set up or integrate with existing\ninfrastructure, do not offer configurable interactive interfaces, and do not\ncover the…

NeuralQA: A Usable Library for Question Answering (Contextual Query Expansion + BERT) on Large Datasets Open

Victor Dibia · 2020

Computer science

Existing tools for Question Answering (QA) have challenges that limit their use in practice. They can be complex to set up or integrate with existing infrastructure, do not offer configurable interactive interfaces, and do not cover the fu…

Data2Vis: Automatic Generation of Data Visualizations Using Sequence-to-Sequence Recurrent Neural Networks Open

Victor Dibia, Çağatay Demiralp · 2019

Computer science Biology Philosophy

Rapidly creating effective visualizations using expressive grammars is challenging for users who have limited time and limited skills in statistics and data visualization. Even high-level, dedicated visualization tools often require users …

Beyond Heuristics: Learning Visualization Design Open

Bahador Saket, Dominik Moritz, Halden Lin, Victor Dibia, Çağatay Demiralp , et al. · 2018

Computer science

In this paper, we describe a research agenda for deriving design principles directly from data. We argue that it is time to go beyond manually curated and applied visualization design guidelines. We propose learning models of visualization…

Designing for Democratization: Introducing Novices to Artificial Intelligence Via Maker Kits Open

Victor Dibia, Maryam Ashoori, Aaron Cox · 2018

Computer science Political science Engineering

Existing research highlight the myriad of benefits realized when technology is sufficiently democratized and made accessible to non-technical or novice users. However, democratizing complex technologies such as artificial intelligence (AI)…

A Cognitive Assistant for Visualizing and Analyzing Exoplanets Open

Jeffrey O. Kephart, Victor Dibia, Jason Ellis, Biplav Srivastava, Kartik Talamadupula , et al. · 2018

Computer science History Biology

We demonstrate an embodied cognitive agent that helps scientists visualize and analyze exo-planets and their host stars. The prototype is situated in a room equipped with a large display, microphones, cameras, speakers, and pointing device…

Victor Dibia YOU? Author Swipe