Explanipedia

AgentBuilder: Exploring Scaffolds for Prototyping User Experiences of Interface Agents Open

Jenny T. Liang, Titus Barik, Jeffrey Nichols, Eldon Schoop, Ruijia Cheng · 2025

Interface agents powered by generative AI models (referred to as "agents") can automate actions based on user commands. An important aspect of developing agents is their user experience (i.e., agent experience). There is a growing need to …

Athena: Intermediate Representations for Iterative Scaffolded App Generation with an LLM Open

Jazbo Beason, Ruijia Cheng, Eldon Schoop, Jeffrey Nichols · 2025

It is challenging to generate the code for a complete user interface using a Large Language Model (LLM). User interfaces are complex and their implementations often consist of multiple, inter-related files that together specify the content…

From Interaction to Impact: Towards Safer AI Agents Through Understanding and Evaluating Mobile UI Operation Impacts Open

Zhuohao Zhang, Eldon Schoop, Jeffrey Nichols, Anuj Mahajan, Amanda Swearngin · 2024

Computer science Business

With advances in generative AI, there is increasing work towards creating autonomous agents that can manage daily tasks by operating user interfaces (UIs). While prior research has studied the mechanics of how AI agents might navigate UIs …

UICoder: Finetuning Large Language Models to Generate User Interface Code through Automated Feedback Open

Jason Wu, Eldon Schoop, Alan Leung, Titus Barik, Jeffrey P. Bigham , et al. · 2024

Computer science

Large language models (LLMs) struggle to consistently generate UI code that compiles and produces visually relevant designs. Existing approaches to improve generation rely on expensive human feedback or distilling a proprietary model. In t…

AXNav: Replaying Accessibility Tests from Natural Language Open

Maryam Taeb, Amanda Swearngin, Eldon Schoop, Ruijia Cheng, Yue Jiang , et al. · 2024

Computer science Mathematics

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Open

Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin , et al. · 2024

Computer science Sociology

Recent advancements in multimodal large language models (MLLMs) have been noteworthy, yet, these general-domain MLLMs often fall short in their ability to comprehend and interact effectively with user interface (UI) screens. In this paper,…

Never-ending Learning of User Interfaces Open

Jason Wu, Rebecca Krosnick, Eldon Schoop, Amanda Swearngin, Jeffrey P. Bigham , et al. · 2023

Computer science

Machine learning models have been trained to predict semantic information about user interfaces (UIs) to make apps more accessible, easier to test, and to automate. Currently, most models rely on datasets of static screenshots that are lab…

ILuvUI: Instruction-tuned LangUage-Vision modeling of UIs from Machine Conversations Open

Yue Jiang, Eldon Schoop, Amanda Swearngin, Jeffrey Nichols · 2023

Computer science Geography Mathematics

Multimodal Vision-Language Models (VLMs) enable powerful applications from their fused understanding of images and language, but many perform poorly on UI tasks due to the lack of UI training data. In this paper, we adapt a recipe for gene…

Never-ending Learning of User Interfaces Open

Jason Wu, Rebecca Krosnick, Eldon Schoop, Amanda Swearngin, Jeffrey P. Bigham , et al. · 2023

Computer science Economics

Machine learning models have been trained to predict semantic information about user interfaces (UIs) to make apps more accessible, easier to test, and to automate. Currently, most models rely on datasets that are collected and labeled by …

Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis Open

Eldon Schoop, Xin Zhou, Gang Li, Zhourong Chen, Bjoern Hartmann , et al. · 2022

Computer science

We use a deep learning based approach to predict whether a selected element in a mobile UI screenshot will be perceived by users as tappable, based on pixels only instead of view hierarchies required by previous work. To help designers bet…

Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis Open

Eldon Schoop, Xin Zhou, Gang Li, Zhourong Chen, Björn Hartmann , et al. · 2022

Computer science Psychology

We use a deep learning based approach to predict whether a selected element in a mobile UI screenshot will be perceived by users as tappable, based on pixels only instead of view hierarchies required by previous work. To help designers bet…

IMACS: Image Model Attribution Comparison Summaries Open

Eldon Schoop, Ben Wedin, Andrei Kapishnikov, Tolga Bolukbasi, Michael Terry · 2022

Computer science Psychology

Developing a suitable Deep Neural Network (DNN) often requires significant iteration, where different model versions are evaluated and compared. While metrics such as accuracy are a powerful means to succinctly describe a model's performan…

UMLAUT: Debugging Deep Learning Programs using Program Structure and Model Behavior Open

Eldon Schoop, Forrest Huang, Bjoern Hartmann · 2021

Computer science

Training deep neural networks can generate non-descriptive error messages or produce unusual output without any explicit errors at all. While experts rely on tacit knowledge to apply debugging strategies, non-experts lack the experience re…

HindSight Open

Eldon Schoop, James Smith, Bjoern Hartmann · 2018

Computer science Psychology Engineering

Our perception of our surrounding environment is limited by the constraints of human biology. The field of augmented perception asks how our sensory capabilities can be usefully extended through computational means. We argue that spatial a…

Drill Sergeant Open

Eldon Schoop, Michelle Nguyen, Daniel A. Lim, Valkyrie Savage, Sean Follmer , et al. · 2016

Computer science Engineering Biology

Mapping techniques from software tutorials onto physical craft processes can assist novices in building multi-material assemblies. By providing in-situ step instructions and progress tracking, generating dynamic feedback on technique, and …

Eldon Schoop YOU? Author Swipe