Explanipedia

Visualizing Neural Network Imagination Open

Nevan Wichers, V. Tao, Riccardo Volpato, Fazl Barez · 2024

Computer science Psychology

In certain situations, neural networks will represent environment states in their hidden activations. Our goal is to visualize what environment states the networks are representing. We experiment with a recurrent neural network (RNN) archi…

Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation Open

Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers , et al. · 2024

Psychology Computer science

Reinforcement learning (RL) can align language models with non-differentiable reward signals, such as human preferences. However, a major challenge arises from the sparsity of these reward signals - typically, there is only a single reward…

Fusion-Eval: Integrating Assistant Evaluators with LLMs Open

Lei Shu, Nevan Wichers, Liangchen Luo, Yuntian Zhu, Yinxiao Liu , et al. · 2023

Computer science Geography Mathematics

Evaluating natural language systems poses significant challenges, particularly in the realms of natural language understanding and high-level reasoning. In this paper, we introduce 'Fusion-Eval', an innovative approach that leverages Large…

SiRA: Sparse Mixture of Low Rank Adaptation Open

Yun Zhu, Nevan Wichers, Chu‐Cheng Lin, Xinyi Wang, Tianlong Chen , et al. · 2023

Computer science Mathematics Physics

Parameter Efficient Tuning has been an prominent approach to adapt the Large Language Model to downstream tasks. Most previous works considers adding the dense trainable parameters, where all parameters are used to adapt certain task. We f…

SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition Open

Dylan Slack, Yinlam Chow, Bo Dai, Nevan Wichers · 2022

Computer science Engineering

Methods that extract policy primitives from offline demonstrations using deep generative models have shown promise at accelerating reinforcement learning(RL) for new tasks. Intuitively, these methods should also help to trainsafeRLagents b…

ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces Open

Zecheng He, Srinivas Sunkara, Xiaoxue Zang, Ying Xu, Lijuan Liu , et al. · 2021

Computer science Physics

As mobile devices are becoming ubiquitous, regularly interacting with a variety of user interfaces (UIs) is a common aspect of daily life for many people. To improve the accessibility of these devices and to enable their usage in a variety…

ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces Open

Zecheng He, Srinivas Sunkara, Xiaoxue Zang, Ying Xu, Lijuan Liu , et al. · 2020

Computer science Physics

As mobile devices are becoming ubiquitous, regularly interacting with a variety of user interfaces (UIs) is a common aspect of daily life for many people. To improve the accessibility of these devices and to enable their usage in a variety…

RL agents Implicitly Learning Human Preferences Open

Nevan Wichers · 2020

Computer science

In the real world, RL agents should be rewarded for fulfilling human preferences. We show that RL agents implicitly learn the preferences of humans in their environment. Training a classifier to predict if a simulated human's preferences a…

Resolving Spurious Correlations in Causal Models of Environments via Interventions Open

Sergei Volodin, Nevan Wichers, Jeremy Nixon · 2020

Computer science Psychology Mathematics

Causal models bring many benefits to decision-making systems (or agents) by making them interpretable, sample-efficient, and robust to changes in the input distribution. However, spurious correlations can lead to wrong causal models and pr…

Resolving Referring Expressions in Images With Labeled Elements Open

Nevan Wichers, Dilek Hakkani‐Tür, Jindong Chen · 2018

Computer science Geography Political science

Images may have elements containing text and a bounding box associated with them, for example, text identified via optical character recognition on a computer screen image, or a natural image with labeled objects. We present an end-to-end …

Hierarchical Long-term Video Prediction without Supervision Open

Nevan Wichers, Ruben Villegas, Dumitru Erhan, Honglak Lee · 2018

Computer science Physics

Much of recent research has been devoted to video prediction and generation, yet most of the previous works have demonstrated only limited success in generating videos on short-term horizons. The hierarchical video prediction method by Vil…

Nevan Wichers YOU? Author Swipe