Explanipedia

Caltech-UCSD Birds-200-2011 Dataset Open

Catherine Wah, Steve Branson, Peter Welinder, Pietro Perona, Serge Belongie · 2024

Computer science Biology

CUB-200-2011 is an extended version of CUB-200 [7], a challenging dataset of 200 bird species. The extended version roughly doubles the number of images per category and adds new part localization annotations. All images are annotated with…

Training language models to follow instructions with human feedback Open

Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright , et al. · 2022

Computer science Materials science Philosophy

Making language models bigger does not inherently make them better at following a user's intent. For example, large language models can generate outputs that are untruthful, toxic, or simply not helpful to the user. In other words, these m…

Text and Code Embeddings by Contrastive Pre-Training Open

Arvind Neelakantan, Tao Xu, Raul Puri, Alec Radford, Jesse Michael Han , et al. · 2022

Computer science

Text embeddings are useful features in many applications such as semantic search and computing text similarity. Previous work typically trains models customized for different use cases, varying in dataset choice, training objective and mod…

Evaluating Large Language Models Trained on Code Open

Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Pondé de Oliveira Pinto , et al. · 2021

Computer science

We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we…

Sim2Real in Robotics and Automation: Applications and Challenges Open

Sebastian Höfer, Kostas E. Bekris, Ankur Handa, Juan Camilo Gamboa, Melissa Mozifian , et al. · 2021

Computer science Engineering

To Perform reliably and consistently over sustained periods of time, large-scale automation critically relies on computer simulation. Simulation allows us and supervisory AI to effectively design, validate, and continuously improve complex…

Asymmetric self-play for automatic goal discovery in robotic manipulation Open

OpenAI OpenAI, Matthias Plappert, Raul Sampedro, Tao Xu, Ilge Akkaya , et al. · 2021

Computer science Psychology

We train a single, goal-conditioned policy that can solve many robotic manipulation tasks, including tasks with previously unseen goals and objects. We rely on asymmetric self-play for goal discovery, where two agents, Alice and Bob, play …

Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop Open

Sebastian Höfer, Kostas E. Bekris, Ankur Handa, Juan Camilo Gamboa Higuera, Florian Golemo , et al. · 2020

Computer science Engineering

This report presents the debates, posters, and discussions of the Sim2Real workshop held in conjunction with the 2020 edition of the "Robotics: Science and System" conference. Twelve leaders of the field took competing debate positions on …

Learning dexterous in-hand manipulation Open

OpenAI Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafał Józefowicz, Bob McGrew , et al. · 2019

Computer science Psychology

We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies that can perform vision-based object reorientation on a physical Shadow Dexterous Hand. The training is performed in a simulated environment in which we ra…

Solving Rubik's Cube with a Robot Hand Open

OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin , et al. · 2019

Computer science Mathematics

We demonstrate that models trained only in simulation can be used to solve a manipulation problem of unprecedented complexity on a real robot. This is made possible by two key components: a novel algorithm, which we call automatic domain r…

ORRB -- OpenAI Remote Rendering Backend Open

Maciek Chociej, Peter Welinder, Lilian Weng · 2019

Computer science

We present the OpenAI Remote Rendering Backend (ORRB), a system that allows fast and customizable rendering of robotics environments. It is based on the Unity3d game engine and interfaces with the MuJoCo physics simulation library. ORRB wa…

Domain Randomization and Generative Models for Robotic Grasping Open

Joshua Tobin, Lukas Biewald, Rocky Duan, Marcin Andrychowicz, Ankur Handa , et al. · 2018

Computer science Mathematics

Deep learning-based robotic grasping has made significant progress thanks to algorithmic improvements and increased data availability. However, state-of-the-art models are often trained on as few as hundreds or thousands of unique object i…

Asymmetric Actor Critic for Image-Based Robot Learning Open

Lerrel Pinto, Marcin Andrychowicz, Peter Welinder, Wojciech Zaremba, Pieter Abbeel · 2018

Computer science Mathematics

Deep reinforcement learning (RL) has proven a powerful technique in many sequential decision making domains. However, Robotics poses many challenges for RL, most notably training on a physical system can be expensive and dangerous, which h…

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research Open

Matthias Plappert, Marcin Andrychowicz, Alex Ray, Bob McGrew, Bowen Baker , et al. · 2018

Computer science Psychology History

The purpose of this technical report is two-fold. First of all, it introduces a suite of challenging continuous control tasks (integrated with OpenAI Gym) based on currently existing robotics hardware. The tasks include pushing, sliding an…

Hindsight Experience Replay Open

Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong , et al. · 2017

Computer science Psychology Mathematics

Dealing with sparse rewards is one of the biggest challenges in Reinforcement Learning (RL). We present a novel technique called Hindsight Experience Replay which allows sample-efficient learning from rewards which are sparse and binary an…

Peter Welinder YOU? Author Swipe