Explanipedia

ESCHER Open

Romil Bhardwaj, Alexey Tumanov, Stephanie Wang, Richard Liaw, Philipp Moritz , et al. · 2022

Computer science Engineering

As distributed applications become increasingly complex, so do their scheduling requirements. This development calls for cluster schedulers that are not only general, but also evolvable. Unfortunately, most existing cluster schedulers are …

Hoplite Open

Siyuan Zhuang, Zhuohan Li, Danyang Zhuo, Stephanie Wang, Eric Liang , et al. · 2021

Computer science Economics

Task-based distributed frameworks (e.g., Ray, Dask, Hydro) have become increasingly popular for distributed applications that contain asynchronous and dynamic workloads, including asynchronous gradient descent, reinforcement learning, and …

Hoplite: Efficient Collective Communication for Task-Based Distributed Systems. Open

Siyuan Zhuang, Zhuohan Li, Danyang Zhuo, Stephanie Wang, Eric Liang , et al. · 2020

Computer science Economics Sociology

Collective communication systems such as MPI offer high performance group communication primitives at the cost of application flexibility. Today, an increasing number of distributed applications (e.g, reinforcement learning) require flexib…

Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems Open

Siyuan Zhuang, Zhuohan Li, Danyang Zhuo, Stephanie Wang, Eric Liang , et al. · 2020

Computer science Economics

Task-based distributed frameworks (e.g., Ray, Dask, Hydro) have become increasingly popular for distributed applications that contain asynchronous and dynamic workloads, including asynchronous gradient descent, reinforcement learning, and …

Lineage stash Open

Stephanie Wang, John Liagouris, Robert Nishihara, Philipp Moritz, Ujval Misra , et al. · 2019

Computer science Biology

As cluster computing frameworks such as Spark, Dryad, Flink, and Ray are being deployed in mission critical applications and on larger and larger clusters, their ability to tolerate failures is growing in importance. These frameworks emplo…

Policy Gradient Search: Online Planning and Expert Iteration without\n Search Trees Open

T Anthony, Robert Nishihara, Philipp Moritz, Tim Salimans, John Schulman · 2019

Computer science Mathematics

Monte Carlo Tree Search (MCTS) algorithms perform simulation-based search to\nimprove policies online. During search, the simulation policy is adapted to\nexplore the most promising lines of play. MCTS has been used by\nstate-of-the-art pr…

Policy Gradient Search: Online Planning and Expert Iteration without Search Trees Open

Thomas Anthony, Robert Nishihara, Philipp Moritz, Tim Salimans, John Schulman · 2019

Computer science Mathematics Materials science

Monte Carlo Tree Search (MCTS) algorithms perform simulation-based search to improve policies online. During search, the simulation policy is adapted to explore the most promising lines of play. MCTS has been used by state-of-the-art progr…

On Systems and Algorithms for Distributed Machine Learning Open

Robert Nishihara · 2019

Computer science Psychology Mathematics

The advent of algorithms capable of leveraging vast quantities of data and computational resources has led to the proliferation of systems and tools aimed to facilitate the development and usage of these algorithms. Hardware trends, includ…

Tune: A Research Platform for Distributed Model Selection and Training Open

Richard Liaw, Eric Liang, Robert Nishihara, Philipp Moritz, Joseph E. Gonzalez , et al. · 2018

Computer science Physics

Modern machine learning algorithms are increasingly computationally demanding, requiring specialized hardware and distributed computation to achieve high performance in a reasonable time frame. Many hyperparameter search algorithms have be…

Ray RLlib: A Framework for Distributed Reinforcement Learning Open

Eric Liang, Richard Liaw, Philipp Moritz, Robert Nishihara, Roy Fox , et al. · 2017

Computer science Biology Materials science

Reinforcement learning (RL) algorithms involve the deep nesting of highly irregular computation patterns, each of which typically exhibits opportunities for distributed computation. We argue for distributing RL components in a composable w…

RLlib: Abstractions for Distributed Reinforcement Learning Open

Eric Liang, Richard Liaw, Philipp Moritz, Robert Nishihara, Roy Fox , et al. · 2017

Computer science Materials science Biology

Reinforcement learning (RL) algorithms involve the deep nesting of highly irregular computation patterns, each of which typically exhibits opportunities for distributed computation. We argue for distributing RL components in a composable w…

Ray RLLib: A Composable and Scalable Reinforcement Learning Library Open

Eric Liang, Richard Liaw, Robert Nishihara, Philipp Moritz, Roy Fox , et al. · 2017

Computer science Engineering Physics

Reinforcement learning (RL) algorithms involve the deep nesting of distinct components, where each component typically exhibits opportunities for distributed computation. Current RL libraries offer parallelism at the level of the entire pr…

Ray: A Distributed Framework for Emerging AI Applications Open

Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw , et al. · 2017

Computer science Engineering Mathematics

The next generation of AI applications will continuously interact with the environment and learn from these interactions. These applications impose new and demanding systems requirements, both in terms of performance and flexibility. In th…

Discovering Causal Signals in Images Open

David Lopez‐Paz, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, Léon Bottou · 2017

Computer science Mathematics Physics

This paper establishes the existence of observable footprints that reveal the "causal dispositions" of the object categories appearing in collections of images. We achieve this goal in two steps. First, we take a learning approach to obser…

Real-Time Machine Learning: The Missing Pieces Open

Robert Nishihara, Philipp Moritz, Stephanie Wang, Alexey Tumanov, William Paul , et al. · 2017

Computer science

Machine learning applications are increasingly deployed not only to serve predictions using static models, but also as tightly-integrated components of feedback loops involving dynamic, real-time decision making. These applications pose a …

Robert Nishihara YOU? Author Swipe