Explanipedia

Capabilities of Gemini Models in Medicine Open

Khaled Saab, Tao Tu, Wei‐Hung Weng, Ryutaro Tanno, David Stutz , et al. · 2024

Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong genera…

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models Open

Aleksandar Botev, Soham De, Samuel Smith, Anushan Fernando, George-Cristian Muraru , et al. · 2024

We introduce RecurrentGemma, a family of open language models which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on language. It has a fixed-sized state…

Gemma: Open Models Based on Gemini Research and Technology Open

Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju , et al. · 2024

This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language unde…

Competition-level code generation with AlphaCode Open

Yujia Li, David Choi, Jun‐Young Chung, Nate Kushman, Julian Schrittwieser , et al. · 2022

Programming is a powerful and ubiquitous problem-solving tool. Systems that can assist programmers or even generate programs themselves could make programming more productive and accessible. Recent transformer-based neural network models s…

Improving alignment of dialogue agents via targeted human judgements Open

Amelia Glaese, Nat McAleese, Maja Trębacz, John Aslanides, Vlad Firoiu , et al. · 2022

We present Sparrow, an information-seeking dialogue agent trained to be more helpful, correct, and harmless compared to prompted language model baselines. We use reinforcement learning from human feedback to train our models with two new a…

Magnetic control of tokamak plasmas through deep reinforcement learning Open

Jonas Degrave, F. Felici, Jonas Buchli, Michael Neunert, Brendan Tracey , et al. · 2022

Unified Scaling Laws for Routed Language Models Open

Aidan Clark, Diego de las Casas, Aurelia Guy, Arthur Mensch, M. Paganini , et al. · 2022

The performance of a language model has been shown to be effectively modeled as a power-law in its parameter count. Here we study the scaling behaviors of Routing Networks: architectures that conditionally use only a subset of their parame…

Scaling Language Models: Methods, Analysis & Insights from Training Gopher Open

Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann , et al. · 2021

Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based…

Applying and improving <span>AlphaFold</span> at <span>CASP14</span> Open

John Jumper, K Taki, Alexander Pritzel, Tim Green, Michael Figurnov , et al. · 2021

We describe the operation and improvement of AlphaFold, the system that was entered by the team AlphaFold2 to the “human” category in the 14th Critical Assessment of Protein Structure Prediction (CASP14). The AlphaFold system entered in CA…

Author response for "Applying and improving AlphaFold at CASP14" Open

John Jumper, Richard J. Evans, Alexander Pritzel, Tim Green, Michael Figurnov , et al. · 2021

Highly accurate protein structure prediction for the human proteome Open

Kathryn Tunyasuvunakool, Jonas Adler, Zachary Wu, Tim Green, Michał Zieliński , et al. · 2021

Highly accurate protein structure prediction with AlphaFold Open

John Jumper, K Taki, Alexander Pritzel, Tim Green, Michael Figurnov , et al. · 2021

Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort 1–4 , the structures of around 100,000 unique proteins have been determ…

Bootstrap your own latent: A new approach to self-supervised Learning Open

Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond , et al. · 2020

We introduce Bootstrap Your Own Latent (BYOL), a new approach to self-supervised image representation learning. BYOL relies on two neural networks, referred to as online and target networks, that interact and learn from each other. From an…

Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13) Open

Andrew Senior, K Taki, John Jumper, James Kirkpatrick, Laurent Sifre , et al. · 2019

We describe AlphaFold, the protein structure prediction system that was entered by the group A7D in CASP13. Submissions were made by three free‐modeling (FM) methods which combine the predictions of three neural networks. All three systems…

Human-level performance in 3D multiplayer games with population-based reinforcement learning Open

Max Jaderberg, Wojciech Marian Czarnecki, Iain Dunning, Luke Marris, Guy Lever , et al. · 2019

Artificial teamwork Artificially intelligent agents are getting better and better at two-player games, but most real-world endeavors require teamwork. Jaderberg et al. designed a computer program that excels at playing the video game Quake…

The StreetLearn Environment and Dataset Open

Piotr Mirowski, Andras Banki-Horvath, Keith Anderson, Denis Teplyashin, Karl Moritz Hermann , et al. · 2019

Navigation is a rich and well-grounded problem domain that drives progress in many different areas of research: perception, planning, memory, exploration, and optimisation in particular. Historically these challenges have been separately c…

Learning to Navigate in Cities Without a Map Open

Piotr Mirowski, Matthew Koichi Grimes, Mateusz Malinowski, Karl Moritz Hermann, Keith Anderson , et al. · 2018

Navigating through unstructured environments is a basic capability of intelligent creatures, and thus is of fundamental interest in the study and development of artificial intelligence. Long-range navigation is a complex cognitive task tha…

Unsupervised Predictive Memory in a Goal-Directed Agent Open

Greg Wayne, Chia-Chun Hung, Amos David, Mehdi Mirza, Arun Ahuja , et al. · 2018

Animals execute goal-directed behaviours despite the limited range and scope of their sensors. To cope, they explore environments and store memories maintaining estimates of important information that is not presently available. Recently, …

Efficient Neural Audio Synthesis Open

Nal Kalchbrenner, Erich Elsen, Karen Simonyan, Seb Noury, Norman Casagrande , et al. · 2018

Sequential models achieve state-of-the-art results in audio, visual and textual domains with respect to both estimating the data distribution and generating high-quality samples. Efficient sampling for this class of models has however rema…

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Open

Lasse Espeholt, Hubert Soyer, Rémi Munos, Karen Simonyan, Volodymir Mnih , et al. · 2018

In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time. We have developed a…

Parallel WaveNet: Fast High-Fidelity Speech Synthesis Open

Aäron van den Oord, Yazhe Li, I. Babuschkin, Karen Simonyan, Oriol Vinyals , et al. · 2017

The recently-developed WaveNet architecture is the current state of the art in realistic speech synthesis, consistently rated as more natural sounding for many different languages than any previous system. However, because WaveNet relies o…

Parallel WaveNet: Fast High-Fidelity Speech Synthesis Open

Aäron van den Oord, Yazhe Li, I. Babuschkin, Karen Simonyan, Oriol Vinyals , et al. · 2017

The recently-developed WaveNet architecture is the current state of the art in realistic speech synthesis, consistently rated as more natural sounding for many different languages than any previous system. However, because WaveNet relies o…

Population Based Training of Neural Networks Open

Max Jaderberg, Valentin Dalibard, Simon Osindero, Wojciech Marian Czarnecki, Jeff Donahue , et al. · 2017

Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In …

Neural Discrete Representation Learning Open

Aäron van den Oord, Oriol Vinyals, Koray Kavukcuoglu · 2017

Learning useful representations without supervision remains a key challenge in machine learning. In this paper, we propose a simple yet powerful generative model that learns such discrete representations. Our model, the Vector Quantised-Va…

Neural Discrete Representation Learning. Open

Aäron van den Oord, Oriol Vinyals, Koray Kavukcuoglu · 2017

Learning useful representations without supervision remains a key challenge in machine learning. In this paper, we propose a simple yet powerful generative model that learns such discrete representations. Our model, the Vector Quantised-Va…

Hierarchical Representations for Efficient Architecture Search Open

Hanxiao Liu, Karen Simonyan, Oriol Vinyals, Chrisantha Fernando, Koray Kavukcuoglu · 2017

We explore efficient neural architecture search methods and show that a simple yet powerful evolutionary algorithm can discover new architectures with excellent performance. Our approach combines a novel hierarchical genetic representation…

Automated Curriculum Learning for Neural Networks Open

Alex Graves, Marc G. Bellemare, Jacob Menick, Rémi Munos, Koray Kavukcuoglu · 2017

We introduce a method for automatically selecting the path, or syllabus, that a neural network follows through a curriculum so as to maximise learning efficiency. A measure of the amount that the network learns from each data sample is pro…

FeUdal Networks for Hierarchical Reinforcement Learning Open

Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg , et al. · 2017

We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical reinforcement learning. Our approach is inspired by the feudal reinforcement learning proposal of Dayan and Hinton, and gains power and efficacy by decoupling end-t…

Understanding Synthetic Gradients and Decoupled Neural Interfaces Open

Wojciech Marian Czarnecki, Grzegorz Świrszcz, Max Jaderberg, Simon Osindero, Oriol Vinyals , et al. · 2017

When training neural networks, the use of Synthetic Gradients (SG) allows layers or modules to be trained without update locking - without waiting for a true error gradient to be backpropagated - resulting in Decoupled Neural Interfaces (D…

Interaction Networks for Learning about Objects, Relations and Physics Open

Peter Battaglia, Razvan Pascanu, Matthew Lai, Danilo Jimenez Rezende, Koray Kavukcuoglu · 2016

Reasoning about objects, relations, and physics is central to human intelligence, and a key goal of artificial intelligence. Here we introduce the interaction network, a model which can reason about how objects in complex systems interact,…

Koray Kavukcuoglu YOU? Author Swipe