Explanipedia

Experimental research on the TCV tokamak Open

B.P. Duval, Abbas Abdolmaleki, M. Agostini, C J Ajay, S. Alberti , et al. · 2024

Tokamak à configuration variable (TCV), recently celebrating 30 years of near-continual operation, continues in its missions to advance outstanding key physics and operational scenario issues for ITER and the design of future power plants …

Training Compute-Optimal Large Language Models Open

Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai , et al. · 2022

We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget. We find that current large language models are significantly undertrained, a consequence of the recent focus…

Magnetic control of tokamak plasmas through deep reinforcement learning Open

Jonas Degrave, F. Felici, Jonas Buchli, Michael Neunert, Brendan Tracey , et al. · 2022

Improving language models by retrieving from trillions of tokens Open

Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford , et al. · 2021

We enhance auto-regressive language models by conditioning on document chunks retrieved from a large corpus, based on local similarity with preceding tokens. With a $2$ trillion token database, our Retrieval-Enhanced Transformer (RETRO) ob…

Scaling Language Models: Methods, Analysis & Insights from Training Gopher Open

Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann , et al. · 2021

Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based…

Transformation-based Adversarial Video Prediction on Large-Scale Data Open

Pauline Luc, Aidan Clark, Sander Dieleman, Diego de Las Casas, Yotam Doron , et al. · 2020

Recent breakthroughs in adversarial generative modeling have led to models capable of producing video samples of high quality, even on large and complex datasets of real-world video. In this work, we focus on the task of video prediction, …

DeepMind Control Suite Open

Yuval Tassa, Yotam Doron, Alistair Muldal, Tom Erez, Yazhe Li , et al. · 2018

The DeepMind Control Suite is a set of continuous control tasks with a standardised structure and interpretable rewards, intended to serve as performance benchmarks for reinforcement learning agents. The tasks are written in Python and pow…

Dawn of the Selfie Era: The Whos, Wheres, and Hows of Selfies on Instagram Open

Flávio Souza, Diego de Las Casas, Vinícius Flores, Sun-Bum Youn, Meeyoung Cha , et al. · 2015

Online interactions are increasingly involving images, especially those containing human faces, which are naturally attention grabbing and more effective at conveying feelings than text. To understand this new convention of digital culture…

Diego de Las Casas YOU? Author Swipe