Explanipedia

Shap-E: Generating Conditional 3D Implicit Functions Open

Heewoo Jun, Alex Nichol · 2023

We present Shap-E, a conditional generative model for 3D assets. Unlike recent work on 3D generative models which produce a single output representation, Shap-E directly generates the parameters of implicit functions that can be rendered a…

Point-E: A System for Generating 3D Point Clouds from Complex Prompts Open

Alex Nichol, Heewoo Jun, Prafulla Dhariwal, Pamela Mishkin, Mark Chen · 2022

Computer science Mathematics Physics

While recent work on text-conditional 3D object generation has shown promising results, the state-of-the-art methods typically require multiple GPU-hours to produce a single sample. This is in stark contrast to state-of-the-art generative …

Efficient Training of Language Models to Fill in the Middle Open

Mohammad Bavarian, Heewoo Jun, Nikolas Tezak, John Schulman, Christine McLeavey , et al. · 2022

Computer science Mathematics Engineering

We show that autoregressive language models can learn to infill text after we apply a straightforward transformation to the dataset, which simply moves a span of text from the middle of a document to its end. While this data augmentation h…

Evaluating Large Language Models Trained on Code Open

Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Pondé de Oliveira Pinto , et al. · 2021

Computer science

We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we…

Scaling Laws for Autoregressive Generative Modeling Open

Tom Henighan, Jared Kaplan, Mor Katz, Mark Chen, Christopher Hesse , et al. · 2020

Computer science Mathematics Physics

We identify empirical scaling laws for the cross-entropy loss in four domains: generative image modeling, video modeling, multimodal image$\leftrightarrow$text models, and mathematical problem solving. In all cases autoregressive Transform…

Jukebox: A Generative Model for Music Open

Prafulla Dhariwal, Heewoo Jun, Christine Payne, Jong Wook Kim, Alec Radford , et al. · 2020

Computer science Art Mathematics

We introduce Jukebox, a model that generates music with singing in the raw audio domain. We tackle the long context of raw audio using a multi-scale VQ-VAE to compress it to discrete codes, and modeling those using autoregressive Transform…

Fast Spectrogram Inversion Using Multi-Head Convolutional Neural Networks Open

Sercan Ö. Arık, Heewoo Jun, Gregory Diamos · 2018

Computer science

We propose the multi-head convolutional neural network (MCNN) architecture\nfor waveform synthesis from spectrograms. Nonlinear interpolation in MCNN is\nemployed with transposed convolution layers in parallel heads. MCNN achieves\nmore th…

Language Modeling at Scale Open

Mostofa Patwary, Milind Chabbi, Heewoo Jun, Jiaji Huang, Gregory Diamos , et al. · 2018

Computer science Mathematics Physics

We show how Zipf's Law can be used to scale up language modeling (LM) to take advantage of more training data and more GPUs. LM plays a key role in many important natural language applications such as speech recognition and machine transla…

Cold Fusion: Training Seq2Seq Models Together with Language Models Open

Anuroop Sriram, Heewoo Jun, Sanjeev Satheesh, Adam Coates · 2018

Computer science Philosophy Mathematics

Sequence-to-sequence (Seq2Seq) models with attention have excelled at tasks which involve generating natural language sentences such as machine translation, image captioning and speech recognition. Performance has further been improved by …

Robust Speech Recognition Using Generative Adversarial Networks Open

Anuroop Sriram, Heewoo Jun, Yashesh Gaur, Sanjeev Satheesh · 2018

Computer science Chemistry

This paper describes a general, scalable, end-to-end framework that uses the generative adversarial network (GAN) objective to enable robust speech recognition. Encoders trained with the proposed approach enjoy improved invariance by learn…

Deep Learning Scaling is Predictable, Empirically Open

Joel Hestness, Sharan Narang, Newsha Ardalani, Gregory Diamos, Heewoo Jun , et al. · 2017

Computer science Mathematics

Deep learning (DL) creates impactful advances following a virtuous recipe: model architecture search, creating large training data sets, and scaling computation. It is widely believed that growing training sets and models should improve ac…

Reducing Bias in Production Speech Models Open

Eric Battenberg, Rewon Child, Adam Coates, Christopher Fougner, Yashesh Gaur , et al. · 2017

Computer science Economics Mathematics

Replacing hand-engineered pipelines with end-to-end deep learning systems has enabled strong results in applications like speech and object recognition. However, the causality and latency constraints of production systems put end-to-end sp…

Heewoo Jun YOU? Author Swipe