Explanipedia

Towards Responsible Development of Generative AI for Education: An Evaluation-Driven Approach Open

Irina Jurenka, Markus Kunesch, Kevin R. McKee, Daniel Gillick, Shaojian Zhu , et al. · 2024

A major challenge facing the world is the provision of equitable and universal access to quality education. Recent advances in generative AI (gen AI) have created excitement about the potential of new technologies to offer a personal tutor…

Large-scale multilingual audio visual dubbing Open

Yi Yang, Brendan Shillingford, Yannis Assael, Miaosen Wang, Wendi Liu , et al. · 2020

We describe a system for large-scale audiovisual translation and dubbing, which translates videos from one language to another. The source language's speech content is transcribed to text, translated, and automatically synthesized into tar…

High Fidelity Speech Synthesis with Adversarial Networks Open

Mikołaj Bińkowski, Jeff Donahue, Sander Dieleman, Aidan Clark, Erich Elsen , et al. · 2019

Generative adversarial networks have seen rapid development in recent years and have led to remarkable improvements in generative modelling of images. However, their application in the audio domain has received limited attention, and autor…

Sample-efficient adaptive text-to-speech Open

Yutian Chen, Yannis Assael, Brendan Shillingford, David Budden, Scott Reed , et al. · 2018

We present a meta-learning approach for adaptive text-to-speech (TTS) with few data. During training, we learn a multi-speaker model using a shared conditional WaveNet core and independent learned embeddings for each speaker. The aim of tr…

Sample Efficient Adaptive Text-to-Speech Open

Yutian Chen, Yannis Assael, Brendan Shillingford, David Budden, Scott Reed , et al. · 2018

We present a meta-learning approach for adaptive text-to-speech (TTS) with few data. During training, we learn a multi-speaker model using a shared conditional WaveNet core and independent learned embeddings for each speaker. The aim of tr…

Parallel WaveNet: Fast High-Fidelity Speech Synthesis Open

Aäron van den Oord, Yazhe Li, I. Babuschkin, Karen Simonyan, Oriol Vinyals , et al. · 2017

The recently-developed WaveNet architecture is the current state of the art in realistic speech synthesis, consistently rated as more natural sounding for many different languages than any previous system. However, because WaveNet relies o…

Parallel WaveNet: Fast High-Fidelity Speech Synthesis Open

Aäron van den Oord, Yazhe Li, I. Babuschkin, Karen Simonyan, Oriol Vinyals , et al. · 2017

The recently-developed WaveNet architecture is the current state of the art in realistic speech synthesis, consistently rated as more natural sounding for many different languages than any previous system. However, because WaveNet relies o…

Luis C. Cobo YOU? Author Swipe