Training Optimal Large Diffusion Language Models Article Swipe

PDF

Related Concepts

No concepts available.

Jinjie Ni , Qian Liu , Chao Du , Longxu Dou , Hang Yan , Zili Wang , Tianyu Pang , Michael Shieh ·

Uncle Sam recruitment poster (US)

YOU? · · 2025 · Open Access · · DOI: https://doi.org/10.48550/arxiv.2510.03280 · OA: W4414968636

We introduce Quokka, the first systematic scaling law for diffusion language models (DLMs), encompassing both compute-constrained and data-constrained regimes, and studying the key modeling and optimization designs. Quokka is a good friend of Chinchilla and provides wider scopes. We hope the results would bring short-term practical guidance in DLMs training and long-term inspirations for the whole AI community.

Related Topics

The Dancers At The End Of Time

The Dancers At The End Of Time

The Bureaucrats (1936 Film)

The Bureaucrats (1936 Film)

The False Mirror

The False Mirror

The Massacre At Chios

The Massacre At Chios

Weapons (2025 Film)

Weapons (2025 Film)

Squid Game Season 3

Squid Game Season 3

Technological Fix

Technological Fix

Electronic Colonialism

Electronic Colonialism

Lauren Sánchez

Collective Action Problem

Collective Action Problem

Shefali Jariwala

Shefali Jariwala

Hackers: Heroes Of The Computer Revolution

Hackers: Heroes Of The Computer Revolution

Community Fridge

Community Fridge

Compassion Fade

Compassion Fade

Takahiro Shiraishi

Takahiro Shiraishi

The Wealth Of Networks

The Wealth Of Networks

This Changes Everything (Book)

This Changes Everything (Book)

Silencing The Past

Silencing The Past

Direct Action: An Ethnography

Direct Action: An Ethnography

The Black Jacobins

The Black Jacobins

Caliban And The Witch

Caliban And The Witch

The Spirit Level (Wilkinson And Pickett Book)

The Spirit Level (Wilkinson And Pickett Book)

Mutual Aid: A Factor Of Evolution

Mutual Aid: A Factor Of Evolution

Orwell (Video Game)

Orwell (Video Game)

Kannappa (Film)

Kannappa (Film)

Finding more related topics…

Fetching topic information...