Exploring foci of:
arXiv (Cornell University)
CoDA: Coding LM via Diffusion Adaptation
September 2025 • Haolin Chen, Shiyu Wang, Can Qin, Bo Pang, Zuxin Liu, Jielin Qiu, Jianguo Zhang, Zhou Yingbo, Zeyuan Chen, Ran Xu, Shelby Heinecke, Silvio Savarese, …
Diffusion language models promise bidirectional context and infilling capabilities that autoregressive coders lack, yet practical systems remain heavyweight. We introduce CoDA, a 1.7B-parameter diffusion coder trained on TPU with a fully open-source training pipeline. CoDA pairs large-scale diffusion pre-training with code-centric mid-training and instruction tuning, enabling confidence-guided sampling that keeps inference latency competitive. On Humaneval, MBPP, and EvalPlus, CoDA-1.7B-Instruct matches or surpass…
Politics And The English Language
C (Programming Language)
Language Interpretation
Greek Language
List Of Iphone Models
Irish Language
Indonesian Language
Tagalog Language
Catalan Language
Scratch (Programming Language)
Scots Language
Vietnamese Language
Armenian Language
Thai Language
Coptic Language
Ukrainian Language
Odia Language
Azerbaijani Language
Sinhala Language
Breton Language
Swedish Language
Llama (Language Model)
Khmer Language
Mongolian Language