Exploring foci of:
arXiv (Cornell University)
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
June 2025 • Zhoujun Cheng, Shibo Hao, Tianyang Liu, Fan Zhou, Y. H. Xie, Feng Yao, Yuexin Bian, Yonghao Zhuang, Nolan Dey, Yuanyuan Zha, Yi Gu, Kun Zhou, Haijun …
Reinforcement learning (RL) has emerged as a promising approach to improve large language model (LLM) reasoning, yet most open efforts focus narrowly on math and code, limiting our understanding of its broader applicability to general reasoning. A key challenge lies in the lack of reliable, scalable RL reward signals across diverse reasoning domains. We introduce Guru, a curated RL reasoning corpus of 92K verifiable examples spanning six reasoning domains--Math, Code, Science, Logic, Simulation, and Tabular--each …
Learning Theory (Education)
From (Tv Series)
The Man From Earth
Health Effects Arising From The September 11 Attacks
The Man From U.N.C.L.E.
Escape From Alcatraz (Film)
The Man From U.N.C.L.E. (Film)
From Zero World Tour
From Russia With Love (Film)
Letters From Iwo Jima
From Paris With Love (Film)
Freed From Desire
List Of Legendary Creatures From Japan
Attention (Machine Learning)
Far From The Madding Crowd
The Judge From Hell
From Up On Poppy Hill
The Boys From Brazil (Film)
Songs From The Big Chair
With A Little Help From My Friends
The Spy Who Came In From The Cold
Escape From The Planet Of The Apes
Cowboys From Hell
Killer Klowns From Outer Space