Exploring foci of:
arXiv (Cornell University)
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
February 2025 • Yuxiang Wei, Olivier Duchenne, Jade Copet, Quentin Carbonneaux, Lingming Zhang, Daniel Fried, Gabriel Synnaeve, Rishabh Singh, Sida I. Wang
The recent DeepSeek-R1 release has demonstrated the immense potential of reinforcement learning (RL) in enhancing the general reasoning capabilities of large language models (LLMs). While DeepSeek-R1 and other follow-up work primarily focus on applying RL to competitive coding and math problems, this paper introduces SWE-RL, the first approach to scale RL-based LLM reasoning for real-world software engineering. Leveraging a lightweight rule-based reward (e.g., the similarity score between ground-truth and LLM-gene…
Learning Curve
Open-Source License
Learning Theory (Education)
Experiential Learning
Open Outcry
Practice (Learning Method)
Open Education
Learning Environment
Machine Learning
French Open
Australian Open
U.S. Senior Open
U.S. Open (Golf)
Deep Learning
Reinforcement Learning
Open-Source Software
Free And Open-Source Software
Learning Standards
Attention (Machine Learning)
Open Society Foundations
Eyes Wide Open (Sabrina Carpenter Album)
Open University
Open Range (2003 Film)
List Of The Open Championship Venues