Xu, Jingjing
YOU?
Author Swipe
View article: Long Chain-of-Thought Fine-tuning via Understanding-to-Reasoning Transition
Long Chain-of-Thought Fine-tuning via Understanding-to-Reasoning Transition Open
Reasoning models have demonstrated remarkable performance on complex tasks by generating long reasoning traces prior to producing final answers. However, previous research on long-context scaling in language models has generally focused on…