A. Xiong
YOU?
Author Swipe
View article: LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads? Open
Competitive programming problems increasingly serve as valuable benchmarks to evaluate the coding capabilities of large language models (LLMs) due to their complexity and ease of verification. Yet, current coding benchmarks face limitation…