Optimizing Large Language Model Hyperparameters for Code Generation

Exploring foci of: arXiv (Cornell University) Optimizing Large Language Model Hyperparameters for Code Generation August 2024 • Chetan Arora, Ahnaf Ibn Sayeed, Sherlock A. Licorish, Fanyu Wang, Christoph Treude Large Language Models (LLMs), such as GPT models, are increasingly used in software engineering for various tasks, such as code generation, requirements management, and debugging. While automating these tasks has garnered significant attention, a systematic study on the impact of varying hyperparameters on code generation outcomes remains unexplored. This study aims to assess LLMs' code generation performance by exhaustively exploring the impact of various hyperparameters. Hyperparameters for LLMs are adjustable s… Open Article Page

Computer Science Programming Language Artificial Intelligence Open Article