Exploring foci of:
arXiv (Cornell University)
Using Large Language Models for Hyperparameter Optimization
December 2023 • Michael R. Zhang, Nishkrit Desai, Juhan Bae, Jonathan Lorraine, Jimmy Ba
This paper explores the use of foundational large language models (LLMs) in hyperparameter optimization (HPO). Hyperparameters are critical in determining the effectiveness of machine learning models, yet their optimization often relies on manual approaches in limited-budget settings. By prompting LLMs with dataset and model descriptions, we develop a methodology where LLMs suggest hyperparameter configurations, which are iteratively refined based on model performance. Our empirical evaluations on standard benchma…
Artificial Intelligence
Computer Science
Machine Learning
Support Vector Machine
Algorithm
Programming Language
The Dancers At The End Of Time
Hope Ii
The Ninth Wave
The Bureaucrats (1936 Film)
Main Page
The False Mirror
The Massacre At Chios
Weapons (2025 Film)
Zohran Mamdani
Squid Game Season 3
Technological Fix
Harvester Vase
Electronic Colonialism
Victoria Mboko
Lauren Sánchez
Jeff Bezos
Collective Action Problem
Shefali Jariwala
Hackers: Heroes Of The Computer Revolution
Community Fridge
Compassion Fade