Efficient Exploration for LLMs Article Swipe

PDF

Related Concepts

Political science

Vikranth Dwaracherla , Seyed Mohammad Asghari , Botao Hao , Benjamin Van Roy ·

Uncle Sam recruitment poster (US)

YOU? · · 2024 · Open Access · · DOI: https://doi.org/10.48550/arxiv.2402.00396 · OA: W4391505508

We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fitting a reward model to the feedback received. Our best-performing agent generates queries using double Thompson sampling, with uncertainty represented by an epistemic neural network. Our results demonstrate that efficient exploration enables high levels of performance with far fewer queries. Further, both uncertainty estimation and the choice of exploration scheme play critical roles.

Related Topics

The Dancers At The End Of Time

The Dancers At The End Of Time

The Bureaucrats (1936 Film)

The Bureaucrats (1936 Film)

The False Mirror

The False Mirror

The Massacre At Chios

The Massacre At Chios

Weapons (2025 Film)

Weapons (2025 Film)

Squid Game Season 3

Squid Game Season 3

Technological Fix

Technological Fix

Electronic Colonialism

Electronic Colonialism

Lauren Sánchez

Collective Action Problem

Collective Action Problem

Shefali Jariwala

Shefali Jariwala

Hackers: Heroes Of The Computer Revolution

Hackers: Heroes Of The Computer Revolution

Community Fridge

Community Fridge

Compassion Fade

Compassion Fade

Takahiro Shiraishi

Takahiro Shiraishi

The Wealth Of Networks

The Wealth Of Networks

This Changes Everything (Book)

This Changes Everything (Book)

Silencing The Past

Silencing The Past

Direct Action: An Ethnography

Direct Action: An Ethnography

The Black Jacobins

The Black Jacobins

Caliban And The Witch

Caliban And The Witch

The Spirit Level (Wilkinson And Pickett Book)

The Spirit Level (Wilkinson And Pickett Book)

Mutual Aid: A Factor Of Evolution

Mutual Aid: A Factor Of Evolution

Orwell (Video Game)

Orwell (Video Game)

Kannappa (Film)

Kannappa (Film)

Finding more related topics…

Fetching topic information...