Explanipedia

OptionZero: Planning with Learned Options Open

Po‐Wei Huang, Pei-Chiun Peng, Hung Guei, Ti-Rong Wu · 2025

Planning with options -- a sequence of primitive actions -- has been shown effective in reinforcement learning within complex environments. Previous studies have focused on planning with predefined options or learned options through expert…

Game Solving with Online Fine-Tuning Open

Ti-Rong Wu, Hung Guei, Ting Han Wei, Chung-Chin Shih, Jui-Te Chin , et al. · 2023

Game solving is a similar, yet more difficult task than mastering a game. Solving a game typically means to find the game-theoretic value (outcome given optimal play), and optionally a full strategy to follow in order to achieve that outco…

MiniZero: Comparative Analysis of AlphaZero and MuZero on Go, Othello, and Atari Games Open

Ti-Rong Wu, Hung Guei, Po‐Wei Huang, Pei-Chiun Peng, Ting Han Wei , et al. · 2023

Computer science Mathematics Geography

This paper presents MiniZero, a zero-knowledge learning framework that supports four state-of-the-art algorithms, including AlphaZero, MuZero, Gumbel AlphaZero, and Gumbel MuZero. While these algorithms have demonstrated super-human perfor…

Optimistic Temporal Difference Learning for <i>2048</i> Open

Hung Guei, Lung-Pin Chen, I‐Chen Wu · 2021

Computer science Mathematics Philosophy

Temporal difference (TD) learning and its variants, such as multistage TD\n(MS-TD) learning and temporal coherence (TC) learning, have been successfully\napplied to 2048. These methods rely on the stochasticity of the environment of\n2048 …

Strength Adjustment and Assessment for MCTS-Based Programs [Research Frontier] Open

An-Jen Liu, Ti-Rong Wu, I‐Chen Wu, Hung Guei, Ting-Han Wei · 2020

Computer science Mathematics

2048 is a single-player stochastic puzzle game. This intriguing and addictive\ngame has been popular worldwide and has attracted researchers to develop\ngame-playing programs. Due to its simplicity and complexity, 2048 has become an\ninter…

On Strength Adjustment for MCTS-Based Programs Open

I‐Chen Wu, Ti-Rong Wu, An-Jen Liu, Hung Guei, Ting-Han Wei · 2019

Computer science Mathematics Materials science

This paper proposes an approach to strength adjustment for MCTS-based game-playing programs. In this approach, we use a softmax policy with a strength index z to choose moves. Most importantly, we filter low quality moves by excluding thos…

Hung Guei YOU? Author Swipe