Explanipedia

Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models Open

Chengao Li, Hanyu Zhang, Yunkun Xu, Xiang Ao · 2025

Reinforcement Learning from Human Feedback (RLHF) has emerged as a powerful technique for aligning large language models (LLMs) with human preferences. However, effectively aligning LLMs with diverse human preferences remains a significant…

Make sport-related self-control better: Ritualized behavior in Chinese athletes Open

Dongting Yun, Liwei Zhang, Yue Qiu, Jia Zheng, Chengao Li · 2025

Research suggests that ritualized behavior leads individuals to gain self-control, thereby influencing their performance. Although ritualized behavior is most widely applied to athletes, these studies have been found to have no clear quant…

Controlling Large Language Models Through Concept Activation Vectors Open

Hanyu Zhang, Xiting Wang, Chengao Li, Xiang Ao, He Qin · 2025

Computer science

As large language models (LLMs) are widely deployed across various domains, the ability to control their generated outputs has become more critical. This control involves aligning LLMs outputs with human values and ethical principles or cu…

Controlling Large Language Models Through Concept Activation Vectors Open

Hanyu Zhang, Xiting Wang, Chengao Li, Xiang Ao, He Qin · 2025

Computer science Philosophy

As large language models (LLMs) are widely deployed across various domains, the ability to control their generated outputs has become more critical. This control involves aligning LLMs outputs with human values and ethical principles or cu…

Chengao Li YOU? Author Swipe