Explanipedia

FlashThink: An Early Exit Method For Efficient Reasoning Open

Guochao Jiang, Gaofeng Quan, Zepeng Ding, Ziqin Luo, Dixuan Wang , et al. · 2025

Large Language Models (LLMs) have shown impressive performance in reasoning tasks. However, LLMs tend to generate excessively long reasoning content, leading to significant computational overhead. Our observations indicate that even on sim…

RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving Open

Zepeng Ding, Dixuan Wang, Ziqin Luo, Guochao Jiang, Deqing Yang , et al. · 2025

Multi-step planning has been widely employed to enhance the performance of large language models (LLMs) on downstream natural language processing (NLP) tasks, which decomposes the original task into multiple subtasks and guide LLMs to solv…

Mitigating Out-of-Entity Errors in Named Entity Recognition: A Sentence-Level Strategy Open

Guochao Jiang, Ziqin Luo, Chengwei Hu, Zepeng Ding, Deqing Yang · 2024

Many previous models of named entity recognition (NER) suffer from the problem of Out-of-Entity (OOE), i.e., the tokens in the entity mentions of the test samples have not appeared in the training samples, which hinders the achievement of …

Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction Open

Zepeng Ding, Ruiyang Ke, Wenhao Huang, Guochao Jiang, Yanda Li , et al. · 2024

Existing research on large language models (LLMs) shows that they can solve information extraction tasks through multi-step planning. However, their extraction behavior on complex sentences and tasks is unstable, emerging issues such as fa…

Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization Open

Dixuan Wang, Yanda Li, Junyuan Jiang, Zepeng Ding, Guochao Jiang , et al. · 2024

Large Language Models (LLMs) have shown remarkable capabilities in language understanding and generation. Nonetheless, it was also witnessed that LLMs tend to produce inaccurate responses to specific queries. This deficiency can be traced …

P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models Open

Guochao Jiang, Zepeng Ding, Yuchen Shi, Deqing Yang · 2024

In recent years, the rise of large language models (LLMs) has made it possible to directly achieve named entity recognition (NER) without any demonstration samples or only using a few samples through in-context learning (ICL). However, sta…

Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction Open

Zepeng Ding, Wenhao Huang, Jiaqing Liang, Deqing Yang, Yanghua Xiao · 2024

Relation triple extraction, which outputs a set of triples from long sentences, plays a vital role in knowledge acquisition. Large language models can accurately extract triples from simple sentences through few-shot learning or fine-tunin…

Zepeng Ding YOU? Author Swipe