Explanipedia

Making Language Model a Hierarchical Classifier Open

Yihong Wang, Zhonglin Jiang, Ningyuan Xi, Yue Zhao, Qing Gu , et al. · 2025

Decoder-only language models, such as GPT and LLaMA, generally decode on the last layer. Motivated by human's hierarchical thinking capability, we propose that a hierarchical decoder architecture could be built with different layers decodi…

Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling Open

Yue Zhao, Xiaoyu Wang, Dan Wang, Zhonglin Jiang, Qing Gu , et al. · 2025

Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval Open

Luo Ji, Fulai Guo, Teng Chen, Qing Gu, Xiaoyu Wang , et al. · 2024

Despite the recent advancement in Retrieval-Augmented Generation (RAG) systems, most retrieval methodologies are often developed for factual retrieval, which assumes query and positive documents are semantically similar. In this paper, we …

Multi-Party Supervised Fine-tuning of Language Models for Multi-Party Dialogue Generation Open

Xiaoyu Wang, Ningyuan Xi, Teng Chen, Qing Gu, Yue Zhao , et al. · 2024

Large Language Models (LLM) are usually fine-tuned to participate in dyadic or two-party dialogues, which can not adapt well to multi-party dialogues (MPD), which hinders their applications in such scenarios including multi-personal meetin…

MeTHanol: Modularized Thinking Language Models with Intermediate Layer Thinking, Decoding and Bootstrapping Reasoning Open

Ningyuan Xi, Xiaoyu Wang, Y. Wu, Teng Chen, Qing Gu , et al. · 2024

Current research efforts are focused on enhancing the thinking and reasoning capability of large language model (LLM) by prompting, data-driven emergence and inference-time computation. In this study, we consider stimulating language model…

LaMsS: When Large Language Models Meet Self-Skepticism Open

Yuxin Wu, Yihong Wang, Teng Chen, Chenxi Liu, Ningyuan Xi , et al. · 2024

Hallucination is a major challenge for large language models (LLMs), preventing their further application in some fields. The skeptical thinking of humankind could be useful for LLMs to self-cognition, self-reflection and alleviate their h…

A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio Open

Ningyuan Xi, Y. Wu, Kun Fan, Teng Chen, Qing Gu , et al. · 2024

Large Language Models (LLM) often need to be Continual Pre-Trained (CPT) to obtain unfamiliar language skills or adapt to new domains. The huge training cost of CPT often asks for cautious choice of key hyper-parameters such as the mixture…

Surfactant Protein-C Regulates Alveolar Type 2 Epithelial Cell Lineages via the CD74 Receptor Open

Krishan Gopal Jain, Yang Liu, Runzhen Zhao, Preeti J. Muire, Ningyuan Xi , et al. · 2024

This study suggests that SPC regulates AT2 lineage in vitro and in vivo. The SPC might influence AT2 lineage during the lung epithelium repair by activating signaling mechanism involving CD74 receptor.

Ningyuan Xi YOU? Author Swipe