Explanipedia

Developing ChemDFM as a large language foundation model for chemistry Open

Zihan Zhao, Da Ma, Lu Chen, Liangtai Sun, Zihao Li , et al. · 2025

Alignment for Efficient Tool Calling of Large Language Models Open

Hongshen Xu, Zihan Wang, Zichen Zhu, Lei Pan, Xingyu Chen , et al. · 2025

CLaw: Benchmarking Chinese Legal Knowledge in Large Language Models - A Fine-grained Corpus and Reasoning Analysis Open

Xinsen Xu, Liang Zhao, Hongshen Xu, Chenchenc · 2025

MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task Automation Open

Zichen Zhu, Hao Tang, Y.F. Li, Dan Liu, Hongshen Xu , et al. · 2025

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? Open

Ruisheng Cao, Fangyu Lei, Haoyuan Wu, Jixuan Chen, Yeqiao Fu , et al. · 2024

Data science and engineering workflows often span multiple stages, from warehousing to orchestration, using tools like BigQuery, dbt, and Airbyte. As vision language models (VLMs) advance in multimodal understanding and code generation, VL…

Sparsity-Accelerated Training for Large Language Models Open

Da Ma, Lu Chen, Pengyu Wang, Hongshen Xu, Hanqi Li , et al. · 2024

Large language models (LLMs) have demonstrated proficiency across various natural language processing (NLP) tasks but often require additional training, such as continual pre-training and supervised fine-tuning. However, the costs associat…

CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions Open

Hanchong Zhang, Ruisheng Cao, Hongshen Xu, Lu Chen, Kai Yu · 2024

Recently, Large Language Models (LLMs) have been demonstrated to possess impressive capabilities in a variety of domains and tasks. We investigate the issue of prompt design in the multi-turn text-to-SQL task and attempt to enhance the LLM…

Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind Open

Hongchuan Zeng, Hongshen Xu, Lu Chen, Kai Yu · 2024

Large Language Models (LLMs) have ushered in a new era in Natural Language Processing, but their massive size demands effective compression techniques for practicality. Although numerous model compression techniques have been investigated,…

A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames Open

Hongshen Xu, Ruisheng Cao, Su Zhu, Sheng Jiang, Hanchong Zhang , et al. · 2024

Previous work on spoken language understanding (SLU) mainly focuses on single-intent settings, where each input utterance merely contains one user intent. This configuration significantly limits the surface form of user utterances and the …

Developing ChemDFM as a large language foundation model for chemistry Open

Zihan Zhao, Da Ma, Lu Chen, Liangtai Sun, Zihao Li , et al. · 2024

Artificial intelligence (AI) has played an increasingly important role in chemical research. However, most models currently used in chemistry are specialist models that require training and tuning for specific tasks. A more generic and eff…

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought Open

Hanchong Zhang, Ruisheng Cao, Lu Chen, Hongshen Xu, Kai Yu · 2023

Recently Large Language Models (LLMs) have been proven to have strong abilities in various domains and tasks. We study the problem of prompt designing in the text-to-SQL task and attempt to improve the LLMs' reasoning ability when generati…

Large Language Models Are Semi-Parametric Reinforcement Learning Agents Open

Danyang Zhang, Chen Lü, Situo Zhang, Hongshen Xu, Zihan Zhao , et al. · 2023

Inspired by the insights in cognitive science with respect to human memory and reasoning mechanism, a novel evolvable LLM-based (Large Language Model) agent framework is proposed as REMEMBERER. By equipping the LLM with a long-term experie…

On the Structural Generalization in Text-to-SQL Open

Jieyu Li, Lu Chen, Ruisheng Cao, Zhu Su, Hongshen Xu , et al. · 2023

Exploring the generalization of a text-to-SQL parser is essential for a system to automatically adapt the real-world databases. Previous works provided investigations focusing on lexical diversity, including the influence of the synonym an…

Exploring Schema Generalizability of Text-to-SQL Open

Jieyu Li, Lu Chen, Ruisheng Cao, Zhu Su, Hongshen Xu , et al. · 2023

Exploring the generalizability of a text-to-SQL parser is essential for a system to automatically adapt the real-world databases. Previous investigation works mostly focus on lexical diversity, including the influence of the synonym and pe…

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought Open

Hanchong Zhang, Ruisheng Cao, Lu Chen, Hongshen Xu, Kai Yu · 2023

Recently Large Language Models (LLMs) have been proven to have strong abilities in various domains and tasks. We study the problem of prompt designing in the text-to-SQL task and attempt to improve the LLMs’ reasoning ability when generati…

TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages Open

Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen , et al. · 2022

Recently, the structural reading comprehension (SRC) task on web pages has attracted increasing research interests. Although previous SRC work has leveraged extra information such as HTML tags or XPaths, the informative topology of web pag…

TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages Open

Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen , et al. · 2022

Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2022.

Select Open

Hongshen Xu · 2020

The purpose of this thesis is to provide an understanding of players' moral responses toward Non-Player Characters (NPCs) in video gameplay. The main research question for this thesis is what is the difference of moral response toward diff…

Heading control method of unmanned sailing boats based on fuzzy PID Open

Xuefei Zhang, Peng Yuan, Junzhe Tan, Shujie Wang, Hongshen Xu , et al. · 2019

[Objectives] In order to improve the anti-jamming ability and navigation stability of the unmanned sailing boats in the changeable and unknown environment and to realize accurate control of the heading of sailing boats,a fuzzy adapt…

Hongshen Xu YOU? Author Swipe