Explanipedia

Glyph: Scaling Context Windows via Visual-Text Compression Open

Jiale Cheng, Y. Liu, Xinyu Zhang, Yulin Fei, Wenyi Hong , et al. · 2025

Large language models (LLMs) increasingly rely on long-context modeling for tasks such as document understanding, code analysis, and multi-step reasoning. However, scaling context windows to the million-token level brings prohibitive compu…

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Open

Haoming Wang, Haoyang Zou, Hongmei Song, Jiazhan Feng, Jun‐Jie Fang , et al. · 2025

The development of autonomous agents for graphical user interfaces (GUIs) presents major challenges in artificial intelligence. While recent advances in native agent models have shown promise by unifying perception, reasoning, action, and …

ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding Open

Sining Zhoubian, Dan Zhang, Jie Tang · 2025

With respect to improving the reasoning accuracy of LLMs, the representative reinforcement learning (RL) method GRPO faces failure due to insignificant reward variance, while verification methods based on process reward models (PRMs) suffe…

GuARD: Effective Anomaly Detection through a Text-Rich and Graph-Informed Language Model Open

Yunhe Pang, Bo Chen, Fanjin Zhang, Yanghui Rao, Evgeny Kharlamov , et al. · 2025

UavNetSim-v1: A Python-based Simulation Platform for UAV Communication Networks Open

Zhou Zhao, Zipeng Dai, Linyi Huang, Cui Yang, Youjun Xiang , et al. · 2025

In unmanned aerial vehicle (UAV) networks, communication protocols and algorithms are essential for cooperation and collaboration between UAVs. Simulation provides a cost-effective solution for prototyping, debugging, and analyzing protoco…

ProtGO: universal protein function prediction utilizing multi-modal gene ontology knowledge Open

Boyan Wang, Yangli-ao Geng, Xingyi Cheng, Bo Chen, Zhilei Bei , et al. · 2025

Motivation As one of the recalcitrant challenges in life sciences and biomedicine, protein function prediction suffers from a deluge of AI-designed proteins, particularly having to face multi-modal information in the era of big data. Impor…

Colony Binary Classification Based on Persistent Homology Feature Extraction and Improved EfficientNet Open

Zumin Wang, Ke Yang, Jie Tang, Jun Gao, Yuhao Zhang , et al. · 2025

Classifying newly formed colonies is instrumental in uncovering sources of infection and enabling precision medicine, holding significant clinical value. However, due to the ambiguous features of early-stage colony images in culture dishes…

Deep learning-based automated segmentation for the quantitative diagnosis of cerebral small vessel disease via multisequence MRI Open

Huiyu Zhao, Miaoyi Zhang, Weijun Tang, Luyuan Jin, Jie Tang , et al. · 2025

Objective Existing visual scoring systems for cerebral small vessel disease (CSVD) cannot assess the global lesion load accurately and quantitatively. We aimed to develop an automated segmentation method based on deep learning (DL) to quan…

Small Language Model Makes an Effective Long Text Extractor Open

Yelin Chen, Fanjin Zhang, Jie Tang · 2025

Named Entity Recognition (NER) is a fundamental problem in natural language processing (NLP). However, the task of extracting longer entity spans (e.g., awards) from extended texts (e.g., homepages) is barely explored. Current NER methods …

StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error Open

Shih‐Mo Yang, Cunxiang Wang, Yidong Wang, Xiaotao Gu, Minlie Huang , et al. · 2025

Evaluating mathematical capabilities is critical for assessing the overall performance of large language models (LLMs). However, existing evaluation methods often focus solely on final answers, resulting in highly inaccurate and uninterpre…

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators Open

Bosi Wen, Pei Ke, Yufei Sun, Cunxiang Wang, Xiaotao Gu , et al. · 2025

Since the adoption of large language models (LLMs) for text evaluation has become increasingly prevalent in the field of natural language processing (NLP), a series of existing works attempt to optimize the prompts for LLM evaluators to im…

Small Language Model Makes an Effective Long Text Extractor Open

Yelin Chen, Fanjin Zhang, Jie Tang · 2025

Named Entity Recognition (NER) is a fundamental problem in natural language processing (NLP). However, the task of extracting longer entity spans (e.g., awards) from extended texts (e.g., homepages) is barely explored. Current NER methods …

WCN25-1183 IMPLEMENTING CKD PATIENT EDUCATION IN A LOW-RESOURCE SETTING: A MIXED METHODS FEASIBILITY STUDY IN WESTERN KENYA Open

Christopher Owino, Jie Tang, Juddy Wachira, Mathew Koech · 2025

Effect of magnetization on antibacterial, lipid-lowering and antioxidant activities of isoquinoline alkaloids Open

Caihong Feng, Weijie Li, Xiaoling Wang, Jie Tang, Shun Yao · 2025

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Open

Wenyi Hong, Yean Cheng, Zhuoyi Yang, Weihan Wang, Lefan Wang , et al. · 2025

In recent years, vision language models (VLMs) have made significant advancements in video understanding. However, a crucial capability - fine-grained motion comprehension - remains under-explored in current benchmarks. To address this gap…

Dynamic Scaling of Unit Tests for Code Reward Modeling Open

Zeyao Ma, Xiaokang Zhang, Jing Zhang, Jifan Yu, Sijia Luo , et al. · 2025

Current large language models (LLMs) often struggle to produce accurate responses on the first attempt for complex reasoning tasks like code generation. Prior research tackles this challenge by generating multiple candidate solutions and v…

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents Open

Yifan Xu, Xiao Liu, Xueqiao Sun, Siyi Cheng, Hao Yu , et al. · 2025

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators Open

Bosi Wen, Pei Ke, Yufei Sun, Cunxiang Wang, Xiaotao Gu , et al. · 2025

A Machine Learning-Based Online Prognostic Prediction Model for Patients with Pancreatitis Complicated by Sepsis: Development and Validation in Two Retrospective Cohorts Open

Wen Zhang, Jie Tang, Qingqing Zhang, Miao Lu, Xueting Deng , et al. · 2025

Dynamic Scaling of Unit Tests for Code Reward Modeling Open

Zeyao Ma, Xiaokang Zhang, Jing Zhang, Jifan Yu, Sijia Luo , et al. · 2025

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models Open

Jiayi Gui, Yiming Liu, Jiale Cheng, Xiaotao Gu, Xiao Liu , et al. · 2025

A Survey of Post-Training Scaling in Large Language Models Open

Hanyu Lai, Xiao Liu, Jun Gao, Jiale Cheng, Zehan Qi , et al. · 2025

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Open

Yushi Bai, Shangqing Tu, Jiajie Zhang, Hao Peng, Xiaozhi Wang , et al. · 2025

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Open

Jiazheng Xu, Yu Huang, Jiale Cheng, Yuanming Yang, Jiajun Xu , et al. · 2024

Visual generative models have achieved remarkable progress in synthesizing photorealistic images and videos, yet aligning their outputs with human preferences across critical dimensions remains a persistent challenge. Though reinforcement …

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Open

Yushi Bai, Shangqing Tu, Jiajie Zhang, Hao Peng, Xiaozhi Wang , et al. · 2024

This paper introduces LongBench v2, a benchmark designed to assess the ability of LLMs to handle long-context problems requiring deep understanding and reasoning across real-world multitasks. LongBench v2 consists of 503 challenging multip…

The Superalignment of Superhuman Intelligence with Large Language Models Open

Minlie Huang, Yingkang Wang, Shiyao Cui, Pei Ke, Jie Tang · 2024

We have witnessed superhuman intelligence thanks to the fast development of large language models and multimodal language models. As the application of such superhuman models becomes more and more popular, a critical question arises here: …

GuARD: Effective Anomaly Detection through a Text-Rich and Graph-Informed Language Model Open

Yunhe Pang, Bo Chen, Fanjin Zhang, Yanghui Rao, Jie Tang · 2024

Anomaly detection on text-rich graphs is widely prevalent in real life, such as detecting incorrectly assigned academic papers to authors and detecting bots in social networks. The remarkable capabilities of large language models (LLMs) pa…

Application Research on Improving Few shot Semi supervised Network In Fault Diagnosis of Rocket Artillery Rotation Machine Open

Xinlei Zheng, Zhipeng Zhang, Taotao Liang, Rui Liao, Jie Tang , et al. · 2024

The success of fault diagnosis based on deep learning is attributed to a large number of labeled samples. However, in the application of fault diagnosis for artillery rotating devices, the scarcity of labeled samples can easily lead to ove…

A Deep Learning-Based Framework for Bearing RUL Prediction to Optimize Laser Shock Peening Remanufacturing Open

Yuchen Liang, Yuqi Wang, An‐Ping Li, Chen Gu, Jie Tang , et al. · 2024

Accurate prediction of the remaining useful life (RUL) of bearings is crucial for maintaining the reliability and efficiency of industrial systems. This study introduces a novel methodology integrating advanced machine learning and optimiz…

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents Open

Yifan Xu, Xiao Liu, Xueqiao Sun, Siyi Cheng, Hao Yu , et al. · 2024

Autonomous agents have become increasingly important for interacting with the real world. Android agents, in particular, have been recently a frequently-mentioned interaction method. However, existing studies for training and evaluating An…

Jie Tang YOU? Author Swipe