Explanipedia

CoT-Evo: Evolutionary Distillation of Chain-of-Thought for Scientific Reasoning Open

Kehua Feng, Keyan Ding, Zhihui Zhu, Lei Liang, Qiang Zhang , et al. · 2025

While chain-of-thought (CoT) distillation from advanced large language models (LLMs) has proven effective in general reasoning tasks, it struggles in scientific domains where even advanced models often produce incorrect or superficial reas…

Enhancing Safe and Controllable Protein Generation via Knowledge Preference Optimization Open

Yuhao Wang, Keyan Ding, Kehua Feng, Zeyuan Wang, Ming Qin , et al. · 2025

Protein language models have emerged as powerful tools for sequence generation, offering substantial advantages in functional optimization and denovo design. However, these models also present significant risks of generating harmful protei…

KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction Open

Haibo Liu, Keyan Ding, Peilin Chen, Yinwei Wei, Liqiang Nie , et al. · 2025

Accurate prediction of protein-ligand binding affinity is critical for drug discovery. While recent deep learning approaches have demonstrated promising results, they often rely solely on structural features of proteins and ligands, overlo…

OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases Open

Yongrui Chen, Zhiqiang Liu, Jiaxuan Yu, Lin Ren, Nan Hu , et al. · 2025

Large Language Models (LLMs) have demonstrated substantial progress on reasoning tasks involving unstructured text, yet their capabilities significantly deteriorate when reasoning requires integrating structured external knowledge such as …

SciCUEval: A Comprehensive Dataset for Evaluating Scientific Context Understanding in Large Language Models Open

Jing Yu, Yuqi Tang, Kehua Feng, M. Sreenivasa Rao, Lei Liang , et al. · 2025

Large Language Models (LLMs) have shown impressive capabilities in contextual understanding and reasoning. However, evaluating their performance across diverse scientific domains remains underexplored, as existing benchmarks primarily focu…

SAFER: Advancing Safety Alignment via Efficient Ex-Ante Reasoning Open

Kehua Feng, Keyan Ding, Yu Jing, Menghan Li, Yuhao Wang , et al. · 2025

Recent advancements in large language models (LLMs) have accelerated progress toward artificial general intelligence, yet their potential to generate harmful content poses critical safety challenges. Existing alignment methods often strugg…

Integrating protein language models and automatic biofoundry for enhanced protein evolution Open

Qiang Zhang, Wanyi Chen, Ming Qin, Yuhao Wang, Zhongji Pu , et al. · 2025

SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration Open

Huajun Chen, Keyan Ding, Jing Yu, Junjie Huang, Yuchen Yang , et al. · 2025

Scientific research increasingly relies on specialized computational tools, yet effectively utilizing these tools demands substantial domain expertise. While Large Language Models (LLMs) show promise in tool automation, they struggle to se…

Boosting LLM’s Molecular Structure Elucidation with Knowledge Enhanced Tree Search Reasoning Open

Xiang Zhuang, Bin Wu, Jiyu Cui, Kehua Feng, Xiaotong Li , et al. · 2025

Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition Open

Kehua Feng, Keyan Ding, Tan Hongzhi, Kede Ma, Zhihua Wang , et al. · 2025

Enhancing Safe and Controllable Protein Generation via Knowledge Preference Optimization Open

Yuhao Wang, Keyan Ding, Kehua Feng, Zeyuan Wang, Ming Qin , et al. · 2025

Multi-purpose controllable protein generation via prompted language models Open

Zeyuan Wang, Binbin Chen, Keyan Ding, Jiawen Cao, Ming Qin , et al. · 2024

Deep learning is increasingly powerful for designing proteins that meet structural and functional requirements. However, most existing methods follow a conventional pipeline: first defining a backbone structure and then generating sequence…

Advancing biomolecular understanding and design following human instructions Open

Xiang Zhuang, Keyan Ding, Tao Lyu, Yinuo Jiang, Xiaotong Li , et al. · 2024

Understanding and designing biomolecules, such as proteins and small molecules, is central to advancing drug discovery, synthetic biology and enzyme engineering. Recent breakthroughs in artificial intelligence have revolutionized biomolecu…

SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks Open

Tianhao Li, Jingyu Lu, Chuangxin Chu, Tianyu Zeng, Yujia Zheng , et al. · 2024

Large language models (LLMs) have a transformative impact on a variety of scientific tasks across disciplines including biology, chemistry, medicine, and physics. However, ensuring the safety alignment of these models in scientific researc…

Retrosynthesis prediction with an iterative string editing model Open

Yuqiang Han, Xiaoyang Xu, Chang‐Yu Hsieh, Keyan Ding, Hongxia Xu , et al. · 2024

SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models Open

Kehua Feng, Keyan Ding, Weijie Wang, Xiang Zhuang, Zeyuan Wang , et al. · 2024

Large language models (LLMs) are playing an increasingly important role in scientific research, yet there remains a lack of comprehensive benchmarks to evaluate the breadth and depth of scientific knowledge embedded in these models. To add…

Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics Open

Zhangkai Ni, Yue Liu, Keyan Ding, Wenhan Yang, Hanli Wang , et al. · 2024

Deep learning-based methods have significantly influenced the blind image quality assessment (BIQA) field, however, these methods often require training using large amounts of human rating data. In contrast, traditional knowledge-based met…

Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition Open

Kehua Feng, Keyan Ding, Kede Ma, Zhihua Wang, Qiang Zhang , et al. · 2024

Reliable evaluation of large language models (LLMs) is impeded by two key challenges: objective metrics often fail to reflect human perception of natural language, and exhaustive human labeling is prohibitively expensive. Here, we propose …

Deep Shape-Texture Statistics for Completely Blind Image Quality Evaluation Open

Yixuan Li, Peilin Chen, Hanwei Zhu, Keyan Ding, Leida Li , et al. · 2024

Opinion-Unaware Blind Image Quality Assessment (OU-BIQA) models aim to predict image quality without training on reference images and subjective quality scores. Thereinto, image statistical comparison is a classic paradigm, while the perfo…

Learning Invariant Molecular Representation in Latent Discrete Space Open

Xiang Zhuang, Qiang Zhang, Keyan Ding, Yatao Bian, Xinghuan Wang , et al. · 2023

Molecular representation learning lays the foundation for drug discovery. However, existing methods suffer from poor out-of-distribution (OOD) generalization, particularly when data for training and testing originate from different environ…

InstructProtein: Aligning Human and Protein Language via Knowledge Instruction Open

Zeyuan Wang, Qiang Zhang, Keyan Ding, Ming Qin, Xiang Zhuang , et al. · 2023

Large Language Models (LLMs) have revolutionized the field of natural language processing, but they fall short in comprehending biological sequences such as proteins. To address this challenge, we propose InstructProtein, an innovative LLM…

Active Finetuning Protein Language Model: A Budget-Friendly Method for Directed Evolution Open

Ming Qin, Keyan Ding, Bin Wu, Zhenping Li, Haihong Yang , et al. · 2023

Directed evolution is a widely-used strategy of protein engineering to improve protein function via mimicking natural mutation and selection. Machine learning-assisted directed evolution (MLDE) approaches aim to learn a fitness predictor, …

Graph Sampling-based Meta-Learning for Molecular Property Prediction Open

Xiang Zhuang, Qiang Zhang, Bin Wu, Keyan Ding, Yin Fang , et al. · 2023

Molecular property is usually observed with a limited number of samples, and researchers have considered property prediction as a few-shot problem. One important fact that has been ignored by prior works is that each molecule can be record…

Graph Sampling-based Meta-Learning for Molecular Property Prediction Open

Xiang Zhuang, Qiang Zhang, Bin Wu, Keyan Ding, Yin Fang , et al. · 2023

Molecular property is usually observed with a limited number of samples, and researchers have considered property prediction as a few-shot problem. One important fact that has been ignored by prior works is that each molecule can be record…

Locally Adaptive Structure and Texture Similarity for Image Quality Assessment Open

Keyan Ding, Yi Liu, Xueyi Zou, Shiqi Wang, Kede Ma · 2021

The latest advances in full-reference image quality assessment (IQA) involve\nunifying structure and texture similarity based on deep representations. The\nresulting Deep Image Structure and Texture Similarity (DISTS) metric, however,\nmak…

A Comparative Study of Image Quality Assessment Models through Perceptual Optimization Open

Keyan Ding, Kede Ma, Shiqi Wang, Eero P. Simoncelli · 2020

The performance of objective image quality assessment (IQA) models has been evaluated primarily by comparing model predictions to human quality judgments. Perceptual datasets gathered for this purpose have provided useful benchmarks for im…

Image Quality Assessment: Unifying Structure and Texture Similarity Open

Keyan Ding, Kede Ma, Shiqi Wang, Eero P. Simoncelli · 2020

Objective measures of image quality generally operate by comparing pixels of a "degraded" image to those of the original. Relative to human observers, these measures are overly sensitive to resampling of texture regions (e.g., replacing on…

A Simple Method to improve Initialization Robustness for Active Contours driven by Local Region Fitting Energy Open

Keyan Ding · 2018

Active contour models based on local region fitting energy can segment images with intensity inhomogeneity effectively, but their segmentation results are easy to error if the initial contour is inappropriate. In this paper, we present a s…

Keyan Ding YOU? Author Swipe